CN109957569A - Base editing system and method based on CPF1 protein - Google Patents
Base editing system and method based on CPF1 protein Download PDFInfo
- Publication number
- CN109957569A CN109957569A CN201811578853.8A CN201811578853A CN109957569A CN 109957569 A CN109957569 A CN 109957569A CN 201811578853 A CN201811578853 A CN 201811578853A CN 109957569 A CN109957569 A CN 109957569A
- Authority
- CN
- China
- Prior art keywords
- lys
- leu
- glu
- ile
- asn
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8213—Targeted insertion of genes into the plant genome by homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y305/00—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
- C12Y305/04—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amidines (3.5.4)
- C12Y305/04001—Cytosine deaminase (3.5.4.1)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Mycology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
技术领域technical field
本发明涉及基因工程领域。具体而言,本发明涉及一种基于CPF1蛋白的碱基编辑系统和方法。更具体而言,本发明涉及一种通过向导RNA指导的Cpf1-脱氨酶融合蛋白对生物体(例如植物)基因组中的靶序列进行高效碱基编辑的系统和方法,以及通过所述方法产生的经遗传修饰的生物体(例如植物)及其后代。The present invention relates to the field of genetic engineering. Specifically, the present invention relates to a CPF1 protein-based base editing system and method. More specifically, the present invention relates to a system and method for efficient base editing of target sequences in the genome of an organism (eg, a plant) by a guide RNA-guided Cpf1-deaminase fusion protein, and production by the method genetically modified organisms (eg, plants) and their progeny.
背景技术Background technique
高效的作物改良的前提是能够获得新的遗传突变,这些突变可以容易地引入现代栽培种中。遗传研究,尤其是基于全基因组相关的研究表明,单核苷酸的改变是构成作物性状差异的主要原因。单碱基的变异会导致氨基酸替换,从而导致优良等位基因和优异性状的进化。在基因组编辑出现前,定向诱导基因组局部突变(TILLING)可以作为用于产生作物改良中迫切需要的突变的方法。然而,TILLING筛选耗时耗力,并且所鉴别的点突变经常受数目和种类的限制。基因组编辑技术,特别是基于CRISPR/Cas9系统的基因组编辑技术可以通过同源重组(HR)介导的DNA修复途径来实现在基因组位点中引入特定碱基的替换。但目前,该方法的成功使用受到很大限制,主要是由于在植物中HR介导的双链断链修复发生的频率很低。另外,有效的提供足量的DNA修复模板也是目前的一大难关。这些问题使得目前在植物中通过HR的方式高效而简单地实现定点突变成为一大挑战。A prerequisite for efficient crop improvement is the availability of new genetic mutations that can be easily introduced into modern cultivars. Genetic studies, especially those based on genome-wide associations, have shown that single nucleotide changes are the major contributors to differences in crop traits. Single-base variations lead to amino acid substitutions, leading to the evolution of superior alleles and superior traits. Before the advent of genome editing, targeted induction of localized mutagenesis in the genome (TILLING) could be used as a method for generating mutations that are sorely needed in crop improvement. However, TILLING screening is time-consuming and labor-intensive, and the point mutations identified are often limited in number and variety. Genome editing technologies, especially those based on the CRISPR/Cas9 system, can achieve the introduction of specific base substitutions in genomic loci through the homologous recombination (HR)-mediated DNA repair pathway. However, currently, the successful use of this method is greatly limited, mainly due to the low frequency of HR-mediated double-strand scission repair in plants. In addition, effectively providing sufficient DNA repair templates is also a major difficulty at present. These problems make it a challenge to efficiently and simply implement site-directed mutagenesis by HR in plants.
近年,利用Cas9与DNA的结合特性与DNA脱氨酶的特性,将Cas9 与脱氨酶融合,可以实现精准的对靶基因进行单碱基胞嘧啶(C)到胸腺嘧啶(T)的转变以及腺嘌呤(A)到鸟嘌呤(G)的转变。目前,对于C到T 的转变的系统主要包括SpnCas9-BE3、SpnCas9-AID以及Cas9变体的融合物如VQR-BE3、EQR-BE3和VRER-BE3,此外还有SaCas9-BE3以及变体 SaKKH-BE3,这些组合的使胞嘧啶(C)到胸腺嘧啶(T)转变的PAM限制降低以及编辑窗口的范围更为可变。此外,近期哈佛大学David Liu实验室通过人工进化的方式,研发出了能作用于ssDNA的腺嘌呤脱氨酶,与 Cas9融合可以实现对DNA进行A到G的转变的Cas9-ABE系统,使单碱基编辑的作用方位进一步扩大。尽管这些研究使DNA的单碱基编辑有了很大的使用范围,但当前的单碱基编辑技术仍存在许多问题:第一,由于Cas9 及Cas9变体的PAM一般限于富含G/C的区域,单碱编辑系统的PAM类型仍需要拓宽;第二,由于Cas9及编辑本身特异性较差,单碱基编辑系统在特异性方面仍需要提高;第三,由于nCas9-BE3及其变体的BE3,还有 nCas9-ABE通常在靶位点的非靶向链上产生单链的缺刻,很容易在错配修复的过程中在产生单碱基突变的同时也会产生DNA的确实和插入,所以在单碱基的高保真方面也有待提高。因此,本领域仍然迫切需要新的对植物基因组进行碱基编辑的系统和方法。In recent years, using the binding properties of Cas9 to DNA and the properties of DNA deaminase, the fusion of Cas9 and deaminase can achieve precise single-base cytosine (C) to thymine (T) conversion of target genes and Conversion of adenine (A) to guanine (G). Currently, the systems for C-to-T transition mainly include SpnCas9-BE3, SpnCas9-AID and fusions of Cas9 variants such as VQR-BE3, EQR-BE3 and VRER-BE3, in addition to SaCas9-BE3 and the variant SaKKH- BE3, these combinations have a reduced PAM restriction for cytosine (C) to thymine (T) transition and a more variable range of editing windows. In addition, recently, the Harvard University David Liu laboratory has developed adenine deaminase that can act on ssDNA through artificial evolution, and the Cas9-ABE system can realize the conversion of DNA from A to G by fusion with Cas9, making single The role of base editing has been further expanded. Although these studies have brought a wide range of DNA single-base editing, there are still many problems with the current single-base editing technology: First, since the PAM of Cas9 and Cas9 variants is generally limited to G/C-rich Region, the PAM types of single-base editing systems still need to be broadened; second, due to the poor specificity of Cas9 and the editing itself, the specificity of single-base editing systems still needs to be improved; third, due to nCas9-BE3 and its variants The BE3, and nCas9-ABE usually produce single-stranded nicks on the non-targeted strand of the target site, and it is easy to generate single-base mutations in the process of mismatch repair. , so the high fidelity of single base also needs to be improved. Therefore, there is still an urgent need in the art for new systems and methods for base editing plant genomes.
附图描述Description of drawings
图1.CPF1介导的植物基因组切割活性的优化。Figure 1. Optimization of CPF1-mediated plant genome cleavage activity.
图2.CPF1介导的植物基因组C到T的突变。Figure 2. CPF1-mediated mutation of C to T in plant genomes.
图3.CPF1介导的植物基因组A到G的突变。Figure 3. CPF1-mediated mutation of plant genomes A to G.
图4.利用CPF1的RNA切割活性进行多位点的同时碱基编辑。Figure 4. Simultaneous base editing at multiple sites utilizing the RNA cleavage activity of CPF1.
发明内容SUMMARY OF THE INVENTION
一、定义1. Definition
在本发明中,除非另有说明,否则本文中使用的科学和技术名词具有本领域技术人员所通常理解的含义。并且,本文中所用的蛋白质和核酸化学、分子生物学、细胞和组织培养、微生物学、免疫学相关术语和实验室操作步骤均为相应领域内广泛使用的术语和常规步骤。例如,本发明中使用的标准重组DNA和分子克隆技术为本领域技术人员熟知,并且在如下文献中有更全面的描述:Sambrook,J.,Fritsch,E.F.和Maniatis,T.,MolecularCloning:A Laboratory Manual;Cold Spring Harbor Laboratory Press:Cold SpringHarbor,1989(下文称为“Sambrook”)。同时,为了更好地理解本发明,下面提供相关术语的定义和解释。In the present invention, unless otherwise specified, scientific and technical terms used herein have the meanings commonly understood by those skilled in the art. Moreover, the protein and nucleic acid chemistry, molecular biology, cell and tissue culture, microbiology, immunology related terms and laboratory procedures used herein are the terms and routine procedures widely used in the corresponding fields. For example, standard recombinant DNA and molecular cloning techniques used in the present invention are well known to those of skill in the art and are more fully described in Sambrook, J., Fritsch, E.F. and Maniatis, T., Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 (hereinafter "Sambrook"). Meanwhile, for a better understanding of the present invention, definitions and explanations of related terms are provided below.
“Cpf1核酸酶”、“Cpf1蛋白”和“Cpf1”在本文中可互换使用,指的是包括Cpf1蛋白或其片段的RNA指导的核酸酶。Cpf1是CRISPR-Cpf1基因组编辑系统的组分,能在向导RNA(crRNA)的指导下靶向并切割DNA靶序列形成DNA双链断裂(DSB)。Cpf1蛋白上包含一个DNA的切割结构域和一个独立的RNA切割结构域。Cpf1蛋白的RNA切割结构域能够加工 pre-crRNA以形成成熟crRNA。"Cpf1 nuclease," "Cpf1 protein," and "Cpf1" are used interchangeably herein to refer to RNA-directed nucleases that include Cpf1 protein or fragments thereof. Cpf1 is a component of the CRISPR-Cpf1 genome editing system, which can target and cleave DNA target sequences to form DNA double-strand breaks (DSBs) under the guidance of guide RNA (crRNA). The Cpf1 protein contains a DNA cleavage domain and an independent RNA cleavage domain. The RNA cleavage domain of Cpf1 protein can process pre-crRNA to form mature crRNA.
“向导RNA”和“gRNA”在本文中可互换使用。Cpf1介导的基因组编辑系统的向导RNA通常仅由成熟crRNA分子构成,其中crRNA包含的序列与靶序列具有足够相同性以便与靶序列的互补序列杂交并且指导复合物(Cpf1+crRNA)与该靶序列序列特异性结合。"Guide RNA" and "gRNA" are used interchangeably herein. Guide RNAs of Cpf1-mediated genome editing systems typically consist only of mature crRNA molecules that contain sequences that are sufficiently identical to the target sequence to hybridize to the complement of the target sequence and guide the complex (Cpf1+crRNA) to the target Sequence sequence specific binding.
“脱氨酶”是指催化脱氨基反应的酶。在本发明一些实施方式中,所述脱氨酶指的是胞嘧啶脱氨酶,其催化胞苷或脱氧胞苷分别脱氨化为尿嘧啶或脱氧尿嘧啶。在本发明一些实施方式中,所述脱氨酶指的是腺嘌呤脱氨酶,其能够催化腺苷或脱氧腺苷(A)形成肌苷(I)。"Deaminase" refers to an enzyme that catalyzes a deamination reaction. In some embodiments of the present invention, the deaminase refers to cytosine deaminase, which catalyzes the deamination of cytidine or deoxycytidine to uracil or deoxyuracil, respectively. In some embodiments of the present invention, the deaminase refers to adenine deaminase, which can catalyze the formation of inosine (I) from adenosine or deoxyadenosine (A).
“基因组”在用于植物细胞时不仅涵盖存在于细胞核中的染色体DNA,而且还包括存在于细胞的亚细胞组分(如线粒体、质体)中的细胞器DNA。"Genome" when used in plant cells encompasses not only the chromosomal DNA present in the nucleus, but also the organelle DNA present in the subcellular components of the cell (eg mitochondria, plastids).
如本文所用,“生物体”包括适于基因组编辑的任何生物体,优选真核生物。生物体的实例包括但不限于,哺乳动物如人、小鼠、大鼠、猴、犬、猪、羊、牛、猫;家禽如鸡、鸭、鹅;植物包括单子叶植物和双子叶植物,例如水稻、玉米、小麦、高粱、大麦、大豆、花生、拟南芥等。As used herein, "organism" includes any organism suitable for genome editing, preferably eukaryotes. Examples of organisms include, but are not limited to, mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, cats; poultry such as chickens, ducks, geese; plants including monocotyledonous and dicotyledonous plants, For example, rice, corn, wheat, sorghum, barley, soybean, peanut, Arabidopsis, etc.
“经遗传修饰的生物体”或“经遗传修饰的细胞”意指在其基因组内包含外源多核苷酸或修饰的基因或表达调控序列的生物体或细胞。例如外源多核苷酸能够稳定地整合进生物体或细胞的基因组中,并遗传连续的世代。外源多核苷酸可单独地或作为重组DNA构建体的部分整合进基因组中。修饰的基因或表达调控序列为在生物体或细胞基因组中所述序列包含单个或多个脱氧核苷酸取代、缺失和添加。例如,通过本发明获得的经遗传修饰的生物体可以相对于野生型(相应的未经所述遗传修饰的生物体)包含一个或多个C至T或A至G的取代。"Genetically modified organism" or "genetically modified cell" means an organism or cell that contains within its genome an exogenous polynucleotide or modified gene or expression control sequences. For example, exogenous polynucleotides can be stably integrated into the genome of an organism or cell and inherited for successive generations. The exogenous polynucleotide can be integrated into the genome alone or as part of a recombinant DNA construct. Modified gene or expression control sequences are those that comprise single or multiple deoxynucleotide substitutions, deletions and additions in the genome of an organism or cell. For example, a genetically modified organism obtained by the present invention may comprise one or more C to T or A to G substitutions relative to the wild type (the corresponding organism not having said genetic modification).
针对序列而言的“外源”意指来自外来物种的序列,或者如果来自相同物种,则指通过蓄意的人为干预而从其天然形式发生了组成和/或基因座的显著改变的序列。"Foreign" with respect to a sequence means a sequence from a foreign species, or, if from the same species, a sequence that has been significantly altered in composition and/or locus from its native form by deliberate human intervention.
“多核苷酸”、“核酸序列”、“核苷酸序列”或“核酸片段”可互换使用并且是单链或双链RNA或DNA聚合物,任选地可含有合成的、非天然的或改变的核苷酸碱基。核苷酸通过如下它们的单个字母名称来指代:“A”为腺苷或脱氧腺苷(分别对应RNA或DNA),“C”表示胞苷或脱氧胞苷,“G”表示鸟苷或脱氧鸟苷,“U”表示尿苷,“T”表示脱氧胸苷,“R”表示嘌呤 (A或G),“Y”表示嘧啶(C或T),“K”表示G或T,“H”表示A或C或 T,“I”表示肌苷,并且“N”表示任何核苷酸。"Polynucleotide", "nucleic acid sequence", "nucleotide sequence" or "nucleic acid fragment" are used interchangeably and are polymers of single- or double-stranded RNA or DNA, optionally containing synthetic, non-natural or altered nucleotide bases. Nucleotides are referred to by their single-letter names as follows: "A" for adenosine or deoxyadenosine (for RNA or DNA, respectively), "C" for cytidine or deoxycytidine, "G" for guanosine or Deoxyguanosine, "U" for uridine, "T" for deoxythymidine, "R" for purine (A or G), "Y" for pyrimidine (C or T), "K" for G or T, " H" means A or C or T, "I" means inosine, and "N" means any nucleotide.
“多肽”、“肽”、和“蛋白质”在本发明中可互换使用,指氨基酸残基的聚合物。该术语适用于其中一个或多个氨基酸残基是相应的天然存在的氨基酸的人工化学类似物的氨基酸聚合物,以及适用于天然存在的氨基酸聚合物。术语“多肽”、“肽”、“氨基酸序列”和“蛋白质”还可包括修饰形式,包括但不限于糖基化、脂质连接、硫酸盐化、谷氨酸残基的γ羧化、羟化和ADP-核糖基化。"Polypeptide," "peptide," and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The term applies to amino acid polymers in which one or more amino acid residues is an artificial chemical analog of the corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. The terms "polypeptide", "peptide", "amino acid sequence" and "protein" may also include modified forms including, but not limited to, glycosylation, lipid linkage, sulfation, gamma carboxylation of glutamic acid residues, hydroxyl sylation and ADP-ribosylation.
如本发明所用,“表达构建体”是指适于感兴趣的核苷酸序列在生物体中表达的载体如重组载体。“表达”指功能产物的产生。例如,核苷酸序列的表达可指核苷酸序列的转录(如转录生成mRNA或功能RNA)和/或RNA 翻译成前体或成熟蛋白质。As used in the present invention, "expression construct" refers to a vector such as a recombinant vector suitable for expression of a nucleotide sequence of interest in an organism. "Expression" refers to the production of a functional product. For example, expression of a nucleotide sequence can refer to transcription of the nucleotide sequence (eg, transcription to produce mRNA or functional RNA) and/or translation of RNA into a precursor or mature protein.
本发明的“表达构建体”可以是线性的核酸片段、环状质粒、病毒载体,或者,在一些实施方式中,可以是能够翻译的RNA(如mRNA)。"Expression constructs" of the present invention may be linear nucleic acid fragments, circular plasmids, viral vectors, or, in some embodiments, RNA capable of translation (eg, mRNA).
本发明的“表达构建体”可包含不同来源的调控序列和感兴趣的核苷酸序列,或相同来源但以不同于通常天然存在的方式排列的调控序列和感兴趣的核苷酸序列。An "expression construct" of the present invention may comprise regulatory sequences and nucleotide sequences of interest from different sources, or regulatory sequences and nucleotide sequences of interest from the same source but arranged in a manner different from that normally found in nature.
“调控序列”和“调控元件”可互换使用,指位于编码序列的上游(5′非编码序列)、中间或下游(3′非编码序列),并且影响相关编码序列的转录、 RNA加工或稳定性或者翻译的核苷酸序列。"Regulatory sequence" and "regulatory element" are used interchangeably and refer to a coding sequence upstream (5' non-coding sequence), intermediate or downstream (3' non-coding sequence) and affecting transcription, RNA processing or Stability or translated nucleotide sequence.
调控序列可包括但不限于启动子、翻译前导序列、内含子和多腺苷酸化识别序列。Regulatory sequences can include, but are not limited to, promoters, translation leader sequences, introns, and polyadenylation recognition sequences.
“启动子”指能够控制另一核酸片段转录的核酸片段。在本发明的一些实施方案中,启动子是能够控制生物体细胞中基因转录的启动子,无论其是否来源于所述生物体。启动子可以是组成型启动子或组织特异性启动子或发育调控启动子或诱导型启动子。"Promoter" refers to a nucleic acid segment capable of controlling the transcription of another nucleic acid segment. In some embodiments of the invention, a promoter is a promoter capable of controlling the transcription of a gene in cells of an organism, whether or not it is derived from the organism. The promoter may be a constitutive promoter or a tissue-specific promoter or a developmentally regulated promoter or an inducible promoter.
“组成型启动子”指一般将引起基因在多数细胞类型中在多数情况下表达的启动子。“组织特异性启动子”和“组织优选启动子”可互换使用,并且指主要但非必须专一地在一种组织或器官中表达,而且也可在一种特定细胞或细胞型中表达的启动子。“发育调控启动子”指其活性由发育事件决定的启动子。“诱导型启动子”响应内源性或外源性刺激(环境、激素、化学信号等)而选择性表达可操纵连接的DNA序列。A "constitutive promoter" refers to a promoter that will generally cause a gene to be expressed in most cell types under most circumstances. "Tissue-specific promoter" and "tissue-preferred promoter" are used interchangeably and refer to expression primarily, but not necessarily exclusively, in a tissue or organ, but also in a particular cell or cell type promoter. "Developmentally regulated promoter" refers to a promoter whose activity is determined by developmental events. An "inducible promoter" selectively expresses an operably linked DNA sequence in response to endogenous or exogenous stimuli (environmental, hormonal, chemical signals, etc.).
如本文中所用,术语“可操作地连接”指调控元件(例如但不限于,启动子序列、转录终止序列等)与核酸序列(例如,编码序列或开放读码框)连接,使得核苷酸序列的转录被所述转录调控元件控制和调节。用于将调控元件区域可操作地连接于核酸分子的技术为本领域已知的。As used herein, the term "operably linked" refers to regulatory elements (eg, but not limited to, promoter sequences, transcription termination sequences, etc.) are linked to a nucleic acid sequence (eg, a coding sequence or open reading frame) such that nucleotides Transcription of the sequence is controlled and regulated by the transcriptional regulatory elements. Techniques for operably linking regions of regulatory elements to nucleic acid molecules are known in the art.
将核酸分子(例如质粒、线性核酸片段、RNA等)或蛋白质“导入”生物体是指用所述核酸或蛋白质转化生物体细胞,使得所述核酸或蛋白质在细胞中能够发挥功能。本发明所用的“转化”包括稳定转化和瞬时转化。"Introducing" a nucleic acid molecule (eg, plasmid, linear nucleic acid fragment, RNA, etc.) or protein into an organism refers to transforming the cells of the organism with the nucleic acid or protein so that the nucleic acid or protein can function in the cell. "Transformation" as used in the present invention includes stable transformation and transient transformation.
“稳定转化”指将外源核苷酸序列导入基因组中,导致外源基因稳定遗传。一旦稳定转化,外源核酸序列稳定地整合进所述生物体和其任何连续世代的基因组中。"Stable transformation" refers to the introduction of a foreign nucleotide sequence into the genome, resulting in the stable inheritance of the foreign gene. Once stably transformed, the exogenous nucleic acid sequence is stably integrated into the genome of the organism and any successive generations thereof.
“瞬时转化”指将核酸分子或蛋白质导入细胞中,执行功能而没有外源基因稳定遗传。瞬时转化中,外源核酸序列不整合进基因组中。"Transient transformation" refers to the introduction of a nucleic acid molecule or protein into a cell to perform a function without the stable inheritance of an exogenous gene. In transient transformation, the exogenous nucleic acid sequence is not integrated into the genome.
如本文所使用的,术语“植物”包括整个植物和任何后代、植物的细胞、组织、或部分。术语“植物部分”包括植物的任何部分,包括,例如但不限于:种子(包括成熟种子、没有种皮的未成熟胚、和不成熟的种子);植物插条(plant cutting);植物细胞;植物细胞培养物;植物器官(例如,花粉、胚、花、果实、芽、叶、根、茎,和相关外植体)。植物组织或植物器官可以是种子、愈伤组织、或者任何其他被组织成结构或功能单元的植物细胞群体。植物细胞或组织培养物能够再生出具有该细胞或组织所来源的植物的生理学和形态学特征的植物,并能够再生出与该植物具有基本上相同基因型的植物。与此相反,一些植物细胞不能够再生产生植物。植物细胞或组织培养物中的可再生细胞可以是胚、原生质体、分生细胞、愈伤组织、花粉、叶、花药、根、根尖、丝、花、果仁、穗、穗轴、壳、或茎。As used herein, the term "plant" includes whole plants and any progeny, cells, tissues, or parts of plants. The term "plant part" includes any part of a plant, including, for example, but not limited to: seeds (including mature seeds, immature embryos without seed coats, and immature seeds); plant cuttings; plant cells; Plant cell cultures; plant organs (eg, pollen, embryos, flowers, fruits, shoots, leaves, roots, stems, and related explants). A plant tissue or plant organ can be a seed, callus, or any other population of plant cells organized into structural or functional units. A plant cell or tissue culture is capable of regenerating plants having the physiological and morphological characteristics of the plant from which the cell or tissue is derived, and capable of regenerating plants having substantially the same genotype as the plant. In contrast, some plant cells are unable to regenerate to produce plants. The regenerable cells in plant cells or tissue culture can be embryos, protoplasts, meristems, callus, pollen, leaves, anthers, roots, root tips, silks, flowers, nuts, ears, cobs, shells , or stem.
植物部分包括可收获的部分和可用于繁殖后代植物的部分。可用于繁殖的植物部分包括,例如但不限于:种子;果实;插条;苗;块茎;和砧木。植物的可收获部分可以是植物的任何有用部分,包括,例如但不限于:花;花粉;苗;块茎;叶;茎;果实;种子;和根。Plant parts include both harvestable parts and parts that can be used to propagate progeny plants. Plant parts that can be used for propagation include, for example, but not limited to: seeds; fruits; cuttings; seedlings; tubers; and rootstocks. A harvestable part of a plant can be any useful part of a plant, including, for example, but not limited to: flowers; pollen; shoots; tubers; leaves; stems; fruits; seeds; and roots.
植物细胞是植物的结构和生理单元。如本文所使用的,植物细胞包括原生质体和具有部分细胞壁的原生质体。植物细胞可以处于分离的单个细胞或细胞聚集体的形式(例如,松散愈伤组织和培养的细胞),并且可以是更高级组织单元(例如,植物组织、植物器官、和植物)的一部分。因此,植物细胞可以是原生质体、产生配子的细胞,或者能够再生成完整植物的细胞或细胞的集合。因此,在本文的实施方案中,包含多个植物细胞并能够再生成为整株植物的种子被认为是一种“植物部分”。Plant cells are the structural and physiological units of plants. As used herein, plant cells include protoplasts and protoplasts with partial cell walls. Plant cells can be in the form of isolated individual cells or cell aggregates (eg, loose callus and cultured cells), and can be part of higher order tissue units (eg, plant tissues, plant organs, and plants). Thus, a plant cell can be a protoplast, a gamete-producing cell, or a cell or collection of cells capable of regenerating into a whole plant. Thus, in embodiments herein, a seed comprising a plurality of plant cells and capable of regenerating into a whole plant is considered a "plant part".
如本文所使用的,术语“原生质体”是指细胞壁被完全或部分地除去、其脂双层膜裸露的植物细胞。典型地,原生质体是没有细胞壁的分离植物细胞,其具有再生成细胞培养物或整株植物的潜力。As used herein, the term "protoplast" refers to a plant cell with its cell wall completely or partially removed and its lipid bilayer membrane exposed. Typically, protoplasts are isolated plant cells without a cell wall that have the potential to regenerate into cell cultures or whole plants.
植物“后代”包括植物的任何后续世代。Plant "progeny" includes any subsequent generation of the plant.
“性状”指植物或特定植物材料或细胞的生理的、形态的、生化的或物理的特征。在一些实施方式中,这些特征可以是肉眼可见的,比如种子、植株的大小等;可用生物化学技术测定的指标,如种子或叶片中蛋白、淀粉或油份的含量等;可观察的代谢或生理过程,如测定对水分胁迫、特定盐、糖或氮浓度的抗性;可检测的基因表达水平;或可观察渗透胁迫的抗性或产量等农艺性状。在一些实施方式中,性状还包括植物的倍性(ploidy),例如对植物育种重要的单倍性(h叩loidy)。在一些实施方式中,性状还包括植物对除草剂的抗性。"Trait" refers to a physiological, morphological, biochemical or physical characteristic of a plant or specific plant material or cell. In some embodiments, these characteristics may be visible to the naked eye, such as the size of seeds, plants, etc.; indicators that can be determined by biochemical techniques, such as the content of protein, starch or oil in seeds or leaves, etc.; observable metabolism or Physiological processes such as determination of resistance to water stress, specific salt, sugar, or nitrogen concentrations; detectable gene expression levels; or observable agronomic traits such as resistance to osmotic stress or yield. In some embodiments, the trait also includes the ploidy of the plant, eg, the haploidy important for plant breeding. In some embodiments, the trait also includes the resistance of the plant to herbicides.
“农艺性状”是可测量的指标参数,包括但不限于:叶片绿色、籽粒产量、生长速率、总生物量或积累速率、成熟时的鲜重、成熟时的干重、果实产量、种子产量、植物总氮含量、果实氮含量、种子氮含量、植物营养组织氮含量、植物总游离氨基酸含量、果实游离氨基酸含量、种子游离氨基酸含量、植物营养组织游离氨基酸含量、植物总蛋白含量、果实蛋白含量、种子蛋白含量、植物营养组织蛋白质含量、抗旱性、氮的吸收、根的倒伏、收获指数、茎的倒伏、株高、穗高、穗长、抗病性、抗寒性、抗盐性和分蘖数等。"Agronomic traits" are measurable indicator parameters including, but not limited to: leaf greenness, grain yield, growth rate, total biomass or accumulation rate, fresh weight at maturity, dry weight at maturity, fruit yield, seed yield, Plant total nitrogen content, fruit nitrogen content, seed nitrogen content, plant vegetative tissue nitrogen content, plant total free amino acid content, fruit free amino acid content, seed free amino acid content, plant vegetative tissue free amino acid content, plant total protein content, fruit protein content , seed protein content, plant vegetative tissue protein content, drought resistance, nitrogen uptake, root lodging, harvest index, stem lodging, plant height, ear height, ear length, disease resistance, cold resistance, salt resistance and Number of tillers, etc.
二、基于Cpf1蛋白的碱基编辑系统2. Base editing system based on Cpf1 protein
本发明提供了一种用于对生物体基因组中的靶序列进行碱基编辑的系统,其包含以下i)至v)中至少一项:The present invention provides a system for base editing a target sequence in the genome of an organism, comprising at least one of the following i) to v):
i)碱基编辑融合蛋白,和向导RNA;i) base editing fusion protein, and guide RNA;
ii)包含编码碱基编辑融合蛋白的核苷酸序列的表达构建体,和向导 RNA;ii) an expression construct comprising a nucleotide sequence encoding a base-edited fusion protein, and a guide RNA;
iii)碱基编辑融合蛋白,和包含编码向导RNA的核苷酸序列的表达构建体;iii) base editing fusion proteins, and expression constructs comprising nucleotide sequences encoding guide RNAs;
iv)包含编码碱基编辑融合蛋白的核苷酸序列的表达构建体,和包含编码向导RNA的核苷酸序列的表达构建体;iv) an expression construct comprising a nucleotide sequence encoding a base-edited fusion protein, and an expression construct comprising a nucleotide sequence encoding a guide RNA;
v)包含编码碱基编辑融合蛋白的核苷酸序列和编码向导RNA的核苷酸序列的表达构建体;v) an expression construct comprising a nucleotide sequence encoding a base-edited fusion protein and a nucleotide sequence encoding a guide RNA;
其中所述碱基编辑融合蛋白包含DNA切割活性缺失的Cpf1和脱氨酶,所述向导RNA能够将所述碱基编辑融合蛋白靶向基因组中的靶序列,导致所述靶序列中一或多个C至T或者A至G的取代。wherein the base-editing fusion protein comprises Cpf1 and deaminase deficient in DNA cleavage activity, and the guide RNA is capable of targeting the base-editing fusion protein to a target sequence in the genome, resulting in one or more C to T or A to G substitutions.
Cpf1包含一个DNA切割结构域(RuvC),将其突变后可以使Cpf1的 DNA切割活性缺失,形成“DNA切割活性缺失的Cpf1”。所述DNA切割活性缺失的Cpf1仍然保留gRNA指导的DNA结合能力。因此,原则上,当与另外的蛋白融合时,DNA切割活性缺失的Cpf1可以简单地通过与合适的向导RNA共表达而将所述另外的蛋白靶向几乎任何DNA序列。Cpf1 contains a DNA cleavage domain (RuvC), which can be mutated to delete the DNA cleavage activity of Cpf1, forming "Cpf1 with missing DNA cleavage activity". The DNA-cleaving activity-depleted Cpf1 still retains the DNA-binding ability directed by the gRNA. Thus, in principle, when fused to an additional protein, Cpf1 deficient in DNA cleavage activity can target the additional protein to almost any DNA sequence simply by co-expression with a suitable guide RNA.
本发明所述DNA切割活性缺失的Cpf1可以衍生自不同物种的Cpf1,例如,衍生自Francisella novicida U112、Acidaminococcussp.BV3L6和 Lachnospiraceae bacteriumND2006的分别称为FnCpf1(例如野生型的氨基酸序列示于SEQ ID NO:19)、AsCpf1(例如野生型的氨基酸序列示于SEQ ID NO:18)和LbCpf1的Cpf1蛋白(例如野生型的氨基酸序列示于SEQ ID NO:20)。The DNA cleavage activity-deficient Cpf1 of the present invention can be derived from Cpf1 of different species, for example, derived from Francisella novicida U112, Acidaminococcus sp. BV3L6 and Lachnospiraceae bacterium ND2006, respectively called FnCpf1 (for example, the amino acid sequence of the wild type is shown in SEQ ID NO: 19), AsCpf1 (eg, the amino acid sequence of wild type is shown in SEQ ID NO: 18) and Cpf1 protein of LbCpf1 (eg, the amino acid sequence of wild type is shown in SEQ ID NO: 20).
在一些实施方案中,所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的FnCpf1。在一些具体实施方式中,所述DNA切割活性缺失的FnCpf1 相对于野生型FnCpf1包含D917A突变。In some embodiments, the DNA cleavage activity-depleted Cpf1 is DNA cleavage activity-depleted FnCpf1. In some embodiments, the DNA-cleaving activity-depleted FnCpf1 comprises a D917A mutation relative to wild-type FnCpf1.
在一些实施方案中,所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的AsCpf1。在一些具体实施方式中,所述DNA切割活性缺失的AsCpf1 相对于野生型AsCpf1包含D908A突变。In some embodiments, the DNA cleavage activity-deficient Cpf1 is DNA cleavage activity-depleted AsCpf1. In some embodiments, the DNA cleavage activity-depleted AsCpf1 comprises a D908A mutation relative to wild-type AsCpf1.
在一些优选实施方案中,所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的LbCpf1。在一些具体实施方式中,所述DNA切割活性缺失的LbCpf1相对于野生型LbCpf1包含D832A突变。In some preferred embodiments, the DNA cleavage activity-depleted Cpf1 is DNA cleavage activity-depleted LbCpf1. In some embodiments, the DNA cleavage activity-depleted LbCpf1 comprises a D832A mutation relative to wild-type LbCpf1.
在一些实施方案中,所述DNA切割活性缺失的Cpf1保留其RNA切割活性,从而能对pre-crRNA进行加工以形成成熟crRNA。因此,在一些实施方案中,本发明的系统中的包含编码向导RNA的核苷酸序列的表达构建体可以包含编码串联的多个不同向导RNA(crRNA)前体的序列,其在转录后可以被所述DNA切割活性缺失的Cpf1加工形成多个不同的向导 RNA(crRNA),从而同时靶向多个不同靶序列。In some embodiments, the DNA-cleaving activity-depleted Cpf1 retains its RNA-cleaving activity, thereby enabling processing of pre-crRNA to form mature crRNA. Thus, in some embodiments, an expression construct comprising a nucleotide sequence encoding a guide RNA in the system of the invention may comprise a sequence encoding a tandem of multiple different guide RNA (crRNA) precursors, which upon transcription may Cpf1, which is deficient in the DNA cleavage activity, is processed to form multiple different guide RNAs (crRNAs), thereby simultaneously targeting multiple different target sequences.
本发明的一些实施方案中,融合蛋白中的所述脱氨酶是胞苷脱氨酶,例如载脂蛋白B mRNA编辑复合体(APOBEC)家族脱氨酶。In some embodiments of the invention, the deaminase in the fusion protein is a cytidine deaminase, such as an apolipoprotein B mRNA editing complex (APOBEC) family deaminase.
胞苷脱氨酶可以催化DNA上胞苷(C)的脱氨化作用形成尿嘧啶(U)。本发明人令人惊奇地发现,将DNA切割活性缺失的Cpf1与胞苷脱氨酶融合,在向导RNA的指导下,融合蛋白可以靶向基因组中的靶序列,由于Cpf1 的DNA切割活性失活缺失,DNA双链不被切割,而融合蛋白中的胞苷脱氨酶能够将Cpf1-向导RNA-DNA复合物形成中产生的单链DNA的胞苷脱氨转换成U,再通过碱基错配修复实现C至T的取代。Cytidine deaminase catalyzes the deamination of cytidine (C) on DNA to form uracil (U). The inventors have surprisingly found that by fusing Cpf1 with a lack of DNA cleavage activity to cytidine deaminase, under the guidance of guide RNA, the fusion protein can target the target sequence in the genome, due to the inactivation of the DNA cleavage activity of Cpf1. In the absence, the DNA double-strand is not cleaved, and the cytidine deaminase in the fusion protein can convert the cytidine deamination of the single-stranded DNA generated in the formation of the Cpf1-guide RNA-DNA complex into U, and then through the base error. With repair to achieve C to T substitution.
本发明所述胞苷脱氨酶特别是可以接受单链DNA作为底物的胞苷脱氨酶。本发明可用的胞苷脱氨酶的实例包括但不限于:APOBEC1脱氨酶、激活诱导的胞苷脱氨酶(AID)、APOBEC3G或CDA1。在本发明的一些具体实施方式中,所述胞苷脱氨酶包含SEQ ID NO:1所示的氨基酸序列。The cytidine deaminase of the present invention is particularly a cytidine deaminase that can accept single-stranded DNA as a substrate. Examples of cytidine deaminase enzymes useful in the present invention include, but are not limited to, APOBEC1 deaminase, activation-induced cytidine deaminase (AID), APOBEC3G, or CDA1. In some specific embodiments of the present invention, the cytidine deaminase comprises the amino acid sequence shown in SEQ ID NO:1.
在融合蛋白中的脱氨酶是胞苷脱氨酶的情况下,本发明的碱基编辑系统可以将基因组靶序列中的一或多个C突变为T,也称作Cpf1-PBE系统。In the case where the deaminase in the fusion protein is cytidine deaminase, the base editing system of the present invention can mutate one or more Cs to Ts in the genomic target sequence, also referred to as the Cpf1-PBE system.
在细胞中,尿嘧啶DNA糖基化酶催化U从DNA上的去除并启动碱基切除修复(BER),导致将U:G修复成C:G。因此,不受任何理论限制,在本发明的碱基编辑融合蛋白或本发明的系统中包含尿嘧啶DNA糖基化酶抑制剂将能够增加碱基编辑的效率。In cells, uracil DNA glycosylase catalyzes the removal of U from DNA and initiates base excision repair (BER), resulting in the repair of U:G to C:G. Therefore, without being bound by any theory, the inclusion of a uracil DNA glycosylase inhibitor in the base editing fusion proteins of the present invention or the systems of the present invention will increase the efficiency of base editing.
因此,在本发明的涉及Cpf1-PBE系统的一些实施方案中,所述碱基编辑融合蛋白还包含尿嘧啶DNA糖基化酶抑制剂(UGI)。在一些具体实施方式中,所述尿嘧啶DNA糖基化酶抑制剂包含SEQ ID NO:2所示的氨基酸序列。Accordingly, in some embodiments of the invention involving the Cpf1-PBE system, the base editing fusion protein further comprises a uracil DNA glycosylase inhibitor (UGI). In some specific embodiments, the uracil DNA glycosylase inhibitor comprises the amino acid sequence shown in SEQ ID NO:2.
本发明的一些实施方案中,所述脱氨酶是腺嘌呤脱氨酶。In some embodiments of the invention, the deaminase is adenine deaminase.
天然存在的腺嘌呤脱氨酶同常以RNA为底物,通过脱氨作用将单链RNA上的腺苷转变成肌苷(I)。最近,通过定向进化的方法,已经基于大肠杆菌的tRNA腺嘌呤脱氨酶TadA获得了能够以单链DNA为底物,将单链 DNA上的脱氧鸟苷转变为肌苷(I)的DNA依赖型腺嘌呤脱氨酶。参见Nicloe M.Gaudelli等人,doi:10.1038/nature24644,2017。Naturally occurring adenine deaminase also uses RNA as a substrate and converts adenosine on single-stranded RNA to inosine (I) by deamination. Recently, a DNA-dependent DNA-dependent conversion of deoxyguanosine on single-stranded DNA to inosine (I) using single-stranded DNA as a substrate has been obtained based on the tRNA adenine deaminase TadA of Escherichia coli by means of directed evolution. type adenine deaminase. See Nicloe M. Gaudelli et al., doi: 10.1038/nature24644, 2017.
本发明人令人惊奇地发现,将DNA切割活性缺失的Cpf1与DNA依赖型腺嘌呤脱氨酶融合,在向导RNA的指导下,融合蛋白可以靶向植物基因组中的靶序列,由于Cpf1的DNA切割活性缺失,DNA双链不被切割,,而融合蛋白中的DNA依赖型腺嘌呤脱氨酶能够将Cpf1-向导RNA-DNA复合物形成中产生的单链DNA的腺苷脱氨转换成肌苷(I),由于DNA聚合酶会将肌苷(I)当做鸟嘌呤(G)处理,因此通过碱基错配修复可以实现A至G的取代。因此,在融合蛋白中的脱氨酶是DNA依赖型腺嘌呤脱氨酶的情况下,本发明的碱基编辑系统可以将基因组靶序列中的一或多个A突变为G,也称作Cpf1-ABE系统。The inventors of the present invention surprisingly found that by fusing Cpf1, which lacks DNA cleavage activity, with DNA-dependent adenine deaminase, under the guidance of guide RNA, the fusion protein can target the target sequence in the plant genome, because the DNA of Cpf1 The cleavage activity is absent, the DNA double-strand is not cleaved, and the DNA-dependent adenine deaminase in the fusion protein can convert the adenosine deamination of single-stranded DNA generated in the formation of the Cpf1-guide RNA-DNA complex into myosine For glycoside (I), since DNA polymerase treats inosine (I) as guanine (G), substitution of A to G can be achieved by base mismatch repair. Therefore, in the case where the deaminase in the fusion protein is a DNA-dependent adenine deaminase, the base editing system of the present invention can mutate one or more A to G in the genomic target sequence, also referred to as Cpf1 -ABE system.
在本发明的一些实施方案中,所述DNA依赖型腺嘌呤脱氨酶是大肠杆菌tRNA腺嘌呤脱氨酶TadA(ecTadA)的变体,特别是可以接受单链DNA 作为底物的变体,所述变体相对于野生型ecTadA包含一或多组选自以下的突变:In some embodiments of the invention, the DNA-dependent adenine deaminase is a variant of the E. coli tRNA adenine deaminase TadA (ecTadA), particularly a variant that can accept single-stranded DNA as a substrate, The variant comprises, relative to wild-type ecTadA, one or more sets of mutations selected from the group consisting of:
1)A106V和D108N;1) A106V and D108N;
2)D147Y和E155V;2) D147Y and E155V;
3)L84F、H123Y和I156F;3) L84F, H123Y and I156F;
4)A142N;4) A142N;
5)H36L、R51L、S146C和K157N;5) H36L, R51L, S146C and K157N;
6)P48S/T/A;6) P48S/T/A;
7)A142N;7) A142N;
8)W23L/R;8) W23L/R;
9)R152H/P。9) R152H/P.
在本发明一具体实施方式中,所述DNA依赖型腺嘌呤脱氨酶(ABE版本7.9)相对于野生型ecTadA包含以下突变:W23R、H36L、R51L、S146C、 K157N、A106V、D108N、P48A、L84F、H123Y、I156F、A142N、D147Y、 E155V和R152P。In a specific embodiment of the invention, the DNA-dependent adenine deaminase (ABE version 7.9) comprises the following mutations relative to wild-type ecTadA: W23R, H36L, R51L, S146C, K157N, A106V, D108N, P48A, L84F , H123Y, I156F, A142N, D147Y, E155V and R152P.
在本发明一具体实施方式中,所述DNA依赖型腺嘌呤脱氨酶(ABE版本7.10)相对于野生型ecTadA包含以下突变:W23R、H36L、R51L、S146C、 K157N、A106V、D108N、P48A、L84F、H123Y、I156F、D147Y、E155V 和R152P。In a specific embodiment of the invention, the DNA-dependent adenine deaminase (ABE version 7.10) comprises the following mutations relative to wild-type ecTadA: W23R, H36L, R51L, S146C, K157N, A106V, D108N, P48A, L84F , H123Y, I156F, D147Y, E155V and R152P.
野生型ecTadA氨基酸序列如下所示: MSEVEFSHEYWMRHALTLAKRAWDEREVPVGAVLVHNNRVIGEGWNR PIGRHDPTAHAEIMALRQGGLVMQNYRLIDATLYVTLEPCVMCAGAMIH SRIGRVVFGARDAKTGAAGSLMDVLHHPGMNHRVEITEGILADECAALL SDFFRMRRQEIKAQKKAQSSTD(SEQ ID NO:3)。在一些实施方案中,其中起始甲硫氨酸可以不存在。The wild-type ecTadA amino acid sequence is shown below: MSEVEFSHEYWMRHALTLAKRAWDEREVPVGAVLVHNNRVIGEGWNRPIGRHDPTAHAEIMALRQGGLVMQNYRLIDATLYVTLEPCVMCAGAMIHSRIGRVVFGARDAKTGAAGSLMDVLHHPGMNHRVEITEGILADECAALLSDFFRMRRQEIKAQKKAQSSTD (SEQ ID NO: 3). In some embodiments, the starting methionine may be absent.
优选的ecTadA衍生的DNA依赖型腺嘌呤脱氨酶(ABE版本7.10)氨基酸序列如下所示: MSEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNR AIGLHDPTAHAEIMALRQGGLVMQNYRLIDATLYVTFEPCVMCAGAMIH SRIGRVVFGVRNAKTGAAGSLMDVLHYPGMNHRVEITEGILADECAALLCYFFRMPRQVFNAQKKAQSSTD(SEQ ID NO:4)。在一些实施方案中,其中起始甲硫氨酸可以不存在。A preferred ecTadA-derived DNA-dependent adenine deaminase (ABE version 7.10) amino acid sequence is shown below: MSEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNRAIGLHDPTAHAEIMALRQGGLVMQNYRLIDATLYVTFEPCVMCAGAMIHSRIGRVVFGVRNAKTGAAGSLMDVLHYPGMNHRVEITEGILADECAALLCYFFRMPRQVFNAQKKAQSSTD (SEQ ID NO. In some embodiments, the starting methionine may be absent.
在本发明的一些实施方案中,所述脱氨酶被融合至所述DNA切割活性缺失的Cpf1的N末端。在一些实施方案中,所述脱氨酶被融合至所述DNA 切割活性缺失的Cpf1的C末端。In some embodiments of the invention, the deaminase is fused to the N-terminus of Cpfl that is deficient in DNA cleavage activity. In some embodiments, the deaminase is fused to the C-terminus of Cpf1 in which the DNA cleavage activity is deleted.
在一些优选实施方案中,所述DNA依赖型腺嘌呤脱氨酶的N端融合有对应的野生型腺嘌呤脱氨酶。预期DNA依赖型腺嘌呤脱氨酶与野生型腺嘌呤脱氨酶形成异二聚体可以显著提高融合蛋白A至G的编辑活性。In some preferred embodiments, the DNA-dependent adenine deaminase is fused to the N-terminus of the corresponding wild-type adenine deaminase. It is expected that DNA-dependent adenine deaminase forms a heterodimer with wild-type adenine deaminase, which can significantly enhance the editing activity of fusion proteins A to G.
在本发明的一些实施方案中,所述脱氨酶和所述DNA切割活性缺失的 Cpf1通过接头融合。所述接头可以是长1-50个(例如1、2、3、4、5、6、 7、8、9、10、11、12、13、14、15、16、17、18、19、20个或20-25 个、25-50个)或更多个氨基酸、无二级以上结构的非功能性氨基酸序列。例如,所述接头可以是柔性接头,例如GGGGS、GS、GAP、(GGGGS)x 3、GGS和(GGS)x7等。在一些具体的实施方案中,所述接头是XTEN接头。在一些具体实施方案中,所述接头长32个氨基酸。在一些具体的实施方案中,所述接头的氨基酸序列为: SGGSSGGSSGSETPGTSESATPESSGGSSGGS。In some embodiments of the invention, the deaminase and the DNA cleavage activity-depleted Cpf1 are fused by a linker. The linkers may be 1-50 in length (eg 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 20-25, 25-50) or more amino acids, non-functional amino acid sequence without secondary structure. For example, the linker may be a flexible linker such as GGGGS, GS, GAP, (GGGGS)x3, GGS and (GGS)x7, and the like. In some specific embodiments, the linker is an XTEN linker. In some specific embodiments, the linker is 32 amino acids in length. In some specific embodiments, the amino acid sequence of the linker is: SGGSSGGSSGSETPGTSESATPESSGGSSGGS.
在本发明的一些实施方案中,本发明的碱基编辑融合蛋白还包含核定位序列(NLS)。一般而言,所述碱基编辑融合蛋白中的一个或多个NLS应具有足够的强度,以便在植物细胞的核中驱动所述碱基编辑融合蛋白以可实现其碱基编辑功能的量积聚。一般而言,核定位活性的强度由所述碱基编辑融合蛋白中NLS的数目、位置、所使用的一个或多个特定的NLS、或这些因素的组合决定。In some embodiments of the invention, the base editing fusion proteins of the invention further comprise a nuclear localization sequence (NLS). In general, the one or more NLSs in the base editing fusion protein should be of sufficient strength to drive accumulation of the base editing fusion protein in the nucleus of a plant cell in an amount that will enable its base editing function . In general, the strength of nuclear localization activity is determined by the number, location of NLSs in the base editing fusion protein, one or more specific NLSs used, or a combination of these factors.
在本发明的一些实施方案中,本发明的碱基编辑融合蛋白的NLS可以位于N端和/或C端。在一些实施方案中,所述碱基编辑融合蛋白包含约1、 2、3、4、5、6、7、8、9、10个或更多个NLS。在一些实施方案中,所述碱基编辑融合蛋白包含在或接近于N端的约1、2、3、4、5、6、7、8、9、 10个或更多个NLS。在一些实施方案中,所述碱基编辑融合蛋白包含在或接近于C端约1、2、3、4、5、6、7、8、9、10个或更多个NLS。在一些实施方案中,所述碱基编辑融合蛋白包含这些的组合,如包含在N端的一个或多个NLS以及在C端的一个或多个NLS。当存在多于一个NLS时,每一个可以被选择为不依赖于其他NLS。在本发明的一些优选实施方式中,所述碱基编辑融合蛋白包含2个NLS,例如所述2个NLS分别位于N端和C端。In some embodiments of the invention, the NLS of the base editing fusion proteins of the invention may be located at the N-terminus and/or the C-terminus. In some embodiments, the base editing fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLSs. In some embodiments, the base editing fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLSs at or near the N-terminus. In some embodiments, the base editing fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLSs at or near the C-terminus. In some embodiments, the base editing fusion protein comprises a combination of these, such as comprising one or more NLSs at the N-terminus and one or more NLSs at the C-terminus. When there is more than one NLS, each can be selected to be independent of the other NLS. In some preferred embodiments of the present invention, the base editing fusion protein comprises 2 NLSs, eg, the 2 NLSs are located at the N-terminus and the C-terminus, respectively.
一般而言,NLS由暴露于蛋白表面上的带正电的赖氨酸或精氨酸的一个或多个短序列组成,但其他类型的NLS也是已知的。NLS的非限制性实例包括:KKRKV(核苷酸序列5’-AAGAAGAGAAAGGTC-3’)、 PKKKRKV(核苷酸序列5’-CCCAAGAAGAAGAGGAAGGTG-3’或CCAAAGAAGAAGAGGAAGGTT),或SGGSPKKKRKV(核苷酸序列5’- TCGGGGGGGAGCCCAAAGAAGAAGCGGAAGGTG-3’)。In general, NLSs consist of one or more short sequences of positively charged lysines or arginines exposed on the surface of proteins, but other types of NLSs are also known. Non-limiting examples of NLS include: KKRKV (nucleotide sequence 5'-AAGAAGAGAAAGGTC-3'), PKKKRKV (nucleotide sequence 5'-CCCAAGAAGAAGAGGAAGGTG-3' or CCAAAGAAGAAGAGGAAGGTT), or SGGSPKKKRKV (nucleotide sequence 5'- TCGGGGGGGAGCCCAAAGAAGAAGCGGAAGGTG-3').
在本发明的一些实施方式中,所述碱基编辑融合蛋白的N端包含 PKKKRKV所示的氨基酸序列的NLS。在本发明的一些实施方式中,所述碱基编辑融合蛋白的C端包含SGGSPKKKRKV或 KRPAATKKAGQAKKKK所示的氨基酸序列的NLS。In some embodiments of the invention, the N-terminus of the base-edited fusion protein comprises the NLS of the amino acid sequence shown in PKKKRKV. In some embodiments of the present invention, the C-terminus of the base editing fusion protein comprises the NLS of the amino acid sequence shown in SGGSPKKKRKV or KRPAATKKAGQAKKKK.
此外,根据所需要编辑的DNA位置,本发明的碱基编辑融合蛋白还可以包括其他的定位序列,例如细胞质定位序列、叶绿体定位序列、线粒体定位序列等。In addition, according to the DNA position to be edited, the base editing fusion protein of the present invention may also include other localization sequences, such as cytoplasmic localization sequences, chloroplast localization sequences, mitochondrial localization sequences, and the like.
在本发明的涉及Cpf1-PBE系统的一些实施方案中,所述碱基编辑融合蛋白还包含尿嘧啶DNA糖基化酶抑制剂(UGI),且还在所述UGI的N端或 C端紧接两个NLS。在一些优选实施方案中,本发明的碱基编辑融合蛋白包含选自SEQ ID NO:24-29的氨基酸序列。In some embodiments of the present invention involving the Cpf1-PBE system, the base editing fusion protein further comprises a uracil DNA glycosylase inhibitor (UGI), and is also close to the N-terminus or C-terminus of the UGI Pick up two NLS. In some preferred embodiments, the base editing fusion proteins of the present invention comprise an amino acid sequence selected from the group consisting of SEQ ID NOs: 24-29.
为了获得有效表达,在本发明的一些实施方式中,所述编码碱基编辑融合蛋白的核苷酸序列针对待进行碱基编辑的生物物种进行密码子优化。In order to obtain efficient expression, in some embodiments of the present invention, the nucleotide sequence encoding the base-edited fusion protein is codon-optimized for the biological species to be base-edited.
密码子优化是指通过用在宿主细胞的基因中更频繁地或者最频繁地使用的密码子代替天然序列的至少一个密码子(例如约或多于约1、2、3、4、 5、10、15、20、25、50个或更多个密码子同时维持该天然氨基酸序列而修饰核酸序列以便增强在感兴趣宿主细胞中的表达的方法。不同的物种对于特定氨基酸的某些密码子展示出特定的偏好。密码子偏好性(在生物之间的密码子使用的差异)经常与信使RNA(mRNA)的翻译效率相关,而该翻译效率则被认为依赖于被翻译的密码子的性质和特定的转运RNA(tRNA)分子的可用性。细胞内选定的tRNA的优势一般反映了最频繁用于肽合成的密码子。因此,可以将基因定制为基于密码子优化在给定生物中的最佳基因表达。密码子利用率表可以容易地获得,例如在www.kazusa.orjp/codon/上可获得的密码子使用数据库(“Codon Usage Database”)中,并且这些表可以通过不同的方式调整适用。参见,Nakamura Y.等,“Codon usage tabulatedfrom theinternational DNA sequencedatabases:statusfortheyear2000. Nucl.AcidsRes.,28:292(2000)。Codon optimization refers to replacing at least one codon of a native sequence (eg, about or more than about 1, 2, 3, 4, 5, 10) with a codon that is more or most frequently used in a gene in a host cell. , 15, 20, 25, 50 or more codons while maintaining the native amino acid sequence and modifying the nucleic acid sequence so as to enhance the expression in a host cell of interest. Different species display certain codons for specific amino acids Codon preference (differences in codon usage between organisms) is often related to the translation efficiency of messenger RNA (mRNA), which is thought to depend on the nature and nature of the codons being translated. Availability of a specific transfer RNA (tRNA) molecule. The predominance of a selected tRNA within a cell generally reflects the codons most frequently used for peptide synthesis. Thus, genes can be tailored based on codon optimization for the most efficient use in a given organism Optimal gene expression. Codon usage tables are readily available, for example in the "Codon Usage Database" available at www.kazusa.orjp/codon/ , and these tables can be adjusted in different ways Applicable. See, Nakamura Y. et al., "Codon usage tabulated from the international DNA sequence databases: status for the year 2000. Nucl. Acids Res., 28: 292 (2000).
在一些具体实施方案中,本发明的碱基编辑融合蛋白由选自SEQ ID NO:8-9、11-12或14-15的核苷酸序列编码。In some specific embodiments, the base editing fusion proteins of the invention are encoded by a nucleotide sequence selected from the group consisting of SEQ ID NOs: 8-9, 11-12, or 14-15.
在本发明一些实施方式中,所述编码碱基编辑融合蛋白的核苷酸序列和/或所述编码向导RNA的核苷酸序列与表达调控元件如启动子可操作地连接。In some embodiments of the invention, the nucleotide sequence encoding a base-edited fusion protein and/or the nucleotide sequence encoding a guide RNA is operably linked to an expression control element, such as a promoter.
本发明可使用的启动子的实例包括但不限于聚合酶(pol)I、pol II或pol III启动子。pol I启动子的实例包括鸡RNApol I启动子。pol II启动子的实例包括但不限于巨细胞病毒立即早期(CMV)启动子、劳斯肉瘤病毒长末端重复(RSV-LTR)启动子和猿猴病毒40(SV40)立即早期启动子。pol III启动子的实例包括U6和H1启动子。可以使用诱导型启动子如金属硫蛋白启动子。启动子的其他实例包括T7噬菌体启动子、T3噬菌体启动子、β-半乳糖苷酶启动子和Sp6噬菌体启动子。当用于植物时,启动子可以是花椰菜花叶病毒35S启动子、玉米Ubi-1启动子、小麦U6启动子、水稻U3启动子、玉米U3启动子、水稻肌动蛋白启动子。Examples of promoters that can be used in the present invention include, but are not limited to, polymerase (pol) I, pol II, or pol III promoters. Examples of pol I promoters include the chicken RNA pol I promoter. Examples of pol II promoters include, but are not limited to, the cytomegalovirus immediate early (CMV) promoter, the Rous sarcoma virus long terminal repeat (RSV-LTR) promoter, and the simian virus 40 (SV40) immediate early promoter. Examples of pol III promoters include the U6 and H1 promoters. Inducible promoters such as the metallothionein promoter can be used. Other examples of promoters include T7 phage promoter, T3 phage promoter, β-galactosidase promoter, and Sp6 phage promoter. When used in plants, the promoter may be the cauliflower mosaic virus 35S promoter, the maize Ubi-1 promoter, the wheat U6 promoter, the rice U3 promoter, the maize U3 promoter, the rice actin promoter.
优选地,向导RNA(crRNA)使用Ubi-1启动子表达并用核酶例如HDV 核酶来切割成熟。Preferably, the guide RNA (crRNA) is expressed using the Ubi-1 promoter and matured by cleavage with a ribozyme such as HDV ribozyme.
在一实施方式中,在Ubi-1启动子后添加一内含子可以增强目的蛋白或 RNA的表达。In one embodiment, adding an intron after the Ubi-1 promoter can enhance the expression of the protein or RNA of interest.
在一些具体实施方案中,用于表达本发明的碱基融合蛋白的表达构建体包含SEQID NO:10或13所示的表达盒。或者所述表达构建体包含SEQ ID NO:30所示的表达调控序列。In some specific embodiments, the expression construct for expressing the base fusion protein of the present invention comprises the expression cassette set forth in SEQ ID NO: 10 or 13. Alternatively, the expression construct comprises the expression control sequence set forth in SEQ ID NO:30.
三、产生经遗传修饰的生物体的方法3. Methods of generating genetically modified organisms
在另一方面,本发明提供了一种产生经遗传修饰的生物体(如植物)的方法,包括将本发明的用于对生物体基因组中的靶序列进行碱基编辑的系统导入所述生物体细胞,由此所述向导RNA将所述碱基编辑融合蛋白靶向所述植物基因组中的靶序列,导致所述靶序列中的一或多个C被T取代或一或多个A被G取代。In another aspect, the present invention provides a method of producing a genetically modified organism, such as a plant, comprising introducing into said organism a system of the present invention for base editing a target sequence in the genome of an organism Somatic cells, whereby the guide RNA targets the base-editing fusion protein to a target sequence in the plant genome, resulting in one or more Cs in the target sequence being replaced by T or one or more A by G replaces.
可以被Cpf1蛋白和向导RNA(即crRNA)复合物识别并靶向的靶序列或 crRNA编码序列的设计可以参照例如Zhang et al.,Cell 163,1-13,October 22, 2015。一般而言,本发明的基因组编辑系统靶向的靶序列5’末端需包含前间区序列邻近基序(protospaceradjacent motif)(PAM)5’-TTTN或5’-YTN,其中N独立地选自A、G、C和T,Y选自C和T。The design of target sequences or crRNA coding sequences that can be recognized and targeted by the complex of Cpf1 protein and guide RNA (ie crRNA) can be referred to, for example, Zhang et al., Cell 163, 1-13, October 22, 2015. In general, the 5' end of the target sequence targeted by the genome editing system of the present invention needs to comprise a protospaceradjacent motif (PAM) 5'-TTTN or 5'-YTN, wherein N is independently selected from A, G, C and T, Y is selected from C and T.
例如,在本发明的一些实施方案中,所述靶序列具有以下结构:5’- TTTN-NX-3’或5’-YTN-NX-3’,其中N独立地选自A、G、C和T,Y选自C 和T;X为15≤X≤35的整数;Nx表示X个连续的核苷酸。For example, in some embodiments of the invention, the target sequence has the following structure: 5'-TTTN- NX -3' or 5'-YTN- NX -3', wherein N is independently selected from A, G , C and T, Y is selected from C and T; X is an integer of 15≤X≤35; Nx represents X consecutive nucleotides.
在本发明中,待进行修饰的靶序列可以位于基因组的任何位置,例如位于功能基因如蛋白编码基因内,或者例如可以位于基因表达调控区如启动子区或增强子区,从而实现对所述基因功能修饰或对基因表达的修饰。In the present invention, the target sequence to be modified can be located at any position in the genome, for example, in a functional gene such as a protein-coding gene, or, for example, in a gene expression regulatory region such as a promoter region or an enhancer region, so as to achieve the Modification of gene function or modification of gene expression.
可以通过T7EI、PCR/RE或测序方法检测所述细胞靶序列中的A至G 或C至T碱基编辑。A to G or C to T base edits in the cellular target sequence can be detected by T7EI, PCR/RE or sequencing methods.
在本发明的方法中,所述碱基编辑的系统可以通过本领域技术人员熟知的各种方法导入细胞。可用于将本发明的基因组编辑系统导入细胞的方法包括但不限于:磷酸钙转染、原生质融合、电穿孔、脂质体转染、微注射、病毒感染(如杆状病毒、痘苗病毒、腺病毒、腺相关病毒、慢病毒和其他病毒)、基因枪法、PEG介导的原生质体转化、土壤农杆菌介导的转化。In the method of the present invention, the base editing system can be introduced into cells by various methods well known to those skilled in the art. Methods that can be used to introduce the genome editing system of the present invention into cells include, but are not limited to, calcium phosphate transfection, protoplast fusion, electroporation, lipofection, microinjection, viral infection (eg, baculovirus, vaccinia virus, adenovirus) virus, adeno-associated virus, lentivirus and others), biolistic, PEG-mediated protoplast transformation, Agrobacterium-mediated transformation.
可以通过本发明的方法进行基因组编辑的细胞可以来自例如,哺乳动物如人、小鼠、大鼠、猴、犬、猪、羊、牛、猫;家禽如鸡、鸭、鹅;植物,包括单子叶植物和双子叶植物,例如水稻、玉米、小麦、高粱、大麦、大豆、花生、拟南芥等。Cells that can undergo genome editing by the methods of the invention can be derived, for example, from mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, cats; poultry such as chickens, ducks, geese; plants, including monads Leafy and dicotyledonous plants such as rice, corn, wheat, sorghum, barley, soybean, peanut, Arabidopsis, and the like.
本发明的方法尤其适合于产生经遗传修饰的植物,例如作物植物。在本发明的产生经遗传修饰的植物的方法中,所述碱基编辑系统可以本领域技术人员熟知的各种方法导入植物。可用于将本发明的碱基编辑系统导入植物的方法包括但不限于:基因枪法、PEG介导的原生质体转化、土壤农杆菌介导的转化、植物病毒介导的转化、花粉管通道法和子房注射法。The methods of the present invention are particularly suitable for producing genetically modified plants, such as crop plants. In the methods of producing genetically modified plants of the present invention, the base editing system can be introduced into plants by various methods well known to those skilled in the art. Methods that can be used to introduce the base editing system of the present invention into plants include, but are not limited to: biolistic method, PEG-mediated protoplast transformation, Agrobacterium-mediated transformation, plant virus-mediated transformation, pollen tube pathway, and protoplast transformation. Atrial injection.
在本发明的产生经遗传修饰的植物的方法中,只需在植物细胞中导入或产生所述碱基编辑融合蛋白和向导RNA即可实现对靶序列的修饰,并且所述修饰可以稳定遗传,无需将所述碱基编辑系统稳定转化植物。这样避免了稳定存在的碱基编辑系统的潜在脱靶作用,也避免外源核苷酸序列在植物基因组中的整合,从而具有更高生物安全性。In the method for producing a genetically modified plant of the present invention, the modification of the target sequence can be achieved only by introducing or producing the base editing fusion protein and the guide RNA in a plant cell, and the modification can be stably inherited, There is no need to stably transform plants with the base editing system. This avoids potential off-target effects of stably existing base editing systems, and also avoids the integration of exogenous nucleotide sequences into the plant genome, resulting in higher biological safety.
在一些优选实施方式中,所述导入在不存在选择压力下进行,从而避免外源核苷酸序列在植物基因组中的整合。In some preferred embodiments, the introduction is performed in the absence of selective pressure, thereby avoiding integration of foreign nucleotide sequences into the plant genome.
在一些实施方式中,所述导入包括将本发明的碱基编辑系统转化至分离的植物细胞或组织,然后使所述经转化的植物细胞或组织再生为完整植物。优选地,在不存在选择压力下进行所述再生,也即是,在组织培养过程中不使用任何针对表达载体上携带的选择基因的选择剂。不使用选择剂可以提高植物的再生效率,获得不含外源核苷酸序列的经修饰的植物。In some embodiments, the introducing comprises transforming the base editing system of the invention into an isolated plant cell or tissue, and then regenerating the transformed plant cell or tissue into a whole plant. Preferably, the regeneration is carried out in the absence of selective pressure, that is, without the use of any selection agent for the selection gene carried on the expression vector during tissue culture. Regeneration efficiency of plants can be increased without the use of selection agents, resulting in modified plants free of exogenous nucleotide sequences.
在另一些实施方式中,可以将本发明的碱基编辑系统转化至完整植物上的特定部位,例如叶片、茎尖、花粉管、幼穗或下胚轴。这特别适合于难以进行组织培养再生的植物的转化。In other embodiments, the base editing systems of the invention can be transformed into specific sites on intact plants, such as leaves, shoot tips, pollen tubes, young ears, or hypocotyls. This is particularly suitable for the transformation of plants that are difficult to regenerate in tissue culture.
在本发明的一些实施方式中,直接将体外表达的蛋白质和/或体外转录的RNA分子转化至所述植物。所述蛋白质和/或RNA分子能够在植物细胞中实现碱基编辑,随后被细胞降解,避免了外源核苷酸序列在植物基因组中的整合。In some embodiments of the invention, the in vitro expressed protein and/or in vitro transcribed RNA molecule is directly transformed into the plant. The protein and/or RNA molecule enables base editing in plant cells and subsequent degradation by the cells, avoiding the integration of exogenous nucleotide sequences into the plant genome.
可以通过本发明的方法进行碱基编辑的植物包括单子叶植物和双子叶植物。例如,所述植物可以是作物植物,例如小麦、水稻、玉米、大豆、向日葵、高粱、油菜、苜蓿、棉花、大麦、粟、甘蔗、番茄、烟草、木薯或马铃薯。Plants that can be base edited by the methods of the present invention include monocotyledonous and dicotyledonous plants. For example, the plant can be a crop plant such as wheat, rice, corn, soybean, sunflower, sorghum, canola, alfalfa, cotton, barley, millet, sugar cane, tomato, tobacco, cassava or potato.
在本发明的一些实施方式中,其中所述靶序列与植物性状如农艺性状相关,由此所述碱基编辑导致所述植物相对于野生型植物具有改变的性状。In some embodiments of the invention, wherein the target sequence is associated with a plant trait, such as an agronomic trait, whereby the base editing results in the plant having an altered trait relative to a wild-type plant.
在本发明中,待进行修饰的靶序列可以位于基因组的任何位置,例如位于功能基因如蛋白编码基因内,或者例如可以位于基因表达调控区如启动子区或增强子区,从而实现对所述基因功能修饰或对基因表达的修饰。相应地,在本发明的一些实施方式中,所述C至T或A至G的取代导致靶蛋白中的氨基酸取代或靶蛋白的截短(生成了终止密码子)。在本发明的另一些实施方式中,所述C至T或A至G的取代导致靶基因的表达发生变化。In the present invention, the target sequence to be modified can be located at any position in the genome, for example, in a functional gene such as a protein-coding gene, or, for example, in a gene expression regulatory region such as a promoter region or an enhancer region, so as to achieve the Modification of gene function or modification of gene expression. Accordingly, in some embodiments of the invention, the C to T or A to G substitutions result in amino acid substitutions in the target protein or truncation of the target protein (generating a stop codon). In other embodiments of the invention, the C to T or A to G substitution results in a change in the expression of the target gene.
在本发明的一些实施方式中,所述方法还包括获得所述经遗传修饰的植物的后代。In some embodiments of the invention, the method further comprises obtaining progeny of the genetically modified plant.
在另一方面,本发明还提供了经遗传修饰的植物或其后代或其部分,其中所述植物通过本发明上述的方法获得。In another aspect, the present invention also provides a genetically modified plant or progeny or part thereof, wherein said plant is obtained by the above-described method of the present invention.
在另一方面,本发明还提供了一种植物育种方法,包括将通过本发明上述的方法获得的经遗传修饰的第一植物与不含有所述遗传修饰的第二植物杂交,从而将所述遗传修饰导入第二植物。In another aspect, the present invention also provides a method of plant breeding, comprising crossing a first genetically modified plant obtained by the above-mentioned method of the present invention with a second plant not containing the genetic modification, so that the The genetic modification is introduced into the second plant.
实施例Example
构建Ubi-CPF1-PBE/ABE表达载体Construction of Ubi-CPF1-PBE/ABE expression vector
ABE、XTEN、dCPF1序列针对植物进行密码子优化并订购自GenScript (Nanjing)。使用引物对HindIII-F(具有HindIII限制位点)和EcoRI(具有 EcoRI限制位点)扩增全长dCPF1-ABE片段。PCR产物用HindIII和EcoRI 消化,然后插入至这两种酶消化的pJIT163-GFP载体(该载体序列示于SEQ ID NO:16)以产生融合表达载体dCPF1-ABE。ABE, XTEN, dCPF1 sequences were codon optimized for plants and ordered from GenScript (Nanjing). The full-length dCPF1-ABE fragment was amplified using primer pair HindIII-F (with HindIII restriction site) and EcoRI (with EcoRI restriction site). The PCR product was digested with HindIII and EcoRI, and then inserted into the pJIT163-GFP vector (the vector sequence is shown in SEQ ID NO: 16) digested with these two enzymes to generate the fusion expression vector dCPF1-ABE.
PBE、XTEN、dCPF1序列针对植物进行密码子优化并订购自GenScript (Nanjing)。使用引物对HindIII-F(具有HindIII限制位点)和EcoRI(具有 EcoRI限制位点)扩增全长dCPF1-PBE片段。PCR产物用HindIII和EcoRI 消化,然后插入至这两种酶消化的pJIT163-GFP载体(该载体序列示于SEQ ID NO:16)以产生融合表达载体dCPF1-PBE。PBE, XTEN, dCPF1 sequences were codon optimized for plants and ordered from GenScript (Nanjing). The full-length dCPF1-PBE fragment was amplified using primer pair HindIII-F (with HindIII restriction site) and EcoRI (with EcoRI restriction site). The PCR product was digested with HindIII and EcoRI and inserted into the pJIT163-GFP vector (the vector sequence is shown in SEQ ID NO: 16) digested with these two enzymes to generate the fusion expression vector dCPF1-PBE.
构建sgRNA表达载体Construction of sgRNA expression vector
根据之前描述(Wang,Y.et al.Simultaneous editing of three homoeoallelesin hexaploid bread wheat confers heritable resistance to powdery mildew.Nat.Biotechnol.32,947-951,2014;Shan,Q.et al.Targeted genome modification of cropplants using a CRISPR-Cas system.Nat.Biotechnol.31,686-688,2013;和 Liang,Z.etal.Targeted mutagenesis in Zea mays using TALENs and the CRISPR/Cas system.JGenet Genomics.41,63-68,2014)基于 pTaU6-sgRNA(Addgene ID53062)或pOsU3-sgRNA(Addgene ID53063)或 pZmU3-sgRNA(Addgene ID5306)或OsU3/TaU6-tRNA-sgRNA(Zhanget al. 2017.Genome Biology.DOI:10.1186/s13059-017-1325-9)构建sgRNA表达载体。此外,还有通过II型启动子来启动锤头酶和crRNA来产生crRNA(Tang et al.Nature plant,doi:10.1038/nplants.2017.18)According to the previous description (Wang, Y. et al. Simultaneous editing of three homoeoallelesin hexaploid bread wheat confers heritable resistance to powdery mildew. Nat. Biotechnol. 32, 947-951, 2014; Shan, Q. et al. Targeted genome modification of cropplants using a CRISPR-Cas system. Nat. Biotechnol. 31, 686-688, 2013; and Liang, Z. et al. Targeted mutagenesis in Zea mays using TALENs and the CRISPR/Cas system. JGenet Genomics. 41, 63-68, 2014 ) based on pTaU6-sgRNA (Addgene ID53062) or pOsU3-sgRNA (Addgene ID53063) or pZmU3-sgRNA (Addgene ID5306) or OsU3/TaU6-tRNA-sgRNA (Zhang et al. 2017. Genome Biology. DOI: 10.1186/s13059-017- 1325-9) Construction of sgRNA expression vector. In addition, there is a type II promoter to initiate the hammerhead enzyme and crRNA to produce crRNA (Tang et al. Nature plant, doi: 10.1038/nplants.2017.18)
pUbi-mGFPP-crRNA、pUbi-DEP1-sgRNA、pUbi-DEP1-crRNA、pUbi -DME-crRNA.pUbi-mGFPP-crRNA, pUbi-DEP1-sgRNA, pUbi-DEP1-crRNA, pUbi-DME-crRNA.
BFP和GFP表达载体BFP and GFP expression vectors
pUbi-mGFP,该载体序列示于SEQ ID NO:17。pUbi-mGFP, the vector sequence is shown in SEQ ID NO:17.
原生质体测定Protoplast assay
在本研究中使用小麦Bobwhite品种、水稻日本晴品种。如下所述进行原生质体转化。平均转化效率为55-70%。每种质粒用10μg通过PEG介导方法进行转化,48小时后,收集原生质体,提取DNA用于T7EI和PCR-RE 测定。The wheat variety Bobwhite and the rice Nipponbare variety were used in this study. Protoplast transformation was performed as described below. The average conversion efficiency was 55-70%. 10 μg of each plasmid was used for transformation by PEG-mediated methods, and after 48 hours, protoplasts were collected and DNA was extracted for T7EI and PCR-RE assays.
小麦原生质体制备及转化Preparation and transformation of wheat protoplasts
1)取小麦幼嫩的叶片,将其中间部分切成0.5-1mm的丝,放入0.6M 的Mannitol溶液中避光处理10分钟,再用滤网过滤,将其放入50ml酶液中20-25℃避光,10rmp缓慢摇晃消化5小时。1) Take the young leaves of wheat, cut the middle part into 0.5-1mm filaments, put them into 0.6M Mannitol solution for 10 minutes in the dark, filter them with a filter, and put them into 50ml of enzyme solution for 20 minutes. -25°C protected from light, 10rmp shaking slowly for 5 hours.
2)加10ml W5稀释酶解产物,用75μm尼龙滤膜过滤酶解液于圆底离心管中(50ml)。2) Add 10 ml of W5 to dilute the enzymatic hydrolysis product, and filter the enzymatic hydrolysis solution with a 75 μm nylon filter into a round-bottom centrifuge tube (50 ml).
3)23℃,100g,离心3min,弃上清。3) 23°C, 100g, centrifuge for 3min, discard the supernatant.
4)用W510ml轻轻悬起,冰上放置30min使原生质体逐渐沉降,弃上清。4) Gently suspend with W510ml, place on ice for 30min to gradually settle the protoplasts, discard the supernatant.
5)加适量MMG悬浮,至于冰上,待转化。5) Add an appropriate amount of MMG to suspend, and keep it on ice for transformation.
6)2ml离心管中加10-20μg质粒,200μl原生质体(大约4×105细胞), 220μl新配的PEG溶液,混匀,室温避光放置10-20分钟诱导转化。6) Add 10-20 μg plasmid, 200 μl protoplasts (about 4×10 5 cells) and 220 μl freshly prepared PEG solution to a 2 ml centrifuge tube, mix well, and place at room temperature in the dark for 10-20 minutes to induce transformation.
7)诱导转化结束后缓慢加880μl W5溶液,轻轻颠倒混匀,100g水平离心3min,吸弃上清。7) After the induction and transformation, slowly add 880 μl of W5 solution, gently invert and mix, centrifuge horizontally at 100 g for 3 min, and aspirate and discard the supernatant.
8)加2ml W5溶液重悬,转移到六孔板中,室温(或25℃)暗处培养。若用于提取原生质体基因组DNA,需培养48h。8) Add 2ml of W5 solution to resuspend, transfer to a six-well plate, and cultivate at room temperature (or 25°C) in the dark. If it is used to extract protoplast genomic DNA, it needs to be cultured for 48h.
水稻原生质体制备及转化:Preparation and transformation of rice protoplasts:
1)选取幼苗叶鞘部分分离原生质体,用锋利刀片切成大约0.5mm宽。1) Select the leaf sheath part of the seedling to separate the protoplast, and cut it into a width of about 0.5mm with a sharp blade.
2)切开后立刻转移到0.6M Mannitol溶液中,避光放置10min。2) Immediately after incision, transfer to 0.6M Mannitol solution and place in the dark for 10min.
3)过滤掉Mannitol溶液,转移到酶解液中,避光抽真空30min。3) Filter out the Mannitol solution, transfer it to the enzymatic hydrolysis solution, and vacuumize for 30 minutes in the dark.
4)避光酶解5-6h,同时缓慢摇动(脱色摇床,速度10)。4) Enzyme hydrolysis in the dark for 5-6 hours, while shaking slowly (decolorizing shaker, speed 10).
5)酶解结束后,加入等体积的W5,水平摇动10sec,释放原生质体。5) After the enzymatic hydrolysis, add an equal volume of W5 and shake it horizontally for 10 sec to release the protoplasts.
6)使用40μm尼龙膜过滤原生质体到50ml圆底离心管,再加W5溶液冲洗。6) Use a 40 μm nylon membrane to filter the protoplasts into a 50 ml round-bottom centrifuge tube, and then rinse with W5 solution.
7)250g水平离心3min沉淀原生质体,吸弃上清。7) Horizontal centrifugation at 250 g for 3 min to precipitate protoplasts, and the supernatant was discarded.
8)加10ml W5重悬原生质体,250g离心3min,弃上清。8) Add 10ml of W5 to resuspend the protoplasts, centrifuge at 250g for 3min, and discard the supernatant.
9)加适量MMG溶液重悬原生质体浓度为2×106/ml。9) Add an appropriate amount of MMG solution to resuspend the protoplast with a concentration of 2×10 6 /ml.
注:以上所有步骤在室温进行。NOTE: All the above steps are performed at room temperature.
10)2ml离心管中加10-20μg质粒,200μl原生质体(大约4×105细胞), 220μl新配的PEG溶液,混匀,室温避光放置10-20分钟诱导转化。10) Add 10-20 μg plasmid, 200 μl protoplasts (about 4×10 5 cells) and 220 μl freshly prepared PEG solution to a 2 ml centrifuge tube, mix well, and place at room temperature in the dark for 10-20 minutes to induce transformation.
11)诱导转化结束后缓慢加880μl W5溶液,轻轻颠倒混匀,250g水平离心3min,吸弃上清。11) After the induction and transformation, slowly add 880 μl of W5 solution, gently invert and mix, centrifuge horizontally at 250 g for 3 min, and aspirate and discard the supernatant.
12)加2ml WI溶液重悬,转移到六孔板中,室温(或25℃)暗处培养,若用于提取原生质体基因组DNA,需培养48h。12) Add 2 ml of WI solution to resuspend, transfer to a six-well plate, and culture at room temperature (or 25°C) in the dark. If it is used to extract protoplast genomic DNA, it needs to be cultured for 48 hours.
PCR/RE检测:PCR/RE detection:
1)提取植物基因组DNA。1) Extracting plant genomic DNA.
2)合成基因特异引物,扩增含有靶位点的片段,长度为350-1000bp 之间:2) Synthesize gene-specific primers to amplify fragments containing the target site, with a length between 350-1000bp:
3)一般反应条件是:94℃变性5min;94℃变性30s,58℃复性30s,72℃延伸30s,扩增30至35个循环;72℃保温5min;12℃保温。取5μl PCR 产物电泳检测。3) The general reaction conditions are: denaturation at 94°C for 5 minutes; denaturation at 94°C for 30s, renaturation at 58°C for 30s, extension at 72°C for 30s, and amplification for 30 to 35 cycles; incubation at 72°C for 5 minutes; incubation at 12°C. Take 5μl PCR product for electrophoresis detection.
4)限制性内切酶酶切PCR产物,一般的酶切体系如下:4) Restriction endonuclease digestion of PCR products, the general digestion system is as follows:
5)37℃,酶切2-3h。1.2%琼脂糖凝胶电泳检测。5) 37℃, digested with enzyme for 2-3h. 1.2% agarose gel electrophoresis detection.
6)回收纯化PCR产物中未切开的突变条带,进行TA克隆。反应体系如下:6) Recover the uncut mutant band in the purified PCR product, and perform TA cloning. The reaction system is as follows:
7)22℃连接10min,转化E.coli感受态细胞,涂LB固体平板(Amp100、 IPTG和X-gal),培养12-16h,挑选白色菌落鉴定阳性克隆,送测序。7) Connect at 22°C for 10min, transform E.coli competent cells, coat on LB solid plate (Amp100, IPTG and X-gal), cultivate for 12-16h, select white colonies to identify positive clones, and send for sequencing.
深度测序deep sequencing
不同的sgRNA表达载体分别与Ubi-CPF1-PBE/ABE表达载体转化至小麦、水稻质体48小时后,收集原生质体,提取DNA进行深度测序。在第一轮PCR中,靶区域使用位点特异性引物进行扩增。在二轮PCR中,将正向和反向标签添加至PCR产物末端进行文库构建。合并等量不同PCR产物。样品然后在Beijing Genomics Institute使用Illumina High-Seq 4000测序。Different sgRNA expression vectors and Ubi-CPF1-PBE/ABE expression vectors were transformed into wheat and rice plastids for 48 hours, and the protoplasts were collected, and DNA was extracted for deep sequencing. In the first round of PCR, the target region is amplified using site-specific primers. In two rounds of PCR, forward and reverse tags are added to the ends of PCR products for library construction. Equal amounts of different PCR products were pooled. Samples were then sequenced using an Illumina High-Seq 4000 at Beijing Genomics Institute.
实施例Example
实施例1.CPF1介导的植物基因组切割活性的优化。Example 1. Optimization of CPF1-mediated plant genome cleavage activity.
CPF1在植物细胞中的编辑活性在不同文章中的差别较大,并且不同类型的CPF1之间的切割活性差别也很大。The editing activity of CPF1 in plant cells varies widely from article to article, and the cleavage activity of CPF1 varies greatly among different types of CPF1.
本实施例通过对AsCPF1、FnCPF1以及LbCPF1的入核状态进行优化,同时也对crRNA的启动子进行优化,来提高CPF1在植物细胞里的切割活性。构建了AsCPF1、FnCPF1和LbCPF1的1-4个NLS的载体,并且构建了U3/U6以及UBI启动的通过核酶来产生crRNA的不同载体(如图1)。通过PCR/RE的结果可以看出,2个NLS的三种CPF1都可以工作,而且 LbCPF1效率较高(SEQ ID NO:5-7分别为ASCPF1-2NLS、FNCPF1-2NLS 和LBCPF1-2NLS的编码序列,可容易获得对应的氨基酸序列)。针对OsPDS 这个基因的靶位点,可以看出2NLS-LbCPF1的效率要高于NLS-LbCPF1,高于其他一些报道的构建体。In this example, the cleavage activity of CPF1 in plant cells is improved by optimizing the nuclear state of AsCPF1, FnCPF1 and LbCPF1, and also optimizing the promoter of crRNA. 1-4 NLS vectors of AsCPF1, FnCPF1 and LbCPF1 were constructed, and U3/U6 and UBI-promoted different vectors for ribozyme-generated crRNA were constructed (Fig. 1). It can be seen from the results of PCR/RE that the three CPF1s of the two NLSs can work, and the efficiency of LbCPF1 is higher (SEQ ID NOs: 5-7 are the coding sequences of ASCPF1-2NLS, FNCPF1-2NLS and LBCPF1-2NLS, respectively. , the corresponding amino acid sequence can be easily obtained). For the target site of the OsPDS gene, it can be seen that the efficiency of 2NLS-LbCPF1 is higher than that of NLS-LbCPF1 and higher than some other reported constructs.
实施例2.CPF1介导的植物基因组C到T的突变(CPF1-PBE)Example 2. CPF1-mediated mutation of plant genomes C to T (CPF1-PBE)
参考CPF1在植物细胞切割活性的特点,构建了以下dCPF1-PBE系统: dAsCPF 1-2NLS-PBE、dFnCPF1-2NLS-PBE、dLbCPF1-2NLS-PBE。其中C 端的NLS分别放在了UGI的一端和分别放在了UGI的两端。crRNA用UBI1 来启动并用核酶切割。利用PCR/RE检测结果表明,dFnCPf1和dLbCPF1 检测出了编辑活性,并且NLS在UGI的一端的活性较高(SEQ ID NO:8,9分别示出dFNCPF1-PBE-2NLS和dLbCPF1-2NLS-PBE的编码序列,可容易获得对应的氨基酸序列)。此外,还构建了加强版的dCPF1-PBE2-X,即在用ZmUbi-1启动子后面加入了一个内含子来增加dCPF1-PBE的表达(SEQ ID NO:10示出了包含ZmUbi-1启动子和内含子序列的dLBCPF1-PBE-2NLS 表达盒)。Referring to the cleavage activity of CPF1 in plant cells, the following dCPF1-PBE systems were constructed: dAsCPF1-2NLS-PBE, dFnCPF1-2NLS-PBE, dLbCPF1-2NLS-PBE. The NLS at the C end is placed at one end of the UGI and at both ends of the UGI respectively. crRNA was initiated with UBI1 and cleaved with ribozymes. The results of PCR/RE assay showed that dFnCPf1 and dLbCPF1 detected editing activity, and NLS had higher activity at one end of UGI (SEQ ID NOs: 8, 9 show the dFNCPF1-PBE-2NLS and dLbCPF1-2NLS-PBE, respectively coding sequence, the corresponding amino acid sequence is readily available). In addition, an enhanced version of dCPF1-PBE2-X was constructed by adding an intron after the ZmUbi-1 promoter to increase the expression of dCPF1-PBE (SEQ ID NO: 10 shows the inclusion of the ZmUbi-1 promoter dLBCPF1-PBE-2NLS expression cassette of intron and intron sequences).
实施例3.CPF1介导的植物基因组A到G的突变(CPF1-ABE)Example 3. CPF1-mediated mutation of plant genomes A to G (CPF1-ABE)
构建了以下CPF1-ABE系统:dAsCPF1-1NLS-ABE、 dFnCPF1-NLS-ABE、dLbCPF1-1NLS-ABE、以及dAsCPF1-2NLS-ABE、 dFnCPF1-2NLS-ABE、dLbCPF1-2NLS-ABE,其中ABE又包括ABE7.9和 ABE7.10两个版本。crRNA用UBI1来启动并用核酶切割。The following CPF1-ABE systems were constructed: dAsCPF1-1NLS-ABE, dFnCPF1-NLS-ABE, dLbCPF1-1NLS-ABE, and dAsCPF1-2NLS-ABE, dFnCPF1-2NLS-ABE, dLbCPF1-2NLS-ABE, where ABEs in turn included ABE7 .9 and ABE7.10 two versions. crRNA was initiated with UBI1 and cleaved with ribozymes.
利用图3E的GFP碱基编辑报告系统的结果表明:dFnCPF1-ABE7.10 (SEQ ID NO:11)和dLbCPF1-ABE7.9和dLbCPF1-ABE7.10(SEQ ID NO:12) 都可以工作,而7.10要高于7.9(图3F)。The results using the GFP base editing reporter system in Figure 3E show that dFnCPF1-ABE7.10 (SEQ ID NO: 11) and dLbCPF1-ABE7.9 and dLbCPF1-ABE7.10 (SEQ ID NO: 12) all work, while 7.10 is higher than 7.9 (Fig. 3F).
利用PCR/RE检测结果表明,dLbCPF1-ABE7.10检测出了活性,并且 2NLS要高于1NLS。此外,还构建了两个加强版的dCPF1-ABE2,即在UBI1 启动子后面加入了一个内含子来增加dCPF1-ABE的表达(dCPF1-ABE2-X1)(SEQ ID NO.13),还有将ABE构建在了CPF1的C端(dCPF1-ABE2-X2/X3)(SEQ ID NO.14、15),利用图3E的GFP碱基编辑报告系统的结果:dCPF1-ABE2-X2/X3编辑活性要高于 dLbCPF1-ABE7.10(图3G)。The results of PCR/RE assay showed that dLbCPF1-ABE7.10 detected the activity, and the 2NLS was higher than the 1NLS. In addition, two enhanced versions of dCPF1-ABE2 were constructed by adding an intron after the UBI1 promoter to increase the expression of dCPF1-ABE (dCPF1-ABE2-X1) (SEQ ID NO. 13), and ABE was constructed at the C-terminus of CPF1 (dCPF1-ABE2-X2/X3) (SEQ ID NO. 14, 15), and the results of using the GFP base editing reporter system in Figure 3E: dCPF1-ABE2-X2/X3 editing activity higher than dLbCPF1-ABE7.10 (Fig. 3G).
实施例4.CPF1介导的基因编辑优化Example 4. Optimization of CPF1-mediated gene editing
为了继续提高CPF1的编辑效率,我们继续对CPF1系统进行优化,首先对CPF1介导的编辑的所有表达载体采用BdUbi10启动子进行启动,以便增加其表达量。此外,我们还对crRNA的启动采用II型启动子启动,并将crRNAArray放入表达基因的5’UTR或3’UTR区域,来通过提高mRNA 的表达来提高CPF1的编辑效率。In order to continue to improve the editing efficiency of CPF1, we continued to optimize the CPF1 system. First, all expression vectors for CPF1-mediated editing were initiated with the BdUbi10 promoter to increase their expression levels. In addition, we also used type II promoter for the initiation of crRNA, and put the crRNAArray into the 5'UTR or 3'UTR region of the expressed gene to improve the editing efficiency of CPF1 by increasing the expression of mRNA.
相关序列描述:Related sequence description:
SEQ ID NO.1胞苷脱氨酶氨基酸序列SEQ ID NO.1 amino acid sequence of cytidine deaminase
SEQ ID NO.2尿嘧啶DNA糖基化酶抑制剂(UGI)氨基酸序列SEQ ID NO.2 amino acid sequence of uracil DNA glycosylase inhibitor (UGI)
SEQ ID NO.3野生型ecTadA氨基酸序列SEQ ID NO.3 Wild-type ecTadA amino acid sequence
SEQ ID NO.4 ecTadA衍生的DNA依赖型腺嘌呤脱氨酶(ABE版本7.10)氨基酸序列SEQ ID NO.4 ecTadA-derived DNA-dependent adenine deaminase (ABE version 7.10) amino acid sequence
SEQ ID NO.5 ASCPF1-2NLS编码序列SEQ ID NO.5 ASCPF1-2NLS coding sequence
SEQ ID NO.6 FNCPF1-2NLS编码序列SEQ ID NO.6 FNCPF1-2NLS coding sequence
SEQ ID NO.7 LBCPF1-2NLS编码序列SEQ ID NO.7 LBCPF1-2NLS coding sequence
SEQ ID NO.8 dFNCPF1-PBE-2NLS编码序列SEQ ID NO.8 dFNCPF1-PBE-2NLS coding sequence
SEQ ID NO.9 dLBCPF1-PBE-2NLS编码序列SEQ ID NO.9 dLBCPF1-PBE-2NLS coding sequence
SEQ ID NO.10启动子+内含子+dLBCPF1-PBE-2NLS编码序列SEQ ID NO.10 promoter+intron+dLBCPF1-PBE-2NLS coding sequence
SEQ ID NO.11 dFNCPF1-ABE7.10-2NLS编码序列SEQ ID NO.11 dFNCPF1-ABE7.10-2NLS coding sequence
SEQ ID NO.12 dLBCPF1-ABE7.10-2NLS编码序列SEQ ID NO.12 dLBCPF1-ABE7.10-2NLS coding sequence
SEQ ID NO.13启动子+内含子+dLBCPF1-ABE2-x编码序列SEQ ID NO.13 promoter+intron+dLBCPF1-ABE2-x coding sequence
SEQ ID NO.14 LBCPF1-ABE2-x2编码序列SEQ ID NO. 14 LBCPF1-ABE2-x2 coding sequence
SEQ ID NO.15 LBCPF1-ABE2-x3编码序列SEQ ID NO.15 LBCPF1-ABE2-x3 coding sequence
SEQ ID NO.16 PJIT163-GFPSEQ ID NO. 16 PJIT163-GFP
SEQ ID NO:17 pBUI-mGFPSEQ ID NO: 17 pBUI-mGFP
SEQ ID NO:18 ASCPF1氨基酸序列SEQ ID NO: 18 ASCPF1 amino acid sequence
SEQ ID NO:19 FNCPF1氨基酸序列SEQ ID NO: 19 FNCPF1 amino acid sequence
SEQ ID NO:20 LBCPF1氨基酸序列SEQ ID NO: 20 LBCPF1 amino acid sequence
SEQ ID NO:21 ASCPF1-2NLS氨基酸序列SEQ ID NO: 21 ASCPF1-2NLS amino acid sequence
SEQ ID NO:22 FNCPF1-2NLS氨基酸序列SEQ ID NO: 22 FNCPF1-2NLS amino acid sequence
SEQ ID NO:23 LBCPF1-2NLS氨基酸序列SEQ ID NO: 23 LBCPF1-2NLS amino acid sequence
SEQ ID NO:24 dFNCPF1-PBE-2NLS氨基酸序列SEQ ID NO: 24 dFNCPF1-PBE-2NLS amino acid sequence
SEQ ID NO:25 dLBCPF1-PBE-2NLS氨基酸序列SEQ ID NO: 25 dLBCPF1-PBE-2NLS amino acid sequence
SEQ ID NO:26 dFNCPF1-ABE7.10-2NLS氨基酸序列SEQ ID NO: 26 dFNCPF1-ABE7.10-2NLS amino acid sequence
SEQ ID NO:27 dLBCPF1-ABE7.10-2NLS氨基酸序列SEQ ID NO: 27 dLBCPF1-ABE7.10-2NLS amino acid sequence
SEQ ID NO:28 LBCPF1-ABE2-x2氨基酸序列SEQ ID NO: 28 LBCPF1-ABE2-x2 amino acid sequence
SEQ ID NO:29 LBCPF1-ABE2-x3氨基酸序列SEQ ID NO: 29 LBCPF1-ABE2-x3 amino acid sequence
SEQ ID NO:30启动子+内含子核苷酸序列。SEQ ID NO: 30 Promoter+intron nucleotide sequence.
序列表sequence listing
<110> 中国科学院遗传与发育生物学研究所<110> Institute of Genetics and Developmental Biology, Chinese Academy of Sciences
<120> 基于CPF1蛋白的碱基编辑系统和方法<120> Base editing system and method based on CPF1 protein
<130> 149132<130> 149132
<150> 201711403490.X<150> 201711403490.X
<151> 2017-12-22<151> 2017-12-22
<160> 30<160> 30
<170> PatentIn version 3.5<170> PatentIn version 3.5
<210> 1<210> 1
<211> 227<211> 227
<212> PRT<212> PRT
<213> Rattus norvegicus<213> Rattus norvegicus
<400> 1<400> 1
Ser Ser Glu Thr Gly Pro Val Ala Val Asp Pro Thr Leu Arg Arg ArgSer Ser Glu Thr Gly Pro Val Ala Val Asp Pro Thr Leu Arg Arg Arg
1 5 10 151 5 10 15
Ile Glu Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg Glu Leu ArgIle Glu Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg Glu Leu Arg
20 25 30 20 25 30
Lys Glu Thr Cys Leu Leu Tyr Glu Ile Asn Trp Gly Gly Arg His SerLys Glu Thr Cys Leu Leu Tyr Glu Ile Asn Trp Gly Gly Arg His Ser
35 40 45 35 40 45
Ile Trp Arg His Thr Ser Gln Asn Thr Asn Lys His Val Glu Val AsnIle Trp Arg His Thr Ser Gln Asn Thr Asn Lys His Val Glu Val Asn
50 55 60 50 55 60
Phe Ile Glu Lys Phe Thr Thr Glu Arg Tyr Phe Cys Pro Asn Thr ArgPhe Ile Glu Lys Phe Thr Thr Glu Arg Tyr Phe Cys Pro Asn Thr Arg
65 70 75 8065 70 75 80
Cys Ser Ile Thr Trp Phe Leu Ser Trp Ser Pro Cys Gly Glu Cys SerCys Ser Ile Thr Trp Phe Leu Ser Trp Ser Pro Cys Gly Glu Cys Ser
85 90 95 85 90 95
Arg Ala Ile Thr Glu Phe Leu Ser Arg Tyr Pro His Val Thr Leu PheArg Ala Ile Thr Glu Phe Leu Ser Arg Tyr Pro His Val Thr Leu Phe
100 105 110 100 105 110
Ile Tyr Ile Ala Arg Leu Tyr His His Ala Asp Pro Arg Asn Arg GlnIle Tyr Ile Ala Arg Leu Tyr His His Ala Asp Pro Arg Asn Arg Gln
115 120 125 115 120 125
Gly Leu Arg Asp Leu Ile Ser Ser Gly Val Thr Ile Gln Ile Met ThrGly Leu Arg Asp Leu Ile Ser Ser Gly Val Thr Ile Gln Ile Met Thr
130 135 140 130 135 140
Glu Gln Glu Ser Gly Tyr Cys Trp Arg Asn Phe Val Asn Tyr Ser ProGlu Gln Glu Ser Gly Tyr Cys Trp Arg Asn Phe Val Asn Tyr Ser Pro
145 150 155 160145 150 155 160
Ser Asn Glu Ala His Trp Pro Arg Tyr Pro His Leu Trp Val Arg LeuSer Asn Glu Ala His Trp Pro Arg Tyr Pro His Leu Trp Val Arg Leu
165 170 175 165 170 175
Tyr Val Leu Glu Leu Tyr Cys Ile Ile Leu Gly Leu Pro Pro Cys LeuTyr Val Leu Glu Leu Tyr Cys Ile Ile Leu Gly Leu Pro Pro Cys Leu
180 185 190 180 185 190
Asn Ile Leu Arg Arg Lys Gln Pro Gln Leu Thr Phe Phe Thr Ile AlaAsn Ile Leu Arg Arg Lys Gln Pro Gln Leu Thr Phe Phe Thr Ile Ala
195 200 205 195 200 205
Leu Gln Ser Cys His Tyr Gln Arg Leu Pro Pro His Ile Leu Trp AlaLeu Gln Ser Cys His Tyr Gln Arg Leu Pro Pro His Ile Leu Trp Ala
210 215 220 210 215 220
Thr Gly LeuThr Gly Leu
225225
<210> 2<210> 2
<211> 87<211> 87
<212> PRT<212> PRT
<213> Bacillus phage<213> Bacillus phage
<400> 2<400> 2
Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu ValThr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val
1 5 10 151 5 10 15
Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val IleIle Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val Ile
20 25 30 20 25 30
Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp GluGly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp Glu
35 40 45 35 40 45
Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu TyrSer Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu Tyr
50 55 60 50 55 60
Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys IleLys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys Ile
65 70 75 8065 70 75 80
Lys Met Leu Ser Gly Gly SerLys Met Leu Ser Gly Gly Ser
85 85
<210> 3<210> 3
<211> 167<211> 167
<212> PRT<212> PRT
<213> Escherichia coli<213> Escherichia coli
<400> 3<400> 3
Met Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala LeuMet Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu
1 5 10 151 5 10 15
Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val Gly AlaThr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala
20 25 30 20 25 30
Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg ProVal Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro
35 40 45 35 40 45
Ile Gly Arg His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu ArgIle Gly Arg His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg
50 55 60 50 55 60
Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr LeuGln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu
65 70 75 8065 70 75 80
Tyr Val Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile HisTyr Val Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His
85 90 95 85 90 95
Ser Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr GlySer Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly
100 105 110 100 105 110
Ala Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn HisAla Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn His
115 120 125 115 120 125
Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala LeuArg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu
130 135 140 130 135 140
Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys Ala Gln LysLeu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys
145 150 155 160145 150 155 160
Lys Ala Gln Ser Ser Thr AspLys Ala Gln Ser Ser Thr Asp
165 165
<210> 4<210> 4
<211> 167<211> 167
<212> PRT<212> PRT
<213> Escherichia coli<213> Escherichia coli
<400> 4<400> 4
Met Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala LeuMet Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu
1 5 10 151 5 10 15
Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly AlaThr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly Ala
20 25 30 20 25 30
Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg AlaVal Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Ala
35 40 45 35 40 45
Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu ArgIle Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg
50 55 60 50 55 60
Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr LeuGln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu
65 70 75 8065 70 75 80
Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile HisTyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His
85 90 95 85 90 95
Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr GlySer Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr Gly
100 105 110 100 105 110
Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn HisAla Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn His
115 120 125 115 120 125
Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala LeuArg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu
130 135 140 130 135 140
Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln LysLeu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln Lys
145 150 155 160145 150 155 160
Lys Ala Gln Ser Ser Thr AspLys Ala Gln Ser Ser Thr Asp
165 165
<210> 5<210> 5
<211> 3990<211> 3990
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> ASCPF1-2NLS<223> ASCPF1-2NLS
<400> 5<400> 5
atgacgcagt tcgaggggtt caccaacctc taccaggtca gcaagacgct gcggttcgag 60atgacgcagt tcgaggggtt caccaacctc taccaggtca gcaagacgct gcggttcgag 60
ctgattccgc agggcaagac cctcaagcac atccaggagc agggctttat cgaggaggac 120ctgattccgc agggcaagac cctcaagcac atccaggagc agggctttat cgaggaggac 120
aaagcgcgga acgaccacta caaggagctc aagccgatca tcgaccggat ctacaagacc 180aaagcgcgga acgaccacta caaggagctc aagccgatca tcgaccggat ctacaagacc 180
tacgcggacc agtgcctgca gctcgtgcag ctcgactggg agaacctctc ggccgccatc 240tacgcggacc agtgcctgca gctcgtgcag ctcgactggg agaacctctc ggccgccatc 240
gactcctacc gcaaggagaa gacggaggag acacgcaacg cgctcatcga agaacaggct 300gactcctacc gcaaggagaa gacggaggag acacgcaacg cgctcatcga agaacaggct 300
acgtatcgca acgctatcca cgactacttc atcgggcgca cagataatct caccgacgcc 360acgtatcgca acgctatcca cgactacttc atcgggcgca cagataatct caccgacgcc 360
atcaacaagc gccacgccga aatctataag gggctgttta aggccgagct gttcaatggc 420atcaacaagc gccacgccga aatctataag gggctgttta aggccgagct gttcaatggc 420
aaagtgctga agcaactggg caccgtcacg accacagagc atgagaacgc cctcctccgg 480aaagtgctga agcaactggg caccgtcacg accacagagc atgagaacgc cctcctccgg 480
tcgttcgaca agtttacgac atactttagc ggcttttacg agaaccgcaa aaacgtgttc 540tcgttcgaca agtttacgac atactttagc ggcttttacg agaaccgcaa aaacgtgttc 540
agcgccgagg acattagcac cgccatcccg cataggatcg tgcaagacaa cttcccgaag 600agcgccgagg acattagcac cgccatcccg cataggatcg tgcaagacaa cttcccgaag 600
ttcaaggaga actgccacat cttcacccgc ctcatcacgg ccgtgccgtc cctccgcgag 660ttcaaggaga actgccacat cttcacccgc ctcatcacgg ccgtgccgtc cctccgcgag 660
cacttcgaga acgtgaagaa ggccatcggg atcttcgtgt ccacctcgat cgaagaggtg 720cacttcgaga acgtgaagaa ggccatcggg atcttcgtgt ccacctcgat cgaagaggtg 720
ttctccttcc ccttctacaa tcagctgctg acgcagaccc agatcgacct ctacaaccag 780ttctccttcc ccttctacaa tcagctgctg acgcagaccc agatcgacct ctacaaccag 780
ctcctcggcg gcatctcccg cgaggccggc accgagaaga tcaagggcct gaacgaggtg 840ctcctcggcg gcatctcccg cgaggccggc accgagaaga tcaagggcct gaacgaggtg 840
ctcaatctcg ccatccagaa gaacgacgaa accgcgcaca tcattgcctc actcccacat 900ctcaatctcg ccatccagaa gaacgacgaa accgcgcaca tcattgcctc actcccacat 900
aggtttatcc ccctgtttaa gcagatcctc tccgaccgca acacgttgtc cttcatcctc 960aggtttatcc ccctgtttaa gcagatcctc tccgaccgca acacgttgtc cttcatcctc 960
gaggagttca agagcgacga ggaggtcatc cagtccttct gcaagtacaa gaccctcctc 1020gaggagttca agagcgacga ggaggtcatc cagtccttct gcaagtacaa gaccctcctc 1020
cgcaacgaga atgtgctcga aaccgccgag gcgctgttca atgagctcaa tagcatcgat 1080cgcaacgaga atgtgctcga aaccgccgag gcgctgttca atgagctcaa tagcatcgat 1080
ctcacccaca tcttcatctc ccataagaag ctggaaacca tctccagcgc cctgtgcgac 1140ctcacccaca tcttcatctc ccataagaag ctggaaacca tctccagcgc cctgtgcgac 1140
cactgggaca ccctccgcaa cgccctctac gagcggcgca tcagcgagct caccggcaag 1200cactgggaca ccctccgcaa cgccctctac gagcggcgca tcagcgagct caccggcaag 1200
atcacgaagt cggcgaaaga gaaagtgcaa aggagcctca agcacgagga cattaacctg 1260atcacgaagt cggcgaaaga gaaagtgcaa aggagcctca agcacgagga cattaacctg 1260
caggagatca tctccgccgc gggcaaggag ctgtccgagg ctttcaagca gaagacctcg 1320caggagatca tctccgccgc gggcaaggag ctgtccgagg ctttcaagca gaagacctcg 1320
gagatcctct ctcacgccca cgcggccctg gaccagccgc tgccgacgac cctgaaaaag 1380gagatcctct ctcacgccca cgcggccctg gaccagccgc tgccgacgac cctgaaaaag 1380
caagaggaaa aggaaatcct caagtcccag ctggacagcc tcctggggct gtaccacctt 1440caagaggaaa aggaaatcct caagtcccag ctggacagcc tcctggggct gtaccacctt 1440
ctcgactggt tcgccgtgga cgagtccaac gaggtcgatc cggagtttag cgcccgcctc 1500ctcgactggt tcgccgtgga cgagtccaac gaggtcgatc cggagtttag cgcccgcctc 1500
accgggatca agcttgagat ggaacctagc ttgagcttct ataataaggc gcgcaattac 1560accgggatca agcttgagat ggaacctagc ttgagcttct ataataaggc gcgcaattac 1560
gcgaccaaga agccgtattc cgtggagaag ttcaagctga acttccaaat gcccaccctg 1620gcgaccaaga agccgtattc cgtggagaag ttcaagctga acttccaaat gcccaccctg 1620
gccagcgggt gggacgttaa caaagagaag aacaacggag ccattctttt cgtgaaaaat 1680gccagcgggt gggacgttaa caaagagaag aacaacggag ccattctttt cgtgaaaaat 1680
gggttgtatt atttgggaat tatgccgaaa caaaaaggca ggtacaaggc gctcagcttc 1740gggttgtatt atttgggaat tatgccgaaa caaaaaggca ggtacaaggc gctcagcttc 1740
gagccaactg agaagacctc cgagggcttc gacaagatgt actacgatta ttttcctgac 1800gagccaactg agaagacctc cgagggcttc gacaagatgt actacgatta ttttcctgac 1800
gctgcaaaga tgataccgaa gtgcagcact cagcttaagg cggtgacggc gcactttcag 1860gctgcaaaga tgataccgaa gtgcagcact cagcttaagg cggtgacggc gcactttcag 1860
acccatacca cccccatcct cctctccaac aacttcatcg agccgctcga gatcaccaag 1920acccatacca cccccatcct cctctccaac aacttcatcg agccgctcga gatcaccaag 1920
gagatatacg atctgaataa tccagaaaag gaacccaaga agttccagac cgcctacgcc 1980gagatatacg atctgaataa tccagaaaag gaacccaaga agttccagac cgcctacgcc 1980
aagaagacgg gcgatcaaaa ggggtataga gaggcgctct gcaagtggat cgacttcacg 2040aagaagacgg gcgatcaaaa ggggtataga gaggcgctct gcaagtggat cgacttcacg 2040
cgcgatttcc tcagcaagta caccaagaca acctccatcg atctctcttc cctccgcccc 2100cgcgatttcc tcagcaagta caccaagaca acctccatcg atctctcttc cctccgcccc 2100
tcttcccagt acaaggacct cggggagtac tacgccgaac tcaacccact cctgtatcac 2160tcttcccagt acaaggacct cggggagtac tacgccgaac tcaacccact cctgtatcac 2160
atctcgtttc agcgtatcgc ggaaaaggag atcatggacg ccgtcgaaac cggcaagttg 2220atctcgtttc agcgtatcgc ggaaaaggag atcatggacg ccgtcgaaac cggcaagttg 2220
tatcttttcc aaatctataa caaggacttc gcgaagggcc accacgggaa gccaaacctg 2280tatcttttcc aaatctataa caaggacttc gcgaagggcc accacgggaa gccaaacctg 2280
cacaccctct actggacagg cctcttcagc ccggaaaatc tcgcgaagac gagcataaag 2340cacaccctct actggacagg cctcttcagc ccggaaaatc tcgcgaagac gagcataaag 2340
ctgaacggcc aggcagaact cttctacagg ccgaagtcca ggatgaagcg catggctcat 2400ctgaacggcc aggcagaact cttctacagg ccgaagtcca ggatgaagcg catggctcat 2400
cgcctcggtg agaagatgct gaacaagaag ctgaaagatc aaaagacgcc aatccctgat 2460cgcctcggtg agaagatgct gaacaagaag ctgaaagatc aaaagacgcc aatccctgat 2460
acactgtatc aggagctgta cgattacgtg aaccaccgcc tctcacacga cctcagcgac 2520acactgtatc aggagctgta cgattacgtg aaccaccgcc tctcacacga cctcagcgac 2520
gaggcccgcg cgctcctgcc aaacgtcatc acgaaggagg tcagccacga gatcataaag 2580gaggcccgcg cgctcctgcc aaacgtcatc acgaaggagg tcagccacga gatcataaag 2580
gatcggcggt ttacctctga caagttcttt ttccatgtcc ccatcacgct gaactaccag 2640gatcggcggt ttacctctga caagttcttt ttccatgtcc ccatcacgct gaactaccag 2640
gccgcgaata gcccgtccaa gttcaaccag cgggtcaacg cgtatctcaa ggagcaccca 2700gccgcgaata gcccgtccaa gttcaaccag cgggtcaacg cgtatctcaa ggagcaccca 2700
gagacaccca taatcgggat tgaccggggg gagcggaacc tcatctacat caccgtcatc 2760gagacaccca taatcgggat tgaccggggg gagcggaacc tcatctacat caccgtcatc 2760
gactccaccg gaaagattct cgagcaacgg tcgctcaata ccatccagca gttcgactac 2820gactccaccg gaaagattct cgagcaacgg tcgctcaata ccatccagca gttcgactac 2820
cagaagaagc tcgacaaccg ggagaaggaa cgcgtcgccg cgaggcaggc ctggtccgta 2880cagaagaagc tcgacaaccg ggagaaggaa cgcgtcgccg cgaggcaggc ctggtccgta 2880
gtgggcacga tcaaagacct gaagcagggc tatctcagcc aggtcatcca tgagatagtg 2940gtgggcacga tcaaagacct gaagcagggc tatctcagcc aggtcatcca tgagatagtg 2940
gatctcatga tccactacca agccgtcgtg gtcctcgaga atctcaattt cggattcaaa 3000gatctcatga tccactacca agccgtcgtg gtcctcgaga atctcaattt cggattcaaa 3000
tccaagcgca caggcatcgc cgagaaggcg gtgtaccaac agttcgagaa aatgcttatc 3060tccaagcgca caggcatcgc cgagaaggcg gtgtaccaac agttcgagaa aatgcttatc 3060
gacaagctca attgcctggt gctcaaggac tatccggcgg agaaggtcgg gggggtcctc 3120gacaagctca attgcctggt gctcaaggac tatccggcgg agaaggtcgg gggggtcctc 3120
aatccgtatc agctgaccga ccagtttacg tcatttgcga agatgggcac ccagagcggc 3180aatccgtatc agctgaccga ccagtttacg tcatttgcga agatgggcac ccagagcggc 3180
ttccttttct atgtcccggc cccatatacc tcaaagattg atcccttgac cggattcgtg 3240ttccttttct atgtcccggc cccatatacc tcaaagattg atcccttgac cggattcgtg 3240
gacccgtttg tctggaagac catcaagaac catgagtcgc gtaagcattt cctggagggt 3300gacccgtttg tctggaagac catcaagaac catgagtcgc gtaagcattt cctggagggt 3300
ttcgacttcc tgcactatga tgtaaaaacc ggagacttca tcctgcattt caagatgaat 3360ttcgacttcc tgcactatga tgtaaaaacc ggagacttca tcctgcattt caagatgaat 3360
cggaacctct ccttccagcg gggactccct ggcttcatgc ccgcttggga tatcgttttt 3420cggaacctct ccttccagcg gggactccct ggcttcatgc ccgcttggga tatcgttttt 3420
gagaaaaatg aaacccaatt cgacgccaaa ggcacgcctt tcatcgcggg caagaggatt 3480gagaaaaatg aaacccaatt cgacgccaaa ggcacgcctt tcatcgcggg caagaggatt 3480
gtccctgtaa ttgagaacca tagattcacc gggcgttacc gtgacctgta ccccgcaaac 3540gtccctgtaa ttgagaacca tagattcacc gggcgttacc gtgacctgta ccccgcaaac 3540
gaactcatcg ccctcctgga ggagaaaggc atcgttttcc gcgacgggtc aaatatcctc 3600gaactcatcg ccctcctgga ggagaaaggc atcgttttcc gcgacgggtc aaatatcctc 3600
cccaaactgc tcgagaacga tgatagccac gctattgaca cgatggtagc gctcatcaga 3660cccaaactgc tcgagaacga tgatagccac gctattgaca cgatggtagc gctcatcaga 3660
tccgtgctgc aaatgagaaa ttcaaatgct gccactgggg aggattacat caactcccct 3720tccgtgctgc aaatgagaaa ttcaaatgct gccactgggg aggattacat caactcccct 3720
gtgcgtgatc tcaatggcgt gtgcttcgat tctagatttc agaatcctga gtggccgatg 3780gtgcgtgatc tcaatggcgt gtgcttcgat tctagatttc agaatcctga gtggccgatg 3780
gatgccgatg ctaacggcgc ataccacata gcattgaaag gacaactgtt gttgaaccat 3840gatgccgatg ctaacggcgc ataccacata gcattgaaag gacaactgtt gttgaaccat 3840
ctcaaggaga gcaaggacct taagctgcag aacggcatca gcaaccagga ttggcttgcc 3900ctcaaggaga gcaaggacct taagctgcag aacggcatca gcaaccagga ttggcttgcc 3900
tatatccaag agctccgcaa ttccggcggc agcccaaaga agaagaggaa ggtgagcggc 3960tatatccaag agctccgcaa ttccggcggc agcccaaaga agaagaggaa ggtgagcggc 3960
ggcagcccaa agaagaagcg caaggtctag 3990ggcagcccaa agaagaagcg caaggtctag 3990
<210> 6<210> 6
<211> 3969<211> 3969
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> FNCPF1-2NLS<223> FNCPF1-2NLS
<400> 6<400> 6
atgtccatct accaggagtt cgtcaataag tactcactct ctaagaccct gcggttcgag 60atgtccatct accaggagtt cgtcaataag tactcactct ctaagaccct gcggttcgag 60
ctgatcccgc agggcaagac actcgagaac atcaaggcgc gcggcctgat tctcgacgat 120ctgatcccgc agggcaagac actcgagaac atcaaggcgc gcggcctgat tctcgacgat 120
gagaagcggg ccaaggacta caagaaggcg aagcagatca ttgataagta ccaccagttc 180gagaagcggg ccaaggacta caagaaggcg aagcagatca ttgataagta ccaccagttc 180
ttcatcgagg agattctgtc cagcgtgtgc atctctgagg atctcctgca gaattactcc 240ttcatcgagg agattctgtc cagcgtgtgc atctctgagg atctcctgca gaattactcc 240
gacgtctact tcaagctcaa gaagtctgac gatgacaacc tgcagaagga tttcaagtcc 300gacgtctact tcaagctcaa gaagtctgac gatgacaacc tgcagaagga tttcaagtcc 300
gccaaggaca ccatcaagaa gcagatttct gagtacatca aggattccga gaagttcaag 360gccaaggaca ccatcaagaa gcagatttct gagtacatca aggattccga gaagttcaag 360
aatctcttca accagaatct gattgatgcg aagaagggcc aggagtctga cctgatcctc 420aatctcttca accagaatct gattgatgcg aagaagggcc aggagtctga cctgatcctc 420
tggctgaagc agtccaagga caatggcatt gagctgttca aggccaacag cgatatcacc 480tggctgaagc agtccaagga caatggcatt gagctgttca aggccaacag cgatatcacc 480
gatattgacg aggcgctgga gatcattaag tcattcaagg gctggaccac atacttcaag 540gatattgacg aggcgctgga gatcattaag tcattcaagg gctggaccac atacttcaag 540
ggcttccatg agaaccggaa gaatgtgtac tcatctaacg acattccgac ctccatcatc 600ggcttccatg agaaccggaa gaatgtgtac tcatctaacg acattccgac ctccatcatc 600
tacaggatcg tcgatgacaa tctgccaaag ttcctcgaga acaaggccaa gtacgagtcc 660tacaggatcg tcgatgacaa tctgccaaag ttcctcgaga acaaggccaa gtacgagtcc 660
ctcaaggaca aggccccgga ggcgattaat tacgagcaga tcaagaagga tctggcggag 720ctcaaggaca aggccccgga ggcgattaat tacgagcaga tcaagaagga tctggcggag 720
gagctgacct tcgatatcga ctacaagaca agcgaggtga accagagggt gttctccctc 780gagctgacct tcgatatcga ctacaagaca agcgaggtga accagagggt gttctccctc 780
gatgaggtgt tcgagatcgc caatttcaac aattacctga accagtccgg cattaccaag 840gatgaggtgt tcgagatcgc caatttcaac aattacctga accagtccgg cattaccaag 840
ttcaatacaa tcattggcgg caagttcgtc aacggcgaga ataccaagcg caagggcatt 900ttcaatacaa tcattggcgg caagttcgtc aacggcgaga ataccaagcg caagggcatt 900
aacgagtaca tcaatctcta ctcccagcag atcaacgaca agaccctgaa gaagtacaag 960aacgagtaca tcaatctcta ctcccagcag atcaacgaca agaccctgaa gaagtacaag 960
atgtctgtgc tcttcaagca gatcctgtcc gatacagagt ccaagagctt cgtcattgat 1020atgtctgtgc tcttcaagca gatcctgtcc gatacagagt ccaagagctt cgtcattgat 1020
aagctcgagg acgacagcga cgtggtcacc acaatgcagt cattctacga gcagatcgcc 1080aagctcgagg acgacagcga cgtggtcacc acaatgcagt cattctacga gcagatcgcc 1080
gcgttcaaga ccgtggagga gaagagcatt aaggagacac tctcactcct gttcgatgac 1140gcgttcaaga ccgtggagga gaagagcatt aaggagacac tctcactcct gttcgatgac 1140
ctgaaggccc agaagctcga cctgagcaag atctacttca agaacgataa gagcctcaca 1200ctgaaggccc agaagctcga cctgagcaag atctacttca agaacgataa gagcctcaca 1200
gacctgtcac agcaggtgtt cgatgactac tcagtgattg gcaccgccgt cctcgagtac 1260gacctgtcac agcaggtgtt cgatgactac tcagtgattg gcaccgccgt cctcgagtac 1260
attacacagc agatcgcgcc aaagaacctc gataatcctt ctaagaagga gcaggagctg 1320attacacagc agatcgcgcc aaagaacctc gataatcctt ctaagaagga gcaggagctg 1320
atcgccaaga aaaccgagaa ggcgaagtac ctctccctgg agacaattaa gctcgccctg 1380atcgccaaga aaaccgagaa ggcgaagtac ctctccctgg agacaattaa gctcgccctg 1380
gaggagttca ataagcacag ggatattgac aagcagtgcc gcttcgagga gatcctcgcg 1440gaggagttca ataagcacag ggatattgac aagcagtgcc gcttcgagga gatcctcgcg 1440
aacttcgccg cgatcccaat gattttcgat gagatcgccc agaacaagga caatctggcg 1500aacttcgccg cgatcccaat gattttcgat gagatcgccc agaacaagga caatctggcg 1500
cagatctcta ttaagtacca gaaccagggc aagaaggacc tcctgcaggc ctccgcagag 1560cagatctcta ttaagtacca gaaccagggc aagaaggacc tcctgcaggc ctccgcagag 1560
gacgacgtga aggccatcaa ggatctcctg gaccagacca acaatctcct gcacaagctc 1620gacgacgtga aggccatcaa ggatctcctg gaccagacca acaatctcct gcacaagctc 1620
aagatcttcc atatttcaca gtctgaggat aaggccaata tcctcgataa ggacgagcat 1680aagatcttcc atatttcaca gtctgaggat aaggccaata tcctcgataa ggacgagcat 1680
ttctacctgg tgttcgagga gtgctacttc gagctggcga acattgtccc tctgtacaac 1740ttctacctgg tgttcgagga gtgctacttc gagctggcga acattgtccc tctgtacaac 1740
aagattagga attacatcac acagaagccg tacagcgacg agaagttcaa gctcaacttc 1800aagattagga attacatcac acagaagccg tacagcgacg agaagttcaa gctcaacttc 1800
gagaattcaa ccctggccaa cggctgggat aagaataagg agcctgacaa cacagcgatc 1860gagaattcaa ccctggccaa cggctgggat aagaataagg agcctgacaa cacagcgatc 1860
ctcttcatca aggacgacaa gtactacctg ggcgtgatga ataagaagaa caataagatc 1920ctcttcatca aggacgacaa gtactacctg ggcgtgatga ataagaagaa caataagatc 1920
ttcgatgaca aggccattaa ggagaacaag ggcgagggct acaagaagat cgtgtacaag 1980ttcgatgaca aggccattaa ggagaacaag ggcgagggct acaagaagat cgtgtacaag 1980
ctcctgcctg gcgccaataa gatgctcccg aaggtgttct tctccgcgaa gtccattaag 2040ctcctgcctg gcgccaataa gatgctcccg aaggtgttct tctccgcgaa gtccattaag 2040
ttctacaacc caagcgagga tatcctcagg atcaggaacc actctaccca tacaaagaac 2100ttctacaacc caagcgagga tatcctcagg atcaggaacc actctaccca tacaaagaac 2100
ggctcccctc agaagggcta cgagaagttc gagttcaata tcgaggattg ccggaagttc 2160ggctcccctc agaagggcta cgagaagttc gagttcaata tcgaggattg ccggaagttc 2160
attgacttct acaagcagtc catcagcaag caccctgagt ggaaggattt cggcttccgc 2220attgacttct acaagcagtc catcagcaag caccctgagt ggaaggattt cggcttccgc 2220
ttcagcgaca cccagcggta caactcaatc gatgagttct acagggaggt ggagaatcag 2280ttcagcgaca cccagcggta caactcaatc gatgagttct acagggaggt ggagaatcag 2280
ggctacaagc tcacattcga gaacatttca gagtcttaca tcgactccgt ggtcaatcag 2340ggctacaagc tcacattcga gaacatttca gagtcttaca tcgactccgt ggtcaatcag 2340
ggcaagctct acctgttcca gatctacaac aaggatttca gcgcctactc aaagggcagg 2400ggcaagctct acctgttcca gatctacaac aaggatttca gcgcctactc aaagggcagg 2400
ccgaacctcc ataccctgta ctggaaggcg ctcttcgatg agcgcaatct gcaggacgtg 2460ccgaacctcc ataccctgta ctggaaggcg ctcttcgatg agcgcaatct gcaggacgtg 2460
gtctacaagc tcaacggcga ggccgagctg ttctaccgca agcagtctat tccgaagaag 2520gtctacaagc tcaacggcga ggccgagctg ttctaccgca agcagtctat tccgaagaag 2520
atcacacacc cagcgaagga ggccatcgcg aacaagaata aggacaatcc gaagaaggag 2580atcacacacc cagcgaagga ggccatcgcg aacaagaata aggacaatcc gaagaaggag 2580
tccgtgttcg agtacgatct cattaaggac aagcggttca ccgaggataa gttcttcttc 2640tccgtgttcg agtacgatct cattaaggac aagcggttca ccgaggataa gttcttcttc 2640
cattgcccaa tcacaattaa cttcaagtcc agcggcgcca acaagttcaa tgacgagatc 2700cattgcccaa tcacaattaa cttcaagtcc agcggcgcca acaagttcaa tgacgagatc 2700
aatctcctgc tcaaggagaa ggcgaacgat gtgcatatcc tctccattga ccgcggcgag 2760aatctcctgc tcaaggagaa ggcgaacgat gtgcatatcc tctccattga ccgcggcgag 2760
cggcatctcg cctactacac cctggtcgat ggcaagggca atatcattaa gcaggatacc 2820cggcatctcg cctactacac cctggtcgat ggcaagggca atatcattaa gcaggatacc 2820
ttcaacatca ttggcaatga ccgcatgaaa accaactacc acgataagct cgccgcgatc 2880ttcaacatca ttggcaatga ccgcatgaaa accaactacc acgataagct cgccgcgatc 2880
gagaaggata gggacagcgc gcgcaaggac tggaagaaga tcaacaatat taaggagatg 2940gagaaggata gggacagcgc gcgcaaggac tggaagaaga tcaacaatat taaggagatg 2940
aaggagggct acctctcaca ggtggtccat gagattgcca agctcgtgat cgagtacaat 3000aaggagggct acctctcaca ggtggtccat gagattgcca agctcgtgat cgagtacaat 3000
gcgattgtgg tgttcgagga cctcaacttc ggcttcaagc ggggcaggtt caaggtggag 3060gcgattgtgg tgttcgagga cctcaacttc ggcttcaagc ggggcaggtt caaggtggag 3060
aagcaggtct accagaagct ggagaagatg ctcatcgaga agctcaatta cctggtgttc 3120aagcaggtct accagaagct ggagaagatg ctcatcgaga agctcaatta cctggtgttc 3120
aaggataacg agttcgacaa gaccggagga gtcctcaggg cataccagct gaccgcgcca 3180aaggataacg agttcgacaa gaccggagga gtcctcaggg cataccagct gaccgcgcca 3180
ttcgagacat tcaagaagat gggcaagcag acaggcatca tctactacgt gccagccggc 3240ttcgagacat tcaagaagat gggcaagcag acaggcatca tctactacgt gccagccggc 3240
ttcacctcca agatctgccc tgtgacaggc ttcgtcaacc agctctaccc taagtacgag 3300ttcacctcca agatctgccc tgtgacaggc ttcgtcaacc agctctaccc taagtacgag 3300
tccgtcagca agtcacagga gttcttcagc aagttcgata agatttgcta caatctggac 3360tccgtcagca agtcacagga gttcttcagc aagttcgata agatttgcta caatctggac 3360
aagggctact tcgagttctc attcgattac aagaacttcg gcgacaaggc cgcgaagggc 3420aagggctact tcgagttctc attcgattac aagaacttcg gcgacaaggc cgcgaagggc 3420
aagtggacca ttgcgtcttt cggctcccgg ctcatcaact tcaggaatag cgataagaac 3480aagtggacca ttgcgtcttt cggctcccgg ctcatcaact tcaggaatag cgataagaac 3480
cacaattggg acaccaggga ggtgtacccg acaaaggagc tggagaagct gctcaaggac 3540cacaattggg acaccaggga ggtgtacccg acaaaggagc tggagaagct gctcaaggac 3540
tactcaatcg agtacggcca tggcgagtgc attaaggccg cgatctgcgg cgagagcgat 3600tactcaatcg agtacggcca tggcgagtgc attaaggccg cgatctgcgg cgagagcgat 3600
aagaagttct tcgccaagct gacctcagtc ctcaatacaa tcctgcagat gaggaactct 3660aagaagttct tcgccaagct gacctcagtc ctcaatacaa tcctgcagat gaggaactct 3660
aagaccggca cagagctgga ctacctgatt tccccagtgg ccgatgtcaa cggcaatttc 3720aagaccggca cagagctgga ctacctgatt tccccagtgg ccgatgtcaa cggcaatttc 3720
ttcgacagcc gccaggcgcc gaagaatatg ccacaggacg ccgacgccaa cggcgcatac 3780ttcgacagcc gccaggcgcc gaagaatatg ccacaggacg ccgacgccaa cggcgcatac 3780
cacatcggcc tcaagggcct gatgctgctc ggacgcatta agaacaatca ggagggcaag 3840cacatcggcc tcaagggcct gatgctgctc ggacgcatta agaacaatca ggagggcaag 3840
aagctcaacc tcgtgatcaa gaacgaggag tacttcgagt tcgtccaaaa ccggaacaat 3900aagctcaacc tcgtgatcaa gaacgaggag tacttcgagt tcgtccaaaa ccggaacaat 3900
tccggcggca gcccaaagaa gaagaggaag gtgagcggcg gcagcccaaa gaagaagcgc 3960tccggcggca gcccaaagaa gaagaggaag gtgagcggcg gcagcccaaa gaagaagcgc 3960
aaggtctag 3969aaggtctag 3969
<210> 7<210> 7
<211> 3753<211> 3753
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> LBCPF1-2NLS<223> LBCPF1-2NLS
<400> 7<400> 7
atgtcaaagc tcgagaaatt caccaactgt tattcgttga gcaaaacact gcggtttaaa 60atgtcaaagc tcgagaaatt caccaactgt tattcgttga gcaaaacact gcggtttaaa 60
gcgattccag tcggcaagac tcaagagaat atagacaata agcggctgtt ggtggaagat 120gcgattccag tcggcaagac tcaagagaat atagacaata agcggctgtt ggtggaagat 120
gaaaagcgcg cggaagacta caaaggggtg aagaagttgt tggacagata ctacctctct 180gaaaagcgcg cggaagacta caaaggggtg aagaagttgt tggacagata ctacctctct 180
tttatcaatg atgtcttgca ctcaatcaaa ttgaagaatc tgaacaacta catctccctc 240tttatcaatg atgtcttgca ctcaatcaaa ttgaagaatc tgaacaacta catctccctc 240
ttcagaaaga aaacaaggac agaaaaggag aataaggaac ttgaaaattt ggagatcaat 300ttcagaaaga aaacaaggac agaaaaggag aataaggaac ttgaaaattt ggagatcaat 300
ctgaggaaag agatcgcgaa agcctttaaa ggcaacgaag gatacaaaag tctgttcaag 360ctgaggaaag agatcgcgaa agcctttaaa ggcaacgaag gatacaaaag tctgttcaag 360
aaggatataa ttgagacaat tttgccagag ttcctcgatg acaaggacga gattgcgctg 420aaggatataa ttgagacaat tttgccagag ttcctcgatg acaaggacga gattgcgctg 420
gtcaattcgt tcaacggatt cacaacagca ttcacaggct tctttgataa tcgggaaaat 480gtcaattcgt tcaacggatt cacaacagca ttcacaggct tctttgataa tcgggaaaat 480
atgttctctg aggaggcaaa gtccacttct attgcgttca ggtgtatcaa tgagaatctc 540atgttctctg aggaggcaaa gtccacttct attgcgttca ggtgtatcaa tgagaatctc 540
actaggtaca tttccaacat ggatatcttt gagaaggttg acgcaatttt tgacaagcac 600actaggtaca tttccaacat ggatatcttt gagaaggttg acgcaatttt tgacaagcac 600
gaagttcagg agattaagga gaagatcctc aattccgatt atgacgttga ggacttcttc 660gaagttcagg agattaagga gaagatcctc aattccgatt atgacgttga ggacttcttc 660
gaaggtgagt tttttaattt cgtgctcact caagagggta tcgacgtgta taatgcgatc 720gaaggtgagt ttttttaattt cgtgctcact caagagggta tcgacgtgta taatgcgatc 720
atcggtgggt tcgtgactga gtccggtgaa aagattaagg gattgaacga gtatatcaac 780atcggtgggt tcgtgactga gtccggtgaa aagattaagg gattgaacga gtatatcaac 780
ctttacaacc aaaagacgaa acagaagctg ccaaagttca agcctcttta caaacaggtt 840ctttacaacc aaaagacgaa acagaagctg ccaaagttca agcctcttta caaacaggtt 840
ctttcagacc gcgagtcact ctcgttctat ggggagggct acacttcgga tgaggaagtc 900ctttcagacc gcgagtcact ctcgttctat ggggagggct acacttcgga tgaggaagtc 900
ctggaggtgt tcaggaatac tctcaataag aattcggaga ttttctcttc tataaaaaaa 960ctggaggtgt tcaggaatac tctcaataag aattcggaga ttttctcttc tataaaaaaa 960
ctggaaaagt tgtttaagaa ttttgacgaa tactctagcg ccggcatatt tgtgaaaaac 1020ctggaaaagt tgtttaagaa ttttgacgaa tactctagcg ccggcatatt tgtgaaaaac 1020
ggcccggcca tatcaacgat aagtaaagat atcttcggcg aatggaacgt gatcagagac 1080ggcccggcca tatcaacgat aagtaaagat atcttcggcg aatggaacgt gatcagagac 1080
aaatggaacg cggagtatga cgatattcac ctgaagaaga aggctgtcgt aacggagaag 1140aaatggaacg cggagtatga cgatattcac ctgaagaaga aggctgtcgt aacggagaag 1140
tacgaggatg atcgcaggaa aagcttcaaa aagatcggaa gtttcagcct ggaacagttg 1200tacgaggatg atcgcaggaa aagcttcaaa aagatcggaa gtttcagcct ggaacagttg 1200
caggagtatg ctgacgccga tcttagcgtc gtcgagaagt tgaaggagat aatcatccaa 1260caggagtatg ctgacgccga tcttagcgtc gtcgagaagt tgaaggagt aatcatccaa 1260
aaggtcgacg agatatataa agtctatgga tcaagtgaaa aactgttcga cgccgacttc 1320aaggtcgacg agatatataa agtctatgga tcaagtgaaa aactgttcga cgccgacttc 1320
gttttggaga agtccctgaa gaagaacgac gctgttgttg ccattatgaa ggatctgctc 1380gttttggaga agtccctgaa gaagaacgac gctgttgttg ccattatgaa ggatctgctc 1380
gacagcgtga agagtttcga gaactatatt aaggcttttt tcggggaggg gaaggagact 1440gacagcgtga agagtttcga gaactatatt aaggcttttt tcggggaggg gaaggagact 1440
aacagagatg agtccttcta cggagacttc gtcctcgcgt acgatatact ccttaaggta 1500aacagagatg agtccttcta cggagacttc gtcctcgcgt acgatatact ccttaaggta 1500
gaccacatct acgacgcaat cagaaattac gtgacacaaa agccgtacag caaggacaag 1560gaccacatct acgacgcaat cagaaattac gtgacacaaa agccgtacag caaggacaag 1560
ttcaaactct acttccagaa cccccagttc atgggcggct gggacaagga caaggaaacg 1620ttcaaactct acttccagaa cccccagttc atgggcggct gggacaagga caaggaaacg 1620
gattacaggg ctacgatcct gaggtatggt tcaaaatact acttggcgat tatggacaag 1680gattacaggg ctacgatcct gaggtatggt tcaaaatact acttggcgat tatggacaag 1680
aagtacgcca agtgtctcca gaagattgac aaagacgatg tcaatggcaa ttatgagaag 1740aagtacgcca agtgtctcca gaagattgac aaagacgatg tcaatggcaa ttatgagaag 1740
atcaactaca agctgcttcc gggtccgaac aagatgctcc caaaggtttt cttcagcaag 1800atcaactaca agctgcttcc gggtccgaac aagatgctcc caaaggtttt cttcagcaag 1800
aaatggatgg cctactataa cccaagcgag gacatccaga agatttataa gaacggtacg 1860aaatggatgg cctactataa cccaagcgag gacatccaga agatttataa gaacggtacg 1860
ttcaagaagg gcgacatgtt caatcttaac gactgtcaca agctgatcga cttcttcaaa 1920ttcaagaagg gcgacatgtt caatcttaac gactgtcaca agctgatcga cttcttcaaa 1920
gactcaatta gccggtaccc aaagtggtct aacgcctatg acttcaactt ttcggaaacc 1980gactcaatta gccggtaccc aaagtggtct aacgcctatg acttcaactt ttcggaaacc 1980
gagaagtaca aggatatagc cggattttat agagaggtgg aagagcaggg ctacaaggtg 2040gagaagtaca aggatatagc cggattttat agagaggtgg aagagcaggg ctacaaggtg 2040
tcattcgagt ccgccagcaa gaaggaagtg gacaagctcg tggaagaggg taagctctac 2100tcattcgagt ccgccagcaa gaaggaagtg gacaagctcg tggaagaggg taagctctac 2100
atgttccaga tttataataa agactttagc gataagagcc acgggacacc taatctccac 2160atgttccaga tttataataa agactttagc gataagagcc acgggacacc taatctccac 2160
acaatgtatt tcaagctgct cttcgacgag aataaccacg gccaaatcag gttgtcagga 2220acaatgtatt tcaagctgct cttcgacgag aataaccacg gccaaatcag gttgtcagga 2220
ggggctgaac tcttcatgcg gcgcgctagc cttaagaagg aggagcttgt agtccaccct 2280ggggctgaac tcttcatgcg gcgcgctagc cttaagaagg aggagcttgt agtccaccct 2280
gcgaatagtc caattgcgaa taagaacccg gacaatccta aaaagactac aacattgagc 2340gcgaatagtc caattgcgaa taagaacccg gacaatccta aaaagactac aacattgagc 2340
tacgacgtgt acaaggataa gaggttttcc gaggatcagt acgagctcca catcccgatt 2400tacgacgtgt acaaggataa gaggttttcc gaggatcagt acgagctcca catcccgatt 2400
gcgatcaaca agtgcccaaa gaatattttc aagataaaca cagaggtgcg tgtactcctg 2460gcgatcaaca agtgcccaaa gaatattttc aagataaaca cagaggtgcg tgtactcctg 2460
aagcatgacg acaatcctta cgtcattggg attgatcggg gcgagaggaa cctcctctat 2520aagcatgacg acaatcctta cgtcattggg attgatcggg gcgagaggaa cctcctctat 2520
attgtggtgg tggacgggaa ggggaacata gtcgaacagt actcccttaa cgaaataatt 2580attgtggtgg tggacgggaa ggggaacata gtcgaacagt actcccttaa cgaaataatt 2580
aacaatttca acggcatccg tatcaagacc gactaccatt cgttgctgga caagaaggag 2640aacaatttca acggcatccg tatcaagacc gactaccatt cgttgctgga caagaaggag 2640
aaggagagat ttgaggcgcg gcaaaattgg acaagtatcg agaacatcaa ggaactcaaa 2700aaggagagat ttgaggcgcg gcaaaattgg acaagtatcg agaacatcaa ggaactcaaa 2700
gcaggttata tctctcaagt tgtgcataag atatgcgagc tggttgagaa gtatgacgca 2760gcaggttata tctctcaagt tgtgcataag atatgcgagc tggttgagaa gtatgacgca 2760
gtgatcgctc ttgaggacct caactcgggc tttaagaatt ctagagttaa agtggagaag 2820gtgatcgctc ttgaggacct caactcgggc tttaagaatt ctagagttaa agtggagaag 2820
caggtctatc aaaagttcga gaagatgctt atagataagc tcaactacat ggtcgataag 2880caggtctatc aaaagttcga gaagatgctt atagataagc tcaactacat ggtcgataag 2880
aaatcgaacc catgtgccac cggcggcgca ctcaaaggtt accaaataac aaacaaattc 2940aaatcgaacc catgtgccac cggcggcgca ctcaaaggtt accaaataac aaacaaattc 2940
gagtccttca aatcgatgag tactcagaat gggttcatat tttatatacc ggcgtggctt 3000gagtccttca aatcgatgag tactcagaat gggttcatat tttatatacc ggcgtggctt 3000
acgtctaaga tcgacccgtc aactggtttt gtcaacctgt tgaagacgaa atacacgtcc 3060acgtctaaga tcgacccgtc aactggtttt gtcaacctgt tgaagacgaa atacacgtcc 3060
attgccgatt cgaaaaagtt catatctagt tttgatcgta ttatgtacgt cccagaggaa 3120attgccgatt cgaaaaagtt catatctagt tttgatcgta ttatgtacgt cccagaggaa 3120
gatcttttcg agtttgctct cgactacaaa aacttttcgc ggaccgatgc ggattacatt 3180gatcttttcg agtttgctct cgactacaaa aacttttcgc ggaccgatgc ggattacatt 3180
aaaaaatgga aactctattc gtacggcaac agaatcagga tttttcgcaa ccctaagaag 3240aaaaaatgga aactctattc gtacggcaac agaatcagga tttttcgcaa ccctaagaag 3240
aataacgtct ttgattggga ggaagtttgc ttgactagcg cgtacaagga gctctttaat 3300aataacgtct ttgattggga ggaagtttgc ttgactagcg cgtacaagga gctctttaat 3300
aagtatggca ttaactacca acagggtgat atcagagcac tgctttgcga acaatctgac 3360aagtatggca ttaactacca acagggtgat atcagagcac tgctttgcga acaatctgac 3360
aaggctttct actcatcctt catggctttg atgagcctga tgctccagat gagaaattca 3420aaggctttct actcatcctt catggctttg atgagcctga tgctccagat gagaaattca 3420
attacaggca gaaccgacgt ggatttcttg atctccccgg ttaaaaattc tgatggcatc 3480attacaggca gaaccgacgt ggatttcttg atctccccgg ttaaaaattc tgatggcatc 3480
ttttacgata gcaggaacta tgaagcgcaa gagaatgcga ttctgccaaa aaatgcagac 3540ttttacgata gcaggaacta tgaagcgcaa gagaatgcga ttctgccaaa aaatgcagac 3540
gccaacggtg cctataacat cgccaggaaa gtcctgtggg cgatcggcca gttcaaaaag 3600gccaacggtg cctataacat cgccaggaaa gtcctgtggg cgatcggcca gttcaaaaag 3600
gccgaagacg aaaaattgga caaggtcaaa atcgctatca gcaacaaaga gtggctggag 3660gccgaagacg aaaaattgga caaggtcaaa atcgctatca gcaacaaaga gtggctggag 3660
tatgctcaga catccgtaaa gcattccggc ggcagcccaa agaagaagag gaaggtgagc 3720tatgctcaga catccgtaaa gcattccggc ggcagcccaa agaagaagag gaaggtgagc 3720
ggcggcagcc caaagaagaa gcgcaaggtc tag 3753ggcggcagcc caaagaagaa gcgcaaggtc tag 3753
<210> 8<210> 8
<211> 4995<211> 4995
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> dFNCPF1-PBE-2NLS<223> dFNCPF1-PBE-2NLS
<400> 8<400> 8
atgccaaaga agaagaggaa ggtttcatcg gagaccggcc ctgttgctgt tgaccccacc 60atgccaaaga agaagaggaa ggtttcatcg gagaccggcc ctgttgctgt tgaccccacc 60
ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120
aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180
accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240
aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300
ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360
atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420
ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480
aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540
tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600
aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660
cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720
acgccaggca cctccgagag cgctacgcct gaacttaagt ccatctacca ggagttcgtc 780acgccaggca cctccgagag cgctacgcct gaacttaagt ccatctacca ggagttcgtc 780
aataagtact cactctctaa gaccctgcgg ttcgagctga tcccgcaggg caagacactc 840aataagtact cactctctaa gaccctgcgg ttcgagctga tcccgcaggg caagacactc 840
gagaacatca aggcgcgcgg cctgattctc gacgatgaga agcgggccaa ggactacaag 900gagaacatca aggcgcgcgg cctgattctc gacgatgaga agcgggccaa ggactacaag 900
aaggcgaagc agatcattga taagtaccac cagttcttca tcgaggagat tctgtccagc 960aaggcgaagc agatcattga taagtaccac cagttcttca tcgaggagat tctgtccagc 960
gtgtgcatct ctgaggatct cctgcagaat tactccgacg tctacttcaa gctcaagaag 1020gtgtgcatct ctgaggatct cctgcagaat tactccgacg tctacttcaa gctcaagaag 1020
tctgacgatg acaacctgca gaaggatttc aagtccgcca aggacaccat caagaagcag 1080tctgacgatg acaacctgca gaaggatttc aagtccgcca aggacaccat caagaagcag 1080
atttctgagt acatcaagga ttccgagaag ttcaagaatc tcttcaacca gaatctgatt 1140atttctgagt acatcaagga ttccgagaag ttcaagaatc tcttcaacca gaatctgatt 1140
gatgcgaaga agggccagga gtctgacctg atcctctggc tgaagcagtc caaggacaat 1200gatgcgaaga agggccagga gtctgacctg atcctctggc tgaagcagtc caaggacaat 1200
ggcattgagc tgttcaaggc caacagcgat atcaccgata ttgacgaggc gctggagatc 1260ggcattgagc tgttcaaggc caacagcgat atcaccgata ttgacgaggc gctggagatc 1260
attaagtcat tcaagggctg gaccacatac ttcaagggct tccatgagaa ccggaagaat 1320attaagtcat tcaagggctg gaccacatac ttcaagggct tccatgagaa ccggaagaat 1320
gtgtactcat ctaacgacat tccgacctcc atcatctaca ggatcgtcga tgacaatctg 1380gtgtactcat ctaacgacat tccgacctcc atcatctaca ggatcgtcga tgacaatctg 1380
ccaaagttcc tcgagaacaa ggccaagtac gagtccctca aggacaaggc cccggaggcg 1440ccaaagttcc tcgagaacaa ggccaagtac gagtccctca aggacaaggc cccggaggcg 1440
attaattacg agcagatcaa gaaggatctg gcggaggagc tgaccttcga tatcgactac 1500attaattacg agcagatcaa gaaggatctg gcggaggagc tgaccttcga tatcgactac 1500
aagacaagcg aggtgaacca gagggtgttc tccctcgatg aggtgttcga gatcgccaat 1560aagacaagcg aggtgaacca gagggtgttc tccctcgatg aggtgttcga gatcgccaat 1560
ttcaacaatt acctgaacca gtccggcatt accaagttca atacaatcat tggcggcaag 1620ttcaacaatt acctgaacca gtccggcatt accaagttca atacaatcat tggcggcaag 1620
ttcgtcaacg gcgagaatac caagcgcaag ggcattaacg agtacatcaa tctctactcc 1680ttcgtcaacg gcgagaatac caagcgcaag ggcattaacg agtacatcaa tctctactcc 1680
cagcagatca acgacaagac cctgaagaag tacaagatgt ctgtgctctt caagcagatc 1740cagcagatca acgacaagac cctgaagaag tacaagatgt ctgtgctctt caagcagatc 1740
ctgtccgata cagagtccaa gagcttcgtc attgataagc tcgaggacga cagcgacgtg 1800ctgtccgata cagagtccaa gagcttcgtc attgataagc tcgaggacga cagcgacgtg 1800
gtcaccacaa tgcagtcatt ctacgagcag atcgccgcgt tcaagaccgt ggaggagaag 1860gtcaccacaa tgcagtcatt ctacgagcag atcgccgcgt tcaagaccgt ggaggagaag 1860
agcattaagg agacactctc actcctgttc gatgacctga aggcccagaa gctcgacctg 1920agcattaagg agacactctc actcctgttc gatgacctga aggcccagaa gctcgacctg 1920
agcaagatct acttcaagaa cgataagagc ctcacagacc tgtcacagca ggtgttcgat 1980agcaagatct acttcaagaa cgataagagc ctcacagacc tgtcacagca ggtgttcgat 1980
gactactcag tgattggcac cgccgtcctc gagtacatta cacagcagat cgcgccaaag 2040gactactcag tgattggcac cgccgtcctc gagtacatta cacagcagat cgcgccaaag 2040
aacctcgata atccttctaa gaaggagcag gagctgatcg ccaagaaaac cgagaaggcg 2100aacctcgata atccttctaa gaaggagcag gagctgatcg ccaagaaaac cgagaaggcg 2100
aagtacctct ccctggagac aattaagctc gccctggagg agttcaataa gcacagggat 2160aagtacctct ccctggagac aattaagctc gccctggagg agttcaataa gcacagggat 2160
attgacaagc agtgccgctt cgaggagatc ctcgcgaact tcgccgcgat cccaatgatt 2220attgacaagc agtgccgctt cgaggagatc ctcgcgaact tcgccgcgat cccaatgatt 2220
ttcgatgaga tcgcccagaa caaggacaat ctggcgcaga tctctattaa gtaccagaac 2280ttcgatgaga tcgcccagaa caaggacaat ctggcgcaga tctctattaa gtaccagaac 2280
cagggcaaga aggacctcct gcaggcctcc gcagaggacg acgtgaaggc catcaaggat 2340cagggcaaga aggacctcct gcaggcctcc gcagaggacg acgtgaaggc catcaaggat 2340
ctcctggacc agaccaacaa tctcctgcac aagctcaaga tcttccatat ttcacagtct 2400ctcctggacc agaccaacaa tctcctgcac aagctcaaga tcttccatat ttcacagtct 2400
gaggataagg ccaatatcct cgataaggac gagcatttct acctggtgtt cgaggagtgc 2460gaggataagg ccaatatcct cgataaggac gagcatttct acctggtgtt cgaggagtgc 2460
tacttcgagc tggcgaacat tgtccctctg tacaacaaga ttaggaatta catcacacag 2520tacttcgagc tggcgaacat tgtccctctg tacaacaaga ttaggaatta catcacacag 2520
aagccgtaca gcgacgagaa gttcaagctc aacttcgaga attcaaccct ggccaacggc 2580aagccgtaca gcgacgagaa gttcaagctc aacttcgaga attcaaccct ggccaacggc 2580
tgggataaga ataaggagcc tgacaacaca gcgatcctct tcatcaagga cgacaagtac 2640tgggataaga ataaggagcc tgacaacaca gcgatcctct tcatcaagga cgacaagtac 2640
tacctgggcg tgatgaataa gaagaacaat aagatcttcg atgacaaggc cattaaggag 2700tacctgggcg tgatgaataa gaagaacaat aagatcttcg atgacaaggc cattaaggag 2700
aacaagggcg agggctacaa gaagatcgtg tacaagctcc tgcctggcgc caataagatg 2760aacaagggcg agggctacaa gaagatcgtg tacaagctcc tgcctggcgc caataagatg 2760
ctcccgaagg tgttcttctc cgcgaagtcc attaagttct acaacccaag cgaggatatc 2820ctcccgaagg tgttcttctc cgcgaagtcc attaagttct acaacccaag cgaggatatc 2820
ctcaggatca ggaaccactc tacccataca aagaacggct cccctcagaa gggctacgag 2880ctcaggatca ggaaccactc tacccataca aagaacggct cccctcagaa gggctacgag 2880
aagttcgagt tcaatatcga ggattgccgg aagttcattg acttctacaa gcagtccatc 2940aagttcgagt tcaatatcga ggattgccgg aagttcattg acttctacaa gcagtccatc 2940
agcaagcacc ctgagtggaa ggatttcggc ttccgcttca gcgacaccca gcggtacaac 3000agcaagcacc ctgagtggaa ggatttcggc ttccgcttca gcgacaccca gcggtacaac 3000
tcaatcgatg agttctacag ggaggtggag aatcagggct acaagctcac attcgagaac 3060tcaatcgatg agttctacag ggaggtggag aatcagggct acaagctcac attcgagaac 3060
atttcagagt cttacatcga ctccgtggtc aatcagggca agctctacct gttccagatc 3120atttcagagt cttacatcga ctccgtggtc aatcagggca agctctacct gttccagatc 3120
tacaacaagg atttcagcgc ctactcaaag ggcaggccga acctccatac cctgtactgg 3180tacaacaagg atttcagcgc ctactcaaag ggcaggccga acctccatac cctgtactgg 3180
aaggcgctct tcgatgagcg caatctgcag gacgtggtct acaagctcaa cggcgaggcc 3240aaggcgctct tcgatgagcg caatctgcag gacgtggtct acaagctcaa cggcgaggcc 3240
gagctgttct accgcaagca gtctattccg aagaagatca cacacccagc gaaggaggcc 3300gagctgttct accgcaagca gtctattccg aagaagatca cacacccagc gaaggaggcc 3300
atcgcgaaca agaataagga caatccgaag aaggagtccg tgttcgagta cgatctcatt 3360atcgcgaaca agaataagga caatccgaag aaggagtccg tgttcgagta cgatctcatt 3360
aaggacaagc ggttcaccga ggataagttc ttcttccatt gcccaatcac aattaacttc 3420aaggacaagc ggttcaccga ggataagttc ttcttccatt gcccaatcac aattaacttc 3420
aagtccagcg gcgccaacaa gttcaatgac gagatcaatc tcctgctcaa ggagaaggcg 3480aagtccagcg gcgccaacaa gttcaatgac gagatcaatc tcctgctcaa ggagaaggcg 3480
aacgatgtgc atatcctctc cattgcccgc ggcgagcggc atctcgccta ctacaccctg 3540aacgatgtgc atatcctctc cattgcccgc ggcgagcggc atctcgccta ctacaccctg 3540
gtcgatggca agggcaatat cattaagcag gataccttca acatcattgg caatgaccgc 3600gtcgatggca agggcaatat cattaagcag gataccttca acatcattgg caatgaccgc 3600
atgaaaacca actaccacga taagctcgcc gcgatcgaga aggataggga cagcgcgcgc 3660atgaaaacca actaccacga taagctcgcc gcgatcgaga aggataggga cagcgcgcgc 3660
aaggactgga agaagatcaa caatattaag gagatgaagg agggctacct ctcacaggtg 3720aaggactgga agaagatcaa caatattaag gagatgaagg agggctacct ctcacaggtg 3720
gtccatgaga ttgccaagct cgtgatcgag tacaatgcga ttgtggtgtt cgaggacctc 3780gtccatgaga ttgccaagct cgtgatcgag tacaatgcga ttgtggtgtt cgaggacctc 3780
aacttcggct tcaagcgggg caggttcaag gtggagaagc aggtctacca gaagctggag 3840aacttcggct tcaagcgggg caggttcaag gtggagaagc aggtctacca gaagctggag 3840
aagatgctca tcgagaagct caattacctg gtgttcaagg ataacgagtt cgacaagacc 3900aagatgctca tcgagaagct caattacctg gtgttcaagg ataacgagtt cgacaagacc 3900
ggaggagtcc tcagggcata ccagctgacc gcgccattcg agacattcaa gaagatgggc 3960ggaggagtcc tcagggcata ccagctgacc gcgccattcg agacattcaa gaagatgggc 3960
aagcagacag gcatcatcta ctacgtgcca gccggcttca cctccaagat ctgccctgtg 4020aagcagacag gcatcatcta ctacgtgcca gccggcttca cctccaagat ctgccctgtg 4020
acaggcttcg tcaaccagct ctaccctaag tacgagtccg tcagcaagtc acaggagttc 4080acaggcttcg tcaaccagct ctaccctaag tacgagtccg tcagcaagtc acaggagttc 4080
ttcagcaagt tcgataagat ttgctacaat ctggacaagg gctacttcga gttctcattc 4140ttcagcaagt tcgataagat ttgctacaat ctggacaagg gctacttcga gttctcattc 4140
gattacaaga acttcggcga caaggccgcg aagggcaagt ggaccattgc gtctttcggc 4200gattacaaga acttcggcga caaggccgcg aagggcaagt ggaccattgc gtctttcggc 4200
tcccggctca tcaacttcag gaatagcgat aagaaccaca attgggacac cagggaggtg 4260tcccggctca tcaacttcag gaatagcgat aagaaccaca attgggacac cagggaggtg 4260
tacccgacaa aggagctgga gaagctgctc aaggactact caatcgagta cggccatggc 4320tacccgacaa aggagctgga gaagctgctc aaggactact caatcgagta cggccatggc 4320
gagtgcatta aggccgcgat ctgcggcgag agcgataaga agttcttcgc caagctgacc 4380gagtgcatta aggccgcgat ctgcggcgag agcgataaga agttcttcgc caagctgacc 4380
tcagtcctca atacaatcct gcagatgagg aactctaaga ccggcacaga gctggactac 4440tcagtcctca atacaatcct gcagatgagg aactctaaga ccggcacaga gctggactac 4440
ctgatttccc cagtggccga tgtcaacggc aatttcttcg acagccgcca ggcgccgaag 4500ctgatttccc cagtggccga tgtcaacggc aatttcttcg acagccgcca ggcgccgaag 4500
aatatgccac aggacgccga cgccaacggc gcataccaca tcggcctcaa gggcctgatg 4560aatatgccac aggacgccga cgccaacggc gcataccaca tcggcctcaa gggcctgatg 4560
ctgctcggac gcattaagaa caatcaggag ggcaagaagc tcaacctcgt gatcaagaac 4620ctgctcggac gcattaagaa caatcaggag ggcaagaagc tcaacctcgt gatcaagaac 4620
gaggagtact tcgagttcgt ccaaaaccgg aacaatacgc gtgactccgg cggcagcacc 4680gaggagtact tcgagttcgt ccaaaaccgg aacaatacgc gtgactccgg cggcagcacc 4680
aacctgtccg acatcatcga gaaggagacg ggcaagcaac tcgtgatcca ggagagcatc 4740aacctgtccg acatcatcga gaaggagacg ggcaagcaac tcgtgatcca ggagagcatc 4740
ctcatgctgc cagaggaggt ggaggaggtc atcggcaaca agccagagtc cgacatcctg 4800ctcatgctgc cagaggaggt ggaggaggtc atcggcaaca agccagagtc cgacatcctg 4800
gtgcacaccg cctacgacga gtccaccgac gagaacgtca tgctcctgac cagcgacgcc 4860gtgcacaccg cctacgacga gtccaccgac gagaacgtca tgctcctgac cagcgacgcc 4860
ccagagtaca agccatgggc cctcgtcatc caggacagca acggggagaa caagatcaag 4920ccagagtaca agccatgggc cctcgtcatc caggacagca acggggagaa caagatcaag 4920
atgctgtcgg gggggagccc aaagaagaag cggaaggtga gcggcggcag cccaaagaag 4980atgctgtcgg gggggagccc aaagaagaag cggaaggtga gcggcggcag cccaaagaag 4980
aagcgcaagg tctag 4995aagcgcaagg tctag 4995
<210> 9<210> 9
<211> 4779<211> 4779
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> dLBCPF1-PBE-2NLS<223> dLBCPF1-PBE-2NLS
<400> 9<400> 9
atgccaaaga agaagaggaa ggtttcatcg gagaccggcc ctgttgctgt tgaccccacc 60atgccaaaga agaagaggaa ggtttcatcg gagaccggcc ctgttgctgt tgaccccacc 60
ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120
aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180
accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240
aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300
ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360
atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420
ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480
aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540
tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600
aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660
cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720
acgccaggca cctccgagag cgctacgcct gaacttaagt caaagctcga gaaattcacc 780acgccaggca cctccgagag cgctacgcct gaacttaagt caaagctcga gaaattcacc 780
aactgttatt cgttgagcaa aacactgcgg tttaaagcga ttccagtcgg caagactcaa 840aactgttatt cgttgagcaa aacactgcgg tttaaagcga ttccagtcgg caagactcaa 840
gagaatatag acaataagcg gctgttggtg gaagatgaaa agcgcgcgga agactacaaa 900gagaatatag acaataagcg gctgttggtg gaagatgaaa agcgcgcgga agactacaaa 900
ggggtgaaga agttgttgga cagatactac ctctctttta tcaatgatgt cttgcactca 960ggggtgaaga agttgttgga cagatactac ctctctttta tcaatgatgt cttgcactca 960
atcaaattga agaatctgaa caactacatc tccctcttca gaaagaaaac aaggacagaa 1020atcaaattga agaatctgaa caactacatc tccctcttca gaaagaaaac aaggacagaa 1020
aaggagaata aggaacttga aaatttggag atcaatctga ggaaagagat cgcgaaagcc 1080aaggagaata aggaacttga aaatttggag atcaatctga ggaaagagat cgcgaaagcc 1080
tttaaaggca acgaaggata caaaagtctg ttcaagaagg atataattga gacaattttg 1140tttaaaggca acgaaggata caaaagtctg ttcaagaagg atataattga gacaattttg 1140
ccagagttcc tcgatgacaa ggacgagatt gcgctggtca attcgttcaa cggattcaca 1200ccagagttcc tcgatgacaa ggacgagatt gcgctggtca attcgttcaa cggattcaca 1200
acagcattca caggcttctt tgataatcgg gaaaatatgt tctctgagga ggcaaagtcc 1260acagcattca caggcttctt tgataatcgg gaaaatatgt tctctgagga ggcaaagtcc 1260
acttctattg cgttcaggtg tatcaatgag aatctcacta ggtacatttc caacatggat 1320acttctattg cgttcaggtg tatcaatgag aatctcacta ggtacatttc caacatggat 1320
atctttgaga aggttgacgc aatttttgac aagcacgaag ttcaggagat taaggagaag 1380atctttgaga aggttgacgc aatttttgac aagcacgaag ttcaggagat taaggagaag 1380
atcctcaatt ccgattatga cgttgaggac ttcttcgaag gtgagttttt taatttcgtg 1440atcctcaatt ccgattatga cgttgaggac ttcttcgaag gtgagttttt taatttcgtg 1440
ctcactcaag agggtatcga cgtgtataat gcgatcatcg gtgggttcgt gactgagtcc 1500ctcactcaag agggtatcga cgtgtataat gcgatcatcg gtgggttcgt gactgagtcc 1500
ggtgaaaaga ttaagggatt gaacgagtat atcaaccttt acaaccaaaa gacgaaacag 1560ggtgaaaaga ttaagggatt gaacgagtat atcaaccttt acaaccaaaa gacgaaacag 1560
aagctgccaa agttcaagcc tctttacaaa caggttcttt cagaccgcga gtcactctcg 1620aagctgccaa agttcaagcc tctttacaaa caggttcttt cagaccgcga gtcactctcg 1620
ttctatgggg agggctacac ttcggatgag gaagtcctgg aggtgttcag gaatactctc 1680ttctatgggg agggctacac ttcggatgag gaagtcctgg aggtgttcag gaatactctc 1680
aataagaatt cggagatttt ctcttctata aaaaaactgg aaaagttgtt taagaatttt 1740aataagaatt cggagatttt ctcttctata aaaaaactgg aaaagttgtt taagaatttt 1740
gacgaatact ctagcgccgg catatttgtg aaaaacggcc cggccatatc aacgataagt 1800gacgaatact ctagcgccgg catatttgtg aaaaacggcc cggccatatc aacgataagt 1800
aaagatatct tcggcgaatg gaacgtgatc agagacaaat ggaacgcgga gtatgacgat 1860aaagatatct tcggcgaatg gaacgtgatc agagacaaat ggaacgcgga gtatgacgat 1860
attcacctga agaagaaggc tgtcgtaacg gagaagtacg aggatgatcg caggaaaagc 1920attcacctga agaagaaggc tgtcgtaacg gagaagtacg aggatgatcg caggaaaagc 1920
ttcaaaaaga tcggaagttt cagcctggaa cagttgcagg agtatgctga cgccgatctt 1980ttcaaaaaga tcggaagttt cagcctggaa cagttgcagg agtatgctga cgccgatctt 1980
agcgtcgtcg agaagttgaa ggagataatc atccaaaagg tcgacgagat atataaagtc 2040agcgtcgtcg agaagttgaa ggagataatc atccaaaagg tcgacgagat atataaagtc 2040
tatggatcaa gtgaaaaact gttcgacgcc gacttcgttt tggagaagtc cctgaagaag 2100tatggatcaa gtgaaaaact gttcgacgcc gacttcgttt tggagaagtc cctgaagaag 2100
aacgacgctg ttgttgccat tatgaaggat ctgctcgaca gcgtgaagag tttcgagaac 2160aacgacgctg ttgttgccat tatgaaggat ctgctcgaca gcgtgaagag tttcgagaac 2160
tatattaagg cttttttcgg ggaggggaag gagactaaca gagatgagtc cttctacgga 2220tatattaagg cttttttcgg ggaggggaag gagactaaca gagatgagtc cttctacgga 2220
gacttcgtcc tcgcgtacga tatactcctt aaggtagacc acatctacga cgcaatcaga 2280gacttcgtcc tcgcgtacga tatactcctt aaggtagacc acatctacga cgcaatcaga 2280
aattacgtga cacaaaagcc gtacagcaag gacaagttca aactctactt ccagaacccc 2340aattacgtga cacaaaagcc gtacagcaag gacaagttca aactctactt ccagaacccc 2340
cagttcatgg gcggctggga caaggacaag gaaacggatt acagggctac gatcctgagg 2400cagttcatgg gcggctggga caaggacaag gaaacggatt acagggctac gatcctgagg 2400
tatggttcaa aatactactt ggcgattatg gacaagaagt acgccaagtg tctccagaag 2460tatggttcaa aatactactt ggcgattatg gacaagaagt acgccaagtg tctccagaag 2460
attgacaaag acgatgtcaa tggcaattat gagaagatca actacaagct gcttccgggt 2520attgacaaag acgatgtcaa tggcaattat gagaagatca actacaagct gcttccgggt 2520
ccgaacaaga tgctcccaaa ggttttcttc agcaagaaat ggatggccta ctataaccca 2580ccgaacaaga tgctcccaaa ggttttcttc agcaagaaat ggatggccta ctataaccca 2580
agcgaggaca tccagaagat ttataagaac ggtacgttca agaagggcga catgttcaat 2640agcgaggaca tccagaagat ttataagaac ggtacgttca agaagggcga catgttcaat 2640
cttaacgact gtcacaagct gatcgacttc ttcaaagact caattagccg gtacccaaag 2700cttaacgact gtcacaagct gatcgacttc ttcaaagact caattagccg gtacccaaag 2700
tggtctaacg cctatgactt caacttttcg gaaaccgaga agtacaagga tatagccgga 2760tggtctaacg cctatgactt caacttttcg gaaaccgaga agtacaagga tatagccgga 2760
ttttatagag aggtggaaga gcagggctac aaggtgtcat tcgagtccgc cagcaagaag 2820ttttatagag aggtggaaga gcagggctac aaggtgtcat tcgagtccgc cagcaagaag 2820
gaagtggaca agctcgtgga agagggtaag ctctacatgt tccagattta taataaagac 2880gaagtggaca agctcgtgga agagggtaag ctctacatgt tccagattta taataaagac 2880
tttagcgata agagccacgg gacacctaat ctccacacaa tgtatttcaa gctgctcttc 2940tttagcgata agagccacgg gacacctaat ctccacacaa tgtatttcaa gctgctcttc 2940
gacgagaata accacggcca aatcaggttg tcaggagggg ctgaactctt catgcggcgc 3000gacgagaata accacggcca aatcaggttg tcaggagggg ctgaactctt catgcggcgc 3000
gctagcctta agaaggagga gcttgtagtc caccctgcga atagtccaat tgcgaataag 3060gctagcctta agaaggagga gcttgtagtc caccctgcga atagtccaat tgcgaataag 3060
aacccggaca atcctaaaaa gactacaaca ttgagctacg acgtgtacaa ggataagagg 3120aacccggaca atcctaaaaa gactacaaca ttgagctacg acgtgtacaa ggataagagg 3120
ttttccgagg atcagtacga gctccacatc ccgattgcga tcaacaagtg cccaaagaat 3180ttttccgagg atcagtacga gctccacatc ccgattgcga tcaacaagtg cccaaagaat 3180
attttcaaga taaacacaga ggtgcgtgta ctcctgaagc atgacgacaa tccttacgtc 3240attttcaaga taaacacaga ggtgcgtgta ctcctgaagc atgacgacaa tccttacgtc 3240
attgggattg ctcggggcga gaggaacctc ctctatattg tggtggtgga cgggaagggg 3300attgggattg ctcggggcga gaggaacctc ctctatattg tggtggtgga cgggaagggg 3300
aacatagtcg aacagtactc ccttaacgaa ataattaaca atttcaacgg catccgtatc 3360aacatagtcg aacagtactc ccttaacgaa ataattaaca atttcaacgg catccgtatc 3360
aagaccgact accattcgtt gctggacaag aaggagaagg agagatttga ggcgcggcaa 3420aagaccgact accattcgtt gctggacaag aaggagaagg agagatttga ggcgcggcaa 3420
aattggacaa gtatcgagaa catcaaggaa ctcaaagcag gttatatctc tcaagttgtg 3480aattggacaa gtatcgagaa catcaaggaa ctcaaagcag gttatatctc tcaagttgtg 3480
cataagatat gcgagctggt tgagaagtat gacgcagtga tcgctcttga ggacctcaac 3540cataagatat gcgagctggt tgagaagtat gacgcagtga tcgctcttga ggacctcaac 3540
tcgggcttta agaattctag agttaaagtg gagaagcagg tctatcaaaa gttcgagaag 3600tcgggcttta agaattctag agttaaagtg gagaagcagg tctatcaaaa gttcgagaag 3600
atgcttatag ataagctcaa ctacatggtc gataagaaat cgaacccatg tgccaccggc 3660atgcttatag ataagctcaa ctacatggtc gataagaaat cgaacccatg tgccaccggc 3660
ggcgcactca aaggttacca aataacaaac aaattcgagt ccttcaaatc gatgagtact 3720ggcgcactca aaggttacca aataacaaac aaattcgagt ccttcaaatc gatgagtact 3720
cagaatgggt tcatatttta tataccggcg tggcttacgt ctaagatcga cccgtcaact 3780cagaatgggt tcatatttta tataccggcg tggcttacgt ctaagatcga cccgtcaact 3780
ggttttgtca acctgttgaa gacgaaatac acgtccattg ccgattcgaa aaagttcata 3840ggttttgtca acctgttgaa gacgaaatac acgtccattg ccgattcgaa aaagttcata 3840
tctagttttg atcgtattat gtacgtccca gaggaagatc ttttcgagtt tgctctcgac 3900tctagttttg atcgtattat gtacgtccca gaggaagatc ttttcgagtt tgctctcgac 3900
tacaaaaact tttcgcggac cgatgcggat tacattaaaa aatggaaact ctattcgtac 3960tacaaaaact tttcgcggac cgatgcggat tacattaaaa aatggaaact ctattcgtac 3960
ggcaacagaa tcaggatttt tcgcaaccct aagaagaata acgtctttga ttgggaggaa 4020ggcaacagaa tcaggatttt tcgcaaccct aagaagaata acgtctttga ttgggaggaa 4020
gtttgcttga ctagcgcgta caaggagctc tttaataagt atggcattaa ctaccaacag 4080gtttgcttga ctagcgcgta caaggagctc tttaataagt atggcattaa ctaccaacag 4080
ggtgatatca gagcactgct ttgcgaacaa tctgacaagg ctttctactc atccttcatg 4140ggtgatatca gagcactgct ttgcgaacaa tctgacaagg ctttctactc atccttcatg 4140
gctttgatga gcctgatgct ccagatgaga aattcaatta caggcagaac cgacgtggat 4200gctttgatga gcctgatgct ccagatgaga aattcaatta caggcagaac cgacgtggat 4200
ttcttgatct ccccggttaa aaattctgat ggcatctttt acgatagcag gaactatgaa 4260ttcttgatct ccccggttaa aaattctgat ggcatctttt acgatagcag gaactatgaa 4260
gcgcaagaga atgcgattct gccaaaaaat gcagacgcca acggtgccta taacatcgcc 4320gcgcaagaga atgcgattct gccaaaaaat gcagacgcca acggtgccta taacatcgcc 4320
aggaaagtcc tgtgggcgat cggccagttc aaaaaggccg aagacgaaaa attggacaag 4380aggaaagtcc tgtgggcgat cggccagttc aaaaaggccg aagacgaaaa attggacaag 4380
gtcaaaatcg ctatcagcaa caaagagtgg ctggagtatg ctcagacatc cgtaaagcat 4440gtcaaaatcg ctatcagcaa caaagagtgg ctggagtatg ctcagacatc cgtaaagcat 4440
acgcgtgact ccggcggcag caccaacctg tccgacatca tcgagaagga gacgggcaag 4500acgcgtgact ccggcggcag caccaacctg tccgacatca tcgagaagga gacgggcaag 4500
caactcgtga tccaggagag catcctcatg ctgccagagg aggtggagga ggtcatcggc 4560caactcgtga tccaggagag catcctcatg ctgccagagg aggtggagga ggtcatcggc 4560
aacaagccag agtccgacat cctggtgcac accgcctacg acgagtccac cgacgagaac 4620aacaagccag agtccgacat cctggtgcac accgcctacg acgagtccac cgacgagaac 4620
gtcatgctcc tgaccagcga cgccccagag tacaagccat gggccctcgt catccaggac 4680gtcatgctcc tgaccagcga cgccccagag tacaagccat gggccctcgt catccaggac 4680
agcaacgggg agaacaagat caagatgctg tcggggggga gcccaaagaa gaagcggaag 4740agcaacgggg agaacaagat caagatgctg tcggggggga gcccaaagaa gaagcggaag 4740
gtgagcggcg gcagcccaaa gaagaagcgc aaggtctag 4779gtgagcggcg gcagcccaaa gaagaagcgc aaggtctag 4779
<210> 10<210> 10
<211> 6305<211> 6305
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> BDUBI10-dLBCPF1-PBE-2NLS<223> BDUBI10-dLBCPF1-PBE-2NLS
<400> 10<400> 10
ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420gtttagggtt aatggtttttt atagactaat tttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660
cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720
ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780
gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840
caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900
ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960
tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020
taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080
atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140
cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200
ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260
gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320
cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380
acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440
ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500
tctgcaggtc gaagcttgaa gcaaacatgc caaagaagaa gaggaaggtt tcatcggaga 1560tctgcaggtc gaagcttgaa gcaaacatgc caaagaagaa gaggaaggtt tcatcggaga 1560
ccggccctgt tgctgttgac cccaccctgc ggcggagaat cgagccacac gagttcgagg 1620ccggccctgt tgctgttgac cccaccctgc ggcggagaat cgagccacac gagttcgagg 1620
tgttcttcga cccaagggag ctccgcaagg agacgtgcct cctgtacgag atcaactggg 1680tgttcttcga cccaagggag ctccgcaagg agacgtgcct cctgtacgag atcaactggg 1680
gcggcaggca ctccatctgg aggcacacca gccaaaacac caacaagcac gtggaggtca 1740gcggcaggca ctccatctgg aggcacacca gccaaaacac caacaagcac gtggaggtca 1740
acttcatcga gaagttcacc accgagaggt acttctgccc aaacacccgc tgctccatca 1800acttcatcga gaagttcacc accgagaggt acttctgccc aaacacccgc tgctccatca 1800
cctggttcct gtcctggagc ccatgcggcg agtgctccag ggccatcacc gagttcctca 1860cctggttcct gtcctggagc ccatgcggcg agtgctccag ggccatcacc gagttcctca 1860
gccgctaccc acacgtcacc ctgttcatct acatcgccag gctctaccac cacgccgacc 1920gccgctaccc acacgtcacc ctgttcatct acatcgccag gctctaccac cacgccgacc 1920
caaggaacag gcagggcctc cgcgacctga tctccagcgg cgtgaccatc caaatcatga 1980caaggaacag gcagggcctc cgcgacctga tctccagcgg cgtgaccatc caaatcatga 1980
ccgagcagga gtccggctac tgctggagga acttcgtcaa ctactcccca agcaacgagg 2040ccgagcagga gtccggctac tgctggagga acttcgtcaa ctactcccca agcaacgagg 2040
cccactggcc aaggtaccca cacctctggg tgcgcctcta cgtgctcgag ctgtactgca 2100cccactggcc aaggtaccca cacctctggg tgcgcctcta cgtgctcgag ctgtactgca 2100
tcatcctcgg cctgccacca tgcctcaaca tcctgaggcg caagcaacca cagctgacct 2160tcatcctcgg cctgccacca tgcctcaaca tcctgaggcg caagcaacca cagctgacct 2160
tcttcaccat cgccctccaa agctgccact accagaggct cccaccacac atcctgtggg 2220tcttcaccat cgccctccaa agctgccact accagaggct cccaccacac atcctgtggg 2220
ctaccggcct caagtccggc agcgagacgc caggcacctc cgagagcgct acgcctgaac 2280ctaccggcct caagtccggc agcgagacgc caggcacctc cgagagcgct acgcctgaac 2280
ttaagtcaaa gctcgagaaa ttcaccaact gttattcgtt gagcaaaaca ctgcggttta 2340ttaagtcaaa gctcgagaaa ttcaccaact gttattcgtt gagcaaaaca ctgcggttta 2340
aagcgattcc agtcggcaag actcaagaga atatagacaa taagcggctg ttggtggaag 2400aagcgattcc agtcggcaag actcaagaga atatagacaa taagcggctg ttggtggaag 2400
atgaaaagcg cgcggaagac tacaaagggg tgaagaagtt gttggacaga tactacctct 2460atgaaaagcg cgcggaagac tacaaagggg tgaagaagtt gttggacaga tactacctct 2460
cttttatcaa tgatgtcttg cactcaatca aattgaagaa tctgaacaac tacatctccc 2520cttttatcaa tgatgtcttg cactcaatca aattgaagaa tctgaacaac tacatctccc 2520
tcttcagaaa gaaaacaagg acagaaaagg agaataagga acttgaaaat ttggagatca 2580tcttcagaaa gaaaacaagg acagaaaagg agaataagga acttgaaaat ttggagatca 2580
atctgaggaa agagatcgcg aaagccttta aaggcaacga aggatacaaa agtctgttca 2640atctgaggaa agagatcgcg aaagccttta aaggcaacga aggatacaaa agtctgttca 2640
agaaggatat aattgagaca attttgccag agttcctcga tgacaaggac gagattgcgc 2700agaaggatat aattgagaca attttgccag agttcctcga tgacaaggac gagattgcgc 2700
tggtcaattc gttcaacgga ttcacaacag cattcacagg cttctttgat aatcgggaaa 2760tggtcaattc gttcaacgga ttcacaacag cattcacagg cttctttgat aatcgggaaa 2760
atatgttctc tgaggaggca aagtccactt ctattgcgtt caggtgtatc aatgagaatc 2820atatgttctc tgaggaggca aagtccactt ctattgcgtt caggtgtatc aatgagaatc 2820
tcactaggta catttccaac atggatatct ttgagaaggt tgacgcaatt tttgacaagc 2880tcactaggta catttccaac atggatatct ttgagaaggt tgacgcaatt tttgacaagc 2880
acgaagttca ggagattaag gagaagatcc tcaattccga ttatgacgtt gaggacttct 2940acgaagttca ggagattaag gagaagatcc tcaattccga ttatgacgtt gaggacttct 2940
tcgaaggtga gttttttaat ttcgtgctca ctcaagaggg tatcgacgtg tataatgcga 3000tcgaaggtga gttttttaat ttcgtgctca ctcaagaggg tatcgacgtg tataatgcga 3000
tcatcggtgg gttcgtgact gagtccggtg aaaagattaa gggattgaac gagtatatca 3060tcatcggtgg gttcgtgact gagtccggtg aaaagattaa gggattgaac gagtatatca 3060
acctttacaa ccaaaagacg aaacagaagc tgccaaagtt caagcctctt tacaaacagg 3120acctttacaa ccaaaagacg aaacagaagc tgccaaagtt caagcctctt tacaaacagg 3120
ttctttcaga ccgcgagtca ctctcgttct atggggaggg ctacacttcg gatgaggaag 3180ttctttcaga ccgcgagtca ctctcgttct atggggaggg ctacacttcg gatgaggaag 3180
tcctggaggt gttcaggaat actctcaata agaattcgga gattttctct tctataaaaa 3240tcctggaggt gttcaggaat actctcaata agaattcgga gattttctct tctataaaaa 3240
aactggaaaa gttgtttaag aattttgacg aatactctag cgccggcata tttgtgaaaa 3300aactggaaaa gttgtttaag aattttgacg aatactctag cgccggcata tttgtgaaaa 3300
acggcccggc catatcaacg ataagtaaag atatcttcgg cgaatggaac gtgatcagag 3360acggcccggc catatcaacg ataagtaaag atatcttcgg cgaatggaac gtgatcagag 3360
acaaatggaa cgcggagtat gacgatattc acctgaagaa gaaggctgtc gtaacggaga 3420acaaatggaa cgcggagtat gacgatattc acctgaagaa gaaggctgtc gtaacggaga 3420
agtacgagga tgatcgcagg aaaagcttca aaaagatcgg aagtttcagc ctggaacagt 3480agtacgagga tgatcgcagg aaaagcttca aaaagatcgg aagtttcagc ctggaacagt 3480
tgcaggagta tgctgacgcc gatcttagcg tcgtcgagaa gttgaaggag ataatcatcc 3540tgcaggagta tgctgacgcc gatcttagcg tcgtcgagaa gttgaaggag ataatcatcc 3540
aaaaggtcga cgagatatat aaagtctatg gatcaagtga aaaactgttc gacgccgact 3600aaaaggtcga cgagatatat aaagtctatg gatcaagtga aaaactgttc gacgccgact 3600
tcgttttgga gaagtccctg aagaagaacg acgctgttgt tgccattatg aaggatctgc 3660tcgttttgga gaagtccctg aagaagaacg acgctgttgt tgccattatg aaggatctgc 3660
tcgacagcgt gaagagtttc gagaactata ttaaggcttt tttcggggag gggaaggaga 3720tcgacagcgt gaagagtttc gagaactata ttaaggcttt tttcggggag gggaaggaga 3720
ctaacagaga tgagtccttc tacggagact tcgtcctcgc gtacgatata ctccttaagg 3780ctaacagaga tgagtccttc tacggagact tcgtcctcgc gtacgatata ctccttaagg 3780
tagaccacat ctacgacgca atcagaaatt acgtgacaca aaagccgtac agcaaggaca 3840tagaccacat ctacgacgca atcagaaatt acgtgacaca aaagccgtac agcaaggaca 3840
agttcaaact ctacttccag aacccccagt tcatgggcgg ctgggacaag gacaaggaaa 3900agttcaaact ctacttccag aacccccagt tcatgggcgg ctgggacaag gacaaggaaa 3900
cggattacag ggctacgatc ctgaggtatg gttcaaaata ctacttggcg attatggaca 3960cggattacag ggctacgatc ctgaggtatg gttcaaaata ctacttggcg attatggaca 3960
agaagtacgc caagtgtctc cagaagattg acaaagacga tgtcaatggc aattatgaga 4020agaagtacgc caagtgtctc cagaagattg acaaagacga tgtcaatggc aattatgaga 4020
agatcaacta caagctgctt ccgggtccga acaagatgct cccaaaggtt ttcttcagca 4080agatcaacta caagctgctt ccgggtccga acaagatgct cccaaaggtt ttcttcagca 4080
agaaatggat ggcctactat aacccaagcg aggacatcca gaagatttat aagaacggta 4140agaaatggat ggcctactat aacccaagcg aggacatcca gaagatttat aagaacggta 4140
cgttcaagaa gggcgacatg ttcaatctta acgactgtca caagctgatc gacttcttca 4200cgttcaagaa gggcgacatg ttcaatctta acgactgtca caagctgatc gacttcttca 4200
aagactcaat tagccggtac ccaaagtggt ctaacgccta tgacttcaac ttttcggaaa 4260aagactcaat tagccggtac ccaaagtggt ctaacgccta tgacttcaac ttttcggaaa 4260
ccgagaagta caaggatata gccggatttt atagagaggt ggaagagcag ggctacaagg 4320ccgagaagta caaggatata gccggatttt atagagaggt ggaagagcag ggctacaagg 4320
tgtcattcga gtccgccagc aagaaggaag tggacaagct cgtggaagag ggtaagctct 4380tgtcattcga gtccgccagc aagaaggaag tggacaagct cgtggaagag ggtaagctct 4380
acatgttcca gatttataat aaagacttta gcgataagag ccacgggaca cctaatctcc 4440acatgttcca gatttataat aaagacttta gcgataagag ccacgggaca cctaatctcc 4440
acacaatgta tttcaagctg ctcttcgacg agaataacca cggccaaatc aggttgtcag 4500acacaatgta tttcaagctg ctcttcgacg agaataacca cggccaaatc aggttgtcag 4500
gaggggctga actcttcatg cggcgcgcta gccttaagaa ggaggagctt gtagtccacc 4560gaggggctga actcttcatg cggcgcgcta gccttaagaa ggaggagctt gtagtccacc 4560
ctgcgaatag tccaattgcg aataagaacc cggacaatcc taaaaagact acaacattga 4620ctgcgaatag tccaattgcg aataagaacc cggacaatcc taaaaagact acaacattga 4620
gctacgacgt gtacaaggat aagaggtttt ccgaggatca gtacgagctc cacatcccga 4680gctacgacgt gtacaaggat aagaggtttt ccgaggatca gtacgagctc cacatcccga 4680
ttgcgatcaa caagtgccca aagaatattt tcaagataaa cacagaggtg cgtgtactcc 4740ttgcgatcaa caagtgccca aagaatattt tcaagataaa cacagaggtg cgtgtactcc 4740
tgaagcatga cgacaatcct tacgtcattg ggattgctcg gggcgagagg aacctcctct 4800tgaagcatga cgacaatcct tacgtcattg ggattgctcg gggcgagagg aacctcctct 4800
atattgtggt ggtggacggg aaggggaaca tagtcgaaca gtactccctt aacgaaataa 4860atattgtggt ggtggacggg aaggggaaca tagtcgaaca gtactccctt aacgaaataa 4860
ttaacaattt caacggcatc cgtatcaaga ccgactacca ttcgttgctg gacaagaagg 4920ttaacaattt caacggcatc cgtatcaaga ccgactacca ttcgttgctg gacaagaagg 4920
agaaggagag atttgaggcg cggcaaaatt ggacaagtat cgagaacatc aaggaactca 4980agaaggagag atttgaggcg cggcaaaatt ggacaagtat cgagaacatc aaggaactca 4980
aagcaggtta tatctctcaa gttgtgcata agatatgcga gctggttgag aagtatgacg 5040aagcaggtta tatctctcaa gttgtgcata agatatgcga gctggttgag aagtatgacg 5040
cagtgatcgc tcttgaggac ctcaactcgg gctttaagaa ttctagagtt aaagtggaga 5100cagtgatcgc tcttgaggac ctcaactcgg gctttaagaa ttctagagtt aaagtggaga 5100
agcaggtcta tcaaaagttc gagaagatgc ttatagataa gctcaactac atggtcgata 5160agcaggtcta tcaaaagttc gagaagatgc ttatagataa gctcaactac atggtcgata 5160
agaaatcgaa cccatgtgcc accggcggcg cactcaaagg ttaccaaata acaaacaaat 5220agaaatcgaa cccatgtgcc accggcggcg cactcaaagg ttaccaaata acaaacaaat 5220
tcgagtcctt caaatcgatg agtactcaga atgggttcat attttatata ccggcgtggc 5280tcgagtcctt caaatcgatg agtactcaga atgggttcat attttatata ccggcgtggc 5280
ttacgtctaa gatcgacccg tcaactggtt ttgtcaacct gttgaagacg aaatacacgt 5340ttacgtctaa gatcgacccg tcaactggtt ttgtcaacct gttgaagacg aaatacacgt 5340
ccattgccga ttcgaaaaag ttcatatcta gttttgatcg tattatgtac gtcccagagg 5400ccattgccga ttcgaaaaag ttcatatcta gttttgatcg tattatgtac gtcccagagg 5400
aagatctttt cgagtttgct ctcgactaca aaaacttttc gcggaccgat gcggattaca 5460aagatctttt cgagtttgct ctcgactaca aaaacttttc gcggaccgat gcggattaca 5460
ttaaaaaatg gaaactctat tcgtacggca acagaatcag gatttttcgc aaccctaaga 5520ttaaaaaatg gaaactctat tcgtacggca acagaatcag gatttttcgc aaccctaaga 5520
agaataacgt ctttgattgg gaggaagttt gcttgactag cgcgtacaag gagctcttta 5580agaataacgt ctttgattgg gaggaagttt gcttgactag cgcgtacaag gagctcttta 5580
ataagtatgg cattaactac caacagggtg atatcagagc actgctttgc gaacaatctg 5640ataagtatgg cattaactac caacagggtg atatcagagc actgctttgc gaacaatctg 5640
acaaggcttt ctactcatcc ttcatggctt tgatgagcct gatgctccag atgagaaatt 5700acaaggcttt ctactcatcc ttcatggctt tgatgagcct gatgctccag atgagaaatt 5700
caattacagg cagaaccgac gtggatttct tgatctcccc ggttaaaaat tctgatggca 5760caattacagg cagaaccgac gtggatttct tgatctcccc ggttaaaaat tctgatggca 5760
tcttttacga tagcaggaac tatgaagcgc aagagaatgc gattctgcca aaaaatgcag 5820tcttttacga tagcaggaac tatgaagcgc aagagaatgc gattctgcca aaaaatgcag 5820
acgccaacgg tgcctataac atcgccagga aagtcctgtg ggcgatcggc cagttcaaaa 5880acgccaacgg tgcctataac atcgccagga aagtcctgtg ggcgatcggc cagttcaaaa 5880
aggccgaaga cgaaaaattg gacaaggtca aaatcgctat cagcaacaaa gagtggctgg 5940aggccgaaga cgaaaaattg gacaaggtca aaatcgctat cagcaacaaa gagtggctgg 5940
agtatgctca gacatccgta aagcatacgc gtgactccgg cggcagcacc aacctgtccg 6000agtatgctca gacatccgta aagcatacgc gtgactccgg cggcagcacc aacctgtccg 6000
acatcatcga gaaggagacg ggcaagcaac tcgtgatcca ggagagcatc ctcatgctgc 6060acatcatcga gaaggagacg ggcaagcaac tcgtgatcca ggagagcatc ctcatgctgc 6060
cagaggaggt ggaggaggtc atcggcaaca agccagagtc cgacatcctg gtgcacaccg 6120cagaggaggt ggaggaggtc atcggcaaca agccagagtc cgacatcctg gtgcacaccg 6120
cctacgacga gtccaccgac gagaacgtca tgctcctgac cagcgacgcc ccagagtaca 6180cctacgacga gtccaccgac gagaacgtca tgctcctgac cagcgacgcc ccagagtaca 6180
agccatgggc cctcgtcatc caggacagca acggggagaa caagatcaag atgctgtcgg 6240agccatgggc cctcgtcatc caggacagca acggggagaa caagatcaag atgctgtcgg 6240
gggggagccc aaagaagaag cggaaggtga gcggcggcag cccaaagaag aagcgcaagg 6300gggggagccc aaagaagaag cggaaggtga gcggcggcag cccaaagaag aagcgcaagg 6300
tctag 6305tctag 6305
<210> 11<210> 11
<211> 5196<211> 5196
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> dFNCPF1-ABE7.10-2NLS<223> dFNCPF1-ABE7.10-2NLS
<400> 11<400> 11
atgccaaaaa agaagagaaa ggtttcaggc ggctcctccg aggtggagtt ctctcacgag 60atgccaaaaa agaagagaaa ggtttcaggc ggctcctccg aggtggagtt ctctcacgag 60
tattggatga ggcacgctct tacacttgct aagagagctt gggacgaaag agaagtgcca 120tattggatga ggcacgctct tacacttgct aagagagctt gggacgaaag agaagtgcca 120
gttggcgccg ttcttgtgca taataatagg gtgatcggcg agggttggaa tagaccaatt 180gttggcgccg ttcttgtgca taataatagg gtgatcggcg agggttggaa tagaccaatt 180
ggaaggcatg atccaacagc tcacgcagag attatggctc tcagacaagg cggcctcgtt 240ggaaggcatg atccaacagc tcacgcagag attatggctc tcagacaagg cggcctcgtt 240
atgcagaact acaggctcat tgacgctaca ctctacgtga cactcgaacc ttgcgttatg 300atgcagaact acaggctcat tgacgctaca ctctacgtga cactcgaacc ttgcgttatg 300
tgcgccggag ctatgattca ttctaggatt ggcagggtcg tgtttggagc tagggacgct 360tgcgccggag ctatgattca ttctaggatt ggcagggtcg tgtttggagc tagggacgct 360
aaaacaggag ccgccggatc tcttatggac gtgttgcatc atccaggcat gaaccatagg 420aaaacaggag ccgccggatc tcttatggac gtgttgcatc atccaggcat gaaccatagg 420
gtggagatta cagagggcat tcttgcagac gagtgcgctg ctcttctttc cgatttcttc 480gtggagatta cagagggcat tcttgcagac gagtgcgctg ctcttctttc cgatttcttc 480
aggatgagaa ggcaggagat taaggcccag aagaaggctc agtcttctac agatagcgga 540aggatgagaa ggcaggagat taaggcccag aagaaggctc agtcttctac agatagcgga 540
ggatcttccg gaggatctag cggctccgag acaccaggaa catccgaaag cgctacacca 600ggatcttccg gaggatctag cggctccgag acaccaggaa catccgaaag cgctacacca 600
gaatctagcg gaggctcttc cggaggatct tctgaagtgg agttctccca cgagtattgg 660gaatctagcg gaggctcttc cggaggatct tctgaagtgg agttctccca cgagtattgg 660
atgaggcacg ctcttacact tgctaaaagg gctagggacg aaagggaagt tccagttgga 720atgaggcacg ctcttacact tgctaaaagg gctagggacg aaagggaagt tccagttgga 720
gctgttctcg tgctcaataa cagggtgatt ggcgagggtt ggaatagagc cattggactc 780gctgttctcg tgctcaataa cagggtgatt ggcgagggtt ggaatagagc cattggactc 780
catgatccaa cagctcacgc agagattatg gctcttagac aaggcggcct cgttatgcag 840catgatccaa cagctcacgc agagattatg gctcttagac aaggcggcct cgttatgcag 840
aattacagac tcatcgacgc cacactctac gttaccttcg aaccttgcgt tatgtgcgcc 900aattacagac tcatcgacgc cacactctac gttaccttcg aaccttgcgt tatgtgcgcc 900
ggagctatga tccattctag gattggcagg gtcgtgttcg gcgttagaaa cgctaagaca 960ggagctatga tccattctag gattggcagg gtcgtgttcg gcgttagaaa cgctaagaca 960
ggagctgcag gctctcttat ggacgttctt cattacccag gcatgaatca tagagtggag 1020ggagctgcag gctctcttat ggacgttctt cattacccag gcatgaatca tagagtggag 1020
atcacagaag gcattcttgc agacgagtgc gcagctctcc tttgctattt cttcaggatg 1080atcacagaag gcattcttgc agacgagtgc gcagctctcc tttgctattt cttcaggatg 1080
ccgaggcaag ttttcaacgc tcagaagaag gcccagtctt ctacagattc cggcggatct 1140ccgaggcaag ttttcaacgc tcagaagaag gcccagtctt ctacagattc cggcggatct 1140
tctggaggat ctagcggctc cgagacacca ggaacatccg aatccgctac accagagtct 1200tctggaggat ctagcggctc cgagacacca ggaacatccg aatccgctac accagagtct 1200
tctggaggat ctagcggagg atctcttaag tccatctacc aggagttcgt caataagtac 1260tctggaggat ctagcggagg atctcttaag tccatctacc aggagttcgt caataagtac 1260
tcactctcta agaccctgcg gttcgagctg atcccgcagg gcaagacact cgagaacatc 1320tcactctcta agaccctgcg gttcgagctg atcccgcagg gcaagacact cgagaacatc 1320
aaggcgcgcg gcctgattct cgacgatgag aagcgggcca aggactacaa gaaggcgaag 1380aaggcgcgcg gcctgattct cgacgatgag aagcgggcca aggactacaa gaaggcgaag 1380
cagatcattg ataagtacca ccagttcttc atcgaggaga ttctgtccag cgtgtgcatc 1440cagatcattg ataagtacca ccagttcttc atcgaggaga ttctgtccag cgtgtgcatc 1440
tctgaggatc tcctgcagaa ttactccgac gtctacttca agctcaagaa gtctgacgat 1500tctgaggatc tcctgcagaa ttactccgac gtctacttca agctcaagaa gtctgacgat 1500
gacaacctgc agaaggattt caagtccgcc aaggacacca tcaagaagca gatttctgag 1560gacaacctgc agaaggattt caagtccgcc aaggacacca tcaagaagca gatttctgag 1560
tacatcaagg attccgagaa gttcaagaat ctcttcaacc agaatctgat tgatgcgaag 1620tacatcaagg attccgagaa gttcaagaat ctcttcaacc agaatctgat tgatgcgaag 1620
aagggccagg agtctgacct gatcctctgg ctgaagcagt ccaaggacaa tggcattgag 1680aagggccagg agtctgacct gatcctctgg ctgaagcagt ccaaggacaa tggcattgag 1680
ctgttcaagg ccaacagcga tatcaccgat attgacgagg cgctggagat cattaagtca 1740ctgttcaagg ccaacagcga tatcaccgat attgacgagg cgctggagat cattaagtca 1740
ttcaagggct ggaccacata cttcaagggc ttccatgaga accggaagaa tgtgtactca 1800ttcaagggct ggaccacata cttcaagggc ttccatgaga accggaagaa tgtgtactca 1800
tctaacgaca ttccgacctc catcatctac aggatcgtcg atgacaatct gccaaagttc 1860tctaacgaca ttccgacctc catcatctac aggatcgtcg atgacaatct gccaaagttc 1860
ctcgagaaca aggccaagta cgagtccctc aaggacaagg ccccggaggc gattaattac 1920ctcgagaaca aggccaagta cgagtccctc aaggacaagg ccccggaggc gattaattac 1920
gagcagatca agaaggatct ggcggaggag ctgaccttcg atatcgacta caagacaagc 1980gagcagatca agaaggatct ggcggaggag ctgaccttcg atatcgacta caagacaagc 1980
gaggtgaacc agagggtgtt ctccctcgat gaggtgttcg agatcgccaa tttcaacaat 2040gaggtgaacc agagggtgtt ctccctcgat gaggtgttcg agatcgccaa tttcaacaat 2040
tacctgaacc agtccggcat taccaagttc aatacaatca ttggcggcaa gttcgtcaac 2100tacctgaacc agtccggcat taccaagttc aatacaatca ttggcggcaa gttcgtcaac 2100
ggcgagaata ccaagcgcaa gggcattaac gagtacatca atctctactc ccagcagatc 2160ggcgagaata ccaagcgcaa gggcattaac gagtacatca atctctactc ccagcagatc 2160
aacgacaaga ccctgaagaa gtacaagatg tctgtgctct tcaagcagat cctgtccgat 2220aacgacaaga ccctgaagaa gtacaagatg tctgtgctct tcaagcagat cctgtccgat 2220
acagagtcca agagcttcgt cattgataag ctcgaggacg acagcgacgt ggtcaccaca 2280acagagtcca agagcttcgt cattgataag ctcgaggacg acagcgacgt ggtcaccaca 2280
atgcagtcat tctacgagca gatcgccgcg ttcaagaccg tggaggagaa gagcattaag 2340atgcagtcat tctacgagca gatcgccgcg ttcaagaccg tggaggagaa gagcattaag 2340
gagacactct cactcctgtt cgatgacctg aaggcccaga agctcgacct gagcaagatc 2400gagacactct cactcctgtt cgatgacctg aaggcccaga agctcgacct gagcaagatc 2400
tacttcaaga acgataagag cctcacagac ctgtcacagc aggtgttcga tgactactca 2460tacttcaaga acgataagag cctcacagac ctgtcacagc aggtgttcga tgactactca 2460
gtgattggca ccgccgtcct cgagtacatt acacagcaga tcgcgccaaa gaacctcgat 2520gtgattggca ccgccgtcct cgagtacatt acacagcaga tcgcgccaaa gaacctcgat 2520
aatccttcta agaaggagca ggagctgatc gccaagaaaa ccgagaaggc gaagtacctc 2580aatccttcta agaaggagca ggagctgatc gccaagaaaa ccgagaaggc gaagtacctc 2580
tccctggaga caattaagct cgccctggag gagttcaata agcacaggga tattgacaag 2640tccctggaga caattaagct cgccctggag gagttcaata agcacaggga tattgacaag 2640
cagtgccgct tcgaggagat cctcgcgaac ttcgccgcga tcccaatgat tttcgatgag 2700cagtgccgct tcgaggagat cctcgcgaac ttcgccgcga tcccaatgat tttcgatgag 2700
atcgcccaga acaaggacaa tctggcgcag atctctatta agtaccagaa ccagggcaag 2760atcgcccaga acaaggacaa tctggcgcag atctctatta agtaccagaa ccagggcaag 2760
aaggacctcc tgcaggcctc cgcagaggac gacgtgaagg ccatcaagga tctcctggac 2820aaggacctcc tgcaggcctc cgcagaggac gacgtgaagg ccatcaagga tctcctggac 2820
cagaccaaca atctcctgca caagctcaag atcttccata tttcacagtc tgaggataag 2880cagaccaaca atctcctgca caagctcaag atcttccata tttcacagtc tgaggataag 2880
gccaatatcc tcgataagga cgagcatttc tacctggtgt tcgaggagtg ctacttcgag 2940gccaatatcc tcgataagga cgagcatttc tacctggtgt tcgaggagtg ctacttcgag 2940
ctggcgaaca ttgtccctct gtacaacaag attaggaatt acatcacaca gaagccgtac 3000ctggcgaaca ttgtccctct gtacaacaag attaggaatt acatcacaca gaagccgtac 3000
agcgacgaga agttcaagct caacttcgag aattcaaccc tggccaacgg ctgggataag 3060agcgacgaga agttcaagct caacttcgag aattcaaccc tggccaacgg ctgggataag 3060
aataaggagc ctgacaacac agcgatcctc ttcatcaagg acgacaagta ctacctgggc 3120aataaggagc ctgacaacac agcgatcctc ttcatcaagg acgacaagta ctacctgggc 3120
gtgatgaata agaagaacaa taagatcttc gatgacaagg ccattaagga gaacaagggc 3180gtgatgaata agaagaacaa taagatcttc gatgacaagg ccattaagga gaacaagggc 3180
gagggctaca agaagatcgt gtacaagctc ctgcctggcg ccaataagat gctcccgaag 3240gagggctaca agaagatcgt gtacaagctc ctgcctggcg ccaataagat gctcccgaag 3240
gtgttcttct ccgcgaagtc cattaagttc tacaacccaa gcgaggatat cctcaggatc 3300gtgttcttct ccgcgaagtc cattaagttc tacaacccaa gcgaggatat cctcaggatc 3300
aggaaccact ctacccatac aaagaacggc tcccctcaga agggctacga gaagttcgag 3360aggaaccact ctacccatac aaagaacggc tcccctcaga agggctacga gaagttcgag 3360
ttcaatatcg aggattgccg gaagttcatt gacttctaca agcagtccat cagcaagcac 3420ttcaatatcg aggattgccg gaagttcatt gacttctaca agcagtccat cagcaagcac 3420
cctgagtgga aggatttcgg cttccgcttc agcgacaccc agcggtacaa ctcaatcgat 3480cctgagtgga aggatttcgg cttccgcttc agcgacaccc agcggtacaa ctcaatcgat 3480
gagttctaca gggaggtgga gaatcagggc tacaagctca cattcgagaa catttcagag 3540gagttctaca gggaggtgga gaatcagggc tacaagctca cattcgagaa catttcagag 3540
tcttacatcg actccgtggt caatcagggc aagctctacc tgttccagat ctacaacaag 3600tcttacatcg actccgtggt caatcagggc aagctctacc tgttccagat ctacaacaag 3600
gatttcagcg cctactcaaa gggcaggccg aacctccata ccctgtactg gaaggcgctc 3660gatttcagcg cctactcaaa gggcaggccg aacctccata ccctgtactg gaaggcgctc 3660
ttcgatgagc gcaatctgca ggacgtggtc tacaagctca acggcgaggc cgagctgttc 3720ttcgatgagc gcaatctgca ggacgtggtc tacaagctca acggcgaggc cgagctgttc 3720
taccgcaagc agtctattcc gaagaagatc acacacccag cgaaggaggc catcgcgaac 3780taccgcaagc agtctattcc gaagaagatc acacacccag cgaaggaggc catcgcgaac 3780
aagaataagg acaatccgaa gaaggagtcc gtgttcgagt acgatctcat taaggacaag 3840aagaataagg acaatccgaa gaaggagtcc gtgttcgagt acgatctcat taaggacaag 3840
cggttcaccg aggataagtt cttcttccat tgcccaatca caattaactt caagtccagc 3900cggttcaccg aggataagtt cttcttccat tgcccaatca caattaactt caagtccagc 3900
ggcgccaaca agttcaatga cgagatcaat ctcctgctca aggagaaggc gaacgatgtg 3960ggcgccaaca agttcaatga cgagatcaat ctcctgctca aggagaaggc gaacgatgtg 3960
catatcctct ccattgcccg cggcgagcgg catctcgcct actacaccct ggtcgatggc 4020catatcctct ccattgcccg cggcgagcgg catctcgcct actacaccct ggtcgatggc 4020
aagggcaata tcattaagca ggataccttc aacatcattg gcaatgaccg catgaaaacc 4080aagggcaata tcattaagca ggataccttc aacatcattg gcaatgaccg catgaaaacc 4080
aactaccacg ataagctcgc cgcgatcgag aaggataggg acagcgcgcg caaggactgg 4140aactaccacg ataagctcgc cgcgatcgag aaggataggg acagcgcgcg caaggactgg 4140
aagaagatca acaatattaa ggagatgaag gagggctacc tctcacaggt ggtccatgag 4200aagaagatca acaatattaa ggagatgaag gagggctacc tctcacaggt ggtccatgag 4200
attgccaagc tcgtgatcga gtacaatgcg attgtggtgt tcgaggacct caacttcggc 4260attgccaagc tcgtgatcga gtacaatgcg attgtggtgt tcgaggacct caacttcggc 4260
ttcaagcggg gcaggttcaa ggtggagaag caggtctacc agaagctgga gaagatgctc 4320ttcaagcggg gcaggttcaa ggtggagaag caggtctacc agaagctgga gaagatgctc 4320
atcgagaagc tcaattacct ggtgttcaag gataacgagt tcgacaagac cggaggagtc 4380atcgagaagc tcaattacct ggtgttcaag gataacgagt tcgacaagac cggaggagtc 4380
ctcagggcat accagctgac cgcgccattc gagacattca agaagatggg caagcagaca 4440ctcagggcat accagctgac cgcgccattc gagacattca agaagatggg caagcagaca 4440
ggcatcatct actacgtgcc agccggcttc acctccaaga tctgccctgt gacaggcttc 4500ggcatcatct actacgtgcc agccggcttc acctccaaga tctgccctgt gacaggcttc 4500
gtcaaccagc tctaccctaa gtacgagtcc gtcagcaagt cacaggagtt cttcagcaag 4560gtcaaccagc tctaccctaa gtacgagtcc gtcagcaagt cacaggagtt cttcagcaag 4560
ttcgataaga tttgctacaa tctggacaag ggctacttcg agttctcatt cgattacaag 4620ttcgataaga tttgctacaa tctggacaag ggctacttcg agttctcatt cgattacaag 4620
aacttcggcg acaaggccgc gaagggcaag tggaccattg cgtctttcgg ctcccggctc 4680aacttcggcg acaaggccgc gaagggcaag tggaccattg cgtctttcgg ctcccggctc 4680
atcaacttca ggaatagcga taagaaccac aattgggaca ccagggaggt gtacccgaca 4740atcaacttca ggaatagcga taagaaccac aattgggaca ccagggaggt gtacccgaca 4740
aaggagctgg agaagctgct caaggactac tcaatcgagt acggccatgg cgagtgcatt 4800aaggagctgg agaagctgct caaggactac tcaatcgagt acggccatgg cgagtgcatt 4800
aaggccgcga tctgcggcga gagcgataag aagttcttcg ccaagctgac ctcagtcctc 4860aaggccgcga tctgcggcga gagcgataag aagttcttcg ccaagctgac ctcagtcctc 4860
aatacaatcc tgcagatgag gaactctaag accggcacag agctggacta cctgatttcc 4920aatacaatcc tgcagatgag gaactctaag accggcacag agctggacta cctgatttcc 4920
ccagtggccg atgtcaacgg caatttcttc gacagccgcc aggcgccgaa gaatatgcca 4980ccagtggccg atgtcaacgg caatttcttc gacagccgcc aggcgccgaa gaatatgcca 4980
caggacgccg acgccaacgg cgcataccac atcggcctca agggcctgat gctgctcgga 5040caggacgccg acgccaacgg cgcataccac atcggcctca agggcctgat gctgctcgga 5040
cgcattaaga acaatcagga gggcaagaag ctcaacctcg tgatcaagaa cgaggagtac 5100cgcattaaga acaatcagga gggcaagaag ctcaacctcg tgatcaagaa cgaggagtac 5100
ttcgagttcg tccaaaaccg gaacaattcc ggcggcagcc caaagaagaa gaggaaggtg 5160ttcgagttcg tccaaaaccg gaacaattcc ggcggcagcc caaagaagaa gaggaaggtg 5160
agcggcggca gcccaaagaa gaagcgcaag gtctag 5196agcggcggca gcccaaagaa gaagcgcaag gtctag 5196
<210> 12<210> 12
<211> 4980<211> 4980
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> dLBCPF1-ABE7.10-2NLS<223> dLBCPF1-ABE7.10-2NLS
<400> 12<400> 12
atgccaaaaa agaagagaaa ggtttcaggc ggctcctccg aggtggagtt ctctcacgag 60atgccaaaaa agaagagaaa ggtttcaggc ggctcctccg aggtggagtt ctctcacgag 60
tattggatga ggcacgctct tacacttgct aagagagctt gggacgaaag agaagtgcca 120tattggatga ggcacgctct tacacttgct aagagagctt gggacgaaag agaagtgcca 120
gttggcgccg ttcttgtgca taataatagg gtgatcggcg agggttggaa tagaccaatt 180gttggcgccg ttcttgtgca taataatagg gtgatcggcg agggttggaa tagaccaatt 180
ggaaggcatg atccaacagc tcacgcagag attatggctc tcagacaagg cggcctcgtt 240ggaaggcatg atccaacagc tcacgcagag attatggctc tcagacaagg cggcctcgtt 240
atgcagaact acaggctcat tgacgctaca ctctacgtga cactcgaacc ttgcgttatg 300atgcagaact acaggctcat tgacgctaca ctctacgtga cactcgaacc ttgcgttatg 300
tgcgccggag ctatgattca ttctaggatt ggcagggtcg tgtttggagc tagggacgct 360tgcgccggag ctatgattca ttctaggatt ggcagggtcg tgtttggagc tagggacgct 360
aaaacaggag ccgccggatc tcttatggac gtgttgcatc atccaggcat gaaccatagg 420aaaacaggag ccgccggatc tcttatggac gtgttgcatc atccaggcat gaaccatagg 420
gtggagatta cagagggcat tcttgcagac gagtgcgctg ctcttctttc cgatttcttc 480gtggagatta cagagggcat tcttgcagac gagtgcgctg ctcttctttc cgatttcttc 480
aggatgagaa ggcaggagat taaggcccag aagaaggctc agtcttctac agatagcgga 540aggatgagaa ggcaggagat taaggcccag aagaaggctc agtcttctac agatagcgga 540
ggatcttccg gaggatctag cggctccgag acaccaggaa catccgaaag cgctacacca 600ggatcttccg gaggatctag cggctccgag acaccaggaa catccgaaag cgctacacca 600
gaatctagcg gaggctcttc cggaggatct tctgaagtgg agttctccca cgagtattgg 660gaatctagcg gaggctcttc cggaggatct tctgaagtgg agttctccca cgagtattgg 660
atgaggcacg ctcttacact tgctaaaagg gctagggacg aaagggaagt tccagttgga 720atgaggcacg ctcttacact tgctaaaagg gctagggacg aaagggaagt tccagttgga 720
gctgttctcg tgctcaataa cagggtgatt ggcgagggtt ggaatagagc cattggactc 780gctgttctcg tgctcaataa cagggtgatt ggcgagggtt ggaatagagc cattggactc 780
catgatccaa cagctcacgc agagattatg gctcttagac aaggcggcct cgttatgcag 840catgatccaa cagctcacgc agagattatg gctcttagac aaggcggcct cgttatgcag 840
aattacagac tcatcgacgc cacactctac gttaccttcg aaccttgcgt tatgtgcgcc 900aattacagac tcatcgacgc cacactctac gttaccttcg aaccttgcgt tatgtgcgcc 900
ggagctatga tccattctag gattggcagg gtcgtgttcg gcgttagaaa cgctaagaca 960ggagctatga tccattctag gattggcagg gtcgtgttcg gcgttagaaa cgctaagaca 960
ggagctgcag gctctcttat ggacgttctt cattacccag gcatgaatca tagagtggag 1020ggagctgcag gctctcttat ggacgttctt cattacccag gcatgaatca tagagtggag 1020
atcacagaag gcattcttgc agacgagtgc gcagctctcc tttgctattt cttcaggatg 1080atcacagaag gcattcttgc agacgagtgc gcagctctcc tttgctattt cttcaggatg 1080
ccgaggcaag ttttcaacgc tcagaagaag gcccagtctt ctacagattc cggcggatct 1140ccgaggcaag ttttcaacgc tcagaagaag gcccagtctt ctacagattc cggcggatct 1140
tctggaggat ctagcggctc cgagacacca ggaacatccg aatccgctac accagagtct 1200tctggaggat ctagcggctc cgagacacca ggaacatccg aatccgctac accagagtct 1200
tctggaggat ctagcggagg atctcttaag tcaaagctcg agaaattcac caactgttat 1260tctggaggat ctagcggagg atctcttaag tcaaagctcg agaaattcac caactgttat 1260
tcgttgagca aaacactgcg gtttaaagcg attccagtcg gcaagactca agagaatata 1320tcgttgagca aaacactgcg gtttaaagcg attccagtcg gcaagactca agagaatata 1320
gacaataagc ggctgttggt ggaagatgaa aagcgcgcgg aagactacaa aggggtgaag 1380gacaataagc ggctgttggt ggaagatgaa aagcgcgcgg aagactacaa aggggtgaag 1380
aagttgttgg acagatacta cctctctttt atcaatgatg tcttgcactc aatcaaattg 1440aagttgttgg acagatacta cctctctttt atcaatgatg tcttgcactc aatcaaattg 1440
aagaatctga acaactacat ctccctcttc agaaagaaaa caaggacaga aaaggagaat 1500aagaatctga acaactacat ctccctcttc agaaagaaaa caaggacaga aaaggagaat 1500
aaggaacttg aaaatttgga gatcaatctg aggaaagaga tcgcgaaagc ctttaaaggc 1560aaggaacttg aaaatttgga gatcaatctg aggaaagaga tcgcgaaagc ctttaaaggc 1560
aacgaaggat acaaaagtct gttcaagaag gatataattg agacaatttt gccagagttc 1620aacgaaggat acaaaagtct gttcaagaag gatataattg agacaatttt gccagagttc 1620
ctcgatgaca aggacgagat tgcgctggtc aattcgttca acggattcac aacagcattc 1680ctcgatgaca aggacgagat tgcgctggtc aattcgttca acggattcac aacagcattc 1680
acaggcttct ttgataatcg ggaaaatatg ttctctgagg aggcaaagtc cacttctatt 1740acaggcttct ttgataatcg ggaaaatatg ttctctgagg aggcaaagtc cacttctatt 1740
gcgttcaggt gtatcaatga gaatctcact aggtacattt ccaacatgga tatctttgag 1800gcgttcaggt gtatcaatga gaatctcact aggtacattt ccaacatgga tatctttgag 1800
aaggttgacg caatttttga caagcacgaa gttcaggaga ttaaggagaa gatcctcaat 1860aaggttgacg caatttttga caagcacgaa gttcaggaga ttaaggagaa gatcctcaat 1860
tccgattatg acgttgagga cttcttcgaa ggtgagtttt ttaatttcgt gctcactcaa 1920tccgattatg acgttgagga cttcttcgaa ggtgagtttt ttaatttcgt gctcactcaa 1920
gagggtatcg acgtgtataa tgcgatcatc ggtgggttcg tgactgagtc cggtgaaaag 1980gagggtatcg acgtgtataa tgcgatcatc ggtgggttcg tgactgagtc cggtgaaaag 1980
attaagggat tgaacgagta tatcaacctt tacaaccaaa agacgaaaca gaagctgcca 2040attaagggat tgaacgagta tatcaacctt tacaaccaaa agacgaaaca gaagctgcca 2040
aagttcaagc ctctttacaa acaggttctt tcagaccgcg agtcactctc gttctatggg 2100aagttcaagc ctctttacaa acaggttctt tcagaccgcg agtcactctc gttctatggg 2100
gagggctaca cttcggatga ggaagtcctg gaggtgttca ggaatactct caataagaat 2160gagggctaca cttcggatga ggaagtcctg gaggtgttca ggaatactct caataagaat 2160
tcggagattt tctcttctat aaaaaaactg gaaaagttgt ttaagaattt tgacgaatac 2220tcggagattt tctcttctat aaaaaaactg gaaaagttgt ttaagaattt tgacgaatac 2220
tctagcgccg gcatatttgt gaaaaacggc ccggccatat caacgataag taaagatatc 2280tctagcgccg gcatatttgt gaaaaacggc ccggccatat caacgataag taaagatatc 2280
ttcggcgaat ggaacgtgat cagagacaaa tggaacgcgg agtatgacga tattcacctg 2340ttcggcgaat ggaacgtgat cagagacaaa tggaacgcgg agtatgacga tattcacctg 2340
aagaagaagg ctgtcgtaac ggagaagtac gaggatgatc gcaggaaaag cttcaaaaag 2400aagaagaagg ctgtcgtaac ggagaagtac gaggatgatc gcaggaaaag cttcaaaaag 2400
atcggaagtt tcagcctgga acagttgcag gagtatgctg acgccgatct tagcgtcgtc 2460atcggaagtt tcagcctgga acagttgcag gagtatgctg acgccgatct tagcgtcgtc 2460
gagaagttga aggagataat catccaaaag gtcgacgaga tatataaagt ctatggatca 2520gagaagttga aggagataat catccaaaag gtcgacgaga tatataaagt ctatggatca 2520
agtgaaaaac tgttcgacgc cgacttcgtt ttggagaagt ccctgaagaa gaacgacgct 2580agtgaaaaac tgttcgacgc cgacttcgtt ttggagaagt ccctgaagaa gaacgacgct 2580
gttgttgcca ttatgaagga tctgctcgac agcgtgaaga gtttcgagaa ctatattaag 2640gttgttgcca ttatgaagga tctgctcgac agcgtgaaga gtttcgagaa ctatattaag 2640
gcttttttcg gggaggggaa ggagactaac agagatgagt ccttctacgg agacttcgtc 2700gcttttttcg gggagggggaa ggagactaac agagatgagt ccttctacgg agacttcgtc 2700
ctcgcgtacg atatactcct taaggtagac cacatctacg acgcaatcag aaattacgtg 2760ctcgcgtacg atatactcct taaggtagac cacatctacg acgcaatcag aaattacgtg 2760
acacaaaagc cgtacagcaa ggacaagttc aaactctact tccagaaccc ccagttcatg 2820acacaaaagc cgtacagcaa ggacaagttc aaactctact tccagaaccc ccagttcatg 2820
ggcggctggg acaaggacaa ggaaacggat tacagggcta cgatcctgag gtatggttca 2880ggcggctggg acaaggacaa ggaaacggat tacagggcta cgatcctgag gtatggttca 2880
aaatactact tggcgattat ggacaagaag tacgccaagt gtctccagaa gattgacaaa 2940aaatactact tggcgattat ggacaagaag tacgccaagt gtctccagaa gattgacaaa 2940
gacgatgtca atggcaatta tgagaagatc aactacaagc tgcttccggg tccgaacaag 3000gacgatgtca atggcaatta tgagaagatc aactacaagc tgcttccggg tccgaacaag 3000
atgctcccaa aggttttctt cagcaagaaa tggatggcct actataaccc aagcgaggac 3060atgctcccaa aggttttctt cagcaagaaa tggatggcct actataaccc aagcgaggac 3060
atccagaaga tttataagaa cggtacgttc aagaagggcg acatgttcaa tcttaacgac 3120atccagaaga tttataagaa cggtacgttc aagaagggcg acatgttcaa tcttaacgac 3120
tgtcacaagc tgatcgactt cttcaaagac tcaattagcc ggtacccaaa gtggtctaac 3180tgtcacaagc tgatcgactt cttcaaagac tcaattagcc ggtacccaaa gtggtctaac 3180
gcctatgact tcaacttttc ggaaaccgag aagtacaagg atatagccgg attttataga 3240gcctatgact tcaacttttc ggaaaccgag aagtacaagg atatagccgg attttataga 3240
gaggtggaag agcagggcta caaggtgtca ttcgagtccg ccagcaagaa ggaagtggac 3300gaggtggaag agcagggcta caaggtgtca ttcgagtccg ccagcaagaa ggaagtggac 3300
aagctcgtgg aagagggtaa gctctacatg ttccagattt ataataaaga ctttagcgat 3360aagctcgtgg aagagggtaa gctctacatg ttccagattt ataataaaga ctttagcgat 3360
aagagccacg ggacacctaa tctccacaca atgtatttca agctgctctt cgacgagaat 3420aagagccacg ggacacctaa tctccacaca atgtatttca agctgctctt cgacgagaat 3420
aaccacggcc aaatcaggtt gtcaggaggg gctgaactct tcatgcggcg cgctagcctt 3480aaccacggcc aaatcaggtt gtcaggaggg gctgaactct tcatgcggcg cgctagcctt 3480
aagaaggagg agcttgtagt ccaccctgcg aatagtccaa ttgcgaataa gaacccggac 3540aagaaggagg agcttgtagt ccaccctgcg aatagtccaa ttgcgaataa gaacccggac 3540
aatcctaaaa agactacaac attgagctac gacgtgtaca aggataagag gttttccgag 3600aatcctaaaa agactacaac attgagctac gacgtgtaca aggataagag gttttccgag 3600
gatcagtacg agctccacat cccgattgcg atcaacaagt gcccaaagaa tattttcaag 3660gatcagtacg agctccacat cccgattgcg atcaacaagt gcccaaagaa tattttcaag 3660
ataaacacag aggtgcgtgt actcctgaag catgacgaca atccttacgt cattgggatt 3720ataaacacag aggtgcgtgt actcctgaag catgacgaca atccttacgt cattgggatt 3720
gctcggggcg agaggaacct cctctatatt gtggtggtgg acgggaaggg gaacatagtc 3780gctcggggcg agaggaacct cctctatatt gtggtggtgg acgggaaggg gaacatagtc 3780
gaacagtact cccttaacga aataattaac aatttcaacg gcatccgtat caagaccgac 3840gaacagtact cccttaacga aataattaac aatttcaacg gcatccgtat caagaccgac 3840
taccattcgt tgctggacaa gaaggagaag gagagatttg aggcgcggca aaattggaca 3900taccattcgt tgctggacaa gaaggagaag gagagatttg aggcgcggca aaattggaca 3900
agtatcgaga acatcaagga actcaaagca ggttatatct ctcaagttgt gcataagata 3960agtatcgaga acatcaagga actcaaagca ggttatatct ctcaagttgt gcataagata 3960
tgcgagctgg ttgagaagta tgacgcagtg atcgctcttg aggacctcaa ctcgggcttt 4020tgcgagctgg ttgagaagta tgacgcagtg atcgctcttg aggacctcaa ctcgggcttt 4020
aagaattcta gagttaaagt ggagaagcag gtctatcaaa agttcgagaa gatgcttata 4080aagaattcta gagttaaagt ggagaagcag gtctatcaaa agttcgagaa gatgcttata 4080
gataagctca actacatggt cgataagaaa tcgaacccat gtgccaccgg cggcgcactc 4140gataagctca actacatggt cgataagaaa tcgaacccat gtgccaccgg cggcgcactc 4140
aaaggttacc aaataacaaa caaattcgag tccttcaaat cgatgagtac tcagaatggg 4200aaaggttacc aaataacaaa caaattcgag tccttcaaat cgatgagtac tcagaatggg 4200
ttcatatttt atataccggc gtggcttacg tctaagatcg acccgtcaac tggttttgtc 4260ttcatatttt atataccggc gtggcttacg tctaagatcg acccgtcaac tggttttgtc 4260
aacctgttga agacgaaata cacgtccatt gccgattcga aaaagttcat atctagtttt 4320aacctgttga agacgaaata cacgtccatt gccgattcga aaaagttcat atctagtttt 4320
gatcgtatta tgtacgtccc agaggaagat cttttcgagt ttgctctcga ctacaaaaac 4380gatcgtatta tgtacgtccc agaggaagat cttttcgagt ttgctctcga ctacaaaaac 4380
ttttcgcgga ccgatgcgga ttacattaaa aaatggaaac tctattcgta cggcaacaga 4440ttttcgcgga ccgatgcgga ttacattaaa aaatggaaac tctattcgta cggcaacaga 4440
atcaggattt ttcgcaaccc taagaagaat aacgtctttg attgggagga agtttgcttg 4500atcaggattt ttcgcaaccc taagaagaat aacgtctttg attgggagga agtttgcttg 4500
actagcgcgt acaaggagct ctttaataag tatggcatta actaccaaca gggtgatatc 4560actagcgcgt acaaggagct ctttaataag tatggcatta actaccaaca gggtgatatc 4560
agagcactgc tttgcgaaca atctgacaag gctttctact catccttcat ggctttgatg 4620agagcactgc tttgcgaaca atctgacaag gctttctact catccttcat ggctttgatg 4620
agcctgatgc tccagatgag aaattcaatt acaggcagaa ccgacgtgga tttcttgatc 4680agcctgatgc tccagatgag aaattcaatt acaggcagaa ccgacgtgga tttcttgatc 4680
tccccggtta aaaattctga tggcatcttt tacgatagca ggaactatga agcgcaagag 4740tccccggtta aaaattctga tggcatcttt tacgatagca ggaactatga agcgcaagag 4740
aatgcgattc tgccaaaaaa tgcagacgcc aacggtgcct ataacatcgc caggaaagtc 4800aatgcgattc tgccaaaaaa tgcagacgcc aacggtgcct ataacatcgc caggaaagtc 4800
ctgtgggcga tcggccagtt caaaaaggcc gaagacgaaa aattggacaa ggtcaaaatc 4860ctgtgggcga tcggccagtt caaaaaggcc gaagacgaaa aattggacaa ggtcaaaatc 4860
gctatcagca acaaagagtg gctggagtat gctcagacat ccgtaaagca ttccggcggc 4920gctatcagca acaaagagtg gctggagtat gctcagacat ccgtaaagca ttccggcggc 4920
agcccaaaga agaagaggaa ggtgagcggc ggcagcccaa agaagaagcg caaggtctag 4980agcccaaaga agaagaggaa ggtgagcggc ggcagcccaa agaagaagcg caaggtctag 4980
<210> 13<210> 13
<211> 6506<211> 6506
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> dLBCPF1-ABE2-X<223> dLBCPF1-ABE2-X
<400> 13<400> 13
ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420gtttagggtt aatggtttttt atagactaat tttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660
cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720
ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780
gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840
caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900
ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960
tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020
taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080
atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140
cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200
ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260
gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320
cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380
acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440
ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500
tctgcaggtc gaagcttgaa gcaaacatgc caaaaaagaa gagaaaggtt tcaggcggct 1560tctgcaggtc gaagcttgaa gcaaacatgc caaaaaagaa gagaaaggtt tcaggcggct 1560
cctccgaggt ggagttctct cacgagtatt ggatgaggca cgctcttaca cttgctaaga 1620cctccgaggt ggagttctct cacgagtatt ggatgaggca cgctcttaca cttgctaaga 1620
gagcttggga cgaaagagaa gtgccagttg gcgccgttct tgtgcataat aatagggtga 1680gagcttggga cgaaagagaa gtgccagttg gcgccgttct tgtgcataat aatagggtga 1680
tcggcgaggg ttggaataga ccaattggaa ggcatgatcc aacagctcac gcagagatta 1740tcggcgaggg ttggaataga ccaattggaa ggcatgatcc aacagctcac gcagagatta 1740
tggctctcag acaaggcggc ctcgttatgc agaactacag gctcattgac gctacactct 1800tggctctcag acaaggcggc ctcgttatgc agaactacag gctcattgac gctacactct 1800
acgtgacact cgaaccttgc gttatgtgcg ccggagctat gattcattct aggattggca 1860acgtgacact cgaaccttgc gttatgtgcg ccggagctat gattcattct aggattggca 1860
gggtcgtgtt tggagctagg gacgctaaaa caggagccgc cggatctctt atggacgtgt 1920gggtcgtgtt tggagctagg gacgctaaaa caggagccgc cggatctctt atggacgtgt 1920
tgcatcatcc aggcatgaac catagggtgg agattacaga gggcattctt gcagacgagt 1980tgcatcatcc aggcatgaac catagggtgg agattacaga gggcattctt gcagacgagt 1980
gcgctgctct tctttccgat ttcttcagga tgagaaggca ggagattaag gcccagaaga 2040gcgctgctct tctttccgat ttcttcagga tgagaaggca ggagattaag gcccagaaga 2040
aggctcagtc ttctacagat agcggaggat cttccggagg atctagcggc tccgagacac 2100aggctcagtc ttctacagat agcggaggat cttccggagg atctagcggc tccgagacac 2100
caggaacatc cgaaagcgct acaccagaat ctagcggagg ctcttccgga ggatcttctg 2160caggaacatc cgaaagcgct acaccagaat ctagcggagg ctcttccgga ggatcttctg 2160
aagtggagtt ctcccacgag tattggatga ggcacgctct tacacttgct aaaagggcta 2220aagtggagtt ctcccacgag tattggatga ggcacgctct tacacttgct aaaagggcta 2220
gggacgaaag ggaagttcca gttggagctg ttctcgtgct caataacagg gtgattggcg 2280gggacgaaag ggaagttcca gttggagctg ttctcgtgct caataacagg gtgattggcg 2280
agggttggaa tagagccatt ggactccatg atccaacagc tcacgcagag attatggctc 2340agggttggaa tagagccatt ggactccatg atccaacagc tcacgcagag attatggctc 2340
ttagacaagg cggcctcgtt atgcagaatt acagactcat cgacgccaca ctctacgtta 2400ttagacaagg cggcctcgtt atgcagaatt acagactcat cgacgccaca ctctacgtta 2400
ccttcgaacc ttgcgttatg tgcgccggag ctatgatcca ttctaggatt ggcagggtcg 2460ccttcgaacc ttgcgttatg tgcgccggag ctatgatcca ttctaggatt ggcagggtcg 2460
tgttcggcgt tagaaacgct aagacaggag ctgcaggctc tcttatggac gttcttcatt 2520tgttcggcgt tagaaacgct aagacaggag ctgcaggctc tcttatggac gttcttcatt 2520
acccaggcat gaatcataga gtggagatca cagaaggcat tcttgcagac gagtgcgcag 2580acccaggcat gaatcataga gtggagatca cagaaggcat tcttgcagac gagtgcgcag 2580
ctctcctttg ctatttcttc aggatgccga ggcaagtttt caacgctcag aagaaggccc 2640ctctcctttg ctatttcttc aggatgccga ggcaagtttt caacgctcag aagaaggccc 2640
agtcttctac agattccggc ggatcttctg gaggatctag cggctccgag acaccaggaa 2700agtcttctac agattccggc ggatcttctg gaggatctag cggctccgag acaccaggaa 2700
catccgaatc cgctacacca gagtcttctg gaggatctag cggaggatct cttaagtcaa 2760catccgaatc cgctacacca gagtcttctg gaggatctag cggaggatct cttaagtcaa 2760
agctcgagaa attcaccaac tgttattcgt tgagcaaaac actgcggttt aaagcgattc 2820agctcgagaa attcaccaac tgttattcgt tgagcaaaac actgcggttt aaagcgattc 2820
cagtcggcaa gactcaagag aatatagaca ataagcggct gttggtggaa gatgaaaagc 2880cagtcggcaa gactcaagag aatatagaca ataagcggct gttggtggaa gatgaaaagc 2880
gcgcggaaga ctacaaaggg gtgaagaagt tgttggacag atactacctc tcttttatca 2940gcgcggaaga ctacaaaggg gtgaagaagt tgttggacag atactacctc tcttttatca 2940
atgatgtctt gcactcaatc aaattgaaga atctgaacaa ctacatctcc ctcttcagaa 3000atgatgtctt gcactcaatc aaattgaaga atctgaacaa ctacatctcc ctcttcagaa 3000
agaaaacaag gacagaaaag gagaataagg aacttgaaaa tttggagatc aatctgagga 3060agaaaacaag gacagaaaag gagaataagg aacttgaaaa tttggagatc aatctgagga 3060
aagagatcgc gaaagccttt aaaggcaacg aaggatacaa aagtctgttc aagaaggata 3120aagagatcgc gaaagccttt aaaggcaacg aaggatacaa aagtctgttc aagaaggata 3120
taattgagac aattttgcca gagttcctcg atgacaagga cgagattgcg ctggtcaatt 3180taattgagac aattttgcca gagttcctcg atgacaagga cgagattgcg ctggtcaatt 3180
cgttcaacgg attcacaaca gcattcacag gcttctttga taatcgggaa aatatgttct 3240cgttcaacgg attcacaaca gcattcacag gcttctttga taatcgggaa aatatgttct 3240
ctgaggaggc aaagtccact tctattgcgt tcaggtgtat caatgagaat ctcactaggt 3300ctgaggaggc aaagtccact tctattgcgt tcaggtgtat caatgagaat ctcactaggt 3300
acatttccaa catggatatc tttgagaagg ttgacgcaat ttttgacaag cacgaagttc 3360acatttccaa catggatatc tttgagaagg ttgacgcaat ttttgacaag cacgaagttc 3360
aggagattaa ggagaagatc ctcaattccg attatgacgt tgaggacttc ttcgaaggtg 3420aggagattaa ggagaagatc ctcaattccg attatgacgt tgaggacttc ttcgaaggtg 3420
agttttttaa tttcgtgctc actcaagagg gtatcgacgt gtataatgcg atcatcggtg 3480agttttttaa tttcgtgctc actcaagagg gtatcgacgt gtataatgcg atcatcggtg 3480
ggttcgtgac tgagtccggt gaaaagatta agggattgaa cgagtatatc aacctttaca 3540ggttcgtgac tgagtccggt gaaaagatta agggattgaa cgagtatatc aacctttaca 3540
accaaaagac gaaacagaag ctgccaaagt tcaagcctct ttacaaacag gttctttcag 3600accaaaagac gaaacagaag ctgccaaagt tcaagcctct ttacaaacag gttctttcag 3600
accgcgagtc actctcgttc tatggggagg gctacacttc ggatgaggaa gtcctggagg 3660accgcgagtc actctcgttc tatggggagg gctacacttc ggatgaggaa gtcctggagg 3660
tgttcaggaa tactctcaat aagaattcgg agattttctc ttctataaaa aaactggaaa 3720tgttcaggaa tactctcaat aagaattcgg agattttctc ttctataaaa aaactggaaa 3720
agttgtttaa gaattttgac gaatactcta gcgccggcat atttgtgaaa aacggcccgg 3780agttgtttaa gaattttgac gaatactcta gcgccggcat atttgtgaaa aacggcccgg 3780
ccatatcaac gataagtaaa gatatcttcg gcgaatggaa cgtgatcaga gacaaatgga 3840ccatatcaac gataagtaaa gatatcttcg gcgaatggaa cgtgatcaga gacaaatgga 3840
acgcggagta tgacgatatt cacctgaaga agaaggctgt cgtaacggag aagtacgagg 3900acgcggagta tgacgatatt cacctgaaga agaaggctgt cgtaacggag aagtacgagg 3900
atgatcgcag gaaaagcttc aaaaagatcg gaagtttcag cctggaacag ttgcaggagt 3960atgatcgcag gaaaagcttc aaaaagatcg gaagtttcag cctggaacag ttgcaggagt 3960
atgctgacgc cgatcttagc gtcgtcgaga agttgaagga gataatcatc caaaaggtcg 4020atgctgacgc cgatcttagc gtcgtcgaga agttgaagga gataatcatc caaaaggtcg 4020
acgagatata taaagtctat ggatcaagtg aaaaactgtt cgacgccgac ttcgttttgg 4080acgagatata taaagtctat ggatcaagtg aaaaactgtt cgacgccgac ttcgttttgg 4080
agaagtccct gaagaagaac gacgctgttg ttgccattat gaaggatctg ctcgacagcg 4140agaagtccct gaagaagaac gacgctgttg ttgccattat gaaggatctg ctcgacagcg 4140
tgaagagttt cgagaactat attaaggctt ttttcgggga ggggaaggag actaacagag 4200tgaagagttt cgagaactat attaaggctt ttttcgggga ggggaaggag actaacagag 4200
atgagtcctt ctacggagac ttcgtcctcg cgtacgatat actccttaag gtagaccaca 4260atgagtcctt ctacggagac ttcgtcctcg cgtacgatat actccttaag gtagaccaca 4260
tctacgacgc aatcagaaat tacgtgacac aaaagccgta cagcaaggac aagttcaaac 4320tctacgacgc aatcagaaat tacgtgacac aaaagccgta cagcaaggac aagttcaaac 4320
tctacttcca gaacccccag ttcatgggcg gctgggacaa ggacaaggaa acggattaca 4380tctacttcca gaacccccag ttcatgggcg gctgggacaa ggacaaggaa acggattaca 4380
gggctacgat cctgaggtat ggttcaaaat actacttggc gattatggac aagaagtacg 4440gggctacgat cctgaggtat ggttcaaaat actacttggc gattatggac aagaagtacg 4440
ccaagtgtct ccagaagatt gacaaagacg atgtcaatgg caattatgag aagatcaact 4500ccaagtgtct ccagaagatt gacaaagacg atgtcaatgg caattatgag aagatcaact 4500
acaagctgct tccgggtccg aacaagatgc tcccaaaggt tttcttcagc aagaaatgga 4560acaagctgct tccgggtccg aacaagatgc tcccaaaggt tttcttcagc aagaaatgga 4560
tggcctacta taacccaagc gaggacatcc agaagattta taagaacggt acgttcaaga 4620tggcctacta taacccaagc gaggacatcc agaagattta taagaacggt acgttcaaga 4620
agggcgacat gttcaatctt aacgactgtc acaagctgat cgacttcttc aaagactcaa 4680agggcgacat gttcaatctt aacgactgtc acaagctgat cgacttcttc aaagactcaa 4680
ttagccggta cccaaagtgg tctaacgcct atgacttcaa cttttcggaa accgagaagt 4740ttagccggta cccaaagtgg tctaacgcct atgacttcaa cttttcggaa accgagaagt 4740
acaaggatat agccggattt tatagagagg tggaagagca gggctacaag gtgtcattcg 4800acaaggatat agccggattt tatagagagg tggaagagca gggctacaag gtgtcattcg 4800
agtccgccag caagaaggaa gtggacaagc tcgtggaaga gggtaagctc tacatgttcc 4860agtccgccag caagaaggaa gtggacaagc tcgtggaaga gggtaagctc tacatgttcc 4860
agatttataa taaagacttt agcgataaga gccacgggac acctaatctc cacacaatgt 4920agatttataa taaagacttt agcgataaga gccacgggac acctaatctc cacacaatgt 4920
atttcaagct gctcttcgac gagaataacc acggccaaat caggttgtca ggaggggctg 4980atttcaagct gctcttcgac gagaataacc acggccaaat caggttgtca ggaggggctg 4980
aactcttcat gcggcgcgct agccttaaga aggaggagct tgtagtccac cctgcgaata 5040aactcttcat gcggcgcgct agccttaaga aggaggagct tgtagtccac cctgcgaata 5040
gtccaattgc gaataagaac ccggacaatc ctaaaaagac tacaacattg agctacgacg 5100gtccaattgc gaataagaac ccggacaatc ctaaaaagac tacaacattg agctacgacg 5100
tgtacaagga taagaggttt tccgaggatc agtacgagct ccacatcccg attgcgatca 5160tgtacaagga taagaggttt tccgaggatc agtacgagct ccacatcccg attgcgatca 5160
acaagtgccc aaagaatatt ttcaagataa acacagaggt gcgtgtactc ctgaagcatg 5220acaagtgccc aaagaatatt ttcaagataa acacagaggt gcgtgtactc ctgaagcatg 5220
acgacaatcc ttacgtcatt gggattgctc ggggcgagag gaacctcctc tatattgtgg 5280acgacaatcc ttacgtcatt gggattgctc ggggcgagag gaacctcctc tatattgtgg 5280
tggtggacgg gaaggggaac atagtcgaac agtactccct taacgaaata attaacaatt 5340tggtggacgg gaaggggaac atagtcgaac agtactccct taacgaaata attaacaatt 5340
tcaacggcat ccgtatcaag accgactacc attcgttgct ggacaagaag gagaaggaga 5400tcaacggcat ccgtatcaag accgactacc attcgttgct ggacaagaag gagaaggaga 5400
gatttgaggc gcggcaaaat tggacaagta tcgagaacat caaggaactc aaagcaggtt 5460gatttgaggc gcggcaaaat tggacaagta tcgagaacat caaggaactc aaagcaggtt 5460
atatctctca agttgtgcat aagatatgcg agctggttga gaagtatgac gcagtgatcg 5520atatctctca agttgtgcat aagatatgcg agctggttga gaagtatgac gcagtgatcg 5520
ctcttgagga cctcaactcg ggctttaaga attctagagt taaagtggag aagcaggtct 5580ctcttgagga cctcaactcg ggctttaaga attctagagt taaagtggag aagcaggtct 5580
atcaaaagtt cgagaagatg cttatagata agctcaacta catggtcgat aagaaatcga 5640atcaaaagtt cgagaagatg cttatagata agctcaacta catggtcgat aagaaatcga 5640
acccatgtgc caccggcggc gcactcaaag gttaccaaat aacaaacaaa ttcgagtcct 5700acccatgtgc caccggcggc gcactcaaag gttaccaaat aacaaacaaa ttcgagtcct 5700
tcaaatcgat gagtactcag aatgggttca tattttatat accggcgtgg cttacgtcta 5760tcaaatcgat gagtactcag aatgggttca tattttatat accggcgtgg cttacgtcta 5760
agatcgaccc gtcaactggt tttgtcaacc tgttgaagac gaaatacacg tccattgccg 5820agatcgaccc gtcaactggt tttgtcaacc tgttgaagac gaaatacacg tccattgccg 5820
attcgaaaaa gttcatatct agttttgatc gtattatgta cgtcccagag gaagatcttt 5880attcgaaaaa gttcatatct agttttgatc gtattatgta cgtcccagag gaagatcttt 5880
tcgagtttgc tctcgactac aaaaactttt cgcggaccga tgcggattac attaaaaaat 5940tcgagtttgc tctcgactac aaaaactttt cgcggaccga tgcggattac attaaaaaat 5940
ggaaactcta ttcgtacggc aacagaatca ggatttttcg caaccctaag aagaataacg 6000ggaaactcta ttcgtacggc aacagaatca ggatttttcg caaccctaag aagaataacg 6000
tctttgattg ggaggaagtt tgcttgacta gcgcgtacaa ggagctcttt aataagtatg 6060tctttgattg ggaggaagtt tgcttgacta gcgcgtacaa ggagctcttt aataagtatg 6060
gcattaacta ccaacagggt gatatcagag cactgctttg cgaacaatct gacaaggctt 6120gcattaacta ccaacagggt gatatcagag cactgctttg cgaacaatct gacaaggctt 6120
tctactcatc cttcatggct ttgatgagcc tgatgctcca gatgagaaat tcaattacag 6180tctactcatc cttcatggct ttgatgagcc tgatgctcca gatgagaaat tcaattacag 6180
gcagaaccga cgtggatttc ttgatctccc cggttaaaaa ttctgatggc atcttttacg 6240gcagaaccga cgtggatttc ttgatctccc cggttaaaaa ttctgatggc atcttttacg 6240
atagcaggaa ctatgaagcg caagagaatg cgattctgcc aaaaaatgca gacgccaacg 6300atagcaggaa ctatgaagcg caagagaatg cgattctgcc aaaaaatgca gacgccaacg 6300
gtgcctataa catcgccagg aaagtcctgt gggcgatcgg ccagttcaaa aaggccgaag 6360gtgcctataa catcgccagg aaagtcctgt gggcgatcgg ccagttcaaa aaggccgaag 6360
acgaaaaatt ggacaaggtc aaaatcgcta tcagcaacaa agagtggctg gagtatgctc 6420acgaaaaatt ggacaaggtc aaaatcgcta tcagcaacaa agagtggctg gagtatgctc 6420
agacatccgt aaagcattcc ggcggcagcc caaagaagaa gaggaaggtg agcggcggca 6480agacatccgt aaagcattcc ggcggcagcc caaagaagaa gaggaaggtg agcggcggca 6480
gcccaaagaa gaagcgcaag gtctag 6506gcccaaagaa gaagcgcaag gtctag 6506
<210> 14<210> 14
<211> 4971<211> 4971
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> LBCPF1-ABE2-X2<223> LBCPF1-ABE2-X2
<400> 14<400> 14
atgtcaaagc tcgagaaatt caccaactgt tattcgttga gcaaaacact gcggtttaaa 60atgtcaaagc tcgagaaatt caccaactgt tattcgttga gcaaaacact gcggtttaaa 60
gcgattccag tcggcaagac tcaagagaat atagacaata agcggctgtt ggtggaagat 120gcgattccag tcggcaagac tcaagagaat atagacaata agcggctgtt ggtggaagat 120
gaaaagcgcg cggaagacta caaaggggtg aagaagttgt tggacagata ctacctctct 180gaaaagcgcg cggaagacta caaaggggtg aagaagttgt tggacagata ctacctctct 180
tttatcaatg atgtcttgca ctcaatcaaa ttgaagaatc tgaacaacta catctccctc 240tttatcaatg atgtcttgca ctcaatcaaa ttgaagaatc tgaacaacta catctccctc 240
ttcagaaaga aaacaaggac agaaaaggag aataaggaac ttgaaaattt ggagatcaat 300ttcagaaaga aaacaaggac agaaaaggag aataaggaac ttgaaaattt ggagatcaat 300
ctgaggaaag agatcgcgaa agcctttaaa ggcaacgaag gatacaaaag tctgttcaag 360ctgaggaaag agatcgcgaa agcctttaaa ggcaacgaag gatacaaaag tctgttcaag 360
aaggatataa ttgagacaat tttgccagag ttcctcgatg acaaggacga gattgcgctg 420aaggatataa ttgagacaat tttgccagag ttcctcgatg acaaggacga gattgcgctg 420
gtcaattcgt tcaacggatt cacaacagca ttcacaggct tctttgataa tcgggaaaat 480gtcaattcgt tcaacggatt cacaacagca ttcacaggct tctttgataa tcgggaaaat 480
atgttctctg aggaggcaaa gtccacttct attgcgttca ggtgtatcaa tgagaatctc 540atgttctctg aggaggcaaa gtccacttct attgcgttca ggtgtatcaa tgagaatctc 540
actaggtaca tttccaacat ggatatcttt gagaaggttg acgcaatttt tgacaagcac 600actaggtaca tttccaacat ggatatcttt gagaaggttg acgcaatttt tgacaagcac 600
gaagttcagg agattaagga gaagatcctc aattccgatt atgacgttga ggacttcttc 660gaagttcagg agattaagga gaagatcctc aattccgatt atgacgttga ggacttcttc 660
gaaggtgagt tttttaattt cgtgctcact caagagggta tcgacgtgta taatgcgatc 720gaaggtgagt ttttttaattt cgtgctcact caagagggta tcgacgtgta taatgcgatc 720
atcggtgggt tcgtgactga gtccggtgaa aagattaagg gattgaacga gtatatcaac 780atcggtgggt tcgtgactga gtccggtgaa aagattaagg gattgaacga gtatatcaac 780
ctttacaacc aaaagacgaa acagaagctg ccaaagttca agcctcttta caaacaggtt 840ctttacaacc aaaagacgaa acagaagctg ccaaagttca agcctcttta caaacaggtt 840
ctttcagacc gcgagtcact ctcgttctat ggggagggct acacttcgga tgaggaagtc 900ctttcagacc gcgagtcact ctcgttctat ggggagggct acacttcgga tgaggaagtc 900
ctggaggtgt tcaggaatac tctcaataag aattcggaga ttttctcttc tataaaaaaa 960ctggaggtgt tcaggaatac tctcaataag aattcggaga ttttctcttc tataaaaaaa 960
ctggaaaagt tgtttaagaa ttttgacgaa tactctagcg ccggcatatt tgtgaaaaac 1020ctggaaaagt tgtttaagaa ttttgacgaa tactctagcg ccggcatatt tgtgaaaaac 1020
ggcccggcca tatcaacgat aagtaaagat atcttcggcg aatggaacgt gatcagagac 1080ggcccggcca tatcaacgat aagtaaagat atcttcggcg aatggaacgt gatcagagac 1080
aaatggaacg cggagtatga cgatattcac ctgaagaaga aggctgtcgt aacggagaag 1140aaatggaacg cggagtatga cgatattcac ctgaagaaga aggctgtcgt aacggagaag 1140
tacgaggatg atcgcaggaa aagcttcaaa aagatcggaa gtttcagcct ggaacagttg 1200tacgaggatg atcgcaggaa aagcttcaaa aagatcggaa gtttcagcct ggaacagttg 1200
caggagtatg ctgacgccga tcttagcgtc gtcgagaagt tgaaggagat aatcatccaa 1260caggagtatg ctgacgccga tcttagcgtc gtcgagaagt tgaaggagt aatcatccaa 1260
aaggtcgacg agatatataa agtctatgga tcaagtgaaa aactgttcga cgccgacttc 1320aaggtcgacg agatatataa agtctatgga tcaagtgaaa aactgttcga cgccgacttc 1320
gttttggaga agtccctgaa gaagaacgac gctgttgttg ccattatgaa ggatctgctc 1380gttttggaga agtccctgaa gaagaacgac gctgttgttg ccattatgaa ggatctgctc 1380
gacagcgtga agagtttcga gaactatatt aaggcttttt tcggggaggg gaaggagact 1440gacagcgtga agagtttcga gaactatatt aaggcttttt tcggggaggg gaaggagact 1440
aacagagatg agtccttcta cggagacttc gtcctcgcgt acgatatact ccttaaggta 1500aacagagatg agtccttcta cggagacttc gtcctcgcgt acgatatact ccttaaggta 1500
gaccacatct acgacgcaat cagaaattac gtgacacaaa agccgtacag caaggacaag 1560gaccacatct acgacgcaat cagaaattac gtgacacaaa agccgtacag caaggacaag 1560
ttcaaactct acttccagaa cccccagttc atgggcggct gggacaagga caaggaaacg 1620ttcaaactct acttccagaa cccccagttc atgggcggct gggacaagga caaggaaacg 1620
gattacaggg ctacgatcct gaggtatggt tcaaaatact acttggcgat tatggacaag 1680gattacaggg ctacgatcct gaggtatggt tcaaaatact acttggcgat tatggacaag 1680
aagtacgcca agtgtctcca gaagattgac aaagacgatg tcaatggcaa ttatgagaag 1740aagtacgcca agtgtctcca gaagattgac aaagacgatg tcaatggcaa ttatgagaag 1740
atcaactaca agctgcttcc gggtccgaac aagatgctcc caaaggtttt cttcagcaag 1800atcaactaca agctgcttcc gggtccgaac aagatgctcc caaaggtttt cttcagcaag 1800
aaatggatgg cctactataa cccaagcgag gacatccaga agatttataa gaacggtacg 1860aaatggatgg cctactataa cccaagcgag gacatccaga agatttataa gaacggtacg 1860
ttcaagaagg gcgacatgtt caatcttaac gactgtcaca agctgatcga cttcttcaaa 1920ttcaagaagg gcgacatgtt caatcttaac gactgtcaca agctgatcga cttcttcaaa 1920
gactcaatta gccggtaccc aaagtggtct aacgcctatg acttcaactt ttcggaaacc 1980gactcaatta gccggtaccc aaagtggtct aacgcctatg acttcaactt ttcggaaacc 1980
gagaagtaca aggatatagc cggattttat agagaggtgg aagagcaggg ctacaaggtg 2040gagaagtaca aggatatagc cggattttat agagaggtgg aagagcaggg ctacaaggtg 2040
tcattcgagt ccgccagcaa gaaggaagtg gacaagctcg tggaagaggg taagctctac 2100tcattcgagt ccgccagcaa gaaggaagtg gacaagctcg tggaagaggg taagctctac 2100
atgttccaga tttataataa agactttagc gataagagcc acgggacacc taatctccac 2160atgttccaga tttataataa agactttagc gataagagcc acgggacacc taatctccac 2160
acaatgtatt tcaagctgct cttcgacgag aataaccacg gccaaatcag gttgtcagga 2220acaatgtatt tcaagctgct cttcgacgag aataaccacg gccaaatcag gttgtcagga 2220
ggggctgaac tcttcatgcg gcgcgctagc cttaagaagg aggagcttgt agtccaccct 2280ggggctgaac tcttcatgcg gcgcgctagc cttaagaagg aggagcttgt agtccaccct 2280
gcgaatagtc caattgcgaa taagaacccg gacaatccta aaaagactac aacattgagc 2340gcgaatagtc caattgcgaa taagaacccg gacaatccta aaaagactac aacattgagc 2340
tacgacgtgt acaaggataa gaggttttcc gaggatcagt acgagctcca catcccgatt 2400tacgacgtgt acaaggataa gaggttttcc gaggatcagt acgagctcca catcccgatt 2400
gcgatcaaca agtgcccaaa gaatattttc aagataaaca cagaggtgcg tgtactcctg 2460gcgatcaaca agtgcccaaa gaatattttc aagataaaca cagaggtgcg tgtactcctg 2460
aagcatgacg acaatcctta cgtcattggg attgctcggg gcgagaggaa cctcctctat 2520aagcatgacg acaatcctta cgtcattggg attgctcggg gcgagaggaa cctcctctat 2520
attgtggtgg tggacgggaa ggggaacata gtcgaacagt actcccttaa cgaaataatt 2580attgtggtgg tggacgggaa ggggaacata gtcgaacagt actcccttaa cgaaataatt 2580
aacaatttca acggcatccg tatcaagacc gactaccatt cgttgctgga caagaaggag 2640aacaatttca acggcatccg tatcaagacc gactaccatt cgttgctgga caagaaggag 2640
aaggagagat ttgaggcgcg gcaaaattgg acaagtatcg agaacatcaa ggaactcaaa 2700aaggagagat ttgaggcgcg gcaaaattgg acaagtatcg agaacatcaa ggaactcaaa 2700
gcaggttata tctctcaagt tgtgcataag atatgcgagc tggttgagaa gtatgacgca 2760gcaggttata tctctcaagt tgtgcataag atatgcgagc tggttgagaa gtatgacgca 2760
gtgatcgctc ttgaggacct caactcgggc tttaagaatt ctagagttaa agtggagaag 2820gtgatcgctc ttgaggacct caactcgggc tttaagaatt ctagagttaa agtggagaag 2820
caggtctatc aaaagttcga gaagatgctt atagataagc tcaactacat ggtcgataag 2880caggtctatc aaaagttcga gaagatgctt atagataagc tcaactacat ggtcgataag 2880
aaatcgaacc catgtgccac cggcggcgca ctcaaaggtt accaaataac aaacaaattc 2940aaatcgaacc catgtgccac cggcggcgca ctcaaaggtt accaaataac aaacaaattc 2940
gagtccttca aatcgatgag tactcagaat gggttcatat tttatatacc ggcgtggctt 3000gagtccttca aatcgatgag tactcagaat gggttcatat tttatatacc ggcgtggctt 3000
acgtctaaga tcgacccgtc aactggtttt gtcaacctgt tgaagacgaa atacacgtcc 3060acgtctaaga tcgacccgtc aactggtttt gtcaacctgt tgaagacgaa atacacgtcc 3060
attgccgatt cgaaaaagtt catatctagt tttgatcgta ttatgtacgt cccagaggaa 3120attgccgatt cgaaaaagtt catatctagt tttgatcgta ttatgtacgt cccagaggaa 3120
gatcttttcg agtttgctct cgactacaaa aacttttcgc ggaccgatgc ggattacatt 3180gatcttttcg agtttgctct cgactacaaa aacttttcgc ggaccgatgc ggattacatt 3180
aaaaaatgga aactctattc gtacggcaac agaatcagga tttttcgcaa ccctaagaag 3240aaaaaatgga aactctattc gtacggcaac agaatcagga tttttcgcaa ccctaagaag 3240
aataacgtct ttgattggga ggaagtttgc ttgactagcg cgtacaagga gctctttaat 3300aataacgtct ttgattggga ggaagtttgc ttgactagcg cgtacaagga gctctttaat 3300
aagtatggca ttaactacca acagggtgat atcagagcac tgctttgcga acaatctgac 3360aagtatggca ttaactacca acagggtgat atcagagcac tgctttgcga acaatctgac 3360
aaggctttct actcatcctt catggctttg atgagcctga tgctccagat gagaaattca 3420aaggctttct actcatcctt catggctttg atgagcctga tgctccagat gagaaattca 3420
attacaggca gaaccgacgt ggatttcttg atctccccgg ttaaaaattc tgatggcatc 3480attacaggca gaaccgacgt ggatttcttg atctccccgg ttaaaaattc tgatggcatc 3480
ttttacgata gcaggaacta tgaagcgcaa gagaatgcga ttctgccaaa aaatgcagac 3540ttttacgata gcaggaacta tgaagcgcaa gagaatgcga ttctgccaaa aaatgcagac 3540
gccaacggtg cctataacat cgccaggaaa gtcctgtggg cgatcggcca gttcaaaaag 3600gccaacggtg cctataacat cgccaggaaa gtcctgtggg cgatcggcca gttcaaaaag 3600
gccgaagacg aaaaattgga caaggtcaaa atcgctatca gcaacaaaga gtggctggag 3660gccgaagacg aaaaattgga caaggtcaaa atcgctatca gcaacaaaga gtggctggag 3660
tatgctcaga catccgtaaa gcataagctt atgccaaaaa agaagagaaa ggtttcaggc 3720tatgctcaga catccgtaaa gcataagctt atgccaaaaa agaagagaaa ggtttcaggc 3720
ggctcctccg aggtggagtt ctctcacgag tattggatga ggcacgctct tacacttgct 3780ggctcctccg aggtggagtt ctctcacgag tattggatga ggcacgctct tacacttgct 3780
aagagagctt gggacgaaag agaagtgcca gttggcgccg ttcttgtgca taataatagg 3840aagagagctt gggacgaaag agaagtgcca gttggcgccg ttcttgtgca taataatagg 3840
gtgatcggcg agggttggaa tagaccaatt ggaaggcatg atccaacagc tcacgcagag 3900gtgatcggcg agggttggaa tagaccaatt ggaaggcatg atccaacagc tcacgcagag 3900
attatggctc tcagacaagg cggcctcgtt atgcagaact acaggctcat tgacgctaca 3960attatggctc tcagacaagg cggcctcgtt atgcagaact acaggctcat tgacgctaca 3960
ctctacgtga cactcgaacc ttgcgttatg tgcgccggag ctatgattca ttctaggatt 4020ctctacgtga cactcgaacc ttgcgttatg tgcgccggag ctatgattca ttctaggatt 4020
ggcagggtcg tgtttggagc tagggacgct aaaacaggag ccgccggatc tcttatggac 4080ggcagggtcg tgtttggagc tagggacgct aaaacaggag ccgccggatc tcttatggac 4080
gtgttgcatc atccaggcat gaaccatagg gtggagatta cagagggcat tcttgcagac 4140gtgttgcatc atccaggcat gaaccatagg gtggagatta cagagggcat tcttgcagac 4140
gagtgcgctg ctcttctttc cgatttcttc aggatgagaa ggcaggagat taaggcccag 4200gagtgcgctg ctcttctttc cgatttcttc aggatgagaa ggcaggagat taaggcccag 4200
aagaaggctc agtcttctac agatagcgga ggatcttccg gaggatctag cggctccgag 4260aagaaggctc agtcttctac agatagcgga ggatcttccg gaggatctag cggctccgag 4260
acaccaggaa catccgaaag cgctacacca gaatctagcg gaggctcttc cggaggatct 4320acaccaggaa catccgaaag cgctacacca gaatctagcg gaggctcttc cggaggatct 4320
tctgaagtgg agttctccca cgagtattgg atgaggcacg ctcttacact tgctaaaagg 4380tctgaagtgg agttctccca cgagtattgg atgaggcacg ctcttacact tgctaaaagg 4380
gctagggacg aaagggaagt tccagttgga gctgttctcg tgctcaataa cagggtgatt 4440gctagggacg aaagggaagt tccagttgga gctgttctcg tgctcaataa cagggtgatt 4440
ggcgagggtt ggaatagagc cattggactc catgatccaa cagctcacgc agagattatg 4500ggcgagggtt ggaatagagc cattggactc catgatccaa cagctcacgc agagattatg 4500
gctcttagac aaggcggcct cgttatgcag aattacagac tcatcgacgc cacactctac 4560gctcttagac aaggcggcct cgttatgcag aattacagac tcatcgacgc cacactctac 4560
gttaccttcg aaccttgcgt tatgtgcgcc ggagctatga tccattctag gattggcagg 4620gttaccttcg aaccttgcgt tatgtgcgcc ggagctatga tccattctag gattggcagg 4620
gtcgtgttcg gcgttagaaa cgctaagaca ggagctgcag gctctcttat ggacgttctt 4680gtcgtgttcg gcgttagaaa cgctaagaca ggagctgcag gctctcttat ggacgttctt 4680
cattacccag gcatgaatca tagagtggag atcacagaag gcattcttgc agacgagtgc 4740cattacccag gcatgaatca tagagtggag atcacagaag gcattcttgc agacgagtgc 4740
gcagctctcc tttgctattt cttcaggatg ccgaggcaag ttttcaacgc tcagaagaag 4800gcagctctcc tttgctattt cttcaggatg ccgaggcaag ttttcaacgc tcagaagaag 4800
gcccagtctt ctacagattc cggcggatct tctggaggat ctagcggctc cgagacacca 4860gcccagtctt ctacagattc cggcggatct tctggaggat ctagcggctc cgagacacca 4860
ggaacatccg aatccgctac accagagtct tctggaggat ctagcggagg atctcttaag 4920ggaacatccg aatccgctac accagagtct tctggaggat ctagcggagg atctcttaag 4920
aagagaccag cagctacaaa gaaggccgga caagctaaga agaagaagta g 4971aagagaccag cagctacaaa gaaggccgga caagctaaga agaagaagta g 4971
<210> 15<210> 15
<211> 4992<211> 4992
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> LBCPF1-ABE2-X3<223> LBCPF1-ABE2-X3
<400> 15<400> 15
atgccaaaaa agaagagaaa ggtttcaaag ctcgagaaat tcaccaactg ttattcgttg 60atgccaaaaa agaagagaaa ggtttcaaag ctcgagaaat tcaccaactg ttattcgttg 60
agcaaaacac tgcggtttaa agcgattcca gtcggcaaga ctcaagagaa tatagacaat 120agcaaaacac tgcggtttaa agcgattcca gtcggcaaga ctcaagagaa tatagacaat 120
aagcggctgt tggtggaaga tgaaaagcgc gcggaagact acaaaggggt gaagaagttg 180aagcggctgt tggtggaaga tgaaaagcgc gcggaagact acaaaggggt gaagaagttg 180
ttggacagat actacctctc ttttatcaat gatgtcttgc actcaatcaa attgaagaat 240ttggacagat actacctctc ttttatcaat gatgtcttgc actcaatcaa attgaagaat 240
ctgaacaact acatctccct cttcagaaag aaaacaagga cagaaaagga gaataaggaa 300ctgaacaact acatctccct cttcagaaag aaaacaagga cagaaaagga gaataaggaa 300
cttgaaaatt tggagatcaa tctgaggaaa gagatcgcga aagcctttaa aggcaacgaa 360cttgaaaatt tggagatcaa tctgaggaaa gagatcgcga aagcctttaa aggcaacgaa 360
ggatacaaaa gtctgttcaa gaaggatata attgagacaa ttttgccaga gttcctcgat 420ggatacaaaa gtctgttcaa gaaggatata attgagacaa ttttgccaga gttcctcgat 420
gacaaggacg agattgcgct ggtcaattcg ttcaacggat tcacaacagc attcacaggc 480gacaaggacg agattgcgct ggtcaattcg ttcaacggat tcacaacagc attcacaggc 480
ttctttgata atcgggaaaa tatgttctct gaggaggcaa agtccacttc tattgcgttc 540ttctttgata atcgggaaaa tatgttctct gaggaggcaa agtccacttc tattgcgttc 540
aggtgtatca atgagaatct cactaggtac atttccaaca tggatatctt tgagaaggtt 600aggtgtatca atgagaatct cactaggtac atttccaaca tggatatctt tgagaaggtt 600
gacgcaattt ttgacaagca cgaagttcag gagattaagg agaagatcct caattccgat 660gacgcaattt ttgacaagca cgaagttcag gagattaagg agaagatcct caattccgat 660
tatgacgttg aggacttctt cgaaggtgag ttttttaatt tcgtgctcac tcaagagggt 720tatgacgttg aggacttctt cgaaggtgag ttttttaatt tcgtgctcac tcaagagggt 720
atcgacgtgt ataatgcgat catcggtggg ttcgtgactg agtccggtga aaagattaag 780atcgacgtgt ataatgcgat catcggtggg ttcgtgactg agtccggtga aaagattaag 780
ggattgaacg agtatatcaa cctttacaac caaaagacga aacagaagct gccaaagttc 840ggattgaacg agtatatcaa cctttacaac caaaagacga aacagaagct gccaaagttc 840
aagcctcttt acaaacaggt tctttcagac cgcgagtcac tctcgttcta tggggagggc 900aagcctcttt acaaacaggt tctttcagac cgcgagtcac tctcgttcta tggggagggc 900
tacacttcgg atgaggaagt cctggaggtg ttcaggaata ctctcaataa gaattcggag 960tacacttcgg atgaggaagt cctggaggtg ttcaggaata ctctcaataa gaattcggag 960
attttctctt ctataaaaaa actggaaaag ttgtttaaga attttgacga atactctagc 1020attttctctt ctataaaaaa actggaaaag ttgtttaaga attttgacga atactctagc 1020
gccggcatat ttgtgaaaaa cggcccggcc atatcaacga taagtaaaga tatcttcggc 1080gccggcatat ttgtgaaaaa cggcccggcc atatcaacga taagtaaaga tatcttcggc 1080
gaatggaacg tgatcagaga caaatggaac gcggagtatg acgatattca cctgaagaag 1140gaatggaacg tgatcagaga caaatggaac gcggagtatg acgatattca cctgaagaag 1140
aaggctgtcg taacggagaa gtacgaggat gatcgcagga aaagcttcaa aaagatcgga 1200aaggctgtcg taacggagaa gtacgaggat gatcgcagga aaagcttcaa aaagatcgga 1200
agtttcagcc tggaacagtt gcaggagtat gctgacgccg atcttagcgt cgtcgagaag 1260agtttcagcc tggaacagtt gcaggagtat gctgacgccg atcttagcgt cgtcgagaag 1260
ttgaaggaga taatcatcca aaaggtcgac gagatatata aagtctatgg atcaagtgaa 1320ttgaaggaga taatcatcca aaaggtcgac gagatatata aagtctatgg atcaagtgaa 1320
aaactgttcg acgccgactt cgttttggag aagtccctga agaagaacga cgctgttgtt 1380aaactgttcg acgccgactt cgttttggag aagtccctga agaagaacga cgctgttgtt 1380
gccattatga aggatctgct cgacagcgtg aagagtttcg agaactatat taaggctttt 1440gccattatga aggatctgct cgacagcgtg aagagtttcg agaactatat taaggctttt 1440
ttcggggagg ggaaggagac taacagagat gagtccttct acggagactt cgtcctcgcg 1500ttcggggagg ggaaggagac taacagagat gagtccttct acggagactt cgtcctcgcg 1500
tacgatatac tccttaaggt agaccacatc tacgacgcaa tcagaaatta cgtgacacaa 1560tacgatatac tccttaaggt agaccacatc tacgacgcaa tcagaaatta cgtgacacaa 1560
aagccgtaca gcaaggacaa gttcaaactc tacttccaga acccccagtt catgggcggc 1620aagccgtaca gcaaggacaa gttcaaactc tacttccaga acccccagtt catgggcggc 1620
tgggacaagg acaaggaaac ggattacagg gctacgatcc tgaggtatgg ttcaaaatac 1680tgggacaagg acaaggaaac ggattacagg gctacgatcc tgaggtatgg ttcaaaatac 1680
tacttggcga ttatggacaa gaagtacgcc aagtgtctcc agaagattga caaagacgat 1740tacttggcga ttatggacaa gaagtacgcc aagtgtctcc agaagattga caaagacgat 1740
gtcaatggca attatgagaa gatcaactac aagctgcttc cgggtccgaa caagatgctc 1800gtcaatggca attatgagaa gatcaactac aagctgcttc cgggtccgaa caagatgctc 1800
ccaaaggttt tcttcagcaa gaaatggatg gcctactata acccaagcga ggacatccag 1860ccaaaggttt tcttcagcaa gaaatggatg gcctactata acccaagcga ggacatccag 1860
aagatttata agaacggtac gttcaagaag ggcgacatgt tcaatcttaa cgactgtcac 1920aagatttata agaacggtac gttcaagaag ggcgacatgt tcaatcttaa cgactgtcac 1920
aagctgatcg acttcttcaa agactcaatt agccggtacc caaagtggtc taacgcctat 1980aagctgatcg acttcttcaa agactcaatt agccggtacc caaagtggtc taacgcctat 1980
gacttcaact tttcggaaac cgagaagtac aaggatatag ccggatttta tagagaggtg 2040gacttcaact tttcggaaac cgagaagtac aaggatatag ccggatttta tagagaggtg 2040
gaagagcagg gctacaaggt gtcattcgag tccgccagca agaaggaagt ggacaagctc 2100gaagagcagg gctacaaggt gtcattcgag tccgccagca agaaggaagt ggacaagctc 2100
gtggaagagg gtaagctcta catgttccag atttataata aagactttag cgataagagc 2160gtggaagagg gtaagctcta catgttccag atttataata aagactttag cgataagagc 2160
cacgggacac ctaatctcca cacaatgtat ttcaagctgc tcttcgacga gaataaccac 2220cacgggacac ctaatctcca cacaatgtat ttcaagctgc tcttcgacga gaataaccac 2220
ggccaaatca ggttgtcagg aggggctgaa ctcttcatgc ggcgcgctag ccttaagaag 2280ggccaaatca ggttgtcagg aggggctgaa ctcttcatgc ggcgcgctag ccttaagaag 2280
gaggagcttg tagtccaccc tgcgaatagt ccaattgcga ataagaaccc ggacaatcct 2340gaggagcttg tagtccaccc tgcgaatagt ccaattgcga ataagaaccc ggacaatcct 2340
aaaaagacta caacattgag ctacgacgtg tacaaggata agaggttttc cgaggatcag 2400aaaaagacta caacattgag ctacgacgtg tacaaggata agaggttttc cgaggatcag 2400
tacgagctcc acatcccgat tgcgatcaac aagtgcccaa agaatatttt caagataaac 2460tacgagctcc acatcccgat tgcgatcaac aagtgcccaa agaatatttt caagataaac 2460
acagaggtgc gtgtactcct gaagcatgac gacaatcctt acgtcattgg gattgctcgg 2520acagaggtgc gtgtactcct gaagcatgac gacaatcctt acgtcattgg gattgctcgg 2520
ggcgagagga acctcctcta tattgtggtg gtggacggga aggggaacat agtcgaacag 2580ggcgagagga acctcctcta tattgtggtg gtggacggga aggggaacat agtcgaacag 2580
tactccctta acgaaataat taacaatttc aacggcatcc gtatcaagac cgactaccat 2640tactccctta acgaaataat taacaatttc aacggcatcc gtatcaagac cgactaccat 2640
tcgttgctgg acaagaagga gaaggagaga tttgaggcgc ggcaaaattg gacaagtatc 2700tcgttgctgg acaagaagga gaaggagaga tttgaggcgc ggcaaaattg gacaagtatc 2700
gagaacatca aggaactcaa agcaggttat atctctcaag ttgtgcataa gatatgcgag 2760gagaacatca aggaactcaa agcaggttat atctctcaag ttgtgcataa gatatgcgag 2760
ctggttgaga agtatgacgc agtgatcgct cttgaggacc tcaactcggg ctttaagaat 2820ctggttgaga agtatgacgc agtgatcgct cttgaggacc tcaactcggg ctttaagaat 2820
tctagagtta aagtggagaa gcaggtctat caaaagttcg agaagatgct tatagataag 2880tctagagtta aagtggagaa gcaggtctat caaaagttcg agaagatgct tatagataag 2880
ctcaactaca tggtcgataa gaaatcgaac ccatgtgcca ccggcggcgc actcaaaggt 2940ctcaactaca tggtcgataa gaaatcgaac ccatgtgcca ccggcggcgc actcaaaggt 2940
taccaaataa caaacaaatt cgagtccttc aaatcgatga gtactcagaa tgggttcata 3000taccaaataa caaacaaatt cgagtccttc aaatcgatga gtactcagaa tgggttcata 3000
ttttatatac cggcgtggct tacgtctaag atcgacccgt caactggttt tgtcaacctg 3060ttttatatac cggcgtggct tacgtctaag atcgacccgt caactggttt tgtcaacctg 3060
ttgaagacga aatacacgtc cattgccgat tcgaaaaagt tcatatctag ttttgatcgt 3120ttgaagacga aatacacgtc cattgccgat tcgaaaaagt tcatatctag ttttgatcgt 3120
attatgtacg tcccagagga agatcttttc gagtttgctc tcgactacaa aaacttttcg 3180attatgtacg tcccagagga agatcttttc gagtttgctc tcgactacaa aaacttttcg 3180
cggaccgatg cggattacat taaaaaatgg aaactctatt cgtacggcaa cagaatcagg 3240cggaccgatg cggattacat taaaaaatgg aaactctatt cgtacggcaa cagaatcagg 3240
atttttcgca accctaagaa gaataacgtc tttgattggg aggaagtttg cttgactagc 3300atttttcgca accctaagaa gaataacgtc tttgattggg aggaagtttg cttgactagc 3300
gcgtacaagg agctctttaa taagtatggc attaactacc aacagggtga tatcagagca 3360gcgtacaagg agctctttaa taagtatggc attaactacc aacagggtga tatcagagca 3360
ctgctttgcg aacaatctga caaggctttc tactcatcct tcatggcttt gatgagcctg 3420ctgctttgcg aacaatctga caaggctttc tactcatcct tcatggcttt gatgagcctg 3420
atgctccaga tgagaaattc aattacaggc agaaccgacg tggatttctt gatctccccg 3480atgctccaga tgagaaattc aattacaggc agaaccgacg tggatttctt gatctccccg 3480
gttaaaaatt ctgatggcat cttttacgat agcaggaact atgaagcgca agagaatgcg 3540gttaaaaatt ctgatggcat cttttacgat agcaggaact atgaagcgca agagaatgcg 3540
attctgccaa aaaatgcaga cgccaacggt gcctataaca tcgccaggaa agtcctgtgg 3600attctgccaa aaaatgcaga cgccaacggt gcctataaca tcgccaggaa agtcctgtgg 3600
gcgatcggcc agttcaaaaa ggccgaagac gaaaaattgg acaaggtcaa aatcgctatc 3660gcgatcggcc agttcaaaaa ggccgaagac gaaaaattgg acaaggtcaa aatcgctatc 3660
agcaacaaag agtggctgga gtatgctcag acatccgtaa agcataagct tatgccaaaa 3720agcaacaaag agtggctgga gtatgctcag acatccgtaa agcataagct tatgccaaaa 3720
aagaagagaa aggtttcagg cggctcctcc gaggtggagt tctctcacga gtattggatg 3780aagaagagaa aggtttcagg cggctcctcc gaggtggagt tctctcacga gtattggatg 3780
aggcacgctc ttacacttgc taagagagct tgggacgaaa gagaagtgcc agttggcgcc 3840aggcacgctc ttacacttgc taagagagct tgggacgaaa gagaagtgcc agttggcgcc 3840
gttcttgtgc ataataatag ggtgatcggc gagggttgga atagaccaat tggaaggcat 3900gttcttgtgc ataataatag ggtgatcggc gagggttgga atagaccaat tggaaggcat 3900
gatccaacag ctcacgcaga gattatggct ctcagacaag gcggcctcgt tatgcagaac 3960gatccaacag ctcacgcaga gattatggct ctcagacaag gcggcctcgt tatgcagaac 3960
tacaggctca ttgacgctac actctacgtg acactcgaac cttgcgttat gtgcgccgga 4020tacaggctca ttgacgctac actctacgtg acactcgaac cttgcgttat gtgcgccgga 4020
gctatgattc attctaggat tggcagggtc gtgtttggag ctagggacgc taaaacagga 4080gctatgattc attctaggat tggcagggtc gtgtttggag ctagggacgc taaaacagga 4080
gccgccggat ctcttatgga cgtgttgcat catccaggca tgaaccatag ggtggagatt 4140gccgccggat ctcttatgga cgtgttgcat catccaggca tgaaccatag ggtggagatt 4140
acagagggca ttcttgcaga cgagtgcgct gctcttcttt ccgatttctt caggatgaga 4200acagagggca ttcttgcaga cgagtgcgct gctcttcttt ccgatttctt caggatgaga 4200
aggcaggaga ttaaggccca gaagaaggct cagtcttcta cagatagcgg aggatcttcc 4260aggcaggaga ttaaggccca gaagaaggct cagtcttcta cagatagcgg aggatcttcc 4260
ggaggatcta gcggctccga gacaccagga acatccgaaa gcgctacacc agaatctagc 4320ggaggatcta gcggctccga gacaccagga acatccgaaa gcgctacacc agaatctagc 4320
ggaggctctt ccggaggatc ttctgaagtg gagttctccc acgagtattg gatgaggcac 4380ggaggctctt ccggaggatc ttctgaagtg gagttctccc acgagtattg gatgaggcac 4380
gctcttacac ttgctaaaag ggctagggac gaaagggaag ttccagttgg agctgttctc 4440gctcttacac ttgctaaaag ggctagggac gaaagggaag ttccagttgg agctgttctc 4440
gtgctcaata acagggtgat tggcgagggt tggaatagag ccattggact ccatgatcca 4500gtgctcaata acagggtgat tggcgagggt tggaatagag ccattggact ccatgatcca 4500
acagctcacg cagagattat ggctcttaga caaggcggcc tcgttatgca gaattacaga 4560acagctcacg cagagattat ggctcttaga caaggcggcc tcgttatgca gaattacaga 4560
ctcatcgacg ccacactcta cgttaccttc gaaccttgcg ttatgtgcgc cggagctatg 4620ctcatcgacg ccacactcta cgttaccttc gaaccttgcg ttatgtgcgc cggagctatg 4620
atccattcta ggattggcag ggtcgtgttc ggcgttagaa acgctaagac aggagctgca 4680atccattcta ggattggcag ggtcgtgttc ggcgttagaa acgctaagac aggagctgca 4680
ggctctctta tggacgttct tcattaccca ggcatgaatc atagagtgga gatcacagaa 4740ggctctctta tggacgttct tcattaccca ggcatgaatc atagagtgga gatcacagaa 4740
ggcattcttg cagacgagtg cgcagctctc ctttgctatt tcttcaggat gccgaggcaa 4800ggcattcttg cagacgagtg cgcagctctc ctttgctatt tcttcaggat gccgaggcaa 4800
gttttcaacg ctcagaagaa ggcccagtct tctacagatt ccggcggatc ttctggagga 4860gttttcaacg ctcagaagaa ggcccagtct tctacagatt ccggcggatc ttctggagga 4860
tctagcggct ccgagacacc aggaacatcc gaatccgcta caccagagtc ttctggagga 4920tctagcggct ccgagacacc aggaacatcc gaatccgcta caccagagtc ttctggagga 4920
tctagcggag gatctcttaa gaagagacca gcagctacaa agaaggccgg acaagctaag 4980tctagcggag gatctcttaa gaagagacca gcagctacaa agaaggccgg acaagctaag 4980
aagaagaagt ag 4992aagaagaagt ag 4992
<210> 16<210> 16
<211> 5686<211> 5686
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> PJIT163-GFP<223> PJIT163-GFP
<400> 16<400> 16
gagctcggta cctgacccgg tcgtgcccct ctctagagat aatgagcatt gcatgtctaa 60gagctcggta cctgacccgg tcgtgcccct ctctagagat aatgagcatt gcatgtctaa 60
gttataaaaa attaccacat attttttttg tcacacttgt ttgaagtgca gtttatctat 120gttataaaaa attaccacat atttttttttg tcacacttgt ttgaagtgca gtttatctat 120
ctttatacat atatttaaac tttactctac gaataatata atctatagta ctacaataat 180ctttatacat atatttaaac tttactctac gaataatata atctatagta ctacaataat 180
atcagtgttt tagagaatca tataaatgaa cagttagaca tggtctaaag gacaattgag 240atcagtgttt tagagaatca tataaatgaa cagttagaca tggtctaaag gacaattgag 240
tattttgaca acaggactct acagttttat ctttttagtg tgcatgtgtt ctcctttttt 300tattttgaca acaggactct acagttttat ctttttagtg tgcatgtgtt ctcctttttt 300
tttgcaaata gcttcaccta tataatactt catccatttt attagtacat ccatttaggg 360tttgcaaata gcttcaccta tataatactt catccatttt attagtacat ccatttaggg 360
tttagggtta atggttttta tagactaatt tttttagtac atctatttta ttctatttta 420tttagggtta atggttttta tagactaatt tttttagtac atctatttta ttctatttta 420
gcctctaaat taagaaaact aaaactctat tttagttttt ttatttaata atttagatat 480gcctctaaat taagaaaact aaaactctat tttagtttttt ttatttaata atttagatat 480
aaaatagaat aaaataaagt gactaaaaat taaacaaata ccctttaaga aattaaaaaa 540aaaatagaat aaaataaagt gactaaaaat taaacaaata ccctttaaga aattaaaaaa 540
actaaggaaa catttttctt gtttcgagta gataatgcca gcctgttaaa cgccgtcgac 600actaaggaaa catttttctt gtttcgagta gataatgcca gcctgttaaa cgccgtcgac 600
gagtctaacg gacaccaacc agcgaaccag cagcgtcgcg tcgggccaag cgaagcagac 660gagtctaacg gacaccaacc agcgaaccag cagcgtcgcg tcgggccaag cgaagcagac 660
ggcacggcat ctctgtcgct gcctctggac ccctctcgat cgagagttcc gctccaccgt 720ggcacggcat ctctgtcgct gcctctggac ccctctcgat cgagagttcc gctccaccgt 720
tggacttgct ccgctgtcgg catccagaaa ttgcgtggcg gagcggcaga cgtgagccgg 780tggacttgct ccgctgtcgg catccagaaa ttgcgtggcg gagcggcaga cgtgagccgg 780
cacggcaggc ggcctcctcc tcctctcacg gcaccggcag ctacggggga ttcctttccc 840cacggcaggc ggcctcctcc tcctctcacg gcaccggcag ctacggggga ttcctttccc 840
accgctcctt cgctttccct tcctcgcccg ccgtaataaa tagacacccc ctccacaccc 900accgctcctt cgctttccct tcctcgcccg ccgtaataaa tagacacccc ctccacaccc 900
tctttcccca acctcgtgtt gttcggagcg cacacacaca caaccagatc tcccccaaat 960tctttcccca acctcgtgtt gttcggagcg cacacacaca caaccagatc tcccccaaat 960
ccacccgtcg gcacctccgc ttcaaggtac gccgctcgtc ctcccccccc ccccctctct 1020ccacccgtcg gcacctccgc ttcaaggtac gccgctcgtc ctcccccccc ccccctctct 1020
accttctcta gatcggcgtt ccggtccatg gttagggccc ggtagttcta cttctgttca 1080accttctcta gatcggcgtt ccggtccatg gttagggccc ggtagttcta cttctgttca 1080
tgtttgtgtt agatccgtgt ttgtgttaga tccgtgctgc tagcgttcgt acacggatgc 1140tgtttgtgtt agatccgtgt ttgtgttaga tccgtgctgc tagcgttcgt acacggatgc 1140
gacctgtacg tcagacacgt tctgattgct aacttgccag tgtttctctt tggggaatcc 1200gacctgtacg tcagacacgt tctgattgct aacttgccag tgtttctctt tggggaatcc 1200
tgggatggct ctagccgttc cgcagacggg atcgatttca tgattttttt tgtttcgttg 1260tgggatggct ctagccgttc cgcagacggg atcgatttca tgattttttt tgtttcgttg 1260
catagggttt ggtttgccct tttcctttat ttcaatatat gccgtgcact tgtttgtcgg 1320catagggttt ggtttgccct tttcctttat ttcaatatat gccgtgcact tgtttgtcgg 1320
gtcatctttt catgcttttt tttgtcttgg ttgtgatgat gtggtctggt tgggcggtcg 1380gtcatctttt catgcttttt tttgtcttgg ttgtgatgat gtggtctggt tgggcggtcg 1380
ttctagatcg gagtagaatt aattctgttt caaactacct ggtggattta ttaattttgg 1440ttctagatcg gagtagaatt aattctgttt caaactacct ggtggattta ttaattttgg 1440
atctgtatgt gtgtgccata catattcata gttacgaatt gaagatgatg gatggaaata 1500atctgtatgt gtgtgccata catattcata gttacgaatt gaagatgatg gatggaaata 1500
tcgatctagg ataggtatac atgttgatgc gggttttact gatgcatata cagagatgct 1560tcgatctagg ataggtatac atgttgatgc gggttttact gatgcatata cagagatgct 1560
ttttgttcgc ttggttgtga tgatgtggtg tggttgggcg gtcgttcatt cgttctagat 1620ttttgttcgc ttggttgtga tgatgtggtg tggttgggcg gtcgttcatt cgttctagat 1620
cggagtagaa tactgtttca aactacctgg tgtatttatt aattttggaa ctgtatgtgt 1680cggagtagaa tactgtttca aactacctgg tgtatttatt aattttggaa ctgtatgtgt 1680
gtgtcataca tcttcatagt tacgagttta agatggatgg aaatatcgat ctaggatagg 1740gtgtcataca tcttcatagt tacgagttta agatggatgg aaatatcgat ctaggatagg 1740
tatacatgtt gatgtgggtt ttactgatgc atatacatga tggcatatgc agcatctatt 1800tatacatgtt gatgtgggtt ttactgatgc atatacatga tggcatatgc agcatctatt 1800
catatgctct aaccttgagt acctatctat tataataaac aagtatgttt tataattatt 1860catatgctct aaccttgagt acctatctat tataataaac aagtatgttt tataattatt 1860
ttgatcttga tatacttgga tgatggcata tgcagcagct atatgtggat ttttttagcc 1920ttgatcttga tatacttgga tgatggcata tgcagcagct atatgtggat ttttttagcc 1920
ctgccttcat acgctattta tttgcttggt actgtttctt ttgtcgatgc tcaccctgtt 1980ctgccttcat acgctattta tttgcttggt actgtttctt ttgtcgatgc tcaccctgtt 1980
gtttggtgtt acttctgcaa agcttgtcga cggatccatg gtgagcaagg gcgaggagct 2040gtttggtgtt acttctgcaa agcttgtcga cggatccatg gtgagcaagg gcgaggagct 2040
gttcaccggg gtggtgccca tcctggtcga gctggacggc gacgtaaacg gccacaagtt 2100gttcaccggg gtggtgccca tcctggtcga gctggacggc gacgtaaacg gccacaagtt 2100
cagcgtgtcc ggcgagggcg agggcgatgc cacctacggc aagctgaccc tgaagttcat 2160cagcgtgtcc ggcgagggcg agggcgatgc cacctacggc aagctgaccc tgaagttcat 2160
ctgcaccacc ggcaagctgc ccgtgccctg gcccaccctc gtgaccacct tcacctacgg 2220ctgcaccacc ggcaagctgc ccgtgccctg gcccaccctc gtgaccacct tcacctacgg 2220
cgtgcagtgc ttcagccgct accccgacca catgaagcag cacgacttct tcaagtccgc 2280cgtgcagtgc ttcagccgct accccgacca catgaagcag cacgacttct tcaagtccgc 2280
catgcccgaa ggctacgtcc aggagcgcac catcttcttc aaggacgacg gcaactacaa 2340catgcccgaa ggctacgtcc aggagcgcac catcttcttc aaggacgacg gcaactacaa 2340
gacccgcgcc gaggtgaagt tcgagggcga caccctggtg aaccgcatcg agctgaaggg 2400gacccgcgcc gaggtgaagt tcgagggcga caccctggtg aaccgcatcg agctgaaggg 2400
catcgacttc aaggaggacg gcaacatcct ggggcacaag ctggagtaca actacaacag 2460catcgacttc aaggaggacg gcaacatcct ggggcacaag ctggagtaca actacaacag 2460
ccacaacgtc tatatcatgg ccgacaagca gaagaacggc atcaaggtga acttcaagat 2520ccacaacgtc tatatcatgg ccgacaagca gaagaacggc atcaaggtga acttcaagat 2520
ccgccacaac atcgaggacg gcagcgtgca gctcgccgac cactaccagc agaacacccc 2580ccgccacaac atcgaggacg gcagcgtgca gctcgccgac cactaccagc agaacacccc 2580
catcggcgac ggccccgtgc tgctgcccga caaccactac ctgagcaccc agtccgccct 2640catcggcgac ggccccgtgc tgctgcccga caaccactac ctgagcaccc agtccgccct 2640
gagcaaagac cccaacgaga agcgcgatca catggtcctg ctggagttcg tgaccgccgc 2700gagcaaagac cccaacgaga agcgcgatca catggtcctg ctggagttcg tgaccgccgc 2700
cgggatcact cacggcatgg acgagctgta caagtaaccc gggaattcgg tacgctgaaa 2760cgggatcact cacggcatgg acgagctgta caagtaaccc gggaattcgg tacgctgaaa 2760
tcaccagtct ctctctacaa atctatctct ctctattttc tccataaata atgtgtgagt 2820tcaccagtct ctctctacaa atctatctct ctctattttc tccataaata atgtgtgagt 2820
agtttcccga taagggaaat tagggttctt atagggtttc gctcatgtgt tgagcatata 2880agtttcccga taagggaaat tagggttctt atagggtttc gctcatgtgt tgagcatata 2880
agaaaccctt agtatgtatt tgtatttgta aaatacttct atcaataaaa tttctaattc 2940agaaaccctt agtatgtatt tgtatttgta aaatacttct atcaataaaa tttctaattc 2940
ctaaaaccaa aatccagtac taaaatccag atctcctaaa gtccctatag atctttgtcg 3000ctaaaaccaa aatccagtac taaaatccag atctcctaaa gtccctatag atctttgtcg 3000
tgaatataaa ccagacacga gacgactaaa cctggagccc agacgccgtt cgaagctaga 3060tgaatataaa ccagacacga gacgactaaa cctggagccc agacgccgtt cgaagctaga 3060
agtaccgctt aggcaggagg ccgttaggga aaagatgcta aggcagggtt ggttacgttg 3120agtaccgctt aggcaggagg ccgttaggga aaagatgcta aggcagggtt ggttacgttg 3120
actcccccgt aggtttggtt taaatatgat gaagtggacg gaaggaagga ggaagacaag 3180actcccccgt aggtttggtt taaatatgat gaagtggacg gaaggaagga ggaagacaag 3180
gaaggataag gttgcaggcc ctgtgcaagg taagaagatg gaaatttgat agaggtacgc 3240gaaggataag gttgcaggcc ctgtgcaagg taagaagatg gaaatttgat agaggtacgc 3240
tactatactt atactatacg ctaagggaat gcttgtattt ataccctata ccccctaata 3300tactatactt atactatacg ctaagggaat gcttgtattt ataccctata ccccctaata 3300
accccttatc aatttaagaa ataatccgca taagcccccg cttaaaaatt ggtatcagag 3360accccttatc aatttaagaa ataatccgca taagccccccg cttaaaaatt ggtatcagag 3360
ccatgaatag gtctatgacc aaaactcaag aggataaaac ctcaccaaaa tacgaaagag 3420ccatgaatag gtctatgacc aaaactcaag aggataaaac ctcaccaaaa tacgaaagag 3420
ttcttaactc taaagataaa agatctttca agatcaaaac tagttccctc acaccggagc 3480ttcttaactc taaagataaa agatctttca agatcaaaac tagttccctc acaccggagc 3480
atgcgatatc ctcgagagat ctaggcgtaa tcatggtcat agctgtttcc tgtgtgaaat 3540atgcgatatc ctcgagagat ctaggcgtaa tcatggtcat agctgtttcc tgtgtgaaat 3540
tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctgg 3600tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctgg 3600
ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag 3660ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag 3660
tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 3720tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 3720
ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 3780ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 3780
ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 3840ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 3840
gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 3900gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 3900
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 3960gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 3960
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 4020cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 4020
ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 4080ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 4080
tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta tctcagttcg 4140tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta tctcagttcg 4140
gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 4200gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 4200
tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 4260tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 4260
ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 4320ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 4320
ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 4380ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 4380
ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 4440ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 4440
accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 4500accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 4500
tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 4560tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 4560
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 4620cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 4620
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 4680taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 4680
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 4740caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 4740
gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt 4800gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt 4800
gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 4860gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 4860
ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct 4920ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct 4920
attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 4980attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 4980
gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc 5040gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc 5040
tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 5100tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 5100
agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg 5160agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg 5160
gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 5220gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 5220
actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 5280actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 5280
tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc 5340tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc 5340
attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 5400attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 5400
tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 5460tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 5460
tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 5520tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 5520
aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta tcagggttat 5580aaatgttgaa tactcatact cttcctttttt caatattatt gaagcattta tcagggttat 5580
tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 5640tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 5640
cgcacatttc cccgaaaagt gccacctgcc agtgccaagc taattc 5686cgcacatttc cccgaaaagt gccacctgcc agtgccaagc taattc 5686
<210> 17<210> 17
<211> 720<211> 720
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> pBUI-mGFP<223> pBUI-mGFP
<400> 17<400> 17
atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60
ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120
ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180
ctcgtgacca ccttcaccta cggcgtgtag tgcttcagcc gctaccccga ccacatgaag 240ctcgtgacca ccttcaccta cggcgtgtag tgcttcagcc gctaccccga ccacatgaag 240
cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300
ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360
gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420
aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 480aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 480
ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 540ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 540
gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600
tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660
ctgctggagt tcgtgaccgc cgccgggatc actcacggca tggacgagct gtacaagtaa 720ctgctggagt tcgtgaccgc cgccgggatc actcacggca tggacgagct gtacaagtaa 720
<210> 18<210> 18
<211> 1307<211> 1307
<212> PRT<212> PRT
<213> Acidaminococcus sp.<213> Acidaminococcus sp.
<400> 18<400> 18
Met Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys ThrMet Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys Thr
1 5 10 151 5 10 15
Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile GlnLeu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile Gln
20 25 30 20 25 30
Glu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr LysGlu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr Lys
35 40 45 35 40 45
Glu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp GlnGlu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp Gln
50 55 60 50 55 60
Cys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala IleCys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala Ile
65 70 75 8065 70 75 80
Asp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu IleAsp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu Ile
85 90 95 85 90 95
Glu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile GlyGlu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile Gly
100 105 110 100 105 110
Arg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu IleArg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu Ile
115 120 125 115 120 125
Tyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu LysTyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu Lys
130 135 140 130 135 140
Gln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu ArgGln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu Arg
145 150 155 160145 150 155 160
Ser Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn ArgSer Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn Arg
165 170 175 165 170 175
Lys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His ArgLys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His Arg
180 185 190 180 185 190
Ile Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile PheIle Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile Phe
195 200 205 195 200 205
Thr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu AsnThr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu Asn
210 215 220 210 215 220
Val Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu ValVal Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu Val
225 230 235 240225 230 235 240
Phe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile AspPhe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile Asp
245 250 255 245 250 255
Leu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr GluLeu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr Glu
260 265 270 260 265 270
Lys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys AsnLys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys Asn
275 280 285 275 280 285
Asp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile ProAsp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile Pro
290 295 300 290 295 300
Leu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile LeuLeu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile Leu
305 310 315 320305 310 315 320
Glu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys TyrGlu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys Tyr
325 330 335 325 330 335
Lys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala LeuLys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala Leu
340 345 350 340 345 350
Phe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser HisPhe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser His
355 360 365 355 360 365
Lys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp ThrLys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp Thr
370 375 380 370 375 380
Leu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly LysLeu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly Lys
385 390 395 400385 390 395 400
Ile Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His GluIle Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His Glu
405 410 415 405 410 415
Asp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu SerAsp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu Ser
420 425 430 420 425 430
Glu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His AlaGlu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His Ala
435 440 445 435 440 445
Ala Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu LysAla Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu Lys
450 455 460 450 455 460
Glu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His LeuGlu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His Leu
465 470 475 480465 470 475 480
Leu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu PheLeu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu Phe
485 490 495 485 490 495
Ser Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu SerSer Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu Ser
500 505 510 500 505 510
Phe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser ValPhe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser Val
515 520 525 515 520 525
Glu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly TrpGlu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly Trp
530 535 540 530 535 540
Asp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys AsnAsp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys Asn
545 550 555 560545 550 555 560
Gly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr LysGly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr Lys
565 570 575 565 570 575
Ala Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp LysAla Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp Lys
580 585 590 580 585 590
Met Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys CysMet Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys Cys
595 600 605 595 600 605
Ser Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr ThrSer Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr Thr
610 615 620 610 615 620
Pro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr LysPro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr Lys
625 630 635 640625 630 635 640
Glu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe GlnGlu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe Gln
645 650 655 645 650 655
Thr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu AlaThr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu Ala
660 665 670 660 665 670
Leu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr ThrLeu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr Thr
675 680 685 675 680 685
Lys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln TyrLys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln Tyr
690 695 700 690 695 700
Lys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr HisLys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr His
705 710 715 720705 710 715 720
Ile Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val GluIle Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val Glu
725 730 735 725 730 735
Thr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala LysThr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala Lys
740 745 750 740 745 750
Gly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly LeuGly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly Leu
755 760 765 755 760 765
Phe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly GlnPhe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly Gln
770 775 780 770 775 780
Ala Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala HisAla Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala His
785 790 795 800785 790 795 800
Arg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys ThrArg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys Thr
805 810 815 805 810 815
Pro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn HisPro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn His
820 825 830 820 825 830
Arg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro AsnArg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro Asn
835 840 845 835 840 845
Val Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg PheVal Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg Phe
850 855 860 850 855 860
Thr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr GlnThr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr Gln
865 870 875 880865 870 875 880
Ala Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr LeuAla Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr Leu
885 890 895 885 890 895
Lys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu ArgLys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu Arg
900 905 910 900 905 910
Asn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu GluAsn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu Glu
915 920 925 915 920 925
Gln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys LeuGln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys Leu
930 935 940 930 935 940
Asp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser ValAsp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser Val
945 950 955 960945 950 955 960
Val Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val IleVal Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val Ile
965 970 975 965 970 975
His Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val LeuHis Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val Leu
980 985 990 980 985 990
Glu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala GluGlu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala Glu
995 1000 1005 995 1000 1005
Lys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys LeuLys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys Leu
1010 1015 1020 1010 1015 1020
Asn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly GlyAsn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly Gly
1025 1030 1035 1025 1030 1035
Val Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe AlaVal Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe Ala
1040 1045 1050 1040 1045 1050
Lys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala ProLys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala Pro
1055 1060 1065 1055 1060 1065
Tyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro PheTyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro Phe
1070 1075 1080 1070 1075 1080
Val Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe LeuVal Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe Leu
1085 1090 1095 1085 1090 1095
Glu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp PheGlu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp Phe
1100 1105 1110 1100 1105 1110
Ile Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg GlyIle Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg Gly
1115 1120 1125 1115 1120 1125
Leu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys AsnLeu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys Asn
1130 1135 1140 1130 1135 1140
Glu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly LysGlu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly Lys
1145 1150 1155 1145 1150 1155
Arg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg TyrArg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg Tyr
1160 1165 1170 1160 1165 1170
Arg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu GluArg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu Glu
1175 1180 1185 1175 1180 1185
Lys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys LeuLys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys Leu
1190 1195 1200 1190 1195 1200
Leu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala LeuLeu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala Leu
1205 1210 1215 1205 1210 1215
Ile Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr GlyIle Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr Gly
1220 1225 1230 1220 1225 1230
Glu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val CysGlu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val Cys
1235 1240 1245 1235 1240 1245
Phe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala AspPhe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala Asp
1250 1255 1260 1250 1255 1260
Ala Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu LeuAla Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu Leu
1265 1270 1275 1265 1270 1275
Asn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly IleAsn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly Ile
1280 1285 1290 1280 1285 1290
Ser Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg AsnSer Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg Asn
1295 1300 1305 1295 1300 1305
<210> 19<210> 19
<211> 1300<211> 1300
<212> PRT<212> PRT
<213> Francisella novicida<213> Francisella novicida
<400> 19<400> 19
Met Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys ThrMet Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr
1 5 10 151 5 10 15
Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile LysLeu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys
20 25 30 20 25 30
Ala Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr LysAla Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys
35 40 45 35 40 45
Lys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu GluLys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu
50 55 60 50 55 60
Ile Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr SerIle Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser
65 70 75 8065 70 75 80
Asp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln LysAsp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys
85 90 95 85 90 95
Asp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu TyrAsp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr
100 105 110 100 105 110
Ile Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu IleIle Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile
115 120 125 115 120 125
Asp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys GlnAsp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln
130 135 140 130 135 140
Ser Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile ThrSer Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr
145 150 155 160145 150 155 160
Asp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp ThrAsp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr
165 170 175 165 170 175
Thr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser SerThr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser
180 185 190 180 185 190
Asn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn LeuAsn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu
195 200 205 195 200 205
Pro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp LysPro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys
210 215 220 210 215 220
Ala Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala GluAla Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu
225 230 235 240225 230 235 240
Glu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln ArgGlu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg
245 250 255 245 250 255
Val Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn TyrVal Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr
260 265 270 260 265 270
Leu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly LysLeu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys
275 280 285 275 280 285
Phe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr IlePhe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile
290 295 300 290 295 300
Asn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr LysAsn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys
305 310 315 320305 310 315 320
Met Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys SerMet Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser
325 330 335 325 330 335
Phe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr MetPhe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met
340 345 350 340 345 350
Gln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu LysGln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys
355 360 365 355 360 365
Ser Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala GlnSer Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln
370 375 380 370 375 380
Lys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu ThrLys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr
385 390 395 400385 390 395 400
Asp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr AlaAsp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala
405 410 415 405 410 415
Val Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp AsnVal Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn
420 425 430 420 425 430
Pro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys AlaPro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala
435 440 445 435 440 445
Lys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe AsnLys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn
450 455 460 450 455 460
Lys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu AlaLys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala
465 470 475 480465 470 475 480
Asn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn LysAsn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys
485 490 495 485 490 495
Asp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys LysAsp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys
500 505 510 500 505 510
Asp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys AspAsp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp
515 520 525 515 520 525
Leu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe HisLeu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His
530 535 540 530 535 540
Ile Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu HisIle Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His
545 550 555 560545 550 555 560
Phe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile ValPhe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val
565 570 575 565 570 575
Pro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr SerPro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser
580 585 590 580 585 590
Asp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn GlyAsp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly
595 600 605 595 600 605
Trp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile LysTrp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys
610 615 620 610 615 620
Asp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys IleAsp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile
625 630 635 640625 630 635 640
Phe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys LysPhe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys
645 650 655 645 650 655
Ile Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys ValIle Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val
660 665 670 660 665 670
Phe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp IlePhe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile
675 680 685 675 680 685
Leu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro GlnLeu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln
690 695 700 690 695 700
Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys PheLys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe
705 710 715 720705 710 715 720
Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys AspIle Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp
725 730 735 725 730 735
Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp GluPhe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu
740 745 750 740 745 750
Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu AsnPhe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn
755 760 765 755 760 765
Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu TyrIle Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr
770 775 780 770 775 780
Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly ArgLeu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg
785 790 795 800785 790 795 800
Pro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg AsnPro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn
805 810 815 805 810 815
Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe TyrLeu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe Tyr
820 825 830 820 825 830
Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu AlaArg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu Ala
835 840 845 835 840 845
Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe GluIle Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe Glu
850 855 860 850 855 860
Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe PheTyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe
865 870 875 880865 870 875 880
His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys PheHis Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe
885 890 895 885 890 895
Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val HisAsn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His
900 905 910 900 905 910
Ile Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr LeuIle Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu
915 920 925 915 920 925
Val Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile IleVal Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile
930 935 940 930 935 940
Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala IleGly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala Ile
945 950 955 960945 950 955 960
Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn AsnGlu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn
965 970 975 965 970 975
Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu IleIle Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile
980 985 990 980 985 990
Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp LeuAla Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu
995 1000 1005 995 1000 1005
Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln ValAsn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val
1010 1015 1020 1010 1015 1020
Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr LeuTyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu
1025 1030 1035 1025 1030 1035
Val Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu ArgVal Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg
1040 1045 1050 1040 1045 1050
Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met GlyAla Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly
1055 1060 1065 1055 1060 1065
Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr SerLys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser
1070 1075 1080 1070 1075 1080
Lys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro LysLys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys
1085 1090 1095 1085 1090 1095
Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe AspTyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp
1100 1105 1110 1100 1105 1110
Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser PheLys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe
1115 1120 1125 1115 1120 1125
Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp ThrAsp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr
1130 1135 1140 1130 1135 1140
Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser AspIle Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp
1145 1150 1155 1145 1150 1155
Lys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys GluLys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu
1160 1165 1170 1160 1165 1170
Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His GlyLeu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly
1175 1180 1185 1175 1180 1185
Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys PheGlu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe
1190 1195 1200 1190 1195 1200
Phe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met ArgPhe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg
1205 1210 1215 1205 1210 1215
Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro ValAsn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val
1220 1225 1230 1220 1225 1230
Ala Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro LysAla Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys
1235 1240 1245 1235 1240 1245
Asn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile GlyAsn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly
1250 1255 1260 1250 1255 1260
Leu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln GluLeu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu
1265 1270 1275 1265 1270 1275
Gly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe GluGly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu
1280 1285 1290 1280 1285 1290
Phe Val Gln Asn Arg Asn AsnPhe Val Gln Asn Arg Asn Asn
1295 1300 1295 1300
<210> 20<210> 20
<211> 1228<211> 1228
<212> PRT<212> PRT
<213> Lachnospiraceae bacterium<213> Lachnospiraceae bacterium
<400> 20<400> 20
Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys ThrMet Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr
1 5 10 151 5 10 15
Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile AspLeu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp
20 25 30 20 25 30
Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr LysAsn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys
35 40 45 35 40 45
Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn AspGly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp
50 55 60 50 55 60
Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser LeuVal Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu
65 70 75 8065 70 75 80
Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu AsnPhe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn
85 90 95 85 90 95
Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly AsnLeu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn
100 105 110 100 105 110
Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile LeuGlu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu
115 120 125 115 120 125
Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser PhePro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe
130 135 140 130 135 140
Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu AsnAsn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn
145 150 155 160145 150 155 160
Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys IleMet Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile
165 170 175 165 170 175
Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu LysAsn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys
180 185 190 180 185 190
Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu LysVal Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys
195 200 205 195 200 205
Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu PheIle Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe
210 215 220 210 215 220
Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala IlePhe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile
225 230 235 240225 230 235 240
Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu AsnIle Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn
245 250 255 245 250 255
Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro LysGlu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys
260 265 270 260 265 270
Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu SerPhe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser
275 280 285 275 280 285
Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val PhePhe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe
290 295 300 290 295 300
Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys LysArg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys
305 310 315 320305 310 315 320
Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly IleLeu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile
325 330 335 325 330 335
Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile PhePhe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe
340 345 350 340 345 350
Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp AspGly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp
355 360 365 355 360 365
Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp AspIle His Leu Lys Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp
370 375 380 370 375 380
Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln LeuArg Arg Lys Ser Phe Lys Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu
385 390 395 400385 390 395 400
Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys GluGln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu
405 410 415 405 410 415
Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser SerIle Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser
420 425 430 420 425 430
Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys LysGlu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys
435 440 445 435 440 445
Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val LysAsn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys
450 455 460 450 455 460
Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu ThrSer Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr
465 470 475 480465 470 475 480
Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp IleAsn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile
485 490 495 485 490 495
Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val ThrLeu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr
500 505 510 500 505 510
Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn ProGln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro
515 520 525 515 520 525
Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg AlaGln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala
530 535 540 530 535 540
Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp LysThr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys
545 550 555 560545 550 555 560
Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn GlyLys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly
565 570 575 565 570 575
Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys MetAsn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met
580 585 590 580 585 590
Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn ProLeu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro
595 600 605 595 600 605
Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys GlySer Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly
610 615 620 610 615 620
Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe LysAsp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys
625 630 635 640625 630 635 640
Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe AsnAsp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn
645 650 655 645 650 655
Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg GluPhe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu
660 665 670 660 665 670
Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys LysVal Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys
675 680 685 675 680 685
Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln IleGlu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile
690 695 700 690 695 700
Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu HisTyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His
705 710 715 720705 710 715 720
Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln IleThr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile
725 730 735 725 730 735
Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu LysArg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys
740 745 750 740 745 750
Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn LysLys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys
755 760 765 755 760 765
Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val TyrAsn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr
770 775 780 770 775 780
Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro IleLys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile
785 790 795 800785 790 795 800
Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu ValAla Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val
805 810 815 805 810 815
Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile AspArg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Asp
820 825 830 820 825 830
Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys GlyArg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly
835 840 845 835 840 845
Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe AsnAsn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn
850 855 860 850 855 860
Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys GluGly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu
865 870 875 880865 870 875 880
Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn IleLys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile
885 890 895 885 890 895
Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile CysLys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys
900 905 910 900 905 910
Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu AsnGlu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn
915 920 925 915 920 925
Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr GlnSer Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln
930 935 940 930 935 940
Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp LysLys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys
945 950 955 960945 950 955 960
Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln IleLys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile
965 970 975 965 970 975
Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly PheThr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe
980 985 990 980 985 990
Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser ThrIle Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr
995 1000 1005 995 1000 1005
Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala AspGly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp
1010 1015 1020 1010 1015 1020
Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val ProSer Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro
1025 1030 1035 1025 1030 1035
Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe SerGlu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser
1040 1045 1050 1040 1045 1050
Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser TyrArg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr
1055 1060 1065 1055 1060 1065
Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn ValGly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val
1070 1075 1080 1070 1075 1080
Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu LeuPhe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu
1085 1090 1095 1085 1090 1095
Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg AlaPhe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala
1100 1105 1110 1100 1105 1110
Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe MetLeu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met
1115 1120 1125 1115 1120 1125
Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr GlyAla Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly
1130 1135 1140 1130 1135 1140
Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser AspArg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp
1145 1150 1155 1145 1150 1155
Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn AlaGly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala
1160 1165 1170 1160 1165 1170
Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile AlaIle Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala
1175 1180 1185 1175 1180 1185
Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu AspArg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp
1190 1195 1200 1190 1195 1200
Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu TrpGlu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp
1205 1210 1215 1205 1210 1215
Leu Glu Tyr Ala Gln Thr Ser Val Lys HisLeu Glu Tyr Ala Gln Thr Ser Val Lys His
1220 1225 1220 1225
<210> 21<210> 21
<211> 1329<211> 1329
<212> PRT<212> PRT
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> ASCPF1-2NLS<223> ASCPF1-2NLS
<400> 21<400> 21
Met Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys ThrMet Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys Thr
1 5 10 151 5 10 15
Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile GlnLeu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile Gln
20 25 30 20 25 30
Glu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr LysGlu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr Lys
35 40 45 35 40 45
Glu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp GlnGlu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp Gln
50 55 60 50 55 60
Cys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala IleCys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala Ile
65 70 75 8065 70 75 80
Asp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu IleAsp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu Ile
85 90 95 85 90 95
Glu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile GlyGlu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile Gly
100 105 110 100 105 110
Arg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu IleArg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu Ile
115 120 125 115 120 125
Tyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu LysTyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu Lys
130 135 140 130 135 140
Gln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu ArgGln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu Arg
145 150 155 160145 150 155 160
Ser Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn ArgSer Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn Arg
165 170 175 165 170 175
Lys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His ArgLys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His Arg
180 185 190 180 185 190
Ile Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile PheIle Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile Phe
195 200 205 195 200 205
Thr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu AsnThr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu Asn
210 215 220 210 215 220
Val Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu ValVal Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu Val
225 230 235 240225 230 235 240
Phe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile AspPhe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile Asp
245 250 255 245 250 255
Leu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr GluLeu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr Glu
260 265 270 260 265 270
Lys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys AsnLys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys Asn
275 280 285 275 280 285
Asp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile ProAsp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile Pro
290 295 300 290 295 300
Leu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile LeuLeu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile Leu
305 310 315 320305 310 315 320
Glu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys TyrGlu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys Tyr
325 330 335 325 330 335
Lys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala LeuLys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala Leu
340 345 350 340 345 350
Phe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser HisPhe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser His
355 360 365 355 360 365
Lys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp ThrLys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp Thr
370 375 380 370 375 380
Leu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly LysLeu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly Lys
385 390 395 400385 390 395 400
Ile Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His GluIle Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His Glu
405 410 415 405 410 415
Asp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu SerAsp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu Ser
420 425 430 420 425 430
Glu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His AlaGlu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His Ala
435 440 445 435 440 445
Ala Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu LysAla Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu Lys
450 455 460 450 455 460
Glu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His LeuGlu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His Leu
465 470 475 480465 470 475 480
Leu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu PheLeu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu Phe
485 490 495 485 490 495
Ser Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu SerSer Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu Ser
500 505 510 500 505 510
Phe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser ValPhe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser Val
515 520 525 515 520 525
Glu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly TrpGlu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly Trp
530 535 540 530 535 540
Asp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys AsnAsp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys Asn
545 550 555 560545 550 555 560
Gly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr LysGly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr Lys
565 570 575 565 570 575
Ala Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp LysAla Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp Lys
580 585 590 580 585 590
Met Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys CysMet Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys Cys
595 600 605 595 600 605
Ser Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr ThrSer Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr Thr
610 615 620 610 615 620
Pro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr LysPro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr Lys
625 630 635 640625 630 635 640
Glu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe GlnGlu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe Gln
645 650 655 645 650 655
Thr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu AlaThr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu Ala
660 665 670 660 665 670
Leu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr ThrLeu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr Thr
675 680 685 675 680 685
Lys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln TyrLys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln Tyr
690 695 700 690 695 700
Lys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr HisLys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr His
705 710 715 720705 710 715 720
Ile Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val GluIle Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val Glu
725 730 735 725 730 735
Thr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala LysThr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala Lys
740 745 750 740 745 750
Gly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly LeuGly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly Leu
755 760 765 755 760 765
Phe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly GlnPhe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly Gln
770 775 780 770 775 780
Ala Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala HisAla Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala His
785 790 795 800785 790 795 800
Arg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys ThrArg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys Thr
805 810 815 805 810 815
Pro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn HisPro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn His
820 825 830 820 825 830
Arg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro AsnArg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro Asn
835 840 845 835 840 845
Val Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg PheVal Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg Phe
850 855 860 850 855 860
Thr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr GlnThr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr Gln
865 870 875 880865 870 875 880
Ala Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr LeuAla Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr Leu
885 890 895 885 890 895
Lys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu ArgLys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu Arg
900 905 910 900 905 910
Asn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu GluAsn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu Glu
915 920 925 915 920 925
Gln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys LeuGln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys Leu
930 935 940 930 935 940
Asp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser ValAsp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser Val
945 950 955 960945 950 955 960
Val Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val IleVal Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val Ile
965 970 975 965 970 975
His Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val LeuHis Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val Leu
980 985 990 980 985 990
Glu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala GluGlu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala Glu
995 1000 1005 995 1000 1005
Lys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys LeuLys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys Leu
1010 1015 1020 1010 1015 1020
Asn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly GlyAsn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly Gly
1025 1030 1035 1025 1030 1035
Val Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe AlaVal Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe Ala
1040 1045 1050 1040 1045 1050
Lys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala ProLys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala Pro
1055 1060 1065 1055 1060 1065
Tyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro PheTyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro Phe
1070 1075 1080 1070 1075 1080
Val Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe LeuVal Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe Leu
1085 1090 1095 1085 1090 1095
Glu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp PheGlu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp Phe
1100 1105 1110 1100 1105 1110
Ile Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg GlyIle Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg Gly
1115 1120 1125 1115 1120 1125
Leu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys AsnLeu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys Asn
1130 1135 1140 1130 1135 1140
Glu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly LysGlu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly Lys
1145 1150 1155 1145 1150 1155
Arg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg TyrArg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg Tyr
1160 1165 1170 1160 1165 1170
Arg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu GluArg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu Glu
1175 1180 1185 1175 1180 1185
Lys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys LeuLys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys Leu
1190 1195 1200 1190 1195 1200
Leu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala LeuLeu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala Leu
1205 1210 1215 1205 1210 1215
Ile Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr GlyIle Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr Gly
1220 1225 1230 1220 1225 1230
Glu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val CysGlu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val Cys
1235 1240 1245 1235 1240 1245
Phe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala AspPhe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala Asp
1250 1255 1260 1250 1255 1260
Ala Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu LeuAla Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu Leu
1265 1270 1275 1265 1270 1275
Asn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly IleAsn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly Ile
1280 1285 1290 1280 1285 1290
Ser Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg Asn SerSer Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg Asn Ser
1295 1300 1305 1295 1300 1305
Gly Gly Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser ProGly Gly Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro
1310 1315 1320 1310 1315 1320
Lys Lys Lys Arg Lys ValLys Lys Lys Arg Lys Val
1325 1325
<210> 22<210> 22
<211> 1322<211> 1322
<212> PRT<212> PRT
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> FNCPF1-2NLS<223> FNCPF1-2NLS
<400> 22<400> 22
Met Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys ThrMet Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr
1 5 10 151 5 10 15
Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile LysLeu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys
20 25 30 20 25 30
Ala Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr LysAla Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys
35 40 45 35 40 45
Lys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu GluLys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu
50 55 60 50 55 60
Ile Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr SerIle Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser
65 70 75 8065 70 75 80
Asp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln LysAsp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys
85 90 95 85 90 95
Asp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu TyrAsp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr
100 105 110 100 105 110
Ile Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu IleIle Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile
115 120 125 115 120 125
Asp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys GlnAsp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln
130 135 140 130 135 140
Ser Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile ThrSer Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr
145 150 155 160145 150 155 160
Asp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp ThrAsp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr
165 170 175 165 170 175
Thr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser SerThr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser
180 185 190 180 185 190
Asn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn LeuAsn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu
195 200 205 195 200 205
Pro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp LysPro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys
210 215 220 210 215 220
Ala Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala GluAla Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu
225 230 235 240225 230 235 240
Glu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln ArgGlu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg
245 250 255 245 250 255
Val Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn TyrVal Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr
260 265 270 260 265 270
Leu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly LysLeu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys
275 280 285 275 280 285
Phe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr IlePhe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile
290 295 300 290 295 300
Asn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr LysAsn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys
305 310 315 320305 310 315 320
Met Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys SerMet Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser
325 330 335 325 330 335
Phe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr MetPhe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met
340 345 350 340 345 350
Gln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu LysGln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys
355 360 365 355 360 365
Ser Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala GlnSer Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln
370 375 380 370 375 380
Lys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu ThrLys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr
385 390 395 400385 390 395 400
Asp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr AlaAsp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala
405 410 415 405 410 415
Val Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp AsnVal Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn
420 425 430 420 425 430
Pro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys AlaPro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala
435 440 445 435 440 445
Lys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe AsnLys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn
450 455 460 450 455 460
Lys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu AlaLys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala
465 470 475 480465 470 475 480
Asn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn LysAsn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys
485 490 495 485 490 495
Asp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys LysAsp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys
500 505 510 500 505 510
Asp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys AspAsp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp
515 520 525 515 520 525
Leu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe HisLeu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His
530 535 540 530 535 540
Ile Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu HisIle Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His
545 550 555 560545 550 555 560
Phe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile ValPhe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val
565 570 575 565 570 575
Pro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr SerPro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser
580 585 590 580 585 590
Asp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn GlyAsp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly
595 600 605 595 600 605
Trp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile LysTrp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys
610 615 620 610 615 620
Asp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys IleAsp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile
625 630 635 640625 630 635 640
Phe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys LysPhe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys
645 650 655 645 650 655
Ile Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys ValIle Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val
660 665 670 660 665 670
Phe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp IlePhe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile
675 680 685 675 680 685
Leu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro GlnLeu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln
690 695 700 690 695 700
Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys PheLys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe
705 710 715 720705 710 715 720
Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys AspIle Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp
725 730 735 725 730 735
Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp GluPhe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu
740 745 750 740 745 750
Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu AsnPhe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn
755 760 765 755 760 765
Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu TyrIle Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr
770 775 780 770 775 780
Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly ArgLeu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg
785 790 795 800785 790 795 800
Pro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg AsnPro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn
805 810 815 805 810 815
Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe TyrLeu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe Tyr
820 825 830 820 825 830
Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu AlaArg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu Ala
835 840 845 835 840 845
Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe GluIle Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe Glu
850 855 860 850 855 860
Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe PheTyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe
865 870 875 880865 870 875 880
His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys PheHis Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe
885 890 895 885 890 895
Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val HisAsn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His
900 905 910 900 905 910
Ile Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr LeuIle Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu
915 920 925 915 920 925
Val Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile IleVal Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile
930 935 940 930 935 940
Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala IleGly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala Ile
945 950 955 960945 950 955 960
Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn AsnGlu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn
965 970 975 965 970 975
Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu IleIle Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile
980 985 990 980 985 990
Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp LeuAla Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu
995 1000 1005 995 1000 1005
Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln ValAsn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val
1010 1015 1020 1010 1015 1020
Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr LeuTyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu
1025 1030 1035 1025 1030 1035
Val Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu ArgVal Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg
1040 1045 1050 1040 1045 1050
Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met GlyAla Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly
1055 1060 1065 1055 1060 1065
Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr SerLys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser
1070 1075 1080 1070 1075 1080
Lys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro LysLys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys
1085 1090 1095 1085 1090 1095
Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe AspTyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp
1100 1105 1110 1100 1105 1110
Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser PheLys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe
1115 1120 1125 1115 1120 1125
Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp ThrAsp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr
1130 1135 1140 1130 1135 1140
Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser AspIle Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp
1145 1150 1155 1145 1150 1155
Lys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys GluLys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu
1160 1165 1170 1160 1165 1170
Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His GlyLeu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly
1175 1180 1185 1175 1180 1185
Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys PheGlu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe
1190 1195 1200 1190 1195 1200
Phe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met ArgPhe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg
1205 1210 1215 1205 1210 1215
Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro ValAsn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val
1220 1225 1230 1220 1225 1230
Ala Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro LysAla Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys
1235 1240 1245 1235 1240 1245
Asn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile GlyAsn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly
1250 1255 1260 1250 1255 1260
Leu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln GluLeu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu
1265 1270 1275 1265 1270 1275
Gly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe GluGly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu
1280 1285 1290 1280 1285 1290
Phe Val Gln Asn Arg Asn Asn Ser Gly Gly Ser Pro Lys Lys LysPhe Val Gln Asn Arg Asn Asn Ser Gly Gly Ser Pro Lys Lys Lys
1295 1300 1305 1295 1300 1305
Arg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys ValArg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val
1310 1315 1320 1310 1315 1320
<210> 23<210> 23
<211> 1252<211> 1252
<212> PRT<212> PRT
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> LBCPF1-2NLS<223> LBCPF1-2NLS
<400> 23<400> 23
Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys ThrMet Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr
1 5 10 151 5 10 15
Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile AspLeu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp
20 25 30 20 25 30
Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr LysAsn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys
35 40 45 35 40 45
Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn AspGly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp
50 55 60 50 55 60
Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser LeuVal Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu
65 70 75 8065 70 75 80
Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu AsnPhe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn
85 90 95 85 90 95
Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly AsnLeu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn
100 105 110 100 105 110
Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile LeuGlu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu
115 120 125 115 120 125
Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser PhePro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe
130 135 140 130 135 140
Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu AsnAsn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn
145 150 155 160145 150 155 160
Met Glu Thr Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe ArgMet Glu Thr Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg
165 170 175 165 170 175
Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile PheCys Ile Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe
180 185 190 180 185 190
Glu Lys Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile LysGlu Lys Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys
195 200 205 195 200 205
Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu GlyGlu Lys Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly
210 215 220 210 215 220
Glu Phe Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr AsnGlu Phe Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn
225 230 235 240225 230 235 240
Ala Ile Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys GlyAla Ile Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly
245 250 255 245 250 255
Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys LeuLeu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu
260 265 270 260 265 270
Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu SerPro Lys Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser
275 280 285 275 280 285
Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu GluLeu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu
290 295 300 290 295 300
Val Phe Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser IleVal Phe Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile
305 310 315 320305 310 315 320
Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser AlaLys Lys Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala
325 330 335 325 330 335
Gly Ile Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys AspGly Ile Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp
340 345 350 340 345 350
Ile Phe Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu TyrIle Phe Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr
355 360 365 355 360 365
Asp Asp Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr GluAsp Asp Ile His Leu Lys Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu
370 375 380 370 375 380
Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu GluAsp Asp Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu
385 390 395 400385 390 395 400
Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys LeuGln Leu Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu
405 410 415 405 410 415
Lys Glu Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr GlyLys Glu Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly
420 425 430 420 425 430
Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser LeuSer Ser Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu
435 440 445 435 440 445
Lys Lys Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp SerLys Lys Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser
450 455 460 450 455 460
Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly LysVal Lys Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys
465 470 475 480465 470 475 480
Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala TyrGlu Thr Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr
485 490 495 485 490 495
Asp Ile Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn TyrAsp Ile Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr
500 505 510 500 505 510
Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe GlnVal Thr Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln
515 520 525 515 520 525
Asn Pro Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp TyrAsn Pro Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr
530 535 540 530 535 540
Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile MetArg Ala Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met
545 550 555 560545 550 555 560
Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp ValAsp Lys Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val
565 570 575 565 570 575
Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro AsnAsn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn
580 585 590 580 585 590
Lys Met Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr TyrLys Met Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr
595 600 605 595 600 605
Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe LysAsn Pro Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys
610 615 620 610 615 620
Lys Gly Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp PheLys Gly Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe
625 630 635 640625 630 635 640
Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr AspPhe Lys Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp
645 650 655 645 650 655
Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe TyrPhe Asn Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr
660 665 670 660 665 670
Arg Glu Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala SerArg Glu Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser
675 680 685 675 680 685
Lys Lys Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met PheLys Lys Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe
690 695 700 690 695 700
Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro AsnGln Ile Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn
705 710 715 720705 710 715 720
Leu His Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His GlyLeu His Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly
725 730 735 725 730 735
Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala SerGln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser
740 745 750 740 745 750
Leu Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile AlaLeu Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala
755 760 765 755 760 765
Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr AspAsn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp
770 775 780 770 775 780
Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His IleVal Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile
785 790 795 800785 790 795 800
Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn ThrPro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr
805 810 815 805 810 815
Glu Val Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile GlyGlu Val Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly
820 825 830 820 825 830
Ile Asp Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp GlyIle Asp Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly
835 840 845 835 840 845
Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn AsnLys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn
850 855 860 850 855 860
Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp LysPhe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys
865 870 875 880865 870 875 880
Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile GluLys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu
885 890 895 885 890 895
Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His LysAsn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys
900 905 910 900 905 910
Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu AspIle Cys Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp
915 920 925 915 920 925
Leu Asn Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln ValLeu Asn Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val
930 935 940 930 935 940
Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met ValTyr Gln Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val
945 950 955 960945 950 955 960
Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly TyrAsp Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr
965 970 975 965 970 975
Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln AsnGln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn
980 985 990 980 985 990
Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp ProGly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro
995 1000 1005 995 1000 1005
Ser Thr Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser IleSer Thr Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile
1010 1015 1020 1010 1015 1020
Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met TyrAla Asp Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr
1025 1030 1035 1025 1030 1035
Val Pro Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys AsnVal Pro Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn
1040 1045 1050 1040 1045 1050
Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu TyrPhe Ser Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr
1055 1060 1065 1055 1060 1065
Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys AsnSer Tyr Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn
1070 1075 1080 1070 1075 1080
Asn Val Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr LysAsn Val Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys
1085 1090 1095 1085 1090 1095
Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp IleGlu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile
1100 1105 1110 1100 1105 1110
Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser SerArg Ala Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser
1115 1120 1125 1115 1120 1125
Phe Met Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser IlePhe Met Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile
1130 1135 1140 1130 1135 1140
Thr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys AsnThr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn
1145 1150 1155 1145 1150 1155
Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln GluSer Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu
1160 1165 1170 1160 1165 1170
Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr AsnAsn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn
1175 1180 1185 1175 1180 1185
Ile Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys AlaIle Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala
1190 1195 1200 1190 1195 1200
Glu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn LysGlu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys
1205 1210 1215 1205 1210 1215
Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His Ser Gly GlyGlu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His Ser Gly Gly
1220 1225 1230 1220 1225 1230
Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro Lys LysSer Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro Lys Lys
1235 1240 1245 1235 1240 1245
Lys Arg Lys ValLys Arg Lys Val
1250 1250
<210> 24<210> 24
<211> 1664<211> 1664
<212> PRT<212> PRT
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> dFNCPF1-PBE-2NLS<223> dFNCPF1-PBE-2NLS
<400> 24<400> 24
Met Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val AlaMet Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val Ala
1 5 10 151 5 10 15
Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu ValVal Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val
20 25 30 20 25 30
Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr GluPhe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu
35 40 45 35 40 45
Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln AsnIle Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn
50 55 60 50 55 60
Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr GluThr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu
65 70 75 8065 70 75 80
Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu SerArg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser
85 90 95 85 90 95
Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu SerTrp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser
100 105 110 100 105 110
Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr HisArg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His
115 120 125 115 120 125
His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser SerHis Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser
130 135 140 130 135 140
Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys TrpGly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp
145 150 155 160145 150 155 160
Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro ArgArg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg
165 170 175 165 170 175
Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys IleTyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile
180 185 190 180 185 190
Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln ProIle Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro
195 200 205 195 200 205
Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln ArgGln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg
210 215 220 210 215 220
Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser GluLeu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser Glu
225 230 235 240225 230 235 240
Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Ser Ile TyrThr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Ser Ile Tyr
245 250 255 245 250 255
Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr Leu Arg Phe GluGln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Glu
260 265 270 260 265 270
Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys Ala Arg Gly LeuLeu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys Ala Arg Gly Leu
275 280 285 275 280 285
Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys Lys Ala Lys GlnIle Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys Lys Ala Lys Gln
290 295 300 290 295 300
Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu Ile Leu Ser SerIle Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu Ile Leu Ser Ser
305 310 315 320305 310 315 320
Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser Asp Val Tyr PheVal Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser Asp Val Tyr Phe
325 330 335 325 330 335
Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys Asp Phe Lys SerLys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys Asp Phe Lys Ser
340 345 350 340 345 350
Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr Ile Lys Asp SerAla Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr Ile Lys Asp Ser
355 360 365 355 360 365
Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile Asp Ala Lys LysGlu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile Asp Ala Lys Lys
370 375 380 370 375 380
Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln Ser Lys Asp AsnGly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln Ser Lys Asp Asn
385 390 395 400385 390 395 400
Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr Asp Ile Asp GluGly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr Asp Ile Asp Glu
405 410 415 405 410 415
Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr Thr Tyr Phe LysAla Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr Thr Tyr Phe Lys
420 425 430 420 425 430
Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser Asn Asp Ile ProGly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser Asn Asp Ile Pro
435 440 445 435 440 445
Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu Pro Lys Phe LeuThr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu Pro Lys Phe Leu
450 455 460 450 455 460
Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys Ala Pro Glu AlaGlu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys Ala Pro Glu Ala
465 470 475 480465 470 475 480
Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu Glu Leu Thr PheIle Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu Glu Leu Thr Phe
485 490 495 485 490 495
Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg Val Phe Ser LeuAsp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg Val Phe Ser Leu
500 505 510 500 505 510
Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr Leu Asn Gln SerAsp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr Leu Asn Gln Ser
515 520 525 515 520 525
Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys Phe Val Asn GlyGly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys Phe Val Asn Gly
530 535 540 530 535 540
Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile Asn Leu Tyr SerGlu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile Asn Leu Tyr Ser
545 550 555 560545 550 555 560
Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys Met Ser Val LeuGln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys Met Ser Val Leu
565 570 575 565 570 575
Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser Phe Val Ile AspPhe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser Phe Val Ile Asp
580 585 590 580 585 590
Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met Gln Ser Phe TyrLys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met Gln Ser Phe Tyr
595 600 605 595 600 605
Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys Ser Ile Lys GluGlu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys Ser Ile Lys Glu
610 615 620 610 615 620
Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln Lys Leu Asp LeuThr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln Lys Leu Asp Leu
625 630 635 640625 630 635 640
Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr Asp Leu Ser GlnSer Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr Asp Leu Ser Gln
645 650 655 645 650 655
Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala Val Leu Glu TyrGln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala Val Leu Glu Tyr
660 665 670 660 665 670
Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn Pro Ser Lys LysIle Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn Pro Ser Lys Lys
675 680 685 675 680 685
Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala Lys Tyr Leu SerGlu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala Lys Tyr Leu Ser
690 695 700 690 695 700
Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn Lys His Arg AspLeu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn Lys His Arg Asp
705 710 715 720705 710 715 720
Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala Asn Phe Ala AlaIle Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala Asn Phe Ala Ala
725 730 735 725 730 735
Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys Asp Asn Leu AlaIle Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys Asp Asn Leu Ala
740 745 750 740 745 750
Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys Asp Leu Leu GlnGln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys Asp Leu Leu Gln
755 760 765 755 760 765
Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp Leu Leu Asp GlnAla Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp Leu Leu Asp Gln
770 775 780 770 775 780
Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His Ile Ser Gln SerThr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His Ile Ser Gln Ser
785 790 795 800785 790 795 800
Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His Phe Tyr Leu ValGlu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His Phe Tyr Leu Val
805 810 815 805 810 815
Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val Pro Leu Tyr AsnPhe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val Pro Leu Tyr Asn
820 825 830 820 825 830
Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser Asp Glu Lys PheLys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser Asp Glu Lys Phe
835 840 845 835 840 845
Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly Trp Asp Lys AsnLys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly Trp Asp Lys Asn
850 855 860 850 855 860
Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys Asp Asp Lys TyrLys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys Asp Asp Lys Tyr
865 870 875 880865 870 875 880
Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile Phe Asp Asp LysTyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile Phe Asp Asp Lys
885 890 895 885 890 895
Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys Ile Val Tyr LysAla Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys Ile Val Tyr Lys
900 905 910 900 905 910
Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val Phe Phe Ser AlaLeu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val Phe Phe Ser Ala
915 920 925 915 920 925
Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile Leu Arg Ile ArgLys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile Leu Arg Ile Arg
930 935 940 930 935 940
Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln Lys Gly Tyr GluAsn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln Lys Gly Tyr Glu
945 950 955 960945 950 955 960
Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe Ile Asp Phe TyrLys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe Ile Asp Phe Tyr
965 970 975 965 970 975
Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp Phe Gly Phe ArgLys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp Phe Gly Phe Arg
980 985 990 980 985 990
Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu Phe Tyr Arg GluPhe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu Phe Tyr Arg Glu
995 1000 1005 995 1000 1005
Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn Ile Ser GluVal Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn Ile Ser Glu
1010 1015 1020 1010 1015 1020
Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr Leu PheSer Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr Leu Phe
1025 1030 1035 1025 1030 1035
Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg ProGln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg Pro
1040 1045 1050 1040 1045 1050
Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg AsnAsn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn
1055 1060 1065 1055 1060 1065
Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu PheLeu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe
1070 1075 1080 1070 1075 1080
Tyr Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala LysTyr Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys
1085 1090 1095 1085 1090 1095
Glu Ala Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu SerGlu Ala Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser
1100 1105 1110 1100 1105 1110
Val Phe Glu Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu AspVal Phe Glu Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp
1115 1120 1125 1115 1120 1125
Lys Phe Phe Phe His Cys Pro Ile Thr Ile Asn Phe Lys Ser SerLys Phe Phe Phe His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser
1130 1135 1140 1130 1135 1140
Gly Ala Asn Lys Phe Asn Asp Glu Ile Asn Leu Leu Leu Lys GluGly Ala Asn Lys Phe Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu
1145 1150 1155 1145 1150 1155
Lys Ala Asn Asp Val His Ile Leu Ser Ile Ala Arg Gly Glu ArgLys Ala Asn Asp Val His Ile Leu Ser Ile Ala Arg Gly Glu Arg
1160 1165 1170 1160 1165 1170
His Leu Ala Tyr Tyr Thr Leu Val Asp Gly Lys Gly Asn Ile IleHis Leu Ala Tyr Tyr Thr Leu Val Asp Gly Lys Gly Asn Ile Ile
1175 1180 1185 1175 1180 1185
Lys Gln Asp Thr Phe Asn Ile Ile Gly Asn Asp Arg Met Lys ThrLys Gln Asp Thr Phe Asn Ile Ile Gly Asn Asp Arg Met Lys Thr
1190 1195 1200 1190 1195 1200
Asn Tyr His Asp Lys Leu Ala Ala Ile Glu Lys Asp Arg Asp SerAsn Tyr His Asp Lys Leu Ala Ala Ile Glu Lys Asp Arg Asp Ser
1205 1210 1215 1205 1210 1215
Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn Ile Lys Glu Met LysAla Arg Lys Asp Trp Lys Lys Ile Asn Asn Ile Lys Glu Met Lys
1220 1225 1230 1220 1225 1230
Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile Ala Lys Leu ValGlu Gly Tyr Leu Ser Gln Val Val His Glu Ile Ala Lys Leu Val
1235 1240 1245 1235 1240 1245
Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu Asn Phe GlyIle Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu Asn Phe Gly
1250 1255 1260 1250 1255 1260
Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val Tyr Gln LysPhe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val Tyr Gln Lys
1265 1270 1275 1265 1270 1275
Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu Val Phe LysLeu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu Val Phe Lys
1280 1285 1290 1280 1285 1290
Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg Ala Tyr GlnAsp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg Ala Tyr Gln
1295 1300 1305 1295 1300 1305
Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly Lys Gln ThrLeu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly Lys Gln Thr
1310 1315 1320 1310 1315 1320
Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser Lys Ile CysGly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser Lys Ile Cys
1325 1330 1335 1325 1330 1335
Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys Tyr Glu SerPro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys Tyr Glu Ser
1340 1345 1350 1340 1345 1350
Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp Lys Ile CysVal Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp Lys Ile Cys
1355 1360 1365 1355 1360 1365
Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe Asp Tyr LysTyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe Asp Tyr Lys
1370 1375 1380 1370 1375 1380
Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr Ile Ala SerAsn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr Ile Ala Ser
1385 1390 1395 1385 1390 1395
Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp Lys Asn HisPhe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp Lys Asn His
1400 1405 1410 1400 1405 1410
Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu Leu Glu LysAsn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu Leu Glu Lys
1415 1420 1425 1415 1420 1425
Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly Glu Cys IleLeu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly Glu Cys Ile
1430 1435 1440 1430 1435 1440
Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe Phe Ala LysLys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe Phe Ala Lys
1445 1450 1455 1445 1450 1455
Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg Asn Ser LysLeu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg Asn Ser Lys
1460 1465 1470 1460 1465 1470
Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val Ala Asp ValThr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val Ala Asp Val
1475 1480 1485 1475 1480 1485
Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys Asn Met ProAsn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys Asn Met Pro
1490 1495 1500 1490 1495 1500
Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly Leu Lys GlyGln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly Leu Lys Gly
1505 1510 1515 1505 1510 1515
Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu Gly Lys LysLeu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu Gly Lys Lys
1520 1525 1530 1520 1525 1530
Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu Phe Val GlnLeu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu Phe Val Gln
1535 1540 1545 1535 1540 1545
Asn Arg Asn Asn Thr Arg Asp Ser Gly Gly Ser Thr Asn Leu SerAsn Arg Asn Asn Thr Arg Asp Ser Gly Gly Ser Thr Asn Leu Ser
1550 1555 1560 1550 1555 1560
Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val Ile Gln GluAsp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val Ile Gln Glu
1565 1570 1575 1565 1570 1575
Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val Ile Gly AsnSer Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val Ile Gly Asn
1580 1585 1590 1580 1585 1590
Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp Glu SerLys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp Glu Ser
1595 1600 1605 1595 1600 1605
Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu TyrThr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu Tyr
1610 1615 1620 1610 1615 1620
Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn LysLys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys
1625 1630 1635 1625 1630 1635
Ile Lys Met Leu Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys ValIle Lys Met Leu Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val
1640 1645 1650 1640 1645 1650
Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys ValSer Gly Gly Ser Pro Lys Lys Lys Arg Lys Val
1655 1660 1655 1660
<210> 25<210> 25
<211> 1592<211> 1592
<212> PRT<212> PRT
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> dLBCPF1-PBE-2NLS<223> dLBCPF1-PBE-2NLS
<400> 25<400> 25
Met Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val AlaMet Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val Ala
1 5 10 151 5 10 15
Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu ValVal Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val
20 25 30 20 25 30
Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr GluPhe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu
35 40 45 35 40 45
Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln AsnIle Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn
50 55 60 50 55 60
Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr GluThr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu
65 70 75 8065 70 75 80
Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu SerArg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser
85 90 95 85 90 95
Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu SerTrp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser
100 105 110 100 105 110
Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr HisArg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His
115 120 125 115 120 125
His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser SerHis Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser
130 135 140 130 135 140
Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys TrpGly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp
145 150 155 160145 150 155 160
Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro ArgArg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg
165 170 175 165 170 175
Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys IleTyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile
180 185 190 180 185 190
Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln ProIle Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro
195 200 205 195 200 205
Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln ArgGln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg
210 215 220 210 215 220
Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser GluLeu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser Glu
225 230 235 240225 230 235 240
Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Ser Lys LeuThr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Ser Lys Leu
245 250 255 245 250 255
Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe LysGlu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys
260 265 270 260 265 270
Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg LeuAla Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu
275 280 285 275 280 285
Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys LysLeu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys
290 295 300 290 295 300
Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His SerLeu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser
305 310 315 320305 310 315 320
Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys LysIle Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys
325 330 335 325 330 335
Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile AsnThr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn
340 345 350 340 345 350
Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr LysLeu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys
355 360 365 355 360 365
Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe LeuSer Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu
370 375 380 370 375 380
Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe ThrAsp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr
385 390 395 400385 390 395 400
Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser GluThr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu
405 410 415 405 410 415
Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn LeuGlu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu
420 425 430 420 425 430
Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala IleThr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile
435 440 445 435 440 445
Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn SerPhe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser
450 455 460 450 455 460
Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe ValAsp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val
465 470 475 480465 470 475 480
Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly PheLeu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe
485 490 495 485 490 495
Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile AsnVal Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn
500 505 510 500 505 510
Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro LeuLeu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu
515 520 525 515 520 525
Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly GluTyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu
530 535 540 530 535 540
Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr LeuGly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu
545 550 555 560545 550 555 560
Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys LeuAsn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu
565 570 575 565 570 575
Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys AsnPhe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn
580 585 590 580 585 590
Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp AsnGly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn
595 600 605 595 600 605
Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu LysVal Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys
610 615 620 610 615 620
Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys SerLys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser
625 630 635 640625 630 635 640
Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr AlaPhe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala
645 650 655 645 650 655
Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile GlnAsp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln
660 665 670 660 665 670
Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu PheLys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe
675 680 685 675 680 685
Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala ValAsp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val
690 695 700 690 695 700
Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu AsnVal Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn
705 710 715 720705 710 715 720
Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp GluTyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu
725 730 735 725 730 735
Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys ValSer Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val
740 745 750 740 745 750
Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro TyrAsp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr
755 760 765 755 760 765
Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met GlySer Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly
770 775 780 770 775 780
Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu ArgGly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg
785 790 795 800785 790 795 800
Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala LysTyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys
805 810 815 805 810 815
Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu LysCys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys
820 825 830 820 825 830
Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys ValIle Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val
835 840 845 835 840 845
Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp IlePhe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile
850 855 860 850 855 860
Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe AsnGln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn
865 870 875 880865 870 875 880
Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile SerLeu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser
885 890 895 885 890 895
Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu ThrArg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr
900 905 910 900 905 910
Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu GlnGlu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln
915 920 925 915 920 925
Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys Glu Val Asp LysGly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys
930 935 940 930 935 940
Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys AspLeu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp
945 950 955 960945 950 955 960
Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His Thr Met Tyr PhePhe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His Thr Met Tyr Phe
965 970 975 965 970 975
Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile Arg Leu Ser GlyLys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly
980 985 990 980 985 990
Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys Lys Glu Glu LeuGly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu
995 1000 1005 995 1000 1005
Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys Asn Pro AspVal Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp
1010 1015 1020 1010 1015 1020
Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr Lys AspAsn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr Lys Asp
1025 1030 1035 1025 1030 1035
Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile AlaLys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile Ala
1040 1045 1050 1040 1045 1050
Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu ValIle Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val
1055 1060 1065 1055 1060 1065
Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly IleArg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile
1070 1075 1080 1070 1075 1080
Ala Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp GlyAla Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly
1085 1090 1095 1085 1090 1095
Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile AsnLys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn
1100 1105 1110 1100 1105 1110
Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu LeuAsn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu
1115 1120 1125 1115 1120 1125
Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp ThrAsp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr
1130 1135 1140 1130 1135 1140
Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser GlnSer Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln
1145 1150 1155 1145 1150 1155
Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala ValVal Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala Val
1160 1165 1170 1160 1165 1170
Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg ValIle Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val
1175 1180 1185 1175 1180 1185
Lys Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu IleLys Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile
1190 1195 1200 1190 1195 1200
Asp Lys Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys AlaAsp Lys Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala
1205 1210 1215 1205 1210 1215
Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe GluThr Gly Gly Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu
1220 1225 1230 1220 1225 1230
Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe Ile Phe Tyr IleSer Phe Lys Ser Met Ser Thr Gln Asn Gly Phe Ile Phe Tyr Ile
1235 1240 1245 1235 1240 1245
Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr Gly Phe ValPro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr Gly Phe Val
1250 1255 1260 1250 1255 1260
Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp Ser Lys LysAsn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp Ser Lys Lys
1265 1270 1275 1265 1270 1275
Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro Glu Glu AspPhe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro Glu Glu Asp
1280 1285 1290 1280 1285 1290
Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser Arg Thr AspLeu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser Arg Thr Asp
1295 1300 1305 1295 1300 1305
Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr Gly Asn ArgAla Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr Gly Asn Arg
1310 1315 1320 1310 1315 1320
Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val Phe Asp TrpIle Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val Phe Asp Trp
1325 1330 1335 1325 1330 1335
Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu Phe Asn LysGlu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu Phe Asn Lys
1340 1345 1350 1340 1345 1350
Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala Leu Leu CysTyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala Leu Leu Cys
1355 1360 1365 1355 1360 1365
Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met Ala Leu MetGlu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met Ala Leu Met
1370 1375 1380 1370 1375 1380
Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly Arg Thr AspSer Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly Arg Thr Asp
1385 1390 1395 1385 1390 1395
Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp Gly Ile PheVal Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp Gly Ile Phe
1400 1405 1410 1400 1405 1410
Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala Ile Leu ProTyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala Ile Leu Pro
1415 1420 1425 1415 1420 1425
Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala Arg Lys ValLys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala Arg Lys Val
1430 1435 1440 1430 1435 1440
Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp Glu Lys LeuLeu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp Glu Lys Leu
1445 1450 1455 1445 1450 1455
Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp Leu Glu TyrAsp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp Leu Glu Tyr
1460 1465 1470 1460 1465 1470
Ala Gln Thr Ser Val Lys His Thr Arg Asp Ser Gly Gly Ser ThrAla Gln Thr Ser Val Lys His Thr Arg Asp Ser Gly Gly Ser Thr
1475 1480 1485 1475 1480 1485
Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu ValAsn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val
1490 1495 1500 1490 1495 1500
Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu ValIle Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val
1505 1510 1515 1505 1510 1515
Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala TyrIle Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr
1520 1525 1530 1520 1525 1530
Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp AlaAsp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala
1535 1540 1545 1535 1540 1545
Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn GlyPro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly
1550 1555 1560 1550 1555 1560
Glu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Pro Lys Lys LysGlu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Pro Lys Lys Lys
1565 1570 1575 1565 1570 1575
Arg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys ValArg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val
1580 1585 1590 1580 1585 1590
<210> 26<210> 26
<211> 1731<211> 1731
<212> PRT<212> PRT
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> dFNCPF1-ABE7.10-2NLS<223> dFNCPF1-ABE7.10-2NLS
<400> 26<400> 26
Met Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val GluMet Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu
1 5 10 151 5 10 15
Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys ArgPhe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg
20 25 30 20 25 30
Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His AsnAla Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His Asn
35 40 45 35 40 45
Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His AspAsn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His Asp
50 55 60 50 55 60
Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu ValPro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val
65 70 75 8065 70 75 80
Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Leu GluMet Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Leu Glu
85 90 95 85 90 95
Pro Cys Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly ArgPro Cys Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg
100 105 110 100 105 110
Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser LeuVal Val Phe Gly Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu
115 120 125 115 120 125
Met Asp Val Leu His His Pro Gly Met Asn His Arg Val Glu Ile ThrMet Asp Val Leu His His Pro Gly Met Asn His Arg Val Glu Ile Thr
130 135 140 130 135 140
Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe PheGlu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe
145 150 155 160145 150 155 160
Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser SerArg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser Ser
165 170 175 165 170 175
Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr ProThr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro
180 185 190 180 185 190
Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser GlyGly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly
195 200 205 195 200 205
Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His AlaGly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala
210 215 220 210 215 220
Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val GlyLeu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly
225 230 235 240225 230 235 240
Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn ArgAla Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg
245 250 255 245 250 255
Ala Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala LeuAla Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu
260 265 270 260 265 270
Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala ThrArg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr
275 280 285 275 280 285
Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met IleLeu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile
290 295 300 290 295 300
His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys ThrHis Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr
305 310 315 320305 310 315 320
Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met AsnGly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn
325 330 335 325 330 335
His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala AlaHis Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala
340 345 350 340 345 350
Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala GlnLeu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln
355 360 365 355 360 365
Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly SerLys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser
370 375 380 370 375 380
Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu SerSer Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser
385 390 395 400385 390 395 400
Ser Gly Gly Ser Ser Gly Gly Ser Leu Lys Ser Ile Tyr Gln Glu PheSer Gly Gly Ser Ser Gly Gly Ser Leu Lys Ser Ile Tyr Gln Glu Phe
405 410 415 405 410 415
Val Asn Lys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Glu Leu Ile ProVal Asn Lys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Glu Leu Ile Pro
420 425 430 420 425 430
Gln Gly Lys Thr Leu Glu Asn Ile Lys Ala Arg Gly Leu Ile Leu AspGln Gly Lys Thr Leu Glu Asn Ile Lys Ala Arg Gly Leu Ile Leu Asp
435 440 445 435 440 445
Asp Glu Lys Arg Ala Lys Asp Tyr Lys Lys Ala Lys Gln Ile Ile AspAsp Glu Lys Arg Ala Lys Asp Tyr Lys Lys Ala Lys Gln Ile Ile Asp
450 455 460 450 455 460
Lys Tyr His Gln Phe Phe Ile Glu Glu Ile Leu Ser Ser Val Cys IleLys Tyr His Gln Phe Phe Ile Glu Glu Ile Leu Ser Ser Val Cys Ile
465 470 475 480465 470 475 480
Ser Glu Asp Leu Leu Gln Asn Tyr Ser Asp Val Tyr Phe Lys Leu LysSer Glu Asp Leu Leu Gln Asn Tyr Ser Asp Val Tyr Phe Lys Leu Lys
485 490 495 485 490 495
Lys Ser Asp Asp Asp Asn Leu Gln Lys Asp Phe Lys Ser Ala Lys AspLys Ser Asp Asp Asp Asn Leu Gln Lys Asp Phe Lys Ser Ala Lys Asp
500 505 510 500 505 510
Thr Ile Lys Lys Gln Ile Ser Glu Tyr Ile Lys Asp Ser Glu Lys PheThr Ile Lys Lys Gln Ile Ser Glu Tyr Ile Lys Asp Ser Glu Lys Phe
515 520 525 515 520 525
Lys Asn Leu Phe Asn Gln Asn Leu Ile Asp Ala Lys Lys Gly Gln GluLys Asn Leu Phe Asn Gln Asn Leu Ile Asp Ala Lys Lys Gly Gln Glu
530 535 540 530 535 540
Ser Asp Leu Ile Leu Trp Leu Lys Gln Ser Lys Asp Asn Gly Ile GluSer Asp Leu Ile Leu Trp Leu Lys Gln Ser Lys Asp Asn Gly Ile Glu
545 550 555 560545 550 555 560
Leu Phe Lys Ala Asn Ser Asp Ile Thr Asp Ile Asp Glu Ala Leu GluLeu Phe Lys Ala Asn Ser Asp Ile Thr Asp Ile Asp Glu Ala Leu Glu
565 570 575 565 570 575
Ile Ile Lys Ser Phe Lys Gly Trp Thr Thr Tyr Phe Lys Gly Phe HisIle Ile Lys Ser Phe Lys Gly Trp Thr Thr Tyr Phe Lys Gly Phe His
580 585 590 580 585 590
Glu Asn Arg Lys Asn Val Tyr Ser Ser Asn Asp Ile Pro Thr Ser IleGlu Asn Arg Lys Asn Val Tyr Ser Ser Asn Asp Ile Pro Thr Ser Ile
595 600 605 595 600 605
Ile Tyr Arg Ile Val Asp Asp Asn Leu Pro Lys Phe Leu Glu Asn LysIle Tyr Arg Ile Val Asp Asp Asn Leu Pro Lys Phe Leu Glu Asn Lys
610 615 620 610 615 620
Ala Lys Tyr Glu Ser Leu Lys Asp Lys Ala Pro Glu Ala Ile Asn TyrAla Lys Tyr Glu Ser Leu Lys Asp Lys Ala Pro Glu Ala Ile Asn Tyr
625 630 635 640625 630 635 640
Glu Gln Ile Lys Lys Asp Leu Ala Glu Glu Leu Thr Phe Asp Ile AspGlu Gln Ile Lys Lys Asp Leu Ala Glu Glu Leu Thr Phe Asp Ile Asp
645 650 655 645 650 655
Tyr Lys Thr Ser Glu Val Asn Gln Arg Val Phe Ser Leu Asp Glu ValTyr Lys Thr Ser Glu Val Asn Gln Arg Val Phe Ser Leu Asp Glu Val
660 665 670 660 665 670
Phe Glu Ile Ala Asn Phe Asn Asn Tyr Leu Asn Gln Ser Gly Ile ThrPhe Glu Ile Ala Asn Phe Asn Asn Tyr Leu Asn Gln Ser Gly Ile Thr
675 680 685 675 680 685
Lys Phe Asn Thr Ile Ile Gly Gly Lys Phe Val Asn Gly Glu Asn ThrLys Phe Asn Thr Ile Ile Gly Gly Lys Phe Val Asn Gly Glu Asn Thr
690 695 700 690 695 700
Lys Arg Lys Gly Ile Asn Glu Tyr Ile Asn Leu Tyr Ser Gln Gln IleLys Arg Lys Gly Ile Asn Glu Tyr Ile Asn Leu Tyr Ser Gln Gln Ile
705 710 715 720705 710 715 720
Asn Asp Lys Thr Leu Lys Lys Tyr Lys Met Ser Val Leu Phe Lys GlnAsn Asp Lys Thr Leu Lys Lys Tyr Lys Met Ser Val Leu Phe Lys Gln
725 730 735 725 730 735
Ile Leu Ser Asp Thr Glu Ser Lys Ser Phe Val Ile Asp Lys Leu GluIle Leu Ser Asp Thr Glu Ser Lys Ser Phe Val Ile Asp Lys Leu Glu
740 745 750 740 745 750
Asp Asp Ser Asp Val Val Thr Thr Met Gln Ser Phe Tyr Glu Gln IleAsp Asp Ser Asp Val Val Thr Thr Met Gln Ser Phe Tyr Glu Gln Ile
755 760 765 755 760 765
Ala Ala Phe Lys Thr Val Glu Glu Lys Ser Ile Lys Glu Thr Leu SerAla Ala Phe Lys Thr Val Glu Glu Lys Ser Ile Lys Glu Thr Leu Ser
770 775 780 770 775 780
Leu Leu Phe Asp Asp Leu Lys Ala Gln Lys Leu Asp Leu Ser Lys IleLeu Leu Phe Asp Asp Leu Lys Ala Gln Lys Leu Asp Leu Ser Lys Ile
785 790 795 800785 790 795 800
Tyr Phe Lys Asn Asp Lys Ser Leu Thr Asp Leu Ser Gln Gln Val PheTyr Phe Lys Asn Asp Lys Ser Leu Thr Asp Leu Ser Gln Gln Val Phe
805 810 815 805 810 815
Asp Asp Tyr Ser Val Ile Gly Thr Ala Val Leu Glu Tyr Ile Thr GlnAsp Asp Tyr Ser Val Ile Gly Thr Ala Val Leu Glu Tyr Ile Thr Gln
820 825 830 820 825 830
Gln Ile Ala Pro Lys Asn Leu Asp Asn Pro Ser Lys Lys Glu Gln GluGln Ile Ala Pro Lys Asn Leu Asp Asn Pro Ser Lys Lys Glu Gln Glu
835 840 845 835 840 845
Leu Ile Ala Lys Lys Thr Glu Lys Ala Lys Tyr Leu Ser Leu Glu ThrLeu Ile Ala Lys Lys Thr Glu Lys Ala Lys Tyr Leu Ser Leu Glu Thr
850 855 860 850 855 860
Ile Lys Leu Ala Leu Glu Glu Phe Asn Lys His Arg Asp Ile Asp LysIle Lys Leu Ala Leu Glu Glu Phe Asn Lys His Arg Asp Ile Asp Lys
865 870 875 880865 870 875 880
Gln Cys Arg Phe Glu Glu Ile Leu Ala Asn Phe Ala Ala Ile Pro MetGln Cys Arg Phe Glu Glu Ile Leu Ala Asn Phe Ala Ala Ile Pro Met
885 890 895 885 890 895
Ile Phe Asp Glu Ile Ala Gln Asn Lys Asp Asn Leu Ala Gln Ile SerIle Phe Asp Glu Ile Ala Gln Asn Lys Asp Asn Leu Ala Gln Ile Ser
900 905 910 900 905 910
Ile Lys Tyr Gln Asn Gln Gly Lys Lys Asp Leu Leu Gln Ala Ser AlaIle Lys Tyr Gln Asn Gln Gly Lys Lys Asp Leu Leu Gln Ala Ser Ala
915 920 925 915 920 925
Glu Asp Asp Val Lys Ala Ile Lys Asp Leu Leu Asp Gln Thr Asn AsnGlu Asp Asp Val Lys Ala Ile Lys Asp Leu Leu Asp Gln Thr Asn Asn
930 935 940 930 935 940
Leu Leu His Lys Leu Lys Ile Phe His Ile Ser Gln Ser Glu Asp LysLeu Leu His Lys Leu Lys Ile Phe His Ile Ser Gln Ser Glu Asp Lys
945 950 955 960945 950 955 960
Ala Asn Ile Leu Asp Lys Asp Glu His Phe Tyr Leu Val Phe Glu GluAla Asn Ile Leu Asp Lys Asp Glu His Phe Tyr Leu Val Phe Glu Glu
965 970 975 965 970 975
Cys Tyr Phe Glu Leu Ala Asn Ile Val Pro Leu Tyr Asn Lys Ile ArgCys Tyr Phe Glu Leu Ala Asn Ile Val Pro Leu Tyr Asn Lys Ile Arg
980 985 990 980 985 990
Asn Tyr Ile Thr Gln Lys Pro Tyr Ser Asp Glu Lys Phe Lys Leu AsnAsn Tyr Ile Thr Gln Lys Pro Tyr Ser Asp Glu Lys Phe Lys Leu Asn
995 1000 1005 995 1000 1005
Phe Glu Asn Ser Thr Leu Ala Asn Gly Trp Asp Lys Asn Lys GluPhe Glu Asn Ser Thr Leu Ala Asn Gly Trp Asp Lys Asn Lys Glu
1010 1015 1020 1010 1015 1020
Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys Asp Asp Lys Tyr TyrPro Asp Asn Thr Ala Ile Leu Phe Ile Lys Asp Asp Lys Tyr Tyr
1025 1030 1035 1025 1030 1035
Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile Phe Asp Asp LysLeu Gly Val Met Asn Lys Lys Asn Asn Lys Ile Phe Asp Asp Lys
1040 1045 1050 1040 1045 1050
Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys Ile Val TyrAla Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys Ile Val Tyr
1055 1060 1065 1055 1060 1065
Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val Phe PheLys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val Phe Phe
1070 1075 1080 1070 1075 1080
Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile LeuSer Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile Leu
1085 1090 1095 1085 1090 1095
Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro GlnArg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln
1100 1105 1110 1100 1105 1110
Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg LysLys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys
1115 1120 1125 1115 1120 1125
Phe Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu TrpPhe Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp
1130 1135 1140 1130 1135 1140
Lys Asp Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn SerLys Asp Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser
1145 1150 1155 1145 1150 1155
Ile Asp Glu Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys LeuIle Asp Glu Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu
1160 1165 1170 1160 1165 1170
Thr Phe Glu Asn Ile Ser Glu Ser Tyr Ile Asp Ser Val Val AsnThr Phe Glu Asn Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn
1175 1180 1185 1175 1180 1185
Gln Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe SerGln Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser
1190 1195 1200 1190 1195 1200
Ala Tyr Ser Lys Gly Arg Pro Asn Leu His Thr Leu Tyr Trp LysAla Tyr Ser Lys Gly Arg Pro Asn Leu His Thr Leu Tyr Trp Lys
1205 1210 1215 1205 1210 1215
Ala Leu Phe Asp Glu Arg Asn Leu Gln Asp Val Val Tyr Lys LeuAla Leu Phe Asp Glu Arg Asn Leu Gln Asp Val Val Tyr Lys Leu
1220 1225 1230 1220 1225 1230
Asn Gly Glu Ala Glu Leu Phe Tyr Arg Lys Gln Ser Ile Pro LysAsn Gly Glu Ala Glu Leu Phe Tyr Arg Lys Gln Ser Ile Pro Lys
1235 1240 1245 1235 1240 1245
Lys Ile Thr His Pro Ala Lys Glu Ala Ile Ala Asn Lys Asn LysLys Ile Thr His Pro Ala Lys Glu Ala Ile Ala Asn Lys Asn Lys
1250 1255 1260 1250 1255 1260
Asp Asn Pro Lys Lys Glu Ser Val Phe Glu Tyr Asp Leu Ile LysAsp Asn Pro Lys Lys Glu Ser Val Phe Glu Tyr Asp Leu Ile Lys
1265 1270 1275 1265 1270 1275
Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe His Cys Pro IleAsp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe His Cys Pro Ile
1280 1285 1290 1280 1285 1290
Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe Asn Asp GluThr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe Asn Asp Glu
1295 1300 1305 1295 1300 1305
Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His Ile LeuIle Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His Ile Leu
1310 1315 1320 1310 1315 1320
Ser Ile Ala Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu ValSer Ile Ala Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu Val
1325 1330 1335 1325 1330 1335
Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile IleAsp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile
1340 1345 1350 1340 1345 1350
Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala AlaGly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala
1355 1360 1365 1355 1360 1365
Ile Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys IleIle Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile
1370 1375 1380 1370 1375 1380
Asn Asn Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val ValAsn Asn Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val
1385 1390 1395 1385 1390 1395
His Glu Ile Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val ValHis Glu Ile Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val
1400 1405 1410 1400 1405 1410
Phe Glu Asp Leu Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys ValPhe Glu Asp Leu Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val
1415 1420 1425 1415 1420 1425
Glu Lys Gln Val Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu LysGlu Lys Gln Val Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys
1430 1435 1440 1430 1435 1440
Leu Asn Tyr Leu Val Phe Lys Asp Asn Glu Phe Asp Lys Thr GlyLeu Asn Tyr Leu Val Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly
1445 1450 1455 1445 1450 1455
Gly Val Leu Arg Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr PheGly Val Leu Arg Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe
1460 1465 1470 1460 1465 1470
Lys Lys Met Gly Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro AlaLys Lys Met Gly Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala
1475 1480 1485 1475 1480 1485
Gly Phe Thr Ser Lys Ile Cys Pro Val Thr Gly Phe Val Asn GlnGly Phe Thr Ser Lys Ile Cys Pro Val Thr Gly Phe Val Asn Gln
1490 1495 1500 1490 1495 1500
Leu Tyr Pro Lys Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe PheLeu Tyr Pro Lys Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe
1505 1510 1515 1505 1510 1515
Ser Lys Phe Asp Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr PheSer Lys Phe Asp Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe
1520 1525 1530 1520 1525 1530
Glu Phe Ser Phe Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala LysGlu Phe Ser Phe Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys
1535 1540 1545 1535 1540 1545
Gly Lys Trp Thr Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn PheGly Lys Trp Thr Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe
1550 1555 1560 1550 1555 1560
Arg Asn Ser Asp Lys Asn His Asn Trp Asp Thr Arg Glu Val TyrArg Asn Ser Asp Lys Asn His Asn Trp Asp Thr Arg Glu Val Tyr
1565 1570 1575 1565 1570 1575
Pro Thr Lys Glu Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile GluPro Thr Lys Glu Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu
1580 1585 1590 1580 1585 1590
Tyr Gly His Gly Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu SerTyr Gly His Gly Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser
1595 1600 1605 1595 1600 1605
Asp Lys Lys Phe Phe Ala Lys Leu Thr Ser Val Leu Asn Thr IleAsp Lys Lys Phe Phe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile
1610 1615 1620 1610 1615 1620
Leu Gln Met Arg Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr LeuLeu Gln Met Arg Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu
1625 1630 1635 1625 1630 1635
Ile Ser Pro Val Ala Asp Val Asn Gly Asn Phe Phe Asp Ser ArgIle Ser Pro Val Ala Asp Val Asn Gly Asn Phe Phe Asp Ser Arg
1640 1645 1650 1640 1645 1650
Gln Ala Pro Lys Asn Met Pro Gln Asp Ala Asp Ala Asn Gly AlaGln Ala Pro Lys Asn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala
1655 1660 1665 1655 1660 1665
Tyr His Ile Gly Leu Lys Gly Leu Met Leu Leu Gly Arg Ile LysTyr His Ile Gly Leu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys
1670 1675 1680 1670 1675 1680
Asn Asn Gln Glu Gly Lys Lys Leu Asn Leu Val Ile Lys Asn GluAsn Asn Gln Glu Gly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu
1685 1690 1695 1685 1690 1695
Glu Tyr Phe Glu Phe Val Gln Asn Arg Asn Asn Ser Gly Gly SerGlu Tyr Phe Glu Phe Val Gln Asn Arg Asn Asn Ser Gly Gly Ser
1700 1705 1710 1700 1705 1710
Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro Lys Lys LysPro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys
1715 1720 1725 1715 1720 1725
Arg Lys ValArg Lys Val
1730 1730
<210> 27<210> 27
<211> 1659<211> 1659
<212> PRT<212> PRT
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> dLBCPF1-ABE7.10-2NLS<223> dLBCPF1-ABE7.10-2NLS
<400> 27<400> 27
Met Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val GluMet Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu
1 5 10 151 5 10 15
Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys ArgPhe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg
20 25 30 20 25 30
Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His AsnAla Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His Asn
35 40 45 35 40 45
Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His AspAsn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His Asp
50 55 60 50 55 60
Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu ValPro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val
65 70 75 8065 70 75 80
Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Leu GluMet Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Leu Glu
85 90 95 85 90 95
Pro Cys Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly ArgPro Cys Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg
100 105 110 100 105 110
Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser LeuVal Val Phe Gly Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu
115 120 125 115 120 125
Met Asp Val Leu His His Pro Gly Met Asn His Arg Val Glu Ile ThrMet Asp Val Leu His His Pro Gly Met Asn His Arg Val Glu Ile Thr
130 135 140 130 135 140
Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe PheGlu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe
145 150 155 160145 150 155 160
Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser SerArg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser Ser
165 170 175 165 170 175
Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr ProThr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro
180 185 190 180 185 190
Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser GlyGly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly
195 200 205 195 200 205
Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His AlaGly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala
210 215 220 210 215 220
Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val GlyLeu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly
225 230 235 240225 230 235 240
Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn ArgAla Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg
245 250 255 245 250 255
Ala Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala LeuAla Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu
260 265 270 260 265 270
Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala ThrArg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr
275 280 285 275 280 285
Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met IleLeu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile
290 295 300 290 295 300
His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys ThrHis Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr
305 310 315 320305 310 315 320
Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met AsnGly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn
325 330 335 325 330 335
His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala AlaHis Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala
340 345 350 340 345 350
Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala GlnLeu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln
355 360 365 355 360 365
Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly SerLys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser
370 375 380 370 375 380
Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu SerSer Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser
385 390 395 400385 390 395 400
Ser Gly Gly Ser Ser Gly Gly Ser Leu Lys Ser Lys Leu Glu Lys PheSer Gly Gly Ser Ser Gly Gly Ser Leu Lys Ser Lys Leu Glu Lys Phe
405 410 415 405 410 415
Thr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile ProThr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro
420 425 430 420 425 430
Val Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val GluVal Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu
435 440 445 435 440 445
Asp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu AspAsp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp
450 455 460 450 455 460
Arg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys LeuArg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu
465 470 475 480465 470 475 480
Lys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg ThrLys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr
485 490 495 485 490 495
Glu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg LysGlu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys
500 505 510 500 505 510
Glu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu PheGlu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe
515 520 525 515 520 525
Lys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp LysLys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys
530 535 540 530 535 540
Asp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala PheAsp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe
545 550 555 560545 550 555 560
Thr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala LysThr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys
565 570 575 565 570 575
Ser Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg TyrSer Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr
580 585 590 580 585 590
Ile Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp LysIle Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys
595 600 605 595 600 605
His Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr AspHis Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp
610 615 620 610 615 620
Val Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr GlnVal Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln
625 630 635 640625 630 635 640
Glu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr GluGlu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu
645 650 655 645 650 655
Ser Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr AsnSer Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn
660 665 670 660 665 670
Gln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys GlnGln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln
675 680 685 675 680 685
Val Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr ThrVal Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr
690 695 700 690 695 700
Ser Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys AsnSer Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn
705 710 715 720705 710 715 720
Ser Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys AsnSer Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn
725 730 735 725 730 735
Phe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro AlaPhe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala
740 745 750 740 745 750
Ile Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile ArgIle Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg
755 760 765 755 760 765
Asp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys AlaAsp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala
770 775 780 770 775 780
Val Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys LysVal Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys
785 790 795 800785 790 795 800
Ile Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala AspIle Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp
805 810 815 805 810 815
Leu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val AspLeu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp
820 825 830 820 825 830
Glu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala AspGlu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp
835 840 845 835 840 845
Phe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala IlePhe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile
850 855 860 850 855 860
Met Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile LysMet Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys
865 870 875 880865 870 875 880
Ala Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe TyrAla Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr
885 890 895 885 890 895
Gly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His IleGly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile
900 905 910 900 905 910
Tyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys AspTyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp
915 920 925 915 920 925
Lys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp AspLys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp
930 935 940 930 935 940
Lys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly SerLys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser
945 950 955 960945 950 955 960
Lys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu GlnLys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln
965 970 975 965 970 975
Lys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn TyrLys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr
980 985 990 980 985 990
Lys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe SerLys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser
995 1000 1005 995 1000 1005
Lys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln LysLys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys
1010 1015 1020 1010 1015 1020
Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn LeuIle Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu
1025 1030 1035 1025 1030 1035
Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile SerAsn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser
1040 1045 1050 1040 1045 1050
Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser GluArg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu
1055 1060 1065 1055 1060 1065
Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val GluThr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu
1070 1075 1080 1070 1075 1080
Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys GluGlu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys Glu
1085 1090 1095 1085 1090 1095
Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln IleVal Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile
1100 1105 1110 1100 1105 1110
Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn LeuTyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu
1115 1120 1125 1115 1120 1125
His Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His GlyHis Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly
1130 1135 1140 1130 1135 1140
Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg AlaGln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala
1145 1150 1155 1145 1150 1155
Ser Leu Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser ProSer Leu Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro
1160 1165 1170 1160 1165 1170
Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr LeuIle Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu
1175 1180 1185 1175 1180 1185
Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln TyrSer Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr
1190 1195 1200 1190 1195 1200
Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn IleGlu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile
1205 1210 1215 1205 1210 1215
Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp AspPhe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp
1220 1225 1230 1220 1225 1230
Asn Pro Tyr Val Ile Gly Ile Ala Arg Gly Glu Arg Asn Leu LeuAsn Pro Tyr Val Ile Gly Ile Ala Arg Gly Glu Arg Asn Leu Leu
1235 1240 1245 1235 1240 1245
Tyr Ile Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln TyrTyr Ile Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr
1250 1255 1260 1250 1255 1260
Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile LysSer Leu Asn Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys
1265 1270 1275 1265 1270 1275
Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg PheThr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe
1280 1285 1290 1280 1285 1290
Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile Lys Glu LeuGlu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu
1295 1300 1305 1295 1300 1305
Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys Glu LeuLys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys Glu Leu
1310 1315 1320 1310 1315 1320
Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn SerVal Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn Ser
1325 1330 1335 1325 1330 1335
Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr GlnGly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln
1340 1345 1350 1340 1345 1350
Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val AspLys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp
1355 1360 1365 1355 1360 1365
Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly TyrLys Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr
1370 1375 1380 1370 1375 1380
Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr GlnGln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln
1385 1390 1395 1385 1390 1395
Asn Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys IleAsn Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile
1400 1405 1410 1400 1405 1410
Asp Pro Ser Thr Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr ThrAsp Pro Ser Thr Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr
1415 1420 1425 1415 1420 1425
Ser Ile Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg IleSer Ile Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile
1430 1435 1440 1430 1435 1440
Met Tyr Val Pro Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp TyrMet Tyr Val Pro Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr
1445 1450 1455 1445 1450 1455
Lys Asn Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp LysLys Asn Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys
1460 1465 1470 1460 1465 1470
Leu Tyr Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro LysLeu Tyr Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys
1475 1480 1485 1475 1480 1485
Lys Asn Asn Val Phe Asp Trp Glu Glu Val Cys Leu Thr Ser AlaLys Asn Asn Val Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala
1490 1495 1500 1490 1495 1500
Tyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln GlyTyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly
1505 1510 1515 1505 1510 1515
Asp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe TyrAsp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr
1520 1525 1530 1520 1525 1530
Ser Ser Phe Met Ala Leu Met Ser Leu Met Leu Gln Met Arg AsnSer Ser Phe Met Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn
1535 1540 1545 1535 1540 1545
Ser Ile Thr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser Pro ValSer Ile Thr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val
1550 1555 1560 1550 1555 1560
Lys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu AlaLys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala
1565 1570 1575 1565 1570 1575
Gln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly AlaGln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala
1580 1585 1590 1580 1585 1590
Tyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe LysTyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys
1595 1600 1605 1595 1600 1605
Lys Ala Glu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile SerLys Ala Glu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser
1610 1615 1620 1610 1615 1620
Asn Lys Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His SerAsn Lys Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His Ser
1625 1630 1635 1625 1630 1635
Gly Gly Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser ProGly Gly Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro
1640 1645 1650 1640 1645 1650
Lys Lys Lys Arg Lys ValLys Lys Lys Arg Lys Val
1655 1655
<210> 28<210> 28
<211> 1656<211> 1656
<212> PRT<212> PRT
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> LBCPF1-ABE2-X2<223> LBCPF1-ABE2-X2
<400> 28<400> 28
Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys ThrMet Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr
1 5 10 151 5 10 15
Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile AspLeu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp
20 25 30 20 25 30
Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr LysAsn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys
35 40 45 35 40 45
Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn AspGly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp
50 55 60 50 55 60
Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser LeuVal Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu
65 70 75 8065 70 75 80
Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu AsnPhe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn
85 90 95 85 90 95
Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly AsnLeu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn
100 105 110 100 105 110
Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile LeuGlu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu
115 120 125 115 120 125
Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser PhePro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe
130 135 140 130 135 140
Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu AsnAsn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn
145 150 155 160145 150 155 160
Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys IleMet Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile
165 170 175 165 170 175
Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu LysAsn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys
180 185 190 180 185 190
Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu LysVal Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys
195 200 205 195 200 205
Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu PheIle Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe
210 215 220 210 215 220
Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala IlePhe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile
225 230 235 240225 230 235 240
Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu AsnIle Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn
245 250 255 245 250 255
Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro LysGlu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys
260 265 270 260 265 270
Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu SerPhe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser
275 280 285 275 280 285
Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val PhePhe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe
290 295 300 290 295 300
Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys LysArg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys
305 310 315 320305 310 315 320
Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly IleLeu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile
325 330 335 325 330 335
Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile PhePhe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe
340 345 350 340 345 350
Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp AspGly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp
355 360 365 355 360 365
Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp AspIle His Leu Lys Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp
370 375 380 370 375 380
Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln LeuArg Arg Lys Ser Phe Lys Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu
385 390 395 400385 390 395 400
Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys GluGln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu
405 410 415 405 410 415
Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser SerIle Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser
420 425 430 420 425 430
Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys LysGlu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys
435 440 445 435 440 445
Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val LysAsn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys
450 455 460 450 455 460
Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu ThrSer Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr
465 470 475 480465 470 475 480
Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp IleAsn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile
485 490 495 485 490 495
Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val ThrLeu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr
500 505 510 500 505 510
Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn ProGln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro
515 520 525 515 520 525
Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg AlaGln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala
530 535 540 530 535 540
Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp LysThr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys
545 550 555 560545 550 555 560
Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn GlyLys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly
565 570 575 565 570 575
Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys MetAsn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met
580 585 590 580 585 590
Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn ProLeu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro
595 600 605 595 600 605
Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys GlySer Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly
610 615 620 610 615 620
Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe LysAsp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys
625 630 635 640625 630 635 640
Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe AsnAsp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn
645 650 655 645 650 655
Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg GluPhe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu
660 665 670 660 665 670
Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys LysVal Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys
675 680 685 675 680 685
Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln IleGlu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile
690 695 700 690 695 700
Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu HisTyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His
705 710 715 720705 710 715 720
Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln IleThr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile
725 730 735 725 730 735
Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu LysArg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys
740 745 750 740 745 750
Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn LysLys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys
755 760 765 755 760 765
Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val TyrAsn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr
770 775 780 770 775 780
Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro IleLys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile
785 790 795 800785 790 795 800
Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu ValAla Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val
805 810 815 805 810 815
Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile AlaArg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Ala
820 825 830 820 825 830
Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys GlyArg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly
835 840 845 835 840 845
Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe AsnAsn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn
850 855 860 850 855 860
Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys GluGly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu
865 870 875 880865 870 875 880
Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn IleLys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile
885 890 895 885 890 895
Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile CysLys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys
900 905 910 900 905 910
Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu AsnGlu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn
915 920 925 915 920 925
Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr GlnSer Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln
930 935 940 930 935 940
Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp LysLys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys
945 950 955 960945 950 955 960
Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln IleLys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile
965 970 975 965 970 975
Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly PheThr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe
980 985 990 980 985 990
Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser ThrIle Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr
995 1000 1005 995 1000 1005
Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala AspGly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp
1010 1015 1020 1010 1015 1020
Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val ProSer Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro
1025 1030 1035 1025 1030 1035
Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe SerGlu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser
1040 1045 1050 1040 1045 1050
Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser TyrArg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr
1055 1060 1065 1055 1060 1065
Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn ValGly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val
1070 1075 1080 1070 1075 1080
Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu LeuPhe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu
1085 1090 1095 1085 1090 1095
Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg AlaPhe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala
1100 1105 1110 1100 1105 1110
Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe MetLeu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met
1115 1120 1125 1115 1120 1125
Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr GlyAla Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly
1130 1135 1140 1130 1135 1140
Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser AspArg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp
1145 1150 1155 1145 1150 1155
Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn AlaGly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala
1160 1165 1170 1160 1165 1170
Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile AlaIle Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala
1175 1180 1185 1175 1180 1185
Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu AspArg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp
1190 1195 1200 1190 1195 1200
Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu TrpGlu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp
1205 1210 1215 1205 1210 1215
Leu Glu Tyr Ala Gln Thr Ser Val Lys His Lys Leu Met Pro LysLeu Glu Tyr Ala Gln Thr Ser Val Lys His Lys Leu Met Pro Lys
1220 1225 1230 1220 1225 1230
Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu Phe SerLys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu Phe Ser
1235 1240 1245 1235 1240 1245
His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg AlaHis Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg Ala
1250 1255 1260 1250 1255 1260
Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His AsnTrp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His Asn
1265 1270 1275 1265 1270 1275
Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg HisAsn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His
1280 1285 1290 1280 1285 1290
Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly GlyAsp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly
1295 1300 1305 1295 1300 1305
Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr ValLeu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val
1310 1315 1320 1310 1315 1320
Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His SerThr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His Ser
1325 1330 1335 1325 1330 1335
Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr GlyArg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly
1340 1345 1350 1340 1345 1350
Ala Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met AsnAla Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn
1355 1360 1365 1355 1360 1365
His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys AlaHis Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala
1370 1375 1380 1370 1375 1380
Ala Leu Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile LysAla Leu Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys
1385 1390 1395 1385 1390 1395
Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser SerAla Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser
1400 1405 1410 1400 1405 1410
Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser AlaGly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala
1415 1420 1425 1415 1420 1425
Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser Ser Glu ValThr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser Ser Glu Val
1430 1435 1440 1430 1435 1440
Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu AlaGlu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala
1445 1450 1455 1445 1450 1455
Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly Ala Val LeuLys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu
1460 1465 1470 1460 1465 1470
Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Ala IleVal Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Ala Ile
1475 1480 1485 1475 1480 1485
Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu ArgGly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg
1490 1495 1500 1490 1495 1500
Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala ThrGln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr
1505 1510 1515 1505 1510 1515
Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala MetLeu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met
1520 1525 1530 1520 1525 1530
Ile His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn AlaIle His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala
1535 1540 1545 1535 1540 1545
Lys Thr Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr ProLys Thr Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro
1550 1555 1560 1550 1555 1560
Gly Met Asn His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala AspGly Met Asn His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp
1565 1570 1575 1565 1570 1575
Glu Cys Ala Ala Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg GlnGlu Cys Ala Ala Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln
1580 1585 1590 1580 1585 1590
Val Phe Asn Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser GlyVal Phe Asn Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly
1595 1600 1605 1595 1600 1605
Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr SerGly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser
1610 1615 1620 1610 1615 1620
Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly SerGlu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser
1625 1630 1635 1625 1630 1635
Leu Lys Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala LysLeu Lys Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1640 1645 1650 1640 1645 1650
Lys Lys LysLys Lys Lys
1655 1655
<210> 29<210> 29
<211> 1663<211> 1663
<212> PRT<212> PRT
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> LBCPF1-ABE2-X3<223> LBCPF1-ABE2-X3
<400> 29<400> 29
Met Pro Lys Lys Lys Arg Lys Val Ser Lys Leu Glu Lys Phe Thr AsnMet Pro Lys Lys Lys Arg Lys Val Ser Lys Leu Glu Lys Phe Thr Asn
1 5 10 151 5 10 15
Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro Val GlyCys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro Val Gly
20 25 30 20 25 30
Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu Asp GluLys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu Asp Glu
35 40 45 35 40 45
Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp Arg TyrLys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp Arg Tyr
50 55 60 50 55 60
Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu Lys AsnTyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu Lys Asn
65 70 75 8065 70 75 80
Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr Glu LysLeu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr Glu Lys
85 90 95 85 90 95
Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys Glu IleGlu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys Glu Ile
100 105 110 100 105 110
Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe Lys LysAla Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe Lys Lys
115 120 125 115 120 125
Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys Asp GluAsp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys Asp Glu
130 135 140 130 135 140
Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe Thr GlyIle Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe Thr Gly
145 150 155 160145 150 155 160
Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys Ser ThrPhe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys Ser Thr
165 170 175 165 170 175
Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile SerSer Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile Ser
180 185 190 180 185 190
Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys His GluAsn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys His Glu
195 200 205 195 200 205
Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val GluVal Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val Glu
210 215 220 210 215 220
Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln Glu GlyAsp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln Glu Gly
225 230 235 240225 230 235 240
Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu Ser GlyIle Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu Ser Gly
245 250 255 245 250 255
Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln LysGlu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln Lys
260 265 270 260 265 270
Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val LeuThr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val Leu
275 280 285 275 280 285
Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser AspSer Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser Asp
290 295 300 290 295 300
Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn Ser GluGlu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn Ser Glu
305 310 315 320305 310 315 320
Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe AspIle Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe Asp
325 330 335 325 330 335
Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala Ile SerGlu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala Ile Ser
340 345 350 340 345 350
Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg Asp LysThr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg Asp Lys
355 360 365 355 360 365
Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala Val ValTrp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala Val Val
370 375 380 370 375 380
Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile GlyThr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile Gly
385 390 395 400385 390 395 400
Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu SerSer Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu Ser
405 410 415 405 410 415
Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp Glu IleVal Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp Glu Ile
420 425 430 420 425 430
Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe ValTyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe Val
435 440 445 435 440 445
Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile Met LysLeu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile Met Lys
450 455 460 450 455 460
Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala PheAsp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala Phe
465 470 475 480465 470 475 480
Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly AspPhe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly Asp
485 490 495 485 490 495
Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile Tyr AspPhe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile Tyr Asp
500 505 510 500 505 510
Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys PheAla Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys Phe
515 520 525 515 520 525
Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp Lys AspLys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp Lys Asp
530 535 540 530 535 540
Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys TyrLys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys Tyr
545 550 555 560545 550 555 560
Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys IleTyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys Ile
565 570 575 565 570 575
Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys LeuAsp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys Leu
580 585 590 580 585 590
Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser Lys LysLeu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser Lys Lys
595 600 605 595 600 605
Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr LysTrp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr Lys
610 615 620 610 615 620
Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu Asn Asp Cys HisAsn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu Asn Asp Cys His
625 630 635 640625 630 635 640
Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys TrpLys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys Trp
645 650 655 645 650 655
Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys AspSer Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys Asp
660 665 670 660 665 670
Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln Gly Tyr Lys Val SerIle Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln Gly Tyr Lys Val Ser
675 680 685 675 680 685
Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys Leu Val Glu Glu GlyPhe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys Leu Val Glu Glu Gly
690 695 700 690 695 700
Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys SerLys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys Ser
705 710 715 720705 710 715 720
His Gly Thr Pro Asn Leu His Thr Met Tyr Phe Lys Leu Leu Phe AspHis Gly Thr Pro Asn Leu His Thr Met Tyr Phe Lys Leu Leu Phe Asp
725 730 735 725 730 735
Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu PheGlu Asn Asn His Gly Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe
740 745 750 740 745 750
Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu Val Val His Pro AlaMet Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu Val Val His Pro Ala
755 760 765 755 760 765
Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr ThrAsn Ser Pro Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr
770 775 780 770 775 780
Thr Leu Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp GlnThr Leu Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln
785 790 795 800785 790 795 800
Tyr Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn IleTyr Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile
805 810 815 805 810 815
Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp AsnPhe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp Asn
820 825 830 820 825 830
Pro Tyr Val Ile Gly Ile Ala Arg Gly Glu Arg Asn Leu Leu Tyr IlePro Tyr Val Ile Gly Ile Ala Arg Gly Glu Arg Asn Leu Leu Tyr Ile
835 840 845 835 840 845
Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu AsnVal Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn
850 855 860 850 855 860
Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr HisGlu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His
865 870 875 880865 870 875 880
Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln AsnSer Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn
885 890 895 885 890 895
Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile SerTrp Thr Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser
900 905 910 900 905 910
Gln Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala ValGln Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala Val
915 920 925 915 920 925
Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val LysIle Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val Lys
930 935 940 930 935 940
Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp LysVal Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp Lys
945 950 955 960945 950 955 960
Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly GlyLeu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly
965 970 975 965 970 975
Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys SerAla Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser
980 985 990 980 985 990
Met Ser Thr Gln Asn Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu ThrMet Ser Thr Gln Asn Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr
995 1000 1005 995 1000 1005
Ser Lys Ile Asp Pro Ser Thr Gly Phe Val Asn Leu Leu Lys ThrSer Lys Ile Asp Pro Ser Thr Gly Phe Val Asn Leu Leu Lys Thr
1010 1015 1020 1010 1015 1020
Lys Tyr Thr Ser Ile Ala Asp Ser Lys Lys Phe Ile Ser Ser PheLys Tyr Thr Ser Ile Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe
1025 1030 1035 1025 1030 1035
Asp Arg Ile Met Tyr Val Pro Glu Glu Asp Leu Phe Glu Phe AlaAsp Arg Ile Met Tyr Val Pro Glu Glu Asp Leu Phe Glu Phe Ala
1040 1045 1050 1040 1045 1050
Leu Asp Tyr Lys Asn Phe Ser Arg Thr Asp Ala Asp Tyr Ile LysLeu Asp Tyr Lys Asn Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys
1055 1060 1065 1055 1060 1065
Lys Trp Lys Leu Tyr Ser Tyr Gly Asn Arg Ile Arg Ile Phe ArgLys Trp Lys Leu Tyr Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg
1070 1075 1080 1070 1075 1080
Asn Pro Lys Lys Asn Asn Val Phe Asp Trp Glu Glu Val Cys LeuAsn Pro Lys Lys Asn Asn Val Phe Asp Trp Glu Glu Val Cys Leu
1085 1090 1095 1085 1090 1095
Thr Ser Ala Tyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile Asn TyrThr Ser Ala Tyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr
1100 1105 1110 1100 1105 1110
Gln Gln Gly Asp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp LysGln Gln Gly Asp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys
1115 1120 1125 1115 1120 1125
Ala Phe Tyr Ser Ser Phe Met Ala Leu Met Ser Leu Met Leu GlnAla Phe Tyr Ser Ser Phe Met Ala Leu Met Ser Leu Met Leu Gln
1130 1135 1140 1130 1135 1140
Met Arg Asn Ser Ile Thr Gly Arg Thr Asp Val Asp Phe Leu IleMet Arg Asn Ser Ile Thr Gly Arg Thr Asp Val Asp Phe Leu Ile
1145 1150 1155 1145 1150 1155
Ser Pro Val Lys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg AsnSer Pro Val Lys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn
1160 1165 1170 1160 1165 1170
Tyr Glu Ala Gln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp AlaTyr Glu Ala Gln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala
1175 1180 1185 1175 1180 1185
Asn Gly Ala Tyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile GlyAsn Gly Ala Tyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile Gly
1190 1195 1200 1190 1195 1200
Gln Phe Lys Lys Ala Glu Asp Glu Lys Leu Asp Lys Val Lys IleGln Phe Lys Lys Ala Glu Asp Glu Lys Leu Asp Lys Val Lys Ile
1205 1210 1215 1205 1210 1215
Ala Ile Ser Asn Lys Glu Trp Leu Glu Tyr Ala Gln Thr Ser ValAla Ile Ser Asn Lys Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val
1220 1225 1230 1220 1225 1230
Lys His Lys Leu Met Pro Lys Lys Lys Arg Lys Val Ser Gly GlyLys His Lys Leu Met Pro Lys Lys Lys Lys Arg Lys Val Ser Gly Gly
1235 1240 1245 1235 1240 1245
Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His AlaSer Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala
1250 1255 1260 1250 1255 1260
Leu Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro ValLeu Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val
1265 1270 1275 1265 1270 1275
Gly Ala Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly TrpGly Ala Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp
1280 1285 1290 1280 1285 1290
Asn Arg Pro Ile Gly Arg His Asp Pro Thr Ala His Ala Glu IleAsn Arg Pro Ile Gly Arg His Asp Pro Thr Ala His Ala Glu Ile
1295 1300 1305 1295 1300 1305
Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg LeuMet Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu
1310 1315 1320 1310 1315 1320
Ile Asp Ala Thr Leu Tyr Val Thr Leu Glu Pro Cys Val Met CysIle Asp Ala Thr Leu Tyr Val Thr Leu Glu Pro Cys Val Met Cys
1325 1330 1335 1325 1330 1335
Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg Val Val Phe GlyAla Gly Ala Met Ile His Ser Arg Ile Gly Arg Val Val Phe Gly
1340 1345 1350 1340 1345 1350
Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu Met Asp ValAla Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu Met Asp Val
1355 1360 1365 1355 1360 1365
Leu His His Pro Gly Met Asn His Arg Val Glu Ile Thr Glu GlyLeu His His Pro Gly Met Asn His Arg Val Glu Ile Thr Glu Gly
1370 1375 1380 1370 1375 1380
Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe ArgIle Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe Arg
1385 1390 1395 1385 1390 1395
Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser SerMet Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser Ser
1400 1405 1410 1400 1405 1410
Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu ThrThr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr
1415 1420 1425 1415 1420 1425
Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly SerPro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser
1430 1435 1440 1430 1435 1440
Ser Gly Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp MetSer Gly Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met
1445 1450 1455 1445 1450 1455
Arg His Ala Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg GluArg His Ala Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu
1460 1465 1470 1460 1465 1470
Val Pro Val Gly Ala Val Leu Val Leu Asn Asn Arg Val Ile GlyVal Pro Val Gly Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly
1475 1480 1485 1475 1480 1485
Glu Gly Trp Asn Arg Ala Ile Gly Leu His Asp Pro Thr Ala HisGlu Gly Trp Asn Arg Ala Ile Gly Leu His Asp Pro Thr Ala His
1490 1495 1500 1490 1495 1500
Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln AsnAla Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn
1505 1510 1515 1505 1510 1515
Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Phe Glu Pro CysTyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Phe Glu Pro Cys
1520 1525 1530 1520 1525 1530
Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg ValVal Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg Val
1535 1540 1545 1535 1540 1545
Val Phe Gly Val Arg Asn Ala Lys Thr Gly Ala Ala Gly Ser LeuVal Phe Gly Val Arg Asn Ala Lys Thr Gly Ala Ala Gly Ser Leu
1550 1555 1560 1550 1555 1560
Met Asp Val Leu His Tyr Pro Gly Met Asn His Arg Val Glu IleMet Asp Val Leu His Tyr Pro Gly Met Asn His Arg Val Glu Ile
1565 1570 1575 1565 1570 1575
Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Cys TyrThr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Cys Tyr
1580 1585 1590 1580 1585 1590
Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln Lys Lys AlaPhe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln Lys Lys Ala
1595 1600 1605 1595 1600 1605
Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser GlyGln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly
1610 1615 1620 1610 1615 1620
Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser SerSer Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser
1625 1630 1635 1625 1630 1635
Gly Gly Ser Ser Gly Gly Ser Leu Lys Lys Arg Pro Ala Ala ThrGly Gly Ser Ser Gly Gly Ser Leu Lys Lys Arg Pro Ala Ala Thr
1640 1645 1650 1640 1645 1650
Lys Lys Ala Gly Gln Ala Lys Lys Lys LysLys Lys Ala Gly Gln Ala Lys Lys Lys Lys
1655 1660 1655 1660
<210> 30<210> 30
<211> 1526<211> 1526
<212> DNA<212> DNA
<213> Artificial Sequence<213> Artificial Sequence
<220><220>
<223> Artificial Sequence<223> Artificial Sequence
<400> 30<400> 30
ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420gtttagggtt aatggtttttt atagactaat tttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660
cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720
ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780
gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840
caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900
ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960
tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020
taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080
atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140
cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200
ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260
gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320
cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380
acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440
ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500
tctgcaggtc gaagcttgaa gcaaac 1526tctgcaggtc gaagcttgaa gcaaac 1526
Claims (20)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711403490 | 2017-12-22 | ||
CN201711403490X | 2017-12-22 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109957569A true CN109957569A (en) | 2019-07-02 |
CN109957569B CN109957569B (en) | 2022-10-25 |
Family
ID=66992485
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811578853.8A Active CN109957569B (en) | 2017-12-22 | 2018-12-21 | Base editing system and method based on CPF1 protein |
Country Status (3)
Country | Link |
---|---|
CN (1) | CN109957569B (en) |
AR (1) | AR114014A1 (en) |
WO (1) | WO2019120310A1 (en) |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110964741A (en) * | 2019-12-20 | 2020-04-07 | 北京市农林科学院 | Nuclear localization signal FNB and application thereof in improving base editing efficiency |
CN111019967A (en) * | 2019-11-27 | 2020-04-17 | 南京农业大学 | Application of GmU3-19g-1 and GmU6-16g-1 promoters in soybean polygene editing system |
CN111518794A (en) * | 2020-04-13 | 2020-08-11 | 中山大学 | Preparation and application of induced mutant protein based on activation-induced cytidine deaminase |
WO2021032155A1 (en) * | 2019-08-20 | 2021-02-25 | 中国科学院遗传与发育生物学研究所 | Base editing system and use method therefor |
CN112851776A (en) * | 2020-04-20 | 2021-05-28 | 中国科学院天津工业生物技术研究所 | Gene site-directed mutagenesis method and stress resistance breeding application thereof |
CN113005141A (en) * | 2021-01-05 | 2021-06-22 | 温州医科大学 | Gene editing tool composed of high-activity mutant, preparation method and method for repairing congenital retinoschisis disease pathogenic gene |
CN114045302A (en) * | 2021-11-12 | 2022-02-15 | 三亚中国农业科学院国家南繁研究院 | Single-base editing vector and construction and application thereof |
CN114375335A (en) * | 2019-07-19 | 2022-04-19 | 成对植物服务股份有限公司 | Optimized protein linkers and methods of use |
CN114667149A (en) * | 2019-09-09 | 2022-06-24 | 比姆医疗股份有限公司 | Novel nucleobase editor and methods of use |
CN114761547A (en) * | 2019-09-26 | 2022-07-15 | 先正达农作物保护股份公司 | Methods and compositions for DNA base editing |
CN114829595A (en) * | 2019-10-17 | 2022-07-29 | 成对植物服务股份有限公司 | Variants of CAS12A nuclease, methods of making and uses thereof |
CN114835818A (en) * | 2022-03-17 | 2022-08-02 | 江南大学 | A gene editing fusion protein, its constructed adenine base editor and its application |
CN114867852A (en) * | 2019-10-30 | 2022-08-05 | 成对植物服务股份有限公司 | V-type CRISPR-CAS base editor and method of use thereof |
WO2022223010A1 (en) * | 2021-04-21 | 2022-10-27 | 浙江大学 | Negative-strand rna viral vector and plant genome editing method without transformation |
CN115704015A (en) * | 2021-08-12 | 2023-02-17 | 清华大学 | Targeted mutagenesis system based on adenine and cytosine dual base editor |
CN116286734A (en) * | 2022-11-29 | 2023-06-23 | 武汉大学 | Mutants of wild-type LbCas12a protein and its use in SNP detection |
CN116376948A (en) * | 2022-07-25 | 2023-07-04 | 广州医科大学 | A kind of preparation method of plasmid carrier and MS2 bacteriophage similar particles displaying foreign protein |
CN116497067A (en) * | 2019-02-13 | 2023-07-28 | 比姆医疗股份有限公司 | Compositions and methods for treating heme lesions |
CN116751799A (en) * | 2023-06-14 | 2023-09-15 | 江南大学 | Multi-site double-base editor and application thereof |
CN116836962A (en) * | 2023-06-28 | 2023-10-03 | 微光基因(苏州)有限公司 | Engineered adenosine deaminase and base editor |
WO2023207607A1 (en) * | 2022-04-29 | 2023-11-02 | 北京大学 | Deaminase mutant, composition, and method for modifying mitochondrial dna |
US12133884B2 (en) | 2018-05-11 | 2024-11-05 | Beam Therapeutics Inc. | Methods of substituting pathogenic amino acids using programmable base editor systems |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SG11202106977PA (en) * | 2018-12-27 | 2021-07-29 | Lifeedit Therapeutics Inc | Polypeptides useful for gene editing and methods of use |
CN110551752B (en) * | 2019-08-30 | 2023-03-14 | 北京市农林科学院 | xCas9n-epBE base editing system and application thereof in genome base replacement |
US20230257761A1 (en) | 2019-09-12 | 2023-08-17 | Institute Of Genetics And Developmental Biology, Chinese Academy Of Sciences | Herbicide resistant plant |
EP4185699A1 (en) | 2020-07-21 | 2023-05-31 | Pairwise Plants Services, Inc. | Optimized protein linkers and methods of use |
CN114317596B (en) * | 2020-09-30 | 2024-01-16 | 北京市农林科学院 | A method for mutating A to G in target sequence of plant genome |
CN114317518B (en) * | 2020-09-30 | 2024-01-12 | 北京市农林科学院 | Application of SpRYn-CBE base editing system in base replacement in plant genomes |
CN114317589B (en) * | 2020-09-30 | 2024-01-16 | 北京市农林科学院 | Application of SpRYn-ABE base editing system in plant genome base substitution |
CN112430622A (en) * | 2020-10-26 | 2021-03-02 | 扬州大学 | FokI and dCpf1 fusion protein expression vector and site-directed gene editing method mediated by same |
EP4499819A1 (en) * | 2022-03-30 | 2025-02-05 | Basf Agricultural Solutions Seed Us Llc | Optimized base editors |
CN114686456B (en) * | 2022-05-10 | 2023-02-17 | 中山大学 | Base editing system based on bimolecular deaminase complementation and application thereof |
CN115820691B (en) * | 2022-07-25 | 2023-08-22 | 安徽农业大学 | LbCPf1 variant-based rice base editing system and application |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017184786A1 (en) * | 2016-04-19 | 2017-10-26 | The Broad Institute Inc. | Cpf1 complexes with reduced indel activity |
WO2017184768A1 (en) * | 2016-04-19 | 2017-10-26 | The Broad Institute Inc. | Novel crispr enzymes and systems |
WO2017189308A1 (en) * | 2016-04-19 | 2017-11-02 | The Broad Institute Inc. | Novel crispr enzymes and systems |
CN110214183A (en) * | 2016-08-03 | 2019-09-06 | 哈佛大学的校长及成员们 | Adenosine nucleobase editing machine and application thereof |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3374494A4 (en) * | 2015-11-11 | 2019-05-01 | Coda Biotherapeutics, Inc. | Crispr compositions and methods of using the same for gene therapy |
EP3405570A1 (en) * | 2016-01-22 | 2018-11-28 | The Broad Institute, Inc. | Crystal structure of crispr cpf1 |
-
2018
- 2018-12-21 AR ARP180103817A patent/AR114014A1/en unknown
- 2018-12-21 CN CN201811578853.8A patent/CN109957569B/en active Active
- 2018-12-24 WO PCT/CN2018/123158 patent/WO2019120310A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017184786A1 (en) * | 2016-04-19 | 2017-10-26 | The Broad Institute Inc. | Cpf1 complexes with reduced indel activity |
WO2017184768A1 (en) * | 2016-04-19 | 2017-10-26 | The Broad Institute Inc. | Novel crispr enzymes and systems |
WO2017189308A1 (en) * | 2016-04-19 | 2017-11-02 | The Broad Institute Inc. | Novel crispr enzymes and systems |
CN110214183A (en) * | 2016-08-03 | 2019-09-06 | 哈佛大学的校长及成员们 | Adenosine nucleobase editing machine and application thereof |
Non-Patent Citations (1)
Title |
---|
NICOLE M. GAUDELLI等: "《Programmable base editing of A•T to G•C in genomic DNA without DNA cleavage》", 《NATURE》 * |
Cited By (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US12133884B2 (en) | 2018-05-11 | 2024-11-05 | Beam Therapeutics Inc. | Methods of substituting pathogenic amino acids using programmable base editor systems |
US12016908B2 (en) | 2019-02-13 | 2024-06-25 | Beam Therapeutics Inc. | Compositions and methods for treating hemoglobinopathies |
CN116497067B (en) * | 2019-02-13 | 2025-01-24 | 比姆医疗股份有限公司 | Compositions and methods for treating hemoglobinopathies |
CN116497067A (en) * | 2019-02-13 | 2023-07-28 | 比姆医疗股份有限公司 | Compositions and methods for treating heme lesions |
CN114375335A (en) * | 2019-07-19 | 2022-04-19 | 成对植物服务股份有限公司 | Optimized protein linkers and methods of use |
WO2021032155A1 (en) * | 2019-08-20 | 2021-02-25 | 中国科学院遗传与发育生物学研究所 | Base editing system and use method therefor |
CN114945670B (en) * | 2019-08-20 | 2024-12-10 | 苏州齐禾生科生物科技有限公司 | A base editing system and method of use thereof |
CN114945670A (en) * | 2019-08-20 | 2022-08-26 | 上海蓝十字医学科学研究所 | Base editing system and use method thereof |
CN114667149A (en) * | 2019-09-09 | 2022-06-24 | 比姆医疗股份有限公司 | Novel nucleobase editor and methods of use |
CN114761547B (en) * | 2019-09-26 | 2025-02-18 | 先正达农作物保护股份公司 | Methods and compositions for DNA base editing |
CN114761547A (en) * | 2019-09-26 | 2022-07-15 | 先正达农作物保护股份公司 | Methods and compositions for DNA base editing |
CN114829595A (en) * | 2019-10-17 | 2022-07-29 | 成对植物服务股份有限公司 | Variants of CAS12A nuclease, methods of making and uses thereof |
CN114867852A (en) * | 2019-10-30 | 2022-08-05 | 成对植物服务股份有限公司 | V-type CRISPR-CAS base editor and method of use thereof |
CN111019967A (en) * | 2019-11-27 | 2020-04-17 | 南京农业大学 | Application of GmU3-19g-1 and GmU6-16g-1 promoters in soybean polygene editing system |
CN110964741B (en) * | 2019-12-20 | 2022-03-01 | 北京市农林科学院 | A nuclear localization signal FNB and its application in improving base editing efficiency |
CN110964741A (en) * | 2019-12-20 | 2020-04-07 | 北京市农林科学院 | Nuclear localization signal FNB and application thereof in improving base editing efficiency |
WO2021208501A1 (en) * | 2020-04-13 | 2021-10-21 | 中山大学 | Preparation and use of induced mutant protein based on activated induced cytidine deaminase |
CN111518794B (en) * | 2020-04-13 | 2023-05-16 | 中山大学 | Preparation and application of inducible mutant protein based on activation-inducible cytidine deaminase |
CN111518794A (en) * | 2020-04-13 | 2020-08-11 | 中山大学 | Preparation and application of induced mutant protein based on activation-induced cytidine deaminase |
CN112851776A (en) * | 2020-04-20 | 2021-05-28 | 中国科学院天津工业生物技术研究所 | Gene site-directed mutagenesis method and stress resistance breeding application thereof |
CN113005141A (en) * | 2021-01-05 | 2021-06-22 | 温州医科大学 | Gene editing tool composed of high-activity mutant, preparation method and method for repairing congenital retinoschisis disease pathogenic gene |
WO2022223010A1 (en) * | 2021-04-21 | 2022-10-27 | 浙江大学 | Negative-strand rna viral vector and plant genome editing method without transformation |
CN115704015A (en) * | 2021-08-12 | 2023-02-17 | 清华大学 | Targeted mutagenesis system based on adenine and cytosine dual base editor |
CN114045302A (en) * | 2021-11-12 | 2022-02-15 | 三亚中国农业科学院国家南繁研究院 | Single-base editing vector and construction and application thereof |
CN114835818A (en) * | 2022-03-17 | 2022-08-02 | 江南大学 | A gene editing fusion protein, its constructed adenine base editor and its application |
CN114835818B (en) * | 2022-03-17 | 2024-03-22 | 江南大学 | Gene editing fusion protein, adenine base editor constructed by same and application thereof |
WO2023207607A1 (en) * | 2022-04-29 | 2023-11-02 | 北京大学 | Deaminase mutant, composition, and method for modifying mitochondrial dna |
CN116376948A (en) * | 2022-07-25 | 2023-07-04 | 广州医科大学 | A kind of preparation method of plasmid carrier and MS2 bacteriophage similar particles displaying foreign protein |
CN116376948B (en) * | 2022-07-25 | 2023-12-15 | 广州医科大学 | Plasmid vector and preparation method of MS2 phage similar particles for displaying exogenous proteins |
CN116286734B (en) * | 2022-11-29 | 2024-04-02 | 武汉大学 | Mutants of wild-type LbCas12a protein and their uses for SNP detection |
CN116286734A (en) * | 2022-11-29 | 2023-06-23 | 武汉大学 | Mutants of wild-type LbCas12a protein and its use in SNP detection |
CN116751799A (en) * | 2023-06-14 | 2023-09-15 | 江南大学 | Multi-site double-base editor and application thereof |
CN116751799B (en) * | 2023-06-14 | 2024-01-26 | 江南大学 | Multi-site double-base editor and application thereof |
CN116836962B (en) * | 2023-06-28 | 2024-04-05 | 微光基因(苏州)有限公司 | Engineered adenosine deaminase and base editors |
CN116836962A (en) * | 2023-06-28 | 2023-10-03 | 微光基因(苏州)有限公司 | Engineered adenosine deaminase and base editor |
Also Published As
Publication number | Publication date |
---|---|
AR114014A1 (en) | 2020-07-08 |
CN109957569B (en) | 2022-10-25 |
WO2019120310A1 (en) | 2019-06-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109957569B (en) | Base editing system and method based on CPF1 protein | |
CN110157727A (en) | Alkaloid edit methods | |
KR102061438B1 (en) | A method for converting monocot genome sequences in which a nucleic acid base in a targeting DNA sequence is specifically converted, and a molecular complex used therein. | |
AU2022201319B2 (en) | Viral vectors encoding recombinant fviii variants with increased expression for gene therapy of hemophilia a | |
KR102589523B1 (en) | Cell lines and methods for producing polyketides in yeast | |
DK2324120T3 (en) | Manipulating SNF1 protein kinase OF REVISION OF OIL CONTENT IN OLEAGINOUS ORGANISMS | |
BRPI0806354A2 (en) | transgender oilseeds, seeds, oils, food or food analogues, medicinal food products or medicinal food analogues, pharmaceuticals, beverage formulas for babies, nutritional supplements, pet food, aquaculture feed, animal feed, whole seed products , mixed oil products, partially processed products, by-products and by-products | |
AU2016354550A1 (en) | Viral vectors encoding recombinant fviii variants with increased expression for gene therapy of hemophilia a | |
CN106544361A (en) | Mammalian cell expression vector, expression system, preparation method and application | |
CN108913718A (en) | A kind of preparation method and application of the CAR-T cell of targeting EGFR v III | |
CN112608940B (en) | Construction method and application of animal model of congenital cataract disease | |
CN114540424A (en) | IGFBP7 muscle tissue specific knockout mouse animal model and construction method thereof | |
KR20210034013A (en) | Genetic treatment of hemophilia using viral vectors encoding recombinant FVIII variants with increased expression | |
CN113943737A (en) | Application of chicken CTGF gene in inhibiting differentiation of chicken preadipocytes | |
KR102009270B1 (en) | Recombinant foot-and-mouth disease virus expressing protective antigen of type O-Thi60 | |
CN114395020B (en) | Application of GmRALF1 protein in promoting phosphorus element absorption of plants | |
CN111100874B (en) | Targeting vector and method and application of integrating exogenous gene into mouse DC-SIGN exon 7 site to construct BAC clone | |
CN114134170A (en) | Preparation method and application of HA tag fusion expression vector | |
CN101300358A (en) | Astaxanthine biosynthesis in eukaryotes | |
KR102247462B1 (en) | Recombinant Lactic acid bacteria having enhanced Productivity for Riboflavin | |
KR101246884B1 (en) | Transformed Pseudomonas Aeruginosa for Over-expression of Fatty Acid Biosynthesis Pathway and Method of Preparing the Same | |
JP2003274972A (en) | Flowering control gene and its use | |
CN107345210A (en) | The disinsection engineering bacteria strain and its application of one plant of furocoumarin genetic transformation | |
CN116555061A (en) | A mutant of Saccharomyces cerevisiae and its application in the preparation of alcohol | |
KR20120094788A (en) | The developed pseudomonas aeruginosa strains for improving the content of fatty acids and manufacturing method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20220623 Address after: Room D340, F3, building 2, No. 2250, Pudong South Road, Pudong New Area, Shanghai 200120 Applicant after: Shanghai Blue Cross Medical Science Research Institute Address before: No. 2, No. 1, Beichen West Road, Beichen, Beijing Applicant before: INSTITUTE OF GENETICS AND DEVELOPMENTAL BIOLOGY, CHINESE ACADEMY OF SCIENCES |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20220921 Address after: Unit E598, 5th Floor, Lecheng Plaza, Phase II, Biomedical Industrial Park, No. 218, Sangtian Street, Suzhou Industrial Park, Suzhou Area, China (Jiangsu) Pilot Free Trade Zone, Suzhou City, Jiangsu Province, 215127 Applicant after: Suzhou Qihe Biotechnology Co.,Ltd. Address before: Room D340, F3, building 2, No. 2250, Pudong South Road, Pudong New Area, Shanghai 200120 Applicant before: Shanghai Blue Cross Medical Science Research Institute |
|
GR01 | Patent grant | ||
GR01 | Patent grant |