CN110050063B

CN110050063B - Acid alpha-glucosidase variants and their uses

Info

Publication number: CN110050063B
Application number: CN201780055024.9A
Authority: CN
Inventors: 费德里克·敏果兹; 基斯配·罗兹提
Original assignee: Evry Wald Esson University; Institut National de la Sante et de la Recherche Medicale INSERM; Genethon
Current assignee: Evry Wald Esson University; Institut National de la Sante et de la Recherche Medicale INSERM; Genethon
Priority date: 2016-09-12
Filing date: 2017-09-12
Publication date: 2023-11-03
Anticipated expiration: 2037-09-12
Also published as: PE20250258A1; MX2019002842A; US11339406B2; CN110050063A; CA3035868A1; PE20191203A1; RU2019110773A3; KR102617502B1; US20220275396A1; JP7245155B2; CN117384890A; AU2017322376B2; CL2019000609A1; WO2018046774A1; ZA201902141B; SG11201901494UA; MX2024004420A; US20190390225A1; AU2017322376A1; BR112019004785A2

Abstract

本发明涉及酸性α‑葡萄糖苷酶的变体及其用途。The present invention relates to variants of acid alpha-glucosidases and their uses.

Description

Acid alpha-glucosidase variants and their uses

本发明涉及酸性α-葡萄糖苷酶(GAA)的变体及其用途。The present invention relates to variants of acid alpha-glucosidase (GAA) and their uses.

庞贝氏病(Pompe disease)，也被称为II型糖原贮积病(GSD)和酸性麦芽糖酶缺乏症，是一种由溶酶体酶酸性α-葡萄糖苷酶(GAA)的缺乏引起的常染色体隐性代谢性肌病。GAA是一种外切-1,4和1,6-α-葡萄糖苷酶，其在溶酶体中将糖原水解成葡萄糖。GAA的缺乏导致溶酶体中糖原积累，并引起呼吸肌、心肌和骨骼肌的渐进性损伤。所述疾病的范围从通常在1-2岁之前致死的快速发展的婴儿期过程，到在儿童和成年人中引起显著发病和早期死亡的发展更慢的非均相过程。Hirschhorn RR，遗传病的代谢和分子基础(The Metabolicand Molecular Bases of Inherited Disease),3:3389-3420(2001,McGraw-Hill)；Vander Ploeg和Reuser，Lancet 372:1342-1351(2008)。Pompe disease, also known as glycogen storage disease type II (GSD) and acid maltase deficiency, is caused by a deficiency of the lysosomal enzyme acid alpha-glucosidase (GAA) Autosomal recessive metabolic myopathies. GAA is an exo-1,4 and 1,6-alpha-glucosidase that hydrolyzes glycogen to glucose in lysosomes. GAA deficiency leads to glycogen accumulation in lysosomes and causes progressive damage to respiratory, cardiac, and skeletal muscles. The disease ranges from a rapidly developing course in infancy that is usually fatal before 1-2 years of age, to a more slowly developing heterogeneous process causing significant morbidity and early death in children and adults. Hirschhorn RR, The Metabolic and Molecular Bases of Inherited Disease, 3:3389-3420 (2001, McGraw-Hill); Vander Ploeg and Reuser, Lancet 372:1342-1351 (2008).

当前用于治疗庞贝氏病的人类疗法包括给药重组人类GAA，也被称为酶替代疗法(ERT)。已证实ERT对严重的婴儿期GSD II有效。然而，酶疗法的益处受到需要频繁输注和产生针对重组hGAA的抑制性抗体的限制(Amalfitano,A.等，(2001)Genet.In Med.3:132-138)。此外，ERT不能高效地校正整个身体，可能是由于所述蛋白质在外周静脉递送后的不良生物分配、几种组织摄入的不足和高免疫原性的组合。Current human therapies for the treatment of Pompe disease include the administration of recombinant human GAA, also known as enzyme replacement therapy (ERT). ERT has been shown to be effective in severe GSD II in infancy. However, the benefits of enzyme therapy are limited by the need for frequent infusions and the production of inhibitory antibodies against recombinant hGAA (Amalfitano, A. et al., (2001) Genet. In Med. 3:132-138). Furthermore, ERT is not efficient in correcting the entire body, possibly due to a combination of poor biodistribution of the protein after peripheral intravenous delivery, insufficient uptake by several tissues, and high immunogenicity.

作为ERT的可替选或附属方案，研究了基因治疗方法治疗GSD-II的可行性。(Amalfitano,A.等，(1999)Proc.Natl.Acad.Sci.USA96:8861-8866；Ding,E.等，(2002)Mol.Ther.5:436-446；Fraites,T.J.等，(2002)Mol.Ther.5:571-578；Tsujino,S.等，(1998)Hum.Gene Ther.9:1609-1616)。然而，用于校正遗传缺陷的肌肉定向基因转移必须面对疾病的系统性本质和转入基因的肌肉表达与其他组织相比倾向于免疫原性更高这一事实的限制。As an alternative or adjunct to ERT, the feasibility of gene therapy for GSD-II has been studied. (Amalfitano, A. et al., (1999) Proc. Natl. Acad. Sci. USA 96: 8861-8866; Ding, E. et al., (2002) Mol. Ther. 5: 436-446; Fraites, T. J. et al., (2002) ) Mol. Ther. 5:571-578; Tsujino, S. et al. (1998) Hum. Gene Ther. 9: 1609-1616). However, muscle-directed gene transfer to correct genetic defects must face limitations due to the systemic nature of the disease and the fact that muscle expression of the transferred gene tends to be more immunogenic compared with other tissues.

Doerfler等，2016描述了编码人类密码子优化的GAA的两种构建物的组合给药，一种构建物在肝特异性启动子的控制之下，另一种在肌肉特异性启动子的控制之下。肝特异性启动子驱动的GAA表达被用于在Gaa^-/-小鼠模型中促进对GAA的免疫耐受性，而肌肉特异性启动子驱动的GAA表达在疗法所靶向的一部分组织中提供治疗性蛋白的表达。然而，这种策略不完全令人满意，因为它需要使用多个构建物，并且它不产生GAA的全身性表达。Doerfler et al., 2016 described the combined administration of two constructs encoding human codon-optimized GAA, one under the control of a liver-specific promoter and the other under the control of a muscle-specific promoter. Down. Liver-specific promoter-driven GAA expression was used to promote immune tolerance to GAA in a Gaa ^−/− mouse model, whereas muscle-specific promoter-driven GAA expression was provided in a subset of tissues targeted by the therapy Expression of therapeutic proteins. However, this strategy is not entirely satisfactory as it requires the use of multiple constructs and it does not result in systemic expression of GAA.

过去已提出使用修饰的GAA蛋白来改进溶酶体贮积病治疗。具体来说，申请WO2004064750和Sun等，2006，公开了一种包含可操作连接到GAA的信号肽的嵌合GAA多肽，作为提高所述蛋白质向分泌途径的靶向的一种方式。The use of modified GAA proteins has been proposed in the past to improve lysosomal storage disease treatment. Specifically, application WO2004064750 and Sun et al., 2006, disclose a chimeric GAA polypeptide containing a signal peptide operably linked to GAA as a way to increase targeting of the protein to the secretory pathway.

然而，患者可用的疗法并不完全令人满意，并且在本领域中仍需要改进的GAA多肽和GAA生产。具体来说，对使用GAA的治疗的长期功效、高水平的GAA生产、对产生的GAA多肽的提高的免疫耐受性和GAA被需要它的细胞和组织的增加的摄取，仍存在着需求。此外，在WO2004064750和Sun等，2006中，其中公开的嵌合GAA多肽的组织分布不完全令人满意。因此，对允许在如果不是全部也是大多数目标组织中校正糖原积累的全面治疗性GAA多肽，仍存在需求。However, the therapies available to patients are not entirely satisfactory, and there is still a need in the art for improved GAA polypeptides and GAA production. Specifically, there remains a need for long-term efficacy of treatments using GAA, high levels of GAA production, improved immune tolerance to the produced GAA polypeptides, and increased uptake of GAA by cells and tissues that require it. Furthermore, in WO2004064750 and Sun et al., 2006, the tissue distribution of the chimeric GAA polypeptides disclosed therein is not entirely satisfactory. Therefore, there remains a need for comprehensive therapeutic GAA polypeptides that allow correction of glycogen accumulation in most, if not all, target tissues.

发明内容Contents of the invention

本发明涉及GAA变体，其与野生型GAA蛋白相比以更高的水平表达和分泌，引发对全身范围的糖原病理性积累的改进的校正，并引起对GAA的免疫耐受性的诱导。The present invention relates to GAA variants that are expressed and secreted at higher levels than wild-type GAA protein, triggering improved correction of systemic glycogen pathological accumulation, and causing the induction of immune tolerance to GAA .

根据一个方面，本发明涉及一种截短的GAA多肽，其包含从亲本GAA多肽的N-端末端缺失至少一个氨基酸，其中所述亲本多肽对应于不含信号肽的GAA多肽的前体形式。在特定实施方式中，所述截短的GAA多肽与所述亲本GAA多肽相比在其N-端末端处缺失了至少2个，特别是至少2个、特别是至少3个、特别是至少4个、特别是至少5个、特别是至少6个、特别是至少7个、特别是至少8个连续氨基酸。在另一个实施方式中，所述截短的GAA多肽与所述亲本GAA多肽相比在其N-端末端处缺失了至多75个，特别是至多70个、特别是至多60个、特别是至多55个、特别是至多50个、特别是至多47个、特别是至多46个、特别是至多45个、特别是至多44个、特别是至多43个连续氨基酸。在其他特定实施方式中，所述截短的GAA多肽与所述亲本GAA多肽相比在其N-端末端处缺失了至多47个，特别是至多46个、特别是至多45个、特别是至多44个、特别是至多43个连续氨基酸。在另一个特定实施方式中，所述截短的GAA多肽与所述亲本GAA多肽相比在其N-端末端处缺失了1至75个，特别是1至47个、特别是1至46个、特别是1至45个、特别是1至44个、特别是1至43个连续氨基酸。在另一个实施方式中，所述截短的GAA多肽与所述亲本GAA多肽相比在其N-端末端处缺失了2至43个，特别是3至43个、特别是4至43个、特别是5至43个、特别是6至43个、特别是7至43个、特别是8至43个连续氨基酸。在更特定实施方式中，所述截短的GAA多肽与亲本GAA多肽相比在其N-端末端处缺失了6、7、8、9、10、27、28、29、30、31、40、41、42、43、44、45、46或47个连续氨基酸，特别是与亲本GAA多肽相比在其N-端末端处截短了7、8、9、28、29、30、41、42、43或44个、更特别地8、29、42或43个连续氨基酸。在其他特定实施方式中，所述亲本多肽是人类GAA(hGAA)，特别是具有在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的氨基酸序列的hGAA。According to one aspect, the invention relates to a truncated GAA polypeptide comprising a deletion of at least one amino acid from the N-terminal terminus of a parent GAA polypeptide, wherein said parent polypeptide corresponds to a precursor form of a GAA polypeptide that does not contain a signal peptide. In a specific embodiment, the truncated GAA polypeptide has at least 2, especially at least 2, especially at least 3, especially at least 4, deleted at its N-terminal end compared to the parent GAA polypeptide. , especially at least 5, especially at least 6, especially at least 7, especially at least 8 consecutive amino acids. In another embodiment, the truncated GAA polypeptide has at most 75, in particular at most 70, in particular at most 60, especially at most 60, at its N-terminal end deleted compared to said parent GAA polypeptide. 55, in particular at most 50, in particular at most 47, in particular at most 46, in particular at most 45, in particular at most 44, in particular at most 43, consecutive amino acids. In other specific embodiments, said truncated GAA polypeptide has at most 47, in particular at most 46, in particular at most 45, especially at most 45, at its N-terminal end deleted compared to said parent GAA polypeptide. 44, especially up to 43 consecutive amino acids. In another specific embodiment, the truncated GAA polypeptide has from 1 to 75, in particular from 1 to 47, especially from 1 to 46, deleted at its N-terminal end compared to the parent GAA polypeptide. , especially 1 to 45, especially 1 to 44, especially 1 to 43 consecutive amino acids. In another embodiment, the truncated GAA polypeptide has 2 to 43, in particular 3 to 43, especially 4 to 43, deleted at its N-terminal end compared to the parent GAA polypeptide. In particular 5 to 43, especially 6 to 43, especially 7 to 43, especially 8 to 43 consecutive amino acids. In a more specific embodiment, the truncated GAA polypeptide has 6, 7, 8, 9, 10, 27, 28, 29, 30, 31, 40 deleted at its N-terminal end compared to the parent GAA polypeptide. , 41, 42, 43, 44, 45, 46 or 47 consecutive amino acids, especially 7, 8, 9, 28, 29, 30, 41, truncated at its N-terminal end compared with the parent GAA polypeptide. 42, 43 or 44, more particularly 8, 29, 42 or 43 consecutive amino acids. In other specific embodiments, the parent polypeptide is human GAA (hGAA), particularly hGAA having the amino acid sequence set forth in SEQ ID NO: 1 or SEQ ID NO: 33, particularly SEQ ID NO: 1.

在特定实施方式中，本发明的截短的GAA多肽具有在SEQ ID NO：27、SEQ ID NO：28、SEQ ID NO：34和SEQ ID NO：35中示出的序列。In specific embodiments, the truncated GAA polypeptides of the invention have the sequences set forth in SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:34, and SEQ ID NO:35.

此外，本发明的截短的GAA多肽还可以包含融合到其N-端末端的信号肽，特别是选自SEQ ID NO：3至7的信号肽，特别是SEQ ID NO：3的信号肽。Furthermore, the truncated GAA polypeptide of the invention may also comprise a signal peptide fused to its N-terminal end, in particular a signal peptide selected from the group consisting of SEQ ID NO: 3 to 7, in particular the signal peptide of SEQ ID NO: 3.

另一方面，本发明涉及一种核酸分子，其编码如上所述的截短的GAA多肽，所述截短的GAA多肽任选地通过其N-端末端融合到信号肽。在某些实施方式中，所述核酸分子具有被优化以在体内、特别是在人类对象中提高所述截短的GAA多肽的表达和/或提高对所述截短的GAA多肽的免疫耐受性的核苷酸序列。In another aspect, the invention relates to a nucleic acid molecule encoding a truncated GAA polypeptide as described above, optionally fused via its N-terminal end to a signal peptide. In certain embodiments, the nucleic acid molecules have properties optimized to increase expression of the truncated GAA polypeptide and/or increase immune tolerance to the truncated GAA polypeptide in vivo, particularly in human subjects. sexual nucleotide sequence.

另一方面，本发明涉及一种核酸构建物，其包含可操作连接到一种或多种调控序列例如启动子、内含子、多腺苷化信号和/或增强子(例如顺式调控模块或CRM)的本发明的核酸分子。在特定实施方式中，所述启动子是优选地选自α-1抗胰蛋白酶启动子(hAAT)、甲状腺素运载蛋白启动子、白蛋白启动子和甲状腺素结合性球蛋白(TBG)启动子的肝特异性启动子。在另一个特定实施方式中，所述启动子是肌肉特异性启动子例如Spc5-12、MCK和肌间线蛋白启动子。在另一个实施方式中，所述启动子是遍在启动子例如CMV、CAG和PGK启动子。所述核酸构建物还可以任选地包含内含子，特别是选自人类β球蛋白b2(或HBB2)内含子、FIX内含子、鸡β-球蛋白内含子和SV40内含子的内含子，其中所述内含子任选地是修饰的内含子例如SEQ ID NO：17的修饰的HBB2内含子、SEQ ID NO：19的修饰的FIX内含子或SEQID NO：21的修饰的鸡β-球蛋白内含子。在本发明的核酸构建物的特定实施方式中，所述核酸构建物优选地以下述顺序包含：增强子；内含子；启动子，特别是肝特异性启动子；编码所述GAA蛋白的核酸序列；和多腺苷化信号，所述构建物优选地以下述顺序包含：ApoE控制区；HBB2内含子，特别是修饰的HBB2内含子；hAAT启动子；编码所述截短的GAA多肽的核酸序列；和牛生长激素多腺苷化信号。在特定实施方式中，所述核酸构建物更特别地包含SEQ IDNO：22至26任一者的核苷酸序列。In another aspect, the invention relates to a nucleic acid construct comprising a nucleic acid construct operably linked to one or more regulatory sequences such as a promoter, an intron, a polyadenylation signal and/or an enhancer (e.g., a cis-regulatory module or CRM) of the nucleic acid molecule of the present invention. In a specific embodiment, the promoter is preferably selected from the group consisting of alpha-1 antitrypsin promoter (hAAT), transthyretin promoter, albumin promoter and thyroxine binding globulin (TBG) promoter liver-specific promoter. In another specific embodiment, the promoter is a muscle-specific promoter such as the Spc5-12, MCK and desmin promoters. In another embodiment, the promoter is a ubiquitous promoter such as the CMV, CAG and PGK promoters. The nucleic acid construct may also optionally comprise an intron, in particular selected from the group consisting of human β-globin b2 (or HBB2) intron, FIX intron, chicken β-globin intron and SV40 intron. An intron, wherein the intron is optionally a modified intron such as the modified HBB2 intron of SEQ ID NO: 17, the modified FIX intron of SEQ ID NO: 19 or the modified FIX intron of SEQ ID NO: Modified chicken beta-globin intron of 21. In a specific embodiment of the nucleic acid construct of the invention, the nucleic acid construct preferably comprises in the following order: an enhancer; an intron; a promoter, in particular a liver-specific promoter; a nucleic acid encoding the GAA protein sequence; and a polyadenylation signal, said construct preferably comprising in the following order: an ApoE control region; an HBB2 intron, in particular a modified HBB2 intron; an hAAT promoter; encoding said truncated GAA polypeptide Nucleic acid sequence of; and bovine growth hormone polyadenylation signal. In a specific embodiment, the nucleic acid construct more specifically comprises the nucleotide sequence of any one of SEQ ID NOs: 22 to 26.

另一方面，本发明涉及一种载体，其包含本文中公开的核酸分子或核酸构建物。本发明的载体可以特别是病毒载体，优选为反转录病毒载体例如慢病毒载体，或AAV载体。优选地，所述载体是单链或双链自身互补的AAV载体，优选为具有AAV来源的衣壳例如AAV1、AAV2、变体AAV2、AAV3、变体AAV3、AAV3B、变体AAV3B、AAV4、AAV5、AAV6、变体AAV6、AAV7、AAV8、AAV9、AAV10例如AAVcy10和AAVrh10、AAVrh74、AAVdj、AAV-Anc80、AAV-LK03、AAV2i8和猪AAV例如AAVpo4和AAVpo6衣壳或具有嵌合衣壳的AAV载体。在特定实施方式中，所述载体是具有AAV8、AAV9、AAVrh74或AAV2i8衣壳，特别是AAV8、AAV9或AAVrh74衣壳，更特别是AAV8衣壳的AAV载体。In another aspect, the invention relates to a vector comprising a nucleic acid molecule or nucleic acid construct disclosed herein. The vector of the present invention may in particular be a viral vector, preferably a retroviral vector such as a lentiviral vector, or an AAV vector. Preferably, the vector is a single-stranded or double-stranded self-complementary AAV vector, preferably a capsid having an AAV origin such as AAV1, AAV2, variant AAV2, AAV3, variant AAV3, AAV3B, variant AAV3B, AAV4, AAV5 , AAV6, variant AAV6, AAV7, AAV8, AAV9, AAV10 such as AAVcy10 and AAVrh10, AAVrh74, AAVdj, AAV-Anc80, AAV-LK03, AAV2i8 and porcine AAV such as AAVpo4 and AAVpo6 capsids or AAV vectors with chimeric capsids . In a specific embodiment, the vector is an AAV vector having an AAV8, AAV9, AAVrh74 or AAV2i8 capsid, particularly an AAV8, AAV9 or AAVrh74 capsid, more particularly an AAV8 capsid.

另一方面，本发明提供了一种细胞，其用本发明的核酸分子、核酸构建物或载体转化。更特别地，所述细胞是肝细胞或肌细胞。In another aspect, the invention provides a cell transformed with a nucleic acid molecule, nucleic acid construct or vector of the invention. More specifically, the cells are hepatocytes or myocytes.

在特定情况下，本发明提供了一种药物组合物，其在可药用载体中包含本发明的截短的GAA多肽、核酸分子、核酸构建物、载体或细胞。In certain instances, the invention provides a pharmaceutical composition comprising a truncated GAA polypeptide, nucleic acid molecule, nucleic acid construct, vector or cell of the invention in a pharmaceutically acceptable carrier.

本发明还涉及本发明的截短的GAA多肽、核酸分子、核酸构建物、载体或细胞，其用作药物。The invention also relates to a truncated GAA polypeptide, nucleic acid molecule, nucleic acid construct, vector or cell of the invention for use as a medicament.

本发明还提供了本发明的截短的GAA多肽、核酸分子、核酸构建物、载体或细胞，其用于治疗糖原贮积病的方法中。在特定实施方式中，所述糖原贮积病是GSDI、GSDII、GSDIII、GSDIV、GSDV、GSDVI、GSDVII、GSDVIII或心脏的致死性先天性糖原贮积病。在更特定实施方式中，所述糖原贮积病选自GSDI、GSDII和GSDIII，更特别地选自GSDII和GSDIII。在甚至更特别的实施方式中，所述糖原贮积病是GSDII。The invention also provides a truncated GAA polypeptide, nucleic acid molecule, nucleic acid construct, vector or cell of the invention for use in a method of treating glycogen storage disease. In specific embodiments, the glycogen storage disease is GSDI, GSDII, GSDIII, GSDIV, GSDV, GSDVI, GSDVII, GSDVIII, or a lethal congenital glycogen storage disease of the heart. In a more specific embodiment, the glycogen storage disease is selected from GSDI, GSDII and GSDIII, more particularly from GSDII and GSDIII. In an even more specific embodiment, the glycogen storage disease is GSDII.

附图说明Description of drawings

图1.hGAA的部分的缺失提高了它在体外的分泌。图A.使用Lipofectamine^TM，将人类肝细胞瘤细胞(Huh7)用表达绿色荧光蛋白(GFP)的对照质粒或表达野生型hGAA(hGAA)或按照两种不同算法进行序列优化的hGAA(分别为hGAAco1和co2)的质粒转染。不同的hGAA构建物含有野生型或人类α-1-抗胰蛋白酶信号肽(sp2)。截短的hGAA通过在信号肽后缺失8个氨基酸来获得(Δ8)。在转染后48小时，通过产荧光酶测定法测量培养基中的hGAA活性，并针对在“材料和方法”中指明的反应产物的标准曲线评估GAA活性。柱状图显示了源自于三个不同实验的分泌的hGAA水平的平均值±SE。统计分析通过配对t-检验来进行，在柱状图中报告了获得的p-值(*＝正如所指示的p<0.05)。图B.使用Lipofectamine，将人类肝细胞瘤细胞(Huh7)用表达GFP的对照质粒或表达具有野生型或胰凝乳蛋白酶原B1信号肽(sp7)的hGAAco1的质粒转染。hGAA蛋白已通过在信号肽后移除8或42个氨基酸来截短(分别为Δ8和Δ42)。转染后48小时，通过如上所指示的产荧光测定法测量培养基中的hGAA活性。柱状图显示了源自于三个不同实验的分泌的hGAA水平的平均值±SE。统计分析通过ANOVA来进行(*＝正如所指示的p<0.05)。Figure 1. Partial deletion of hGAA improves its secretion in vitro. Figure A. Using Lipofectamine ^TM , human hepatoma cells (Huh7) were treated with a control plasmid expressing green fluorescent protein (GFP) or expressing wild-type hGAA (hGAA) or hGAA sequence-optimized according to two different algorithms (hGAAco1, respectively). and co2) plasmid transfection. Different hGAA constructs contain wild-type or human α-1-antitrypsin signal peptide (sp2). Truncated hGAA is obtained by deleting 8 amino acids after the signal peptide (Δ8). At 48 hours after transfection, hGAA activity in the culture medium was measured by luciferase assay and GAA activity was assessed against a standard curve of reaction products specified in “Materials and Methods”. The bar graph shows the mean ± SE of secreted hGAA levels derived from three different experiments. Statistical analysis was performed by paired t-test and the obtained p-values are reported in histograms (*=p<0.05 as indicated). Panel B. Human hepatoma cells (Huh7) were transfected with a control plasmid expressing GFP or a plasmid expressing hGAAco1 with wild-type or chymotrypsinogen B1 signal peptide (sp7) using Lipofectamine. The hGAA protein has been truncated by removing 8 or 42 amino acids after the signal peptide (Δ8 and Δ42, respectively). 48 hours after transfection, hGAA activity in the culture medium was measured by the fluorogenic assay as indicated above. The bar graph shows the mean ± SE of secreted hGAA levels derived from three different experiments. Statistical analysis was performed by ANOVA (*=p<0.05 as indicated).

图2.在庞贝氏病小鼠模型中hGAA的部分的缺失提高了它在血流中的分泌。将3月龄GAA^-/-小鼠(n＝4-5只小鼠/组)用PBS或2E12vg/kg的AAV8载体静脉内注射，所述载体在肝特异性启动子的转录控制之下表达序列优化的hGAA(hGAAco1)。hGAA的野生型信号肽已用胰凝乳蛋白酶原信号肽(sp7)替换，并且hGAA的序列作为全长本源序列使用或通过在信号肽后移除8或42的氨基酸来截短(分别为Δ8和Δ42)。在注射后一个月，将小鼠放血，并使用产荧光测定法测量血清中的hGAA活性。统计分析通过ANOVA来进行(*＝正如所指示的p<0.05)。Figure 2. Partial deletion of hGAA enhances its secretion in the bloodstream in a mouse model of Pompe disease. Three-month-old GAA ^−/− mice (n = 4-5 mice/group) were injected intravenously with PBS or 2E12 vg/kg of AAV8 vector expressed under the transcriptional control of a liver-specific promoter. Sequence optimized hGAA (hGAAco1). The wild-type signal peptide of hGAA has been replaced with the chymotrypsinogen signal peptide (sp7), and the sequence of hGAA is used as the full-length native sequence or truncated by removing 8 or 42 amino acids after the signal peptide (Δ8, respectively and Δ42). One month after injection, the mice were bled, and hGAA activity in the serum was measured using a fluorogenic assay. Statistical analysis was performed by ANOVA (*=p<0.05 as indicated).

图3.信号肽提高hGAA的分泌。通过Lipofectamine^TM，将人类肝细胞瘤细胞(Huh7)用对照质粒(GFP)、表达野生型hGAA(被称为sp1)的质粒或表达与信号肽6-8(sp6-8)融合的序列优化的Δ8hGAA(hGAAco)的质粒转染。在转染后48小时，通过产荧光酶测定法测量培养基中的hGAA活性，并针对4-甲基伞形酮的标准曲线评估GAA活性。柱状图显示了源自于三个不同实验的分泌的hGAA水平的平均值±SE。统计分析通过ANOVA来进行(*＝相对于模拟转染的细胞p<0.05)。Figure 3. Signal peptide enhances secretion of hGAA. Human hepatoma cells (Huh7) were treated with a control plasmid (GFP), a plasmid expressing wild-type hGAA (termed sp1), or a sequence optimized for expression fused to signal peptide 6-8 (sp6-8) by Lipofectamine ^™ . Plasmid transfection of Δ8hGAA (hGAAco). At 48 hours after transfection, hGAA activity in the culture medium was measured by luciferase assay and GAA activity was assessed against a standard curve for 4-methylumbelliferone. The bar graph shows the mean ± SE of secreted hGAA levels derived from three different experiments. Statistical analysis was performed by ANOVA (*=p<0.05 vs. mock-transfected cells).

图4.截短的Δ8hGAA在庞贝氏病的小鼠模型中高效校正糖原积累。将4月龄野生型(WT)和GAA^-/-小鼠(n＝4-5只小鼠/组)用PBS或6E11vg/kg的AAV8载体静脉内注射，所述载体在人类α-1-抗胰蛋白酶启动子的转录控制之下表达与信号肽1、2、7和8(sp1、2、7、8)融合的序列优化的Δ8hGAA(hGAAco)。图A.柱状图示出了在载体注射后3个月通过产荧光测定法在血液中测量到的hGAA活性。统计分析通过ANOVA来进行，在所述柱状图中报道了相对于PBS处理的GAA-/-动物获得的p-值(*＝p<0.05)。图B-D.心脏、隔膜和股四头肌中糖原含量的生物化学校正。将4月龄GAA^-/-小鼠如上所述进行处理。在注射后三个月，将小鼠处死并评估糖原含量。柱状图示出了在心脏(图B)、隔膜(图C)和股四头肌(图D)中测量到的糖原含量，其被表示为在糖原的酶消化后释放的葡萄糖。统计分析通过ANOVA来进行(*＝相对于PBS注射的GAA-/-小鼠p<0.05)。Figure 4. Truncated Δ8hGAA efficiently corrects glycogen accumulation in a mouse model of Pompe disease. Four-month-old wild-type (WT) and GAA ^−/− mice (n = 4-5 mice/group) were injected intravenously with PBS or 6E11 vg/kg of AAV8 vector, which is expressed in human α-1- Sequence-optimized Δ8hGAA (hGAAco) fused to signal peptides 1, 2, 7, and 8 (sp1, 2, 7, 8) was expressed under the transcriptional control of the antitrypsin promoter. Panel A. Bar graph showing hGAA activity measured in blood by fluorometry 3 months after vector injection. Statistical analysis was performed by ANOVA, and the p-values obtained relative to PBS-treated GAA-/- animals are reported in the bar graphs (*=p<0.05). Figure BD. Biochemical correction of glycogen content in the heart, diaphragm, and quadriceps muscles. Four-month-old GAA ^−/− mice were processed as described above. Three months after injection, mice were sacrificed and glycogen content was assessed. The bar graph shows the glycogen content measured in the heart (Panel B), diaphragm (Panel C) and quadriceps muscle (Panel D), expressed as glucose released after enzymatic digestion of glycogen. Statistical analysis was performed by ANOVA (*=p<0.05 vs. PBS-injected GAA-/- mice).

图5.在庞贝氏病小鼠模型中高度分泌的hGAA降低了体液应答。将4月龄GAA-/-小鼠用PBS或两种不同剂量(5E11或2E12vg/kg)的AAV8载体静脉内注射，所述载体包含在人类α-1-抗胰蛋白酶启动子的转录控制之下编码融合到信号肽1(co)、信号肽2(sp2-Δ8-co)、信号肽7(sp7-Δ8-co)或信号肽8(sp8-Δ8-co)的Δ8hGAA的优化序列。在注射后1个月，通过ELISA分析血清中抗hGAA抗体的存在。使用纯化的小鼠IgG作为标准品进行定量。统计分析通过ANOVA和Dunnett’s事后检验来进行(*＝p<0.01)。Figure 5. Highly secreted hGAA reduces humoral responses in a mouse model of Pompe disease. Four-month-old GAA-/- mice were injected intravenously with PBS or two different doses (5E11 or 2E12 vg/kg) of the AAV8 vector contained under the transcriptional control of the human α-1-antitrypsin promoter. The following is an optimized sequence encoding Δ8hGAA fused to signal peptide 1 (co), signal peptide 2 (sp2-Δ8-co), signal peptide 7 (sp7-Δ8-co), or signal peptide 8 (sp8-Δ8-co). One month after injection, sera were analyzed for the presence of anti-hGAA antibodies by ELISA. Quantification was performed using purified mouse IgG as standard. Statistical analysis was performed by ANOVA and Dunnett's post hoc test (*=p<0.01).

图6.在NHP中AAV8-hAAT-sp7-Δ8-hGAAco1注射引起hGAA在血液中的高效分泌和在肌肉中的摄取。在第0天用2E12vg/kg的AAV8-hAAT-sp7-Δ8-hGAAco1注射两只食蟹猴(Macaca Fascicularis)。图A对在载体给药之前12天和之后30天从两只猴获得的血清进行的hGAA蛋白质印迹。在左侧指示了与样品平行运行的分子量标志物的条带位置。图B在载体注入后3个月，将猴处死并收获组织，用于hGAA摄取的生物化学评估。对从二头肌和隔膜获得的组织提取物进行hGAA蛋白质印迹。使用抗微管蛋白抗体作为载样对照。在左侧指示了与样品平行运行的分子量标志物的条带位置。Figure 6. AAV8-hAAT-sp7-Δ8-hGAAco1 injection causes efficient secretion of hGAA in blood and uptake in muscle in NHP. Two cynomolgus monkeys (Macaca Fascicularis) were injected on day 0 with 2E12 vg/kg of AAV8-hAAT-sp7-Δ8-hGAAco1. Panel A Western blot of hGAA performed on sera obtained from two monkeys 12 days before and 30 days after vehicle administration. Band positions for molecular weight markers run parallel to the sample are indicated on the left. Panel B Three months after vector injection, monkeys were sacrificed and tissues were harvested for biochemical assessment of hGAA uptake. hGAA Western blotting was performed on tissue extracts obtained from biceps and diaphragm. Anti-tubulin antibody was used as a loading control. Band positions for molecular weight markers run parallel to the sample are indicated on the left.

图7.用hGAA表达载体注射的GDE-/-动物的肝脏中糖原含量的生物化学校正。将3月龄野生型(WT)或GDE-/-小鼠用PBS或表达在人类α-1-抗胰蛋白酶启动子的转录控制之下并与信号肽7融合的密码子优化的hGAA的AAV8载体(AAV8-hAAT-sp7--Δ8-hGAAco1)以1E11或1E12vg/小鼠的剂量静脉内注射。柱状图示出了在肝脏中测量到的糖原含量，其被表示为在糖原的酶消化后释放的葡萄糖。统计分析通过ANOVA来进行(*＝相对于PBS注射的GDE-/-小鼠p<0.05，§＝相对于PBS注射的WT动物p<0.05)。Figure 7. Biochemical calibration of glycogen content in livers of GDE-/- animals injected with hGAA expression vector. Three-month-old wild-type (WT) or GDE-/- mice were treated with PBS or AAV8 expressing codon-optimized hGAA under the transcriptional control of the human α-1-antitrypsin promoter and fused to signal peptide 7. The vector (AAV8-hAAT-sp7--Δ8-hGAAco1) was injected intravenously at a dose of 1E11 or 1E12 vg/mouse. The bar graph shows the glycogen content measured in the liver, expressed as glucose released after enzymatic digestion of glycogen. Statistical analysis was performed by ANOVA (*=p<0.05 vs. PBS-injected GDE-/- mice, §=p<0.05 vs. PBS-injected WT animals).

图8.在用编码不同GAA变体的质粒转染的细胞的培养基中的GAA活性。在质粒转染后24小时(图A)和48小时(图B)测量HuH7细胞的培养基中的GAA活性，所述质粒包含编码与本源GAA sp1信号肽组合的本源GAA(co)或编码包括与异源sp7信号肽组合的本源GAA的工程化GAA(sp7-co)的优化序列。评估了在sp7信号肽之后的GAA编码序列中的不同缺失的影响(sp7-Δ8-co，sp7-Δ29-co，sp7-Δ42-co，sp7-Δ43-co，sp7-Δ47-co，sp7-Δ62-co)。使用编码eGFP的质粒作为阴性对照。统计分析通过单向ANOVA和Tukey事后检验来进行。条中的散列标志(#)示出了相对于co的统计学显著差异；tau符号(τ)示出了相对于sp7-Δ8-co、sp7-Δ29-co、sp7-Δ42-co、sp7-Δ43-co的统计学显著差异。数据是两个独立实验的平均值±SD。除了使用不同符号的情况之外，^*p<0.05，^**p<0.01，^***p<0.001，^****p<0.0001。Figure 8. GAA activity in the culture medium of cells transfected with plasmids encoding different GAA variants. GAA activity was measured in the culture medium of HuH7 cells at 24 hours (Panel A) and 48 hours (Panel B) after transfection with a plasmid containing a plasmid encoding native GAA(co) in combination with a native GAA sp1 signal peptide or encoding including Optimized sequence of engineered GAA(sp7-co) of native GAA combined with heterologous sp7 signal peptide. The effect of different deletions in the GAA coding sequence following the sp7 signal peptide was evaluated (sp7-Δ8-co, sp7-Δ29-co, sp7-Δ42-co, sp7-Δ43-co, sp7-Δ47-co, sp7- Δ62-co). A plasmid encoding eGFP was used as a negative control. Statistical analysis was performed by one-way ANOVA and Tukey's post hoc test. Hash symbols (#) in bars show statistically significant differences relative to co; tau symbols (τ) show statistically significant differences relative to sp7-Δ8-co, sp7-Δ29-co, sp7-Δ42-co, sp7 -Statistically significant difference for Δ43-co. Data are means ± SD of two independent experiments. ^* p<0.05, ^** p<0.01, ^*** p<0.001, ^**** p<0.0001, except when different signs are used.

图9.不同GAA变体的细胞内GAA活性。在质粒转染后48小时测量HuH7细胞的裂解液中的GAA活性，所述质粒包含编码与本源GAA sp1信号肽组合的本源GAA(co)或编码包括与异源sp7信号肽组合的本源GAA的工程化GAA(sp7-co)的优化序列。评估了在所述信号肽之后的GAA编码序列中的不同缺失的影响(sp7-Δ8-co，sp7-Δ29-co，sp7-Δ42-co，sp7-Δ43-co，sp7-Δ47-co，sp7-Δ62-co)。使用编码eGFP的质粒作为阴性对照。统计分析通过单向ANOVA和Tukey事后检验来进行。Tau符号(τ)示出了相对于sp7-co、sp7-Δ8-co、sp7-Δ29-co、sp7-Δ42-co、sp7-Δ43-co的统计学显著差异。数据是两个独立实验的平均值±SD。除了使用不同符号的情况之外，^*p<0.05，^**p<0.01，^***p<0.001，^****p<0.0001。Figure 9. Intracellular GAA activity of different GAA variants. GAA activity was measured in lysates of HuH7 cells 48 hours after transfection with plasmids containing plasmids encoding native GAA(co) in combination with native GAA sp1 signal peptide or encoding native GAA in combination with heterologous sp7 signal peptide. Optimized sequence of engineered GAA(sp7-co). The effect of different deletions in the GAA coding sequence following the signal peptide was evaluated (sp7-Δ8-co, sp7-Δ29-co, sp7-Δ42-co, sp7-Δ43-co, sp7-Δ47-co, sp7 -Δ62-co). A plasmid encoding eGFP was used as a negative control. Statistical analysis was performed by one-way ANOVA and Tukey's post hoc test. Tau symbols (τ) show statistically significant differences relative to sp7-co, sp7-Δ8-co, sp7-Δ29-co, sp7-Δ42-co, sp7-Δ43-co. Data are means ± SD of two independent experiments. ^* p<0.05, ^** p<0.01, ^*** p<0.001, ^**** p<0.0001, except when different signs are used.

图10.使用与sp6或sp8信号肽组合的Δ8缺失，在细胞培养基中提高的GAA活性。在质粒转染后48小时测量了HuH7细胞的培养基(图A)和裂解液(图B)中的GAA活性，所述质粒包含编码与本源GAA sp1信号肽组合的本源GAA(co)或编码包括与异源sp6或sp8信号肽组合的本源GAA的工程化GAA(sp6-co或sp8-co)的优化序列。评估了在信号肽之后的GAA编码序列中的8个氨基酸的缺失的影响(sp6-Δ8-co、sp8-Δ8-co)。使用编码eGFP的质粒作为阴性对照。统计分析通过单向ANOVA和Tukey事后检验来进行。条中的星号示出了相对于co的统计学显著差异。数据是两个独立实验的平均值±SD。除了使用不同符号的情况之外，^*p<0.05，^**p<0.01，^***p<0.001，^****p<0.0001。Figure 10. Improved GAA activity in cell culture medium using Δ8 deletion in combination with sp6 or sp8 signal peptide. GAA activity was measured in the culture medium (Panel A) and lysate (Panel B) of HuH7 cells 48 hours after transfection with a plasmid containing a plasmid encoding native GAA (co) in combination with native GAA sp1 signal peptide or encoding Optimized sequences of engineered GAA (sp6-co or sp8-co) including native GAA combined with heterologous sp6 or sp8 signal peptide. The effect of deletion of 8 amino acids in the GAA coding sequence following the signal peptide was evaluated (sp6-Δ8-co, sp8-Δ8-co). A plasmid encoding eGFP was used as a negative control. Statistical analysis was performed by one-way ANOVA and Tukey's post hoc test. Asterisks in bars show statistically significant differences relative to co. Data are means ± SD of two independent experiments. ^* p<0.05, ^** p<0.01, ^*** p<0.001, ^**** p<0.0001, except when different signs are used.

发明详述Detailed description of the invention

本发明涉及一种截短的GAA多肽，编码这种截短的GAA多肽的核酸分子，包含所述核酸的核酸构建物，包含所述核酸构建物的载体，包含所述核酸分子或构建物或载体的细胞，以及包含本发明的多肽、核酸分子、核酸构建物、载体或细胞的药物组合物。发明人已令人吃惊地显示，本发明的GAA的截短形式极大提高GAA分泌并同时降低它的免疫原性。The present invention relates to a truncated GAA polypeptide, a nucleic acid molecule encoding such a truncated GAA polypeptide, a nucleic acid construct comprising the nucleic acid, a vector comprising the nucleic acid construct, a nucleic acid molecule or construct comprising the nucleic acid molecule or cells of the vector, and pharmaceutical compositions comprising the polypeptides, nucleic acid molecules, nucleic acid constructs, vectors or cells of the invention. The inventors have surprisingly shown that the truncated form of GAA of the present invention greatly increases GAA secretion while simultaneously reducing its immunogenicity.

溶酶体酸性α-葡萄糖苷酶或“GAA”(E.C.3.2.1.20)(1,4-α-D-葡聚糖葡萄糖水解酶)是一种外切-1,4-α-D-葡萄糖苷酶，其水解寡糖的α-1,4和α-1,6键两者，以释放葡萄糖。GAA的缺乏引起II型糖原贮积病(GSDII)，也被称为庞贝氏病(尽管这个术语以前是指这种疾病的婴儿期发作形式)。它催化糖原的完全降解，并在分支点处减慢。在17号染色体上28kb的人类酸性α-葡萄糖苷酶基因编码3.6kb mRNA，其产生952个氨基酸的多肽(Hoefsloot等，(1988)EMBO J.7:1697；Martiniuk等，(1990)DNA and Cell Biology 9:85)。所述酶在内质网中接受与翻译同时的N-连接糖基化。它被合成为110-kDa的前体形式，其经过大量的糖基化修饰、磷酸化并经过蛋白水解加工，通过大约90-kDa的内体中间体，成熟为最终的溶酶体76和67kDa形式(Hoefsloot，(1988)EMBO J.7:1697；Hoefsloot等，(1990)Biochem.J.272:485；Wisselaar等，(1993)J.Biol.Chem.268:2223；Hermans等，(1993)Biochem.J.289:681)。Lysosomal acid alpha-glucosidase or "GAA" (E.C.3.2.1.20) (1,4-alpha-D-glucan glucohydrolase) is an exo-1,4-alpha-D-glucose Glycodase, which hydrolyzes both the α-1,4 and α-1,6 linkages of oligosaccharides to release glucose. A deficiency in GAA causes glycogen storage disease type II (GSDII), also known as Pompe disease (although this term previously referred to the infantile-onset form of the disease). It catalyzes the complete degradation of glycogen and slows it down at the branch point. The 28 kb human acid alpha-glucosidase gene on chromosome 17 encodes a 3.6 kb mRNA that produces a 952 amino acid polypeptide (Hoefsloot et al., (1988) EMBO J. 7:1697; Martiniuk et al., (1990) DNA and Cell Biology 9:85). The enzyme undergoes N-linked glycosylation in the endoplasmic reticulum concurrent with translation. It is synthesized as a 110-kDa precursor form, which undergoes extensive glycosylation modification, phosphorylation, and undergoes proteolytic processing through an approximately 90-kDa endosomal intermediate to mature into the final lysosomal 76 and 67 kDa Form (Hoefsloot, (1988) EMBO J. 7:1697; Hoefsloot et al., (1990) Biochem. J. 272:485; Wisselaar et al., (1993) J. Biol. Chem. 268:2223; Hermans et al., (1993) Biochem. J. 289:681).

在患有GSD II的患者中，酸性α-葡萄糖苷酶的缺乏引起糖原在溶酶体中大量积累，破坏细胞功能(Hirschhorn,R.和Reuser,A.J.，(2001)，在《遗传病的代谢和分子基础》(The Metabolic and Molecular Basis for Inherited Disease)，Scriver,C.R.等主编，第3389-3419页(McGraw-Hill，New York)中)。在最常见的婴儿期形式中，患者表现出渐进性肌肉变性和心肌病，并在2岁之前死亡。在青少年和成年人发作形式中，存在严重衰弱。In patients with GSD II, a deficiency of acid alpha-glucosidase causes glycogen to accumulate in lysosomes, disrupting cellular function (Hirschhorn, R. and Reuser, A.J., (2001), in "Genetic Diseases" "The Metabolic and Molecular Basis for Inherited Disease", edited by Scriver, C.R. et al., pp. 3389-3419 (McGraw-Hill, New York). In the most common infantile form, patients exhibit progressive muscle degeneration and cardiomyopathy and die before the age of 2 years. In adolescent and adult-onset forms, there is severe debilitation.

此外，患有其他GSD的患者也可能从优化形式的GAA的给药获益。例如，已显示(Sun等，(2013)Mol Genet Metab 108(2):145；WO2010/005565)GAA的给药在来自于III型糖原贮积病(GSD III)患者的原代成肌细胞中降低糖原。Additionally, patients with other GSDs may also benefit from administration of optimized forms of GAA. For example, it has been shown (Sun et al., (2013) Mol Genet Metab 108(2):145; WO2010/005565) that administration of GAA in primary myoblasts from patients with glycogen storage disease type III (GSD III) Moderately lower glycogen.

特别地，在本发明的情形中，“GAA的前体形式”是GAA多肽的包含其天然信号肽的形式。例如，SEQ ID NO：2的序列是人类GAA(hGAA)的前体形式。在SEQ ID NO：2中，1-27位氨基酸残基对应于所述hGAA多肽的信号肽。这个hGAA的信号肽的序列也显示在SEQ ID NO：4中。In particular, in the context of the present invention, a "precursor form of GAA" is a form of a GAA polypeptide that contains its native signal peptide. For example, the sequence of SEQ ID NO: 2 is the precursor form of human GAA (hGAA). In SEQ ID NO: 2, amino acid residues 1-27 correspond to the signal peptide of the hGAA polypeptide. The sequence of this hGAA signal peptide is also shown in SEQ ID NO:4.

在本发明的情形中，本发明的截短的GAA多肽源自于亲本GAA多肽。根据本发明，“亲本GAA多肽”可以是如上所定义的功能性前体GAA序列，但不含其信号肽。例如，参考典型的野生型人类GAA多肽，完整的野生型GAA多肽(即GAA的前体形式)显示在SEQ ID NO：2或SEQ ID NO：30中并具有信号肽(对应于SEQ ID NO：2或SEQ ID NO：30的1-27位氨基酸)，而充当这些野生型人类GAA多肽的截短的GAA形式的基础的亲本GAA多肽分别显示在SEQ IDNO：1和SEQ ID NO：33中，并且没有信号肽。在这个实例中，对应于SEQ ID NO：2的28-952位氨基酸和SEQ ID NO：30的28-952位氨基酸的后者，被称为亲本GAA多肽。In the context of the present invention, the truncated GAA polypeptides of the invention are derived from the parent GAA polypeptide. According to the present invention, a "parent GAA polypeptide" may be a functional precursor GAA sequence as defined above, but without its signal peptide. For example, with reference to a typical wild-type human GAA polypeptide, the complete wild-type GAA polypeptide (i.e., the precursor form of GAA) is shown in SEQ ID NO: 2 or SEQ ID NO: 30 and has a signal peptide (corresponding to SEQ ID NO: 2 or amino acids 1-27 of SEQ ID NO: 30), whereas the parent GAA polypeptides that serve as the basis for the truncated GAA forms of these wild-type human GAA polypeptides are shown in SEQ ID NO: 1 and SEQ ID NO: 33, respectively, And there is no signal peptide. In this example, the latter corresponding to amino acids 28-952 of SEQ ID NO:2 and amino acids 28-952 of SEQ ID NO:30 are referred to as the parent GAA polypeptide.

根据本发明，本发明的截短的GAA多肽是功能性GAA多肽，即它具有野生型GAA多肽的功能。正如上文所定义的，野生型GAA的功能是水解寡糖和多糖、更特别是糖原的α-1,4和α-1,6链两者，以释放出葡萄糖。由本发明的核酸编码的功能性GAA蛋白与SEQ ID NO：1或SEQ ID NO：33的野生型GAA多肽相比，可以具有至少50％、60％、70％、80％、90％、95％、99％或至少100％的对糖原的水解活性。由本发明的核酸编码的GAA蛋白的活性甚至可能为SEQ ID NO：1或SEQ ID NO：33的野生型GAA蛋白的活性的超过100％，例如超过110％、120％、130％、140％或甚至超过150％。According to the present invention, the truncated GAA polypeptide of the invention is a functional GAA polypeptide, that is, it has the functions of a wild-type GAA polypeptide. As defined above, the function of wild-type GAA is to hydrolyze both oligosaccharides and polysaccharides, more particularly both the alpha-1,4 and alpha-1,6 chains of glycogen, to release glucose. The functional GAA protein encoded by the nucleic acid of the present invention can have at least 50%, 60%, 70%, 80%, 90%, 95% of the wild-type GAA polypeptide of SEQ ID NO: 1 or SEQ ID NO: 33. , 99% or at least 100% glycogen hydrolytic activity. The activity of the GAA protein encoded by the nucleic acid of the invention may even be more than 100% of the activity of the wild-type GAA protein of SEQ ID NO: 1 or SEQ ID NO: 33, for example more than 110%, 120%, 130%, 140% or Even more than 150%.

所述亲本GAA多肽的氨基酸序列或其编码序列可以源自于任何来源，包括鸟类和哺乳动物物种。当在本文中使用时，术语“鸟类”包括但不限于鸡、鸭、鹅、鹌鹑、火鸡和野鸡。当在本文中使用时，术语“哺乳动物”包括但不限于人类、猿猴和其他非人类灵长动物、牛科动物、绵羊、山羊、马、猫科动物、犬科动物、兔形目动物等。在本发明的实施方式中，所述亲本GAA多肽是人类、小鼠或鹌鹑，特别是人类的GAA多肽。The amino acid sequence of the parent GAA polypeptide or its coding sequence can be derived from any source, including avian and mammalian species. As used herein, the term "birds" includes, but is not limited to, chickens, ducks, geese, quail, turkeys, and pheasants. As used herein, the term "mammal" includes, but is not limited to, humans, apes and other non-human primates, bovines, sheep, goats, horses, felines, canines, lagomorphs, and the like. . In an embodiment of the invention, the parent GAA polypeptide is a human, mouse or quail, particularly a human GAA polypeptide.

此外，所述亲本GAA多肽可以是GAA多肽的功能性变体，其与GAA多肽相比包含一个或多个氨基酸修饰例如氨基酸插入、缺失和/或替换。例如，所述亲本多肽可以是人类GAA多肽例如SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1的多肽的功能性衍生物，与该人类GAA多肽具有至少80、85、90、95、96、97、98或至少99％的序列同一性。例如，除了上文中定义的截短之外，GAA多肽的功能性变体与所述亲本多肽例如在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本GAA多肽相比可以具有0至50个之间、0至30个之间、0至20个之间、0至15个之间、0至10个之间或0至5个之间的氨基酸变化。特别地，所述亲本GAA多肽可以由具有在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的氨基酸序列的人类GAA多肽构成。Furthermore, the parent GAA polypeptide may be a functional variant of the GAA polypeptide that contains one or more amino acid modifications such as amino acid insertions, deletions and/or substitutions compared to the GAA polypeptide. For example, the parent polypeptide may be a functional derivative of a human GAA polypeptide such as SEQ ID NO: 1 or SEQ ID NO: 33, especially a polypeptide of SEQ ID NO: 1, that shares at least 80, 85, 90, 95, 96, 97, 98 or at least 99% sequence identity. For example, in addition to the truncations defined above, functional variants of a GAA polypeptide are identical to said parent polypeptide, for example the parent set forth in SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1 GAA polypeptides may have between 0 and 50, between 0 and 30, between 0 and 20, between 0 and 15, between 0 and 10, or between 0 and 5 amino acid changes. In particular, the parent GAA polypeptide may consist of a human GAA polypeptide having the amino acid sequence set forth in SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1.

术语“同一性”及其偏差当指称多肽时，意味着当两个被比较的多肽序列中的一个位置被同一氨基酸占据时(例如如果两个多肽的每一者中的一个位置被亮氨酸占据)，则所述多肽在该位置处具有同一性。两个多肽之间的同一性百分数是所述两个序列共有的匹配位置的数目除以被比较的位置的数目X 100的函数。例如，如果在两个多肽中10个位置中的6个匹配，则所述两个序列是60％一致的。通常，在将两个序列对齐以给出最大同一性时做出比较。本领域技术人员已知的各种不同的生物信息学工具可用于比对核酸序列，例如BLAST或FASTA。The term "identity" and its deviations when referring to polypeptides means that when a position in the two polypeptide sequences being compared is occupied by the same amino acid (for example if a position in each of the two polypeptides is occupied by a leucine occupies), then the polypeptide has identity at that position. The percent identity between two polypeptides is a function of the number of matching positions common to the two sequences divided by the number of positions compared X 100. For example, if 6 out of 10 positions in two polypeptides match, the two sequences are 60% identical. Typically, comparisons are made when two sequences are aligned to give maximum identity. A variety of different bioinformatics tools known to those skilled in the art can be used to align nucleic acid sequences, such as BLAST or FASTA.

所述亲本GAA多肽也可以是GAA变体，例如由Kunita等，(1997)Biochemica etBiophysica Acta 1362:269所描述的GAA II；由Hirschhorn,R.和Reuser,A.J.(2001)在《遗传病的代谢和分子基础》(The Metabolic and Molecular Basis for InheritedDisease)(Scriver,C.R.、Beaudet,A.L.、Sly,W.S.和Valle,D.主编)，第3389-3419页，McGraw-Hill，New York中所描述的GAA多态性和SNP，参见第3403-3405页。可以使用本领域中已知的任何变体GAA多肽作为定义亲本GAA多肽的基础。示例性的变体GAA多肽包括SEQID NO：2(NCBI参考序列NP_000143.2)；SEQ ID NO：29(GenBank AAA52506.1)；SEQ ID NO：30(GenBank CAA68763.1)；SEQ ID NO：31(GenBank：EAW89583.1)和SEQ ID NO：32(GenBankABI53718.1)。其他有用的变体包括在Hoefsloot等，(1988)EMBO J.7:1697和Van Hove等，(1996)Proc.Natl.Acad.Sci.USA 93:65(人类)以及GenBank登记号NM_008064(小鼠)中所描述的。其他变体GAA多肽包括在WO2012/145644、WO00/34451和US6,858,425中所描述的那些。在特定实施方式中，所述亲本GAA多肽源自于在SEQ ID NO：2或SEQ ID NO：30中示出的氨基酸序列。The parent GAA polypeptide may also be a GAA variant, such as GAA II as described by Kunita et al., (1997) Biochemica et Biophysica Acta 1362:269; by Hirschhorn, R. and Reuser, A.J. (2001) Metabolism of Genetic Diseases GAA described in The Metabolic and Molecular Basis for Inherited Disease (Scriver, C.R., Beaudet, A.L., Sly, W.S., and Valle, D., editors), pp. 3389-3419, McGraw-Hill, New York Polymorphisms and SNPs, see pages 3403-3405. Any variant GAA polypeptide known in the art can be used as a basis for defining the parent GAA polypeptide. Exemplary variant GAA polypeptides include SEQ ID NO: 2 (NCBI reference sequence NP_000143.2); SEQ ID NO: 29 (GenBank AAA52506.1); SEQ ID NO: 30 (GenBank CAA68763.1); SEQ ID NO: 31 (GenBank: EAW89583.1) and SEQ ID NO: 32 (GenBankABI53718.1). Other useful variants are included in Hoefsloot et al., (1988) EMBO J. 7:1697 and Van Hove et al., (1996) Proc. Natl. Acad. Sci. USA 93:65 (human) and GenBank accession number NM_008064 (mouse ) as described in. Other variant GAA polypeptides include those described in WO2012/145644, WO00/34451 and US6,858,425. In specific embodiments, the parent GAA polypeptide is derived from the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID NO:30.

本发明的GAA的截短形式是亲本GAA多肽的N-端截短形式，其中从所述亲本GAA多肽的N-端末端缺失至少一个氨基酸。Truncated forms of GAA of the present invention are N-terminally truncated forms of a parent GAA polypeptide in which at least one amino acid is deleted from the N-terminus of the parent GAA polypeptide.

“截短形式”意味着包含从亲本GAA多肽的N-端部分缺失一个或几个连续氨基酸的GAA多肽。例如，所述GAA组成部分与所述亲本GAA蛋白相比，可以从其N-端末端截短1至75个连续氨基酸或超过75个连续氨基酸。具体来说，所述截短的GAA多肽与所述亲本GAA蛋白相比，可以从其N-端末端截短1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49、50、51、52、53、54、55、56、57、58、59、60、61、62、63、64、65、66、67、68、69、70、71、72、73、74或75个连续氨基酸(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。使用可替选的命名法，由所述亲本GAA多肽中1个氨基酸的截短产生的GAA多肽被称为Δ1GAA截短形式，由从N-端末端截短2个连续氨基酸产生的GAA多肽被称为Δ2GAA截短形式，由所述亲本GAA多肽中3个连续氨基酸的截短产生的GAA多肽被称为Δ3GAA截短形式，等等。在特定实施方式中，本发明的截短的GAA多肽是Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42、Δ43、Δ44、Δ45、Δ46、Δ47、Δ48、Δ49、Δ50、Δ51、Δ52、Δ53、Δ54、Δ55、Δ56、Δ57、Δ58、Δ59、Δ60、Δ61、Δ62、Δ63、Δ64、Δ65、Δ66、Δ67、Δ68、Δ69、Δ70、Δ71、Δ72、Δ73、Δ74或Δ75GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。"Truncated form" means a GAA polypeptide that contains one or several consecutive amino acids deleted from the N-terminal portion of the parent GAA polypeptide. For example, the GAA moiety may be truncated from its N-terminal end by 1 to 75 contiguous amino acids or by more than 75 contiguous amino acids compared to the parent GAA protein. Specifically, the truncated GAA polypeptide can be truncated 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 from its N-terminal end compared with the parent GAA protein. ,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36 ,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61 , 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74 or 75 consecutive amino acids (especially in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ A truncated form of the parent hGAA protein shown in ID NO: 1). Using alternative nomenclature, the GAA polypeptide resulting from the truncation of 1 amino acid in the parent GAA polypeptide is referred to as the Δ1 GAA truncated form, and the GAA polypeptide resulting from the truncation of 2 consecutive amino acids from the N-terminus is is called the Δ2GAA truncated form, the GAA polypeptide resulting from the truncation of 3 consecutive amino acids in the parent GAA polypeptide is called the Δ3GAA truncated form, and so on. In specific embodiments, the truncated GAA polypeptides of the invention are Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18 , Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42, Δ43 , Δ44, Δ45, Δ46, Δ47, Δ48, Δ49, Δ50, Δ51, Δ52, Δ53, Δ54, Δ55, Δ56, Δ57, Δ58, Δ59, Δ60, Δ61, Δ62, Δ63, Δ64, Δ65, Δ66, Δ67, Δ68 , Δ69, Δ70, Δ71, Δ72, Δ73, Δ74 or Δ75GAA truncated forms (especially truncated forms of the parent hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 form).

在另一个特定实施方式中，本发明的截短的GAA多肽是Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42、Δ43、Δ44、Δ45、Δ46或Δ47GAA截短形式(特别是在SEQID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In another specific embodiment, the truncated GAA polypeptides of the invention are Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17 , Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 , Δ43, Δ44, Δ45, Δ46 or Δ47 GA truncated form (especially the truncated form of the parent hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在另一个特定实施方式中，本发明的截短的GAA多肽是Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42、Δ43、Δ44、Δ45或Δ46GAA截短形式(特别是在SEQ IDNO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In another specific embodiment, the truncated GAA polypeptides of the invention are Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17 , Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 , Δ43, Δ44, Δ45 or Δ46 GA truncated form (especially the truncated form of the parent hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在另一个特定实施方式中，本发明的截短的GAA多肽是Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42、Δ43、Δ44或Δ45GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In another specific embodiment, the truncated GAA polypeptides of the invention are Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17 , Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 , Δ43, Δ44 or Δ45 GA truncated form (especially the truncated form of the parent hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42、Δ43或Δ44GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In other specific embodiments, the truncated GAA polypeptides of the invention are Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42, A Δ43 or Δ44 GA truncated form (especially a truncated form of the parent hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In other specific embodiments, the truncated GAA polypeptides of the invention are Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, or Δ43GAA truncated form (in particular the truncated form of the parent hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41或Δ42GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In other specific embodiments, the truncated GAA polypeptides of the invention are Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41 or Δ42GAA Short forms (especially truncated forms of the parent hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In other specific embodiments, the truncated GAA polypeptides of the invention are Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA Short forms (especially truncated forms of the parent hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In other specific embodiments, the truncated GAA polypeptides of the invention are Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20,Δ21,Δ22,Δ23,Δ24,Δ25,Δ26,Δ27,Δ28,Δ29,Δ30,Δ31,Δ32,Δ33,Δ34,Δ35,Δ36,Δ37,Δ38,Δ39,Δ40,Δ41,Δ42 or Δ43GAA truncated form (In particular the truncated form of the parent hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ IDNO：1中示出的亲本hGAA蛋白的截短形式)。In other specific embodiments, the truncated GAA polypeptides of the invention are Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated form (especially is a truncated form of the parent hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In other specific embodiments, the truncated GAA polypeptides of the invention are Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated forms (especially in SEQ ID NO: 1 or SEQ ID NO: 33, in particular a truncated form of the parent hGAA protein shown in SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In other specific embodiments, the truncated GAA polypeptides of the invention are Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated forms (especially in SEQ ID NO: 1 or SEQ ID NO: 33, in particular a truncated form of the parent hGAA protein shown in SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In other specific embodiments, the truncated GAA polypeptides of the invention are Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated forms (especially in SEQ ID NO: 1 or SEQ ID NO:33, in particular a truncated form of the parent hGAA protein shown in SEQ ID NO:1).

在其他特定实施方式中，本发明的截短的GAA多肽是Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的亲本hGAA蛋白的截短形式)。In other specific embodiments, the truncated GAA polypeptides of the invention are Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated forms (especially in SEQ ID NO: 1 or SEQ ID NO:33, in particular a truncated form of the parental hGAA protein shown in SEQ ID NO:1).

在其他特定实施方式中，本发明的截短的GAA多肽是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ6、Δ7、Δ8、Δ9或Δ10截短形式，特别是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ7、Δ8或Δ9截短形式，更特别是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ8截短形式。In other specific embodiments, the truncated GAA polypeptide of the invention is Δ6, Δ7, Δ8, Δ9 or Δ10 truncated forms, in particular Δ7, Δ8 or Δ9 of GAA (especially the hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1) A truncated form, more particularly a Δ8 truncated form of GAA (especially the hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ27、Δ28、Δ29、Δ30或Δ31截短形式，特别是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ28、Δ29或Δ30截短形式，更特别是GAA(特别是在SEQ ID NO：1或SEQ IDNO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ29截短形式。In other specific embodiments, the truncated GAA polypeptide of the invention is Δ27, Δ28, Δ29, Δ30 or Δ31 truncated forms, in particular Δ28, Δ29 or Δ30 of GAA (especially the hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1) A truncated form, more particularly a Δ29 truncated form of GAA (especially the hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在另一个特定实施方式中，本发明的截短的GAA多肽是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ40、Δ41、Δ42、Δ43或Δ44截短形式，特别是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ41、Δ42或Δ43截短形式，更特别是GAA(特别是在SEQ ID NO：1或SEQ IDNO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ42截短形式。In another specific embodiment, the truncated GAA polypeptide of the invention is the Δ40 of GAA (especially the hGAA protein set forth in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1) , Δ41, Δ42, Δ43 or Δ44 truncated forms, especially Δ41, Δ42 or truncated forms of GAA (especially the hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1) A Δ43 truncated form, more particularly a Δ42 truncated form of GAA (especially the hGAA protein set forth in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1).

在其他特定实施方式中，本发明的截短的GAA多肽是GAA(特别是在SEQ ID NO：1中示出的hGAA蛋白)的Δ41、Δ42、Δ43、Δ44或Δ45截短形式，特别是GAA(特别是在SEQ IDNO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ42、Δ43或Δ44截短形式，更特别是GAA(特别是在SEQ ID NO：1中示出的hGAA蛋白)的Δ43截短形式。In other specific embodiments, the truncated GAA polypeptide of the invention is a Δ41, Δ42, Δ43, Δ44 or Δ45 truncated form of GAA (especially the hGAA protein set forth in SEQ ID NO: 1), especially GAA Δ42, Δ43 or Δ44 truncated form of the hGAA protein (especially the hGAA protein shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1), more particularly GAA (especially the hGAA protein shown in SEQ ID NO: 1 : Δ43 truncated form of hGAA protein shown in 1).

在另一个实施方式中，本发明的截短的GAA多肽是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ6、Δ7、Δ8、Δ9、Δ10、Δ27、Δ28、Δ29、Δ30、Δ31、Δ40、Δ41、Δ42、Δ43、Δ44、Δ45、Δ46或Δ47截短形式。In another embodiment, the truncated GAA polypeptide of the invention is Δ6, Δ7, Δ8, Δ9, Δ10, Δ27, Δ28, Δ29, Δ30, Δ31, Δ40, Δ41, Δ42, Δ43, Δ44, Δ45, Δ46 or Δ47 truncated forms.

在另一个实施方式中，本发明的截短的GAA多肽是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ7、Δ8、Δ9、Δ28、Δ29、Δ30、Δ41、Δ42、Δ43或Δ44截短形式。In another embodiment, the truncated GAA polypeptide of the invention is Δ7, Δ8, Δ9, Δ28, Δ29, Δ30, Δ41, Δ42, Δ43 or Δ44 truncated forms.

在另一个实施方式中，本发明的截短的GAA多肽是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ6、Δ7、Δ8、Δ9、Δ10、Δ40、Δ41、Δ42、Δ43或Δ44截短形式。In another embodiment, the truncated GAA polypeptide of the invention is Δ6, Δ7, Δ8, Δ9, Δ10, Δ40, Δ41, Δ42, Δ43 or Δ44 truncated forms.

在另一个实施方式中，本发明的截短的GAA多肽是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ8、Δ29、Δ42、Δ43或Δ47截短形式。In another embodiment, the truncated GAA polypeptide of the invention is Δ8, Δ29, Δ42, Δ43 or Δ47 truncated forms.

在另一个实施方式中，本发明的截短的GAA多肽是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ8、Δ29、Δ42或Δ43截短形式。In another embodiment, the truncated GAA polypeptide of the invention is Δ8, Δ29, Δ42 or Δ43 truncated forms.

在另一个实施方式中，本发明的截短的GAA多肽是GAA(特别是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA蛋白)的Δ8或Δ42截短形式。In another embodiment, the truncated GAA polypeptide of the invention is Δ8 or Δ42 truncated form.

在本发明的特定实施方式中，本发明的截短的GAA多肽是功能性人类GAA多肽的截短形式。在其他特定实施方式中，所述亲本hGAA多肽是在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1中示出的hGAA多肽。在这个实施方式的变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42、Δ43、Δ44、Δ45、Δ46、Δ47、Δ48、Δ49、Δ50、Δ51、Δ52、Δ53、Δ54、Δ55、Δ56、Δ57、Δ58、Δ59、Δ60、Δ61、Δ62、Δ63、Δ64、Δ65、Δ66、Δ67、Δ68、Δ69、Δ70、Δ71、Δ72、Δ73、Δ74或Δ75GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In specific embodiments of the invention, the truncated GAA polypeptides of the invention are truncated forms of functional human GAA polypeptides. In other specific embodiments, the parent hGAA polypeptide is the hGAA polypeptide set forth in SEQ ID NO: 1 or SEQ ID NO: 33, particularly SEQ ID NO: 1. In a variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly represented in SEQ ID NO: 1 Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20 of the hGAA polypeptide or functional variant thereof , Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42, Δ43, Δ44, Δ45 , Δ46, Δ47, Δ48, Δ49, Δ50, Δ51, Δ52, Δ53, Δ54, Δ55, Δ56, Δ57, Δ58, Δ59, Δ60, Δ61, Δ62, Δ63, Δ64, Δ65, Δ66, Δ67, Δ68, Δ69, Δ70 , Δ71, Δ72, Δ73, Δ74 or Δ75GAA truncated form, the functional variant comprising an amino acid substitution in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, and At least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 sex.

在这个实施方式的变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42、Δ43、Δ44、Δ45、Δ46或Δ47GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In a variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly represented in SEQ ID NO: 1 Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20 of the hGAA polypeptide or functional variant thereof , Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42, Δ43, Δ44, Δ45 , Δ46 or Δ47GAA truncated forms, the functional variants comprise amino acid substitutions in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, and are identical to SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1, having at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42、Δ43、Δ44、Δ45或Δ46GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In a variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly represented in SEQ ID NO: 1 Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20 of the hGAA polypeptide or functional variant thereof , Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42, Δ43, Δ44, Δ45 Or a truncated form of Δ46GAA, the functional variant comprising an amino acid substitution in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, and being identical to SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1, has at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42、Δ43、Δ44或Δ45GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In a variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly represented in SEQ ID NO: 1 Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20 of the hGAA polypeptide or functional variant thereof , Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42, Δ43, Δ44 or Δ45GAA Truncated forms, said functional variants comprise amino acid substitutions in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, and are identical to SEQ ID NO: 1 or SEQ ID NO :33, especially SEQ ID NO:1 having at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42、Δ43或Δ44GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, or Δ44GAA truncated form, the functional variant comprising amino acid substitutions in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, and is identical to SEQ ID NO: 1 or SEQ ID NO:33, particularly SEQ ID NO:1, has at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA Short form, said functional variant comprises amino acid substitutions in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1, and is identical to SEQ ID NO: 1 or SEQ ID NO: 33. In particular, SEQ ID NO: 1 has at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ1、Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41或Δ42GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ1, Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19,Δ20,Δ21,Δ22,Δ23,Δ24,Δ25,Δ26,Δ27,Δ28,Δ29,Δ30,Δ31,Δ32,Δ33,Δ34,Δ35,Δ36,Δ37,Δ38,Δ39,Δ40,Δ41 or Δ42GAA truncated form , the functional variant contains amino acid substitutions in SEQ ID NO: 1 or SEQ ID NO: 33, especially the sequence shown in SEQ ID NO: 1, and is identical to SEQ ID NO: 1 or SEQ ID NO: 33, In particular, SEQ ID NO: 1 has an identity of at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99%.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41或Δ42GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41 or Δ42GAA truncated form, so Said functional variants comprise amino acid substitutions in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, and are identical to SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 is at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41或Δ42GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41 or Δ42GAA truncated form, the function Sexual variants comprise amino acid substitutions in the sequence set forth in SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1, and are identical to SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1 has at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41或Δ42GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41 or Δ42GAA truncated form, the functional changes The entity contains amino acid substitutions in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, and is identical to SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 has at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41或Δ42GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41 or Δ42GAA truncated form, the functional variant is in The sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, contains amino acid substitutions and has the same amino acid substitution as SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 At least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41或Δ42GAA截短形式，所述功能性变体在SEQ IDNO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ IDNO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41 or Δ42GAA truncated form, the functional variant is in SEQ ID NO. : 1 or SEQ ID NO: 33, especially the sequence shown in SEQ ID NO: 1, contains amino acid substitutions, and has at least 75 or 80 0% similarity with SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 , 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41或Δ42GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41 or Δ42GAA truncated form, the functional variant is in SEQ ID NO: 1 or SEQ ID NO: 33, especially the sequence shown in SEQ ID NO: 1, containing amino acid substitutions, and having at least 75 or 80 % differences with SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 , 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41或Δ42GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41 or Δ42GAA truncated form, the functional variant is in SEQ ID NO: 1 or The sequence shown in SEQ ID NO: 33, especially SEQ ID NO: 1, contains amino acid substitutions and is at least 75, 80, 85, 90 with SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 , 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ2、Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ2, Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20,Δ21,Δ22,Δ23,Δ24,Δ25,Δ26,Δ27,Δ28,Δ29,Δ30,Δ31,Δ32,Δ33,Δ34,Δ35,Δ36,Δ37,Δ38,Δ39,Δ40,Δ41,Δ42 or Δ43GAA truncated form , the functional variant contains amino acid substitutions in SEQ ID NO: 1 or SEQ ID NO: 33, especially the sequence shown in SEQ ID NO: 1, and is identical to SEQ ID NO: 1 or SEQ ID NO: 33, In particular, SEQ ID NO: 1 has an identity of at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99%.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ3、Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ3, Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated form, so Said functional variants comprise amino acid substitutions in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, and are identical to SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 is at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ4、Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ4, Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated form, the function Sexual variants comprise amino acid substitutions in the sequence set forth in SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1, and are identical to SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1 has at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ5、Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ5, Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated form, the functional changes The entity contains amino acid substitutions in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, and is identical to SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 has at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ6、Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ6, Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated form, the functional variant is in The sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1, contains amino acid substitutions and has the same amino acid substitution as SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 At least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ7、Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式，所述功能性变体在SEQ IDNO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ IDNO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ7, Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated form, the functional variant is in SEQ ID NO. : 1 or SEQ ID NO: 33, especially the sequence shown in SEQ ID NO: 1, contains amino acid substitutions, and has at least 75 or 80 0% similarity with SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 , 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、甚至更特别是在SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ8、Δ9、Δ10、Δ11、Δ12、Δ13、Δ14、Δ15、Δ16、Δ17、Δ18、Δ19、Δ20、Δ21、Δ22、Δ23、Δ24、Δ25、Δ26、Δ27、Δ28、Δ29、Δ30、Δ31、Δ32、Δ33、Δ34、Δ35、Δ36、Δ37、Δ38、Δ39、Δ40、Δ41、Δ42或Δ43GAA截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少75、80、85、90、91、92、93、94、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly in SEQ ID NO: 1 or SEQ ID NO: 33, even more particularly in SEQ ID NO: Δ8, Δ9, Δ10, Δ11, Δ12, Δ13, Δ14, Δ15, Δ16, Δ17, Δ18, Δ19, Δ20, Δ21, Δ22, Δ23, Δ24, Δ25, Δ26, Δ27, Δ28, Δ29, Δ30, Δ31, Δ32, Δ33, Δ34, Δ35, Δ36, Δ37, Δ38, Δ39, Δ40, Δ41, Δ42 or Δ43GAA truncated form, the functional variant is in SEQ ID NO: 1 or SEQ ID NO: 33, especially the sequence shown in SEQ ID NO: 1, containing amino acid substitutions, and having at least 75 or 80 % differences with SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 , 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、特别是SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ6、Δ7、Δ8、Δ9或Δ10、特别是Δ7、Δ8或Δ9、更特别是Δ8截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少80、85、90、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, particularly in SEQ ID NO: 1 Δ6, Δ7, Δ8, Δ9 or Δ10, especially Δ7, Δ8 or Δ9, more particularly Δ8 truncated forms of hGAA polypeptides or functional variants thereof, said functional variants in SEQ ID NO: 1 or The sequence shown in SEQ ID NO: 33, especially SEQ ID NO: 1, contains amino acid substitutions and has at least 80, 85 or 90 0% similarity with SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 , 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、特别是SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ27、Δ28、Δ29、Δ30或Δ31、特别是Δ28、Δ29或Δ30、更特别是Δ29截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少80、85、90、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, particularly in SEQ ID NO: 1 Δ27, Δ28, Δ29, Δ30 or Δ31, especially Δ28, Δ29 or Δ30, more particularly Δ29 truncated forms of the hGAA polypeptide or functional variants thereof, said functional variants in SEQ ID NO: 1 or The sequence shown in SEQ ID NO: 33, especially SEQ ID NO: 1, contains amino acid substitutions and has at least 80, 85 or 90 0% similarity with SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 , 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、特别是SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ40、Δ41、Δ42、Δ43或Δ44、特别是Δ41、Δ42或Δ43、更特别是Δ42截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少80、85、90、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, particularly in SEQ ID NO: 1 Δ40, Δ41, Δ42, Δ43 or Δ44, especially Δ41, Δ42 or Δ43, more particularly Δ42 truncated forms of the hGAA polypeptide or functional variants thereof, said functional variants in SEQ ID NO: 1 or The sequence shown in SEQ ID NO: 33, especially SEQ ID NO: 1, contains amino acid substitutions and has at least 80, 85 or 90 0% similarity with SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 , 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、特别是SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ41、Δ42、Δ43、Δ44或Δ45、特别是Δ42、Δ43或Δ44、更特别是Δ43截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少80、85、90、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, particularly in SEQ ID NO: 1 Δ41, Δ42, Δ43, Δ44 or Δ45, especially Δ42, Δ43 or Δ44, more particularly Δ43 truncated forms of the hGAA polypeptide or functional variants thereof, said functional variants in SEQ ID NO: 1 or The sequence shown in SEQ ID NO: 33, especially SEQ ID NO: 1, contains amino acid substitutions and has at least 80, 85 or 90 0% similarity with SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 , 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、特别是SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ6、Δ7、Δ8、Δ9、Δ10、Δ27、Δ28、Δ29、Δ30、Δ31、Δ40、Δ41、Δ42、Δ43、Δ44或Δ45、特别是Δ7、Δ8、Δ9、Δ28、Δ29、Δ30、Δ41、Δ42、Δ43或Δ44、特别是Δ8、Δ29、Δ42或Δ43截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少80、85、90、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, particularly in SEQ ID NO: 1 Δ6, Δ7, Δ8, Δ9, Δ10, Δ27, Δ28, Δ29, Δ30, Δ31, Δ40, Δ41, Δ42, Δ43, Δ44 or Δ45 of the hGAA polypeptide or functional variant thereof, especially Δ7, Δ8, Δ9 , Δ28, Δ29, Δ30, Δ41, Δ42, Δ43 or Δ44, especially Δ8, Δ29, Δ42 or Δ43 truncated forms, the functional variants are in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ The sequence shown in ID NO: 1 contains amino acid substitutions and has at least 80, 85, 90, 95, 96, 97, 98 or more similarities with SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 99% identical.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、特别是SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ6、Δ7、Δ8、Δ9、Δ10、Δ40、Δ41、Δ42、Δ43或Δ44、特别是Δ8或Δ42截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少80、85、90、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, particularly in SEQ ID NO: 1 Δ6, Δ7, Δ8, Δ9, Δ10, Δ40, Δ41, Δ42, Δ43 or Δ44, especially Δ8 or Δ42 truncated forms of the hGAA polypeptide or functional variants thereof, which functional variants are in SEQ ID NO. : 1 or SEQ ID NO: 33, especially the sequence shown in SEQ ID NO: 1 contains amino acid substitutions, and has at least 80, 85, 90, 95, 96, 97, 98 or 99% identity.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、特别是SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ8、Δ29、Δ42、Δ43或Δ47截短形式，所述功能性变体在SEQ ID NO：1或SEQ IDNO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ IDNO：33、特别是SEQ ID NO：1具有至少80、85、90、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, particularly in SEQ ID NO: 1 Δ8, Δ29, Δ42, Δ43 or Δ47 truncated form of the hGAA polypeptide or a functional variant thereof, which functional variant is represented by SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 The sequence shown contains amino acid substitutions and is at least 80, 85, 90, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、特别是SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ8、Δ29、Δ42或Δ43截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1具有至少80、85、90、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, particularly in SEQ ID NO: 1 Δ8, Δ29, Δ42 or Δ43 truncated form of the hGAA polypeptide or a functional variant thereof, the functional variant is shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 The sequence contains amino acid substitutions and is at least 80, 85, 90, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 1 or SEQ ID NO: 33, particularly SEQ ID NO: 1.

在这个实施方式的另一种变化形式中，本发明的截短的GAA多肽是hGAA多肽、更特别是在SEQ ID NO：1或SEQ ID NO：33中、特别是SEQ ID NO：1中示出的hGAA多肽或其功能性变体的Δ8或Δ42截短形式，所述功能性变体在SEQ ID NO：1或SEQ ID NO：33、特别是SEQID NO：1所示的序列中包含氨基酸替换，并且与SEQ ID NO：1或SEQ ID NO：33、特别是SEQID NO：1具有至少80、85、90、95、96、97、98或99％的同一性。In another variation of this embodiment, the truncated GAA polypeptide of the invention is an hGAA polypeptide, more particularly represented in SEQ ID NO: 1 or SEQ ID NO: 33, particularly in SEQ ID NO: 1 A Δ8 or Δ42 truncated form of the hGAA polypeptide or a functional variant thereof comprising amino acids in the sequence shown in SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1 substitution, and is at least 80, 85, 90, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 1 or SEQ ID NO: 33, especially SEQ ID NO: 1.

在特定实施方式中，本发明的截短的hGAA多肽具有由SEQ ID NO：27、SEQ ID NO：28、SEQ ID NO：34、SEQ ID NO：35或SEQ ID NO：36中示出的序列或其功能性变体构成的氨基酸序列，所述功能性变体与SEQ ID NO：27、SEQ ID NO：28、SEQ ID NO：34、SEQ ID NO：35或SEQ ID NO：36中示出的序列相比包含1至5个，特别是1至4个、特别是1至3个、更特别是1至2个、特别是1个氨基酸替换。在另一个特定实施方式中，本发明的截短的hGAA多肽具有由SEQ ID NO：27、SEQ ID NO：28、SEQ ID NO：34或SEQ ID NO：35中示出的序列或其功能性变体构成的氨基酸序列，所述功能性变体与SEQ ID NO：27、SEQ ID NO：28、SEQ ID NO：34或SEQ ID NO：35中示出的序列相比包含1至5个氨基酸替换。在特定实施方式中，本发明的截短的hGAA多肽具有由SEQ ID NO：27或SEQ ID NO：28中示出的序列或其功能性变体构成的氨基酸序列，所述功能性变体与SEQ ID NO：27或SEQ ID NO：28中示出的序列相比包含1至5个，特别是1至4个、特别是1至3个、更特别是1至2个、特别是1个氨基酸替换。In specific embodiments, the truncated hGAA polypeptide of the invention has the sequence set forth in SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 34, SEQ ID NO: 35 or SEQ ID NO: 36 or an amino acid sequence consisting of a functional variant thereof, which functional variant is identical to that shown in SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 34, SEQ ID NO: 35 or SEQ ID NO: 36 The sequence comparison contains 1 to 5, in particular 1 to 4, in particular 1 to 3, more in particular 1 to 2, in particular 1 amino acid substitutions. In another specific embodiment, the truncated hGAA polypeptide of the invention has the sequence set forth in SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 34 or SEQ ID NO: 35 or the functionality thereof Amino acid sequences consisting of variants, said functional variants comprising 1 to 5 amino acids compared to the sequence set forth in SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 34 or SEQ ID NO: 35 replace. In a specific embodiment, the truncated hGAA polypeptide of the invention has an amino acid sequence consisting of the sequence set forth in SEQ ID NO: 27 or SEQ ID NO: 28, or a functional variant thereof, which functional variant is identical to The sequence shown in SEQ ID NO: 27 or SEQ ID NO: 28 contains 1 to 5, especially 1 to 4, especially 1 to 3, more particularly 1 to 2, especially 1 Amino acid substitutions.

本发明的截短的GAA多肽还可以包含信号肽，例如GAA的天然信号肽或源自于另一种分泌蛋白的可替选信号肽。这些信号肽的非限制性实例包括在SEQ ID NO：3至7中示出的那些。发明人已令人吃惊地显示，将本发明的截短的GAA多肽融合到可替选信号肽甚至进一步提高了它的分泌。因此，本发明提供了一种嵌合GAA多肽，其包含信号组成部分和截短的GAA多肽组成部分，所述截短的GAA多肽组成部分是如上所定义的截短的GAA多肽。在特定实施方式中，所述信号肽是GAA的天然信号肽，例如SEQ ID NO：4中示出的hGAA的信号肽。在另一个实施方式中，所述信号肽是源自于不同于GAA的蛋白质的外源(或可替选)信号肽。在特定实施方式中，所述可替选信号肽选自SEQ ID NO：3、5、6和7或其如下文所定义的功能性衍生物。The truncated GAA polypeptides of the invention may also comprise a signal peptide, such as the native signal peptide of GAA or an alternative signal peptide derived from another secreted protein. Non-limiting examples of these signal peptides include those set forth in SEQ ID NOs: 3 to 7. The inventors have surprisingly shown that fusing a truncated GAA polypeptide of the invention to an alternative signal peptide enhances its secretion even further. Accordingly, the present invention provides a chimeric GAA polypeptide comprising a signal moiety and a truncated GAA polypeptide moiety, the truncated GAA polypeptide moiety being a truncated GAA polypeptide as defined above. In a specific embodiment, the signal peptide is a native signal peptide of GAA, such as the signal peptide of hGAA set forth in SEQ ID NO:4. In another embodiment, the signal peptide is an exogenous (or alternative) signal peptide derived from a protein other than GAA. In a specific embodiment, the alternative signal peptide is selected from the group consisting of SEQ ID NOs: 3, 5, 6 and 7 or functional derivatives thereof as defined below.

发明人已显示，融合到所述GAA蛋白的剩余部分的外源信号肽与包含其天然信号肽的相应GAA多肽相比，提高了得到的嵌合GAA多肽的分泌。此外，所述截短的GAA多肽组成部分与包含融合到所述亲本GAA多肽的同一信号肽的嵌合GAA多肽相比，也提高了所述嵌合GAA多肽(包括信号肽和截短的GAA多肽两者)的分泌。The inventors have shown that an exogenous signal peptide fused to the remainder of the GAA protein increases secretion of the resulting chimeric GAA polypeptide compared to the corresponding GAA polypeptide containing its native signal peptide. In addition, the truncated GAA polypeptide moiety also has improved performance compared to a chimeric GAA polypeptide comprising the same signal peptide fused to the parent GAA polypeptide. secretion of both polypeptides).

可以在本发明中工作的特定信号肽包括来自于胰凝乳蛋白酶原B2的1-20位氨基酸(SEQ ID NO：3)、人类α-1-抗胰蛋白酶的信号肽(SEQ ID NO：5)、来自于艾杜糖醛酸-2-硫酸酯酶的1-25位氨基酸(SEQ ID NO：6)和来自于蛋白酶C1抑制剂的1-23位氨基酸(SEQ IDNO：7)。当与包含天然信号肽的GAA相比时，SEQ ID NO：3和SEQ ID NO：5至SEQ ID NO：7的信号肽允许所述嵌合GAA蛋白在体外和体内两者更高地分泌。在特定实施方式中，所述信号肽具有SEQ ID NO：3至7中示出的序列，或者是其功能性衍生物，即与SEQ ID NO：3至7中示出的序列相比包含1至5个，特别是1至5个、特别是1至3个、更特别是1至2个、特别是1个氨基酸缺失、插入或替换的序列，只要所得到的序列对应于功能性信号肽，即允许GAA蛋白分泌的信号肽即可。在特定实施方式中，所述信号肽组成部分序列由选自SEQ ID NO：3至7的序列构成。Specific signal peptides that may work in the present invention include amino acids 1-20 from chymotrypsinogen B2 (SEQ ID NO: 3), the signal peptide from human alpha-1-antitrypsin (SEQ ID NO: 5 ), amino acids 1-25 from iduronate-2-sulfatase (SEQ ID NO: 6) and amino acids 1-23 from protease C1 inhibitor (SEQ ID NO: 7). The signal peptides of SEQ ID NO:3 and SEQ ID NO:5 to SEQ ID NO:7 allow for higher secretion of the chimeric GAA protein both in vitro and in vivo when compared to GAA containing the native signal peptide. In a specific embodiment, the signal peptide has the sequence set forth in SEQ ID NO: 3 to 7, or is a functional derivative thereof, that is, compared to the sequence set forth in SEQ ID NO: 3 to 7, it contains 1 to 5, in particular 1 to 5, in particular 1 to 3, more in particular 1 to 2, in particular 1, amino acid deletions, insertions or substitutions in the sequence, provided that the resulting sequence corresponds to a functional signal peptide , that is, the signal peptide that allows GAA protein to be secreted. In a specific embodiment, the signal peptide component sequence consists of a sequence selected from SEQ ID NO: 3 to 7.

在特定实施方式中，本发明的GAA多肽选自：In a specific embodiment, the GAA polypeptide of the invention is selected from:

-SEQ ID NO：3与GAA的Δ8截短形式，例如SEQ ID NO：27中显示的hGAA的Δ8截短形式的组合；- a combination of SEQ ID NO: 3 with a Δ8 truncated form of GAA, such as the Δ8 truncated form of hGAA shown in SEQ ID NO: 27;

-SEQ ID NO：4与GAA的Δ8截短形式，例如SEQ ID NO：27中显示的hGAA的Δ8截短形式的组合；- a combination of SEQ ID NO: 4 with a Δ8 truncated form of GAA, such as the Δ8 truncated form of hGAA shown in SEQ ID NO: 27;

-SEQ ID NO：5与GAA的Δ8截短形式，例如SEQ ID NO：27中显示的hGAA的Δ8截短形式的组合；- a combination of SEQ ID NO: 5 with a Δ8 truncated form of GAA, such as the Δ8 truncated form of hGAA shown in SEQ ID NO: 27;

-SEQ ID NO：6与GAA的Δ8截短形式，例如SEQ ID NO：27中显示的hGAA的Δ8截短形式的组合；- a combination of SEQ ID NO: 6 with a Δ8 truncated form of GAA, such as the Δ8 truncated form of hGAA shown in SEQ ID NO: 27;

-SEQ ID NO：7与GAA的Δ8截短形式，例如SEQ ID NO：27中显示的hGAA的Δ8截短形式的组合；- a combination of SEQ ID NO: 7 with a Δ8 truncated form of GAA, such as the Δ8 truncated form of hGAA shown in SEQ ID NO: 27;

-SEQ ID NO：3与GAA的Δ29截短形式，例如SEQ ID NO：34中显示的hGAA的Δ29截短形式的组合；- a combination of SEQ ID NO: 3 with a Δ29 truncated form of GAA, such as the Δ29 truncated form of hGAA shown in SEQ ID NO: 34;

-SEQ ID NO：4与GAA的Δ29截短形式，例如SEQ ID NO：34中显示的hGAA的Δ29截短形式的组合；- a combination of SEQ ID NO: 4 with a Δ29 truncated form of GAA, such as the Δ29 truncated form of hGAA shown in SEQ ID NO: 34;

-SEQ ID NO：5与GAA的Δ29截短形式，例如SEQ ID NO：34中显示的hGAA的Δ29截短形式的组合；- a combination of SEQ ID NO: 5 with a Δ29 truncated form of GAA, such as the Δ29 truncated form of hGAA shown in SEQ ID NO: 34;

-SEQ ID NO：6与GAA的Δ29截短形式，例如SEQ ID NO：34中显示的hGAA的Δ29截短形式的组合；- a combination of SEQ ID NO: 6 with a Δ29 truncated form of GAA, such as the Δ29 truncated form of hGAA shown in SEQ ID NO: 34;

-SEQ ID NO：7与GAA的Δ29截短形式，例如SEQ ID NO：34中显示的hGAA的Δ29截短形式的组合；- a combination of SEQ ID NO: 7 with a Δ29 truncated form of GAA, such as the Δ29 truncated form of hGAA shown in SEQ ID NO: 34;

-SEQ ID NO：3与GAA的Δ42截短形式，例如SEQ ID NO：28中显示的hGAA的Δ42截短形式的组合；- a combination of SEQ ID NO: 3 with a Δ42 truncated form of GAA, such as the Δ42 truncated form of hGAA shown in SEQ ID NO: 28;

-SEQ ID NO：4与GAA的Δ42截短形式，例如SEQ ID NO：28中显示的hGAA的Δ42截短形式的组合；- a combination of SEQ ID NO: 4 with a Δ42 truncated form of GAA, such as the Δ42 truncated form of hGAA shown in SEQ ID NO: 28;

-SEQ ID NO：5与GAA的Δ42截短形式，例如SEQ ID NO：28中显示的hGAA的Δ42截短形式的组合；- a combination of SEQ ID NO: 5 with a Δ42 truncated form of GAA, such as the Δ42 truncated form of hGAA shown in SEQ ID NO: 28;

-SEQ ID NO：6与GAA的Δ42截短形式，例如SEQ ID NO：28中显示的hGAA的Δ42截短形式的组合；- a combination of SEQ ID NO: 6 with a Δ42 truncated form of GAA, such as the Δ42 truncated form of hGAA shown in SEQ ID NO: 28;

-SEQ ID NO：7与GAA的Δ42截短形式，例如SEQ ID NO：28中显示的hGAA的Δ42截短形式的组合；- a combination of SEQ ID NO: 7 with a Δ42 truncated form of GAA, such as the Δ42 truncated form of hGAA shown in SEQ ID NO: 28;

-SEQ ID NO：3与GAA的Δ43截短形式，例如SEQ ID NO：35中显示的hGAA的Δ43截短形式的组合；- a combination of SEQ ID NO: 3 with a Δ43 truncated form of GAA, such as the Δ43 truncated form of hGAA shown in SEQ ID NO: 35;

-SEQ ID NO：4与GAA的Δ43截短形式，例如SEQ ID NO：35中显示的hGAA的Δ43截短形式的组合；- a combination of SEQ ID NO: 4 with a Δ43 truncated form of GAA, such as the Δ43 truncated form of hGAA shown in SEQ ID NO: 35;

-SEQ ID NO：5与GAA的Δ43截短形式，例如SEQ ID NO：35中显示的hGAA的Δ43截短形式的组合；- a combination of SEQ ID NO: 5 with a Δ43 truncated form of GAA, such as the Δ43 truncated form of hGAA shown in SEQ ID NO: 35;

-SEQ ID NO：6与GAA的Δ43截短形式，例如SEQ ID NO：35中显示的hGAA的Δ43截短形式的组合；和- a combination of SEQ ID NO: 6 with a Δ43 truncated form of GAA, such as the Δ43 truncated form of hGAA shown in SEQ ID NO: 35; and

-SEQ ID NO：7与GAA的Δ43截短形式，例如SEQ ID NO：35中显示的hGAA的Δ43截短形式的组合；- a combination of SEQ ID NO: 7 with a Δ43 truncated form of GAA, such as the Δ43 truncated form of hGAA shown in SEQ ID NO: 35;

-SEQ ID NO：3与GAA的Δ47截短形式，例如SEQ ID NO：36中显示的hGAA的Δ47截短形式的组合；- a combination of SEQ ID NO: 3 with a Δ47 truncated form of GAA, such as the Δ47 truncated form of hGAA shown in SEQ ID NO: 36;

-SEQ ID NO：4与GAA的Δ47截短形式，例如SEQ ID NO：36中显示的hGAA的Δ47截短形式的组合；- a combination of SEQ ID NO: 4 with a Δ47 truncated form of GAA, such as the Δ47 truncated form of hGAA shown in SEQ ID NO: 36;

-SEQ ID NO：5与GAA的Δ47截短形式，例如SEQ ID NO：36中显示的hGAA的Δ47截短形式的组合；- a combination of SEQ ID NO: 5 with a Δ47 truncated form of GAA, such as the Δ47 truncated form of hGAA shown in SEQ ID NO: 36;

-SEQ ID NO：6与GAA的Δ47截短形式，例如SEQ ID NO：36中显示的hGAA的Δ47截短形式的组合；和- a combination of SEQ ID NO: 6 with a Δ47 truncated form of GAA, such as the Δ47 truncated form of hGAA shown in SEQ ID NO: 36; and

-SEQ ID NO：7与GAA的Δ47截短形式，例如SEQ ID NO：36中显示的hGAA的Δ47截短形式的组合；- a combination of SEQ ID NO: 7 with a Δ47 truncated form of GAA, such as the Δ47 truncated form of hGAA shown in SEQ ID NO: 36;

或者是其功能性衍生物，所述功能性衍生物与所得到的序列组合具有至少90％的同一性，特别是至少95％、至少96％、至少97％、至少98％或至少99％的同一性。在这些实施方式中，正如上面提到的，所述信号肽组成部分可以是与SEQ ID NO：3至7中示出的序列相比包含1至5个，特别是1至4个、特别是1至3个、更特别是1至2个、特别是1个氨基酸缺失、插入或替换的序列，只要所得到的序列对应于功能性信号肽，即允许得到的嵌合的截短的GAA蛋白分泌的信号肽即可。Or a functional derivative thereof, which has at least 90% identity with the resulting sequence combination, especially at least 95%, at least 96%, at least 97%, at least 98% or at least 99%. Identity. In these embodiments, as mentioned above, the signal peptide component may comprise from 1 to 5, in particular from 1 to 4, especially from the sequence shown in SEQ ID NO: 3 to 7. Sequences in which 1 to 3, more particularly 1 to 2, in particular 1 amino acid are deleted, inserted or substituted, allow the resulting chimeric truncated GAA protein as long as the resulting sequence corresponds to a functional signal peptide The secreted signal peptide is sufficient.

从细胞分泌的新合成的GAA的相对比例可以通过本领域中已知的和实施例中所描述的方法常规地确定。分泌的蛋白质可以通过在细胞培养基、血清、奶等中直接测量所述蛋白质本身(例如通过蛋白质印迹)或通过蛋白质活性测定法(例如酶测定法)来检测。The relative proportions of newly synthesized GAA secreted from cells can be routinely determined by methods known in the art and described in the Examples. Secreted proteins can be detected by direct measurement of the protein itself in cell culture medium, serum, milk, etc. (eg by Western blotting) or by protein activity assays (eg enzymatic assays).

本领域技术人员还应该理解，所述截短的GAA多肽或嵌合GAA多肽可以含有另外的氨基酸，例如作为核酸构建物的操作例如添加限制性位点的结果，只要这些另外的氨基酸不使所述信号肽或GAA多肽不具有功能即可。所述另外的氨基酸可以被切除或者可以被成熟多肽保留，只要保留不产生无功能的多肽即可。It will also be understood by those skilled in the art that the truncated GAA polypeptide or chimeric GAA polypeptide may contain additional amino acids, for example as a result of manipulations of nucleic acid constructs such as the addition of restriction sites, so long as these additional amino acids do not It is sufficient that the signal peptide or GAA polypeptide does not have a function. The additional amino acids may be cleaved or may be retained by the mature polypeptide so long as the retention does not result in a non-functional polypeptide.

另一方面，本发明涉及一种核酸分子，其编码本发明的截短的GAA多肽或本发明的嵌合GAA多肽。In another aspect, the invention relates to a nucleic acid molecule encoding a truncated GAA polypeptide of the invention or a chimeric GAA polypeptide of the invention.

编码截短的GAA的本发明的核酸分子的序列针对在体内表达所述GAA多肽被优化。序列优化可以包括核酸序列的大量改变，包括密码子优化，提高GC含量，减少CpG岛的数目，减少可选开放阅读框(ARF)的数目和减少拼接供体和拼接受体位点的数目。由于遗传密码的简并性，不同核酸分子可能编码相同蛋白质。也已公知，不同生物体中的遗传密码通常偏向于使用编码同一氨基酸的几个密码子中的一者超过其他密码子。通过密码子优化，将改变引入到核苷酸序列中，其利用给定细胞背景中存在的密码子偏好，以使得到的密码子优化的核苷酸序列与未经密码子优化的序列相比更可能以相对高的水平在这种给定的细胞背景中表达。在本发明的优选实施方式中，这种编码截短的GAA的序列优化过的核苷酸序列被密码子优化，以与编码同一截短的GAA蛋白的未经密码子优化的核苷酸序列相比提高它在人类细胞中的表达，例如通过利用人类特异性密码子使用偏好。The sequence of the nucleic acid molecule of the invention encoding a truncated GAA is optimized for expression of the GAA polypeptide in vivo. Sequence optimization can include numerous changes to the nucleic acid sequence, including codon optimization, increasing GC content, reducing the number of CpG islands, reducing the number of alternative open reading frames (ARFs) and reducing the number of splice donor and splice acceptor sites. Due to the degeneracy of the genetic code, different nucleic acid molecules may encode the same protein. It is also known that the genetic code in different organisms often favors the use of one of several codons encoding the same amino acid over the others. With codon optimization, changes are introduced into a nucleotide sequence that take advantage of the codon preferences present in a given cellular context such that the resulting codon-optimized nucleotide sequence compares to a sequence that has not been codon-optimized is more likely to be expressed at relatively high levels in this given cellular context. In a preferred embodiment of the invention, this sequence-optimized nucleotide sequence encoding a truncated GAA is codon-optimized to match the non-codon-optimized nucleotide sequence encoding the same truncated GAA protein. Compared to improving its expression in human cells, for example by exploiting human-specific codon usage preferences.

在特定实施方式中，所述优化过的GAA编码序列是密码子优化的，和/或与SEQ IDNO：8的野生型hGAA编码序列的82-2859位核苷酸相比具有提高的GC含量和/或具有减少的可选开放阅读框数目和/或具有减少的拼接供体和/或拼接受体位点数目。例如，本发明的核酸序列引起所述GAA序列中与野生型GAA序列的序列相比GC含量提高至少2、3、4、5或10％。在特定实施方式中，本发明的核酸序列引起所述GAA序列中与野生型GAA核苷酸序列的序列相比GC含量提高2、3、4或更特别地5％或10％(特别是5％)。在特定实施方式中，编码功能性GAA多肽的本发明的核酸序列与SEQ ID NO：8中示出的序列的82-2859位核苷酸“基本上相同”，即具有约70％同一性、更优选地约80％同一性、甚至更优选地约90％同一性、甚至更优选地约95％同一性、甚至更优选地约97％、98％或甚至99％同一性。正如上文提到的，除了GC含量和/或ARF数目之外，序列优化还可以包括所述序列中CpG岛数目的减少和/或拼接供体和受体位点数目的减少。当然，正如本领域技术人员公知的，序列优化是所有这些参数之间的平衡，意味着如果上述参数中的至少一者改善而一种或多种其他参数没有改善，则序列可以被认为是优化的，只要所述优化的序列引起所述转入基因的改进例如表达提高和/或在体内对所述转入基因的免疫应答降低即可。In a specific embodiment, the optimized GAA coding sequence is codon-optimized and/or has increased GC content compared to nucleotides 82-2859 of the wild-type hGAA coding sequence of SEQ ID NO: 8. /or have a reduced number of alternative open reading frames and/or have a reduced number of splice donor and/or splice acceptor sites. For example, the nucleic acid sequence of the invention causes an increase in the GC content of the GAA sequence by at least 2, 3, 4, 5 or 10% compared to the sequence of the wild-type GAA sequence. In a specific embodiment, the nucleic acid sequence of the invention causes an increase in the GC content of said GAA sequence by 2, 3, 4 or more specifically 5% or 10% (especially 5%) compared to the sequence of the wild-type GAA nucleotide sequence. %). In a specific embodiment, the nucleic acid sequence of the invention encoding a functional GAA polypeptide is "substantially identical", i.e., has about 70% identity, to nucleotides 82-2859 of the sequence set forth in SEQ ID NO: 8. More preferably about 80% identity, even more preferably about 90% identity, even more preferably about 95% identity, even more preferably about 97%, 98% or even 99% identity. As mentioned above, in addition to GC content and/or ARF number, sequence optimization may also include a reduction in the number of CpG islands and/or a reduction in the number of splice donor and acceptor sites in the sequence. Of course, as is well known to those skilled in the art, sequence optimization is a balance between all these parameters, meaning that a sequence can be considered optimized if at least one of the above parameters improves while one or more other parameters do not. , as long as the optimized sequence results in an improvement of the transferred gene, such as an increase in expression and/or a reduction in the immune response to the transferred gene in vivo.

此外，编码功能性GAA的核苷酸序列对人类细胞的密码子使用的适应性可以被表示为密码子适应指数(CAI)。密码子适应指数在本文中被定义为基因的密码子使用对高表达的人类基因的密码子使用的相对适应性的度量。每个密码子的相对适应性(w)是每个密码子的使用与同一氨基酸的最丰富的密码子的使用的比率。所述CAI被定义为这些相对适应性值的几何平均值。排除非同义密码子和终止密码子(取决于遗传密码)。CAI值的范围为0至1，更高的值指示最丰富密码子的更高比例(参见Sharp和Li，1987,Nucleic AcidsResearch 15:1281-1295；也参见：Kim等，Gene.1997,199:293-301；zurMegede等，Journalof Virology,2000,74:2628-2635)。优选地，编码GAA的核酸分子具有至少0.75(特别是0.77)、0.8、0.85、0.90、0.92或0.94的CAI。Furthermore, the adaptability of a nucleotide sequence encoding a functional GAA to the codon usage of human cells can be expressed as a codon adaptation index (CAI). The codon adaptation index is defined herein as a measure of the relative adaptability of a gene's codon usage to the codon usage of highly expressed human genes. The relative fitness (w) of each codon is the ratio of the usage of each codon to the usage of the most abundant codon for the same amino acid. The CAI is defined as the geometric mean of these relative fitness values. Exclude non-synonymous codons and stop codons (depending on the genetic code). CAI values range from 0 to 1, with higher values indicating a higher proportion of the most abundant codons (see Sharp and Li, 1987, Nucleic Acids Research 15:1281-1295; see also: Kim et al., Gene. 1997, 199: 293-301; zurMegede et al., Journal of Virology, 2000, 74: 2628-2635). Preferably, the GAA-encoding nucleic acid molecule has a CAI of at least 0.75 (especially 0.77), 0.8, 0.85, 0.90, 0.92 or 0.94.

术语“核酸序列”(或核酸分子)是指采取单链或双链形式的DNA或RNA分子，特别是编码本发明的GAA蛋白的DNA。The term "nucleic acid sequence" (or nucleic acid molecule) refers to a DNA or RNA molecule in single- or double-stranded form, in particular DNA encoding the GAA protein of the invention.

发明人已发现，上文描述的截短的GAA多肽当从编码它的核酸分子表达时，与野生型GAA cDNA相比，令人吃惊地引起功能性GAA蛋白在体外和体内两者的高水平表达。此外，也正如本发明人所示，从表达本发明的核酸分子的肝和肌细胞产生的截短的GAA蛋白不诱导免疫应答。这意味着该核酸分子可用于生产高水平的GAA多肽并提供治疗益处，例如避免求助于免疫抑制性治疗，允许低剂量的免疫抑制性治疗，和允许向需要的对象重复给药本发明的核酸分子。因此，本发明的截短的GAA多肽和本发明的核酸分子在缺乏GAA表达和/或活性或GAA的高水平表达可以改善疾病例如糖原贮积病的情形中是特别令人感兴趣的。具体来说，所述糖原贮积病可以是GSDI(冯·吉尔克氏病(von Gierke's disease))、GSDII(庞贝氏病)、GSDIII(科里氏病(Cori disease))、GSDIV、GSDV、GSDVI、GSDVII、GSDVIII或心脏的致死性先天性糖原贮积病。更具体来说，所述糖原贮积病选自GSDI、GSDII和GSDIII，甚至更特别地选自GSDII和GSDIII。在甚至更特定的实施方式中，所述糖原贮积病是GSDII。具体来说，本发明的核酸分子可用于基因疗法中，以治疗缺乏GAA的病症或与糖原的积累相关的其他病症，例如GSDI(冯·吉尔克氏病)、GSDII(庞贝氏病)、GSDIII(科里氏病)、GSDIV、GSDV、GSDVI、GSDVII、GSDVIII和心脏的致死性先天性糖原贮积病，更特别是GSDI、GSDII或GSDIII，甚至更特别是GSDII和GSDIII。在甚至更特定的实施方式中，本发明的核酸分子可用于基因疗法中，以治疗GSDII。The inventors have discovered that the truncated GAA polypeptide described above, when expressed from the nucleic acid molecule encoding it, surprisingly causes high levels of functional GAA protein both in vitro and in vivo compared to wild-type GAA cDNA. Express. Furthermore, as also shown by the present inventors, truncated GAA proteins produced from liver and muscle cells expressing the nucleic acid molecules of the invention do not induce an immune response. This means that the nucleic acid molecules can be used to produce high levels of GAA polypeptides and provide therapeutic benefits, such as avoiding recourse to immunosuppressive treatments, allowing low doses of immunosuppressive treatments, and allowing repeated administration of the nucleic acids of the invention to subjects in need. molecular. Therefore, the truncated GAA polypeptides of the invention and the nucleic acid molecules of the invention are of particular interest in situations where a lack of GAA expression and/or activity or where high levels of GAA expression may ameliorate diseases, such as glycogen storage diseases. Specifically, the glycogen storage disease may be GSDI (von Gierke's disease), GSDII (Pompe's disease), GSDIII (Cori disease), GSDIV, GSDV, GSDVI, GSDVII, GSDVIII, or fatal congenital glycogen storage diseases of the heart. More specifically, the glycogen storage disease is selected from GSDI, GSDII and GSDIII, even more particularly from GSDII and GSDIII. In an even more specific embodiment, the glycogen storage disease is GSDII. In particular, the nucleic acid molecules of the present invention can be used in gene therapy to treat GAA-deficient disorders or other disorders related to the accumulation of glycogen, such as GSDI (von Gierke's disease), GSDII (Pompe's disease) , GSDIII (Corey's disease), GSDIV, GSDV, GSDVI, GSDVII, GSDVIII and fatal congenital glycogen storage diseases of the heart, more particularly GSDI, GSDII or GSDIII, even more particularly GSDII and GSDIII. In an even more specific embodiment, the nucleic acid molecules of the invention may be used in gene therapy to treat GSDII.

在本发明的另一个实施方式中，本发明的核酸分子的编码所述截短的GAA多肽组成部分的部分与编码SEQ ID NO：1或SEQ ID NO：33、特别是SEQ ID NO：1的核苷酸序列的相应部分具有至少75％(例如77,7％)或至少80％或至少82％(例如83.1％)的同一性，后者是不含信号肽的野生型hGAA多肽的序列。In another embodiment of the invention, the portion of the nucleic acid molecule of the invention encoding said truncated GAA polypeptide moiety is identical to the portion encoding SEQ ID NO: 1 or SEQ ID NO: 33, in particular SEQ ID NO: 1 Corresponding portions of the nucleotide sequence have an identity of at least 75% (eg 77,7%) or at least 80% or at least 82% (eg 83.1%) to the sequence of a wild-type hGAA polypeptide without a signal peptide.

本发明的核酸分子的截短的GAA组成部分与作为序列优化过的序列的SEQ ID NO：10或11的核苷酸序列优选地具有至少85％、更优选地至少90％、甚至更优选地至少92％同一性、特别是至少95％的同一性，例如至少98、99或100％的同一性。The truncated GAA component of the nucleic acid molecule of the invention preferably has at least 85%, more preferably at least 90%, even more preferably, the nucleotide sequence of SEQ ID NO: 10 or 11 as a sequence-optimized sequence. At least 92% identity, in particular at least 95% identity, for example at least 98, 99 or 100% identity.

术语“同一性”及其偏差是指两个核酸分子之间的序列同一性。当两个被比较序列中的一个位置被同一碱基占据时(例如如果两个DNA分子中的每一者中的一个位置被腺嘌呤占据)，则所述分子在该位置处是一致的。两个序列之间的同一性百分数是所述两个序列共有的匹配位置的数目除以被比较的位置的数目X 100的函数。例如，如果在两个序列中10个位置中的6个匹配，则所述两个序列是60％一致的。通常，在将两个序列对齐以给出最大同一性时做出比较。本领域技术人员已知的各种不同的生物信息学工具可用于比对核酸序列，例如BLAST或FASTA。The term "identity" and its deviations refer to the sequence identity between two nucleic acid molecules. When a position in two compared sequences is occupied by the same base (eg if a position in each of two DNA molecules is occupied by adenine), the molecules are identical at that position. The percent identity between two sequences is a function of the number of matching positions common to the two sequences divided by the number of positions compared X 100. For example, if 6 out of 10 positions in two sequences match, the two sequences are 60% identical. Typically, comparisons are made when two sequences are aligned to give maximum identity. A variety of different bioinformatics tools known to those skilled in the art can be used to align nucleic acid sequences, such as BLAST or FASTA.

此外，本发明的核酸分子编码功能性GAA蛋白，即它编码人类GAA蛋白，所述蛋白在被表达时具有野生型GAA蛋白的功能。正如上文所定义的，野生型GAA的功能是水解寡糖和多糖、更特别是糖原的α-1,4和α-1,6连接两者，以释放出葡萄糖。由本发明的核酸编码的功能性GAA蛋白与SEQ ID NO：1、2、30或33的野生型GAA蛋白相比，可以具有至少50％、60％、70％、80％、90％、95％、99％或至少100％的对糖原的水解活性。由本发明的核酸编码的GAA蛋白的活性甚至可以为SEQ ID NO：1、2、30或33的野生型GAA蛋白的活性的超过100％，例如超过110％、120％、130％、140％或甚至超过150％。Furthermore, the nucleic acid molecule of the invention encodes a functional GAA protein, ie it encodes a human GAA protein which, when expressed, has the functionality of a wild-type GAA protein. As defined above, the function of wild-type GAA is to hydrolyze both oligosaccharides and polysaccharides, more particularly the alpha-1,4 and alpha-1,6 linkages of glycogen, to release glucose. The functional GAA protein encoded by the nucleic acid of the present invention may have at least 50%, 60%, 70%, 80%, 90%, 95% compared to the wild-type GAA protein of SEQ ID NO: 1, 2, 30 or 33. , 99% or at least 100% glycogen hydrolytic activity. The activity of the GAA protein encoded by the nucleic acid of the present invention may even be more than 100% of the activity of the wild-type GAA protein of SEQ ID NO: 1, 2, 30 or 33, such as more than 110%, 120%, 130%, 140% or Even more than 150%.

专业技术人员能够容易地确定本发明的核酸是否表达功能性GAA蛋白。适合的方法对于本领域技术人员来说是显而易见的。例如，一种适合的体外方法包括将所述核酸插入到载体例如质粒或病毒载体中，用所述载体转染或转导宿主细胞例如293T或HeLa细胞或其他细胞例如Huh7，并测定GAA活性。可替选地，适合的体内方法包括将含有所述核酸的载体转导到庞贝氏病或另一种糖原贮积病的小鼠模型中，并测定小鼠血浆中的功能性GAA和组织中GAA的存在。适合的方法更详细描述在下面的实验部分中。A skilled person can easily determine whether the nucleic acid of the invention expresses a functional GAA protein. Suitable methods will be apparent to those skilled in the art. For example, one suitable in vitro method involves inserting the nucleic acid into a vector such as a plasmid or viral vector, transfecting or transducing a host cell such as 293T or HeLa cells or other cells such as Huh7 with the vector, and assaying for GAA activity. Alternatively, a suitable in vivo method involves transducing a vector containing the nucleic acid into a mouse model of Pompe disease or another glycogen storage disease and determining functional GAA and GAA in the mouse plasma. The presence of GAA in the organization. Suitable methods are described in more detail in the experimental section below.

在特定实施方式中，本发明的核酸分子包含在SEQ ID NO：12或SEQ ID NO：13中示出的序列，其编码具有SEQ ID NO：27中示出的氨基酸序列的多肽；在SEQ ID NO：48或SEQID NO：49中示出的序列，其编码具有SEQ ID NO：28中示出的氨基酸序列的多肽；在SEQ IDNO：50或SEQ ID NO：51中示出的序列，其编码具有SEQ ID NO：35中示出的氨基酸序列的多肽；或在SEQ ID NO：52或SEQ ID NO：53中示出的序列，其编码具有SEQ ID NO：36中示出的氨基酸序列的多肽。在进一步的实施方式中，本发明的核酸分子包含在SEQ ID NO：12或SEQID NO：13中示出的序列，其编码具有SEQ ID NO：27中示出的氨基酸序列的多肽；在SEQ IDNO：48或SEQ ID NO：49中示出的序列，其编码具有SEQ ID NO：28中示出的氨基酸序列的多肽；或在SEQ ID NO：50或SEQ ID NO：51中示出的序列，其编码具有SEQ ID NO：35中示出的氨基酸序列的多肽。在特定实施方式中，本发明的核酸分子包含在SEQ ID NO：12或SEQ IDNO：13中示出的序列，其编码具有SEQ ID NO：27中示出的氨基酸序列的多肽。In a specific embodiment, the nucleic acid molecule of the invention comprises the sequence set forth in SEQ ID NO: 12 or SEQ ID NO: 13, which encodes a polypeptide having the amino acid sequence set forth in SEQ ID NO: 27; in SEQ ID The sequence shown in NO: 48 or SEQ ID NO: 49, which encodes a polypeptide having the amino acid sequence shown in SEQ ID NO: 28; the sequence shown in SEQ ID NO: 50 or SEQ ID NO: 51, which encodes A polypeptide having the amino acid sequence set forth in SEQ ID NO: 35; or a sequence set forth in SEQ ID NO: 52 or SEQ ID NO: 53 encoding a polypeptide having the amino acid sequence set forth in SEQ ID NO: 36 . In a further embodiment, the nucleic acid molecule of the invention comprises the sequence set forth in SEQ ID NO: 12 or SEQ ID NO: 13, which encodes a polypeptide having the amino acid sequence set forth in SEQ ID NO: 27; in SEQ ID NO : 48 or the sequence set forth in SEQ ID NO: 49, which encodes a polypeptide having the amino acid sequence set forth in SEQ ID NO: 28; or the sequence set forth in SEQ ID NO: 50 or SEQ ID NO: 51, It encodes a polypeptide having the amino acid sequence shown in SEQ ID NO:35. In a specific embodiment, the nucleic acid molecule of the invention comprises the sequence set forth in SEQ ID NO: 12 or SEQ ID NO: 13, which encodes a polypeptide having the amino acid sequence set forth in SEQ ID NO: 27.

本发明还涉及包含本发明的核酸分子的核酸构建物。所述核酸构建物可以对应于表达盒，其包含可操作连接到一个或多个表达控制序列和/或提高转入基因的表达的其他序列和/或增强被编码蛋白质的分泌的序列和/或增强被编码蛋白质的摄入的序列的本发明的核酸序列。当在本文中使用时，术语“可操作连接”是指多核苷酸元件以功能性关系相连。当核酸被放置成与另一个核酸序列处于功能性关系时，它被“可操作连接”。例如，启动子或另一个转录调控序列，如果它影响编码序列的转录，则被可操作连接到所述编码序列。这些表达控制序列在本领域中是已知的，例如启动子、增强子(例如顺式调控模块(CRM))、内含子、polyA信号等。The invention also relates to nucleic acid constructs comprising the nucleic acid molecules of the invention. The nucleic acid construct may correspond to an expression cassette comprising other sequences operably linked to one or more expression control sequences and/or other sequences that increase expression of the transferred gene and/or sequences that enhance secretion of the encoded protein and/or Nucleic acid sequences of the invention enhance the uptake of the encoded protein. When used herein, the term "operably linked" means that polynucleotide elements are linked in a functional relationship. A nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid sequence. For example, a promoter or another transcriptional control sequence is operably linked to a coding sequence if it affects the transcription of the coding sequence. These expression control sequences are known in the art, such as promoters, enhancers (eg, cis-regulatory modules (CRM)), introns, polyA signals, and the like.

具体来说，所述表达盒可以包含启动子。所述启动子可以是遍在或组织特异性启动子，特别是能够在需要GAA表达的细胞或组织中，例如在缺乏GAA的患者中的需要GAA表达的细胞或组织中促进表达的启动子。在特定实施方式中，所述启动子是肝特异性启动子例如α-1抗胰蛋白酶启动子(hAAT)(SEQ ID NO：14)、甲状腺素运载蛋白启动子、白蛋白启动子、甲状腺素结合性球蛋白(TBG)启动子、LSP启动子(包含甲状腺素结合性球蛋白启动子序列、两个拷贝的α1-微球蛋白/双库尼茨抑制剂(bikunin)增强子序列和前导序列-34。Ill,C.R.等，(1997)，用于甲型血友病的基因疗法的人类因子VIII互补DNA表达质粒的优化(Optimization of the human factor VIII complementary DNA expression plasmidfor gene therapy of hemophilia A)，Blood Coag.Fibrinol.8:S23–S30)等。其他有用的肝特异性启动子在本领域中是已知的，例如在冷泉港实验室(Cold Spring HarborLaboratory)编纂的肝特异性基因启动子数据库(http://rulai.cshl.edu/LSPD/)中列出的启动子。在本发明的情形中，优选的启动子是hAAT启动子。在另一个实施方式中，所述启动子是在一种目标组织或细胞中(例如在肌细胞中)和在肝细胞中指导表达的启动子。例如，在一定程度上，对肌细胞特异的启动子例如肌间线蛋白、Spc5-12和MCK启动子可能在肝细胞中存在一定的表达泄漏，这对于诱导对象对从本发明的核酸表达的GAA多肽的免疫耐受性来说可以是有利的。In particular, the expression cassette may comprise a promoter. The promoter may be a ubiquitous or tissue-specific promoter, in particular a promoter capable of promoting expression in cells or tissues requiring GAA expression, such as in cells or tissues requiring GAA expression in a GAA-deficient patient. In specific embodiments, the promoter is a liver-specific promoter such as alpha-1 antitrypsin promoter (hAAT) (SEQ ID NO: 14), transthyretin promoter, albumin promoter, thyroxine Binding globulin (TBG) promoter, LSP promoter (containing thyroxine-binding globulin promoter sequence, two copies of α1-microglobulin/bikunin enhancer sequence and leader sequence -34. Ill, C.R. et al., (1997), Optimization of the human factor VIII complementary DNA expression plasmid for gene therapy of hemophilia A, Blood Coag. Fibrinol. 8:S23–S30), etc. Other useful liver-specific promoters are known in the art, for example in the liver-specific gene promoter database compiled by Cold Spring Harbor Laboratory (http://rulai.cshl.edu/LSPD/ ). In the context of the present invention, a preferred promoter is the hAAT promoter. In another embodiment, the promoter is one that directs expression in a tissue or cell of interest (eg, in muscle cells) and in hepatocytes. For example, to a certain extent, promoters specific to muscle cells, such as desmin, Spc5-12 and MCK promoters, may have a certain expression leakage in hepatocytes, which is useful for inducing subjects to express expression from the nucleic acid of the invention. GAA polypeptides may be advantageous in terms of immune tolerance.

其他组织特异性或非组织特异性启动子在本发明的实践中也可以是有用的。例如，所述表达盒可以包含组织特异性启动子，其是不同于肝特异性启动子的启动子。例如，所述启动子可以是肌肉特异性的，例如肌间线蛋白启动子(和肌间线蛋白启动子变体例如包含天然或人工增强子的肌间线蛋白启动子)、SPc5-12或MCK启动子。在另一个实施方式中，所述启动子是对其他细胞谱系特异的启动子例如红细胞生成素启动子，用于从红细胞谱系的细胞表达所述GAA多肽。Other tissue-specific or non-tissue-specific promoters may also be useful in the practice of the invention. For example, the expression cassette may comprise a tissue-specific promoter that is different from a liver-specific promoter. For example, the promoter may be muscle specific, such as the desmin promoter (and desmin promoter variants such as the desmin promoter containing a natural or artificial enhancer), SPc5-12 or MCK promoter. In another embodiment, the promoter is a promoter specific for other cell lineages, such as an erythropoietin promoter, for expression of the GAA polypeptide from cells of the erythroid lineage.

在另一个实施方式中，所述启动子是遍在启动子。代表性的遍在启动子包括巨细胞病毒增强子/鸡β肌动蛋白(CAG)启动子、巨细胞病毒增强子/启动子(CMV)、PGK启动子、SV40早期启动子等。In another embodiment, the promoter is a ubiquitous promoter. Representative ubiquitous promoters include cytomegalovirus enhancer/chicken β-actin (CAG) promoter, cytomegalovirus enhancer/promoter (CMV), PGK promoter, SV40 early promoter, etc.

此外，所述启动子也可以是内源启动子，例如白蛋白启动子或GAA启动子。Furthermore, the promoter may also be an endogenous promoter, such as an albumin promoter or a GAA promoter.

在特定实施方式中，所述启动子被结合到增强子序列例如顺式调控模块(CRM)或人工增强子序列。例如，所述启动子可以被结合到增强子序列例如人类ApoE控制区(或人类载脂蛋白E/C-I基因座，肝控制区HCR-1——Genbank登记号U32510，示出在SEQ ID NO：15中)。在特定实施方式中，增强子序列例如ApoE序列被结合到肝特异性启动子例如上面列出的那些启动子，特别是例如hAAT启动子。在本发明的实践中有用的其他CRM包括在Rincon等，MolTher.2015Jan；23(1):43-52；Chuah等，MolTher.2014Sep；22(9):1605-13或Nair等，Blood.2014May 15；123(20):3195-9中所描述的CRM。In certain embodiments, the promoter is bound to an enhancer sequence such as a cis-acting regulatory module (CRM) or an artificial enhancer sequence. For example, the promoter can be bound to an enhancer sequence such as the human ApoE control region (or human apolipoprotein E/C-I locus, liver control region HCR-1 - Genbank accession number U32510, shown in SEQ ID NO: 15). In particular embodiments, enhancer sequences, such as ApoE sequences, are bound to liver-specific promoters, such as those listed above, in particular, for example, the hAAT promoter. Other CRMs useful in the practice of the present invention include those in Rincon et al. MolTher. 2015 Jan; 23(1):43-52; Chuah et al. MolTher. 2014 Sep; 22(9):1605-13 or Nair et al. Blood. 2014 May CRM as described in 15;123(20):3195-9.

在另一个特定实施方式中，所述核酸构建物包含内含子，特别是置于所述启动子与GAA编码序列之间的内含子。内含子可以被引入以提高mRNA的稳定性和蛋白质的生产。在其他实施方式中，所述核酸构建物包含人类β球蛋白b2(或HBB2)内含子、凝血因子IX(FIX)内含子、SV40内含子或鸡β-球蛋白内含子。在另一个其他实施方式中，本发明的核酸构建物含有修饰的内含子(特别是修饰的HBB2或FIX内含子)，其被设计用于减少在所述内含子中存在的可选开放阅读框(ARF)的数目或甚至完全除去ARF。优选地，除去长度跨度超过50bp并具有与起始密码子同框的终止密码子的ARF。ARF可以通过修改所述内含子的序列来去除。例如，可以利用核苷酸替换、插入或缺失，优选地通过核苷酸替换进行修饰。作为实例，可以替换在目标内含子序列中存在的ATG或GTG起始密码子中的一个或多个核苷酸，特别是一个核苷酸，产生非起始密码子。例如，可以将目标内含子序列内的ATG或GTG用不是起始密码子的CTG代替。In another specific embodiment, the nucleic acid construct comprises an intron, in particular an intron placed between the promoter and the GAA coding sequence. Introns can be introduced to improve mRNA stability and protein production. In other embodiments, the nucleic acid construct comprises a human beta-globin b2 (or HBB2) intron, a factor IX (FIX) intron, an SV40 intron, or a chicken beta-globin intron. In yet other embodiments, the nucleic acid constructs of the invention contain modified introns (especially modified HBB2 or FIX introns) designed to reduce the optional number of open reading frames (ARFs) or even complete removal of ARFs. Preferably, ARFs that span more than 50 bp in length and have a stop codon in-frame with the start codon are removed. ARF can be removed by modifying the sequence of the intron. For example, nucleotide substitutions, insertions or deletions may be utilized, preferably modification by nucleotide substitutions. As an example, one or more nucleotides, in particular one nucleotide, in an ATG or GTG start codon present in the intron sequence of interest can be replaced, creating a non-start codon. For example, the ATG or GTG within the target intron sequence can be replaced with a CTG that is not the start codon.

在核酸构建物中使用的经典HBB2内含子示出在SEQ ID NO：16中。例如，该HBB2内含子可以通过消除所述内含子中的起始密码子(ATG和GTG密码子)进行修饰。在特定实施方式中，包含在所述构建物中的修饰的HBB2内含子具有在SEQ ID NO：17中示出的序列。在核酸构建物中使用的经典FIX内含子源自于人类FIX的第一内含子，并示出在SEQ ID NO：18中。FIX内含子可以通过消除所述内含子中的起始密码子(ATG和GTG密码子)进行修饰。在特定实施方式中，包含在本发明的构建物中的修饰的FIX内含子具有在SEQ ID NO：19中示出的序列。在核酸构建物中使用的经典鸡β-球蛋白内含子示出在SEQ ID NO：20中。鸡β-球蛋白内含子可以通过消除所述内含子中的起始密码子(ATG和GTG密码子)进行修饰。在特定实施方式中，包含在本发明的构建物中的修饰的鸡β-球蛋白内含子具有在SEQ ID NO：21中示出的序列。The classic HBB2 intron used in nucleic acid constructs is shown in SEQ ID NO:16. For example, the HBB2 intron can be modified by eliminating the initiation codon (ATG and GTG codons) in the intron. In a specific embodiment, the modified HBB2 intron comprised in the construct has the sequence set forth in SEQ ID NO: 17. The classic FIX intron used in nucleic acid constructs is derived from the first intron of human FIX and is shown in SEQ ID NO:18. The FIX intron can be modified by eliminating the start codon (ATG and GTG codons) in the intron. In a specific embodiment, the modified FIX intron comprised in the constructs of the invention has the sequence set forth in SEQ ID NO: 19. The classic chicken beta-globin intron used in nucleic acid constructs is shown in SEQ ID NO:20. The chicken beta-globin intron can be modified by eliminating the initiation codon (ATG and GTG codons) in the intron. In a specific embodiment, the modified chicken beta-globin intron comprised in the constructs of the invention has the sequence set forth in SEQ ID NO:21.

发明人以前已在WO2015/162302中显示，这种修饰的内含子特别是修饰的HBB2或FIX内含子具有有利的性质，并且可以显著提高转入基因的表达。The inventors have previously shown in WO2015/162302 that this modified intron, especially the modified HBB2 or FIX intron, has advantageous properties and can significantly improve the expression of the transferred gene.

在特定实施方式中，本发明的核酸构建物是一种表达盒，其以5'至3'的方向包含任选地在前面带有增强子的启动子、本发明的编码序列(即本发明的优化的截短的GAA编码序列、本发明的嵌合GAA编码序列或本发明的嵌合且序列优化过的GAA编码序列)和多腺苷化信号(例如牛生长激素多腺苷化信号、SV40多腺苷化信号或另一种天然存在的或人工多腺苷化信号)。在特定实施方式中，本发明的核酸构建物是一种表达盒，其以5'至3'的方向包含任选地在前面带有增强子(例如ApoE控制区)的启动子、内含子(特别是如上所定义的内含子)、本发明的编码序列和多腺苷化信号。在另一个特定实施方式中，本发明的核酸构建物是一种表达盒，其以5'至3'的方向包含增强子例如ApoE控制区、启动子、内含子(特别是如上所定义的内含子)、本发明的编码序列和多腺苷化信号。在本发明的其他特定实施方式中，所述表达盒以5'至3'的方向包含ApoE控制区、hAAT-肝特异性启动子、HBB2内含子(特别是如上所定义的修饰的HBB2内含子)、本发明的编码序列和牛生长激素多腺苷化信号，例如在下述序列中示出的构建物：In a specific embodiment, the nucleic acid construct of the invention is an expression cassette comprising in a 5' to 3' direction a promoter optionally preceded by an enhancer, the coding sequence of the invention (i.e. the coding sequence of the invention An optimized truncated GAA coding sequence, a chimeric GAA coding sequence of the invention or a chimeric and sequence-optimized GAA coding sequence of the invention) and a polyadenylation signal (e.g., a bovine growth hormone polyadenylation signal, SV40 polyadenylation signal or another naturally occurring or artificial polyadenylation signal). In a specific embodiment, the nucleic acid construct of the invention is an expression cassette comprising a promoter, an intron, in a 5' to 3' orientation, optionally preceded by an enhancer (e.g., an ApoE control region) (especially an intron as defined above), the coding sequence of the invention and the polyadenylation signal. In another specific embodiment, the nucleic acid construct of the invention is an expression cassette comprising in the 5' to 3' direction an enhancer such as an ApoE control region, a promoter, an intron (in particular as defined above intron), the coding sequence of the invention and the polyadenylation signal. In other specific embodiments of the invention, the expression cassette comprises in the 5' to 3' direction an ApoE control region, a hAAT-liver-specific promoter, an HBB2 intron (especially a modified HBB2 intron as defined above). intron), the coding sequence of the invention and the bovine growth hormone polyadenylation signal, for example the construct shown in the following sequence:

-SEQ ID NO：22，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ8截短形式并编码SEQ ID NO：5的信号肽的未优化的核苷酸序列；- SEQ ID NO: 22, comprising an unoptimized nucleotide sequence encoding a Δ8 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 and encoding the signal peptide of SEQ ID NO: 5;

-SEQ ID NO：23，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ8截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)并编码SEQ ID NO：5的信号肽的优化的序列；- SEQ ID NO: 23, which includes a Δ8 truncated form of GAA encoding the parental hGAA of SEQ ID NO: 1 (a nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and encoding SEQ ID NO: 1 Optimized sequence of signal peptide of NO:5;

-SEQ ID NO：24，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ8截短形式(源自于SEQ ID NO：13的优化序列的核苷酸序列)并编码SEQ ID NO：5的信号肽的另一个优化的序列；- SEQ ID NO: 24, which includes a Δ8 truncated form of GAA encoding the parental hGAA of SEQ ID NO: 1 (a nucleotide sequence derived from the optimized sequence of SEQ ID NO: 13) and encoding SEQ ID NO: 1 Another optimized sequence of the signal peptide of NO:5;

-SEQ ID NO：25，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ8截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的优化的序列；- SEQ ID NO: 25, which includes a Δ8 truncated form encoding GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and SEQ ID NO : Optimized sequence of signal peptide 3;

-SEQ ID NO：26，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ42截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的优化的序列；- SEQ ID NO: 26, which includes the Δ42 truncated form of GAA encoding the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and SEQ ID NO : Optimized sequence of signal peptide 3;

-SEQ ID NO：37，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ29截短形式(源自于SEQ ID NO：9的未优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的未优化的序列；- SEQ ID NO: 37, which includes a Δ29 truncated form encoding GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the unoptimized sequence of SEQ ID NO: 9) and SEQ ID NO: 1 The unoptimized sequence of the signal peptide of NO:3;

-SEQ ID NO：38，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ29截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的优化的序列；- SEQ ID NO: 38, which includes a Δ29 truncated form encoding GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and SEQ ID NO : Optimized sequence of signal peptide 3;

-SEQ ID NO：39，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ29截短形式(源自于SEQ ID NO：13的优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的另一个优化的序列；- SEQ ID NO: 39, which includes the Δ29 truncated form of GAA encoding the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 13) and SEQ ID NO : Another optimized sequence of the signal peptide of 3;

-SEQ ID NO：40，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ42截短形式(源自于SEQ ID NO：9的未优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的未优化的序列；- SEQ ID NO: 40, which includes the Δ42 truncated form of GAA encoding the parental hGAA of SEQ ID NO: 1 (the nucleotide sequence derived from the unoptimized sequence of SEQ ID NO: 9) and SEQ ID NO: 1 The unoptimized sequence of the signal peptide of NO:3;

-SEQ ID NO：41，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ42截短形式(源自于SEQ ID NO：13的优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的另一个优化的序列；- SEQ ID NO: 41, which includes the Δ42 truncated form of GAA encoding the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 13) and SEQ ID NO : Another optimized sequence of the signal peptide of 3;

-SEQ ID NO：42，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ43截短形式(源自于SEQ ID NO：9的未优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的未优化的序列；- SEQ ID NO: 42, which includes a Δ43 truncated form encoding GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the unoptimized sequence of SEQ ID NO: 9) and SEQ ID NO: 1 The unoptimized sequence of the signal peptide of NO:3;

-SEQ ID NO：43，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ43截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的优化的序列；和- SEQ ID NO: 43, which includes a Δ43 truncated form encoding GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and SEQ ID NO : The optimized sequence of the signal peptide of 3; and

-SEQ ID NO：44，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ43截短形式(源自于SEQ ID NO：13的优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的另一个优化的序列；- SEQ ID NO: 44, which includes a Δ43 truncated form encoding GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 13) and SEQ ID NO : Another optimized sequence of the signal peptide of 3;

-SEQ ID NO：45，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ47截短形式(源自于SEQ ID NO：9的未优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的未优化的序列；- SEQ ID NO: 45, which includes a Δ47 truncated form encoding GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the unoptimized sequence of SEQ ID NO: 9) and SEQ ID NO: 1 The unoptimized sequence of the signal peptide of NO:3;

-SEQ ID NO：46，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ47截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的优化的序列；和- SEQ ID NO: 46, which includes a Δ47 truncated form encoding GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and SEQ ID NO : The optimized sequence of the signal peptide of 3; and

-SEQ ID NO：47，其包括编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ47截短形式(源自于SEQ ID NO：13的优化序列的核苷酸序列)和SEQ ID NO：3的信号肽的另一个优化的序列。- SEQ ID NO: 47, which includes a Δ47 truncated form encoding GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 13) and SEQ ID NO :Another optimized sequence of the signal peptide of 3.

本发明的其他表达盒可以包括下述核酸序列：Other expression cassettes of the invention may include the following nucleic acid sequences:

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ8截短形式并编码SEQ ID NO：4、6或7的信号肽的未优化的核苷酸序列；- an unoptimized nucleotide sequence encoding a Δ8 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 and encoding the signal peptide of SEQ ID NO: 4, 6 or 7;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ29截短形式并编码SEQ ID NO：4、6或7的信号肽的未优化的核苷酸序列；- an unoptimized nucleotide sequence encoding a Δ29 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 and encoding the signal peptide of SEQ ID NO: 4, 6 or 7;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ42截短形式并编码SEQ ID NO：4、6或7的信号肽的未优化的核苷酸序列；- an unoptimized nucleotide sequence encoding a Δ42 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 and encoding the signal peptide of SEQ ID NO: 4, 6 or 7;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ43截短形式并编码SEQ ID NO：4、6或7的信号肽的未优化的核苷酸序列；- an unoptimized nucleotide sequence encoding a Δ43 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 and encoding the signal peptide of SEQ ID NO: 4, 6 or 7;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ47截短形式并编码SEQ ID NO：4、6或7的信号肽的未优化的核苷酸序列；- an unoptimized nucleotide sequence encoding a Δ47 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 and encoding the signal peptide of SEQ ID NO: 4, 6 or 7;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ8截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)并编码SEQ ID NO：4、6或7的信号肽的优化的核苷酸序列；- encoding a Δ8 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and encoding SEQ ID NO: 4, 6 or 7 Optimized nucleotide sequence of signal peptide;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ8截短形式(源自于SEQ ID NO：13的优化序列的核苷酸序列)并编码SEQ ID NO：4、6或7的信号肽的优化的核苷酸序列；- encoding a Δ8 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 13) and encoding SEQ ID NO: 4, 6 or 7 Optimized nucleotide sequence of signal peptide;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ29截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)并编码SEQ ID NO：4、6或7的信号肽的优化的核苷酸序列；- encoding a Δ29 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and encoding SEQ ID NO: 4, 6 or 7 Optimized nucleotide sequence of signal peptide;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ29截短形式(源自于SEQ ID NO：13的优化序列的核苷酸序列)并编码SEQ ID NO：4、6或7的信号肽的优化的核苷酸序列；- encoding a Δ29 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 13) and encoding SEQ ID NO: 4, 6 or 7 Optimized nucleotide sequence of signal peptide;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ42截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)并编码SEQ ID NO：4、6或7的信号肽的优化的核苷酸序列；- encoding a Δ42 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and encoding SEQ ID NO: 4, 6 or 7 Optimized nucleotide sequence of signal peptide;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ42截短形式(源自于SEQ ID NO：13的优化序列的核苷酸序列)并编码SEQ ID NO：4、6或7的信号肽的优化的核苷酸序列；- encoding a Δ42 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 13) and encoding SEQ ID NO: 4, 6 or 7 Optimized nucleotide sequence of signal peptide;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ43截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)并编码SEQ ID NO：4、6或7的信号肽的优化的核苷酸序列；- encoding a Δ43 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and encoding SEQ ID NO: 4, 6 or 7 Optimized nucleotide sequence of signal peptide;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ43截短形式(源自于SEQ ID NO：13的优化序列的核苷酸序列)并编码SEQ ID NO：4、6或7的信号肽的优化的核苷酸序列；- encoding a Δ43 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 13) and encoding SEQ ID NO: 4, 6 or 7 Optimized nucleotide sequence of signal peptide;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ47截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)并编码SEQ ID NO：4、6或7的信号肽的优化的核苷酸序列；- encoding a Δ47 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and encoding SEQ ID NO: 4, 6 or 7 Optimized nucleotide sequence of signal peptide;

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ47截短形式(源自于SEQ ID NO：13的优化序列的核苷酸序列)并编码SEQ ID NO：4、6或7的信号肽的优化的核苷酸序列。- encoding a Δ47 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 13) and encoding SEQ ID NO: 4, 6 or 7 Optimized nucleotide sequence of signal peptide.

在这些特定构建物的可替选实施方式中，编码SEQ ID NO：1的序列被编码SEQ IDNO：33的序列代替。In alternative embodiments of these particular constructs, the sequence encoding SEQ ID NO:1 is replaced by the sequence encoding SEQ ID NO:33.

在特定实施方式中，所述表达盒包含ApoE控制区、hAAT-肝特异性启动子、密码子优化的HBB2内含子、本发明的编码序列和牛生长激素多腺苷化信号。In specific embodiments, the expression cassette contains an ApoE control region, a hAAT-liver-specific promoter, a codon-optimized HBB2 intron, the coding sequence of the invention, and a bovine growth hormone polyadenylation signal.

在设计本发明的核酸构建物时，本领域技术人员应该注意用于将所述构建物递送到细胞和器官的载体的尺寸限制。具体来说，本领域技术人员知道AAV载体的主要限制是它的运载容量，其可能随着AAV血清型而变，但据认为被限制在母体病毒基因组的尺寸左右。例如，5kb通常被认为是包装在AAV8衣壳中的最大尺寸(Wu Z.等，MolTher.,2010,18(1):80-86；Lai Y.等，Mol Ther.,2010,18(1):75-79；Wang Y.等，Hum Gene Ther Methods,2012,23(4):225-33)。因此，本领域技术人员在本发明的实践中应该注意选择本发明的核酸构建物的组分，以使得到的核酸序列，包括编码AAV 5'-至3'-ITR的序列，优选地不超过所使用的AAV载体的运载容量的110％，具体来说优选地不超过5.5kb。When designing nucleic acid constructs of the invention, those skilled in the art should be aware of the size limitations of vectors used to deliver the constructs to cells and organs. Specifically, those skilled in the art know that a major limitation of an AAV vector is its carrying capacity, which may vary with AAV serotype but is thought to be limited to around the size of the parent viral genome. For example, 5 kb is generally considered to be the maximum size packaged in AAV8 capsids (Wu Z. et al., Mol Ther., 2010, 18(1):80-86; Lai Y. et al., Mol Ther., 2010, 18(1) ):75-79; Wang Y. et al., Hum Gene Ther Methods, 2012, 23(4):225-33). Therefore, those skilled in the art should take care in the practice of the present invention to select the components of the nucleic acid construct of the present invention so that the resulting nucleic acid sequence, including the sequence encoding the AAV 5'- to 3'-ITR, preferably does not exceed 110% of the carrying capacity of the AAV vector used, specifically preferably no more than 5.5 kb.

本发明还涉及包含本文中公开的核酸分子或构建物的载体。具体来说，本发明的载体是适合于蛋白质表达，优选地用于基因疗法的载体。在一个实施方式中，所述载体是质粒载体。在另一个实施方式中，所述载体是含有本发明的核酸分子、特别是编码本发明的GAA多肽的信使RNA的纳米粒子。在另一个实施方式中，所述载体是基于转座子的系统，允许将本发明的核酸分子或构建物整合到靶细胞的基因组中，例如极度活跃的睡美人(Sleeping Beauty)(SB100X)转座子系统(Mates等，2009)。在另一个实施方式中，所述载体是适用于基因疗法的病毒载体，靶向任何目标细胞例如肝组织或细胞、肌细胞、CNS细胞(例如脑细胞)或造血干细胞例如红细胞谱系的细胞(例如红细胞)。在这种情况下，本发明的核酸构建物还含有本领域中公知的适合于生产高效病毒载体的序列。在特定实施方式中，所述病毒载体源自于整合病毒。具体来说，所述病毒载体可以源自于反转录病毒或慢病毒。在另一个特定实施方式中，所述病毒载体是AAV载体，例如适用于转导肝组织或细胞的AAV载体，更特别是AAV-1、AAV-2和AAV-2变体(例如包含具有Y44+500+730F+T491V改变的工程化衣壳的四重突变的衣壳优化的AAV-2，其公开在Ling等，2016Jul 18,Hum Gene TherMethods.[Epub ahead of print]中)、AAV-3和AAV-3变体(例如包含具有两个氨基酸变化S663V+T492V的工程化AAV3衣壳的AAV3-ST变体，其公开在Vercauteren等，2016,Mol.Ther.Vol.24(6),p.1042中)、AAV-3B和AAV-3B变体、AAV-4、AAV-5、AAV-6和AAV-6变体(例如包含三重突变的AAV6衣壳Y731F/Y705F/T492V形式的AAV6变体，其公开在Rosario等，2016,Mol Ther Methods Clin Dev.3,p.16026中)、AAV-7、AAV-8、AAV-9、AAV-10例如AAV-cy10和AAV-rh10、AAV-rh74、AAV-dj、Anc80、LK03、AAV2i8、猪AAV血清型例如AAVpo4和AAVpo6等载体，或反转录病毒载体例如慢病毒载体和α-反转录病毒。正如在本领域中已知的，取决于考虑使用的具体病毒载体，将其他适合的序列引入到本发明的核酸构建物中，用于获得功能性病毒载体。适合的序列包括用于AAV载体的AAV ITR或用于慢病毒载体的LTR。因此，本发明还涉及如上所述的表达盒，其在每一侧带有ITR或LTR。The invention also relates to vectors comprising the nucleic acid molecules or constructs disclosed herein. In particular, the vector of the present invention is a vector suitable for protein expression, preferably for gene therapy. In one embodiment, the vector is a plasmid vector. In another embodiment, the vector is a nanoparticle containing a nucleic acid molecule of the invention, in particular messenger RNA encoding a GAA polypeptide of the invention. In another embodiment, the vector is a transposon-based system that allows integration of the nucleic acid molecules or constructs of the invention into the genome of the target cell, such as the hyperactive Sleeping Beauty (SB100X) transposon seat system (Mates et al., 2009). In another embodiment, the vector is a viral vector suitable for gene therapy targeting any target cell such as liver tissue or cells, myocytes, CNS cells (e.g. brain cells) or hematopoietic stem cells such as cells of the erythroid lineage (e.g. red blood cells). In this case, the nucleic acid constructs of the invention also contain sequences known in the art that are suitable for the production of efficient viral vectors. In specific embodiments, the viral vector is derived from an integrating virus. In particular, the viral vectors may be derived from retroviruses or lentiviruses. In another specific embodiment, the viral vector is an AAV vector, such as an AAV vector suitable for transduction of liver tissue or cells, more particularly AAV-1, AAV-2 and AAV-2 variants (e.g. containing Y44 +500+730F+T491V modified quadruple mutated capsid-optimized AAV-2 engineered capsid, which was disclosed in Ling et al., 2016Jul 18, Hum Gene TherMethods. [Epub ahead of print]), AAV-3 and AAV-3 variants (such as the AAV3-ST variant comprising an engineered AAV3 capsid with two amino acid changes S663V+T492V, disclosed in Vercauteren et al., 2016, Mol. Ther. Vol. 24(6), p .1042), AAV-3B and AAV-3B variants, AAV-4, AAV-5, AAV-6 and AAV-6 variants (e.g. AAV6 variants containing triple mutations in the Y731F/Y705F/T492V form of the AAV6 capsid entities, which are disclosed in Rosario et al., 2016, Mol Ther Methods Clin Dev. 3, p. 16026), AAV-7, AAV-8, AAV-9, AAV-10 such as AAV-cy10 and AAV-rh10, AAV- Vectors such as rh74, AAV-dj, Anc80, LK03, AAV2i8, porcine AAV serotypes such as AAVpo4 and AAVpo6, or retroviral vectors such as lentiviral vectors and alpha-retroviruses. As is known in the art, depending on the particular viral vector contemplated for use, other suitable sequences can be introduced into the nucleic acid constructs of the invention for obtaining functional viral vectors. Suitable sequences include AAV ITR for AAV vectors or LTR for lentiviral vectors. Therefore, the invention also relates to an expression cassette as described above, carrying an ITR or LTR on each side.

病毒载体的优点在本公开下面的部分中讨论。病毒载体对于递送本发明的核酸分子或构建物来说是优选的，例如反转录病毒载体如慢病毒载体，或非致病性细小病毒，更优选为AAV载体。人类细小病毒腺相关病毒(AAV)是一种天然复制有缺陷的依赖病毒，其能够整合到被感染细胞的基因组中以建立潜伏感染。最后一种性质在哺乳动物病毒中似乎是独特的，因为所述整合发生在人类基因组中被称为AAVS1的特异性位点处，其位于19号染色体上(19q13.3-qter)。Advantages of viral vectors are discussed in the remainder of this disclosure. Viral vectors are preferred for delivery of the nucleic acid molecules or constructs of the invention, such as retroviral vectors such as lentiviral vectors, or non-pathogenic parvoviruses, more preferably AAV vectors. The human parvovirus adeno-associated virus (AAV) is a naturally replication-deficient, dependent virus capable of integrating into the genome of infected cells to establish latent infection. This last property appears to be unique among mammalian viruses because the integration occurs at a specific site in the human genome called AAVS1, located on chromosome 19 (19q13.3-qter).

因此，AAV载体作为用于人类基因疗法的潜在载体引起了相当大的兴趣。所述病毒的有利性质包括它与任何人类疾病缺乏相关性，它感染分裂细胞和非分裂细胞两者的能力，以及可以被感染的源自于不同组织的广范围的细胞系。Therefore, AAV vectors have attracted considerable interest as potential vectors for human gene therapy. Advantageous properties of the virus include its lack of association with any human disease, its ability to infect both dividing and non-dividing cells, and the wide range of cell lines originating from different tissues that can be infected.

在从人类或非人类灵长动物(NHP)分离并被充分表征的AAV的血清型中，人类血清型2是被开发作为基因转移载体的第一种AAV，其他目前使用的AAV血清型包括AAV-1、AAV-2变体(例如包含具有Y44+500+730F+T491V改变的工程化衣壳的四重突变的衣壳优化的AAV-2，其公开在Ling等，2016Jul 18,Hum Gene Ther Methods.[Epub ahead of print]中)、AAV-3和AAV-3变体(例如包含具有两个氨基酸变化S663V+T492V的工程化AAV3衣壳的AAV3-ST变体，其公开在Vercauteren等，2016,Mol.Ther.Vol.24(6),p.1042中)、AAV-3B和AAV-3B变体、AAV-4、AAV-5、AAV-6和AAV-6变体(例如包含三重突变的AAV6衣壳Y731F/Y705F/T492V形式的AAV6变体，其公开在Rosario等，2016,Mol Ther Methods Clin Dev.3,p.16026中)、AAV-7、AAV-8、AAV-9、AAV-10例如AAV-cy10和AAV-rh10、AAV-rh74、AAV-dj、Anc80、LK03、AAV2i8、猪AAV血清型例如AAVpo4和AAVpo6，以及AAV血清型的酪氨酸、赖氨酸和丝氨酸衣壳突变体等。此外，其他非天然的工程化变体和嵌合AAV也可以是有用的。Among the serotypes of AAV isolated and well characterized from humans or non-human primates (NHP), human serotype 2 was the first AAV to be developed as a gene transfer vector. Other currently used AAV serotypes include AAV -1. AAV-2 variants (e.g., capsid-optimized AAV-2 containing quadruple mutations of engineered capsids with Y44+500+730F+T491V changes, which was disclosed in Ling et al., 2016Jul 18, Hum Gene Ther Methods. [Epub ahead of print]), AAV-3, and AAV-3 variants (e.g., AAV3-ST variants containing an engineered AAV3 capsid with two amino acid changes S663V+T492V, disclosed in Vercauteren et al., 2016, Mol.Ther.Vol.24(6), p.1042), AAV-3B and AAV-3B variants, AAV-4, AAV-5, AAV-6 and AAV-6 variants (e.g. containing triple AAV6 variants in the mutated AAV6 capsid Y731F/Y705F/T492V form as disclosed in Rosario et al., 2016, Mol Ther Methods Clin Dev. 3, p. 16026), AAV-7, AAV-8, AAV-9, AAV-10 such as AAV-cy10 and AAV-rh10, AAV-rh74, AAV-dj, Anc80, LK03, AAV2i8, porcine AAV serotypes such as AAVpo4 and AAVpo6, and tyrosine, lysine and serine coats of AAV serotypes Shell mutants, etc. Additionally, other non-natural engineered variants and chimeric AAVs may also be useful.

AAV病毒可以使用常规的分子生物学技术进行工程化改造，使得可以优化这些粒子用于核酸序列的细胞特异性递送，用于最小化免疫原性，用于调节稳定性和粒子寿命，用于高效降解，用于精确递送到核。AAV viruses can be engineered using conventional molecular biology techniques, allowing these particles to be optimized for cell-specific delivery of nucleic acid sequences, for minimizing immunogenicity, for modulating stability and particle lifetime, for high efficiency Degradation for precise delivery to the nucleus.

用于组装成载体的理想的AAV片段包括cap蛋白，包括vp1、vp2、vp3和高变区，rep蛋白，包括rep 78、rep 68、rep 52和rep 40，以及编码这些蛋白质的序列。这些片段可以被容易地使用在各种不同的载体系统和宿主细胞中。Ideal AAV fragments for assembly into vectors include the cap protein, including vp1, vp2, vp3, and the hypervariable region, the rep protein, including rep 78, rep 68, rep 52, and rep 40, and the sequences encoding these proteins. These fragments can be readily used in a variety of different vector systems and host cells.

缺少Rep蛋白的基于AAV的重组载体以低效能整合到宿主的基因组中，并主要作为可以在靶细胞中存留数年的稳定的环状游离体存在。AAV-based recombinant vectors lacking the Rep protein integrate into the host genome with low efficiency and exist primarily as stable circular episomes that can persist in target cells for years.

除了使用AAV天然血清型之外，在本发明的情形中也可以使用人工AAV血清型，包括但不限于具有非天然存在的衣壳蛋白的AAV。这种人工衣壳可以通过任何适合的技术，使用所选的AAV序列(例如vp1衣壳蛋白的片段)与可以从不同的所选AAV血清型、同一AAV血清型的非毗连部分、非AAV病毒来源或非病毒来源获得的异源序列相组合来产生。人工AAV血清型可以是但不限于嵌合AAV衣壳、重组AAV衣壳或“人源化”AAV衣壳。In addition to using native serotypes of AAV, artificial AAV serotypes may also be used in the context of the present invention, including but not limited to AAV with non-naturally occurring capsid proteins. Such artificial capsids can be produced by any suitable technique using selected AAV sequences (e.g. fragments of the vp1 capsid protein) and can be obtained from different selected AAV serotypes, non-contiguous portions of the same AAV serotype, non-AAV viruses Produced by combining heterologous sequences obtained from viral or non-viral sources. Artificial AAV serotypes may be, but are not limited to, chimeric AAV capsids, recombinant AAV capsids, or "humanized" AAV capsids.

因此，本发明涉及包含本发明的核酸分子或构建物的AAV载体。在本发明的情形中，所述AAV载体包含能够转导目标靶细胞、特别是肝细胞的AAV衣壳。根据特定实施方式，所述AAV载体具有AAV-1、AAV-2、AAV-2变体(例如包含具有Y44+500+730F+T491V改变的工程化衣壳的四重突变的衣壳优化的AAV-2，其公开在Ling等，2016Jul 18,Hum Gene TherMethods.[Epub ahead of print]中)、AAV-3和AAV-3变体(例如包含具有两个氨基酸变化S663V+T492V的工程化AAV3衣壳的AAV3-ST变体，其公开在Vercauteren等，2016,Mol.Ther.Vol.24(6),p.1042中)、AAV-3B和AAV-3B变体、AAV-4、AAV-5、AAV-6和AAV-6变体(例如包含三重突变的AAV6衣壳Y731F/Y705F/T492V形式的AAV6变体，其公开在Rosario等，2016,Mol Ther Methods Clin Dev.3,p.16026中)、AAV-7、AAV-8、AAV-9、AAV-10例如AAV-cy10和AAV-rh10、AAV-rh74、AAV-dj、Anc80、LK03、AAV2i8、猪AAV例如AAVpo4和AAVpo6，以及AAV血清型的酪氨酸、赖氨酸和丝氨酸衣壳突变体等的血清型。在特定实施方式中，所述AAV载体具有AAV8、AAV9、AAVrh74或AAV2i8血清型(即所述AAV载体具有AAV8、AAV9、AAVrh74或AAV2i8血清型的衣壳)。在另一个特定实施方式中，所述AAV载体是假型载体，即它的基因组和衣壳源自于不同血清型的AAV。例如，所述假型AAV载体可以是其基因组源自于上文提到的血清型之一，并且衣壳源自于另一种血清型的载体。例如，所述假型载体的基因组可以具有源自于AAV8、AAV9、AAVrh74或AAV2i8血清型的衣壳，并且它的基因组可以源自于不同的血清型。在特定实施方式中，所述AAV载体具有AAV8、AAV9或AAVrh74血清型，特别是AAV8或AAV9血清型，更特别是AAV8血清型的衣壳。Accordingly, the present invention relates to AAV vectors comprising the nucleic acid molecules or constructs of the invention. In the context of the present invention, the AAV vector contains an AAV capsid capable of transducing target cells of interest, in particular hepatocytes. According to a specific embodiment, the AAV vector has AAV-1, AAV-2, AAV-2 variants (e.g., a capsid-optimized AAV comprising a quadruple mutation of an engineered capsid with Y44+500+730F+T491V changes -2, which is disclosed in Ling et al., 2016Jul 18, Hum Gene TherMethods. [Epub ahead of print]), AAV-3 and AAV-3 variants (e.g., an engineered AAV3 coat containing two amino acid changes S663V+T492V AAV3-ST variant of the shell, which is disclosed in Vercauteren et al., 2016, Mol.Ther.Vol.24(6), p.1042), AAV-3B and AAV-3B variants, AAV-4, AAV-5 , AAV-6, and AAV-6 variants (e.g., AAV6 variants containing triple mutated AAV6 capsid Y731F/Y705F/T492V forms, disclosed in Rosario et al., 2016, Mol Ther Methods Clin Dev. 3, p. 16026 ), AAV-7, AAV-8, AAV-9, AAV-10 such as AAV-cy10 and AAV-rh10, AAV-rh74, AAV-dj, Anc80, LK03, AAV2i8, porcine AAV such as AAVpo4 and AAVpo6, and AAV sera Serotypes of tyrosine, lysine, and serine capsid mutants. In specific embodiments, the AAV vector is of the AAV8, AAV9, AAVrh74, or AAV2i8 serotype (i.e., the AAV vector has a capsid of the AAV8, AAV9, AAVrh74, or AAV2i8 serotype). In another specific embodiment, the AAV vector is a pseudotyped vector, ie, its genome and capsid are derived from a different serotype of AAV. For example, the pseudotyped AAV vector may be one in which the genome is derived from one of the serotypes mentioned above and the capsid is derived from another serotype. For example, the genome of the pseudotyped vector can have a capsid derived from the AAV8, AAV9, AAVrh74 or AAV2i8 serotypes, and its genome can be derived from a different serotype. In a specific embodiment, the AAV vector has a capsid of the AAV8, AAV9 or AAVrh74 serotype, particularly the AAV8 or AAV9 serotype, more particularly the AAV8 serotype.

在特定实施方式中，在所述载体被用于将转入基因递送到肌细胞的情形中，所述AAV载体可以选自AAV8、AAV9和AAVrh74等。In specific embodiments, where the vector is used to deliver transgenes to muscle cells, the AAV vector may be selected from the group consisting of AAV8, AAV9, AAVrh74, and the like.

在另一个特定实施方式中，在所述载体被用于将转入基因递送到肝细胞的情形中，所述AAV载体可以选自AAV5、AAV8、AAV9、AAV-LK03、AAV-Anc80和AAV3B等。In another specific embodiment, where the vector is used to deliver transgenes to hepatocytes, the AAV vector may be selected from the group consisting of AAV5, AAV8, AAV9, AAV-LK03, AAV-Anc80, AAV3B, etc. .

在另一个实施方式中，所述衣壳是修饰的衣壳。在本发明的情形中，“修饰的衣壳”可以是嵌合衣壳或包含源自于一种或多种野生型AAV VP衣壳蛋白的一种或多种变体VP衣壳蛋白的衣壳。In another embodiment, the capsid is a modified capsid. In the context of the present invention, a "modified capsid" may be a chimeric capsid or a capsid comprising one or more variant VP capsid proteins derived from one or more wild-type AAV VP capsid proteins. shell.

在特定实施方式中，所述AAV载体是嵌合载体，即它的衣壳包含源自于至少两种不同AAV血清型的VP衣壳蛋白，或包含至少一种将源自于至少两种AAV血清型的VP蛋白区域或结构域组合的嵌合VP蛋白。可用于转导肝细胞的这种嵌合AAV载体的实例描述在Shen等，Molecular Therapy,2007和Tenney等，Virology,2014中。例如，嵌合AAV载体可以源自于AAV8衣壳序列与不同于AAV8血清型的AAV血清型例如上文具体提到的任一种AAV血清型的序列的组合。在另一个实施方式中，所述AAV载体的衣壳包含一种或多种变体VP衣壳蛋白，例如在WO2015013313中所公开的，特别是表现出高的肝趋向性的RHM4-1、RHM15-1、RHM15-2、RHM15-3/RHM15-5、RHM15-4和RHM15-6衣壳变体。In a specific embodiment, the AAV vector is a chimeric vector, i.e., its capsid contains VP capsid proteins derived from at least two different AAV serotypes, or contains at least one VP capsid protein that would be derived from at least two AAV serotypes. Chimeric VP proteins based on VP protein regions or domain combinations of serotypes. Examples of such chimeric AAV vectors that can be used to transduce hepatocytes are described in Shen et al., Molecular Therapy, 2007 and Tenney et al., Virology, 2014. For example, a chimeric AAV vector may be derived from a combination of AAV8 capsid sequences and sequences from an AAV serotype that is different from the AAV8 serotype, such as any of the AAV serotypes specifically mentioned above. In another embodiment, the capsid of the AAV vector comprises one or more variant VP capsid proteins, such as those disclosed in WO2015013313, in particular RHM4-1, RHM15 which exhibit high liver tropism -1, RHM15-2, RHM15-3/RHM15-5, RHM15-4 and RHM15-6 capsid variants.

在另一个实施方式中，所述修饰的衣壳也可源自于通过易错PCR和/或肽插入(例如在Bartel等，2011中所述)插入的衣壳修饰。此外，衣壳变体可以包括单氨基酸变化例如酪氨酸突变体(例如在Zhong等，2008中所述)。In another embodiment, the modified capsids may also be derived from capsid modifications inserted by error-prone PCR and/or peptide insertion (eg as described in Bartel et al., 2011). Additionally, capsid variants may include single amino acid changes such as tyrosine mutants (eg as described in Zhong et al., 2008).

此外，所述AAV载体的基因组可以是单链或自身互补的双链基因组(McCarty等，Gene Therapy，2003)。自身互补的双链AAV载体通过从AAV末端重复序列之一中缺失掉末端解链位点(trs)来产生。这些其复制的基因组是野生型AAV基因组长度的一半的修饰的载体，具有包装DNA二聚体的倾向性。在优选实施方式中，在本发明的实践中使用的AAV载体具有单链基因组，并且更优选地包含AAV8、AAV9、AAVrh74或AAV2i8衣壳，特别是AAV8、AAV9或AAVrh74衣壳例如AAV8或AAV9衣壳，更特别是AAV8衣壳。Furthermore, the genome of the AAV vector may be single-stranded or self-complementary, double-stranded (McCarty et al., Gene Therapy, 2003). Self-complementary double-stranded AAV vectors are generated by deleting the terminal melting site (trs) from one of the AAV terminal repeats. These modified vectors, whose replicated genomes are half the length of wild-type AAV genomes, have a tendency to package DNA dimers. In preferred embodiments, the AAV vectors used in the practice of the invention have a single-stranded genome, and more preferably comprise an AAV8, AAV9, AAVrh74 or AAV2i8 capsid, particularly an AAV8, AAV9 or AAVrh74 capsid such as an AAV8 or AAV9 capsid. capsids, more specifically AAV8 capsids.

在特别优选的实施方式中，本发明涉及一种AAV载体，其在单链或双链的自身互补的基因组(例如单链基因组)中包含本发明的核酸构建物。在一个实施方式中，所述AAV载体包含AAV8、AAV9、AAVrh74或AAV2i8衣壳，特别是AAV8、AAV9或AAVrh74衣壳例如AAV8或AAV9衣壳，更特别是AAV8衣壳。在另一个特定实施方式中，所述核酸被可操作连接到启动子，特别是遍在或肝特异性启动子。根据特定变体的实施方式，所述启动子是遍在启动子，例如巨细胞病毒增强子/鸡β-肌动蛋白(CAG)启动子、巨细胞病毒增强子/启动子(CMV)、PGK启动子和SV40早期启动子。在特定变体中，所述遍在启动子是CAG启动子。根据另一种变体，所述启动子是肝特异性启动子，例如α-1抗胰蛋白酶启动子(hAAT)、甲状腺素运载蛋白启动子、白蛋白启动子和甲状腺素结合性球蛋白(TBG)启动子。在特定变体中，所述肝特异性启动子是SEQ ID NO：14的hAAT肝特异性启动子。在另一个特定实施方式中，包含在本发明的AAV载体的基因组中的核酸构建物还包含如上所述的内含子，例如置于所述启动子与编码GAA编码序列(即本发明的优化的GAA编码序列、本发明的嵌合GAA编码序列或本发明的嵌合且优化的GAA编码序列)的核酸序列之间的内含子。可以包含在引入到所述AAV载体中的核酸构建物内的代表性内含子包括但不限于人类β球蛋白b2(或HBB2)内含子、FIX内含子和鸡β-球蛋白内含子。所述AAV载体的基因组中的所述内含子可以是经典(或未修饰的)内含子或被设计以减少所述内含子中的可选开放阅读框(ARF)的数目或甚至完全除去ARF的修饰的内含子。可以在这种将本发明的核酸引入到AAV载体内的实施方式的实践中使用的修饰和未修饰的内含子，已在上文充分描述。在特定实施方式中，本发明的AAV载体，特别是包含AAV8、AAV9、AAVrh74或AAV2i8衣壳，特别是AAV8、AAV9或AAVrh74衣壳例如AAV8或AAV9衣壳，更特别是AAV8衣壳的AAV载体，在其基因组内包括修饰(或优化的)内含子，例如SEQ ID NO：17的修饰的HBB2内含子、SEQ ID NO：19的修饰的FIX内含子和SEQ ID NO：21的修饰的鸡β-球蛋白内含子。在另一个特定实施方式中，本发明的载体是包含AAV8、AAV9、AAVrh74或AAV2i8衣壳、特别是AAV8、AAV9或AAVrh74衣壳例如AAV8或AAV9衣壳，更特别是AAV8衣壳的AAV载体，其包含的基因组以5’至3’方向含有：AAV 5'-ITR(例如AAV2 5’-ITR)，ApoE控制区，hAAT-肝特异性启动子，HBB2内含子(特别是如上所定义的修饰的HBB2内含子)，本发明的GAA编码序列，牛生长激素多腺苷化信号和AAV 3'-ITR(例如AAV2 3'-ITR)，例如所述基因组包含在SEQ ID NO：22至26和SEQ ID NO：37至47中示出的核酸构建物，并在侧翼带有AAV 5'-ITR(例如AAV2 5’-ITR)和AAV 3'-ITR(例如AAV2 3'-ITR)。在本发明的实践中有用的其他核酸构建物包括上文描述的那些，包括：In a particularly preferred embodiment, the invention relates to an AAV vector comprising a nucleic acid construct of the invention in a single- or double-stranded self-complementary genome (eg a single-stranded genome). In one embodiment, the AAV vector comprises an AAV8, AAV9, AAVrh74 or AAV2i8 capsid, particularly an AAV8, AAV9 or AAVrh74 capsid such as an AAV8 or AAV9 capsid, more particularly an AAV8 capsid. In another specific embodiment, the nucleic acid is operably linked to a promoter, in particular a ubiquitous or liver-specific promoter. According to specific variant embodiments, the promoter is a ubiquitous promoter, such as cytomegalovirus enhancer/chicken beta-actin (CAG) promoter, cytomegalovirus enhancer/promoter (CMV), PGK promoter and SV40 early promoter. In certain variants, the ubiquitous promoter is the CAG promoter. According to another variant, said promoter is a liver-specific promoter, such as alpha-1 antitrypsin promoter (hAAT), transthyretin promoter, albumin promoter and thyroxine-binding globulin ( TBG) promoter. In certain variants, the liver-specific promoter is the hAAT liver-specific promoter of SEQ ID NO:14. In another specific embodiment, the nucleic acid construct contained in the genome of the AAV vector of the present invention also contains an intron as described above, for example, placed between the promoter and the encoding GAA coding sequence (i.e., the optimization of the present invention The intron between the nucleic acid sequence of the GAA coding sequence, the chimeric GAA coding sequence of the invention or the chimeric and optimized GAA coding sequence of the invention). Representative introns that may be included within nucleic acid constructs introduced into the AAV vector include, but are not limited to, human beta-globin b2 (or HBB2) intron, FIX intron, and chicken beta-globin intron. son. The introns in the genome of the AAV vector may be classical (or unmodified) introns or designed to reduce the number of alternative open reading frames (ARFs) in the introns or even completely Removing the modified intron of ARF. Modified and unmodified introns that may be used in the practice of this embodiment of introducing the nucleic acid of the invention into an AAV vector have been fully described above. In particular embodiments, the AAV vectors of the invention, in particular AAV vectors comprising AAV8, AAV9, AAVrh74 or AAV2i8 capsids, in particular AAV8, AAV9 or AAVrh74 capsids such as AAV8 or AAV9 capsids, more particularly AAV8 capsids , including modified (or optimized) introns in its genome, such as the modified HBB2 intron of SEQ ID NO: 17, the modified FIX intron of SEQ ID NO: 19 and the modification of SEQ ID NO: 21 The chicken β-globin intron. In another specific embodiment, the vector of the invention is an AAV vector comprising an AAV8, AAV9, AAVrh74 or AAV2i8 capsid, in particular an AAV8, AAV9 or AAVrh74 capsid such as an AAV8 or AAV9 capsid, more particularly an AAV8 capsid, It contains a genome containing in 5' to 3' direction: AAV 5'-ITR (e.g. AAV2 5'-ITR), ApoE control region, hAAT-liver-specific promoter, HBB2 intron (in particular as defined above Modified HBB2 intron), the GAA coding sequence of the invention, the bovine growth hormone polyadenylation signal and the AAV 3'-ITR (e.g., AAV2 3'-ITR), for example, the genome is contained in SEQ ID NO: 22 to 26 and the nucleic acid constructs shown in SEQ ID NOs: 37 to 47, and flanked by an AAV 5'-ITR (eg, AAV2 5'-ITR) and an AAV 3'-ITR (eg, AAV2 3'-ITR). Other nucleic acid constructs useful in the practice of the invention include those described above, including:

-编码源自于SEQ ID NO：1的亲本hGAA的GAA的Δ42截短形式(源自于SEQ ID NO：12的优化序列的核苷酸序列)并编码SEQ ID NO：4或6的信号肽的优化的核苷酸序列；- encoding a Δ42 truncated form of GAA derived from the parental hGAA of SEQ ID NO: 1 (nucleotide sequence derived from the optimized sequence of SEQ ID NO: 12) and encoding the signal peptide of SEQ ID NO: 4 or 6 The optimized nucleotide sequence;

在本发明的特定实施方式中，本发明的核酸构建物包含如上所定义的肝特异性启动子，并且所述载体是如上所述的能够转导肝组织或细胞的病毒载体。发明人在下文展示的数据显示，得益于这个实施方式开发了高效且优化的载体以在肝细胞中表达高度可分泌形式的GAA并诱导对所述蛋白质的免疫耐受，肝的促耐受原和代谢性质被有利地实现。In a specific embodiment of the invention, the nucleic acid construct of the invention comprises a liver-specific promoter as defined above, and the vector is a viral vector capable of transducing liver tissue or cells as described above. The inventors present data below showing that thanks to this embodiment a highly efficient and optimized vector was developed to express a highly secretable form of GAA in hepatocytes and induce immune tolerance to the protein, pro-tolerance of the liver Protochemical and metabolic properties are advantageously achieved.

此外，在另一个特定实施方式中，本发明提供了两种载体例如两种病毒载体、特别是两种AAV载体的组合，用于在目标细胞中改进基因递送和治疗功效。例如，所述两种载体可以携带编码本发明的GAA蛋白的本发明的核酸分子，并且其在这两种载体的每一种中，在一个不同的启动子控制之下。在特定实施方式中，一种载体包含作为肝特异性启动子(如上文描述的之一)的启动子，另一种载体包含对用于治疗糖原贮积病的另一种目标组织具有特异性的启动子例如肌肉特异性启动子，例如肌间线蛋白启动子。在这个实施方式的特定变化形式中，这种载体组合对应于如WO2015196179中所述产生的多种共包装AAV载体。Furthermore, in another specific embodiment, the present invention provides a combination of two vectors, such as two viral vectors, in particular two AAV vectors, for improved gene delivery and therapeutic efficacy in target cells. For example, the two vectors may carry a nucleic acid molecule of the invention encoding a GAA protein of the invention and under the control of a different promoter in each of the two vectors. In a specific embodiment, one vector contains a promoter that is a liver-specific promoter (such as one described above) and the other vector contains a promoter specific for another target tissue for the treatment of glycogen storage disease. Specific promoters include muscle-specific promoters, such as the desmin promoter. In a specific variation of this embodiment, this combination of vectors corresponds to multiple co-packaged AAV vectors generated as described in WO2015196179.

本发明还涉及用本发明的核酸分子或构建物转化的细胞例如肝细胞，这与用于离体基因疗法的情况相同。本发明的细胞可以通过任何适合的给药途径递送到需要它们的对象例如缺乏GAA的患者，例如通过注射到所述对象的肝脏中或血流中。在特定实施方式中，本发明包括将本发明的核酸分子、核酸构建物或载体、特别是慢病毒载体引入到肝细胞中，特别是待治疗对象的肝细胞中，并将其中引入有所述核酸的转化的肝细胞给药到所述对象。有利的是，这个实施方式对从所述细胞分泌GAA有用。在特定实施方式中，所述肝细胞是来自于待治疗患者的肝细胞，或者是肝干细胞，其被进一步转化并在体外分化成肝细胞，用于随后给药到所述患者。The invention also relates to cells, such as hepatocytes, transformed with the nucleic acid molecules or constructs of the invention, as is the case for ex vivo gene therapy. The cells of the invention may be delivered to a subject in need thereof, such as a GAA-deficient patient, by any suitable route of administration, for example by injection into the subject's liver or bloodstream. In a specific embodiment, the present invention includes introducing the nucleic acid molecule, nucleic acid construct or vector of the invention, especially the lentiviral vector, into hepatocytes, especially the hepatocytes of the subject to be treated, and introducing therein said The transformed hepatocytes of the nucleic acid are administered to the subject. Advantageously, this embodiment is useful for secreting GAA from said cells. In specific embodiments, the hepatocytes are hepatocytes from the patient to be treated, or are liver stem cells that are further transformed and differentiated into hepatocytes in vitro for subsequent administration to the patient.

本发明还涉及一种转基因非人类动物，在其基因组中包含编码本发明的GAA多肽的核酸分子或构建物。在特定实施方式中，所述动物是小鼠。The invention also relates to a transgenic non-human animal comprising in its genome a nucleic acid molecule or construct encoding a GAA polypeptide of the invention. In specific embodiments, the animal is a mouse.

除了在下文实施例中呈现的特定递送系统之外，各种不同的递送系统是已知的，并且可用于给药本发明的核酸分子或构建物，例如包封在脂质体、微粒、微胶囊、能够表达本发明的编码序列的重组细胞中，受体介导的胞吞作用，构建治疗性核酸作为反转录病毒或其他载体的一部分等。In addition to the specific delivery systems presented in the examples below, a variety of different delivery systems are known and can be used to administer the nucleic acid molecules or constructs of the invention, e.g. encapsulated in liposomes, microparticles, microparticles Capsules, recombinant cells capable of expressing the coding sequences of the invention, receptor-mediated endocytosis, constructing therapeutic nucleic acids as part of a retrovirus or other vector, etc.

根据一个实施方式，可能希望将本发明的GAA多肽、核酸分子、核酸构建物或细胞通过任何适合的途径引入到所述对象的肝脏中。除了裸露的DNA之外，可以将例如微环和转座子用于慢病毒载体的递送。此外，基因编辑技术例如锌指核酸酶、巨核酸酶(meganuclease)、TALEN和CRISPR，也可用于递送本发明的编码序列。According to one embodiment, it may be desired to introduce a GAA polypeptide, nucleic acid molecule, nucleic acid construct or cell of the invention into the liver of the subject via any suitable route. In addition to naked DNA, minicircles and transposons, for example, can be used for lentiviral vector delivery. In addition, gene editing technologies such as zinc finger nucleases, meganucleases, TALENs and CRISPR can also be used to deliver the coding sequences of the invention.

本发明还提供了包含本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞的药物组合物。这些组合物包含治疗有效量的所述治疗剂(本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞)和可药用载体。在特定实施方式中，术语“可药用的”意味着由联邦或州政府的监管机构批准或列于美国或欧洲药典或其他公认的药典中，用于在动物和人类中使用的。术语“载体”是指稀释剂、佐剂、赋形剂或介质，所述治疗剂与它们一起给药。这些药用载体可以是无菌液体例如水和油，包括石油、动物、植物或合成来源的油，例如花生油、大豆油、矿物油、芝麻油等。当所述药物组合物被静脉内给药时，水是优选的载体。盐水溶液和右旋糖和甘油水溶液也可用作液体载体，特别是对于可注射溶液。适合的药物赋形剂包括淀粉、葡萄糖、乳糖、蔗糖、硬脂酸钠、单硬脂酸甘油酯、滑石、氯化钠、脱脂奶粉、甘油、丙二醇、水、乙醇等。The invention also provides pharmaceutical compositions comprising the nucleic acid molecules, nucleic acid constructs, vectors, GAA polypeptides or cells of the invention. These compositions comprise a therapeutically effective amount of the therapeutic agent (nucleic acid molecule, nucleic acid construct, vector, GAA polypeptide or cell of the invention) and a pharmaceutically acceptable carrier. In certain embodiments, the term "pharmaceutically acceptable" means approved by a regulatory agency of the federal or state government or listed in the United States or European Pharmacopoeia or other recognized pharmacopeia for use in animals and humans. The term "carrier" refers to a diluent, adjuvant, excipient, or medium with which the therapeutic agent is administered. These pharmaceutical carriers can be sterile liquids such as water and oils, including oils of petroleum, animal, vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil, and the like. When the pharmaceutical composition is administered intravenously, water is the preferred carrier. Saline solutions and aqueous dextrose and glycerol solutions may also be used as liquid carriers, particularly for injectable solutions. Suitable pharmaceutical excipients include starch, glucose, lactose, sucrose, sodium stearate, glyceryl monostearate, talc, sodium chloride, skimmed milk powder, glycerol, propylene glycol, water, ethanol, etc.

如果需要，所述组合物还可以含有少量润湿剂或乳化剂或pH缓冲剂。这些组合物可以采取溶液、悬液、乳液、片剂、丸剂、胶囊、粉剂、缓释剂等的形式。口服配方可以包含标准的载体例如制药级甘露糖醇、乳糖、淀粉、硬脂酸镁、糖精钠、纤维素、碳酸镁等。适合的药物载体的实例描述在E.W.Martin的《Remington制药学》(Remington's PharmaceuticalSciences)中。这些组合物应该含有治疗有效量的所述治疗剂、优选是采取纯化形式的治疗剂，以及适合量的载体以便提供适合于给药到所述对象的形式。在特定实施方式中，本发明的核酸、载体或细胞被配制在包含磷酸盐缓冲盐水并增补有0.25％人血清白蛋白的组合物中。在另一个特定实施方式中，本发明的核酸、载体或细胞被配制在包含林格乳酸盐溶液和以总组合物的重量计终浓度为0.01-0.0001％、例如浓度为0.001％的非离子型表面活性剂例如pluronic F68的组合物中。所述组合物还可以包含血清白蛋白，特别是人血清白蛋白，例如0.25％的人血清白蛋白。其他适用于储存或给药的配方在本领域中是已知的，特别是从WO 2005/118792或Allay等，2011。If desired, the compositions may also contain minor amounts of wetting or emulsifying agents or pH buffering agents. These compositions may take the form of solutions, suspensions, emulsions, tablets, pills, capsules, powders, sustained release formulations, and the like. Oral formulations may contain standard carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharin, cellulose, magnesium carbonate, and the like. Examples of suitable pharmaceutical carriers are described in Remington's Pharmaceutical Sciences by E.W. Martin. These compositions should contain a therapeutically effective amount of the therapeutic agent, preferably in purified form, and a suitable amount of carrier so as to provide a form suitable for administration to the subject. In specific embodiments, the nucleic acids, vectors or cells of the invention are formulated in a composition comprising phosphate buffered saline supplemented with 0.25% human serum albumin. In another specific embodiment, the nucleic acids, vectors or cells of the invention are formulated in a solution containing Ringer's lactate solution and a final concentration of non-ionic acid at a final concentration of 0.01-0.0001%, e.g. a concentration of 0.001% by weight of the total composition. surfactant such as pluronic F68. The composition may also comprise serum albumin, in particular human serum albumin, for example 0.25% human serum albumin. Other formulations suitable for storage or administration are known in the art, inter alia from WO 2005/118792 or Allay et al., 2011.

在优选实施方式中，所述组合物按照常规程序配制成适合于静脉内给药到人类的药物组合物。通常，用于静脉内给药的组合物是在无菌等渗水性缓冲液中的溶液。在必要时，所述组合物还可以包含增溶剂和局部麻醉剂例如利诺卡因，以缓解注射位点处的疼痛。In preferred embodiments, the composition is formulated according to conventional procedures into a pharmaceutical composition suitable for intravenous administration to humans. Typically, compositions for intravenous administration are solutions in sterile isotonic aqueous buffer. Where necessary, the composition may also contain solubilizers and local anesthetics such as lignocaine to relieve pain at the injection site.

在一个实施方式中，本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞可以在囊泡、特别是脂质体中递送。在另一个实施方式中，本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞可以在受控释放系统中递送。In one embodiment, the nucleic acid molecules, nucleic acid constructs, vectors, GAA polypeptides or cells of the invention can be delivered in vesicles, especially liposomes. In another embodiment, the nucleic acid molecules, nucleic acid constructs, vectors, GAA polypeptides or cells of the invention can be delivered in a controlled release system.

本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞的给药方法包括但不限于真皮内、肌肉内、腹膜内、静脉内、皮下、鼻内、硬膜外和口服途径。在特定实施方式中，所述给药通过静脉内或肌肉内途径。本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞，不论是否被载体化，都可以通过任何方便的途径给药，例如通过输注或快速浓注，通过经上皮或粘膜衬(例如口腔粘膜、直肠和肠粘膜等)的吸收，并且可以与其他生物活性药剂一起给药。给药可以是系统性或局部的。Methods of administration of the nucleic acid molecules, nucleic acid constructs, vectors, GAA polypeptides or cells of the invention include, but are not limited to, intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidural and oral routes. In specific embodiments, the administration is by intravenous or intramuscular route. The nucleic acid molecules, nucleic acid constructs, vectors, GAA polypeptides or cells of the invention, whether vectored or not, may be administered by any convenient route, such as by infusion or bolus injection, by transepithelial or mucosal linings (e.g. oral mucosa, rectal and intestinal mucosa, etc.) and can be administered together with other bioactive agents. Administration may be systemic or local.

在特定实施方式中，可能希望将本发明的药物组合物局部给药到需要治疗的区域，例如肝脏。这可以例如利用植入物来实现，所述植入物是多孔、无孔或胶状材料，包括膜例如硅橡胶膜或纤维。In certain embodiments, it may be desirable to administer pharmaceutical compositions of the present invention topically to the area in need of treatment, such as the liver. This can be achieved, for example, using implants that are porous, non-porous or gel-like materials, including membranes such as silicone rubber membranes or fibers.

在糖原贮积病的治疗中有效的本发明的治疗剂(即本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞)的量可以通过标准的临床技术来确定。此外，可以任选地使用体内和/或体外测定法来帮助预测最适剂量范围。在所述制剂中使用的精确剂量也取决于给药途径和疾病的严重性，并且应该根据执业医师的判断和每位患者的情况来决定。本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞的给药到需要它们的对象的剂量将随着几种因素而变，包括但不限于给药途径、治疗的具体疾病、对象的年龄或实现治疗效果所必需的表达水平。本领域技术人员可以在本领域知识的基础上，容易地根据这些因素和其他因素确定所需的剂量范围。在治疗包括向对象给药病毒载体例如AAV载体的情况下，所述载体的典型剂量为至少1x10⁸个载体基因组每千克体重(vg/kg)，例如至少1x10⁹vg/kg、至少1x10¹⁰vg/kg、至少1x10¹¹vg/kg、至少1x10¹²vg/kg、至少1x10¹³vg/kg或至少1x10¹⁴vg/kg。The amount of a therapeutic agent of the invention (ie, a nucleic acid molecule, nucleic acid construct, vector, GAA polypeptide or cell of the invention) that is effective in the treatment of glycogen storage disease can be determined by standard clinical techniques. In addition, in vivo and/or in vitro assays may optionally be used to aid in predicting optimal dosage ranges. The precise dosage used in the formulations will also depend on the route of administration and the severity of the disease, and should be determined according to the judgment of the practitioner and the circumstances of each patient. The dosage of the nucleic acid molecules, nucleic acid constructs, vectors, GAA polypeptides or cells of the invention administered to a subject in need thereof will vary depending on several factors, including, but not limited to, the route of administration, the specific disease being treated, the condition of the subject age or the level of expression necessary to achieve a therapeutic effect. Those skilled in the art can readily determine the desired dosage range based on these and other factors based on knowledge in the art. Where the treatment involves administering to the subject a viral vector, such as an AAV vector, a typical dosage of the vector is at least ^1x10 vector genomes per kilogram of body weight (vg/kg), for example at least ^1x10 vg/kg, at least ^1x10 vg /kg, at least 1x10 ¹¹ vg/kg, at least 1x10 ¹² vg/kg, at least 1x10 ¹³ vg/kg or at least 1x10 ¹⁴ vg/kg.

本发明还涉及一种用于治疗糖原贮积病的方法，所述方法包括向需要的对象递送治疗有效量的本发明的核酸、载体、GAA多肽、药物组合物或细胞的步骤。The invention also relates to a method for treating glycogen storage disease, said method comprising the step of delivering a therapeutically effective amount of a nucleic acid, vector, GAA polypeptide, pharmaceutical composition or cell of the invention to a subject in need thereof.

本发明还涉及一种用于治疗糖原贮积病的方法，所述方法不诱导针对所述转入基因(即针对本发明的GAA多肽)的免疫应答，或诱导针对所述转入基因的降低的免疫应答，所述方法包括向需要的对象递送治疗有效量的本发明的核酸分子、核酸构建物、载体、药物组合物或细胞的步骤。本发明还涉及一种用于治疗糖原贮积病的方法，所述方法包括向需要的对象重复给药治疗有效量的本发明的核酸分子、核酸构建物、载体、药物组合物或细胞。在这种情况下，本发明的核酸分子或核酸构建物包含在肝细胞中有功能的启动子，从而允许从其产生针对所表达的GAA多肽的免疫耐受性。同样地，在这种情况下，在这种情形中使用的药物组合物包含的核酸分子或核酸构建物包含在肝细胞中有功能的启动子。在递送肝细胞的情况下，所述细胞可以是以前从所述需要治疗的对象收集并通过在其中引入本发明的核酸分子或核酸构建物进行工程化改造从而使它们能够生产本发明的GAA多肽的细胞。根据一个实施方式，在包含重复给药的情况下，所述给药可以重复至少一次或更多次，并且甚至可以被认为按照定期时间表进行，例如每周、每月或每年一次。所述定期时间表也可以包括每2、3、4、5、6、7、8、9或10年或超过10年给药一次。在另一个特定实施方式中，本发明的病毒载体的每次给药，对于每次连续的给药来说使用不同的病毒来进行，从而避免由于针对以前给药的病毒载体的可能免疫应答而造成的效能降低。例如，第一次给药可以使用包含AAV8衣壳的病毒载体，然后给药包含AAV9衣壳的载体或甚至给药与AAV无关的病毒例如反转录病毒或慢病毒载体。The present invention also relates to a method for treating glycogen storage disease, which method does not induce an immune response against the transferred gene (ie, against the GAA polypeptide of the invention), or induces an immune response against the transferred gene. Reduced immune response, the method comprising the step of delivering a therapeutically effective amount of a nucleic acid molecule, nucleic acid construct, vector, pharmaceutical composition or cell of the invention to a subject in need thereof. The present invention also relates to a method for treating glycogen storage disease, said method comprising repeatedly administering to a subject in need thereof a therapeutically effective amount of a nucleic acid molecule, nucleic acid construct, vector, pharmaceutical composition or cell of the invention. In this case, the nucleic acid molecule or nucleic acid construct of the invention contains a promoter that is functional in hepatocytes, thereby allowing immune tolerance to the expressed GAA polypeptide to be generated therefrom. Likewise, in this case, the pharmaceutical composition used in this case contains a nucleic acid molecule or a nucleic acid construct that contains a promoter that is functional in hepatocytes. In the case of delivering hepatocytes, the cells may have been previously collected from the subject in need of treatment and engineered by introducing therein a nucleic acid molecule or nucleic acid construct of the invention such that they are capable of producing a GAA polypeptide of the invention. Cell. According to one embodiment, where repeated administration is involved, said administration may be repeated at least once or more, and may even be considered to occur on a regular schedule, such as weekly, monthly or yearly. The regular schedule may also include dosing every 2, 3, 4, 5, 6, 7, 8, 9 or 10 years or more. In another specific embodiment, each administration of a viral vector of the invention is performed using a different virus for each successive administration, thereby avoiding possible immune responses to previously administered viral vectors. resulting in reduced performance. For example, a viral vector containing AAV8 capsids may be administered first, followed by a vector containing AAV9 capsids or even a virus unrelated to AAV such as a retroviral or lentiviral vector.

根据本发明，治疗可以包括治愈、缓解或预防作用。因此，治疗性和预防性治疗包括特定糖原贮积病的症状的改善或阻止或以其他方式降低发生特定糖原贮积病的风险。术语“预防性”可以被认为是降低特定病症的严重性或发生率。“预防性”还包括在以前被诊断为患有特定病症的患者中阻止所述病症的复发。“治疗性”也可以降低现有病症的严重性。术语“治疗”在本文中用于指称可以有益于动物、特别是哺乳动物、更特别是人类对象的任何方案。According to the present invention, treatment may include curative, palliative or preventive effects. Accordingly, therapeutic and preventive treatments include amelioration of symptoms of a particular glycogen storage disease or prevention of or otherwise reducing the risk of developing a particular glycogen storage disease. The term "preventative" may be thought of as reducing the severity or incidence of a particular condition. "Prophylactic" also includes preventing the recurrence of a particular condition in a patient previously diagnosed with that condition. "Therapeutic" can also reduce the severity of existing conditions. The term "treatment" is used herein to refer to any regimen that may benefit an animal, particularly a mammal, and more particularly a human subject.

本发明还涉及一种用于治疗糖原贮积病的离体基因治疗方法，所述方法包括将本发明的核酸分子或核酸构建物引入到需要治疗的患者的分离的细胞例如分离的造血干细胞中，并将所述细胞引入到所述需要治疗的患者中。在这种情况的特定实施方式中，将所述核酸分子或构建物用如上所定义的载体引入到所述细胞中。在特定实施方式中，所述载体是整合型病毒载体。在另一个特定实施方式中，所述病毒载体是反转录病毒载体例如慢病毒载体。例如，在van Til等，2010,Blood,115(26),p.5329中所公开的慢病毒载体可用于本发明的方法的实践中。The present invention also relates to an ex vivo gene therapy method for treating glycogen storage diseases, said method comprising introducing the nucleic acid molecule or nucleic acid construct of the invention into isolated cells, such as isolated hematopoietic stem cells, of a patient in need of treatment. and introducing the cells into the patient in need of treatment. In a specific embodiment of this case, said nucleic acid molecule or construct is introduced into said cell using a vector as defined above. In specific embodiments, the vector is an integrating viral vector. In another specific embodiment, the viral vector is a retroviral vector such as a lentiviral vector. For example, the lentiviral vectors disclosed in van Til et al., 2010, Blood, 115(26), p. 5329 can be used in the practice of the methods of the invention.

本发明还涉及本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞，其用作药物。The invention also relates to a nucleic acid molecule, nucleic acid construct, vector, GAA polypeptide or cell of the invention for use as a medicament.

本发明还涉及本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞，其用于治疗由GAA基因中的突变引起的疾病的方法中，特别是用于治疗庞贝氏病的方法中。本发明还涉及本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞，其用于治疗糖原贮积病的方法中，所述糖原贮积病例如为GSDI(冯·吉尔克氏病)、GSDII(庞贝氏病)、GSDIII(科里氏病)、GSDIV、GSDV、GSDVI、GSDVII、GSDVIII和心脏的致死性先天性糖原贮积病，更特别是GSDI、GSDII或GSDIII，甚至更特别是GSDII和GSDIII，最特别是GSDII。本发明的截短的GAA多肽可以被给药到需要的患者，用于酶替代疗法(ERT)中，例如用于糖原贮积病的酶替代疗法中，所述糖原贮积病例如GSDI(冯·吉尔克氏病)、GSDII(庞贝氏病)、GSDIII(科里氏病)、GSDIV、GSDV、GSDVI、GSDVII、GSDVIII和心脏的致死性先天性糖原贮积病，更特别是GSDI、GSDII或GSDIII，甚至更特别是GSDII和GSDIII，最特别是GSDII。The invention also relates to the nucleic acid molecule, nucleic acid construct, vector, GAA polypeptide or cell of the invention for use in a method for treating diseases caused by mutations in the GAA gene, in particular for use in methods for treating Pompe disease. . The invention also relates to the nucleic acid molecule, nucleic acid construct, vector, GAA polypeptide or cell of the invention for use in a method of treating a glycogen storage disease, such as GSDI (von Gierke's disease), GSDII (Pompe disease), GSDIII (Corey's disease), GSDIV, GSDV, GSDVI, GSDVII, GSDVIII and fatal congenital glycogen storage diseases of the heart, more particularly GSDI, GSDII or GSDIII, Even more specifically GSDII and GSDIII, most specifically GSDII. Truncated GAA polypeptides of the invention may be administered to a patient in need thereof for use in enzyme replacement therapy (ERT), for example for glycogen storage diseases such as GSDI (Von Gierke's disease), GSDII (Pompe's disease), GSDIII (Corrie's disease), GSDIV, GSDV, GSDVI, GSDVII, GSDVIII and fatal congenital glycogen storage diseases of the heart, more particularly GSDI, GSDII or GSDIII, even more particularly GSDII and GSDIII, most particularly GSDII.

本发明还涉及本发明的核酸分子、核酸构建物、载体、GAA多肽或细胞在制造药物中的用途，所述药物可用于治疗糖原贮积病例如GSDI(冯·吉尔克氏病)、GSDII(庞贝氏病)、GSDIII(科里氏病)、GSDIV、GSDV、GSDVI、GSDVII、GSDVIII和心脏的致死性先天性糖原贮积病，更特别是GSDI、GSDII或GSDIII，甚至更特别是GSDII和GSDIII，最特别是GSDII。The present invention also relates to the use of the nucleic acid molecules, nucleic acid constructs, vectors, GAA polypeptides or cells of the present invention in the manufacture of medicaments, which can be used to treat glycogen storage diseases such as GSDI (von Gierke's disease), GSDII (Pompe's disease), GSDIII (Corrie's disease), GSDIV, GSDV, GSDVI, GSDVII, GSDVIII and fatal congenital glycogen storage diseases of the heart, more particularly GSDI, GSDII or GSDIII, even more particularly GSDII and GSDIII, most especially GSDII.

实施例Example

通过参考下面的实验实施例和附图，对本发明进行更详细地描述。提供这些实施例仅仅是出于说明的目的，并且不打算是限制性的。The invention is described in more detail with reference to the following experimental examples and drawings. These examples are provided for illustrative purposes only and are not intended to be limiting.

材料和方法Materials and methods

GAA活性GAA activity

在将冷冻的组织样品在蒸馏水中匀浆后测量GAA活性。称出50-100mg组织并匀浆，然后以10000x g离心20分钟。在96孔板中，使用10μl上清液和20μl底物4MUα-D-葡萄糖苷设置反应。将所述反应混合物在37℃温浴1小时，然后通过添加150μl pH 10.5的碳酸钠缓冲液终止反应。使用EnSpireα读板器(Perkin-Elmer)在449nm(发射)和360nm(激发)下，使用标准曲线(0-2500pmol/μl的4MU)来测量从各个反应混合物释放的荧光4MU。所述澄清的上清液的蛋白质浓度通过BCA(Thermo Fisher Scientific)来定量。为了计算GAA活性，用释放的4MU浓度除以样品的蛋白质浓度，并将活性以nmol/小时/mg蛋白为单位报告。GAA activity was measured after homogenizing frozen tissue samples in distilled water. Weigh out 50-100mg of tissue and homogenize, then centrifuge at 10,000xg for 20 minutes. In a 96-well plate, set up the reaction using 10 μl of supernatant and 20 μl of substrate 4MU α-D-glucoside. The reaction mixture was incubated at 37°C for 1 hour and then stopped by adding 150 μl of sodium carbonate buffer pH 10.5. Fluorescent 4MU released from each reaction mixture was measured using an EnSpire alpha plate reader (Perkin-Elmer) at 449 nm (emission) and 360 nm (excitation) using a standard curve (0-2500 pmol/μl of 4MU). The protein concentration of the clarified supernatant was quantified by BCA (Thermo Fisher Scientific). To calculate GAA activity, divide the released 4MU concentration by the protein concentration of the sample and report the activity in nmol/hour/mg protein.

小鼠研究mouse research

Gaa-/-小鼠通过定向破坏外显子6来产生，并维持在C57BL/6J/129X1/SvJ背景上(Raben N.等，1998)。以0.2ml的体积通过尾静脉递送载体。每月收集血清样品以监测分泌的hGAA的水平。使用PBS注射的患病动物和野生型同窝仔畜作为对照。Gaa-/- mice were generated by targeted disruption of exon 6 and maintained on a C57BL/6J/129X1/SvJ background (Raben N. et al., 1998). The vector was delivered via the tail vein in a volume of 0.2 ml. Serum samples were collected monthly to monitor levels of secreted hGAA. PBS-injected diseased animals and wild-type littermates were used as controls.

NHP研究NHP research

将雄性食蟹猴饲养在不锈钢笼子中并维持在12小时光/暗周期下。在开始研究之前，所有食蟹猴具有<1:5的中和抗体滴度。通过隐静脉输注一剂2E12vg/kg的AAV8-hAAT-sp7-Δ8-hGAAco1。在所述注射之前12天和之后30天通过股静脉获取血液样品。将全血收集在含有EDTA的管中并离心以分离血清。在载体给药后三个月，将所有食蟹猴安乐死。首先将动物用氯胺酮/右旋美托咪啶的混合物麻醉，然后使用IV注射的戊巴比妥钠进行安乐死。立即收集组织并在液氮中冷冻。Male cynomolgus monkeys were housed in stainless steel cages and maintained under a 12-h light/dark cycle. All cynomolgus monkeys had neutralizing antibody titers <1:5 before starting the study. A dose of 2E12 vg/kg of AAV8-hAAT-sp7-Δ8-hGAAco1 was infused via the saphenous vein. Blood samples were obtained via the femoral vein 12 days before and 30 days after the injection. Whole blood was collected in EDTA-containing tubes and centrifuged to separate serum. Three months after vehicle administration, all cynomolgus monkeys were euthanized. The animals were first anesthetized with a ketamine/dexmedetomidine mixture and then euthanized using an IV injection of sodium pentobarbital. Tissues were collected immediately and frozen in liquid nitrogen.

蛋白质印迹分析Western blot analysis

从冷冻的肌肉获得总匀浆物。通过Pierce BCA蛋白质测定法(Thermo FisherScientific)，按照制造商的说明书确定提取物中的蛋白质浓度。蛋白质印迹使用抗hGAA抗体(Abcam)来进行。使用抗微管蛋白抗体(Sigma Aldrich)作为载样对照。Total homogenates were obtained from frozen muscles. Protein concentration in the extracts was determined by Pierce BCA protein assay (Thermo Fisher Scientific) following the manufacturer's instructions. Western blotting was performed using anti-hGAA antibody (Abcam). Anti-tubulin antibody (Sigma Aldrich) was used as a loading control.

结果result

怀着设计具有提高的分泌和降低的免疫原性的GAA的新形式的目的，我们决定产生GAA的截短形式，并任选地将它们与可替选信号肽组合。With the aim of designing new forms of GAA with improved secretion and reduced immunogenicity, we decided to generate truncated forms of GAA and optionally combine them with alternative signal peptides.

SEQ ID NO：2中示出的人类GAA充当设计这些新形式的基础。SEQ ID NO：1对应于SEQ ID NO：2的序列，但不含GAA的相应的天然信号肽(SEQ ID NO：2的1-27位氨基酸)。设计了核酸构建物以编码源自于SEQ ID NO：1的、在N-端末端处截短的GAA多肽。我们从在野生型hGAA编码序列(SEQ ID NO：9，其对应于作为包括信号肽编码序列的野生型hGAA编码序列的SEQ ID NO：8的82-2859位核苷酸)的基础上设计缺失了对应于SEQ ID NO：1的前8个氨基酸的密码子的核酸序列(Δ8)开始。除了野生型hGAA编码序列之外，我们还设计了编码所述Δ8截短的hGAA多肽的优化的核酸序列(SEQ ID NO：10和SEQ ID NO：11分别对应于hGAAco1和hGAAco2优化的编码序列)，以排除可能的序列特异性影响。The human GAA shown in SEQ ID NO:2 served as the basis for designing these new forms. SEQ ID NO: 1 corresponds to the sequence of SEQ ID NO: 2, but does not contain the corresponding native signal peptide of GAA (amino acids 1-27 of SEQ ID NO: 2). A nucleic acid construct was designed to encode a GAA polypeptide derived from SEQ ID NO: 1 truncated at the N-terminal end. We designed deletions on the basis of the wild-type hGAA coding sequence (SEQ ID NO:9, which corresponds to nucleotides 82-2859 of SEQ ID NO:8 as the wild-type hGAA coding sequence including the signal peptide coding sequence) Beginning with the nucleic acid sequence (Δ8) corresponding to the codons for the first 8 amino acids of SEQ ID NO:1. In addition to the wild-type hGAA coding sequence, we also designed optimized nucleic acid sequences encoding the Δ8 truncated hGAA polypeptide (SEQ ID NO: 10 and SEQ ID NO: 11 correspond to the optimized coding sequences of hGAAco1 and hGAAco2, respectively) , to rule out possible sequence-specific effects.

表1.优化的序列的描述。此表说明了两种hGAA优化序列与野生型的特征的比较。a)密码子适应指数和b)GC含量使用稀有密码子分析工具来计算(http://www.genscript.com)。c)和d)分别是在5’至3’(aORF 5'→3')和3’至5’(aORF 3'→5')链上计算的可选开放阅读框。e)和f)分别是使用拼接位点在线预测工具计算的受体(SA)和供体(SD)拼接位点(http://www.fruitfly.org/seq_tools/splice.html)。g)和h)分别是针对野生型(wt)和优化的co1序列计算的百分同一性。i)CpG岛使用MethDB在线工具计算(http://www.methdb.de/links.html)。CpG岛是长度超过100bp、GC含量>60％并且观察/预期比率>0.6的序列。Table 1. Description of optimized sequences. This table illustrates the comparison of the characteristics of the two hGAA optimized sequences with those of the wild type. a) Codon adaptation index and b) GC content were calculated using the rare codon analysis tool (http://www.genscript.com). c) and d) are the alternative open reading frames calculated on the 5’ to 3’ (aORF 5’→3’) and 3’ to 5’ (aORF 3’→5’) strands, respectively. e) and f) are the acceptor (SA) and donor (SD) splice sites calculated using the splice site online prediction tool (http://www.fruitfly.org/seq_tools/splice.html), respectively. g) and h) are percent identities calculated for the wild-type (wt) and optimized co1 sequences, respectively. i) CpG islands were calculated using the MethDB online tool (http://www.methdb.de/links.html). CpG islands are sequences longer than 100 bp, with a GC content >60% and an observed/expected ratio >0.6.

将hGAA序列的1-27位氨基酸(对应于hGAA的天然信号肽，在这里被定义为sp1；其序列示出在SEQ ID NO：4中)用在这里被定义为sp2的人类α-1-抗胰蛋白酶(NP_000286.3)的序列的1-24位氨基酸(序列示出在SEQ ID NO：5中)代替。我们将编码截短的hGAA的构建物与它们的全尺寸版本平行地转染到人类肝细胞瘤细胞(Huh-7)中，并在48小时后测量释放在培养基中的hGAA的量(图1A)。对于野生型(hGAA)和密码子优化的(hGAAco2)序列两者来说，hGAA序列的Δ8缺失引起分泌水平显著的、50％的提高。在不同的密码子优化的序列(hGAAco1)上进行的相同的截短也以相同的程度提高了hGAA的分泌。Amino acids 1-27 of the hGAA sequence (corresponding to the native signal peptide of hGAA, defined here as sp1; its sequence is shown in SEQ ID NO: 4) were used with human α-1-, defined here as sp2. Amino acids 1-24 of the sequence of antitrypsin (NP_000286.3) (sequence shown in SEQ ID NO: 5) were substituted. We transfected constructs encoding truncated hGAA into human hepatoma cells (Huh-7) in parallel with their full-size versions and measured the amount of hGAA released in the culture medium after 48 h (Fig. 1A). The Δ8 deletion of the hGAA sequence resulted in a significant, 50% increase in secretion levels for both wild-type (hGAA) and codon-optimized (hGAAco2) sequences. The same truncation performed on a different codon-optimized sequence (hGAAco1) also enhanced hGAA secretion to the same extent.

为了证实信号肽之后的序列的变化可以提高hGAA的分泌，我们进一步截短了所述hGAA多肽。我们从hGAAco1构建物消除了对应于hGAA的前42个氨基酸的密码子(Δ42)，并将它们用源自于胰凝乳蛋白酶原B1的信号肽(sp7；序列示出在SEQ ID NO：3中)代替。然后，我们将使用这种新的缺失的构建物获得的分泌的功效与使用与sp7信号肽融合的Δ8版本和带有sp1或sp7的全尺寸hGAAco1进行了比较。我们将这些构建物转染到Huh-7细胞中，并在转染后48小时测量培养基中的hGAA活性。正如预期的，我们可以在全尺寸hGAAco1转染后测量到hGAA活性(相对于GFP p＝0.055)，并且通过用sp7替换野生型信号肽，它的分泌提高2倍(相对于hGAAco1，p＝0.006)。令人吃惊的是，与全尺寸序列相比，与sp7信号肽融合的Δ8和Δ42hGAA序列两者都显示出分泌的hGAA的2倍的提高(相对于sp7-hGAAco1分别为p＝0.0002和0.0003，图1B)。To confirm that changes in the sequence after the signal peptide can enhance hGAA secretion, we further truncated the hGAA polypeptide. We eliminated the codons corresponding to the first 42 amino acids of hGAA (Δ42) from the hGAAco1 construct and replaced them with a signal peptide derived from chymotrypsinogen B1 (sp7; sequence shown in SEQ ID NO: 3 middle) instead. We then compared the efficacy of the secretion obtained using this new deleted construct with that using the Δ8 version fused to the sp7 signal peptide and the full-size hGAAco1 with sp1 or sp7. We transfected these constructs into Huh-7 cells and measured hGAA activity in the culture medium 48 hours after transfection. As expected, we could measure hGAA activity after transfection of full-size hGAAco1 (p=0.055 vs. GFP), and by replacing the wild-type signal peptide with sp7, its secretion was improved 2-fold (p=0.006 vs. hGAAco1 ). Surprisingly, both the Δ8 and Δ42 hGAA sequences fused to the sp7 signal peptide showed a 2-fold increase in secreted hGAA compared to the full-size sequence (p=0.0002 and 0.0003, respectively, relative to sp7-hGAAco1, Figure 1B).

合在一起，这些数据证实了hGAA序列的截短与高效信号肽相结合，能够在体外提高所述蛋白质的分泌。此外，所述截短与本源序列的突变相比具有一个重要优势，因为它不产生大量的新抗原，这在治疗性产品的工程化改造中是一个优点。Taken together, these data demonstrate that truncation of the hGAA sequence combined with a highly efficient signal peptide can enhance secretion of the protein in vitro. Furthermore, the truncation has an important advantage over mutations of the native sequence in that it does not generate large amounts of neoantigens, which is an advantage in the engineering of therapeutic products.

然后，我们在体内，在庞贝氏病小鼠模型中验证了这些发现。我们用AAV8载体注射GAA-/-小鼠(Raben等，J.Bio.Chem.1998)，所述AAV8载体在源自于载脂蛋白B增强子和人类α-1-抗胰蛋白酶启动子的融合体的高强度肝特异性启动子(hAAT)的转录控制之下表达与sp7信号肽融合的全尺寸、Δ8或Δ42hGAAco1。在注射2E12vg/kg的如上所述的载体后一个月，将小鼠放血并测量血清中的hGAA活性。用表达与sp7融合的全长hGAAco1的载体处理的小鼠，在血流中显示出hGAA水平的提高(相对于PBS，p＝0.115)。令人吃惊的是，截短的hGAAΔ8和Δ42两者都引起血清中hGAA水平的显著提高(分别为p＝0.014和0.013)。We then validated these findings in vivo, in a mouse model of Pompe disease. We injected GAA-/- mice (Raben et al., J. Bio. Chem. 1998) with an AAV8 vector derived from the apolipoprotein B enhancer and the human alpha-1-antitrypsin promoter. Full-size, Δ8 or Δ42 hGAAco1 fused to the sp7 signal peptide is expressed under the transcriptional control of the fusion's high-strength liver-specific promoter (hAAT). One month after injection of 2E12 vg/kg of vector as described above, mice were bled and hGAA activity in serum was measured. Mice treated with a vector expressing full-length hGAAco1 fused to sp7 showed increased hGAA levels in the bloodstream (p=0.115 vs. PBS). Surprisingly, both truncated hGAA Δ8 and Δ42 caused significant increases in hGAA levels in serum (p=0.014 and 0.013, respectively).

这些数据表明，hGAA的前端氨基酸的缺失引起血流中分泌的hGAA的水平显著提高。These data indicate that deletion of the front amino acid of hGAA causes a significant increase in the levels of secreted hGAA in the bloodstream.

此外，将对应于艾杜糖醛酸-2-硫酸酯酶的1-25位氨基酸(sp6；SEQ ID NO：6)的另一个信号肽融合到hGAA的Δ8截短形式。与表达与sp1、sp2、sp6、sp7或sp8融合的优化的hGAA(hGAAco1)的质粒平行地，我们用表达GFP或野生型hGAA(hGAA；对应于SEQ ID NO：30的28-952位氨基酸残基的亲本多肽)的质粒转染肝细胞瘤细胞(Huh-7)。在转染后48小时，分析生长培养基中hGAA的存在。值得注意的是，这些构建物引起hGAA分泌水平显著高于在GFP转染的细胞所代表的阴性对照中观察到的分泌水平(图3)。Additionally, another signal peptide corresponding to amino acids 1-25 of iduronate-2-sulfatase (sp6; SEQ ID NO: 6) was fused to the Δ8 truncated form of hGAA. In parallel with plasmids expressing optimized hGAA (hGAAco1) fused to sp1, sp2, sp6, sp7 or sp8, we used plasmids expressing GFP or wild-type hGAA (hGAA; corresponding to amino acid residues 28-952 of SEQ ID NO: 30 Hepatoma cells (Huh-7) were transfected with a plasmid based on the parent polypeptide). At 48 hours after transfection, the growth medium was analyzed for the presence of hGAA. Notably, these constructs caused hGAA secretion levels that were significantly higher than those observed in the negative control represented by GFP-transfected cells (Fig. 3).

然后，我们评估了如上所述用Δ8-hGAA处理的GAA-/-小鼠的心脏、隔膜和股四头肌中的糖原含量。值得注意的是，在用Δ8-hGAAco表达载体处理后，我们在组织中观察到高水平的hGAA(数据未示出)，其与所有考虑的组织中糖原含量的显著降低相关(图4B-D)。具体来说，在心脏中(图4B)，在用带有高效信号肽sp7和8的载体处理后测量到的糖原水平与在未患病的动物中观察到的水平不可区分(相对于WT分别为p＝0.983和0.996)。重要的是，与PBS注射或用与sp1信号肽融合的hGAAco表达载体处理的GAA-/-动物相比，在用sp7和sp8载体两者处理后观察到的水平显著降低。We then assessed glycogen content in the heart, diaphragm, and quadriceps muscles of GAA-/- mice treated with Δ8-hGAA as described above. Notably, we observed high levels of hGAA in tissues after treatment with the Δ8-hGAAco expression vector (data not shown), which correlated with a significant decrease in glycogen content in all tissues considered (Figure 4B- D). Specifically, in the heart (Fig. 4B), glycogen levels measured after treatment with vehicle bearing efficient signal peptides sp7 and 8 were indistinguishable from those observed in undiseased animals (relative to WT p=0.983 and 0.996 respectively). Importantly, the levels observed after treatment with both sp7 and sp8 vectors were significantly reduced compared to GAA-/- animals injected with PBS or treated with the hGAAco expression vector fused to the sp1 signal peptide.

我们还试验了使用我们的载体的肝转导是否诱导针对所述转入基因的体液应答。将小鼠用在肝特异性启动子的转录控制之下表达带有本源sp1信号肽的hGAAco1(co)或与sp2、sp7或sp8融合的Δ8-hGAAco1的AAV8载体静脉内注射。结果呈现在图5中。用在组成性启动子的转录控制之下表达Δ8-hGAAco1的AAV肌肉内注射的Gaa-/-显示出非常高水平的总IgG(～150μg/mL)，而在肝中表达同一蛋白的载体总的来说显示出更低的体液应答水平。有趣的是，用表达sp1hGAAco1(co)的载体注射的小鼠在两种剂量下显示出可检测的抗体水平，而用工程化的高分泌载体注射的小鼠具有不可检测的IgG水平。这些数据表明，转入基因在肝中的表达对于外周耐受的诱导来说是基础性的，它们还提供了下述指示，即通过与高效信号肽融合获得的高的循环hGAA水平，诱导针对所述蛋白质本身的体液应答的降低。We also tested whether hepatic transduction using our vector induces a humoral response to the introduced gene. Mice were injected intravenously with AAV8 vectors expressing hGAAco1(co) with the native sp1 signal peptide or Δ8-hGAAco1 fused to sp2, sp7, or sp8 under the transcriptional control of a liver-specific promoter. The results are presented in Figure 5. Gaa-/- injected intramuscularly with AAV expressing Δ8-hGAAco1 under the transcriptional control of a constitutive promoter showed very high levels of total IgG (∼150 μg/mL), whereas vectors expressing the same protein in the liver total showed lower humoral response levels. Interestingly, mice injected with a vector expressing sp1hGAAco1(co) showed detectable antibody levels at both doses, whereas mice injected with the engineered high-secretion vector had undetectable IgG levels. These data indicate that expression of the transgene in the liver is fundamental for the induction of peripheral tolerance and they also provide an indication that the high circulating hGAA levels achieved by fusion with a highly efficient signal peptide induces resistance to Reduction in humoral response to the protein itself.

将在小鼠研究中选出的性能最好的载体注射到两只非人类灵长动物(NHP，食蟹猴)中，以验证我们的载体的分泌效能和在肌肉中的摄取。我们用2E12vg/kg的AAV8-hAAT-sp7-Δ8-hGAAco1注射两只猴。在注射后一个月，我们使用特异性抗hGAA抗体，通过蛋白质印迹测量了两只动物的血清中的hGAA水平。我们在两只猴中观察到大小与hGAA相符的清晰条带。这个条带在载体注射前12天获得的血清样品中不存在，从而证实了我们的检测方法的特异性(图6A)。我们在注射后三个月处死动物并获取组织，以确认从肝脏分泌到血流中的hGAA是否被肌肉高效摄取。我们使用特异性针对hGAA的抗体对从两只猴的二头肌和隔膜获得的总裂解液进行蛋白质印迹。有趣的是，我们能够在2号动物中观察到清晰的条带，所述动物也在血流中显示出最高水平的hGAA(图6B)。在1号动物中，我们也能在两种被分析的肌肉中观察到分子量与hGAA相一致的较弱的条带。这些数据表明，在NHP中AAV8-hAAT-sp7-Δ8-hGAAco1载体高效转导肝。它们也证实了分泌在血流中的蛋白被高效摄取到肌肉中，并且这种摄取与在血液中测量到的hGAA水平相关。The best-performing vector selected in the mouse study was injected into two non-human primates (NHP, cynomolgus monkeys) to verify the secretion potency and uptake into muscle of our vector. We injected two monkeys with 2E12vg/kg of AAV8-hAAT-sp7-Δ8-hGAAco1. One month after injection, we measured hGAA levels in the serum of both animals by Western blotting using specific anti-hGAA antibodies. We observed clear bands in two monkeys with a size consistent with hGAA. This band was absent in serum samples obtained 12 days before vector injection, confirming the specificity of our assay (Fig. 6A). We sacrificed the animals three months after injection and harvested tissues to confirm whether hGAA secreted from the liver into the bloodstream was efficiently taken up by the muscles. We performed Western blots of total lysates obtained from biceps and septum of two monkeys using antibodies specific for hGAA. Interestingly, we were able to observe clear bands in animal 2, which also showed the highest levels of hGAA in the bloodstream (Figure 6B). In animal 1, we could also observe weaker bands with molecular weights consistent with hGAA in both analyzed muscles. These data demonstrate that the AAV8-hAAT-sp7-Δ8-hGAAco1 vector efficiently transduces the liver in NHP. They also demonstrate that proteins secreted in the bloodstream are efficiently taken up into muscle, and that this uptake correlates with hGAA levels measured in the blood.

我们还确定了在小鼠研究中选出的最佳性能载体(AAV8-hAAT-sp7-Δ8-hGAAco1)在GSDIII小鼠模型中的效果。我们开发了糖原脱支酶(GDE)的敲除小鼠模型。这种模型重现了在患有III型糖原贮积病(GSDIII)的人类中观察到的疾病表型。具体来说，完全缺乏GDE活性的GDE-/-小鼠具有肌肉强度受损并在不同组织中积累糖原。有趣的是，它们也在肝脏中积累糖原，这在人类中也被观察到。在这里，我们试验了肝脏中sp7-Δ8-hGAA的过表达是否挽救在GDE-/-小鼠中观察到的糖原积累。我们用1E11或1E12vg/小鼠的AAV8-hAAT-sp7-Δ8-hGAAco1注射GDE-/-小鼠。作为对照，我们用PBS平行地注射野生型(WT)和GDE-/-小鼠。在载体给药后三个月，将小鼠处死，并对肝脏中的糖原水平进行定量。结果报告在图7中。正如已经报道的(Pagliarani等和我们的模型)，GDE-/-小鼠显示出肝脏中糖原积累的显著增加(p＝1.3E-7)，当与野生型动物相比时具有多5倍的糖原。令人吃惊的是，使用1E11和1E12vg/小鼠的AAV8-hAAT-sp7-Δ8-hGAAco1载体的处理引起糖原含量的统计学显著的降低(分别为p＝4.5E-5和1.4E-6)。重要的是，在用AAV8-hAAT-sp7-Δ8-hGAAco1载体注射的小鼠的肝脏中测量到的糖原水平与在野生型动物中测量到的水平不可区分，特别是在最高剂量下(对于1E11剂量组群来说p＝0.053，对于1E12剂量组群来说为0.244)。We also determined the effect of the vector selected for best performance in mouse studies (AAV8-hAAT-sp7-Δ8-hGAAco1) in the GSDIII mouse model. We developed a knockout mouse model of glycogen debranching enzyme (GDE). This model recapitulates the disease phenotype observed in humans with glycogen storage disease type III (GSDIII). Specifically, GDE-/- mice that completely lack GDE activity have impaired muscle strength and accumulate glycogen in different tissues. Interestingly, they also accumulate glycogen in the liver, which is also observed in humans. Here, we tested whether overexpression of sp7-Δ8-hGAA in the liver rescues the glycogen accumulation observed in GDE-/- mice. We injected GDE-/- mice with 1E11 or 1E12 vg/mouse of AAV8-hAAT-sp7-Δ8-hGAAco1. As controls, we injected wild-type (WT) and GDE-/- mice in parallel with PBS. Three months after vehicle administration, mice were sacrificed and glycogen levels in the liver were quantified. The results are reported in Figure 7. As already reported (Pagliarani et al. and our model), GDE-/- mice showed a significant increase in glycogen accumulation in the liver (p=1.3E-7), with 5-fold more when compared to wild-type animals. of glycogen. Surprisingly, treatment with 1E11 and 1E12 vg/mouse AAV8-hAAT-sp7-Δ8-hGAAco1 vector caused a statistically significant reduction in glycogen content (p=4.5E-5 and 1.4E-6, respectively ). Importantly, glycogen levels measured in the livers of mice injected with the AAV8-hAAT-sp7-Δ8-hGAAco1 vector were indistinguishable from those measured in wild-type animals, especially at the highest dose (for p=0.053 for the 1E11 dose cohort and 0.244 for the 1E12 dose cohort).

我们在用不同版本的GAA(全都密码子优化过)转染的HuH7细胞的培养基和裂解物中进行了GAA活性的分析：1.包括本源sp1GAA信号肽的本源GAA(co)，2.含有异源sp7信号肽的工程化的GAA(sp7-co)，和3.含有异源sp7信号肽然后缺失了不同数目的氨基酸的工程化的GAA(sp7-Δ8-co、sp7-Δ29-co、sp7-Δ42-co、sp7-Δ43-co、sp7-Δ47-co和sp7-Δ62-co，其中分别缺失了SEQ ID NO：1的前8、29、42、47和62个N-端氨基酸)。所述分析显示(图8)，与工程化的未缺失的GAA(sp7-co)和本源GAA(co)两者相比，在用Δ8、Δ29、Δ42和Δ43GAA版本转染的细胞的培养基中GAA活性明显更高。相反，与其他工程化的GAA版本[缺失的(sp7-Δ8-co、sp7-Δ29-co、sp7-Δ42-co、sp7-Δ43-co)和未缺失的(sp7-co)]相比，在用Δ47和Δ62GAA版本转染的细胞的培养基中观察到明显更低的GAA活性。有趣的是，(图9)在生产性缺失(sp7-Δ8-co、sp7-Δ29-co、sp7-Δ42-co、sp7-Δ43-co)和未缺失的版本(sp7-co)之间细胞内GAA活性没有差异，表明它们都在细胞内高效地生产和加工。相反，对于sp7-Δ47-co和sp7-Δ62-co版本来说，细胞内GAA活性非常低，并且与所有其他工程化的版本[缺失的(sp7-Δ8-co、sp7-Δ29-co、sp7-Δ42-co、sp7-Δ43-co)和未缺失的(sp7-co)]相比明显更低。We performed analyzes of GAA activity in the culture medium and lysates of HuH7 cells transfected with different versions of GAA (all codon optimized): 1. native GAA(co) including the native sp1GAA signal peptide, 2. Engineered GAA with heterologous sp7 signal peptide (sp7-co), and 3. Engineered GAA containing heterologous sp7 signal peptide and then deleted different numbers of amino acids (sp7-Δ8-co, sp7-Δ29-co, sp7-Δ42-co, sp7-Δ43-co, sp7-Δ47-co and sp7-Δ62-co, in which the first 8, 29, 42, 47 and 62 N-terminal amino acids of SEQ ID NO: 1 are deleted respectively) . The analysis showed (Fig. 8) that in the culture medium of cells transfected with the Δ8, Δ29, Δ42 and Δ43 GAA versions compared to both engineered non-deleted GAA (sp7-co) and native GAA (co) GAA activity was significantly higher. In contrast, compared with other engineered GAA versions [deleted (sp7-Δ8-co, sp7-Δ29-co, sp7-Δ42-co, sp7-Δ43-co) and non-deleted (sp7-co)], Significantly lower GAA activity was observed in the culture medium of cells transfected with the Δ47 and Δ62 GAA versions. Interestingly (Fig. 9) between productive deleted (sp7-Δ8-co, sp7-Δ29-co, sp7-Δ42-co, sp7-Δ43-co) and non-deleted versions (sp7-co) cells There were no differences in intracellular GAA activity, indicating that they are both efficiently produced and processed within the cell. In contrast, intracellular GAA activity was very low for the sp7-Δ47-co and sp7-Δ62-co versions and was consistent with all other engineered versions [deleted (sp7-Δ8-co, sp7-Δ29-co, sp7 -Δ42-co, sp7-Δ43-co) and not deleted (sp7-co)] were significantly lower.

我们还在用不同版本的GAA(全都密码子优化过)转染的HuH7细胞的培养基和裂解物中进行了GAA活性的分析：1.包括本源sp1GAA信号肽的本源GAA(co)，2.含有异源sp6或sp8信号肽的工程化的GAA(sp6-co、sp8-co)，和3.含有异源sp6或sp8信号肽然后缺失了8个氨基酸的工程化的GAA(sp6-Δ8-co、sp8-Δ8-co)。所述分析显示(图10)，与i.它们相应的工程化的未缺失的GAA版本(sp6-co或sp8-co)和ii.本源GAA(co)相比，在用Δ8版本转染的细胞的培养基中GAA活性明显更高。有趣的是，在所有工程化的GAA版本(缺失和未缺失的两者)之间，细胞内GAA活性没有差异，表明它们都在细胞内高效地生产和加工(细胞裂解液图)。相反，与工程化的版本相比，使用本源GAA(co)时细胞内GAA活性明显更高，表明本源GAA主要保留在细胞内。We also performed analyzes of GAA activity in the culture medium and lysates of HuH7 cells transfected with different versions of GAA (all codon optimized): 1. native GAA(co) including the native sp1GAA signal peptide, 2. Engineered GAA containing heterologous sp6 or sp8 signal peptide (sp6-co, sp8-co), and 3. Engineered GAA containing heterologous sp6 or sp8 signal peptide and then deleted 8 amino acids (sp6-Δ8- co, sp8-Δ8-co). The analysis showed (Figure 10) that, compared with i. their corresponding engineered non-deleted GAA versions (sp6-co or sp8-co) and ii. native GAA (co), the GAA activity was significantly higher in the cells' culture medium. Interestingly, there was no difference in intracellular GAA activity between all engineered versions of GAA (both deleted and non-deleted), indicating that they are all efficiently produced and processed within the cell (cell lysate plot). In contrast, intracellular GAA activity was significantly higher when native GAA(co) was used compared with the engineered version, indicating that native GAA was primarily retained within the cell.

序列表sequence list

<110> 吉尼松公司等<110> Genezon Company, etc.

<120> 酸性α-葡萄糖苷酶变体及其用途<120> Acid alpha-glucosidase variants and their uses

<130> B2299PC00<130> B2299PC00

<160> 53<160> 53

<170> PatentIn version 3.3<170>PatentIn version 3.3

<210> 1<210> 1

<211> 925<211> 925

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAwt w/o sp<223> hGAAwt w/o sp

<400> 1<400> 1

Gly His Ile Leu Leu His Asp Phe Leu Leu Val Pro Arg Glu Leu SerGly His Ile Leu Leu His Asp Phe Leu Leu Val Pro Arg Glu Leu Ser

1 5 10 151 5 10 15

Gly Ser Ser Pro Val Leu Glu Glu Thr His Pro Ala His Gln Gln GlyGly Ser Ser Pro Val Leu Glu Glu Thr His Pro Ala His Gln Gln Gly

20 25 30 20 25 30

Ala Ser Arg Pro Gly Pro Arg Asp Ala Gln Ala His Pro Gly Arg ProAla Ser Arg Pro Gly Pro Arg Asp Ala Gln Ala His Pro Gly Arg Pro

35 40 45 35 40 45

Arg Ala Val Pro Thr Gln Cys Asp Val Pro Pro Asn Ser Arg Phe AspArg Ala Val Pro Thr Gln Cys Asp Val Pro Pro Asn Ser Arg Phe Asp

50 55 60 50 55 60

Cys Ala Pro Asp Lys Ala Ile Thr Gln Glu Gln Cys Glu Ala Arg GlyCys Ala Pro Asp Lys Ala Ile Thr Gln Glu Gln Cys Glu Ala Arg Gly

65 70 75 8065 70 75 80

Cys Cys Tyr Ile Pro Ala Lys Gln Gly Leu Gln Gly Ala Gln Met GlyCys Cys Tyr Ile Pro Ala Lys Gln Gly Leu Gln Gly Ala Gln Met Gly

85 90 95 85 90 95

Gln Pro Trp Cys Phe Phe Pro Pro Ser Tyr Pro Ser Tyr Lys Leu GluGln Pro Trp Cys Phe Phe Pro Pro Ser Tyr Pro Ser Tyr Lys Leu Glu

100 105 110 100 105 110

Asn Leu Ser Ser Ser Glu Met Gly Tyr Thr Ala Thr Leu Thr Arg ThrAsn Leu Ser Ser Ser Glu Met Gly Tyr Thr Ala Thr Leu Thr Arg Thr

115 120 125 115 120 125

Thr Pro Thr Phe Phe Pro Lys Asp Ile Leu Thr Leu Arg Leu Asp ValThr Pro Thr Phe Phe Pro Lys Asp Ile Leu Thr Leu Arg Leu Asp Val

130 135 140 130 135 140

Met Met Glu Thr Glu Asn Arg Leu His Phe Thr Ile Lys Asp Pro AlaMet Met Glu Thr Glu Asn Arg Leu His Phe Thr Ile Lys Asp Pro Ala

145 150 155 160145 150 155 160

Asn Arg Arg Tyr Glu Val Pro Leu Glu Thr Pro His Val His Ser ArgAsn Arg Arg Tyr Glu Val Pro Leu Glu Thr Pro His Val His Ser Arg

165 170 175 165 170 175

Ala Pro Ser Pro Leu Tyr Ser Val Glu Phe Ser Glu Glu Pro Phe GlyAla Pro Ser Pro Leu Tyr Ser Val Glu Phe Ser Glu Glu Pro Phe Gly

180 185 190 180 185 190

Val Ile Val Arg Arg Gln Leu Asp Gly Arg Val Leu Leu Asn Thr ThrVal Ile Val Arg Arg Gln Leu Asp Gly Arg Val Leu Leu Asn Thr Thr

195 200 205 195 200 205

Val Ala Pro Leu Phe Phe Ala Asp Gln Phe Leu Gln Leu Ser Thr SerVal Ala Pro Leu Phe Phe Ala Asp Gln Phe Leu Gln Leu Ser Thr Ser

210 215 220 210 215 220

Leu Pro Ser Gln Tyr Ile Thr Gly Leu Ala Glu His Leu Ser Pro LeuLeu Pro Ser Gln Tyr Ile Thr Gly Leu Ala Glu His Leu Ser Pro Leu

225 230 235 240225 230 235 240

Met Leu Ser Thr Ser Trp Thr Arg Ile Thr Leu Trp Asn Arg Asp LeuMet Leu Ser Thr Ser Trp Thr Arg Ile Thr Leu Trp Asn Arg Asp Leu

245 250 255 245 250 255

Ala Pro Thr Pro Gly Ala Asn Leu Tyr Gly Ser His Pro Phe Tyr LeuAla Pro Thr Pro Gly Ala Asn Leu Tyr Gly Ser His Pro Phe Tyr Leu

260 265 270 260 265 270

Ala Leu Glu Asp Gly Gly Ser Ala His Gly Val Phe Leu Leu Asn SerAla Leu Glu Asp Gly Gly Ser Ala His Gly Val Phe Leu Leu Asn Ser

275 280 285 275 280 285

Asn Ala Met Asp Val Val Leu Gln Pro Ser Pro Ala Leu Ser Trp ArgAsn Ala Met Asp Val Val Leu Gln Pro Ser Pro Ala Leu Ser Trp Arg

290 295 300 290 295 300

Ser Thr Gly Gly Ile Leu Asp Val Tyr Ile Phe Leu Gly Pro Glu ProSer Thr Gly Gly Ile Leu Asp Val Tyr Ile Phe Leu Gly Pro Glu Pro

305 310 315 320305 310 315 320

Lys Ser Val Val Gln Gln Tyr Leu Asp Val Val Gly Tyr Pro Phe MetLys Ser Val Val Gln Gln Tyr Leu Asp Val Val Gly Tyr Pro Phe Met

325 330 335 325 330 335

Pro Pro Tyr Trp Gly Leu Gly Phe His Leu Cys Arg Trp Gly Tyr SerPro Pro Tyr Trp Gly Leu Gly Phe His Leu Cys Arg Trp Gly Tyr Ser

340 345 350 340 345 350

Ser Thr Ala Ile Thr Arg Gln Val Val Glu Asn Met Thr Arg Ala HisSer Thr Ala Ile Thr Arg Gln Val Val Glu Asn Met Thr Arg Ala His

355 360 365 355 360 365

Phe Pro Leu Asp Val Gln Trp Asn Asp Leu Asp Tyr Met Asp Ser ArgPhe Pro Leu Asp Val Gln Trp Asn Asp Leu Asp Tyr Met Asp Ser Arg

370 375 380 370 375 380

Arg Asp Phe Thr Phe Asn Lys Asp Gly Phe Arg Asp Phe Pro Ala MetArg Asp Phe Thr Phe Asn Lys Asp Gly Phe Arg Asp Phe Pro Ala Met

385 390 395 400385 390 395 400

Val Gln Glu Leu His Gln Gly Gly Arg Arg Tyr Met Met Ile Val AspVal Gln Glu Leu His Gln Gly Gly Arg Arg Tyr Met Met Ile Val Asp

405 410 415 405 410 415

Pro Ala Ile Ser Ser Ser Gly Pro Ala Gly Ser Tyr Arg Pro Tyr AspPro Ala Ile Ser Ser Ser Gly Pro Ala Gly Ser Tyr Arg Pro Tyr Asp

420 425 430 420 425 430

Glu Gly Leu Arg Arg Gly Val Phe Ile Thr Asn Glu Thr Gly Gln ProGlu Gly Leu Arg Arg Gly Val Phe Ile Thr Asn Glu Thr Gly Gln Pro

435 440 445 435 440 445

Leu Ile Gly Lys Val Trp Pro Gly Ser Thr Ala Phe Pro Asp Phe ThrLeu Ile Gly Lys Val Trp Pro Gly Ser Thr Ala Phe Pro Asp Phe Thr

450 455 460 450 455 460

Asn Pro Thr Ala Leu Ala Trp Trp Glu Asp Met Val Ala Glu Phe HisAsn Pro Thr Ala Leu Ala Trp Trp Glu Asp Met Val Ala Glu Phe His

465 470 475 480465 470 475 480

Asp Gln Val Pro Phe Asp Gly Met Trp Ile Asp Met Asn Glu Pro SerAsp Gln Val Pro Phe Asp Gly Met Trp Ile Asp Met Asn Glu Pro Ser

485 490 495 485 490 495

Asn Phe Ile Arg Gly Ser Glu Asp Gly Cys Pro Asn Asn Glu Leu GluAsn Phe Ile Arg Gly Ser Glu Asp Gly Cys Pro Asn Asn Glu Leu Glu

500 505 510 500 505 510

Asn Pro Pro Tyr Val Pro Gly Val Val Gly Gly Thr Leu Gln Ala AlaAsn Pro Pro Tyr Val Pro Gly Val Val Gly Gly Thr Leu Gln Ala Ala

515 520 525 515 520 525

Thr Ile Cys Ala Ser Ser His Gln Phe Leu Ser Thr His Tyr Asn LeuThr Ile Cys Ala Ser Ser His Gln Phe Leu Ser Thr His Tyr Asn Leu

530 535 540 530 535 540

His Asn Leu Tyr Gly Leu Thr Glu Ala Ile Ala Ser His Arg Ala LeuHis Asn Leu Tyr Gly Leu Thr Glu Ala Ile Ala Ser His Arg Ala Leu

545 550 555 560545 550 555 560

Val Lys Ala Arg Gly Thr Arg Pro Phe Val Ile Ser Arg Ser Thr PheVal Lys Ala Arg Gly Thr Arg Pro Phe Val Ile Ser Arg Ser Thr Phe

565 570 575 565 570 575

Ala Gly His Gly Arg Tyr Ala Gly His Trp Thr Gly Asp Val Trp SerAla Gly His Gly Arg Tyr Ala Gly His Trp Thr Gly Asp Val Trp Ser

580 585 590 580 585 590

Ser Trp Glu Gln Leu Ala Ser Ser Val Pro Glu Ile Leu Gln Phe AsnSer Trp Glu Gln Leu Ala Ser Ser Val Pro Glu Ile Leu Gln Phe Asn

595 600 605 595 600 605

Leu Leu Gly Val Pro Leu Val Gly Ala Asp Val Cys Gly Phe Leu GlyLeu Leu Gly Val Pro Leu Val Gly Ala Asp Val Cys Gly Phe Leu Gly

610 615 620 610 615 620

Asn Thr Ser Glu Glu Leu Cys Val Arg Trp Thr Gln Leu Gly Ala PheAsn Thr Ser Glu Glu Leu Cys Val Arg Trp Thr Gln Leu Gly Ala Phe

625 630 635 640625 630 635 640

Tyr Pro Phe Met Arg Asn His Asn Ser Leu Leu Ser Leu Pro Gln GluTyr Pro Phe Met Arg Asn His Asn Ser Leu Leu Ser Leu Pro Gln Glu

645 650 655 645 650 655

Pro Tyr Ser Phe Ser Glu Pro Ala Gln Gln Ala Met Arg Lys Ala LeuPro Tyr Ser Phe Ser Glu Pro Ala Gln Gln Ala Met Arg Lys Ala Leu

660 665 670 660 665 670

Thr Leu Arg Tyr Ala Leu Leu Pro His Leu Tyr Thr Leu Phe His GlnThr Leu Arg Tyr Ala Leu Leu Pro His Leu Tyr Thr Leu Phe His Gln

675 680 685 675 680 685

Ala His Val Ala Gly Glu Thr Val Ala Arg Pro Leu Phe Leu Glu PheAla His Val Ala Gly Glu Thr Val Ala Arg Pro Leu Phe Leu Glu Phe

690 695 700 690 695 700

Pro Lys Asp Ser Ser Thr Trp Thr Val Asp His Gln Leu Leu Trp GlyPro Lys Asp Ser Ser Thr Trp Thr Val Asp His Gln Leu Leu Trp Gly

705 710 715 720705 710 715 720

Glu Ala Leu Leu Ile Thr Pro Val Leu Gln Ala Gly Lys Ala Glu ValGlu Ala Leu Leu Ile Thr Pro Val Leu Gln Ala Gly Lys Ala Glu Val

725 730 735 725 730 735

Thr Gly Tyr Phe Pro Leu Gly Thr Trp Tyr Asp Leu Gln Thr Val ProThr Gly Tyr Phe Pro Leu Gly Thr Trp Tyr Asp Leu Gln Thr Val Pro

740 745 750 740 745 750

Val Glu Ala Leu Gly Ser Leu Pro Pro Pro Pro Ala Ala Pro Arg GluVal Glu Ala Leu Gly Ser Leu Pro Pro Pro Pro Ala Ala Pro Arg Glu

755 760 765 755 760 765

Pro Ala Ile His Ser Glu Gly Gln Trp Val Thr Leu Pro Ala Pro LeuPro Ala Ile His Ser Glu Gly Gln Trp Val Thr Leu Pro Ala Pro Leu

770 775 780 770 775 780

Asp Thr Ile Asn Val His Leu Arg Ala Gly Tyr Ile Ile Pro Leu GlnAsp Thr Ile Asn Val His Leu Arg Ala Gly Tyr Ile Ile Pro Leu Gln

785 790 795 800785 790 795 800

Gly Pro Gly Leu Thr Thr Thr Glu Ser Arg Gln Gln Pro Met Ala LeuGly Pro Gly Leu Thr Thr Thr Glu Ser Arg Gln Gln Pro Met Ala Leu

805 810 815 805 810 815

Ala Val Ala Leu Thr Lys Gly Gly Glu Ala Arg Gly Glu Leu Phe TrpAla Val Ala Leu Thr Lys Gly Gly Glu Ala Arg Gly Glu Leu Phe Trp

820 825 830 820 825 830

Asp Asp Gly Glu Ser Leu Glu Val Leu Glu Arg Gly Ala Tyr Thr GlnAsp Asp Gly Glu Ser Leu Glu Val Leu Glu Arg Gly Ala Tyr Thr Gln

835 840 845 835 840 845

Val Ile Phe Leu Ala Arg Asn Asn Thr Ile Val Asn Glu Leu Val ArgVal Ile Phe Leu Ala Arg Asn Asn Thr Ile Val Asn Glu Leu Val Arg

850 855 860 850 855 860

Val Thr Ser Glu Gly Ala Gly Leu Gln Leu Gln Lys Val Thr Val LeuVal Thr Ser Glu Gly Ala Gly Leu Gln Leu Gln Lys Val Thr Val Leu

865 870 875 880865 870 875 880

Gly Val Ala Thr Ala Pro Gln Gln Val Leu Ser Asn Gly Val Pro ValGly Val Ala Thr Ala Pro Gln Gln Val Leu Ser Asn Gly Val Pro Val

885 890 895 885 890 895

Ser Asn Phe Thr Tyr Ser Pro Asp Thr Lys Val Leu Asp Ile Cys ValSer Asn Phe Thr Tyr Ser Pro Asp Thr Lys Val Leu Asp Ile Cys Val

900 905 910 900 905 910

Ser Leu Leu Met Gly Glu Gln Phe Leu Val Ser Trp CysSer Leu Leu Met Gly Glu Gln Phe Leu Val Ser Trp Cys

915 920 925 915 920 925

<210> 2<210> 2

<211> 952<211> 952

<212> PRT<212> PRT

<213> 智人<213> Homo sapiens

<400> 2<400> 2

Met Gly Val Arg His Pro Pro Cys Ser His Arg Leu Leu Ala Val CysMet Gly Val Arg His Pro Pro Cys Ser His Arg Leu Leu Ala Val Cys

1 5 10 151 5 10 15

Ala Leu Val Ser Leu Ala Thr Ala Ala Leu Leu Gly His Ile Leu LeuAla Leu Val Ser Leu Ala Thr Ala Ala Leu Leu Gly His Ile Leu Leu

20 25 30 20 25 30

His Asp Phe Leu Leu Val Pro Arg Glu Leu Ser Gly Ser Ser Pro ValHis Asp Phe Leu Leu Val Pro Arg Glu Leu Ser Gly Ser Ser Pro Val

35 40 45 35 40 45

Leu Glu Glu Thr His Pro Ala His Gln Gln Gly Ala Ser Arg Pro GlyLeu Glu Glu Thr His Pro Ala His Gln Gln Gly Ala Ser Arg Pro Gly

50 55 60 50 55 60

Pro Arg Asp Ala Gln Ala His Pro Gly Arg Pro Arg Ala Val Pro ThrPro Arg Asp Ala Gln Ala His Pro Gly Arg Pro Arg Ala Val Pro Thr

65 70 75 8065 70 75 80

Gln Cys Asp Val Pro Pro Asn Ser Arg Phe Asp Cys Ala Pro Asp LysGln Cys Asp Val Pro Pro Asn Ser Arg Phe Asp Cys Ala Pro Asp Lys

85 90 95 85 90 95

Ala Ile Thr Gln Glu Gln Cys Glu Ala Arg Gly Cys Cys Tyr Ile ProAla Ile Thr Gln Glu Gln Cys Glu Ala Arg Gly Cys Cys Tyr Ile Pro

100 105 110 100 105 110

Ala Lys Gln Gly Leu Gln Gly Ala Gln Met Gly Gln Pro Trp Cys PheAla Lys Gln Gly Leu Gln Gly Ala Gln Met Gly Gln Pro Trp Cys Phe

115 120 125 115 120 125

Phe Pro Pro Ser Tyr Pro Ser Tyr Lys Leu Glu Asn Leu Ser Ser SerPhe Pro Pro Ser Tyr Pro Ser Tyr Lys Leu Glu Asn Leu Ser Ser Ser

130 135 140 130 135 140

Glu Met Gly Tyr Thr Ala Thr Leu Thr Arg Thr Thr Pro Thr Phe PheGlu Met Gly Tyr Thr Ala Thr Leu Thr Arg Thr Thr Pro Thr Phe Phe

145 150 155 160145 150 155 160

Pro Lys Asp Ile Leu Thr Leu Arg Leu Asp Val Met Met Glu Thr GluPro Lys Asp Ile Leu Thr Leu Arg Leu Asp Val Met Met Glu Thr Glu

165 170 175 165 170 175

Asn Arg Leu His Phe Thr Ile Lys Asp Pro Ala Asn Arg Arg Tyr GluAsn Arg Leu His Phe Thr Ile Lys Asp Pro Ala Asn Arg Arg Tyr Glu

180 185 190 180 185 190

Val Pro Leu Glu Thr Pro His Val His Ser Arg Ala Pro Ser Pro LeuVal Pro Leu Glu Thr Pro His Val His Ser Arg Ala Pro Ser Pro Leu

195 200 205 195 200 205

Tyr Ser Val Glu Phe Ser Glu Glu Pro Phe Gly Val Ile Val Arg ArgTyr Ser Val Glu Phe Ser Glu Glu Pro Phe Gly Val Ile Val Arg Arg

210 215 220 210 215 220

Gln Leu Asp Gly Arg Val Leu Leu Asn Thr Thr Val Ala Pro Leu PheGln Leu Asp Gly Arg Val Leu Leu Asn Thr Thr Val Ala Pro Leu Phe

225 230 235 240225 230 235 240

Phe Ala Asp Gln Phe Leu Gln Leu Ser Thr Ser Leu Pro Ser Gln TyrPhe Ala Asp Gln Phe Leu Gln Leu Ser Thr Ser Leu Pro Ser Gln Tyr

245 250 255 245 250 255

Ile Thr Gly Leu Ala Glu His Leu Ser Pro Leu Met Leu Ser Thr SerIle Thr Gly Leu Ala Glu His Leu Ser Pro Leu Met Leu Ser Thr Ser

260 265 270 260 265 270

Trp Thr Arg Ile Thr Leu Trp Asn Arg Asp Leu Ala Pro Thr Pro GlyTrp Thr Arg Ile Thr Leu Trp Asn Arg Asp Leu Ala Pro Thr Pro Gly

275 280 285 275 280 285

Ala Asn Leu Tyr Gly Ser His Pro Phe Tyr Leu Ala Leu Glu Asp GlyAla Asn Leu Tyr Gly Ser His Pro Phe Tyr Leu Ala Leu Glu Asp Gly

290 295 300 290 295 300

Gly Ser Ala His Gly Val Phe Leu Leu Asn Ser Asn Ala Met Asp ValGly Ser Ala His Gly Val Phe Leu Leu Asn Ser Asn Ala Met Asp Val

305 310 315 320305 310 315 320

Val Leu Gln Pro Ser Pro Ala Leu Ser Trp Arg Ser Thr Gly Gly IleVal Leu Gln Pro Ser Pro Ala Leu Ser Trp Arg Ser Thr Gly Gly Ile

325 330 335 325 330 335

Leu Asp Val Tyr Ile Phe Leu Gly Pro Glu Pro Lys Ser Val Val GlnLeu Asp Val Tyr Ile Phe Leu Gly Pro Glu Pro Lys Ser Val Val Gln

340 345 350 340 345 350

Gln Tyr Leu Asp Val Val Gly Tyr Pro Phe Met Pro Pro Tyr Trp GlyGln Tyr Leu Asp Val Val Gly Tyr Pro Phe Met Pro Pro Tyr Trp Gly

355 360 365 355 360 365

Leu Gly Phe His Leu Cys Arg Trp Gly Tyr Ser Ser Thr Ala Ile ThrLeu Gly Phe His Leu Cys Arg Trp Gly Tyr Ser Ser Thr Ala Ile Thr

370 375 380 370 375 380

Arg Gln Val Val Glu Asn Met Thr Arg Ala His Phe Pro Leu Asp ValArg Gln Val Val Glu Asn Met Thr Arg Ala His Phe Pro Leu Asp Val

385 390 395 400385 390 395 400

Gln Trp Asn Asp Leu Asp Tyr Met Asp Ser Arg Arg Asp Phe Thr PheGln Trp Asn Asp Leu Asp Tyr Met Asp Ser Arg Arg Asp Phe Thr Phe

405 410 415 405 410 415

Asn Lys Asp Gly Phe Arg Asp Phe Pro Ala Met Val Gln Glu Leu HisAsn Lys Asp Gly Phe Arg Asp Phe Pro Ala Met Val Gln Glu Leu His

420 425 430 420 425 430

Gln Gly Gly Arg Arg Tyr Met Met Ile Val Asp Pro Ala Ile Ser SerGln Gly Gly Arg Arg Tyr Met Met Ile Val Asp Pro Ala Ile Ser Ser

435 440 445 435 440 445

Ser Gly Pro Ala Gly Ser Tyr Arg Pro Tyr Asp Glu Gly Leu Arg ArgSer Gly Pro Ala Gly Ser Tyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg

450 455 460 450 455 460

Gly Val Phe Ile Thr Asn Glu Thr Gly Gln Pro Leu Ile Gly Lys ValGly Val Phe Ile Thr Asn Glu Thr Gly Gln Pro Leu Ile Gly Lys Val

465 470 475 480465 470 475 480

Trp Pro Gly Ser Thr Ala Phe Pro Asp Phe Thr Asn Pro Thr Ala LeuTrp Pro Gly Ser Thr Ala Phe Pro Asp Phe Thr Asn Pro Thr Ala Leu

485 490 495 485 490 495

Ala Trp Trp Glu Asp Met Val Ala Glu Phe His Asp Gln Val Pro PheAla Trp Trp Glu Asp Met Val Ala Glu Phe His Asp Gln Val Pro Phe

500 505 510 500 505 510

Asp Gly Met Trp Ile Asp Met Asn Glu Pro Ser Asn Phe Ile Arg GlyAsp Gly Met Trp Ile Asp Met Asn Glu Pro Ser Asn Phe Ile Arg Gly

515 520 525 515 520 525

Ser Glu Asp Gly Cys Pro Asn Asn Glu Leu Glu Asn Pro Pro Tyr ValSer Glu Asp Gly Cys Pro Asn Asn Glu Leu Glu Asn Pro Pro Tyr Val

530 535 540 530 535 540

Pro Gly Val Val Gly Gly Thr Leu Gln Ala Ala Thr Ile Cys Ala SerPro Gly Val Val Gly Gly Thr Leu Gln Ala Ala Thr Ile Cys Ala Ser

545 550 555 560545 550 555 560

Ser His Gln Phe Leu Ser Thr His Tyr Asn Leu His Asn Leu Tyr GlySer His Gln Phe Leu Ser Thr His Tyr Asn Leu His Asn Leu Tyr Gly

565 570 575 565 570 575

Leu Thr Glu Ala Ile Ala Ser His Arg Ala Leu Val Lys Ala Arg GlyLeu Thr Glu Ala Ile Ala Ser His Arg Ala Leu Val Lys Ala Arg Gly

580 585 590 580 585 590

Thr Arg Pro Phe Val Ile Ser Arg Ser Thr Phe Ala Gly His Gly ArgThr Arg Pro Phe Val Ile Ser Arg Ser Thr Phe Ala Gly His Gly Arg

595 600 605 595 600 605

Tyr Ala Gly His Trp Thr Gly Asp Val Trp Ser Ser Trp Glu Gln LeuTyr Ala Gly His Trp Thr Gly Asp Val Trp Ser Ser Trp Glu Gln Leu

610 615 620 610 615 620

Ala Ser Ser Val Pro Glu Ile Leu Gln Phe Asn Leu Leu Gly Val ProAla Ser Ser Val Pro Glu Ile Leu Gln Phe Asn Leu Leu Gly Val Pro

625 630 635 640625 630 635 640

Leu Val Gly Ala Asp Val Cys Gly Phe Leu Gly Asn Thr Ser Glu GluLeu Val Gly Ala Asp Val Cys Gly Phe Leu Gly Asn Thr Ser Glu Glu

645 650 655 645 650 655

Leu Cys Val Arg Trp Thr Gln Leu Gly Ala Phe Tyr Pro Phe Met ArgLeu Cys Val Arg Trp Thr Gln Leu Gly Ala Phe Tyr Pro Phe Met Arg

660 665 670 660 665 670

Asn His Asn Ser Leu Leu Ser Leu Pro Gln Glu Pro Tyr Ser Phe SerAsn His Asn Ser Leu Leu Ser Leu Pro Gln Glu Pro Tyr Ser Phe Ser

675 680 685 675 680 685

Glu Pro Ala Gln Gln Ala Met Arg Lys Ala Leu Thr Leu Arg Tyr AlaGlu Pro Ala Gln Gln Ala Met Arg Lys Ala Leu Thr Leu Arg Tyr Ala

690 695 700 690 695 700

Leu Leu Pro His Leu Tyr Thr Leu Phe His Gln Ala His Val Ala GlyLeu Leu Pro His Leu Tyr Thr Leu Phe His Gln Ala His Val Ala Gly

705 710 715 720705 710 715 720

Glu Thr Val Ala Arg Pro Leu Phe Leu Glu Phe Pro Lys Asp Ser SerGlu Thr Val Ala Arg Pro Leu Phe Leu Glu Phe Pro Lys Asp Ser Ser

725 730 735 725 730 735

Thr Trp Thr Val Asp His Gln Leu Leu Trp Gly Glu Ala Leu Leu IleThr Trp Thr Val Asp His Gln Leu Leu Trp Gly Glu Ala Leu Leu Ile

740 745 750 740 745 750

Thr Pro Val Leu Gln Ala Gly Lys Ala Glu Val Thr Gly Tyr Phe ProThr Pro Val Leu Gln Ala Gly Lys Ala Glu Val Thr Gly Tyr Phe Pro

755 760 765 755 760 765

Leu Gly Thr Trp Tyr Asp Leu Gln Thr Val Pro Val Glu Ala Leu GlyLeu Gly Thr Trp Tyr Asp Leu Gln Thr Val Pro Val Glu Ala Leu Gly

770 775 780 770 775 780

Ser Leu Pro Pro Pro Pro Ala Ala Pro Arg Glu Pro Ala Ile His SerSer Leu Pro Pro Pro Pro Ala Ala Pro Arg Glu Pro Ala Ile His Ser

785 790 795 800785 790 795 800

Glu Gly Gln Trp Val Thr Leu Pro Ala Pro Leu Asp Thr Ile Asn ValGlu Gly Gln Trp Val Thr Leu Pro Ala Pro Leu Asp Thr Ile Asn Val

805 810 815 805 810 815

His Leu Arg Ala Gly Tyr Ile Ile Pro Leu Gln Gly Pro Gly Leu ThrHis Leu Arg Ala Gly Tyr Ile Ile Pro Leu Gln Gly Pro Gly Leu Thr

820 825 830 820 825 830

Thr Thr Glu Ser Arg Gln Gln Pro Met Ala Leu Ala Val Ala Leu ThrThr Thr Glu Ser Arg Gln Gln Pro Met Ala Leu Ala Val Ala Leu Thr

835 840 845 835 840 845

Lys Gly Gly Glu Ala Arg Gly Glu Leu Phe Trp Asp Asp Gly Glu SerLys Gly Gly Glu Ala Arg Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser

850 855 860 850 855 860

Leu Glu Val Leu Glu Arg Gly Ala Tyr Thr Gln Val Ile Phe Leu AlaLeu Glu Val Leu Glu Arg Gly Ala Tyr Thr Gln Val Ile Phe Leu Ala

865 870 875 880865 870 875 880

Arg Asn Asn Thr Ile Val Asn Glu Leu Val Arg Val Thr Ser Glu GlyArg Asn Asn Thr Ile Val Asn Glu Leu Val Arg Val Thr Ser Glu Gly

885 890 895 885 890 895

Ala Gly Leu Gln Leu Gln Lys Val Thr Val Leu Gly Val Ala Thr AlaAla Gly Leu Gln Leu Gln Lys Val Thr Val Leu Gly Val Ala Thr Ala

900 905 910 900 905 910

Pro Gln Gln Val Leu Ser Asn Gly Val Pro Val Ser Asn Phe Thr TyrPro Gln Gln Val Leu Ser Asn Gly Val Pro Val Ser Asn Phe Thr Tyr

915 920 925 915 920 925

Ser Pro Asp Thr Lys Val Leu Asp Ile Cys Val Ser Leu Leu Met GlySer Pro Asp Thr Lys Val Leu Asp Ile Cys Val Ser Leu Leu Met Gly

930 935 940 930 935 940

Glu Gln Phe Leu Val Ser Trp CysGlu Gln Phe Leu Val Ser Trp Cys

945 950945 950

<210> 3<210> 3

<211> 18<211> 18

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7<223> sp7

<400> 3<400> 3

Met Ala Phe Leu Trp Leu Leu Ser Cys Trp Ala Leu Leu Gly Thr ThrMet Ala Phe Leu Trp Leu Leu Ser Cys Trp Ala Leu Leu Gly Thr Thr

1 5 10 151 5 10 15

Phe GlyPhe Gly

<210> 4<210> 4

<211> 27<211> 27

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp1<223> sp1

<400> 4<400> 4

1 5 10 151 5 10 15

Ala Leu Val Ser Leu Ala Thr Ala Ala Leu LeuAla Leu Val Ser Leu Ala Thr Ala Ala Leu Leu

20 25 20 25

<210> 5<210> 5

<211> 24<211> 24

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp2<223> sp2

<400> 5<400> 5

Met Pro Ser Ser Val Ser Trp Gly Ile Leu Leu Leu Ala Gly Leu CysMet Pro Ser Ser Val Ser Trp Gly Ile Leu Leu Leu Ala Gly Leu Cys

1 5 10 151 5 10 15

Cys Leu Val Pro Val Ser Leu AlaCys Leu Val Pro Val Ser Leu Ala

20 20

<210> 6<210> 6

<211> 25<211> 25

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp6<223> sp6

<400> 6<400> 6

Met Pro Pro Pro Arg Thr Gly Arg Gly Leu Leu Trp Leu Gly Leu ValMet Pro Pro Pro Arg Thr Gly Arg Gly Leu Leu Trp Leu Gly Leu Val

1 5 10 151 5 10 15

Leu Ser Ser Val Cys Val Ala Leu GlyLeu Ser Ser Val Cys Val Ala Leu Gly

20 25 20 25

<210> 7<210> 7

<211> 22<211> 22

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp8<223> sp8

<400> 7<400> 7

Met Ala Ser Arg Leu Thr Leu Leu Thr Leu Leu Leu Leu Leu Leu AlaMet Ala Ser Arg Leu Thr Leu Leu Thr Leu Leu Leu Leu Leu Leu Ala

1 5 10 151 5 10 15

Gly Asp Arg Ala Ser SerGly Asp Arg Ala Ser Ser

20 20

<210> 8<210> 8

<211> 2859<211> 2859

<212> DNA<212> DNA

<213> 智人<213> Homo sapiens

<400> 8<400> 8

atgggagtga ggcacccgcc ctgctcccac cggctcctgg ccgtctgcgc cctcgtgtcc 60atgggagtga ggcacccgcc ctgctcccac cggctcctgg ccgtctgcgc cctcgtgtcc 60

ttggcaaccg cagcgctcct ggggcacatc ctactccatg atttcctgct ggttccccga 120ttggcaaccg cagcgctcct ggggcacatc ctactccatg atttcctgct ggttccccga 120

gagctgagtg gctcctcccc agtcctggag gagactcacc cagctcacca gcagggagcc 180gagctgagtg gctcctcccc agtcctggag gagactcacc cagctcacca gcagggagcc 180

agcagaccag ggccccggga tgcccaggca caccccgggc ggccgcgagc agtgcccaca 240agcagaccag ggccccggga tgcccaggca caccccgggc ggccgcgagc agtgcccaca 240

cagtgcgacg tcccccccaa cagccgcttc gattgcgccc ctgacaaggc catcacccag 300cagtgcgacg tcccccccaa cagccgcttc gattgcgccc ctgacaaggc catcacccag 300

gaacagtgcg aggcccgcgg ctgttgctac atccctgcaa agcaggggct gcagggagcc 360gaacagtgcg aggcccgcgg ctgttgctac atccctgcaa agcaggggct gcagggagcc 360

cagatggggc agccctggtg cttcttccca cccagctacc ccagctacaa gctggagaac 420cagatggggc agccctggtg cttcttccca cccagctacc ccagctacaa gctggagaac 420

ctgagctcct ctgaaatggg ctacacggcc accctgaccc gtaccacccc caccttcttc 480ctgagctcct ctgaaatggg ctacacggcc accctgaccc gtaccacccc caccttcttc 480

cccaaggaca tcctgaccct gcggctggac gtgatgatgg agactgagaa ccgcctccac 540cccaaggaca tcctgaccct gcggctggac gtgatgatgg agactgagaa ccgcctccac 540

ttcacgatca aagatccagc taacaggcgc tacgaggtgc ccttggagac cccgcatgtc 600ttcacgatca aagatccagc taacaggcgc tacgaggtgc ccttggagac cccgcatgtc 600

cacagccggg caccgtcccc actctacagc gtggagttct ccgaggagcc cttcggggtg 660cacagccggg caccgtcccc actctacagc gtggagttct ccgaggagcc cttcggggtg 660

atcgtgcgcc ggcagctgga cggccgcgtg ctgctgaaca cgacggtggc gcccctgttc 720atcgtgcgcc ggcagctgga cggccgcgtg ctgctgaaca cgacggtggc gcccctgttc 720

tttgcggacc agttccttca gctgtccacc tcgctgccct cgcagtatat cacaggcctc 780tttgcggacc agttccttca gctgtccacc tcgctgccct cgcagtatat cacaggcctc 780

gccgagcacc tcagtcccct gatgctcagc accagctgga ccaggatcac cctgtggaac 840gccgagcacc tcagtcccct gatgctcagc accagctgga ccaggatcac cctgtggaac 840

cgggaccttg cgcccacgcc cggtgcgaac ctctacgggt ctcacccttt ctacctggcg 900cgggaccttg cgcccacgcc cggtgcgaac ctctacgggt ctcacccttt ctacctggcg 900

ctggaggacg gcgggtcggc acacggggtg ttcctgctaa acagcaatgc catggatgtg 960ctggaggacg gcgggtcggc acacggggtg ttcctgctaa acagcaatgc catggatgtg 960

gtcctgcagc cgagccctgc ccttagctgg aggtcgacag gtgggatcct ggatgtctac 1020gtcctgcagc cgagccctgc ccttagctgg aggtcgacag gtgggatcct ggatgtctac 1020

atcttcctgg gcccagagcc caagagcgtg gtgcagcagt acctggacgt tgtgggatac 1080atcttcctgg gcccagcc caagagcgtg gtgcagcagt acctggacgt tgtgggatac 1080

ccgttcatgc cgccatactg gggcctgggc ttccacctgt gccgctgggg ctactcctcc 1140ccgttcatgc cgccatactg gggcctgggc ttccacctgt gccgctgggg ctactcctcc 1140

accgctatca cccgccaggt ggtggagaac atgaccaggg cccacttccc cctggacgtc 1200accgctatca cccgccaggt ggtggagaac atgaccaggg cccacttccc cctggacgtc 1200

cagtggaacg acctggacta catggactcc cggagggact tcacgttcaa caaggatggc 1260cagtggaacg acctggacta catggactcc cggagggact tcacgttcaa caaggatggc 1260

ttccgggact tcccggccat ggtgcaggag ctgcaccagg gcggccggcg ctacatgatg 1320ttccgggact tcccggccat ggtgcaggag ctgcaccagg gcggccggcg ctacatgatg 1320

atcgtggatc ctgccatcag cagctcgggc cctgccggga gctacaggcc ctacgacgag 1380atcgtggatc ctgccatcag cagctcgggc cctgccggga gctacaggcc ctacgacgag 1380

ggtctgcgga ggggggtttt catcaccaac gagaccggcc agccgctgat tgggaaggta 1440ggtctgcgga ggggggtttt catcaccaac gagaccggcc agccgctgat tgggaaggta 1440

tggcccgggt ccactgcctt ccccgacttc accaacccca cagccctggc ctggtgggag 1500tggcccgggt ccactgcctt ccccgacttc accaacccca cagccctggc ctggtggggag 1500

gacatggtgg ctgagttcca tgaccaggtg cccttcgacg gcatgtggat tgacatgaac 1560gacatggtgg ctgagttcca tgaccaggtg cccttcgacg gcatgtggat tgacatgaac 1560

gagccttcca acttcatcag gggctctgag gacggctgcc ccaacaatga gctggagaac 1620gagccttcca acttcatcag gggctctgag gacggctgcc ccaacaatga gctggagaac 1620

ccaccctacg tgcctggggt ggttgggggg accctccagg cggccaccat ctgtgcctcc 1680ccaccctacg tgcctggggt ggttgggggg accctccagg cggccaccat ctgtgcctcc 1680

agccaccagt ttctctccac acactacaac ctgcacaacc tctacggcct gaccgaagcc 1740agccaccagt ttctctccac acactacaac ctgcacaacc tctacggcct gaccgaagcc 1740

atcgcctccc acagggcgct ggtgaaggct cgggggacac gcccatttgt gatctcccgc 1800atcgcctccc acagggcgct ggtgaaggct cgggggacac gcccatttgt gatctcccgc 1800

tcgacctttg ctggccacgg ccgatacgcc ggccactgga cgggggacgt gtggagctcc 1860tcgacctttg ctggccacgg ccgatacgcc ggccactgga cgggggacgt gtggagctcc 1860

tgggagcagc tcgcctcctc cgtgccagaa atcctgcagt ttaacctgct gggggtgcct 1920tggggagcagc tcgcctcctc cgtgccagaa atcctgcagt ttaacctgct gggggtgcct 1920

ctggtcgggg ccgacgtctg cggcttcctg ggcaacacct cagaggagct gtgtgtgcgc 1980ctggtcgggg ccgacgtctg cggcttcctg ggcaacacct cagaggagct gtgtgtgcgc 1980

tggacccagc tgggggcctt ctaccccttc atgcggaacc acaacagcct gctcagtctg 2040tggacccagc tgggggcctt ctaccccttc atgcggaacc acaacagcct gctcagtctg 2040

ccccaggagc cgtacagctt cagcgagccg gcccagcagg ccatgaggaa ggccctcacc 2100ccccaggagc cgtacagctt cagcgagccg gcccagcagg ccatgaggaa ggccctcacc 2100

ctgcgctacg cactcctccc ccacctctac acactgttcc accaggccca cgtcgcgggg 2160ctgcgctacg cactcctccc ccacctctac acactgttcc accaggccca cgtcgcgggg 2160

gagaccgtgg cccggcccct cttcctggag ttccccaagg actctagcac ctggactgtg 2220gagaccgtgg cccggcccct cttcctggag ttccccaagg actctagcac ctggactgtg 2220

gaccaccagc tcctgtgggg ggaggccctg ctcatcaccc cagtgctcca ggccgggaag 2280gaccaccagc tcctgtgggg ggaggccctg ctcatcaccc cagtgctcca ggccgggaag 2280

gccgaagtga ctggctactt ccccttgggc acatggtacg acctgcagac ggtgccagta 2340gccgaagtga ctggctactt ccccttgggc acatggtacg acctgcagac ggtgccagta 2340

gaggcccttg gcagcctccc acccccacct gcagctcccc gtgagccagc catccacagc 2400gaggcccttg gcagcctccc acccccacct gcagctcccc gtgagccagc catccacagc 2400

gaggggcagt gggtgacgct gccggccccc ctggacacca tcaacgtcca cctccgggct 2460gaggggcagt gggtgacgct gccggccccc ctggacacca tcaacgtcca cctccgggct 2460

gggtacatca tccccctgca gggccctggc ctcacaacca cagagtcccg ccagcagccc 2520gggtacatca tccccctgca gggccctggc ctcacaacca cagagtcccg ccagcagccc 2520

atggccctgg ctgtggccct gaccaagggt ggggaggccc gaggggagct gttctgggac 2580atggccctgg ctgtggccct gaccaagggt ggggaggccc gaggggagct gttctgggac 2580

gatggagaga gcctggaagt gctggagcga ggggcctaca cacaggtcat cttcctggcc 2640gatggagaga gcctggaagt gctggagcga ggggcctaca cacaggtcat cttcctggcc 2640

aggaataaca cgatcgtgaa tgagctggta cgtgtgacca gtgagggagc tggcctgcag 2700aggaataaca cgatcgtgaa tgagctggta cgtgtgacca gtgagggagc tggcctgcag 2700

ctgcagaagg tgactgtcct gggcgtggcc acggcgcccc agcaggtcct ctccaacggt 2760ctgcagaagg tgactgtcct gggcgtggcc acggcgcccc agcaggtcct ctccaacggt 2760

gtccctgtct ccaacttcac ctacagcccc gacaccaagg tcctggacat ctgtgtctcg 2820gtccctgtct ccaacttcac ctacagcccc gacaccaagg tcctggacat ctgtgtctcg 2820

ctgttgatgg gagagcagtt tctcgtcagc tggtgttag 2859ctgttgatgg gagagcagtt tctcgtcagc tggtgttag 2859

<210> 9<210> 9

<211> 2778<211> 2778

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAwt w/o sp<223> hGAAwt w/o sp

<400> 9<400> 9

gggcacatcc tactccatga tttcctgctg gttccccgag agctgagtgg ctcctcccca 60gggcacatcc tactccatga tttcctgctg gttccccgag agctgagtgg ctcctcccca 60

gtcctggagg agactcaccc agctcaccag cagggagcca gcagaccagg gccccgggat 120gtcctggagg agactcaccc agctcaccag cagggagcca gcagaccagg gccccggggat 120

gcccaggcac accccgggcg gccgcgagca gtgcccacac agtgcgacgt cccccccaac 180gcccaggcac accccgggcg gccgcgagca gtgcccacac agtgcgacgt cccccccaac 180

agccgcttcg attgcgcccc tgacaaggcc atcacccagg aacagtgcga ggcccgcggc 240agccgcttcg attgcgcccc tgacaaggcc atcacccagg aacagtgcga ggcccgcggc 240

tgttgctaca tccctgcaaa gcaggggctg cagggagccc agatggggca gccctggtgc 300tgttgctaca tccctgcaaa gcaggggctg cagggagccc agatggggca gccctggtgc 300

ttcttcccac ccagctaccc cagctacaag ctggagaacc tgagctcctc tgaaatgggc 360ttcttcccac ccagctaccc cagctacaag ctggagaacc tgagctcctc tgaaatgggc 360

tacacggcca ccctgacccg taccaccccc accttcttcc ccaaggacat cctgaccctg 420tacacggcca ccctgacccg taccaccccc accttcttcc ccaaggacat cctgaccctg 420

cggctggacg tgatgatgga gactgagaac cgcctccact tcacgatcaa agatccagct 480cggctggacg tgatgatgga gactgagaac cgcctccact tcacgatcaa agatccagct 480

aacaggcgct acgaggtgcc cttggagacc ccgcatgtcc acagccgggc accgtcccca 540aacaggcgct acgaggtgcc cttggagacc ccgcatgtcc acagccgggc accgtcccca 540

ctctacagcg tggagttctc cgaggagccc ttcggggtga tcgtgcgccg gcagctggac 600ctctacagcg tggagttctc cgaggagccc ttcggggtga tcgtgcgccg gcagctggac 600

ggccgcgtgc tgctgaacac gacggtggcg cccctgttct ttgcggacca gttccttcag 660ggccgcgtgc tgctgaacac gacggtggcg cccctgttct ttgcggacca gttccttcag 660

ctgtccacct cgctgccctc gcagtatatc acaggcctcg ccgagcacct cagtcccctg 720ctgtccacct cgctgccctc gcagtatatc acaggcctcg ccgagcacct cagtcccctg 720

atgctcagca ccagctggac caggatcacc ctgtggaacc gggaccttgc gcccacgccc 780atgctcagca ccagctggac caggatcacc ctgtggaacc gggaccttgc gcccacgccc 780

ggtgcgaacc tctacgggtc tcaccctttc tacctggcgc tggaggacgg cgggtcggca 840ggtgcgaacc tctacgggtc tcaccctttc tacctggcgc tggaggacgg cgggtcggca 840

cacggggtgt tcctgctaaa cagcaatgcc atggatgtgg tcctgcagcc gagccctgcc 900cacggggtgt tcctgctaaa cagcaatgcc atggatgtgg tcctgcagcc gagccctgcc 900

cttagctgga ggtcgacagg tgggatcctg gatgtctaca tcttcctggg cccagagccc 960cttagctgga ggtcgacagg tgggatcctg gatgtctaca tcttcctggg cccagagccc 960

aagagcgtgg tgcagcagta cctggacgtt gtgggatacc cgttcatgcc gccatactgg 1020aagagcgtgg tgcagcagta cctggacgtt gtgggatacc cgttcatgcc gccatactgg 1020

ggcctgggct tccacctgtg ccgctggggc tactcctcca ccgctatcac ccgccaggtg 1080ggcctgggct tccacctgtg ccgctggggc tactcctcca ccgctatcac ccgccaggtg 1080

gtggagaaca tgaccagggc ccacttcccc ctggacgtcc agtggaacga cctggactac 1140gtggagaaca tgaccagggc ccacttcccc ctggacgtcc agtggaacga cctggactac 1140

atggactccc ggagggactt cacgttcaac aaggatggct tccgggactt cccggccatg 1200atggactccc ggagggactt cacgttcaac aaggatggct tccgggactt cccggccatg 1200

gtgcaggagc tgcaccaggg cggccggcgc tacatgatga tcgtggatcc tgccatcagc 1260gtgcaggagc tgcaccaggg cggccggcgc tacatgatga tcgtggatcc tgccatcagc 1260

agctcgggcc ctgccgggag ctacaggccc tacgacgagg gtctgcggag gggggttttc 1320agctcgggcc ctgccgggag ctacaggccc tacgacgagg gtctgcggag gggggttttc 1320

atcaccaacg agaccggcca gccgctgatt gggaaggtat ggcccgggtc cactgccttc 1380atcaccaacg agaccggcca gccgctgatt gggaaggtat ggcccgggtc cactgccttc 1380

cccgacttca ccaaccccac agccctggcc tggtgggagg acatggtggc tgagttccat 1440cccgacttca ccaaccccac agccctggcc tggtgggagg acatggtggc tgagttccat 1440

gaccaggtgc ccttcgacgg catgtggatt gacatgaacg agccttccaa cttcatcagg 1500gaccaggtgc ccttcgacgg catgtggatt gacatgaacg agccttccaa cttcatcagg 1500

ggctctgagg acggctgccc caacaatgag ctggagaacc caccctacgt gcctggggtg 1560ggctctgagg acggctgccc caacaatgag ctggagaacc caccctacgt gcctggggtg 1560

gttgggggga ccctccaggc ggccaccatc tgtgcctcca gccaccagtt tctctccaca 1620gttgggggga ccctccaggc ggccaccatc tgtgcctcca gccaccagtt tctctccaca 1620

cactacaacc tgcacaacct ctacggcctg accgaagcca tcgcctccca cagggcgctg 1680cactacaacc tgcacaacct ctacggcctg accgaagcca tcgcctccca cagggcgctg 1680

gtgaaggctc gggggacacg cccatttgtg atctcccgct cgacctttgc tggccacggc 1740gtgaaggctc gggggacacg cccatttgtg atctcccgct cgacctttgc tggccacggc 1740

cgatacgccg gccactggac gggggacgtg tggagctcct gggagcagct cgcctcctcc 1800cgatacgccg gccactggac gggggacgtg tggagctcct gggagcagct cgcctcctcc 1800

gtgccagaaa tcctgcagtt taacctgctg ggggtgcctc tggtcggggc cgacgtctgc 1860gtgccagaaa tcctgcagtt taacctgctg ggggtgcctc tggtcggggc cgacgtctgc 1860

ggcttcctgg gcaacacctc agaggagctg tgtgtgcgct ggacccagct gggggccttc 1920ggcttcctgg gcaacacctc agaggagctg tgtgtgcgct ggacccagct gggggccttc 1920

taccccttca tgcggaacca caacagcctg ctcagtctgc cccaggagcc gtacagcttc 1980taccccttca tgcggaacca caacagcctg ctcagtctgc cccaggagcc gtacagcttc 1980

agcgagccgg cccagcaggc catgaggaag gccctcaccc tgcgctacgc actcctcccc 2040agcgagccgg cccagcaggc catgaggaag gccctcaccc tgcgctacgc actcctcccc 2040

cacctctaca cactgttcca ccaggcccac gtcgcggggg agaccgtggc ccggcccctc 2100cacctctaca cactgttcca ccaggcccac gtcgcggggg agaccgtggc ccggcccctc 2100

ttcctggagt tccccaagga ctctagcacc tggactgtgg accaccagct cctgtggggg 2160ttcctggagt tccccaagga ctctagcacc tggactgtgg accaccagct cctgtggggg 2160

gaggccctgc tcatcacccc agtgctccag gccgggaagg ccgaagtgac tggctacttc 2220gaggccctgc tcatcacccc agtgctccag gccgggaagg ccgaagtgac tggctacttc 2220

cccttgggca catggtacga cctgcagacg gtgccagtag aggcccttgg cagcctccca 2280cccttgggca catggtacga cctgcagacg gtgccagtag aggcccttgg cagcctccca 2280

cccccacctg cagctccccg tgagccagcc atccacagcg aggggcagtg ggtgacgctg 2340cccccacctg cagctccccg tgagccagcc atccacagcg aggggcagtg ggtgacgctg 2340

ccggcccccc tggacaccat caacgtccac ctccgggctg ggtacatcat ccccctgcag 2400ccggcccccc tggacaccat caacgtccac ctccgggctg ggtacatcat ccccctgcag 2400

ggccctggcc tcacaaccac agagtcccgc cagcagccca tggccctggc tgtggccctg 2460ggccctggcc tcacaaccac agagtcccgc cagcagccca tggccctggc tgtggccctg 2460

accaagggtg gggaggcccg aggggagctg ttctgggacg atggagagag cctggaagtg 2520accaagggtg gggaggcccg aggggagctg ttctgggacg atggagagag cctggaagtg 2520

ctggagcgag gggcctacac acaggtcatc ttcctggcca ggaataacac gatcgtgaat 2580ctggagcgag gggcctacac acaggtcatc ttcctggcca ggaataacac gatcgtgaat 2580

gagctggtac gtgtgaccag tgagggagct ggcctgcagc tgcagaaggt gactgtcctg 2640gagctggtac gtgtgaccag tgagggagct ggcctgcagc tgcagaaggt gactgtcctg 2640

ggcgtggcca cggcgcccca gcaggtcctc tccaacggtg tccctgtctc caacttcacc 2700ggcgtggcca cggcgcccca gcaggtcctc tccaacggtg tccctgtctc caacttcacc 2700

tacagccccg acaccaaggt cctggacatc tgtgtctcgc tgttgatggg agagcagttt 2760tacagccccg acaccaaggt cctggacatc tgtgtctcgc tgttgatggg agagcagttt 2760

ctcgtcagct ggtgttag 2778ctcgtcagct ggtgttag 2778

<210> 10<210> 10

<211> 2778<211> 2778

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAco1 w/o sp<223> hGAAco1 w/o sp

<400> 10<400> 10

ggccatatcc tgctgcacga ctttctacta gtgcccagag agctgagcgg cagctctccc 60ggccatatcc tgctgcacga ctttctacta gtgcccagag agctgagcgg cagctctccc 60

gtgctggaag aaacacaccc tgcccatcag cagggcgcct ctagacctgg acctagagat 120gtgctggaag aaacacaccc tgcccatcag cagggcgcct ctagacctgg acctagagat 120

gcccaggccc accccggcag acctagagct gtgcctaccc agtgtgacgt gccccccaac 180gcccaggccc accccggcag acctagagct gtgcctaccc agtgtgacgt gccccccaac 180

agcagattcg actgcgcccc tgacaaggcc atcacccagg aacagtgcga ggccagaggc 240agcagattcg actgcgcccc tgacaaggcc atcacccagg aacagtgcga ggccagaggc 240

tgctgctaca tccctgccaa gcagggactg cagggcgctc agatgggaca gccctggtgc 300tgctgctaca tccctgccaa gcagggactg cagggcgctc agatgggaca gccctggtgc 300

ttcttcccac cctcctaccc cagctacaag ctggaaaacc tgagcagcag cgagatgggc 360ttcttcccac cctcctaccc cagctacaag ctggaaaacc tgagcagcag cgagatgggc 360

tacaccgcca ccctgaccag aaccaccccc acattcttcc caaaggacat cctgaccctg 420tacaccgcca ccctgaccag aaccaccccc acattcttcc caaaggacat cctgaccctg 420

cggctggacg tgatgatgga aaccgagaac cggctgcact tcaccatcaa ggaccccgcc 480cggctggacg tgatgatgga aaccgagaac cggctgcact tcaccatcaa ggaccccgcc 480

aatcggagat acgaggtgcc cctggaaacc ccccacgtgc actctagagc ccccagccct 540aatcggagat acgaggtgcc cctggaaacc ccccacgtgc actctagagc ccccagccct 540

ctgtacagcg tggaattcag cgaggaaccc ttcggcgtga tcgtgcggag acagctggat 600ctgtacagcg tggaattcag cgaggaaccc ttcggcgtga tcgtgcggag acagctggat 600

ggcagagtgc tgctgaacac caccgtggcc cctctgttct tcgccgacca gttcctgcag 660ggcagagtgc tgctgaacac caccgtggcc cctctgttct tcgccgacca gttcctgcag 660

ctgagcacca gcctgcccag ccagtacatc acaggactgg ccgagcacct gagccccctg 720ctgagcacca gcctgcccag ccagtacatc acaggactgg ccgagcacct gagccccctg 720

atgctgagca catcctggac ccggatcacc ctgtggaaca gggatctggc ccctacccct 780atgctgagca catcctggac ccggatcacc ctgtggaaca gggatctggc ccctacccct 780

ggcgccaatc tgtacggcag ccaccctttc tacctggccc tggaagatgg cggatctgcc 840ggcgccaatc tgtacggcag ccaccctttc tacctggccc tggaagatgg cggatctgcc 840

cacggagtgt ttctgctgaa ctccaacgcc atggacgtgg tgctgcagcc tagccctgcc 900cacggagtgt ttctgctgaa ctccaacgcc atggacgtgg tgctgcagcc tagccctgcc 900

ctgtcttgga gaagcacagg cggcatcctg gatgtgtaca tctttctggg ccccgagccc 960ctgtcttgga gaagcacagg cggcatcctg gatgtgtaca tctttctggg ccccgagccc 960

aagagcgtgg tgcagcagta tctggatgtc gtgggctacc ccttcatgcc cccttactgg 1020aagagcgtgg tgcagcagta tctggatgtc gtgggctacc ccttcatgcc cccttactgg 1020

ggcctgggat tccacctgtg cagatggggc tactccagca ccgccatcac cagacaggtg 1080ggcctggggat tccacctgtg cagatggggc tactccagca ccgccatcac cagacaggtg 1080

gtggaaaaca tgaccagagc ccacttccca ctggatgtgc agtggaacga cctggactac 1140gtggaaaaca tgaccagagc ccacttccca ctggatgtgc agtggaacga cctggactac 1140

atggacagca gacgggactt caccttcaac aaggacggct tccgggactt ccccgccatg 1200atggacagca gacgggactt caccttcaac aaggacggct tccgggactt ccccgccatg 1200

gtgcaggaac tgcatcaggg cggcagacgg tacatgatga tcgtggatcc cgccatcagc 1260gtgcaggaac tgcatcaggg cggcagacgg tacatgatga tcgtggatcc cgccatcagc 1260

tcctctggcc ctgccggctc ttacagaccc tacgacgagg gcctgcggag aggcgtgttc 1320tcctctggcc ctgccggctc ttacagaccc tacgacgagg gcctgcggag aggcgtgttc 1320

atcaccaacg agacaggcca gcccctgatc ggcaaagtgt ggcctggcag cacagccttc 1380atcaccaacg agacaggcca gcccctgatc ggcaaagtgt ggcctggcag cacagccttc 1380

cccgacttca ccaatcctac cgccctggct tggtgggagg acatggtggc cgagttccac 1440cccgacttca ccaatcctac cgccctggct tggtgggagg acatggtggc cgagttccac 1440

gaccaggtgc ccttcgacgg catgtggatc gacatgaacg agcccagcaa cttcatccgg 1500gaccaggtgc ccttcgacgg catgtggatc gacatgaacg agcccagcaa cttcatccgg 1500

ggcagcgagg atggctgccc caacaacgaa ctggaaaatc ccccttacgt gcccggcgtc 1560ggcagcgagg atggctgccc caacaacgaa ctggaaaatc ccccttacgt gcccggcgtc 1560

gtgggcggaa cactgcaggc cgctacaatc tgtgccagca gccaccagtt tctgagcacc 1620gtgggcggaa cactgcaggc cgctacaatc tgtgccagca gccaccagtt tctgagcacc 1620

cactacaacc tgcacaacct gtacggcctg accgaggcca ttgccagcca ccgcgctctc 1680cactacaacc tgcacaacct gtacggcctg accgaggcca ttgccagcca ccgcgctctc 1680

gtgaaagcca gaggcacacg gcccttcgtg atcagcagaa gcacctttgc cggccacggc 1740gtgaaagcca gaggcacacg gcccttcgtg atcagcagaa gcacctttgc cggccacggc 1740

agatacgccg gacattggac tggcgacgtg tggtcctctt gggagcagct ggcctctagc 1800agatacgccg gacattggac tggcgacgtg tggtcctctt gggagcagct ggcctctagc 1800

gtgcccgaga tcctgcagtt caatctgctg ggcgtgccac tcgtgggcgc cgatgtgtgt 1860gtgcccgaga tcctgcagtt caatctgctg ggcgtgccac tcgtgggcgc cgatgtgtgt 1860

ggcttcctgg gcaacacctc cgaggaactg tgtgtgcggt ggacacagct gggcgccttc 1920ggcttcctgg gcaacacctc cgaggaactg tgtgtgcggt ggacacagct gggcgccttc 1920

taccctttca tgagaaacca caacagcctg ctgagcctgc cccaggaacc ctacagcttt 1980taccctttca tgagaaacca caacagcctg ctgagcctgc cccaggaacc ctacagcttt 1980

agcgagcctg cacagcaggc catgcggaag gccctgacac tgagatacgc tctgctgccc 2040agcgagcctg cacagcaggc catgcggaag gccctgacac tgagatacgc tctgctgccc 2040

cacctgtaca ccctgtttca ccaggcccat gtggccggcg agacagtggc cagacctctg 2100cacctgtaca ccctgtttca ccaggcccat gtggccggcg agacagtggc cagacctctg 2100

tttctggaat tccccaagga cagcagcacc tggaccgtgg accatcagct gctgtgggga 2160tttctggaat tccccaagga cagcagcacc tggaccgtgg accatcagct gctgtgggga 2160

gaggctctgc tgattacccc agtgctgcag gcaggcaagg ccgaagtgac cggctacttt 2220gaggctctgc tgattacccc agtgctgcag gcaggcaagg ccgaagtgac cggctacttt 2220

cccctgggca cttggtacga cctgcagacc gtgcctgtgg aagccctggg atctctgcct 2280cccctgggca cttggtacga cctgcagacc gtgcctgtgg aagccctggg atctctgcct 2280

ccacctcctg ccgctcctag agagcctgcc attcactctg agggccagtg ggtcacactg 2340ccacctcctg ccgctcctag agagcctgcc attcactctg agggccagtg ggtcacactg 2340

cctgcccccc tggataccat caacgtgcac ctgagggccg gctacatcat accactgcag 2400cctgcccccc tggataccat caacgtgcac ctgagggccg gctacatcat accactgcag 2400

ggacctggcc tgaccaccac cgagtctaga cagcagccaa tggccctggc cgtggccctg 2460ggacctggcc tgaccaccac cgagtctaga cagcagccaa tggccctggc cgtggccctg 2460

accaaaggcg gagaagctag gggcgagctg ttctgggacg atggcgagag cctggaagtg 2520accaaaggcg gagaagctag gggcgagctg ttctgggacg atggcgagag cctggaagtg 2520

ctggaaagag gcgcctatac ccaagtgatc ttcctggccc ggaacaacac catcgtgaac 2580ctggaaagag gcgcctatac ccaagtgatc ttcctggccc ggaacaacac catcgtgaac 2580

gagctggtgc gcgtgacctc tgaaggcgct ggactgcagc tgcagaaagt gaccgtgctg 2640gagctggtgc gcgtgacctc tgaaggcgct ggactgcagc tgcagaaagt gaccgtgctg 2640

ggagtggcca cagcccctca gcaggtgctg tctaatggcg tgcccgtgtc caacttcacc 2700ggagtggcca cagcccctca gcaggtgctg tctaatggcg tgcccgtgtc caacttcacc 2700

tacagccccg acaccaaggt gctggacatc tgcgtgtcac tgctgatggg agagcagttt 2760tacagccccg acaccaaggt gctggacatc tgcgtgtcac tgctgatggg agagcagttt 2760

ctggtgtcct ggtgctga 2778ctggtgtcct ggtgctga 2778

<210> 11<210> 11

<211> 2778<211> 2778

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAco2 w/o sp<223> hGAAco2 w/o sp

<400> 11<400> 11

ggacacatcc tgctgcacga cttcctgttg gtgcctagag agctgagcgg atcatcccca 60ggacacatcc tgctgcacga cttcctgttg gtgcctagag agctgagcgg atcatcccca 60

gtgctggagg agactcatcc tgctcaccaa cagggagctt ccagaccagg accgagagac 120gtgctggagg agactcatcc tgctcaccaa cagggagctt ccagaccagg accgagagac 120

gcccaagccc atcctggtag accaagagct gtgcctaccc aatgcgacgt gccacccaac 180gcccaagccc atcctggtag accaagagct gtgcctaccc aatgcgacgt gccacccaac 180

tcccgattcg actgcgcgcc agataaggct attacccaag agcagtgtga agccagaggt 240tcccgattcg actgcgcgcc agataaggct attacccaag agcagtgtga agccagaggt 240

tgctgctaca tcccagcgaa gcaaggattg caaggcgccc aaatgggaca accttggtgt 300tgctgctaca tcccagcgaa gcaaggattg caaggcgccc aaatgggaca accttggtgt 300

ttcttccccc cttcgtaccc atcatataaa ctcgaaaacc tgtcctcttc ggaaatgggt 360ttcttccccc cttcgtaccc atcatataaa ctcgaaaacc tgtcctcttc ggaaatgggt 360

tatactgcca ccctcaccag aactactcct actttcttcc cgaaagacat cttgaccttg 420tatactgcca ccctcaccag aactactcct actttcttcc cgaaagacat cttgaccttg 420

aggctggacg tgatgatgga gactgaaaac cggctgcatt tcactatcaa agatcctgcc 480aggctggacg tgatgatgga gactgaaaac cggctgcatt tcactatcaa agatcctgcc 480

aatcggcgat acgaggtccc tctggaaacc cctcacgtgc actcacgggc tccttctccg 540aatcggcgat acgaggtccc tctggaaacc cctcacgtgc actcacgggc tccttctccg 540

ctttactccg tcgaattctc tgaggaaccc ttcggagtga tcgttagacg ccagctggat 600ctttactccg tcgaattctc tgaggaaccc ttcggagtga tcgttagacg ccagctggat 600

ggtagagtgc tgttgaacac tactgtggcc ccacttttct tcgctgacca gtttctgcaa 660ggtagagtgc tgttgaacac tactgtggcc ccacttttct tcgctgacca gtttctgcaa 660

ctgtccactt ccctgccatc ccagtacatt actggactcg ccgaacacct gtcgccactg 720ctgtccactt ccctgccatc ccagtacatt actggactcg ccgaacacct gtcgccactg 720

atgctctcga cctcttggac tagaatcact ttgtggaaca gagacttggc ccctactccg 780atgctctcga cctcttggac tagaatcact ttgtggaaca gagacttggc ccctactccg 780

ggagcaaatc tgtacggaag ccaccctttt tacctggcgc tcgaagatgg cggatccgct 840ggagcaaatc tgtacggaag ccaccctttt tacctggcgc tcgaagatgg cggatccgct 840

cacggagtgt tcctgctgaa tagcaacgca atggacgtgg tgctgcaacc ttcccctgca 900cacggagtgt tcctgctgaa tagcaacgca atggacgtgg tgctgcaacc ttcccctgca 900

ctcagttgga gaagtaccgg gggtattctg gacgtgtaca tcttcctcgg accagaaccc 960ctcagttgga gaagtaccgg gggtattctg gacgtgtaca tcttcctcgg accagaaccc 960

aagagcgtgg tgcagcaata tctggacgtg gtcggatacc cttttatgcc tccttactgg 1020aagagcgtgg tgcagcaata tctggacgtg gtcggatacc cttttatgcc tccttatactgg 1020

ggactgggat tccacctttg ccgttggggc tactcatcca ccgccattac cagacaggtg 1080ggactggggat tccacctttg ccgttggggc tactcatcca ccgccattac cagacaggtg 1080

gtggagaata tgaccagagc ccacttccct ctcgacgtgc agtggaacga tctggactat 1140gtggagaata tgaccagagc ccacttccct ctcgacgtgc agtggaacga tctggactat 1140

atggactccc ggagagattt caccttcaac aaggacgggt tccgcgattt tcccgcgatg 1200atggactccc ggagagattt caccttcaac aaggacgggt tccgcgattt tcccgcgatg 1200

gttcaagagc tccaccaggg tggtcgaaga tatatgatga tcgtcgaccc agccatttcg 1260gttcaagagc tccaccagggg tggtcgaaga tatatgatga tcgtcgaccc agccatttcg 1260

agcagcggac ccgctggatc ttatagacct tacgacgaag gccttaggag aggagtgttc 1320agcagcggac ccgctggatc ttatagacct tacgacgaag gccttaggag aggagtgttc 1320

atcacaaacg agactggaca gcctttgatc ggtaaagtgt ggcctggatc aaccgccttt 1380atcacaaacg agactggaca gcctttgatc ggtaaagtgt ggcctggatc aaccgccttt 1380

cctgacttta ccaatcccac tgccttggct tggtgggagg acatggtggc cgaattccac 1440cctgacttta ccaatcccac tgccttggct tggtgggagg acatggtggc cgaattccac 1440

gaccaagtcc cctttgatgg aatgtggatc gatatgaacg aaccaagcaa ttttatcaga 1500gaccaagtcc cctttgatgg aatgtggatc gatatgaacg aaccaagcaa ttttatcaga 1500

ggttccgaag acggttgccc caacaacgaa ctggaaaacc ctccttatgt gcccggagtc 1560ggttccgaag acggttgccc caacaacgaa ctggaaaacc ctccttatgt gcccggagtc 1560

gtgggcggaa cattacaggc cgcgactatt tgcgccagca gccaccaatt cctgtccact 1620gtgggcggaa cattacaggc cgcgactatt tgcgccagca gccaccaatt cctgtccact 1620

cactacaacc tccacaacct ttatggatta accgaagcta ttgcaagtca cagggctctg 1680cactacaacc tccacaacct ttatggatta accgaagcta ttgcaagtca cagggctctg 1680

gtgaaggcta gagggactag gccctttgtg atctcccgat ccacctttgc cggacacggg 1740gtgaaggcta gagggactag gccctttgtg atctcccgat ccacctttgc cggacacggg 1740

agatacgccg gtcactggac tggtgacgtg tggagctcat gggaacaact ggcctcctcc 1800agatacgccg gtcactggac tggtgacgtg tggagctcat gggaacaact ggcctcctcc 1800

gtgccggaaa tcttacagtt caaccttctg ggtgtccctc ttgtcggagc agacgtgtgt 1860gtgccggaaa tcttacagtt caaccttctg ggtgtccctc ttgtcggagc agacgtgtgt 1860

gggtttcttg gtaacacctc cgaggaactg tgtgtgcgct ggactcaact gggtgcattc 1920gggtttcttg gtaacacctc cgaggaactg tgtgtgcgct ggactcaact gggtgcattc 1920

tacccattca tgagaaacca caactccttg ctgtccctgc cacaagagcc ctactcgttc 1980tacccattca tgagaaacca caactccttg ctgtccctgc cacaagagcc ctactcgttc 1980

agcgagcctg cacaacaggc tatgcggaag gcactgaccc tgagatacgc cctgcttcca 2040agcgagcctg cacaacaggc tatgcggaag gcactgaccc tgagatacgc cctgcttcca 2040

cacttataca ctctcttcca tcaagcgcat gtggcaggag aaaccgttgc aaggcctctt 2100cacttataca ctctcttcca tcaagcgcat gtggcaggag aaaccgttgc aaggcctctt 2100

ttccttgaat tccccaagga ttcctcgact tggacggtgg atcatcagct gctgtgggga 2160ttccttgaat tccccaagga ttcctcgact tggacggtgg atcatcagct gctgtgggga 2160

gaagctctgc tgattactcc agtgttgcaa gccggaaaag ctgaggtgac cggatacttt 2220gaagctctgc tgattactcc agtgttgcaa gccggaaaag ctgaggtgac cggatacttt 2220

ccgctgggaa cctggtacga cctccagact gtccctgttg aagcccttgg atcactgcct 2280ccgctgggaa cctggtacga cctccagact gtccctgttg aagcccttgg atcactgcct 2280

ccgcctccgg cagctccacg cgaaccagct atacattccg agggacagtg ggttacatta 2340ccgcctccgg cagctccacg cgaaccagct atacattccg agggacagtg ggttacatta 2340

ccagctcctc tggacacaat caacgtccac ttaagagctg gctacattat ccctctgcaa 2400ccagctcctc tggacacaat caacgtccac ttaagagctg gctacattat ccctctgcaa 2400

ggaccaggac tgactacgac cgagagcaga cagcagccaa tggcactggc tgtggctctg 2460ggaccaggac tgactacgac cgagagcaga cagcagccaa tggcactggc tgtggctctg 2460

accaagggag gggaagctag aggagaactc ttctgggatg atggggagtc ccttgaagtg 2520accaagggag gggaagctag aggagaactc ttctgggatg atggggagtc ccttgaagtg 2520

ctggaaagag gcgcttacac tcaagtcatt ttccttgcac ggaacaacac cattgtgaac 2580ctggaaagag gcgcttacac tcaagtcatt ttccttgcac ggaacaacac cattgtgaac 2580

gaattggtgc gagtgaccag cgaaggagct ggacttcaac tgcagaaggt cactgtgctc 2640gaattggtgc gagtgaccag cgaaggagct ggacttcaac tgcagaaggt cactgtgctc 2640

ggagtggcta ccgctcctca gcaagtgctg tcgaatggag tccccgtgtc aaactttacc 2700ggagtggcta ccgctcctca gcaagtgctg tcgaatggag tccccgtgtc aaactttacc 2700

tactcccctg acactaaggt gctcgacatt tgcgtgtccc tcctgatggg agagcagttc 2760tactcccctg acactaaggt gctcgacatt tgcgtgtccc tcctgatggg agagcagttc 2760

cttgtgtcct ggtgttga 2778cttgtgtcctggtgttga 2778

<210> 12<210> 12

<211> 2754<211> 2754

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAco1-Δ-8 w/o sp<223> hGAAco1-Δ-8 w/o sp

<400> 12<400> 12

ctactagtgc ccagagagct gagcggcagc tctcccgtgc tggaagaaac acaccctgcc 60ctactagtgc ccagagagct gagcggcagc tctcccgtgc tggaagaaac acaccctgcc 60

catcagcagg gcgcctctag acctggacct agagatgccc aggcccaccc cggcagacct 120catcagcagg gcgcctctag acctggacct agagatgccc aggcccaccc cggcagacct 120

agagctgtgc ctacccagtg tgacgtgccc cccaacagca gattcgactg cgcccctgac 180agagctgtgc ctacccagtg tgacgtgccc cccaacagca gattcgactg cgcccctgac 180

aaggccatca cccaggaaca gtgcgaggcc agaggctgct gctacatccc tgccaagcag 240aaggccatca cccaggaaca gtgcgaggcc agaggctgct gctacatccc tgccaagcag 240

ggactgcagg gcgctcagat gggacagccc tggtgcttct tcccaccctc ctaccccagc 300ggactgcagg gcgctcagat gggacagccc tggtgcttct tcccaccctc ctaccccagc 300

tacaagctgg aaaacctgag cagcagcgag atgggctaca ccgccaccct gaccagaacc 360tacaagctgg aaaacctgag cagcagcgag atgggctaca ccgccaccct gaccagaacc 360

acccccacat tcttcccaaa ggacatcctg accctgcggc tggacgtgat gatggaaacc 420acccccacat tcttcccaaa ggacatcctg accctgcggc tggacgtgat gatggaaacc 420

gagaaccggc tgcacttcac catcaaggac cccgccaatc ggagatacga ggtgcccctg 480gagaaccggc tgcacttcac catcaaggac cccgccaatc ggagatacga ggtgcccctg 480

gaaacccccc acgtgcactc tagagccccc agccctctgt acagcgtgga attcagcgag 540gaaacccccc acgtgcactc tagagccccc agccctctgt acagcgtgga attcagcgag 540

gaacccttcg gcgtgatcgt gcggagacag ctggatggca gagtgctgct gaacaccacc 600gaacccttcg gcgtgatcgt gcggagacag ctggatggca gagtgctgct gaacaccacc 600

gtggcccctc tgttcttcgc cgaccagttc ctgcagctga gcaccagcct gcccagccag 660gtggcccctc tgttcttcgc cgaccagttc ctgcagctga gcaccagcct gcccagccag 660

tacatcacag gactggccga gcacctgagc cccctgatgc tgagcacatc ctggacccgg 720tacatcacag gactggccga gcacctgagc cccctgatgc tgagcacatc ctggacccgg 720

atcaccctgt ggaacaggga tctggcccct acccctggcg ccaatctgta cggcagccac 780atcaccctgt ggaacaggga tctggcccct acccctggcg ccaatctgta cggcagccac 780

cctttctacc tggccctgga agatggcgga tctgcccacg gagtgtttct gctgaactcc 840cctttctacc tggccctgga agatggcgga tctgcccacg gagtgtttct gctgaactcc 840

aacgccatgg acgtggtgct gcagcctagc cctgccctgt cttggagaag cacaggcggc 900aacgccatgg acgtggtgct gcagcctagc cctgccctgt cttggagaag cacaggcggc 900

atcctggatg tgtacatctt tctgggcccc gagcccaaga gcgtggtgca gcagtatctg 960atcctggatg tgtacatctt tctgggcccc gagcccaaga gcgtggtgca gcagtatctg 960

gatgtcgtgg gctacccctt catgccccct tactggggcc tgggattcca cctgtgcaga 1020gatgtcgtgg gctacccctt catgccccct tactggggcc tgggattcca cctgtgcaga 1020

tggggctact ccagcaccgc catcaccaga caggtggtgg aaaacatgac cagagcccac 1080tggggctact ccagcaccgc catcaccaga caggtggtgg aaaacatgac cagagcccac 1080

ttcccactgg atgtgcagtg gaacgacctg gactacatgg acagcagacg ggacttcacc 1140ttcccactgg atgtgcagtg gaacgacctg gactacatgg acagcagacg ggacttcacc 1140

ttcaacaagg acggcttccg ggacttcccc gccatggtgc aggaactgca tcagggcggc 1200ttcaacaagg acggcttccg ggacttcccc gccatggtgc aggaactgca tcagggcggc 1200

agacggtaca tgatgatcgt ggatcccgcc atcagctcct ctggccctgc cggctcttac 1260agacggtaca tgatgatcgt ggatcccgcc atcagctcct ctggccctgc cggctcttac 1260

agaccctacg acgagggcct gcggagaggc gtgttcatca ccaacgagac aggccagccc 1320agaccctacg acgagggcct gcggagaggc gtgttcatca ccaacgagac aggccagccc 1320

ctgatcggca aagtgtggcc tggcagcaca gccttccccg acttcaccaa tcctaccgcc 1380ctgatcggca aagtgtggcc tggcagcaca gccttccccg acttcaccaa tcctaccgcc 1380

ctggcttggt gggaggacat ggtggccgag ttccacgacc aggtgccctt cgacggcatg 1440ctggcttggt gggaggacat ggtggccgag ttccacgacc aggtgccctt cgacggcatg 1440

tggatcgaca tgaacgagcc cagcaacttc atccggggca gcgaggatgg ctgccccaac 1500tggatcgaca tgaacgagcc cagcaacttc atccggggca gcgaggatgg ctgccccaac 1500

aacgaactgg aaaatccccc ttacgtgccc ggcgtcgtgg gcggaacact gcaggccgct 1560aacgaactgg aaaatccccc ttacgtgccc ggcgtcgtgg gcggaacact gcaggccgct 1560

acaatctgtg ccagcagcca ccagtttctg agcacccact acaacctgca caacctgtac 1620acaatctgtg ccagcagcca ccagtttctg agcacccact acaacctgca caacctgtac 1620

ggcctgaccg aggccattgc cagccaccgc gctctcgtga aagccagagg cacacggccc 1680ggcctgaccg aggccattgc cagccaccgc gctctcgtga aagccagagg cacacggccc 1680

ttcgtgatca gcagaagcac ctttgccggc cacggcagat acgccggaca ttggactggc 1740ttcgtgatca gcagaagcac ctttgccggc cacggcagat acgccggaca ttggactggc 1740

gacgtgtggt cctcttggga gcagctggcc tctagcgtgc ccgagatcct gcagttcaat 1800gacgtgtggt cctcttggga gcagctggcc tctagcgtgc ccgagatcct gcagttcaat 1800

ctgctgggcg tgccactcgt gggcgccgat gtgtgtggct tcctgggcaa cacctccgag 1860ctgctgggcg tgccactcgt gggcgccgat gtgtgtggct tcctgggcaa cacctccgag 1860

gaactgtgtg tgcggtggac acagctgggc gccttctacc ctttcatgag aaaccacaac 1920gaactgtgtg tgcggtggac acagctgggc gccttctacc ctttcatgag aaaccacaac 1920

agcctgctga gcctgcccca ggaaccctac agctttagcg agcctgcaca gcaggccatg 1980agcctgctga gcctgcccca ggaaccctac agctttagcg agcctgcaca gcaggccatg 1980

cggaaggccc tgacactgag atacgctctg ctgccccacc tgtacaccct gtttcaccag 2040cggaaggccc tgacactgag atacgctctg ctgccccacc tgtacaccct gtttcaccag 2040

gcccatgtgg ccggcgagac agtggccaga cctctgtttc tggaattccc caaggacagc 2100gcccatgtgg ccggcgagac agtggccaga cctctgtttc tggaattccc caaggacagc 2100

agcacctgga ccgtggacca tcagctgctg tggggagagg ctctgctgat taccccagtg 2160agcacctgga ccgtggacca tcagctgctg tggggagagg ctctgctgat taccccagtg 2160

ctgcaggcag gcaaggccga agtgaccggc tactttcccc tgggcacttg gtacgacctg 2220ctgcaggcag gcaaggccga agtgaccggc tactttcccc tgggcacttg gtacgacctg 2220

cagaccgtgc ctgtggaagc cctgggatct ctgcctccac ctcctgccgc tcctagagag 2280cagaccgtgc ctgtggaagc cctgggatct ctgcctccac ctcctgccgc tcctagagag 2280

cctgccattc actctgaggg ccagtgggtc acactgcctg cccccctgga taccatcaac 2340cctgccattc actctgaggg ccagtgggtc acactgcctg cccccctgga taccatcaac 2340

gtgcacctga gggccggcta catcatacca ctgcagggac ctggcctgac caccaccgag 2400gtgcacctga gggccggcta catcatacca ctgcagggac ctggcctgac caccacccgag 2400

tctagacagc agccaatggc cctggccgtg gccctgacca aaggcggaga agctaggggc 2460tctagacagc agccaatggc cctggccgtg gccctgacca aaggcggaga agctaggggc 2460

gagctgttct gggacgatgg cgagagcctg gaagtgctgg aaagaggcgc ctatacccaa 2520gagctgttct gggacgatgg cgagagcctg gaagtgctgg aaagaggcgc ctatacccaa 2520

gtgatcttcc tggcccggaa caacaccatc gtgaacgagc tggtgcgcgt gacctctgaa 2580gtgatcttcc tggccccggaa caacaccatc gtgaacgagc tggtgcgcgt gacctctgaa 2580

ggcgctggac tgcagctgca gaaagtgacc gtgctgggag tggccacagc ccctcagcag 2640ggcgctggac tgcagctgca gaaagtgacc gtgctgggag tggccacagc ccctcagcag 2640

gtgctgtcta atggcgtgcc cgtgtccaac ttcacctaca gccccgacac caaggtgctg 2700gtgctgtcta atggcgtgcc cgtgtccaac ttcacctaca gccccgacac caaggtgctg 2700

gacatctgcg tgtcactgct gatgggagag cagtttctgg tgtcctggtg ctga 2754gacatctgcg tgtcactgct gatggggagag cagtttctgg tgtcctggtg ctga 2754

<210> 13<210> 13

<211> 2754<211> 2754

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAco2-Δ8 w/o sp<223> hGAAco2-Δ8 w/o sp

<400> 13<400> 13

ctgttggtgc ctagagagct gagcggatca tccccagtgc tggaggagac tcatcctgct 60ctgttggtgc ctagagagct gagcggatca tccccagtgc tggaggagac tcatcctgct 60

caccaacagg gagcttccag accaggaccg agagacgccc aagcccatcc tggtagacca 120caccaacagg gagcttccag accaggaccg agagacgccc aagcccatcc tggtagacca 120

agagctgtgc ctacccaatg cgacgtgcca cccaactccc gattcgactg cgcgccagat 180agagctgtgc ctacccaatg cgacgtgcca cccaactccc gattcgactg cgcgccagat 180

aaggctatta cccaagagca gtgtgaagcc agaggttgct gctacatccc agcgaagcaa 240aaggctatta cccaagagca gtgtgaagcc agaggttgct gctacatccc agcgaagcaa 240

ggattgcaag gcgcccaaat gggacaacct tggtgtttct tccccccttc gtacccatca 300ggattgcaag gcgcccaaat gggacaacct tggtgtttct tccccccttc gtacccatca 300

tataaactcg aaaacctgtc ctcttcggaa atgggttata ctgccaccct caccagaact 360tataaactcg aaaacctgtc ctcttcggaa atgggttata ctgccaccct caccagaact 360

actcctactt tcttcccgaa agacatcttg accttgaggc tggacgtgat gatggagact 420actcctactt tcttcccgaa agacatcttg accttgaggc tggacgtgat gatggagact 420

gaaaaccggc tgcatttcac tatcaaagat cctgccaatc ggcgatacga ggtccctctg 480gaaaaccggc tgcatttcac tatcaaagat cctgccaatc ggcgatacga ggtccctctg 480

gaaacccctc acgtgcactc acgggctcct tctccgcttt actccgtcga attctctgag 540gaaacccctc acgtgcactc acgggctcct tctccgcttt actccgtcga attctctgag 540

gaacccttcg gagtgatcgt tagacgccag ctggatggta gagtgctgtt gaacactact 600gaacccttcg gagtgatcgt tagacgccag ctggatggta gagtgctgtt gaacactact 600

gtggccccac ttttcttcgc tgaccagttt ctgcaactgt ccacttccct gccatcccag 660gtggccccac ttttcttcgc tgaccagttt ctgcaactgt ccacttccct gccatcccag 660

tacattactg gactcgccga acacctgtcg ccactgatgc tctcgacctc ttggactaga 720tacattactg gactcgccga acacctgtcg ccactgatgc tctcgacctc ttggactaga 720

atcactttgt ggaacagaga cttggcccct actccgggag caaatctgta cggaagccac 780atcactttgt ggaacagaga cttggcccct actccgggag caaatctgta cggaagccac 780

cctttttacc tggcgctcga agatggcgga tccgctcacg gagtgttcct gctgaatagc 840cctttttacc tggcgctcga agatggcgga tccgctcacg gagtgttcct gctgaatagc 840

aacgcaatgg acgtggtgct gcaaccttcc cctgcactca gttggagaag taccgggggt 900aacgcaatgg acgtggtgct gcaaccttcc cctgcactca gttggagaag taccgggggt 900

attctggacg tgtacatctt cctcggacca gaacccaaga gcgtggtgca gcaatatctg 960attctggacg tgtacatctt cctcggacca gaacccaaga gcgtggtgca gcaatatctg 960

gacgtggtcg gatacccttt tatgcctcct tactggggac tgggattcca cctttgccgt 1020gacgtggtcg gatacccttt tatgcctcct tactggggac tgggattcca cctttgccgt 1020

tggggctact catccaccgc cattaccaga caggtggtgg agaatatgac cagagcccac 1080tggggctact catccaccgc cattaccaga caggtggtgg agaatatgac cagagcccac 1080

ttccctctcg acgtgcagtg gaacgatctg gactatatgg actcccggag agatttcacc 1140ttccctctcg acgtgcagtg gaacgatctg gactatatgg actcccggag agatttcacc 1140

ttcaacaagg acgggttccg cgattttccc gcgatggttc aagagctcca ccagggtggt 1200ttcaacaagg acgggttccg cgattttccc gcgatggttc aagagctcca ccagggtggt 1200

cgaagatata tgatgatcgt cgacccagcc atttcgagca gcggacccgc tggatcttat 1260cgaagatata tgatgatcgt cgacccagcc atttcgagca gcggacccgc tggatctttat 1260

agaccttacg acgaaggcct taggagagga gtgttcatca caaacgagac tggacagcct 1320agaccttacg acgaaggcct taggagagga gtgttcatca caaacgagac tggacagcct 1320

ttgatcggta aagtgtggcc tggatcaacc gcctttcctg actttaccaa tcccactgcc 1380ttgatcggta aagtgtggcc tggatcaacc gcctttcctg actttaccaa tcccactgcc 1380

ttggcttggt gggaggacat ggtggccgaa ttccacgacc aagtcccctt tgatggaatg 1440ttggcttggt gggaggacat ggtggccgaa ttccacgacc aagtcccctt tgatggaatg 1440

tggatcgata tgaacgaacc aagcaatttt atcagaggtt ccgaagacgg ttgccccaac 1500tggatcgata tgaacgaacc aagcaatttt atcagaggtt ccgaagacgg ttgccccaac 1500

aacgaactgg aaaaccctcc ttatgtgccc ggagtcgtgg gcggaacatt acaggccgcg 1560aacgaactgg aaaaccctcc ttatgtgccc ggagtcgtgg gcggaacatt acaggccgcg 1560

actatttgcg ccagcagcca ccaattcctg tccactcact acaacctcca caacctttat 1620actatttgcg ccagcagcca ccaattcctg tccactcact acaacctcca caacctttat 1620

ggattaaccg aagctattgc aagtcacagg gctctggtga aggctagagg gactaggccc 1680ggattaaccg aagctattgc aagtcacagg gctctggtga aggctagagg gactaggccc 1680

tttgtgatct cccgatccac ctttgccgga cacgggagat acgccggtca ctggactggt 1740tttgtgatct cccgatccac ctttgccgga cacggggagat acgccggtca ctggactggt 1740

gacgtgtgga gctcatggga acaactggcc tcctccgtgc cggaaatctt acagttcaac 1800gacgtgtgga gctcatggga acaactggcc tcctccgtgc cggaaatctt acagttcaac 1800

cttctgggtg tccctcttgt cggagcagac gtgtgtgggt ttcttggtaa cacctccgag 1860cttctgggtg tccctcttgt cggagcagac gtgtgtgggt ttcttggtaa cacctccgag 1860

gaactgtgtg tgcgctggac tcaactgggt gcattctacc cattcatgag aaaccacaac 1920gaactgtgtg tgcgctggac tcaactgggt gcattctacc cattcatgag aaaccacaac 1920

tccttgctgt ccctgccaca agagccctac tcgttcagcg agcctgcaca acaggctatg 1980tccttgctgt ccctgccaca agagccctac tcgttcagcg agcctgcaca acaggctatg 1980

cggaaggcac tgaccctgag atacgccctg cttccacact tatacactct cttccatcaa 2040cggaaggcac tgaccctgag atacgccctg cttccacact tatacactct cttccatcaa 2040

gcgcatgtgg caggagaaac cgttgcaagg cctcttttcc ttgaattccc caaggattcc 2100gcgcatgtgg caggagaaac cgttgcaagg cctcttttcc ttgaattccc caaggattcc 2100

tcgacttgga cggtggatca tcagctgctg tggggagaag ctctgctgat tactccagtg 2160tcgacttgga cggtggatca tcagctgctg tggggagaag ctctgctgat tactccagtg 2160

ttgcaagccg gaaaagctga ggtgaccgga tactttccgc tgggaacctg gtacgacctc 2220ttgcaagccg gaaaagctga ggtgaccgga tactttccgc tgggaacctg gtacgacctc 2220

cagactgtcc ctgttgaagc ccttggatca ctgcctccgc ctccggcagc tccacgcgaa 2280cagactgtcc ctgttgaagc ccttggatca ctgcctccgc ctccggcagc tccacgcgaa 2280

ccagctatac attccgaggg acagtgggtt acattaccag ctcctctgga cacaatcaac 2340ccagctatac attccgaggg acagtgggtt acattaccag ctcctctgga cacaatcaac 2340

gtccacttaa gagctggcta cattatccct ctgcaaggac caggactgac tacgaccgag 2400gtccacttaa gagctggcta cattatccct ctgcaaggac caggactgac tacgaccgag 2400

agcagacagc agccaatggc actggctgtg gctctgacca agggagggga agctagagga 2460agcagacagc agccaatggc actggctgtg gctctgacca agggagggga agctagagga 2460

gaactcttct gggatgatgg ggagtccctt gaagtgctgg aaagaggcgc ttacactcaa 2520gaactcttct gggatgatgg ggagtccctt gaagtgctgg aaagaggcgc ttacactcaa 2520

gtcattttcc ttgcacggaa caacaccatt gtgaacgaat tggtgcgagt gaccagcgaa 2580gtcattttcc ttgcacggaa caacaccatt gtgaacgaat tggtgcgagt gaccagcgaa 2580

ggagctggac ttcaactgca gaaggtcact gtgctcggag tggctaccgc tcctcagcaa 2640ggagctggac ttcaactgca gaaggtcact gtgctcggag tggctaccgc tcctcagcaa 2640

gtgctgtcga atggagtccc cgtgtcaaac tttacctact cccctgacac taaggtgctc 2700gtgctgtcga atggagtccc cgtgtcaaac tttacctact cccctgacac taaggtgctc 2700

gacatttgcg tgtccctcct gatgggagag cagttccttg tgtcctggtg ttga 2754gacatttgcg tgtccctcct gatgggag cagttccttg tgtcctggtg ttga 2754

<210> 14<210> 14

<211> 397<211> 397

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hAAT启动子<223> hAAT promoter

<400> 14<400> 14

gatcttgcta ccagtggaac agccactaag gattctgcag tgagagcaga gggccagcta 60gatcttgcta ccagtggaac agccactaag gattctgcag tgagagcaga gggccagcta 60

agtggtactc tcccagagac tgtctgactc acgccacccc ctccaccttg gacacaggac 120agtggtactc tcccagagac tgtctgactc acgccacccc ctccaccttg gacacaggac 120

gctgtggttt ctgagccagg tacaatgact cctttcggta agtgcagtgg aagctgtaca 180gctgtggttt ctgagccagg tacaatgact cctttcggta agtgcagtgg aagctgtaca 180

ctgcccaggc aaagcgtccg ggcagcgtag gcgggcgact cagatcccag ccagtggact 240ctgcccaggc aaagcgtccg ggcagcgtag gcgggcgact cagatcccag ccagtggact 240

tagcccctgt ttgctcctcc gataactggg gtgaccttgg ttaatattca ccagcagcct 300tagcccctgt ttgctcctcc gataactggg gtgaccttgg ttaatattca ccagcagcct 300

cccccgttgc ccctctggat ccactgctta aatacggacg aggacagggc cctgtctcct 360cccccgttgc ccctctggat ccactgctta aatacggacg aggacagggc cctgtctcct 360

cagcttcagg caccaccact gacctgggac agtgaat 397cagcttcagg caccacccact gacctgggac agtgaat 397

<210> 15<210> 15

<211> 321<211> 321

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> ApoE控制区<223> ApoE control area

<400> 15<400> 15

aggctcagag gcacacagga gtttctgggc tcaccctgcc cccttccaac ccctcagttc 60aggctcagag gcacacagga gtttctgggc tcaccctgcc cccttccaac ccctcagttc 60

ccatcctcca gcagctgttt gtgtgctgcc tctgaagtcc acactgaaca aacttcagcc 120ccatcctcca gcagctgttt gtgtgctgcc tctgaagtcc acactgaaca aacttcagcc 120

tactcatgtc cctaaaatgg gcaaacattg caagcagcaa acagcaaaca cacagccctc 180tactcatgtc cctaaaatgg gcaaacattg caagcagcaa acagcaaaca cacagccctc 180

cctgcctgct gaccttggag ctggggcaga ggtcagagac ctctctgggc ccatgccacc 240cctgcctgct gaccttggag ctggggcaga ggtcagagac ctctctgggc ccatgccacc 240

tccaacatcc actcgacccc ttggaatttc ggtggagagg agcagaggtt gtcctggcgt 300tccaacatcc actcgacccc ttggaatttc ggtggagagg agcagaggtt gtcctggcgt 300

ggtttaggta gtgtgagagg g 321ggtttaggta gtgtgagagg g 321

<210> 16<210> 16

<211> 441<211> 441

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> HBB2内含子<223> HBB2 intron

<400> 16<400> 16

gtacacatat tgaccaaatc agggtaattt tgcatttgta attttaaaaa atgctttctt 60gtacacatat tgaccaaatc agggtaattt tgcatttgta attttaaaaa atgctttctt 60

cttttaatat acttttttgt ttatcttatt tctaatactt tccctaatct ctttctttca 120cttttaatat acttttttgt ttatcttatt tctaatactt tccctaatct ctttctttca 120

gggcaataat gatacaatgt atcatgcctc tttgcaccat tctaaagaat aacagtgata 180gggcaataat gatacaatgt atcatgcctc tttgcaccat tctaaagaat aacagtgata 180

atttctgggt taaggcaata gcaatatttc tgcatataaa tatttctgca tataaattgt 240atttctgggt taaggcaata gcaatatttc tgcatataaa tatttctgca tataaattgt 240

aactgatgta agaggtttca tattgctaat agcagctaca atccagctac cattctgctt 300aactgatgta agaggtttca tattgctaat agcagctaca atccagctac cattctgctt 300

ttattttatg gttgggataa ggctggatta ttctgagtcc aagctaggcc cttttgctaa 360ttattttatg gttggggataa ggctggatta ttctgagtcc aagctaggcc cttttgctaa 360

tcatgttcat acctcttatc ttcctcccac agctcctggg caacgtgctg gtctgtgtgc 420tcatgttcat acctcttatc ttcctcccac agctcctggg caacgtgctg gtctgtgtgc 420

tggcccatca ctttggcaaa g 441tggcccatca ctttggcaaaa g 441

<210> 17<210> 17

<211> 441<211> 441

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 修饰的HBB2内含子<223> Modified HBB2 intron

<400> 17<400> 17

ttattttctg gttgggataa ggctggatta ttctgagtcc aagctaggcc cttttgctaa 360ttattttctg gttgggataa ggctggatta ttctgagtcc aagctaggcc cttttgctaa 360

tcttgttcat acctcttatc ttcctcccac agctcctggg caacctgctg gtctctctgc 420tcttgttcat acctcttatc ttcctcccac agctcctggg caacctgctg gtctctctgc 420

tggcccatca ctttggcaaa g 441tggcccatca ctttggcaaaa g 441

<210> 18<210> 18

<211> 1438<211> 1438

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> FIX内含子<223> FIX intron

<400> 18<400> 18

ggtttgtttc cttttttaaa atacattgag tatgcttgcc ttttagatat agaaatatct 60ggtttgtttc cttttttaaa atacattgag tatgcttgcc ttttagatat agaaatatct 60

gatgctgtct tcttcactaa attttgatta catgatttga cagcaatatt gaagagtcta 120gatgctgtct tcttcactaa attttgatta catgatttga cagcaatatt gaagagtcta 120

acagccagca cgcaggttgg taagtactgg ttctttgtta gctaggtttt cttcttcttc 180acagccagca cgcaggttgg taagtactgg ttctttgtta gctaggtttt cttcttcttc 180

atttttaaaa ctaaatagat cgacaatgct tatgatgcat ttatgtttaa taaacactgt 240atttttaaaa ctaaatagat cgacaatgct tatgatgcat ttatgtttaa taaacactgt 240

tcagttcatg atttggtcat gtaattcctg ttagaaaaca ttcatctcct tggtttaaaa 300tcagttcatg atttggtcat gtaattcctg ttagaaaaca ttcatctcct tggtttaaaa 300

aaattaaaag tgggaaaaca aagaaatagc agaatatagt gaaaaaaaat aaccacatta 360aaattaaaag tgggaaaaca aagaaatagc agaatatagt gaaaaaaaat aaccacatta 360

tttttgtttg gacttaccac tttgaaatca aaatgggaaa caaaagcaca aacaatggcc 420tttttgtttg gacttaccac tttgaaatca aaatgggaaa caaaagcaca aacaatggcc 420

ttatttacac aaaaagtctg attttaagat atatgacatt tcaaggtttc agaagtatgt 480ttatttacac aaaaagtctg attttaagat atatgacatt tcaaggtttc agaagtatgt 480

aatgaggtgt gtctctaatt ttttaaatta tatatcttca atttaaagtt ttagttaaaa 540aatgaggtgt gtctctaatt ttttaaatta tatatcttca atttaaagtt ttagttaaaa 540

cataaagatt aacctttcat tagcaagctg ttagttatca ccaacgcttt tcatggatta 600cataaagatt aacctttcat tagcaagctg ttagttatca ccaacgcttt tcatggatta 600

ggaaaaaatc attttgtctc tatgtcaaac atcttggagt tgatatttgg ggaaacacaa 660ggaaaaaatc attttgtctc tatgtcaaac atcttggagt tgatatttgg ggaaacacaa 660

tactcagttg agttccctag gggagaaaag cacgcttaag aattgacata aagagtagga 720tactcagttg agttccctag gggagaaaag cacgcttaag aattgacata aagagtagga 720

agttagctaa tgcaacatat atcactttgt tttttcacaa ctacagtgac tttatgtatt 780agttagctaa tgcaacatat atcactttgt tttttcacaa ctacagtgac tttatgtatt 780

tcccagagga aggcatacag ggaagaaatt atcccatttg gacaaacagc atgttctcac 840tcccagagga aggcatacag ggaagaaatt atcccatttg gacaaacagc atgttctcac 840

aggaagcatt tatcacactt acttgtcaac tttctagaat caaatctagt agctgacagt 900aggaagcatt tatcacactt acttgtcaac tttctagaat caaatctagt agctgacagt 900

accaggatca ggggtgccaa ccctaagcac ccccagaaag ctgactggcc ctgtggttcc 960accaggatca ggggtgccaa ccctaagcac ccccagaaag ctgactggcc ctgtggttcc 960

cactccagac atgatgtcag ctgtgaaatc gacgtcgctg gaccataatt aggcttctgt 1020cactccagac atgatgtcag ctgtgaaatc gacgtcgctg gaccataatt aggcttctgt 1020

tcttcaggag acatttgttc aaagtcattt gggcaaccat attctgaaaa cagcccagcc 1080tcttcaggag acatttgttc aaagtcattt gggcaaccat attctgaaaa cagcccagcc 1080

agggtgatgg atcactttgc aaagatcctc aatgagctat tttcaagtga tgacaaagtg 1140agggtgatgg atcactttgc aaagatcctc aatgagctat tttcaagtga tgacaaagtg 1140

tgaagttaac cgctcatttg agaactttct ttttcatcca aagtaaattc aaatatgatt 1200tgaagttaac cgctcatttg agaactttct ttttcatcca aagtaaattc aaatatgatt 1200

agaaatctga ccttttatta ctggaattct cttgactaaa agtaaaattg aattttaatt 1260agaaatctga ccttttatta ctggaattct cttgactaaa agtaaaattg aattttaatt 1260

cctaaatctc catgtgtata cagtactgtg ggaacatcac agattttggc tccatgccct 1320cctaaatctc catgtgtata cagtactgtg ggaacatcac agattttggc tccatgccct 1320

aaagagaaat tggctttcag attatttgga ttaaaaacaa agactttctt aagagatgta 1380aaagagaaat tggctttcag attatttgga ttaaaaacaa agactttctt aagagatgta 1380

aaattttcat gatgttttct tttttgctaa aactaaagaa ttattctttt acatttca 1438aaattttcat gatgttttct tttttgctaa aactaaagaa ttatctttt acatttca 1438

<210> 19<210> 19

<211> 1438<211> 1438

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 修饰的FIX内含子<223> Modified FIX intron

<400> 19<400> 19

atttttaaaa ctaaatagat cgacattgct tttgttgcat ttatgtttaa taaacactgt 240atttttaaaa ctaaatagat cgacattgct tttgttgcat ttatgtttaa taaacactgt 240

tttttgtttg gacttaccac tttgaaatca aattgggaaa caaaagcaca aacaatggcc 420tttttgtttg gacttaccac tttgaaatca aattgggaaa caaaagcaca aacaatggcc 420

ggaaaaaatc attttgtctc tttgtcaaac atcttggagt tgatatttgg ggaaacacaa 660ggaaaaaatc attttgtctc tttgtcaaac atcttggagt tgatatttgg ggaaacacaa 660

agttagctat tgcaacatat atcactttgt tttttcacaa ctacagtgac tttttgtatt 780agttagctat tgcaacatat atcactttgt tttttcacaa ctacagtgac tttttgtatt 780

tcccagagga aggcatacag ggaagaaatt atcccatttg gacaaacagc ttgttctcac 840tcccagagga aggcatacag ggaagaaatt atcccatttg gacaaacagc ttgttctcac 840

agggtgttgg atcactttgc aaagatcctc attgagctat tttcaagtgt tgacaaagtg 1140agggtgttgg atcactttgc aaagatcctc attgagctat tttcaagtgt tgacaaagtg 1140

aaattttctt gttgttttct tttttgctaa aactaaagaa ttattctttt acatttca 1438aaattttctt gttgttttct tttttgctaa aactaaagaa ttatctttt acatttca 1438

<210> 20<210> 20

<211> 881<211> 881

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 鸡β-球蛋白内含子<223> Chicken beta-globin intron

<400> 20<400> 20

gcgggagtcg ctgcgttgcc ttcgccccgt gccccgctcc gccgccgcct cgcgccgccc 60gcgggagtcg ctgcgttgcc ttcgccccgt gccccgctcc gccgccgcct cgcgccgccc 60

gccccggctc tgactgaccg cgttactccc acaggtgagc gggcgggacg gcccttctcc 120gccccggctc tgactgaccg cgttatccc acaggtgagc gggcgggacg gcccttctcc 120

tccgggctgt aattagcgct tggtttaatg acggcttgtt tcttttctgt ggctgcgtga 180tccgggctgt aattagcgct tggtttaatg acggcttgtt tcttttctgt ggctgcgtga 180

aagccttgag gggctccggg agggcccttt gtgcgggggg agcggctcgg ggggtgcgtg 240aagccttgag gggctccggg agggcccttt gtgcgggggg agcggctcgg ggggtgcgtg 240

cgtgtgtgtg tgcgtgggga gcgccgcgtg cggctccgcg ctgcccggcg gctgtgagcg 300cgtgtgtgtg tgcgtgggga gcgccgcgtg cggctccgcg ctgcccggcg gctgtgagcg 300

ctgcgggcgc ggcgcggggc tttgtgcgct ccgcagtgtg cgcgagggga gcgcggccgg 360ctgcgggcgc ggcgcggggc tttgtgcgct ccgcagtgtg cgcgagggga gcgcggccgg 360

gggcggtgcc ccgcggtgcg gggggggctg cgaggggaac aaaggctgcg tgcggggtgt 420gggcggtgcc ccgcggtgcg gggggggctg cgaggggaac aaaggctgcg tgcggggtgt 420

gtgcgtgggg gggtgagcag ggggtgtggg cgcgtcggtc gggctgcaac cccccctgca 480gtgcgtgggg gggtgagcag ggggtgtggg cgcgtcggtc gggctgcaac cccccctgca 480

cccccctccc cgagttgctg agcacggccc ggcttcgggt gcggggctcc gtacggggcg 540cccccctccc cgagttgctg agcacggccc ggcttcgggt gcggggctcc gtacggggcg 540

tggcgcgggg ctcgccgtgc cgggcggggg gtggcggcag gtgggggtgc cgggcggggc 600tggcgcgggg ctcgccgtgc cgggcggggg gtggcggcag gtgggggtgc cgggcggggc 600

ggggccgcct cgggccgggg agggctcggg ggaggggcgc ggcggccccc ggagcgccgg 660ggggccgcct cgggccgggg agggctcggg ggaggggcgc ggcggccccc ggagcgccgg 660

cggctgtcga ggcgcggcga gccgcagcca ttgcctttta tggtaatcgt gcgagagggc 720cggctgtcga ggcgcggcga gccgcagcca ttgcctttta tggtaatcgt gcgagagggc 720

gcagggactt cctttgtccc aaatctgtgc ggagccgaaa tctgggaggc gccgccgcac 780gcagggactt cctttgtccc aaatctgtgc ggagccgaaa tctgggggc gccgccgcac 780

cccctctagc gggcgcgggg cgaagcggtg cggcgccggc aggaaggaaa tgggcgggga 840cccctctagc gggcgcgggg cgaagcggtg cggcgccggc aggaaggaaa tgggcgggga 840

gggccttcgt gcgtcgccgc gccgccgtcc ccttctccct c 881gggccttcgt gcgtcgccgc gccgccgtcc ccttctccct c 881

<210> 21<210> 21

<211> 881<211> 881

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 修饰的鸡β-球蛋白内含子<223> Modified chicken β-globin intron

<400> 21<400> 21

cggctgtcga ggcgcggcga gccgcagcca ttgccttttt tggtaatcgt gcgagagggc 720cggctgtcga ggcgcggcga gccgcagcca ttgccttttt tggtaatcgt gcgagagggc 720

cccctctagc gggcgcgggg cgaagcggtg cggcgccggc aggaaggaat tgggcgggga 840cccctctagc gggcgcgggg cgaagcggtg cggcgccggc aggaaggaat tgggcgggga 840

<210> 22<210> 22

<211> 4318<211> 4318

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 构建物：sp2+hGAAwt-Δ-8<223> Construct: sp2+hGAAwt-Δ-8

<400> 22<400> 22

ggtttaggta gtgtgagagg ggtacccggg gatcttgcta ccagtggaac agccactaag 360ggtttaggta gtgtgagagg ggtacccggg gatcttgcta ccagtggaac agccactaag 360

gattctgcag tgagagcaga gggccagcta agtggtactc tcccagagac tgtctgactc 420gattctgcag tgagagcaga gggccagcta agtggtactc tcccagagac tgtctgactc 420

acgccacccc ctccaccttg gacacaggac gctgtggttt ctgagccagg tacaatgact 480acgccacccc ctccaccttg gacacaggac gctgtggttt ctgagccagg tacaatgact 480

cctttcggta agtgcagtgg aagctgtaca ctgcccaggc aaagcgtccg ggcagcgtag 540cctttcggta agtgcagtgg aagctgtaca ctgcccaggc aaagcgtccg ggcagcgtag 540

gcgggcgact cagatcccag ccagtggact tagcccctgt ttgctcctcc gataactggg 600gcgggcgact cagatcccag ccagtggact tagcccctgt ttgctcctcc gataactggg 600

gtgaccttgg ttaatattca ccagcagcct cccccgttgc ccctctggat ccactgctta 660gtgaccttgg ttaatattca ccagcagcct cccccgttgc ccctctggat ccactgctta 660

aatacggacg aggacagggc cctgtctcct cagcttcagg caccaccact gacctgggac 720aatacggacg aggacagggc cctgtctcct cagcttcagg caccacccact gacctgggac 720

agtgaataga tcctgagaac ttcagggtga gtctatggga cccttgatgt tttctttccc 780agtgaataga tcctgagaac ttcagggtga gtctatggga cccttgatgt tttctttccc 780

cttcttttct atggttaagt tcatgtcata ggaaggggag aagtaacagg gtacacatat 840cttcttttct atggttaagt tcatgtcata ggaaggggag aagtaacagg gtacacatat 840

tgaccaaatc agggtaattt tgcatttgta attttaaaaa atgctttctt cttttaatat 900tgaccaaatc agggtaattt tgcatttgta attttaaaaa atgctttctt cttttaatat 900

acttttttgt ttatcttatt tctaatactt tccctaatct ctttctttca gggcaataat 960acttttttgt ttatcttatt tctaatactt tccctaatct ctttctttca gggcaataat 960

gatacaatgt atcatgcctc tttgcaccat tctaaagaat aacagtgata atttctgggt 1020gatacaatgt atcatgcctc tttgcaccat tctaaagaat aacagtgata atttctgggt 1020

taaggcaata gcaatatttc tgcatataaa tatttctgca tataaattgt aactgatgta 1080taaggcaata gcaatatttc tgcatataaa tatttctgca tataaattgt aactgatgta 1080

agaggtttca tattgctaat agcagctaca atccagctac cattctgctt ttattttctg 1140agaggtttca tattgctaat agcagctaca atccagctac cattctgctt ttattttctg 1140

gttgggataa ggctggatta ttctgagtcc aagctaggcc cttttgctaa tcttgttcat 1200gttgggataa ggctggatta ttctgagtcc aagctaggcc cttttgctaa tcttgttcat 1200

acctcttatc ttcctcccac agctcctggg caacctgctg gtctctctgc tggcccatca 1260acctctttatc ttcctcccac agctcctggg caacctgctg gtctctctgc tggcccatca 1260

ctttggcaaa gcacgcgtgc caccatgccg tcttctgtct cgtggggcat cctcctgctg 1320ctttggcaaa gcacgcgtgc caccatgccg tcttctgtct cgtggggcat cctcctgctg 1320

gcaggcctgt gctgcctggt ccctgtctcc ctggctctgc tggttccccg agagctgagt 1380gcaggcctgt gctgcctggt ccctgtctcc ctggctctgc tggttccccg agagctgagt 1380

ggctcctccc cagtcctgga ggagactcac ccagctcacc agcagggagc cagcagacca 1440ggctcctccc cagtcctgga ggagactcac ccagctcacc agcagggagc cagcagacca 1440

gggccccggg atgcccaggc acaccccggg cggccgcgag cagtgcccac acagtgcgac 1500gggccccggg atgcccaggc acaccccggg cggccgcgag cagtgcccac acagtgcgac 1500

gtccccccca acagccgctt cgattgcgcc cctgacaagg ccatcaccca ggaacagtgc 1560gtccccccca acagccgctt cgattgcgcc cctgacaagg ccatcaccca ggaacagtgc 1560

gaggcccgcg gctgttgcta catccctgca aagcaggggc tgcagggagc ccagatgggg 1620gaggcccgcg gctgttgcta catccctgca aagcaggggc tgcagggagc ccagatgggg 1620

cagccctggt gcttcttccc acccagctac cccagctaca agctggagaa cctgagctcc 1680cagccctggt gcttcttccc acccagctac cccagctaca agctggagaa cctgagctcc 1680

tctgaaatgg gctacacggc caccctgacc cgtaccaccc ccaccttctt ccccaaggac 1740tctgaaatgg gctacacggc caccctgacc cgtaccaccc ccaccttctt ccccaaggac 1740

atcctgaccc tgcggctgga cgtgatgatg gagactgaga accgcctcca cttcacgatc 1800atcctgaccc tgcggctgga cgtgatgatg gagactgaga accgcctcca cttcacgatc 1800

aaagatccag ctaacaggcg ctacgaggtg cccttggaga ccccgcatgt ccacagccgg 1860aaagatccag ctaacaggcg ctacgaggtg cccttggaga ccccgcatgt ccacagccgg 1860

gcaccgtccc cactctacag cgtggagttc tccgaggagc ccttcggggt gatcgtgcgc 1920gcaccgtccc cactctacag cgtggagttc tccgaggagc ccttcggggt gatcgtgcgc 1920

cggcagctgg acggccgcgt gctgctgaac acgacggtgg cgcccctgtt ctttgcggac 1980cggcagctgg acggccgcgt gctgctgaac acgacggtgg cgcccctgtt ctttgcggac 1980

cagttccttc agctgtccac ctcgctgccc tcgcagtata tcacaggcct cgccgagcac 2040cagttccttc agctgtccac ctcgctgccc tcgcagtata tcacaggcct cgccgagcac 2040

ctcagtcccc tgatgctcag caccagctgg accaggatca ccctgtggaa ccgggacctt 2100ctcagtcccc tgatgctcag caccagctgg accaggatca ccctgtggaa ccgggacctt 2100

gcgcccacgc ccggtgcgaa cctctacggg tctcaccctt tctacctggc gctggaggac 2160gcgcccacgc ccggtgcgaa cctctacggg tctcaccctt tctacctggc gctggaggac 2160

ggcgggtcgg cacacggggt gttcctgcta aacagcaatg ccatggatgt ggtcctgcag 2220ggcgggtcgg cacacggggt gttcctgcta aacagcaatg ccatggatgt ggtcctgcag 2220

ccgagccctg cccttagctg gaggtcgaca ggtgggatcc tggatgtcta catcttcctg 2280ccgagccctg cccttagctg gaggtcgaca ggtggggatcc tggatgtcta catcttcctg 2280

ggcccagagc ccaagagcgt ggtgcagcag tacctggacg ttgtgggata cccgttcatg 2340ggcccagc ccaagagcgt ggtgcagcag tacctggacg ttgtgggata cccgttcatg 2340

ccgccatact ggggcctggg cttccacctg tgccgctggg gctactcctc caccgctatc 2400ccgccatact ggggcctggg cttccacctg tgccgctggg gctactcctc caccgctatc 2400

acccgccagg tggtggagaa catgaccagg gcccacttcc ccctggacgt ccagtggaac 2460acccgccagg tggtggagaa catgaccagg gcccacttcc ccctggacgt ccagtggaac 2460

gacctggact acatggactc ccggagggac ttcacgttca acaaggatgg cttccgggac 2520gacctggact acatggactc ccggagggac ttcacgttca acaaggatgg cttccgggac 2520

ttcccggcca tggtgcagga gctgcaccag ggcggccggc gctacatgat gatcgtggat 2580ttcccggcca tggtgcagga gctgcaccag ggcggccggc gctacatgat gatcgtggat 2580

cctgccatca gcagctcggg ccctgccggg agctacaggc cctacgacga gggtctgcgg 2640cctgccatca gcagctcggg ccctgccggg agctacaggc cctacgacga gggtctgcgg 2640

aggggggttt tcatcaccaa cgagaccggc cagccgctga ttgggaaggt atggcccggg 2700aggggggttt tcatcaccaa cgagaccggc cagccgctga ttgggaaggt atggcccggg 2700

tccactgcct tccccgactt caccaacccc acagccctgg cctggtggga ggacatggtg 2760tccactgcct tccccgactt caccaacccc acagccctgg cctggtggga ggacatggtg 2760

gctgagttcc atgaccaggt gcccttcgac ggcatgtgga ttgacatgaa cgagccttcc 2820gctgagttcc atgaccaggt gcccttcgac ggcatgtgga ttgacatgaa cgagccttcc 2820

aacttcatca ggggctctga ggacggctgc cccaacaatg agctggagaa cccaccctac 2880aacttcatca ggggctctga ggacggctgc cccaacaatg agctggagaa cccaccctac 2880

gtgcctgggg tggttggggg gaccctccag gcggccacca tctgtgcctc cagccaccag 2940gtgcctgggg tggttggggg gaccctccag gcggccacca tctgtgcctc cagccaccag 2940

tttctctcca cacactacaa cctgcacaac ctctacggcc tgaccgaagc catcgcctcc 3000tttctctcca cacactacaa cctgcacaac ctctacggcc tgaccgaagc catcgcctcc 3000

cacagggcgc tggtgaaggc tcgggggaca cgcccatttg tgatctcccg ctcgaccttt 3060cacagggcgc tggtgaaggc tcgggggaca cgcccatttg tgatctcccg ctcgaccttt 3060

gctggccacg gccgatacgc cggccactgg acgggggacg tgtggagctc ctgggagcag 3120gctggccacg gccgatacgc cggccactgg acgggggacg tgtggagctc ctgggagcag 3120

ctcgcctcct ccgtgccaga aatcctgcag tttaacctgc tgggggtgcc tctggtcggg 3180ctcgcctcct ccgtgccaga aatcctgcag tttaacctgc tgggggtgcc tctggtcggg 3180

gccgacgtct gcggcttcct gggcaacacc tcagaggagc tgtgtgtgcg ctggacccag 3240gccgacgtct gcggcttcct gggcaacacc tcagaggagc tgtgtgtgcg ctggacccag 3240

ctgggggcct tctacccctt catgcggaac cacaacagcc tgctcagtct gccccaggag 3300ctgggggcct tctacccctt catgcggaac cacaacagcc tgctcagtct gccccaggag 3300

ccgtacagct tcagcgagcc ggcccagcag gccatgagga aggccctcac cctgcgctac 3360ccgtacagct tcagcgagcc ggcccagcag gccatgagga aggccctcac cctgcgctac 3360

gcactcctcc cccacctcta cacactgttc caccaggccc acgtcgcggg ggagaccgtg 3420gcactcctcc cccacctcta cacactgttc caccaggccc acgtcgcggg ggagaccgtg 3420

gcccggcccc tcttcctgga gttccccaag gactctagca cctggactgt ggaccaccag 3480gcccggcccc tcttcctgga gttccccaag gactctagca cctggactgt ggaccaccag 3480

ctcctgtggg gggaggccct gctcatcacc ccagtgctcc aggccgggaa ggccgaagtg 3540ctcctgtggg gggaggccct gctcatcacc ccagtgctcc aggccgggaa ggccgaagtg 3540

actggctact tccccttggg cacatggtac gacctgcaga cggtgccagt agaggccctt 3600actggctact tccccttggg cacatggtac gacctgcaga cggtgccagt agaggccctt 3600

ggcagcctcc cacccccacc tgcagctccc cgtgagccag ccatccacag cgaggggcag 3660ggcagcctcc cacccccacc tgcagctccc cgtgagccag ccatccacag cgaggggcag 3660

tgggtgacgc tgccggcccc cctggacacc atcaacgtcc acctccgggc tgggtacatc 3720tgggtgacgc tgccggcccc cctggacacc atcaacgtcc acctccgggc tgggtacatc 3720

atccccctgc agggccctgg cctcacaacc acagagtccc gccagcagcc catggccctg 3780atccccctgc aggggccctgg cctcacaacc acagagtccc gccagcagcc catggccctg 3780

gctgtggccc tgaccaaggg tggggaggcc cgaggggagc tgttctggga cgatggagag 3840gctgtggccc tgaccaaggg tggggaggcc cgaggggagc tgttctggga cgatggagag 3840

agcctggaag tgctggagcg aggggcctac acacaggtca tcttcctggc caggaataac 3900agcctggaag tgctggagcg aggggcctac acacaggtca tcttcctggc caggaataac 3900

acgatcgtga atgagctggt acgtgtgacc agtgagggag ctggcctgca gctgcagaag 3960acgatcgtga atgagctggt acgtgtgacc agtgaggggag ctggcctgca gctgcagaag 3960

gtgactgtcc tgggcgtggc cacggcgccc cagcaggtcc tctccaacgg tgtccctgtc 4020gtgactgtcc tgggcgtggc cacggcgccc cagcaggtcc tctccaacgg tgtccctgtc 4020

tccaacttca cctacagccc cgacaccaag gtcctggaca tctgtgtctc gctgttgatg 4080tccaacttca cctacagccc cgacaccaag gtcctggaca tctgtgtctc gctgttgatg 4080

ggagagcagt ttctcgtcag ctggtgttag ctcgagagat ctaccggtga attcaccgcg 4140ggagagcagt ttctcgtcag ctggtgttag ctcgagagat ctaccggtga attcaccgcg 4140

ggtttaaact gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc 4200ggtttaaact gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc 4200

cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc 4260cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc 4260

gcattgtctg agtaggtgtc attctattct ggggggtggg gtgggggcta gctctaga 4318gcattgtctg agtaggtgtc attctattct ggggggtggg gtgggggcta gctctaga 4318

<210> 23<210> 23

<211> 4318<211> 4318

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 构建物：sp2+hGAAco1-Δ-8<223> Construct: sp2+hGAAco1-Δ-8

<400> 23<400> 23

ctttggcaaa gcacgcgtgc caccatgcct agctctgtgt cctggggcat tctgctgctg 1320ctttggcaaa gcacgcgtgc caccatgcct agctctgtgt cctggggcat tctgctgctg 1320

gccggcctgt gttgtctggt gcctgtgtct ctggccctac tagtgcccag agagctgagc 1380gccggcctgt gttgtctggt gcctgtgtct ctggccctac tagtgcccag agagctgagc 1380

ggcagctctc ccgtgctgga agaaacacac cctgcccatc agcagggcgc ctctagacct 1440ggcagctctc ccgtgctgga agaaacacac cctgcccatc agcagggcgc ctctagacct 1440

ggacctagag atgcccaggc ccaccccggc agacctagag ctgtgcctac ccagtgtgac 1500ggacctagag atgcccaggc ccaccccggc agacctagag ctgtgcctac ccagtgtgac 1500

gtgcccccca acagcagatt cgactgcgcc cctgacaagg ccatcaccca ggaacagtgc 1560gtgcccccca acagcagatt cgactgcgcc cctgacaagg ccatcaccca ggaacagtgc 1560

gaggccagag gctgctgcta catccctgcc aagcagggac tgcagggcgc tcagatggga 1620gaggccagag gctgctgcta catccctgcc aagcagggac tgcagggcgc tcagatggga 1620

cagccctggt gcttcttccc accctcctac cccagctaca agctggaaaa cctgagcagc 1680cagccctggt gcttcttccc accctcctac cccagctaca agctggaaaa cctgagcagc 1680

agcgagatgg gctacaccgc caccctgacc agaaccaccc ccacattctt cccaaaggac 1740agcgagatgg gctacaccgc caccctgacc agaaccaccc ccacattctt cccaaaggac 1740

atcctgaccc tgcggctgga cgtgatgatg gaaaccgaga accggctgca cttcaccatc 1800atcctgaccc tgcggctgga cgtgatgatg gaaaccgaga accggctgca cttcaccatc 1800

aaggaccccg ccaatcggag atacgaggtg cccctggaaa ccccccacgt gcactctaga 1860aaggaccccg ccaatcggag atacgaggtg cccctggaaa ccccccacgt gcactctaga 1860

gcccccagcc ctctgtacag cgtggaattc agcgaggaac ccttcggcgt gatcgtgcgg 1920gcccccagcc ctctgtacag cgtggaattc agcgaggaac ccttcggcgt gatcgtgcgg 1920

agacagctgg atggcagagt gctgctgaac accaccgtgg cccctctgtt cttcgccgac 1980agacagctgg atggcagagt gctgctgaac accaccgtgg cccctctgtt cttcgccgac 1980

cagttcctgc agctgagcac cagcctgccc agccagtaca tcacaggact ggccgagcac 2040cagttcctgc agctgagcac cagcctgccc agccagtaca tcacaggact ggccgagcac 2040

ctgagccccc tgatgctgag cacatcctgg acccggatca ccctgtggaa cagggatctg 2100ctgagccccc tgatgctgag cacatcctgg acccggatca ccctgtggaa cagggatctg 2100

gcccctaccc ctggcgccaa tctgtacggc agccaccctt tctacctggc cctggaagat 2160gcccctaccc ctggcgccaa tctgtacggc agccaccctt tctacctggc cctggaagat 2160

ggcggatctg cccacggagt gtttctgctg aactccaacg ccatggacgt ggtgctgcag 2220ggcggatctg cccacggagt gtttctgctg aactccaacg ccatggacgt ggtgctgcag 2220

cctagccctg ccctgtcttg gagaagcaca ggcggcatcc tggatgtgta catctttctg 2280cctagccctg ccctgtcttg gagaagcaca ggcggcatcc tggatgtgta catctttctg 2280

ggccccgagc ccaagagcgt ggtgcagcag tatctggatg tcgtgggcta ccccttcatg 2340ggccccgagc ccaagagcgt ggtgcagcag tatctggatg tcgtgggcta ccccttcatg 2340

cccccttact ggggcctggg attccacctg tgcagatggg gctactccag caccgccatc 2400cccccttact ggggcctggg attccacctg tgcagatggg gctactccag caccgccatc 2400

accagacagg tggtggaaaa catgaccaga gcccacttcc cactggatgt gcagtggaac 2460accagacagg tggtggaaaa catgaccaga gcccacttcc cactggatgt gcagtggaac 2460

gacctggact acatggacag cagacgggac ttcaccttca acaaggacgg cttccgggac 2520gacctggact acatggacag cagacgggac ttcaccttca acaaggacgg cttccgggac 2520

ttccccgcca tggtgcagga actgcatcag ggcggcagac ggtacatgat gatcgtggat 2580ttccccgcca tggtgcagga actgcatcag ggcggcagac ggtacatgat gatcgtggat 2580

cccgccatca gctcctctgg ccctgccggc tcttacagac cctacgacga gggcctgcgg 2640cccgccatca gctcctctgg ccctgccggc tcttacagac cctacgacga gggcctgcgg 2640

agaggcgtgt tcatcaccaa cgagacaggc cagcccctga tcggcaaagt gtggcctggc 2700agaggcgtgt tcatcaccaa cgagacaggc cagcccctga tcggcaaagt gtggcctggc 2700

agcacagcct tccccgactt caccaatcct accgccctgg cttggtggga ggacatggtg 2760agcacagcct tccccgactt caccaatcct accgccctgg cttggtggga ggacatggtg 2760

gccgagttcc acgaccaggt gcccttcgac ggcatgtgga tcgacatgaa cgagcccagc 2820gccgagttcc acgaccaggt gcccttcgac ggcatgtgga tcgacatgaa cgagcccagc 2820

aacttcatcc ggggcagcga ggatggctgc cccaacaacg aactggaaaa tcccccttac 2880aacttcatcc ggggcagcga ggatggctgc cccaacaacg aactggaaaa tcccccttac 2880

gtgcccggcg tcgtgggcgg aacactgcag gccgctacaa tctgtgccag cagccaccag 2940gtgcccggcg tcgtgggcgg aacactgcag gccgctacaa tctgtgccag cagccaccag 2940

tttctgagca cccactacaa cctgcacaac ctgtacggcc tgaccgaggc cattgccagc 3000tttctgagca cccactacaa cctgcacaac ctgtacggcc tgaccgaggc cattgccagc 3000

caccgcgctc tcgtgaaagc cagaggcaca cggcccttcg tgatcagcag aagcaccttt 3060caccgcgctc tcgtgaaagc cagaggcaca cggcccttcg tgatcagcag aagcaccttt 3060

gccggccacg gcagatacgc cggacattgg actggcgacg tgtggtcctc ttgggagcag 3120gccggccacg gcagatacgc cggacattgg actggcgacg tgtggtcctc ttggggagcag 3120

ctggcctcta gcgtgcccga gatcctgcag ttcaatctgc tgggcgtgcc actcgtgggc 3180ctggcctcta gcgtgcccga gatcctgcag ttcaatctgc tgggcgtgcc actcgtgggc 3180

gccgatgtgt gtggcttcct gggcaacacc tccgaggaac tgtgtgtgcg gtggacacag 3240gccgatgtgt gtggcttcct gggcaacacc tccgaggaac tgtgtgtgcg gtggacacag 3240

ctgggcgcct tctacccttt catgagaaac cacaacagcc tgctgagcct gccccaggaa 3300ctgggcgcct tctacccttt catgagaaac cacaacagcc tgctgagcct gccccaggaa 3300

ccctacagct ttagcgagcc tgcacagcag gccatgcgga aggccctgac actgagatac 3360ccctacagct ttagcgagcc tgcacagcag gccatgcgga aggccctgac actgagatac 3360

gctctgctgc cccacctgta caccctgttt caccaggccc atgtggccgg cgagacagtg 3420gctctgctgc cccacctgta caccctgttt caccaggccc atgtggccgg cgagacagtg 3420

gccagacctc tgtttctgga attccccaag gacagcagca cctggaccgt ggaccatcag 3480gccagacctc tgtttctgga attccccaag gacagcagca cctggaccgt ggaccatcag 3480

ctgctgtggg gagaggctct gctgattacc ccagtgctgc aggcaggcaa ggccgaagtg 3540ctgctgtggg gagaggctct gctgattacc ccagtgctgc aggcaggcaa ggccgaagtg 3540

accggctact ttcccctggg cacttggtac gacctgcaga ccgtgcctgt ggaagccctg 3600accggctact ttcccctggg cacttggtac gacctgcaga ccgtgcctgt ggaagccctg 3600

ggatctctgc ctccacctcc tgccgctcct agagagcctg ccattcactc tgagggccag 3660ggatctctgc ctccacctcc tgccgctcct agagagcctg ccattcactc tgagggccag 3660

tgggtcacac tgcctgcccc cctggatacc atcaacgtgc acctgagggc cggctacatc 3720tgggtcacac tgcctgcccc cctggatacc atcaacgtgc acctgagggc cggctacatc 3720

ataccactgc agggacctgg cctgaccacc accgagtcta gacagcagcc aatggccctg 3780ataccactgc agggacctgg cctgaccacc accgagtcta gacagcagcc aatggccctg 3780

gccgtggccc tgaccaaagg cggagaagct aggggcgagc tgttctggga cgatggcgag 3840gccgtggccc tgaccaaagg cggagaagct aggggcgagc tgttctggga cgatggcgag 3840

agcctggaag tgctggaaag aggcgcctat acccaagtga tcttcctggc ccggaacaac 3900agcctggaag tgctggaaag aggcgcctat acccaagtga tcttcctggc ccggaacaac 3900

accatcgtga acgagctggt gcgcgtgacc tctgaaggcg ctggactgca gctgcagaaa 3960accatcgtga acgagctggt gcgcgtgacc tctgaaggcg ctggactgca gctgcagaaa 3960

gtgaccgtgc tgggagtggc cacagcccct cagcaggtgc tgtctaatgg cgtgcccgtg 4020gtgaccgtgc tgggagtggc cacagcccct cagcaggtgc tgtctaatgg cgtgcccgtg 4020

tccaacttca cctacagccc cgacaccaag gtgctggaca tctgcgtgtc actgctgatg 4080tccaacttca cctacagccc cgacaccaag gtgctggaca tctgcgtgtc actgctgatg 4080

ggagagcagt ttctggtgtc ctggtgctga ctcgagagat ctaccggtga attcaccgcg 4140ggagagcagt ttctggtgtc ctggtgctga ctcgagagat ctaccggtga attcaccgcg 4140

<210> 24<210> 24

<211> 4318<211> 4318

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 构建物：sp2+hGAAco2-Δ-8<223> Construct: sp2+hGAAco2-Δ-8

<400> 24<400> 24

ctttggcaaa gcacgcgtgc caccatgcca tcgtcagtgt cttggggcat tcttctgctc 1320ctttggcaaa gcacgcgtgc caccatgcca tcgtcagtgt cttggggcat tcttctgctc 1320

gccggattgt gttgcctggt gcctgtctca ttggccctgt tggtgcctag agagctgagc 1380gccggattgt gttgcctggt gcctgtctca ttggccctgt tggtgcctag agagctgagc 1380

ggatcatccc cagtgctgga ggagactcat cctgctcacc aacagggagc ttccagacca 1440ggatcatccc cagtgctgga ggagactcat cctgctcacc aacagggagc ttccagacca 1440

ggaccgagag acgcccaagc ccatcctggt agaccaagag ctgtgcctac ccaatgcgac 1500ggaccgagag acgcccaagc ccatcctggt agaccaagag ctgtgcctac ccaatgcgac 1500

gtgccaccca actcccgatt cgactgcgcg ccagataagg ctattaccca agagcagtgt 1560gtgccaccca actcccgatt cgactgcgcg ccagataagg ctattaccca agagcagtgt 1560

gaagccagag gttgctgcta catcccagcg aagcaaggat tgcaaggcgc ccaaatggga 1620gaagccagag gttgctgcta catcccagcg aagcaaggat tgcaaggcgc ccaaatggga 1620

caaccttggt gtttcttccc cccttcgtac ccatcatata aactcgaaaa cctgtcctct 1680caaccttggt gtttcttccc cccttcgtac ccatcatata aactcgaaaa cctgtcctct 1680

tcggaaatgg gttatactgc caccctcacc agaactactc ctactttctt cccgaaagac 1740tcggaaatgg gttatactgc caccctcacc agaactactc ctactttctt cccgaaagac 1740

atcttgacct tgaggctgga cgtgatgatg gagactgaaa accggctgca tttcactatc 1800atcttgacct tgaggctgga cgtgatgatg gagactgaaa accggctgca tttcactatc 1800

aaagatcctg ccaatcggcg atacgaggtc cctctggaaa cccctcacgt gcactcacgg 1860aaagatcctg ccaatcggcg atacgaggtc cctctggaaa cccctcacgt gcactcacgg 1860

gctccttctc cgctttactc cgtcgaattc tctgaggaac ccttcggagt gatcgttaga 1920gctccttctc cgctttactc cgtcgaattc tctgaggaac ccttcggagt gatcgttaga 1920

cgccagctgg atggtagagt gctgttgaac actactgtgg ccccactttt cttcgctgac 1980cgccagctgg atggtagagt gctgttgaac actactgtgg ccccactttt cttcgctgac 1980

cagtttctgc aactgtccac ttccctgcca tcccagtaca ttactggact cgccgaacac 2040cagtttctgc aactgtccac ttccctgcca tcccagtaca ttactggact cgccgaacac 2040

ctgtcgccac tgatgctctc gacctcttgg actagaatca ctttgtggaa cagagacttg 2100ctgtcgccac tgatgctctc gacctcttgg actagaatca ctttgtggaa cagagacttg 2100

gcccctactc cgggagcaaa tctgtacgga agccaccctt tttacctggc gctcgaagat 2160gcccctactc cgggagcaaa tctgtacgga agccaccctt tttacctggc gctcgaagat 2160

ggcggatccg ctcacggagt gttcctgctg aatagcaacg caatggacgt ggtgctgcaa 2220ggcggatccg ctcacggagt gttcctgctg aatagcaacg caatggacgt ggtgctgcaa 2220

ccttcccctg cactcagttg gagaagtacc gggggtattc tggacgtgta catcttcctc 2280ccttcccctg cactcagttg gagaagtacc gggggtattc tggacgtgta catcttcctc 2280

ggaccagaac ccaagagcgt ggtgcagcaa tatctggacg tggtcggata cccttttatg 2340ggaccagaac ccaagagcgt ggtgcagcaa tatctggacg tggtcggata cccttttatg 2340

cctccttact ggggactggg attccacctt tgccgttggg gctactcatc caccgccatt 2400cctccttact ggggactggg attccacctt tgccgttggg gctactcatc caccgccatt 2400

accagacagg tggtggagaa tatgaccaga gcccacttcc ctctcgacgt gcagtggaac 2460accagacagg tggtggagaa tatgaccaga gcccacttcc ctctcgacgt gcagtggaac 2460

gatctggact atatggactc ccggagagat ttcaccttca acaaggacgg gttccgcgat 2520gatctggact atatggactc ccggagagat ttcaccttca acaaggacgg gttccgcgat 2520

tttcccgcga tggttcaaga gctccaccag ggtggtcgaa gatatatgat gatcgtcgac 2580tttcccgcga tggttcaaga gctccaccag ggtggtcgaa gatatatgat gatcgtcgac 2580

ccagccattt cgagcagcgg acccgctgga tcttatagac cttacgacga aggccttagg 2640ccagccattt cgagcagcgg acccgctgga tcttatagac cttacgacga aggccttagg 2640

agaggagtgt tcatcacaaa cgagactgga cagcctttga tcggtaaagt gtggcctgga 2700agaggagtgt tcatcacaaa cgagactgga cagcctttga tcggtaaagt gtggcctgga 2700

tcaaccgcct ttcctgactt taccaatccc actgccttgg cttggtggga ggacatggtg 2760tcaaccgcct ttcctgactt taccaatccc actgccttgg cttggtggga ggacatggtg 2760

gccgaattcc acgaccaagt cccctttgat ggaatgtgga tcgatatgaa cgaaccaagc 2820gccgaattcc acgaccaagt cccctttgat ggaatgtgga tcgatatgaa cgaaccaagc 2820

aattttatca gaggttccga agacggttgc cccaacaacg aactggaaaa ccctccttat 2880aattttatca gaggttccga agacggttgc cccaacaacg aactggaaaa ccctccttat 2880

gtgcccggag tcgtgggcgg aacattacag gccgcgacta tttgcgccag cagccaccaa 2940gtgcccggag tcgtgggcgg aacattacag gccgcgacta tttgcgccag cagccaccaa 2940

ttcctgtcca ctcactacaa cctccacaac ctttatggat taaccgaagc tattgcaagt 3000ttcctgtcca ctcactacaa cctccacaac ctttatggat taaccgaagc tattgcaagt 3000

cacagggctc tggtgaaggc tagagggact aggccctttg tgatctcccg atccaccttt 3060cacagggctc tggtgaaggc tagagggact aggccctttg tgatctcccg atccaccttt 3060

gccggacacg ggagatacgc cggtcactgg actggtgacg tgtggagctc atgggaacaa 3120gccggacacg ggagatacgc cggtcactgg actggtgacg tgtggagctc atgggaacaa 3120

ctggcctcct ccgtgccgga aatcttacag ttcaaccttc tgggtgtccc tcttgtcgga 3180ctggcctcct ccgtgccgga aatcttacag ttcaaccttc tgggtgtccc tcttgtcgga 3180

gcagacgtgt gtgggtttct tggtaacacc tccgaggaac tgtgtgtgcg ctggactcaa 3240gcagacgtgt gtgggtttct tggtaacacc tccgaggaac tgtgtgtgcg ctggactcaa 3240

ctgggtgcat tctacccatt catgagaaac cacaactcct tgctgtccct gccacaagag 3300ctgggtgcat tctacccatt catgagaaac cacaactcct tgctgtccct gccacaagag 3300

ccctactcgt tcagcgagcc tgcacaacag gctatgcgga aggcactgac cctgagatac 3360ccctactcgt tcagcgagcc tgcacaacag gctatgcgga aggcactgac cctgagatac 3360

gccctgcttc cacacttata cactctcttc catcaagcgc atgtggcagg agaaaccgtt 3420gccctgcttc cacacttata cactctcttc catcaagcgc atgtggcagg agaaaccgtt 3420

gcaaggcctc ttttccttga attccccaag gattcctcga cttggacggt ggatcatcag 3480gcaaggcctc ttttccttga attccccaag gattcctcga cttggacggt ggatcatcag 3480

ctgctgtggg gagaagctct gctgattact ccagtgttgc aagccggaaa agctgaggtg 3540ctgctgtggg gagaagctct gctgattact ccagtgttgc aagccggaaa agctgaggtg 3540

accggatact ttccgctggg aacctggtac gacctccaga ctgtccctgt tgaagccctt 3600accggatact ttccgctggg aacctggtac gacctccaga ctgtccctgt tgaagccctt 3600

ggatcactgc ctccgcctcc ggcagctcca cgcgaaccag ctatacattc cgagggacag 3660ggatcactgc ctccgcctcc ggcagctcca cgcgaaccag ctatacattc cgagggacag 3660

tgggttacat taccagctcc tctggacaca atcaacgtcc acttaagagc tggctacatt 3720tgggttacat taccagctcc tctggacaca atcaacgtcc acttaagagc tggctacatt 3720

atccctctgc aaggaccagg actgactacg accgagagca gacagcagcc aatggcactg 3780atccctctgc aaggaccagg actgactacg accgagagca gacagcagcc aatggcactg 3780

gctgtggctc tgaccaaggg aggggaagct agaggagaac tcttctggga tgatggggag 3840gctgtggctc tgaccaaggg aggggaagct agaggagaac tcttctggga tgatggggag 3840

tcccttgaag tgctggaaag aggcgcttac actcaagtca ttttccttgc acggaacaac 3900tcccttgaag tgctggaaag aggcgcttac actcaagtca ttttccttgc acggaacaac 3900

accattgtga acgaattggt gcgagtgacc agcgaaggag ctggacttca actgcagaag 3960accattgtga acgaattggt gcgagtgacc agcgaaggag ctggacttca actgcagaag 3960

gtcactgtgc tcggagtggc taccgctcct cagcaagtgc tgtcgaatgg agtccccgtg 4020gtcactgtgc tcggagtggc taccgctcct cagcaagtgc tgtcgaatgg agtccccgtg 4020

tcaaacttta cctactcccc tgacactaag gtgctcgaca tttgcgtgtc cctcctgatg 4080tcaaacttta cctactcccc tgacactaag gtgctcgaca tttgcgtgtc cctcctgatg 4080

ggagagcagt tccttgtgtc ctggtgttga ctcgagagat ctaccggtga attcaccgcg 4140ggagagcagt tccttgtgtc ctggtgttga ctcgagagat ctaccggtga attcaccgcg 4140

<210> 25<210> 25

<211> 4300<211> 4300

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 构建物：sp7+hGAAco1-Δ-8<223> Construct: sp7+hGAAco1-Δ-8

<400> 25<400> 25

ctttggcaaa gcacgcgtgc caccatggcc tttctgtggc tgctgagctg ttgggccctg 1320ctttggcaaa gcacgcgtgc caccatggcc tttctgtggc tgctgagctg ttgggccctg 1320

ctgggcacca ccttcggcct actagtgccc agagagctga gcggcagctc tcccgtgctg 1380ctgggcacca ccttcggcct actagtgccc agagagctga gcggcagctc tcccgtgctg 1380

gaagaaacac accctgccca tcagcagggc gcctctagac ctggacctag agatgcccag 1440gaagaaacac accctgccca tcagcagggc gcctctagac ctggacctag agatgcccag 1440

gcccaccccg gcagacctag agctgtgcct acccagtgtg acgtgccccc caacagcaga 1500gcccaccccg gcagacctag agctgtgcct acccagtgtg acgtgccccc caacagcaga 1500

ttcgactgcg cccctgacaa ggccatcacc caggaacagt gcgaggccag aggctgctgc 1560ttcgactgcg cccctgacaa ggccatcacc caggaacagt gcgaggccag aggctgctgc 1560

tacatccctg ccaagcaggg actgcagggc gctcagatgg gacagccctg gtgcttcttc 1620tacatccctg ccaagcaggg actgcagggc gctcagatgg gacagccctg gtgcttcttc 1620

ccaccctcct accccagcta caagctggaa aacctgagca gcagcgagat gggctacacc 1680ccaccctcct accccagcta caagctggaa aacctgagca gcagcgagat gggctacacc 1680

gccaccctga ccagaaccac ccccacattc ttcccaaagg acatcctgac cctgcggctg 1740gccaccctga ccagaaccac ccccacattc ttcccaaagg acatcctgac cctgcggctg 1740

gacgtgatga tggaaaccga gaaccggctg cacttcacca tcaaggaccc cgccaatcgg 1800gacgtgatga tggaaaccga gaaccggctg cacttcacca tcaaggaccc cgccaatcgg 1800

agatacgagg tgcccctgga aaccccccac gtgcactcta gagcccccag ccctctgtac 1860agatacgagg tgcccctgga aaccccccac gtgcactcta gagcccccag ccctctgtac 1860

agcgtggaat tcagcgagga acccttcggc gtgatcgtgc ggagacagct ggatggcaga 1920agcgtggaat tcagcgagga acccttcggc gtgatcgtgc ggagacagct ggatggcaga 1920

gtgctgctga acaccaccgt ggcccctctg ttcttcgccg accagttcct gcagctgagc 1980gtgctgctga acaccaccgt ggcccctctg ttcttcgccg accagttcct gcagctgagc 1980

accagcctgc ccagccagta catcacagga ctggccgagc acctgagccc cctgatgctg 2040accagcctgc ccagccagta catcacagga ctggccgagc acctgagccc cctgatgctg 2040

agcacatcct ggacccggat caccctgtgg aacagggatc tggcccctac ccctggcgcc 2100agcacatcct ggacccggat caccctgtgg aacagggatc tggcccctac ccctggcgcc 2100

aatctgtacg gcagccaccc tttctacctg gccctggaag atggcggatc tgcccacgga 2160aatctgtacg gcagccaccc tttctacctg gccctggaag atggcggatc tgcccacgga 2160

gtgtttctgc tgaactccaa cgccatggac gtggtgctgc agcctagccc tgccctgtct 2220gtgtttctgc tgaactccaa cgccatggac gtggtgctgc agcctagccc tgccctgtct 2220

tggagaagca caggcggcat cctggatgtg tacatctttc tgggccccga gcccaagagc 2280tggagaagca caggcggcat cctggatgtg tacatctttc tgggccccga gcccaagagc 2280

gtggtgcagc agtatctgga tgtcgtgggc taccccttca tgccccctta ctggggcctg 2340gtggtgcagc agtatctgga tgtcgtgggc taccccttca tgcccccctta ctggggcctg 2340

ggattccacc tgtgcagatg gggctactcc agcaccgcca tcaccagaca ggtggtggaa 2400ggattccacc tgtgcagatg gggctactcc agcaccgcca tcaccagaca ggtggtggaa 2400

aacatgacca gagcccactt cccactggat gtgcagtgga acgacctgga ctacatggac 2460aacatgacca gagcccactt cccactggat gtgcagtgga acgacctgga ctacatggac 2460

agcagacggg acttcacctt caacaaggac ggcttccggg acttccccgc catggtgcag 2520agcagacggg acttcacctt caacaaggac ggcttccggg acttccccgc catggtgcag 2520

gaactgcatc agggcggcag acggtacatg atgatcgtgg atcccgccat cagctcctct 2580gaactgcatc agggcggcag acggtacatg atgatcgtgg atcccgccat cagctcctct 2580

ggccctgccg gctcttacag accctacgac gagggcctgc ggagaggcgt gttcatcacc 2640ggccctgccg gctcttacag accctacgac gagggcctgc ggagaggcgt gttcatcacc 2640

aacgagacag gccagcccct gatcggcaaa gtgtggcctg gcagcacagc cttccccgac 2700aacgagacag gccagcccct gatcggcaaa gtgtggcctg gcagcacagc cttccccgac 2700

ttcaccaatc ctaccgccct ggcttggtgg gaggacatgg tggccgagtt ccacgaccag 2760ttcaccaatc ctaccgccct ggcttggtgg gaggacatgg tggccgagtt ccacgaccag 2760

gtgcccttcg acggcatgtg gatcgacatg aacgagccca gcaacttcat ccggggcagc 2820gtgcccttcg acggcatgtg gatcgacatg aacgagccca gcaacttcat ccggggcagc 2820

gaggatggct gccccaacaa cgaactggaa aatccccctt acgtgcccgg cgtcgtgggc 2880gaggatggct gccccaacaa cgaactggaa aatccccctt acgtgcccgg cgtcgtgggc 2880

ggaacactgc aggccgctac aatctgtgcc agcagccacc agtttctgag cacccactac 2940ggaacactgc aggccgctac aatctgtgcc agcagccacc agtttctgag cacccactac 2940

aacctgcaca acctgtacgg cctgaccgag gccattgcca gccaccgcgc tctcgtgaaa 3000aacctgcaca acctgtacgg cctgaccgag gccattgcca gccaccgcgc tctcgtgaaa 3000

gccagaggca cacggccctt cgtgatcagc agaagcacct ttgccggcca cggcagatac 3060gccagaggca cacggccctt cgtgatcagc agaagcacct ttgccggcca cggcagatac 3060

gccggacatt ggactggcga cgtgtggtcc tcttgggagc agctggcctc tagcgtgccc 3120gccggacatt ggactggcga cgtgtggtcc tcttggggagc agctggcctc tagcgtgccc 3120

gagatcctgc agttcaatct gctgggcgtg ccactcgtgg gcgccgatgt gtgtggcttc 3180gagatcctgc agttcaatct gctgggcgtg ccactcgtgg gcgccgatgt gtgtggcttc 3180

ctgggcaaca cctccgagga actgtgtgtg cggtggacac agctgggcgc cttctaccct 3240ctgggcaaca cctccgagga actgtgtgtg cggtggacac agctgggcgc cttctaccct 3240

ttcatgagaa accacaacag cctgctgagc ctgccccagg aaccctacag ctttagcgag 3300ttcatgagaa accacaacag cctgctgagc ctgccccagg aaccctacag ctttagcgag 3300

cctgcacagc aggccatgcg gaaggccctg acactgagat acgctctgct gccccacctg 3360cctgcacagc aggccatgcg gaaggccctg acactgagat acgctctgct gccccacctg 3360

tacaccctgt ttcaccaggc ccatgtggcc ggcgagacag tggccagacc tctgtttctg 3420tacaccctgt ttcaccaggc ccatgtggcc ggcgagacag tggccagacc tctgtttctg 3420

gaattcccca aggacagcag cacctggacc gtggaccatc agctgctgtg gggagaggct 3480gaattcccca aggacagcag cacctggacc gtggaccatc agctgctgtg gggagaggct 3480

ctgctgatta ccccagtgct gcaggcaggc aaggccgaag tgaccggcta ctttcccctg 3540ctgctgatta ccccagtgct gcaggcaggc aaggccgaag tgaccggcta ctttcccctg 3540

ggcacttggt acgacctgca gaccgtgcct gtggaagccc tgggatctct gcctccacct 3600ggcacttggt acgacctgca gaccgtgcct gtggaagccc tgggatctct gcctccacct 3600

cctgccgctc ctagagagcc tgccattcac tctgagggcc agtgggtcac actgcctgcc 3660cctgccgctc ctagagagcc tgccattcac tctgagggcc agtgggtcac actgcctgcc 3660

cccctggata ccatcaacgt gcacctgagg gccggctaca tcataccact gcagggacct 3720cccctggata ccatcaacgt gcacctgagg gccggctaca tcataccact gcagggacct 3720

ggcctgacca ccaccgagtc tagacagcag ccaatggccc tggccgtggc cctgaccaaa 3780ggcctgacca ccaccgagtc tagacagcag ccaatggccc tggccgtggc cctgaccaaa 3780

ggcggagaag ctaggggcga gctgttctgg gacgatggcg agagcctgga agtgctggaa 3840ggcggagaag ctaggggcga gctgttctgg gacgatggcg agagcctgga agtgctggaa 3840

agaggcgcct atacccaagt gatcttcctg gcccggaaca acaccatcgt gaacgagctg 3900agaggcgcct atacccaagt gatcttcctg gcccggaaca acaccatcgt gaacgagctg 3900

gtgcgcgtga cctctgaagg cgctggactg cagctgcaga aagtgaccgt gctgggagtg 3960gtgcgcgtga cctctgaagg cgctggactg cagctgcaga aagtgaccgt gctggggagtg 3960

gccacagccc ctcagcaggt gctgtctaat ggcgtgcccg tgtccaactt cacctacagc 4020gccacagccc ctcagcaggt gctgtctaat ggcgtgcccg tgtccaactt cacctacagc 4020

cccgacacca aggtgctgga catctgcgtg tcactgctga tgggagagca gtttctggtg 4080cccgacacca aggtgctgga catctgcgtg tcactgctga tgggagagca gtttctggtg 4080

tcctggtgct gactcgagag atctaccggt gaattcaccg cgggtttaaa ctgtgccttc 4140tcctggtgct gactcgagag atctaccggt gaattcaccg cgggtttaaa ctgtgccttc 4140

tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc 4200tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc 4200

cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg 4260cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg 4260

tcattctatt ctggggggtg gggtgggggc tagctctaga 4300tcattctatt ctggggggtg gggtgggggc tagctctaga 4300

<210> 26<210> 26

<211> 4198<211> 4198

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7+hGAAco1-Δ-42<223> sp7+hGAAco1-Δ-42

<400> 26<400> 26

ctgggcacca ccttcggcgc ccaccccggc agacctagag ctgtgcctac ccagtgtgac 1380ctgggcacca ccttcggcgc ccaccccggc agacctagag ctgtgcctac ccagtgtgac 1380

gtgcccccca acagcagatt cgactgcgcc cctgacaagg ccatcaccca ggaacagtgc 1440gtgcccccca acagcagatt cgactgcgcc cctgacaagg ccatcaccca ggaacagtgc 1440

gaggccagag gctgctgcta catccctgcc aagcagggac tgcagggcgc tcagatggga 1500gaggccagag gctgctgcta catccctgcc aagcagggac tgcagggcgc tcagatggga 1500

cagccctggt gcttcttccc accctcctac cccagctaca agctggaaaa cctgagcagc 1560cagccctggt gcttcttccc accctcctac cccagctaca agctggaaaa cctgagcagc 1560

agcgagatgg gctacaccgc caccctgacc agaaccaccc ccacattctt cccaaaggac 1620agcgagatgg gctacaccgc caccctgacc agaaccaccc ccacattctt cccaaaggac 1620

atcctgaccc tgcggctgga cgtgatgatg gaaaccgaga accggctgca cttcaccatc 1680atcctgaccc tgcggctgga cgtgatgatg gaaaccgaga accggctgca cttcaccatc 1680

aaggaccccg ccaatcggag atacgaggtg cccctggaaa ccccccacgt gcactctaga 1740aaggaccccg ccaatcggag atacgaggtg cccctggaaa ccccccacgt gcactctaga 1740

gcccccagcc ctctgtacag cgtggaattc agcgaggaac ccttcggcgt gatcgtgcgg 1800gcccccagcc ctctgtacag cgtggaattc agcgaggaac ccttcggcgt gatcgtgcgg 1800

agacagctgg atggcagagt gctgctgaac accaccgtgg cccctctgtt cttcgccgac 1860agacagctgg atggcagagt gctgctgaac accaccgtgg cccctctgtt cttcgccgac 1860

cagttcctgc agctgagcac cagcctgccc agccagtaca tcacaggact ggccgagcac 1920cagttcctgc agctgagcac cagcctgccc agccagtaca tcacaggact ggccgagcac 1920

ctgagccccc tgatgctgag cacatcctgg acccggatca ccctgtggaa cagggatctg 1980ctgagccccc tgatgctgag cacatcctgg acccggatca ccctgtggaa cagggatctg 1980

gcccctaccc ctggcgccaa tctgtacggc agccaccctt tctacctggc cctggaagat 2040gcccctaccc ctggcgccaa tctgtacggc agccaccctt tctacctggc cctggaagat 2040

ggcggatctg cccacggagt gtttctgctg aactccaacg ccatggacgt ggtgctgcag 2100ggcggatctg cccacggagt gtttctgctg aactccaacg ccatggacgt ggtgctgcag 2100

cctagccctg ccctgtcttg gagaagcaca ggcggcatcc tggatgtgta catctttctg 2160cctagccctg ccctgtcttg gagaagcaca ggcggcatcc tggatgtgta catctttctg 2160

ggccccgagc ccaagagcgt ggtgcagcag tatctggatg tcgtgggcta ccccttcatg 2220ggccccgagc ccaagagcgt ggtgcagcag tatctggatg tcgtgggcta ccccttcatg 2220

cccccttact ggggcctggg attccacctg tgcagatggg gctactccag caccgccatc 2280cccccttact ggggcctggg attccacctg tgcagatggg gctactccag caccgccatc 2280

accagacagg tggtggaaaa catgaccaga gcccacttcc cactggatgt gcagtggaac 2340accagacagg tggtggaaaa catgaccaga gcccacttcc cactggatgt gcagtggaac 2340

gacctggact acatggacag cagacgggac ttcaccttca acaaggacgg cttccgggac 2400gacctggact acatggacag cagacgggac ttcaccttca acaaggacgg cttccgggac 2400

ttccccgcca tggtgcagga actgcatcag ggcggcagac ggtacatgat gatcgtggat 2460ttccccgcca tggtgcagga actgcatcag ggcggcagac ggtacatgat gatcgtggat 2460

cccgccatca gctcctctgg ccctgccggc tcttacagac cctacgacga gggcctgcgg 2520cccgccatca gctcctctgg ccctgccggc tcttacagac cctacgacga gggcctgcgg 2520

agaggcgtgt tcatcaccaa cgagacaggc cagcccctga tcggcaaagt gtggcctggc 2580agaggcgtgt tcatcaccaa cgagacaggc cagcccctga tcggcaaagt gtggcctggc 2580

agcacagcct tccccgactt caccaatcct accgccctgg cttggtggga ggacatggtg 2640agcacagcct tccccgactt caccaatcct accgccctgg cttggtggga ggacatggtg 2640

gccgagttcc acgaccaggt gcccttcgac ggcatgtgga tcgacatgaa cgagcccagc 2700gccgagttcc acgaccaggt gcccttcgac ggcatgtgga tcgacatgaa cgagcccagc 2700

aacttcatcc ggggcagcga ggatggctgc cccaacaacg aactggaaaa tcccccttac 2760aacttcatcc ggggcagcga ggatggctgc cccaacaacg aactggaaaa tcccccttac 2760

gtgcccggcg tcgtgggcgg aacactgcag gccgctacaa tctgtgccag cagccaccag 2820gtgcccggcg tcgtgggcgg aacactgcag gccgctacaa tctgtgccag cagccaccag 2820

tttctgagca cccactacaa cctgcacaac ctgtacggcc tgaccgaggc cattgccagc 2880tttctgagca cccactacaa cctgcacaac ctgtacggcc tgaccgaggc cattgccagc 2880

caccgcgctc tcgtgaaagc cagaggcaca cggcccttcg tgatcagcag aagcaccttt 2940caccgcgctc tcgtgaaagc cagaggcaca cggcccttcg tgatcagcag aagcaccttt 2940

gccggccacg gcagatacgc cggacattgg actggcgacg tgtggtcctc ttgggagcag 3000gccggccacg gcagatacgc cggacattgg actggcgacg tgtggtcctc ttggggagcag 3000

ctggcctcta gcgtgcccga gatcctgcag ttcaatctgc tgggcgtgcc actcgtgggc 3060ctggcctcta gcgtgcccga gatcctgcag ttcaatctgc tgggcgtgcc actcgtgggc 3060

gccgatgtgt gtggcttcct gggcaacacc tccgaggaac tgtgtgtgcg gtggacacag 3120gccgatgtgt gtggcttcct gggcaacacc tccgaggaac tgtgtgtgcg gtggacacag 3120

ctgggcgcct tctacccttt catgagaaac cacaacagcc tgctgagcct gccccaggaa 3180ctgggcgcct tctacccttt catgagaaac cacaacagcc tgctgagcct gccccaggaa 3180

ccctacagct ttagcgagcc tgcacagcag gccatgcgga aggccctgac actgagatac 3240ccctacagct ttagcgagcc tgcacagcag gccatgcgga aggccctgac actgagatac 3240

gctctgctgc cccacctgta caccctgttt caccaggccc atgtggccgg cgagacagtg 3300gctctgctgc cccacctgta caccctgttt caccaggccc atgtggccgg cgagacagtg 3300

gccagacctc tgtttctgga attccccaag gacagcagca cctggaccgt ggaccatcag 3360gccagacctc tgtttctgga attccccaag gacagcagca cctggaccgt ggaccatcag 3360

ctgctgtggg gagaggctct gctgattacc ccagtgctgc aggcaggcaa ggccgaagtg 3420ctgctgtggg gagaggctct gctgattacc ccagtgctgc aggcaggcaa ggccgaagtg 3420

accggctact ttcccctggg cacttggtac gacctgcaga ccgtgcctgt ggaagccctg 3480accggctact ttcccctggg cacttggtac gacctgcaga ccgtgcctgt ggaagccctg 3480

ggatctctgc ctccacctcc tgccgctcct agagagcctg ccattcactc tgagggccag 3540ggatctctgc ctccacctcc tgccgctcct agagagcctg ccattcactc tgagggccag 3540

tgggtcacac tgcctgcccc cctggatacc atcaacgtgc acctgagggc cggctacatc 3600tgggtcacac tgcctgcccc cctggatacc atcaacgtgc acctgagggc cggctacatc 3600

ataccactgc agggacctgg cctgaccacc accgagtcta gacagcagcc aatggccctg 3660ataccactgc agggacctgg cctgaccacc accgagtcta gacagcagcc aatggccctg 3660

gccgtggccc tgaccaaagg cggagaagct aggggcgagc tgttctggga cgatggcgag 3720gccgtggccc tgaccaaagg cggagaagct aggggcgagc tgttctggga cgatggcgag 3720

agcctggaag tgctggaaag aggcgcctat acccaagtga tcttcctggc ccggaacaac 3780agcctggaag tgctggaaag aggcgcctat acccaagtga tcttcctggc ccggaacaac 3780

accatcgtga acgagctggt gcgcgtgacc tctgaaggcg ctggactgca gctgcagaaa 3840accatcgtga acgagctggt gcgcgtgacc tctgaaggcg ctggactgca gctgcagaaa 3840

gtgaccgtgc tgggagtggc cacagcccct cagcaggtgc tgtctaatgg cgtgcccgtg 3900gtgaccgtgc tgggagtggc cacagcccct cagcaggtgc tgtctaatgg cgtgcccgtg 3900

tccaacttca cctacagccc cgacaccaag gtgctggaca tctgcgtgtc actgctgatg 3960tccaacttca cctacagccc cgacaccaag gtgctggaca tctgcgtgtc actgctgatg 3960

ggagagcagt ttctggtgtc ctggtgctga ctcgagagat ctaccggtga attcaccgcg 4020ggagagcagt ttctggtgtc ctggtgctga ctcgagagat ctaccggtga attcaccgcg 4020

ggtttaaact gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc 4080ggtttaaact gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc 4080

cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc 4140cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc 4140

gcattgtctg agtaggtgtc attctattct ggggggtggg gtgggggcta gctctaga 4198gcattgtctg agtaggtgtc attctattct gggggtggg gtgggggcta gctctaga 4198

<210> 27<210> 27

<211> 917<211> 917

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAA-Δ-8<223> hGAA-Δ-8

<400> 27<400> 27

Leu Leu Val Pro Arg Glu Leu Ser Gly Ser Ser Pro Val Leu Glu GluLeu Leu Val Pro Arg Glu Leu Ser Gly Ser Ser Pro Val Leu Glu Glu

1 5 10 151 5 10 15

Thr His Pro Ala His Gln Gln Gly Ala Ser Arg Pro Gly Pro Arg AspThr His Pro Ala His Gln Gln Gly Ala Ser Arg Pro Gly Pro Arg Asp

20 25 30 20 25 30

Ala Gln Ala His Pro Gly Arg Pro Arg Ala Val Pro Thr Gln Cys AspAla Gln Ala His Pro Gly Arg Pro Arg Ala Val Pro Thr Gln Cys Asp

35 40 45 35 40 45

Val Pro Pro Asn Ser Arg Phe Asp Cys Ala Pro Asp Lys Ala Ile ThrVal Pro Pro Asn Ser Arg Phe Asp Cys Ala Pro Asp Lys Ala Ile Thr

50 55 60 50 55 60

Gln Glu Gln Cys Glu Ala Arg Gly Cys Cys Tyr Ile Pro Ala Lys GlnGln Glu Gln Cys Glu Ala Arg Gly Cys Cys Tyr Ile Pro Ala Lys Gln

65 70 75 8065 70 75 80

Gly Leu Gln Gly Ala Gln Met Gly Gln Pro Trp Cys Phe Phe Pro ProGly Leu Gln Gly Ala Gln Met Gly Gln Pro Trp Cys Phe Phe Pro Pro

85 90 95 85 90 95

Ser Tyr Pro Ser Tyr Lys Leu Glu Asn Leu Ser Ser Ser Glu Met GlySer Tyr Pro Ser Tyr Lys Leu Glu Asn Leu Ser Ser Ser Glu Met Gly

100 105 110 100 105 110

Tyr Thr Ala Thr Leu Thr Arg Thr Thr Pro Thr Phe Phe Pro Lys AspTyr Thr Ala Thr Leu Thr Arg Thr Thr Pro Thr Phe Phe Pro Lys Asp

115 120 125 115 120 125

Ile Leu Thr Leu Arg Leu Asp Val Met Met Glu Thr Glu Asn Arg LeuIle Leu Thr Leu Arg Leu Asp Val Met Met Glu Thr Glu Asn Arg Leu

130 135 140 130 135 140

His Phe Thr Ile Lys Asp Pro Ala Asn Arg Arg Tyr Glu Val Pro LeuHis Phe Thr Ile Lys Asp Pro Ala Asn Arg Arg Tyr Glu Val Pro Leu

145 150 155 160145 150 155 160

Glu Thr Pro His Val His Ser Arg Ala Pro Ser Pro Leu Tyr Ser ValGlu Thr Pro His Val His Ser Arg Ala Pro Ser Pro Leu Tyr Ser Val

165 170 175 165 170 175

Glu Phe Ser Glu Glu Pro Phe Gly Val Ile Val Arg Arg Gln Leu AspGlu Phe Ser Glu Glu Pro Phe Gly Val Ile Val Arg Arg Gln Leu Asp

180 185 190 180 185 190

Gly Arg Val Leu Leu Asn Thr Thr Val Ala Pro Leu Phe Phe Ala AspGly Arg Val Leu Leu Asn Thr Thr Val Ala Pro Leu Phe Phe Ala Asp

195 200 205 195 200 205

Gln Phe Leu Gln Leu Ser Thr Ser Leu Pro Ser Gln Tyr Ile Thr GlyGln Phe Leu Gln Leu Ser Thr Ser Leu Pro Ser Gln Tyr Ile Thr Gly

210 215 220 210 215 220

Leu Ala Glu His Leu Ser Pro Leu Met Leu Ser Thr Ser Trp Thr ArgLeu Ala Glu His Leu Ser Pro Leu Met Leu Ser Thr Ser Trp Thr Arg

225 230 235 240225 230 235 240

Ile Thr Leu Trp Asn Arg Asp Leu Ala Pro Thr Pro Gly Ala Asn LeuIle Thr Leu Trp Asn Arg Asp Leu Ala Pro Thr Pro Gly Ala Asn Leu

245 250 255 245 250 255

Tyr Gly Ser His Pro Phe Tyr Leu Ala Leu Glu Asp Gly Gly Ser AlaTyr Gly Ser His Pro Phe Tyr Leu Ala Leu Glu Asp Gly Gly Ser Ala

260 265 270 260 265 270

His Gly Val Phe Leu Leu Asn Ser Asn Ala Met Asp Val Val Leu GlnHis Gly Val Phe Leu Leu Asn Ser Asn Ala Met Asp Val Val Leu Gln

275 280 285 275 280 285

Pro Ser Pro Ala Leu Ser Trp Arg Ser Thr Gly Gly Ile Leu Asp ValPro Ser Pro Ala Leu Ser Trp Arg Ser Thr Gly Gly Ile Leu Asp Val

290 295 300 290 295 300

Tyr Ile Phe Leu Gly Pro Glu Pro Lys Ser Val Val Gln Gln Tyr LeuTyr Ile Phe Leu Gly Pro Glu Pro Lys Ser Val Val Gln Gln Tyr Leu

305 310 315 320305 310 315 320

Asp Val Val Gly Tyr Pro Phe Met Pro Pro Tyr Trp Gly Leu Gly PheAsp Val Val Gly Tyr Pro Phe Met Pro Pro Tyr Trp Gly Leu Gly Phe

325 330 335 325 330 335

His Leu Cys Arg Trp Gly Tyr Ser Ser Thr Ala Ile Thr Arg Gln ValHis Leu Cys Arg Trp Gly Tyr Ser Ser Thr Ala Ile Thr Arg Gln Val

340 345 350 340 345 350

Val Glu Asn Met Thr Arg Ala His Phe Pro Leu Asp Val Gln Trp AsnVal Glu Asn Met Thr Arg Ala His Phe Pro Leu Asp Val Gln Trp Asn

355 360 365 355 360 365

Asp Leu Asp Tyr Met Asp Ser Arg Arg Asp Phe Thr Phe Asn Lys AspAsp Leu Asp Tyr Met Asp Ser Arg Arg Asp Phe Thr Phe Asn Lys Asp

370 375 380 370 375 380

Gly Phe Arg Asp Phe Pro Ala Met Val Gln Glu Leu His Gln Gly GlyGly Phe Arg Asp Phe Pro Ala Met Val Gln Glu Leu His Gln Gly Gly

385 390 395 400385 390 395 400

Arg Arg Tyr Met Met Ile Val Asp Pro Ala Ile Ser Ser Ser Gly ProArg Arg Tyr Met Met Ile Val Asp Pro Ala Ile Ser Ser Ser Gly Pro

405 410 415 405 410 415

Ala Gly Ser Tyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg Gly Val PheAla Gly Ser Tyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg Gly Val Phe

420 425 430 420 425 430

Ile Thr Asn Glu Thr Gly Gln Pro Leu Ile Gly Lys Val Trp Pro GlyIle Thr Asn Glu Thr Gly Gln Pro Leu Ile Gly Lys Val Trp Pro Gly

435 440 445 435 440 445

Ser Thr Ala Phe Pro Asp Phe Thr Asn Pro Thr Ala Leu Ala Trp TrpSer Thr Ala Phe Pro Asp Phe Thr Asn Pro Thr Ala Leu Ala Trp Trp

450 455 460 450 455 460

Glu Asp Met Val Ala Glu Phe His Asp Gln Val Pro Phe Asp Gly MetGlu Asp Met Val Ala Glu Phe His Asp Gln Val Pro Phe Asp Gly Met

465 470 475 480465 470 475 480

Trp Ile Asp Met Asn Glu Pro Ser Asn Phe Ile Arg Gly Ser Glu AspTrp Ile Asp Met Asn Glu Pro Ser Asn Phe Ile Arg Gly Ser Glu Asp

485 490 495 485 490 495

Gly Cys Pro Asn Asn Glu Leu Glu Asn Pro Pro Tyr Val Pro Gly ValGly Cys Pro Asn Asn Glu Leu Glu Asn Pro Pro Tyr Val Pro Gly Val

500 505 510 500 505 510

Val Gly Gly Thr Leu Gln Ala Ala Thr Ile Cys Ala Ser Ser His GlnVal Gly Gly Thr Leu Gln Ala Ala Thr Ile Cys Ala Ser Ser His Gln

515 520 525 515 520 525

Phe Leu Ser Thr His Tyr Asn Leu His Asn Leu Tyr Gly Leu Thr GluPhe Leu Ser Thr His Tyr Asn Leu His Asn Leu Tyr Gly Leu Thr Glu

530 535 540 530 535 540

Ala Ile Ala Ser His Arg Ala Leu Val Lys Ala Arg Gly Thr Arg ProAla Ile Ala Ser His Arg Ala Leu Val Lys Ala Arg Gly Thr Arg Pro

545 550 555 560545 550 555 560

Phe Val Ile Ser Arg Ser Thr Phe Ala Gly His Gly Arg Tyr Ala GlyPhe Val Ile Ser Arg Ser Thr Phe Ala Gly His Gly Arg Tyr Ala Gly

565 570 575 565 570 575

His Trp Thr Gly Asp Val Trp Ser Ser Trp Glu Gln Leu Ala Ser SerHis Trp Thr Gly Asp Val Trp Ser Ser Trp Glu Gln Leu Ala Ser Ser

580 585 590 580 585 590

Val Pro Glu Ile Leu Gln Phe Asn Leu Leu Gly Val Pro Leu Val GlyVal Pro Glu Ile Leu Gln Phe Asn Leu Leu Gly Val Pro Leu Val Gly

595 600 605 595 600 605

Ala Asp Val Cys Gly Phe Leu Gly Asn Thr Ser Glu Glu Leu Cys ValAla Asp Val Cys Gly Phe Leu Gly Asn Thr Ser Glu Glu Leu Cys Val

610 615 620 610 615 620

Arg Trp Thr Gln Leu Gly Ala Phe Tyr Pro Phe Met Arg Asn His AsnArg Trp Thr Gln Leu Gly Ala Phe Tyr Pro Phe Met Arg Asn His Asn

625 630 635 640625 630 635 640

Ser Leu Leu Ser Leu Pro Gln Glu Pro Tyr Ser Phe Ser Glu Pro AlaSer Leu Leu Ser Leu Pro Gln Glu Pro Tyr Ser Phe Ser Glu Pro Ala

645 650 655 645 650 655

Gln Gln Ala Met Arg Lys Ala Leu Thr Leu Arg Tyr Ala Leu Leu ProGln Gln Ala Met Arg Lys Ala Leu Thr Leu Arg Tyr Ala Leu Leu Pro

660 665 670 660 665 670

His Leu Tyr Thr Leu Phe His Gln Ala His Val Ala Gly Glu Thr ValHis Leu Tyr Thr Leu Phe His Gln Ala His Val Ala Gly Glu Thr Val

675 680 685 675 680 685

Ala Arg Pro Leu Phe Leu Glu Phe Pro Lys Asp Ser Ser Thr Trp ThrAla Arg Pro Leu Phe Leu Glu Phe Pro Lys Asp Ser Ser Thr Trp Thr

690 695 700 690 695 700

Val Asp His Gln Leu Leu Trp Gly Glu Ala Leu Leu Ile Thr Pro ValVal Asp His Gln Leu Leu Trp Gly Glu Ala Leu Leu Ile Thr Pro Val

705 710 715 720705 710 715 720

Leu Gln Ala Gly Lys Ala Glu Val Thr Gly Tyr Phe Pro Leu Gly ThrLeu Gln Ala Gly Lys Ala Glu Val Thr Gly Tyr Phe Pro Leu Gly Thr

725 730 735 725 730 735

Trp Tyr Asp Leu Gln Thr Val Pro Val Glu Ala Leu Gly Ser Leu ProTrp Tyr Asp Leu Gln Thr Val Pro Val Glu Ala Leu Gly Ser Leu Pro

740 745 750 740 745 750

Pro Pro Pro Ala Ala Pro Arg Glu Pro Ala Ile His Ser Glu Gly GlnPro Pro Pro Ala Ala Pro Arg Glu Pro Ala Ile His Ser Glu Gly Gln

755 760 765 755 760 765

Trp Val Thr Leu Pro Ala Pro Leu Asp Thr Ile Asn Val His Leu ArgTrp Val Thr Leu Pro Ala Pro Leu Asp Thr Ile Asn Val His Leu Arg

770 775 780 770 775 780

Ala Gly Tyr Ile Ile Pro Leu Gln Gly Pro Gly Leu Thr Thr Thr GluAla Gly Tyr Ile Ile Pro Leu Gln Gly Pro Gly Leu Thr Thr Thr Glu

785 790 795 800785 790 795 800

Ser Arg Gln Gln Pro Met Ala Leu Ala Val Ala Leu Thr Lys Gly GlySer Arg Gln Gln Pro Met Ala Leu Ala Val Ala Leu Thr Lys Gly Gly

805 810 815 805 810 815

Glu Ala Arg Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser Leu Glu ValGlu Ala Arg Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser Leu Glu Val

820 825 830 820 825 830

Leu Glu Arg Gly Ala Tyr Thr Gln Val Ile Phe Leu Ala Arg Asn AsnLeu Glu Arg Gly Ala Tyr Thr Gln Val Ile Phe Leu Ala Arg Asn Asn

835 840 845 835 840 845

Thr Ile Val Asn Glu Leu Val Arg Val Thr Ser Glu Gly Ala Gly LeuThr Ile Val Asn Glu Leu Val Arg Val Thr Ser Glu Gly Ala Gly Leu

850 855 860 850 855 860

Gln Leu Gln Lys Val Thr Val Leu Gly Val Ala Thr Ala Pro Gln GlnGln Leu Gln Lys Val Thr Val Leu Gly Val Ala Thr Ala Pro Gln Gln

865 870 875 880865 870 875 880

Val Leu Ser Asn Gly Val Pro Val Ser Asn Phe Thr Tyr Ser Pro AspVal Leu Ser Asn Gly Val Pro Val Ser Asn Phe Thr Tyr Ser Pro Asp

885 890 895 885 890 895

Thr Lys Val Leu Asp Ile Cys Val Ser Leu Leu Met Gly Glu Gln PheThr Lys Val Leu Asp Ile Cys Val Ser Leu Leu Met Gly Glu Gln Phe

900 905 910 900 905 910

Leu Val Ser Trp CysLeu Val Ser Trp Cys

915 915

<210> 28<210> 28

<211> 883<211> 883

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAA-Δ-42<223> hGAA-Δ-42

<400> 28<400> 28

Ala His Pro Gly Arg Pro Arg Ala Val Pro Thr Gln Cys Asp Val ProAla His Pro Gly Arg Pro Arg Ala Val Pro Thr Gln Cys Asp Val Pro

1 5 10 151 5 10 15

Pro Asn Ser Arg Phe Asp Cys Ala Pro Asp Lys Ala Ile Thr Gln GluPro Asn Ser Arg Phe Asp Cys Ala Pro Asp Lys Ala Ile Thr Gln Glu

20 25 30 20 25 30

Gln Cys Glu Ala Arg Gly Cys Cys Tyr Ile Pro Ala Lys Gln Gly LeuGln Cys Glu Ala Arg Gly Cys Cys Tyr Ile Pro Ala Lys Gln Gly Leu

35 40 45 35 40 45

Gln Gly Ala Gln Met Gly Gln Pro Trp Cys Phe Phe Pro Pro Ser TyrGln Gly Ala Gln Met Gly Gln Pro Trp Cys Phe Phe Pro Pro Ser Tyr

50 55 60 50 55 60

Pro Ser Tyr Lys Leu Glu Asn Leu Ser Ser Ser Glu Met Gly Tyr ThrPro Ser Tyr Lys Leu Glu Asn Leu Ser Ser Ser Glu Met Gly Tyr Thr

65 70 75 8065 70 75 80

Ala Thr Leu Thr Arg Thr Thr Pro Thr Phe Phe Pro Lys Asp Ile LeuAla Thr Leu Thr Arg Thr Thr Pro Thr Phe Phe Pro Lys Asp Ile Leu

85 90 95 85 90 95

Thr Leu Arg Leu Asp Val Met Met Glu Thr Glu Asn Arg Leu His PheThr Leu Arg Leu Asp Val Met Met Glu Thr Glu Asn Arg Leu His Phe

100 105 110 100 105 110

Thr Ile Lys Asp Pro Ala Asn Arg Arg Tyr Glu Val Pro Leu Glu ThrThr Ile Lys Asp Pro Ala Asn Arg Arg Tyr Glu Val Pro Leu Glu Thr

115 120 125 115 120 125

Pro His Val His Ser Arg Ala Pro Ser Pro Leu Tyr Ser Val Glu PhePro His Val His Ser Arg Ala Pro Ser Pro Leu Tyr Ser Val Glu Phe

130 135 140 130 135 140

Ser Glu Glu Pro Phe Gly Val Ile Val Arg Arg Gln Leu Asp Gly ArgSer Glu Glu Pro Phe Gly Val Ile Val Arg Arg Gln Leu Asp Gly Arg

145 150 155 160145 150 155 160

Val Leu Leu Asn Thr Thr Val Ala Pro Leu Phe Phe Ala Asp Gln PheVal Leu Leu Asn Thr Thr Val Ala Pro Leu Phe Phe Ala Asp Gln Phe

165 170 175 165 170 175

Leu Gln Leu Ser Thr Ser Leu Pro Ser Gln Tyr Ile Thr Gly Leu AlaLeu Gln Leu Ser Thr Ser Leu Pro Ser Gln Tyr Ile Thr Gly Leu Ala

180 185 190 180 185 190

Glu His Leu Ser Pro Leu Met Leu Ser Thr Ser Trp Thr Arg Ile ThrGlu His Leu Ser Pro Leu Met Leu Ser Thr Ser Trp Thr Arg Ile Thr

195 200 205 195 200 205

Leu Trp Asn Arg Asp Leu Ala Pro Thr Pro Gly Ala Asn Leu Tyr GlyLeu Trp Asn Arg Asp Leu Ala Pro Thr Pro Gly Ala Asn Leu Tyr Gly

210 215 220 210 215 220

Ser His Pro Phe Tyr Leu Ala Leu Glu Asp Gly Gly Ser Ala His GlySer His Pro Phe Tyr Leu Ala Leu Glu Asp Gly Gly Ser Ala His Gly

225 230 235 240225 230 235 240

Val Phe Leu Leu Asn Ser Asn Ala Met Asp Val Val Leu Gln Pro SerVal Phe Leu Leu Asn Ser Asn Ala Met Asp Val Val Leu Gln Pro Ser

245 250 255 245 250 255

Pro Ala Leu Ser Trp Arg Ser Thr Gly Gly Ile Leu Asp Val Tyr IlePro Ala Leu Ser Trp Arg Ser Thr Gly Gly Ile Leu Asp Val Tyr Ile

260 265 270 260 265 270

Phe Leu Gly Pro Glu Pro Lys Ser Val Val Gln Gln Tyr Leu Asp ValPhe Leu Gly Pro Glu Pro Lys Ser Val Val Gln Gln Tyr Leu Asp Val

275 280 285 275 280 285

Val Gly Tyr Pro Phe Met Pro Pro Tyr Trp Gly Leu Gly Phe His LeuVal Gly Tyr Pro Phe Met Pro Pro Tyr Trp Gly Leu Gly Phe His Leu

290 295 300 290 295 300

Cys Arg Trp Gly Tyr Ser Ser Thr Ala Ile Thr Arg Gln Val Val GluCys Arg Trp Gly Tyr Ser Ser Thr Ala Ile Thr Arg Gln Val Val Glu

305 310 315 320305 310 315 320

Asn Met Thr Arg Ala His Phe Pro Leu Asp Val Gln Trp Asn Asp LeuAsn Met Thr Arg Ala His Phe Pro Leu Asp Val Gln Trp Asn Asp Leu

325 330 335 325 330 335

Asp Tyr Met Asp Ser Arg Arg Asp Phe Thr Phe Asn Lys Asp Gly PheAsp Tyr Met Asp Ser Arg Arg Asp Phe Thr Phe Asn Lys Asp Gly Phe

340 345 350 340 345 350

Arg Asp Phe Pro Ala Met Val Gln Glu Leu His Gln Gly Gly Arg ArgArg Asp Phe Pro Ala Met Val Gln Glu Leu His Gln Gly Gly Arg Arg

355 360 365 355 360 365

Tyr Met Met Ile Val Asp Pro Ala Ile Ser Ser Ser Gly Pro Ala GlyTyr Met Met Ile Val Asp Pro Ala Ile Ser Ser Ser Gly Pro Ala Gly

370 375 380 370 375 380

Ser Tyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg Gly Val Phe Ile ThrSer Tyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg Gly Val Phe Ile Thr

385 390 395 400385 390 395 400

Asn Glu Thr Gly Gln Pro Leu Ile Gly Lys Val Trp Pro Gly Ser ThrAsn Glu Thr Gly Gln Pro Leu Ile Gly Lys Val Trp Pro Gly Ser Thr

405 410 415 405 410 415

Ala Phe Pro Asp Phe Thr Asn Pro Thr Ala Leu Ala Trp Trp Glu AspAla Phe Pro Asp Phe Thr Asn Pro Thr Ala Leu Ala Trp Trp Glu Asp

420 425 430 420 425 430

Met Val Ala Glu Phe His Asp Gln Val Pro Phe Asp Gly Met Trp IleMet Val Ala Glu Phe His Asp Gln Val Pro Phe Asp Gly Met Trp Ile

435 440 445 435 440 445

Asp Met Asn Glu Pro Ser Asn Phe Ile Arg Gly Ser Glu Asp Gly CysAsp Met Asn Glu Pro Ser Asn Phe Ile Arg Gly Ser Glu Asp Gly Cys

450 455 460 450 455 460

Pro Asn Asn Glu Leu Glu Asn Pro Pro Tyr Val Pro Gly Val Val GlyPro Asn Asn Glu Leu Glu Asn Pro Pro Tyr Val Pro Gly Val Val Gly

465 470 475 480465 470 475 480

Gly Thr Leu Gln Ala Ala Thr Ile Cys Ala Ser Ser His Gln Phe LeuGly Thr Leu Gln Ala Ala Thr Ile Cys Ala Ser Ser His Gln Phe Leu

485 490 495 485 490 495

Ser Thr His Tyr Asn Leu His Asn Leu Tyr Gly Leu Thr Glu Ala IleSer Thr His Tyr Asn Leu His Asn Leu Tyr Gly Leu Thr Glu Ala Ile

500 505 510 500 505 510

Ala Ser His Arg Ala Leu Val Lys Ala Arg Gly Thr Arg Pro Phe ValAla Ser His Arg Ala Leu Val Lys Ala Arg Gly Thr Arg Pro Phe Val

515 520 525 515 520 525

Ile Ser Arg Ser Thr Phe Ala Gly His Gly Arg Tyr Ala Gly His TrpIle Ser Arg Ser Thr Phe Ala Gly His Gly Arg Tyr Ala Gly His Trp

530 535 540 530 535 540

Thr Gly Asp Val Trp Ser Ser Trp Glu Gln Leu Ala Ser Ser Val ProThr Gly Asp Val Trp Ser Ser Trp Glu Gln Leu Ala Ser Ser Val Pro

545 550 555 560545 550 555 560

Glu Ile Leu Gln Phe Asn Leu Leu Gly Val Pro Leu Val Gly Ala AspGlu Ile Leu Gln Phe Asn Leu Leu Gly Val Pro Leu Val Gly Ala Asp

565 570 575 565 570 575

Val Cys Gly Phe Leu Gly Asn Thr Ser Glu Glu Leu Cys Val Arg TrpVal Cys Gly Phe Leu Gly Asn Thr Ser Glu Glu Leu Cys Val Arg Trp

580 585 590 580 585 590

Thr Gln Leu Gly Ala Phe Tyr Pro Phe Met Arg Asn His Asn Ser LeuThr Gln Leu Gly Ala Phe Tyr Pro Phe Met Arg Asn His Asn Ser Leu

595 600 605 595 600 605

Leu Ser Leu Pro Gln Glu Pro Tyr Ser Phe Ser Glu Pro Ala Gln GlnLeu Ser Leu Pro Gln Glu Pro Tyr Ser Phe Ser Glu Pro Ala Gln Gln

610 615 620 610 615 620

Ala Met Arg Lys Ala Leu Thr Leu Arg Tyr Ala Leu Leu Pro His LeuAla Met Arg Lys Ala Leu Thr Leu Arg Tyr Ala Leu Leu Pro His Leu

625 630 635 640625 630 635 640

Tyr Thr Leu Phe His Gln Ala His Val Ala Gly Glu Thr Val Ala ArgTyr Thr Leu Phe His Gln Ala His Val Ala Gly Glu Thr Val Ala Arg

645 650 655 645 650 655

Pro Leu Phe Leu Glu Phe Pro Lys Asp Ser Ser Thr Trp Thr Val AspPro Leu Phe Leu Glu Phe Pro Lys Asp Ser Ser Thr Trp Thr Val Asp

660 665 670 660 665 670

His Gln Leu Leu Trp Gly Glu Ala Leu Leu Ile Thr Pro Val Leu GlnHis Gln Leu Leu Trp Gly Glu Ala Leu Leu Ile Thr Pro Val Leu Gln

675 680 685 675 680 685

Ala Gly Lys Ala Glu Val Thr Gly Tyr Phe Pro Leu Gly Thr Trp TyrAla Gly Lys Ala Glu Val Thr Gly Tyr Phe Pro Leu Gly Thr Trp Tyr

690 695 700 690 695 700

Asp Leu Gln Thr Val Pro Val Glu Ala Leu Gly Ser Leu Pro Pro ProAsp Leu Gln Thr Val Pro Val Glu Ala Leu Gly Ser Leu Pro Pro Pro

705 710 715 720705 710 715 720

Pro Ala Ala Pro Arg Glu Pro Ala Ile His Ser Glu Gly Gln Trp ValPro Ala Ala Pro Arg Glu Pro Ala Ile His Ser Glu Gly Gln Trp Val

725 730 735 725 730 735

Thr Leu Pro Ala Pro Leu Asp Thr Ile Asn Val His Leu Arg Ala GlyThr Leu Pro Ala Pro Leu Asp Thr Ile Asn Val His Leu Arg Ala Gly

740 745 750 740 745 750

Tyr Ile Ile Pro Leu Gln Gly Pro Gly Leu Thr Thr Thr Glu Ser ArgTyr Ile Ile Pro Leu Gln Gly Pro Gly Leu Thr Thr Thr Glu Ser Arg

755 760 765 755 760 765

Gln Gln Pro Met Ala Leu Ala Val Ala Leu Thr Lys Gly Gly Glu AlaGln Gln Pro Met Ala Leu Ala Val Ala Leu Thr Lys Gly Gly Glu Ala

770 775 780 770 775 780

Arg Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser Leu Glu Val Leu GluArg Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser Leu Glu Val Leu Glu

785 790 795 800785 790 795 800

Arg Gly Ala Tyr Thr Gln Val Ile Phe Leu Ala Arg Asn Asn Thr IleArg Gly Ala Tyr Thr Gln Val Ile Phe Leu Ala Arg Asn Asn Thr Ile

805 810 815 805 810 815

Val Asn Glu Leu Val Arg Val Thr Ser Glu Gly Ala Gly Leu Gln LeuVal Asn Glu Leu Val Arg Val Thr Ser Glu Gly Ala Gly Leu Gln Leu

820 825 830 820 825 830

Gln Lys Val Thr Val Leu Gly Val Ala Thr Ala Pro Gln Gln Val LeuGln Lys Val Thr Val Leu Gly Val Ala Thr Ala Pro Gln Gln Val Leu

835 840 845 835 840 845

Ser Asn Gly Val Pro Val Ser Asn Phe Thr Tyr Ser Pro Asp Thr LysSer Asn Gly Val Pro Val Ser Asn Phe Thr Tyr Ser Pro Asp Thr Lys

850 855 860 850 855 860

Val Leu Asp Ile Cys Val Ser Leu Leu Met Gly Glu Gln Phe Leu ValVal Leu Asp Ile Cys Val Ser Leu Leu Met Gly Glu Gln Phe Leu Val

865 870 875 880865 870 875 880

Ser Trp CysSer Trp Cys

<210> 29<210> 29

<211> 952<211> 952

<212> PRT<212> PRT

<213> 智人<213> Homo sapiens

<400> 29<400> 29

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

740 745 750 740 745 750

755 760 765 755 760 765

Leu Gly Thr Trp Tyr Asp Leu Gln Thr Val Pro Ile Glu Ala Leu GlyLeu Gly Thr Trp Tyr Asp Leu Gln Thr Val Pro Ile Glu Ala Leu Gly

770 775 780 770 775 780

785 790 795 800785 790 795 800

805 810 815 805 810 815

820 825 830 820 825 830

835 840 845 835 840 845

850 855 860 850 855 860

865 870 875 880865 870 875 880

885 890 895 885 890 895

900 905 910 900 905 910

915 920 925 915 920 925

930 935 940 930 935 940

Glu Gln Phe Leu Val Ser Trp CysGlu Gln Phe Leu Val Ser Trp Cys

945 950945 950

<210> 30<210> 30

<211> 952<211> 952

<212> PRT<212> PRT

<213> 智人<213> Homo sapiens

<400> 30<400> 30

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

Val Pro Leu Glu Thr Pro Arg Val His Ser Arg Ala Pro Ser Pro LeuVal Pro Leu Glu Thr Pro Arg Val His Ser Arg Ala Pro Ser Pro Leu

195 200 205 195 200 205

Tyr Ser Val Glu Phe Ser Glu Glu Pro Phe Gly Val Ile Val His ArgTyr Ser Val Glu Phe Ser Glu Glu Pro Phe Gly Val Ile Val His Arg

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

740 745 750 740 745 750

755 760 765 755 760 765

770 775 780 770 775 780

785 790 795 800785 790 795 800

805 810 815 805 810 815

820 825 830 820 825 830

835 840 845 835 840 845

850 855 860 850 855 860

865 870 875 880865 870 875 880

885 890 895 885 890 895

900 905 910 900 905 910

915 920 925 915 920 925

930 935 940 930 935 940

Glu Gln Phe Leu Val Ser Trp CysGlu Gln Phe Leu Val Ser Trp Cys

945 950945 950

<210> 31<210> 31

<211> 957<211> 957

<212> PRT<212> PRT

<213> 智人<213> Homo sapiens

<400> 31<400> 31

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

740 745 750 740 745 750

755 760 765 755 760 765

770 775 780 770 775 780

785 790 795 800785 790 795 800

805 810 815 805 810 815

820 825 830 820 825 830

835 840 845 835 840 845

850 855 860 850 855 860

865 870 875 880865 870 875 880

885 890 895 885 890 895

900 905 910 900 905 910

915 920 925 915 920 925

Ser Pro Asp Thr Lys Ala Arg Gly Pro Arg Val Leu Asp Ile Cys ValSer Pro Asp Thr Lys Ala Arg Gly Pro Arg Val Leu Asp Ile Cys Val

930 935 940 930 935 940

945 950 955945 950 955

<210> 32<210> 32

<211> 952<211> 952

<212> PRT<212> PRT

<213> 智人<213> Homo sapiens

<400> 32<400> 32

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

Ser Gly Pro Ala Gly Ser Tyr Arg Leu Tyr Asp Glu Gly Leu Arg ArgSer Gly Pro Ala Gly Ser Tyr Arg Leu Tyr Asp Glu Gly Leu Arg Arg

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

740 745 750 740 745 750

755 760 765 755 760 765

770 775 780 770 775 780

785 790 795 800785 790 795 800

805 810 815 805 810 815

820 825 830 820 825 830

835 840 845 835 840 845

850 855 860 850 855 860

865 870 875 880865 870 875 880

885 890 895 885 890 895

900 905 910 900 905 910

915 920 925 915 920 925

930 935 940 930 935 940

Glu Gln Phe Leu Val Ser Trp CysGlu Gln Phe Leu Val Ser Trp Cys

945 950945 950

<210> 33<210> 33

<211> 925<211> 925

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 变体hGAAwt w/o sp<223> variant hGAAwt w/o sp

<400> 33<400> 33

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

Asn Arg Arg Tyr Glu Val Pro Leu Glu Thr Pro Arg Val His Ser ArgAsn Arg Arg Tyr Glu Val Pro Leu Glu Thr Pro Arg Val His Ser Arg

165 170 175 165 170 175

180 185 190 180 185 190

Val Ile Val His Arg Gln Leu Asp Gly Arg Val Leu Leu Asn Thr ThrVal Ile Val His Arg Gln Leu Asp Gly Arg Val Leu Leu Asn Thr Thr

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

740 745 750 740 745 750

Ile Glu Ala Leu Gly Ser Leu Pro Pro Pro Pro Ala Ala Pro Arg GluIle Glu Ala Leu Gly Ser Leu Pro Pro Pro Pro Ala Ala Pro Arg Glu

755 760 765 755 760 765

770 775 780 770 775 780

785 790 795 800785 790 795 800

805 810 815 805 810 815

820 825 830 820 825 830

835 840 845 835 840 845

850 855 860 850 855 860

865 870 875 880865 870 875 880

885 890 895 885 890 895

900 905 910 900 905 910

915 920 925 915 920 925

<210> 34<210> 34

<211> 896<211> 896

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAA-Δ-29<223> hGAA-Δ-29

<400> 34<400> 34

Gln Gln Gly Ala Ser Arg Pro Gly Pro Arg Asp Ala Gln Ala His ProGln Gln Gly Ala Ser Arg Pro Gly Pro Arg Asp Ala Gln Ala His Pro

1 5 10 151 5 10 15

Gly Arg Pro Arg Ala Val Pro Thr Gln Cys Asp Val Pro Pro Asn SerGly Arg Pro Arg Ala Val Pro Thr Gln Cys Asp Val Pro Pro Asn Ser

20 25 30 20 25 30

Arg Phe Asp Cys Ala Pro Asp Lys Ala Ile Thr Gln Glu Gln Cys GluArg Phe Asp Cys Ala Pro Asp Lys Ala Ile Thr Gln Glu Gln Cys Glu

35 40 45 35 40 45

Ala Arg Gly Cys Cys Tyr Ile Pro Ala Lys Gln Gly Leu Gln Gly AlaAla Arg Gly Cys Cys Tyr Ile Pro Ala Lys Gln Gly Leu Gln Gly Ala

50 55 60 50 55 60

Gln Met Gly Gln Pro Trp Cys Phe Phe Pro Pro Ser Tyr Pro Ser TyrGln Met Gly Gln Pro Trp Cys Phe Phe Pro Pro Ser Tyr Pro Ser Tyr

65 70 75 8065 70 75 80

Lys Leu Glu Asn Leu Ser Ser Ser Glu Met Gly Tyr Thr Ala Thr LeuLys Leu Glu Asn Leu Ser Ser Ser Glu Met Gly Tyr Thr Ala Thr Leu

85 90 95 85 90 95

Thr Arg Thr Thr Pro Thr Phe Phe Pro Lys Asp Ile Leu Thr Leu ArgThr Arg Thr Thr Pro Thr Phe Phe Pro Lys Asp Ile Leu Thr Leu Arg

100 105 110 100 105 110

Leu Asp Val Met Met Glu Thr Glu Asn Arg Leu His Phe Thr Ile LysLeu Asp Val Met Met Glu Thr Glu Asn Arg Leu His Phe Thr Ile Lys

115 120 125 115 120 125

Asp Pro Ala Asn Arg Arg Tyr Glu Val Pro Leu Glu Thr Pro His ValAsp Pro Ala Asn Arg Arg Tyr Glu Val Pro Leu Glu Thr Pro His Val

130 135 140 130 135 140

His Ser Arg Ala Pro Ser Pro Leu Tyr Ser Val Glu Phe Ser Glu GluHis Ser Arg Ala Pro Ser Pro Leu Tyr Ser Val Glu Phe Ser Glu Glu

145 150 155 160145 150 155 160

Pro Phe Gly Val Ile Val Arg Arg Gln Leu Asp Gly Arg Val Leu LeuPro Phe Gly Val Ile Val Arg Arg Gln Leu Asp Gly Arg Val Leu Leu

165 170 175 165 170 175

Asn Thr Thr Val Ala Pro Leu Phe Phe Ala Asp Gln Phe Leu Gln LeuAsn Thr Thr Val Ala Pro Leu Phe Phe Ala Asp Gln Phe Leu Gln Leu

180 185 190 180 185 190

Ser Thr Ser Leu Pro Ser Gln Tyr Ile Thr Gly Leu Ala Glu His LeuSer Thr Ser Leu Pro Ser Gln Tyr Ile Thr Gly Leu Ala Glu His Leu

195 200 205 195 200 205

Ser Pro Leu Met Leu Ser Thr Ser Trp Thr Arg Ile Thr Leu Trp AsnSer Pro Leu Met Leu Ser Thr Ser Trp Thr Arg Ile Thr Leu Trp Asn

210 215 220 210 215 220

Arg Asp Leu Ala Pro Thr Pro Gly Ala Asn Leu Tyr Gly Ser His ProArg Asp Leu Ala Pro Thr Pro Gly Ala Asn Leu Tyr Gly Ser His Pro

225 230 235 240225 230 235 240

Phe Tyr Leu Ala Leu Glu Asp Gly Gly Ser Ala His Gly Val Phe LeuPhe Tyr Leu Ala Leu Glu Asp Gly Gly Ser Ala His Gly Val Phe Leu

245 250 255 245 250 255

Leu Asn Ser Asn Ala Met Asp Val Val Leu Gln Pro Ser Pro Ala LeuLeu Asn Ser Asn Ala Met Asp Val Val Leu Gln Pro Ser Pro Ala Leu

260 265 270 260 265 270

Ser Trp Arg Ser Thr Gly Gly Ile Leu Asp Val Tyr Ile Phe Leu GlySer Trp Arg Ser Thr Gly Gly Ile Leu Asp Val Tyr Ile Phe Leu Gly

275 280 285 275 280 285

Pro Glu Pro Lys Ser Val Val Gln Gln Tyr Leu Asp Val Val Gly TyrPro Glu Pro Lys Ser Val Val Gln Gln Tyr Leu Asp Val Val Gly Tyr

290 295 300 290 295 300

Pro Phe Met Pro Pro Tyr Trp Gly Leu Gly Phe His Leu Cys Arg TrpPro Phe Met Pro Pro Tyr Trp Gly Leu Gly Phe His Leu Cys Arg Trp

305 310 315 320305 310 315 320

Gly Tyr Ser Ser Thr Ala Ile Thr Arg Gln Val Val Glu Asn Met ThrGly Tyr Ser Ser Thr Ala Ile Thr Arg Gln Val Val Glu Asn Met Thr

325 330 335 325 330 335

Arg Ala His Phe Pro Leu Asp Val Gln Trp Asn Asp Leu Asp Tyr MetArg Ala His Phe Pro Leu Asp Val Gln Trp Asn Asp Leu Asp Tyr Met

340 345 350 340 345 350

Asp Ser Arg Arg Asp Phe Thr Phe Asn Lys Asp Gly Phe Arg Asp PheAsp Ser Arg Arg Asp Phe Thr Phe Asn Lys Asp Gly Phe Arg Asp Phe

355 360 365 355 360 365

Pro Ala Met Val Gln Glu Leu His Gln Gly Gly Arg Arg Tyr Met MetPro Ala Met Val Gln Glu Leu His Gln Gly Gly Arg Arg Tyr Met Met

370 375 380 370 375 380

Ile Val Asp Pro Ala Ile Ser Ser Ser Gly Pro Ala Gly Ser Tyr ArgIle Val Asp Pro Ala Ile Ser Ser Ser Gly Pro Ala Gly Ser Tyr Arg

385 390 395 400385 390 395 400

Pro Tyr Asp Glu Gly Leu Arg Arg Gly Val Phe Ile Thr Asn Glu ThrPro Tyr Asp Glu Gly Leu Arg Arg Gly Val Phe Ile Thr Asn Glu Thr

405 410 415 405 410 415

Gly Gln Pro Leu Ile Gly Lys Val Trp Pro Gly Ser Thr Ala Phe ProGly Gln Pro Leu Ile Gly Lys Val Trp Pro Gly Ser Thr Ala Phe Pro

420 425 430 420 425 430

Asp Phe Thr Asn Pro Thr Ala Leu Ala Trp Trp Glu Asp Met Val AlaAsp Phe Thr Asn Pro Thr Ala Leu Ala Trp Trp Glu Asp Met Val Ala

435 440 445 435 440 445

Glu Phe His Asp Gln Val Pro Phe Asp Gly Met Trp Ile Asp Met AsnGlu Phe His Asp Gln Val Pro Phe Asp Gly Met Trp Ile Asp Met Asn

450 455 460 450 455 460

Glu Pro Ser Asn Phe Ile Arg Gly Ser Glu Asp Gly Cys Pro Asn AsnGlu Pro Ser Asn Phe Ile Arg Gly Ser Glu Asp Gly Cys Pro Asn Asn

465 470 475 480465 470 475 480

Glu Leu Glu Asn Pro Pro Tyr Val Pro Gly Val Val Gly Gly Thr LeuGlu Leu Glu Asn Pro Pro Tyr Val Pro Gly Val Val Gly Gly Thr Leu

485 490 495 485 490 495

Gln Ala Ala Thr Ile Cys Ala Ser Ser His Gln Phe Leu Ser Thr HisGln Ala Ala Thr Ile Cys Ala Ser Ser His Gln Phe Leu Ser Thr His

500 505 510 500 505 510

Tyr Asn Leu His Asn Leu Tyr Gly Leu Thr Glu Ala Ile Ala Ser HisTyr Asn Leu His Asn Leu Tyr Gly Leu Thr Glu Ala Ile Ala Ser His

515 520 525 515 520 525

Arg Ala Leu Val Lys Ala Arg Gly Thr Arg Pro Phe Val Ile Ser ArgArg Ala Leu Val Lys Ala Arg Gly Thr Arg Pro Phe Val Ile Ser Arg

530 535 540 530 535 540

Ser Thr Phe Ala Gly His Gly Arg Tyr Ala Gly His Trp Thr Gly AspSer Thr Phe Ala Gly His Gly Arg Tyr Ala Gly His Trp Thr Gly Asp

545 550 555 560545 550 555 560

Val Trp Ser Ser Trp Glu Gln Leu Ala Ser Ser Val Pro Glu Ile LeuVal Trp Ser Ser Trp Glu Gln Leu Ala Ser Ser Val Pro Glu Ile Leu

565 570 575 565 570 575

Gln Phe Asn Leu Leu Gly Val Pro Leu Val Gly Ala Asp Val Cys GlyGln Phe Asn Leu Leu Gly Val Pro Leu Val Gly Ala Asp Val Cys Gly

580 585 590 580 585 590

Phe Leu Gly Asn Thr Ser Glu Glu Leu Cys Val Arg Trp Thr Gln LeuPhe Leu Gly Asn Thr Ser Glu Glu Leu Cys Val Arg Trp Thr Gln Leu

595 600 605 595 600 605

Gly Ala Phe Tyr Pro Phe Met Arg Asn His Asn Ser Leu Leu Ser LeuGly Ala Phe Tyr Pro Phe Met Arg Asn His Asn Ser Leu Leu Ser Leu

610 615 620 610 615 620

Pro Gln Glu Pro Tyr Ser Phe Ser Glu Pro Ala Gln Gln Ala Met ArgPro Gln Glu Pro Tyr Ser Phe Ser Glu Pro Ala Gln Gln Ala Met Arg

625 630 635 640625 630 635 640

Lys Ala Leu Thr Leu Arg Tyr Ala Leu Leu Pro His Leu Tyr Thr LeuLys Ala Leu Thr Leu Arg Tyr Ala Leu Leu Pro His Leu Tyr Thr Leu

645 650 655 645 650 655

Phe His Gln Ala His Val Ala Gly Glu Thr Val Ala Arg Pro Leu PhePhe His Gln Ala His Val Ala Gly Glu Thr Val Ala Arg Pro Leu Phe

660 665 670 660 665 670

Leu Glu Phe Pro Lys Asp Ser Ser Thr Trp Thr Val Asp His Gln LeuLeu Glu Phe Pro Lys Asp Ser Ser Thr Trp Thr Val Asp His Gln Leu

675 680 685 675 680 685

Leu Trp Gly Glu Ala Leu Leu Ile Thr Pro Val Leu Gln Ala Gly LysLeu Trp Gly Glu Ala Leu Leu Ile Thr Pro Val Leu Gln Ala Gly Lys

690 695 700 690 695 700

Ala Glu Val Thr Gly Tyr Phe Pro Leu Gly Thr Trp Tyr Asp Leu GlnAla Glu Val Thr Gly Tyr Phe Pro Leu Gly Thr Trp Tyr Asp Leu Gln

705 710 715 720705 710 715 720

Thr Val Pro Val Glu Ala Leu Gly Ser Leu Pro Pro Pro Pro Ala AlaThr Val Pro Val Glu Ala Leu Gly Ser Leu Pro Pro Pro Pro Ala Ala

725 730 735 725 730 735

Pro Arg Glu Pro Ala Ile His Ser Glu Gly Gln Trp Val Thr Leu ProPro Arg Glu Pro Ala Ile His Ser Glu Gly Gln Trp Val Thr Leu Pro

740 745 750 740 745 750

Ala Pro Leu Asp Thr Ile Asn Val His Leu Arg Ala Gly Tyr Ile IleAla Pro Leu Asp Thr Ile Asn Val His Leu Arg Ala Gly Tyr Ile Ile

755 760 765 755 760 765

Pro Leu Gln Gly Pro Gly Leu Thr Thr Thr Glu Ser Arg Gln Gln ProPro Leu Gln Gly Pro Gly Leu Thr Thr Thr Glu Ser Arg Gln Gln Pro

770 775 780 770 775 780

Met Ala Leu Ala Val Ala Leu Thr Lys Gly Gly Glu Ala Arg Gly GluMet Ala Leu Ala Val Ala Leu Thr Lys Gly Gly Glu Ala Arg Gly Glu

785 790 795 800785 790 795 800

Leu Phe Trp Asp Asp Gly Glu Ser Leu Glu Val Leu Glu Arg Gly AlaLeu Phe Trp Asp Asp Gly Glu Ser Leu Glu Val Leu Glu Arg Gly Ala

805 810 815 805 810 815

Tyr Thr Gln Val Ile Phe Leu Ala Arg Asn Asn Thr Ile Val Asn GluTyr Thr Gln Val Ile Phe Leu Ala Arg Asn Asn Thr Ile Val Asn Glu

820 825 830 820 825 830

Leu Val Arg Val Thr Ser Glu Gly Ala Gly Leu Gln Leu Gln Lys ValLeu Val Arg Val Thr Ser Glu Gly Ala Gly Leu Gln Leu Gln Lys Val

835 840 845 835 840 845

Thr Val Leu Gly Val Ala Thr Ala Pro Gln Gln Val Leu Ser Asn GlyThr Val Leu Gly Val Ala Thr Ala Pro Gln Gln Val Leu Ser Asn Gly

850 855 860 850 855 860

Val Pro Val Ser Asn Phe Thr Tyr Ser Pro Asp Thr Lys Val Leu AspVal Pro Val Ser Asn Phe Thr Tyr Ser Pro Asp Thr Lys Val Leu Asp

865 870 875 880865 870 875 880

Ile Cys Val Ser Leu Leu Met Gly Glu Gln Phe Leu Val Ser Trp CysIle Cys Val Ser Leu Leu Met Gly Glu Gln Phe Leu Val Ser Trp Cys

885 890 895 885 890 895

<210> 35<210> 35

<211> 882<211> 882

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAA-Δ-43<223> hGAA-Δ-43

<400> 35<400> 35

His Pro Gly Arg Pro Arg Ala Val Pro Thr Gln Cys Asp Val Pro ProHis Pro Gly Arg Pro Arg Ala Val Pro Thr Gln Cys Asp Val Pro Pro

1 5 10 151 5 10 15

Asn Ser Arg Phe Asp Cys Ala Pro Asp Lys Ala Ile Thr Gln Glu GlnAsn Ser Arg Phe Asp Cys Ala Pro Asp Lys Ala Ile Thr Gln Glu Gln

20 25 30 20 25 30

Cys Glu Ala Arg Gly Cys Cys Tyr Ile Pro Ala Lys Gln Gly Leu GlnCys Glu Ala Arg Gly Cys Cys Tyr Ile Pro Ala Lys Gln Gly Leu Gln

35 40 45 35 40 45

Gly Ala Gln Met Gly Gln Pro Trp Cys Phe Phe Pro Pro Ser Tyr ProGly Ala Gln Met Gly Gln Pro Trp Cys Phe Phe Pro Pro Ser Tyr Pro

50 55 60 50 55 60

Ser Tyr Lys Leu Glu Asn Leu Ser Ser Ser Glu Met Gly Tyr Thr AlaSer Tyr Lys Leu Glu Asn Leu Ser Ser Ser Glu Met Gly Tyr Thr Ala

65 70 75 8065 70 75 80

Thr Leu Thr Arg Thr Thr Pro Thr Phe Phe Pro Lys Asp Ile Leu ThrThr Leu Thr Arg Thr Thr Pro Thr Phe Phe Pro Lys Asp Ile Leu Thr

85 90 95 85 90 95

Leu Arg Leu Asp Val Met Met Glu Thr Glu Asn Arg Leu His Phe ThrLeu Arg Leu Asp Val Met Met Glu Thr Glu Asn Arg Leu His Phe Thr

100 105 110 100 105 110

Ile Lys Asp Pro Ala Asn Arg Arg Tyr Glu Val Pro Leu Glu Thr ProIle Lys Asp Pro Ala Asn Arg Arg Tyr Glu Val Pro Leu Glu Thr Pro

115 120 125 115 120 125

His Val His Ser Arg Ala Pro Ser Pro Leu Tyr Ser Val Glu Phe SerHis Val His Ser Arg Ala Pro Ser Pro Leu Tyr Ser Val Glu Phe Ser

130 135 140 130 135 140

Glu Glu Pro Phe Gly Val Ile Val Arg Arg Gln Leu Asp Gly Arg ValGlu Glu Pro Phe Gly Val Ile Val Arg Arg Gln Leu Asp Gly Arg Val

145 150 155 160145 150 155 160

Leu Leu Asn Thr Thr Val Ala Pro Leu Phe Phe Ala Asp Gln Phe LeuLeu Leu Asn Thr Thr Val Ala Pro Leu Phe Phe Ala Asp Gln Phe Leu

165 170 175 165 170 175

Gln Leu Ser Thr Ser Leu Pro Ser Gln Tyr Ile Thr Gly Leu Ala GluGln Leu Ser Thr Ser Leu Pro Ser Gln Tyr Ile Thr Gly Leu Ala Glu

180 185 190 180 185 190

His Leu Ser Pro Leu Met Leu Ser Thr Ser Trp Thr Arg Ile Thr LeuHis Leu Ser Pro Leu Met Leu Ser Thr Ser Trp Thr Arg Ile Thr Leu

195 200 205 195 200 205

Trp Asn Arg Asp Leu Ala Pro Thr Pro Gly Ala Asn Leu Tyr Gly SerTrp Asn Arg Asp Leu Ala Pro Thr Pro Gly Ala Asn Leu Tyr Gly Ser

210 215 220 210 215 220

His Pro Phe Tyr Leu Ala Leu Glu Asp Gly Gly Ser Ala His Gly ValHis Pro Phe Tyr Leu Ala Leu Glu Asp Gly Gly Ser Ala His Gly Val

225 230 235 240225 230 235 240

Phe Leu Leu Asn Ser Asn Ala Met Asp Val Val Leu Gln Pro Ser ProPhe Leu Leu Asn Ser Asn Ala Met Asp Val Val Leu Gln Pro Ser Pro

245 250 255 245 250 255

Ala Leu Ser Trp Arg Ser Thr Gly Gly Ile Leu Asp Val Tyr Ile PheAla Leu Ser Trp Arg Ser Thr Gly Gly Ile Leu Asp Val Tyr Ile Phe

260 265 270 260 265 270

Leu Gly Pro Glu Pro Lys Ser Val Val Gln Gln Tyr Leu Asp Val ValLeu Gly Pro Glu Pro Lys Ser Val Val Gln Gln Tyr Leu Asp Val Val

275 280 285 275 280 285

Gly Tyr Pro Phe Met Pro Pro Tyr Trp Gly Leu Gly Phe His Leu CysGly Tyr Pro Phe Met Pro Pro Tyr Trp Gly Leu Gly Phe His Leu Cys

290 295 300 290 295 300

Arg Trp Gly Tyr Ser Ser Thr Ala Ile Thr Arg Gln Val Val Glu AsnArg Trp Gly Tyr Ser Ser Thr Ala Ile Thr Arg Gln Val Val Glu Asn

305 310 315 320305 310 315 320

Met Thr Arg Ala His Phe Pro Leu Asp Val Gln Trp Asn Asp Leu AspMet Thr Arg Ala His Phe Pro Leu Asp Val Gln Trp Asn Asp Leu Asp

325 330 335 325 330 335

Tyr Met Asp Ser Arg Arg Asp Phe Thr Phe Asn Lys Asp Gly Phe ArgTyr Met Asp Ser Arg Arg Asp Phe Thr Phe Asn Lys Asp Gly Phe Arg

340 345 350 340 345 350

Asp Phe Pro Ala Met Val Gln Glu Leu His Gln Gly Gly Arg Arg TyrAsp Phe Pro Ala Met Val Gln Glu Leu His Gln Gly Gly Arg Arg Tyr

355 360 365 355 360 365

Met Met Ile Val Asp Pro Ala Ile Ser Ser Ser Gly Pro Ala Gly SerMet Met Ile Val Asp Pro Ala Ile Ser Ser Ser Gly Pro Ala Gly Ser

370 375 380 370 375 380

Tyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg Gly Val Phe Ile Thr AsnTyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg Gly Val Phe Ile Thr Asn

385 390 395 400385 390 395 400

Glu Thr Gly Gln Pro Leu Ile Gly Lys Val Trp Pro Gly Ser Thr AlaGlu Thr Gly Gln Pro Leu Ile Gly Lys Val Trp Pro Gly Ser Thr Ala

405 410 415 405 410 415

Phe Pro Asp Phe Thr Asn Pro Thr Ala Leu Ala Trp Trp Glu Asp MetPhe Pro Asp Phe Thr Asn Pro Thr Ala Leu Ala Trp Trp Glu Asp Met

420 425 430 420 425 430

Val Ala Glu Phe His Asp Gln Val Pro Phe Asp Gly Met Trp Ile AspVal Ala Glu Phe His Asp Gln Val Pro Phe Asp Gly Met Trp Ile Asp

435 440 445 435 440 445

Met Asn Glu Pro Ser Asn Phe Ile Arg Gly Ser Glu Asp Gly Cys ProMet Asn Glu Pro Ser Asn Phe Ile Arg Gly Ser Glu Asp Gly Cys Pro

450 455 460 450 455 460

Asn Asn Glu Leu Glu Asn Pro Pro Tyr Val Pro Gly Val Val Gly GlyAsn Asn Glu Leu Glu Asn Pro Pro Tyr Val Pro Gly Val Val Gly Gly

465 470 475 480465 470 475 480

Thr Leu Gln Ala Ala Thr Ile Cys Ala Ser Ser His Gln Phe Leu SerThr Leu Gln Ala Ala Thr Ile Cys Ala Ser Ser His Gln Phe Leu Ser

485 490 495 485 490 495

Thr His Tyr Asn Leu His Asn Leu Tyr Gly Leu Thr Glu Ala Ile AlaThr His Tyr Asn Leu His Asn Leu Tyr Gly Leu Thr Glu Ala Ile Ala

500 505 510 500 505 510

Ser His Arg Ala Leu Val Lys Ala Arg Gly Thr Arg Pro Phe Val IleSer His Arg Ala Leu Val Lys Ala Arg Gly Thr Arg Pro Phe Val Ile

515 520 525 515 520 525

Ser Arg Ser Thr Phe Ala Gly His Gly Arg Tyr Ala Gly His Trp ThrSer Arg Ser Thr Phe Ala Gly His Gly Arg Tyr Ala Gly His Trp Thr

530 535 540 530 535 540

Gly Asp Val Trp Ser Ser Trp Glu Gln Leu Ala Ser Ser Val Pro GluGly Asp Val Trp Ser Ser Trp Glu Gln Leu Ala Ser Ser Val Pro Glu

545 550 555 560545 550 555 560

Ile Leu Gln Phe Asn Leu Leu Gly Val Pro Leu Val Gly Ala Asp ValIle Leu Gln Phe Asn Leu Leu Gly Val Pro Leu Val Gly Ala Asp Val

565 570 575 565 570 575

Cys Gly Phe Leu Gly Asn Thr Ser Glu Glu Leu Cys Val Arg Trp ThrCys Gly Phe Leu Gly Asn Thr Ser Glu Glu Leu Cys Val Arg Trp Thr

580 585 590 580 585 590

Gln Leu Gly Ala Phe Tyr Pro Phe Met Arg Asn His Asn Ser Leu LeuGln Leu Gly Ala Phe Tyr Pro Phe Met Arg Asn His Asn Ser Leu Leu

595 600 605 595 600 605

Ser Leu Pro Gln Glu Pro Tyr Ser Phe Ser Glu Pro Ala Gln Gln AlaSer Leu Pro Gln Glu Pro Tyr Ser Phe Ser Glu Pro Ala Gln Gln Ala

610 615 620 610 615 620

Met Arg Lys Ala Leu Thr Leu Arg Tyr Ala Leu Leu Pro His Leu TyrMet Arg Lys Ala Leu Thr Leu Arg Tyr Ala Leu Leu Pro His Leu Tyr

625 630 635 640625 630 635 640

Thr Leu Phe His Gln Ala His Val Ala Gly Glu Thr Val Ala Arg ProThr Leu Phe His Gln Ala His Val Ala Gly Glu Thr Val Ala Arg Pro

645 650 655 645 650 655

Leu Phe Leu Glu Phe Pro Lys Asp Ser Ser Thr Trp Thr Val Asp HisLeu Phe Leu Glu Phe Pro Lys Asp Ser Ser Thr Trp Thr Val Asp His

660 665 670 660 665 670

Gln Leu Leu Trp Gly Glu Ala Leu Leu Ile Thr Pro Val Leu Gln AlaGln Leu Leu Trp Gly Glu Ala Leu Leu Ile Thr Pro Val Leu Gln Ala

675 680 685 675 680 685

Gly Lys Ala Glu Val Thr Gly Tyr Phe Pro Leu Gly Thr Trp Tyr AspGly Lys Ala Glu Val Thr Gly Tyr Phe Pro Leu Gly Thr Trp Tyr Asp

690 695 700 690 695 700

Leu Gln Thr Val Pro Val Glu Ala Leu Gly Ser Leu Pro Pro Pro ProLeu Gln Thr Val Pro Val Glu Ala Leu Gly Ser Leu Pro Pro Pro Pro

705 710 715 720705 710 715 720

Ala Ala Pro Arg Glu Pro Ala Ile His Ser Glu Gly Gln Trp Val ThrAla Ala Pro Arg Glu Pro Ala Ile His Ser Glu Gly Gln Trp Val Thr

725 730 735 725 730 735

Leu Pro Ala Pro Leu Asp Thr Ile Asn Val His Leu Arg Ala Gly TyrLeu Pro Ala Pro Leu Asp Thr Ile Asn Val His Leu Arg Ala Gly Tyr

740 745 750 740 745 750

Ile Ile Pro Leu Gln Gly Pro Gly Leu Thr Thr Thr Glu Ser Arg GlnIle Ile Pro Leu Gln Gly Pro Gly Leu Thr Thr Thr Glu Ser Arg Gln

755 760 765 755 760 765

Gln Pro Met Ala Leu Ala Val Ala Leu Thr Lys Gly Gly Glu Ala ArgGln Pro Met Ala Leu Ala Val Ala Leu Thr Lys Gly Gly Glu Ala Arg

770 775 780 770 775 780

Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser Leu Glu Val Leu Glu ArgGly Glu Leu Phe Trp Asp Asp Gly Glu Ser Leu Glu Val Leu Glu Arg

785 790 795 800785 790 795 800

Gly Ala Tyr Thr Gln Val Ile Phe Leu Ala Arg Asn Asn Thr Ile ValGly Ala Tyr Thr Gln Val Ile Phe Leu Ala Arg Asn Asn Thr Ile Val

805 810 815 805 810 815

Asn Glu Leu Val Arg Val Thr Ser Glu Gly Ala Gly Leu Gln Leu GlnAsn Glu Leu Val Arg Val Thr Ser Glu Gly Ala Gly Leu Gln Leu Gln

820 825 830 820 825 830

Lys Val Thr Val Leu Gly Val Ala Thr Ala Pro Gln Gln Val Leu SerLys Val Thr Val Leu Gly Val Ala Thr Ala Pro Gln Gln Val Leu Ser

835 840 845 835 840 845

Asn Gly Val Pro Val Ser Asn Phe Thr Tyr Ser Pro Asp Thr Lys ValAsn Gly Val Pro Val Ser Asn Phe Thr Tyr Ser Pro Asp Thr Lys Val

850 855 860 850 855 860

Leu Asp Ile Cys Val Ser Leu Leu Met Gly Glu Gln Phe Leu Val SerLeu Asp Ile Cys Val Ser Leu Leu Met Gly Glu Gln Phe Leu Val Ser

865 870 875 880865 870 875 880

Trp Cystc

<210> 36<210> 36

<211> 878<211> 878

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAA-Δ-47<223> hGAA-Δ-47

<400> 36<400> 36

Pro Arg Ala Val Pro Thr Gln Cys Asp Val Pro Pro Asn Ser Arg PhePro Arg Ala Val Pro Thr Gln Cys Asp Val Pro Pro Asn Ser Arg Phe

1 5 10 151 5 10 15

Asp Cys Ala Pro Asp Lys Ala Ile Thr Gln Glu Gln Cys Glu Ala ArgAsp Cys Ala Pro Asp Lys Ala Ile Thr Gln Glu Gln Cys Glu Ala Arg

20 25 30 20 25 30

Gly Cys Cys Tyr Ile Pro Ala Lys Gln Gly Leu Gln Gly Ala Gln MetGly Cys Cys Tyr Ile Pro Ala Lys Gln Gly Leu Gln Gly Ala Gln Met

35 40 45 35 40 45

Gly Gln Pro Trp Cys Phe Phe Pro Pro Ser Tyr Pro Ser Tyr Lys LeuGly Gln Pro Trp Cys Phe Phe Pro Pro Ser Tyr Pro Ser Tyr Lys Leu

50 55 60 50 55 60

Glu Asn Leu Ser Ser Ser Glu Met Gly Tyr Thr Ala Thr Leu Thr ArgGlu Asn Leu Ser Ser Ser Glu Met Gly Tyr Thr Ala Thr Leu Thr Arg

65 70 75 8065 70 75 80

Thr Thr Pro Thr Phe Phe Pro Lys Asp Ile Leu Thr Leu Arg Leu AspThr Thr Pro Thr Phe Phe Pro Lys Asp Ile Leu Thr Leu Arg Leu Asp

85 90 95 85 90 95

Val Met Met Glu Thr Glu Asn Arg Leu His Phe Thr Ile Lys Asp ProVal Met Met Glu Thr Glu Asn Arg Leu His Phe Thr Ile Lys Asp Pro

100 105 110 100 105 110

Ala Asn Arg Arg Tyr Glu Val Pro Leu Glu Thr Pro His Val His SerAla Asn Arg Arg Tyr Glu Val Pro Leu Glu Thr Pro His Val His Ser

115 120 125 115 120 125

Arg Ala Pro Ser Pro Leu Tyr Ser Val Glu Phe Ser Glu Glu Pro PheArg Ala Pro Ser Pro Leu Tyr Ser Val Glu Phe Ser Glu Glu Pro Phe

130 135 140 130 135 140

Gly Val Ile Val Arg Arg Gln Leu Asp Gly Arg Val Leu Leu Asn ThrGly Val Ile Val Arg Arg Gln Leu Asp Gly Arg Val Leu Leu Asn Thr

145 150 155 160145 150 155 160

Thr Val Ala Pro Leu Phe Phe Ala Asp Gln Phe Leu Gln Leu Ser ThrThr Val Ala Pro Leu Phe Phe Ala Asp Gln Phe Leu Gln Leu Ser Thr

165 170 175 165 170 175

Ser Leu Pro Ser Gln Tyr Ile Thr Gly Leu Ala Glu His Leu Ser ProSer Leu Pro Ser Gln Tyr Ile Thr Gly Leu Ala Glu His Leu Ser Pro

180 185 190 180 185 190

Leu Met Leu Ser Thr Ser Trp Thr Arg Ile Thr Leu Trp Asn Arg AspLeu Met Leu Ser Thr Ser Trp Thr Arg Ile Thr Leu Trp Asn Arg Asp

195 200 205 195 200 205

Leu Ala Pro Thr Pro Gly Ala Asn Leu Tyr Gly Ser His Pro Phe TyrLeu Ala Pro Thr Pro Gly Ala Asn Leu Tyr Gly Ser His Pro Phe Tyr

210 215 220 210 215 220

Leu Ala Leu Glu Asp Gly Gly Ser Ala His Gly Val Phe Leu Leu AsnLeu Ala Leu Glu Asp Gly Gly Ser Ala His Gly Val Phe Leu Leu Asn

225 230 235 240225 230 235 240

Ser Asn Ala Met Asp Val Val Leu Gln Pro Ser Pro Ala Leu Ser TrpSer Asn Ala Met Asp Val Val Leu Gln Pro Ser Pro Ala Leu Ser Trp

245 250 255 245 250 255

Arg Ser Thr Gly Gly Ile Leu Asp Val Tyr Ile Phe Leu Gly Pro GluArg Ser Thr Gly Gly Ile Leu Asp Val Tyr Ile Phe Leu Gly Pro Glu

260 265 270 260 265 270

Pro Lys Ser Val Val Gln Gln Tyr Leu Asp Val Val Gly Tyr Pro PhePro Lys Ser Val Val Gln Gln Tyr Leu Asp Val Val Gly Tyr Pro Phe

275 280 285 275 280 285

Met Pro Pro Tyr Trp Gly Leu Gly Phe His Leu Cys Arg Trp Gly TyrMet Pro Pro Tyr Trp Gly Leu Gly Phe His Leu Cys Arg Trp Gly Tyr

290 295 300 290 295 300

Ser Ser Thr Ala Ile Thr Arg Gln Val Val Glu Asn Met Thr Arg AlaSer Ser Thr Ala Ile Thr Arg Gln Val Val Glu Asn Met Thr Arg Ala

305 310 315 320305 310 315 320

His Phe Pro Leu Asp Val Gln Trp Asn Asp Leu Asp Tyr Met Asp SerHis Phe Pro Leu Asp Val Gln Trp Asn Asp Leu Asp Tyr Met Asp Ser

325 330 335 325 330 335

Arg Arg Asp Phe Thr Phe Asn Lys Asp Gly Phe Arg Asp Phe Pro AlaArg Arg Asp Phe Thr Phe Asn Lys Asp Gly Phe Arg Asp Phe Pro Ala

340 345 350 340 345 350

Met Val Gln Glu Leu His Gln Gly Gly Arg Arg Tyr Met Met Ile ValMet Val Gln Glu Leu His Gln Gly Gly Arg Arg Tyr Met Met Ile Val

355 360 365 355 360 365

Asp Pro Ala Ile Ser Ser Ser Gly Pro Ala Gly Ser Tyr Arg Pro TyrAsp Pro Ala Ile Ser Ser Ser Gly Pro Ala Gly Ser Tyr Arg Pro Tyr

370 375 380 370 375 380

Asp Glu Gly Leu Arg Arg Gly Val Phe Ile Thr Asn Glu Thr Gly GlnAsp Glu Gly Leu Arg Arg Gly Val Phe Ile Thr Asn Glu Thr Gly Gln

385 390 395 400385 390 395 400

Pro Leu Ile Gly Lys Val Trp Pro Gly Ser Thr Ala Phe Pro Asp PhePro Leu Ile Gly Lys Val Trp Pro Gly Ser Thr Ala Phe Pro Asp Phe

405 410 415 405 410 415

Thr Asn Pro Thr Ala Leu Ala Trp Trp Glu Asp Met Val Ala Glu PheThr Asn Pro Thr Ala Leu Ala Trp Trp Glu Asp Met Val Ala Glu Phe

420 425 430 420 425 430

His Asp Gln Val Pro Phe Asp Gly Met Trp Ile Asp Met Asn Glu ProHis Asp Gln Val Pro Phe Asp Gly Met Trp Ile Asp Met Asn Glu Pro

435 440 445 435 440 445

Ser Asn Phe Ile Arg Gly Ser Glu Asp Gly Cys Pro Asn Asn Glu LeuSer Asn Phe Ile Arg Gly Ser Glu Asp Gly Cys Pro Asn Asn Glu Leu

450 455 460 450 455 460

Glu Asn Pro Pro Tyr Val Pro Gly Val Val Gly Gly Thr Leu Gln AlaGlu Asn Pro Pro Tyr Val Pro Gly Val Val Gly Gly Thr Leu Gln Ala

465 470 475 480465 470 475 480

Ala Thr Ile Cys Ala Ser Ser His Gln Phe Leu Ser Thr His Tyr AsnAla Thr Ile Cys Ala Ser Ser His Gln Phe Leu Ser Thr His Tyr Asn

485 490 495 485 490 495

Leu His Asn Leu Tyr Gly Leu Thr Glu Ala Ile Ala Ser His Arg AlaLeu His Asn Leu Tyr Gly Leu Thr Glu Ala Ile Ala Ser His Arg Ala

500 505 510 500 505 510

Leu Val Lys Ala Arg Gly Thr Arg Pro Phe Val Ile Ser Arg Ser ThrLeu Val Lys Ala Arg Gly Thr Arg Pro Phe Val Ile Ser Arg Ser Thr

515 520 525 515 520 525

Phe Ala Gly His Gly Arg Tyr Ala Gly His Trp Thr Gly Asp Val TrpPhe Ala Gly His Gly Arg Tyr Ala Gly His Trp Thr Gly Asp Val Trp

530 535 540 530 535 540

Ser Ser Trp Glu Gln Leu Ala Ser Ser Val Pro Glu Ile Leu Gln PheSer Ser Trp Glu Gln Leu Ala Ser Ser Val Pro Glu Ile Leu Gln Phe

545 550 555 560545 550 555 560

Asn Leu Leu Gly Val Pro Leu Val Gly Ala Asp Val Cys Gly Phe LeuAsn Leu Leu Gly Val Pro Leu Val Gly Ala Asp Val Cys Gly Phe Leu

565 570 575 565 570 575

Gly Asn Thr Ser Glu Glu Leu Cys Val Arg Trp Thr Gln Leu Gly AlaGly Asn Thr Ser Glu Glu Leu Cys Val Arg Trp Thr Gln Leu Gly Ala

580 585 590 580 585 590

Phe Tyr Pro Phe Met Arg Asn His Asn Ser Leu Leu Ser Leu Pro GlnPhe Tyr Pro Phe Met Arg Asn His Asn Ser Leu Leu Ser Leu Pro Gln

595 600 605 595 600 605

Glu Pro Tyr Ser Phe Ser Glu Pro Ala Gln Gln Ala Met Arg Lys AlaGlu Pro Tyr Ser Phe Ser Glu Pro Ala Gln Gln Ala Met Arg Lys Ala

610 615 620 610 615 620

Leu Thr Leu Arg Tyr Ala Leu Leu Pro His Leu Tyr Thr Leu Phe HisLeu Thr Leu Arg Tyr Ala Leu Leu Pro His Leu Tyr Thr Leu Phe His

625 630 635 640625 630 635 640

Gln Ala His Val Ala Gly Glu Thr Val Ala Arg Pro Leu Phe Leu GluGln Ala His Val Ala Gly Glu Thr Val Ala Arg Pro Leu Phe Leu Glu

645 650 655 645 650 655

Phe Pro Lys Asp Ser Ser Thr Trp Thr Val Asp His Gln Leu Leu TrpPhe Pro Lys Asp Ser Ser Thr Trp Thr Val Asp His Gln Leu Leu Trp

660 665 670 660 665 670

Gly Glu Ala Leu Leu Ile Thr Pro Val Leu Gln Ala Gly Lys Ala GluGly Glu Ala Leu Leu Ile Thr Pro Val Leu Gln Ala Gly Lys Ala Glu

675 680 685 675 680 685

Val Thr Gly Tyr Phe Pro Leu Gly Thr Trp Tyr Asp Leu Gln Thr ValVal Thr Gly Tyr Phe Pro Leu Gly Thr Trp Tyr Asp Leu Gln Thr Val

690 695 700 690 695 700

Pro Val Glu Ala Leu Gly Ser Leu Pro Pro Pro Pro Ala Ala Pro ArgPro Val Glu Ala Leu Gly Ser Leu Pro Pro Pro Pro Ala Ala Pro Arg

705 710 715 720705 710 715 720

Glu Pro Ala Ile His Ser Glu Gly Gln Trp Val Thr Leu Pro Ala ProGlu Pro Ala Ile His Ser Glu Gly Gln Trp Val Thr Leu Pro Ala Pro

725 730 735 725 730 735

Leu Asp Thr Ile Asn Val His Leu Arg Ala Gly Tyr Ile Ile Pro LeuLeu Asp Thr Ile Asn Val His Leu Arg Ala Gly Tyr Ile Ile Pro Leu

740 745 750 740 745 750

Gln Gly Pro Gly Leu Thr Thr Thr Glu Ser Arg Gln Gln Pro Met AlaGln Gly Pro Gly Leu Thr Thr Thr Glu Ser Arg Gln Gln Pro Met Ala

755 760 765 755 760 765

Leu Ala Val Ala Leu Thr Lys Gly Gly Glu Ala Arg Gly Glu Leu PheLeu Ala Val Ala Leu Thr Lys Gly Gly Glu Ala Arg Gly Glu Leu Phe

770 775 780 770 775 780

Trp Asp Asp Gly Glu Ser Leu Glu Val Leu Glu Arg Gly Ala Tyr ThrTrp Asp Asp Gly Glu Ser Leu Glu Val Leu Glu Arg Gly Ala Tyr Thr

785 790 795 800785 790 795 800

Gln Val Ile Phe Leu Ala Arg Asn Asn Thr Ile Val Asn Glu Leu ValGln Val Ile Phe Leu Ala Arg Asn Asn Thr Ile Val Asn Glu Leu Val

805 810 815 805 810 815

Arg Val Thr Ser Glu Gly Ala Gly Leu Gln Leu Gln Lys Val Thr ValArg Val Thr Ser Glu Gly Ala Gly Leu Gln Leu Gln Lys Val Thr Val

820 825 830 820 825 830

Leu Gly Val Ala Thr Ala Pro Gln Gln Val Leu Ser Asn Gly Val ProLeu Gly Val Ala Thr Ala Pro Gln Gln Val Leu Ser Asn Gly Val Pro

835 840 845 835 840 845

Val Ser Asn Phe Thr Tyr Ser Pro Asp Thr Lys Val Leu Asp Ile CysVal Ser Asn Phe Thr Tyr Ser Pro Asp Thr Lys Val Leu Asp Ile Cys

850 855 860 850 855 860

Val Ser Leu Leu Met Gly Glu Gln Phe Leu Val Ser Trp CysVal Ser Leu Leu Met Gly Glu Gln Phe Leu Val Ser Trp Cys

865 870 875865 870 875

<210> 37<210> 37

<211> 2745<211> 2745

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7+hGAAwt-Δ-29<223> sp7+hGAAwt-Δ-29

<400> 37<400> 37

atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggccagcag 60atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggccagcag 60

ggagccagca gaccagggcc ccgggatgcc caggcacacc ccgggcggcc gcgagcagtg 120ggagccagca gaccagggcc ccgggatgcc caggcacacc ccgggcggcc gcgagcagtg 120

cccacacagt gcgacgtccc ccccaacagc cgcttcgatt gcgcccctga caaggccatc 180cccacacagt gcgacgtccc ccccaacagc cgcttcgatt gcgcccctga caaggccatc 180

acccaggaac agtgcgaggc ccgcggctgt tgctacatcc ctgcaaagca ggggctgcag 240acccaggaac agtgcgaggc ccgcggctgt tgctacatcc ctgcaaagca ggggctgcag 240

ggagcccaga tggggcagcc ctggtgcttc ttcccaccca gctaccccag ctacaagctg 300ggagcccaga tggggcagcc ctggtgcttc ttcccaccca gctaccccag ctacaagctg 300

gagaacctga gctcctctga aatgggctac acggccaccc tgacccgtac cacccccacc 360gagaacctga gctcctctga aatgggctac acggccaccc tgacccgtac cacccccacc 360

ttcttcccca aggacatcct gaccctgcgg ctggacgtga tgatggagac tgagaaccgc 420ttcttcccca aggacatcct gaccctgcgg ctggacgtga tgatggagac tgagaaccgc 420

ctccacttca cgatcaaaga tccagctaac aggcgctacg aggtgccctt ggagaccccg 480ctccacttca cgatcaaaga tccagctaac aggcgctacg aggtgccctt ggagaccccg 480

catgtccaca gccgggcacc gtccccactc tacagcgtgg agttctccga ggagcccttc 540catgtccaca gccgggcacc gtccccactc tacagcgtgg agttctccga ggagcccttc 540

ggggtgatcg tgcgccggca gctggacggc cgcgtgctgc tgaacacgac ggtggcgccc 600ggggtgatcg tgcgccggca gctggacggc cgcgtgctgc tgaacacgac ggtggcgccc 600

ctgttctttg cggaccagtt ccttcagctg tccacctcgc tgccctcgca gtatatcaca 660ctgttctttg cggaccagtt ccttcagctg tccacctcgc tgccctcgca gtatatcaca 660

ggcctcgccg agcacctcag tcccctgatg ctcagcacca gctggaccag gatcaccctg 720ggcctcgccg agcacctcag tcccctgatg ctcagcacca gctggaccag gatcaccctg 720

tggaaccggg accttgcgcc cacgcccggt gcgaacctct acgggtctca ccctttctac 780tggaaccggg accttgcgcc cacgcccggt gcgaacctct acgggtctca ccctttctac 780

ctggcgctgg aggacggcgg gtcggcacac ggggtgttcc tgctaaacag caatgccatg 840ctggcgctgg aggacggcgg gtcggcacac ggggtgttcc tgctaaacag caatgccatg 840

gatgtggtcc tgcagccgag ccctgccctt agctggaggt cgacaggtgg gatcctggat 900gatgtggtcc tgcagccgag ccctgccctt agctggaggt cgacaggtgg gatcctggat 900

gtctacatct tcctgggccc agagcccaag agcgtggtgc agcagtacct ggacgttgtg 960gtctacatct tcctgggccc agagcccaag agcgtggtgc agcagtacct ggacgttgtg 960

ggatacccgt tcatgccgcc atactggggc ctgggcttcc acctgtgccg ctggggctac 1020ggatacccgt tcatgccgcc atactggggc ctgggcttcc acctgtgccg ctggggctac 1020

tcctccaccg ctatcacccg ccaggtggtg gagaacatga ccagggccca cttccccctg 1080tcctccaccg ctatcacccg ccaggtggtg gagaacatga ccagggccca cttccccctg 1080

gacgtccagt ggaacgacct ggactacatg gactcccgga gggacttcac gttcaacaag 1140gacgtccagt ggaacgacct ggactacatg gactccccgga gggacttcac gttcaacaag 1140

gatggcttcc gggacttccc ggccatggtg caggagctgc accagggcgg ccggcgctac 1200gatggcttcc gggacttccc ggccatggtg caggagctgc accagggcgg ccggcgctac 1200

atgatgatcg tggatcctgc catcagcagc tcgggccctg ccgggagcta caggccctac 1260atgatgatcg tggatcctgc catcagcagc tcgggccctg ccggggagcta caggccctac 1260

gacgagggtc tgcggagggg ggttttcatc accaacgaga ccggccagcc gctgattggg 1320gacgagggtc tgcggagggg ggttttcatc accaacgaga ccggccagcc gctgattggg 1320

aaggtatggc ccgggtccac tgccttcccc gacttcacca accccacagc cctggcctgg 1380aaggtatggc ccgggtccac tgccttcccc gacttcacca accccacagc cctggcctgg 1380

tgggaggaca tggtggctga gttccatgac caggtgccct tcgacggcat gtggattgac 1440tgggaggaca tggtggctga gttccatgac caggtgccct tcgacggcat gtggattgac 1440

atgaacgagc cttccaactt catcaggggc tctgaggacg gctgccccaa caatgagctg 1500atgaacgagc cttccaactt catcaggggc tctgaggacg gctgccccaa caatgagctg 1500

gagaacccac cctacgtgcc tggggtggtt ggggggaccc tccaggcggc caccatctgt 1560gagaacccac cctacgtgcc tggggtggtt ggggggaccc tccaggcggc caccatctgt 1560

gcctccagcc accagtttct ctccacacac tacaacctgc acaacctcta cggcctgacc 1620gcctccagcc accagtttct ctccacacac tacaacctgc acaacctcta cggcctgacc 1620

gaagccatcg cctcccacag ggcgctggtg aaggctcggg ggacacgccc atttgtgatc 1680gaagccatcg cctccccacag ggcgctggtg aaggctcggg ggacacgccc atttgtgatc 1680

tcccgctcga cctttgctgg ccacggccga tacgccggcc actggacggg ggacgtgtgg 1740tcccgctcga cctttgctgg ccacggccga tacgccggcc actggacggg ggacgtgtgg 1740

agctcctggg agcagctcgc ctcctccgtg ccagaaatcc tgcagtttaa cctgctgggg 1800agctcctggg agcagctcgc ctcctccgtg ccagaaatcc tgcagtttaa cctgctgggg 1800

gtgcctctgg tcggggccga cgtctgcggc ttcctgggca acacctcaga ggagctgtgt 1860gtgcctctgg tcggggccga cgtctgcggc ttcctgggca acacctcaga ggagctgtgt 1860

gtgcgctgga cccagctggg ggccttctac cccttcatgc ggaaccacaa cagcctgctc 1920gtgcgctgga cccagctggg ggccttctac cccttcatgc ggaaccacaa cagcctgctc 1920

agtctgcccc aggagccgta cagcttcagc gagccggccc agcaggccat gaggaaggcc 1980agtctgcccc aggagccgta cagcttcagc gagccggccc agcaggccat gaggaaggcc 1980

ctcaccctgc gctacgcact cctcccccac ctctacacac tgttccacca ggcccacgtc 2040ctcaccctgc gctacgcact cctcccccac ctctacacac tgttccacca ggcccacgtc 2040

gcgggggaga ccgtggcccg gcccctcttc ctggagttcc ccaaggactc tagcacctgg 2100gcgggggaga ccgtggcccg gcccctcttc ctggagttcc ccaaggactc tagcacctgg 2100

actgtggacc accagctcct gtggggggag gccctgctca tcaccccagt gctccaggcc 2160actgtggacc accagctcct gtggggggag gccctgctca tcaccccagt gctccaggcc 2160

gggaaggccg aagtgactgg ctacttcccc ttgggcacat ggtacgacct gcagacggtg 2220gggaaggccg aagtgactgg ctacttcccc ttgggcacat ggtacgacct gcagacggtg 2220

ccagtagagg cccttggcag cctcccaccc ccacctgcag ctccccgtga gccagccatc 2280ccagtagagg cccttggcag cctcccaccc ccacctgcag ctccccgtga gccagccatc 2280

cacagcgagg ggcagtgggt gacgctgccg gcccccctgg acaccatcaa cgtccacctc 2340cacagcgagg ggcagtgggt gacgctgccg gcccccctgg acaccatcaa cgtccacctc 2340

cgggctgggt acatcatccc cctgcagggc cctggcctca caaccacaga gtcccgccag 2400cgggctgggt acatcatccc cctgcagggc cctggcctca caaccacaga gtcccgccag 2400

cagcccatgg ccctggctgt ggccctgacc aagggtgggg aggcccgagg ggagctgttc 2460cagcccatgg ccctggctgt ggccctgacc aagggtgggg aggcccgagg ggagctgttc 2460

tgggacgatg gagagagcct ggaagtgctg gagcgagggg cctacacaca ggtcatcttc 2520tgggacgatg gagagagcct ggaagtgctg gagcgagggg cctacacaca ggtcatcttc 2520

ctggccagga ataacacgat cgtgaatgag ctggtacgtg tgaccagtga gggagctggc 2580ctggccagga ataacacgat cgtgaatgag ctggtacgtg tgaccagtga gggagctggc 2580

ctgcagctgc agaaggtgac tgtcctgggc gtggccacgg cgccccagca ggtcctctcc 2640ctgcagctgc agaaggtgac tgtcctgggc gtggccacgg cgccccagca ggtcctctcc 2640

aacggtgtcc ctgtctccaa cttcacctac agccccgaca ccaaggtcct ggacatctgt 2700aacggtgtcc ctgtctccaa cttcacctac agccccgaca ccaaggtcct ggacatctgt 2700

gtctcgctgt tgatgggaga gcagtttctc gtcagctggt gttag 2745gtctcgctgt tgatgggaga gcagtttctc gtcagctggt gttag 2745

<210> 38<210> 38

<211> 2745<211> 2745

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7+hGAAco1-Δ-29<223> sp7+hGAAco1-Δ-29

<400> 38<400> 38

ggcgcctcta gacctggacc tagagatgcc caggcccacc ccggcagacc tagagctgtg 120ggcgcctcta gacctggacc tagagatgcc caggcccacc ccggcagacc tagagctgtg 120

cctacccagt gtgacgtgcc ccccaacagc agattcgact gcgcccctga caaggccatc 180cctacccagt gtgacgtgcc ccccaacagc agattcgact gcgcccctga caaggccatc 180

acccaggaac agtgcgaggc cagaggctgc tgctacatcc ctgccaagca gggactgcag 240acccaggaac agtgcgaggc cagaggctgc tgctacatcc ctgccaagca gggactgcag 240

ggcgctcaga tgggacagcc ctggtgcttc ttcccaccct cctaccccag ctacaagctg 300ggcgctcaga tgggacagcc ctggtgcttc ttcccaccct cctaccccag ctacaagctg 300

gaaaacctga gcagcagcga gatgggctac accgccaccc tgaccagaac cacccccaca 360gaaaacctga gcagcagcga gatgggctac accgccaccc tgaccagaac cacccccaca 360

ttcttcccaa aggacatcct gaccctgcgg ctggacgtga tgatggaaac cgagaaccgg 420ttcttcccaa aggacatcct gaccctgcgg ctggacgtga tgatggaaac cgagaaccgg 420

ctgcacttca ccatcaagga ccccgccaat cggagatacg aggtgcccct ggaaaccccc 480ctgcacttca ccatcaagga ccccgccaat cggagatacg aggtgcccct ggaaaccccc 480

cacgtgcact ctagagcccc cagccctctg tacagcgtgg aattcagcga ggaacccttc 540cacgtgcact ctagagcccc cagccctctg tacagcgtgg aattcagcga ggaacccttc 540

ggcgtgatcg tgcggagaca gctggatggc agagtgctgc tgaacaccac cgtggcccct 600ggcgtgatcg tgcggagaca gctggatggc agagtgctgc tgaacacccac cgtggcccct 600

ctgttcttcg ccgaccagtt cctgcagctg agcaccagcc tgcccagcca gtacatcaca 660ctgttcttcg ccgaccagtt cctgcagctg agcaccagcc tgcccagcca gtacatcaca 660

ggactggccg agcacctgag ccccctgatg ctgagcacat cctggacccg gatcaccctg 720ggactggccg agcacctgag ccccctgatg ctgagcacat cctggacccg gatcaccctg 720

tggaacaggg atctggcccc tacccctggc gccaatctgt acggcagcca ccctttctac 780tggaacaggg atctggcccc tacccctggc gccaatctgt acggcagcca ccctttctac 780

ctggccctgg aagatggcgg atctgcccac ggagtgtttc tgctgaactc caacgccatg 840ctggccctgg aagatggcgg atctgcccac ggagtgtttc tgctgaactc caacgccatg 840

gacgtggtgc tgcagcctag ccctgccctg tcttggagaa gcacaggcgg catcctggat 900gacgtggtgc tgcagcctag ccctgccctg tcttggagaa gcacaggcgg catcctggat 900

gtgtacatct ttctgggccc cgagcccaag agcgtggtgc agcagtatct ggatgtcgtg 960gtgtacatct ttctgggccc cgagcccaag agcgtggtgc agcagtatct ggatgtcgtg 960

ggctacccct tcatgccccc ttactggggc ctgggattcc acctgtgcag atggggctac 1020ggctacccct tcatgccccc ttactggggc ctgggattcc acctgtgcag atggggctac 1020

tccagcaccg ccatcaccag acaggtggtg gaaaacatga ccagagccca cttcccactg 1080tccagcaccg ccatcaccag acaggtggtg gaaaacatga ccagagccca cttcccactg 1080

gatgtgcagt ggaacgacct ggactacatg gacagcagac gggacttcac cttcaacaag 1140gatgtgcagt ggaacgacct ggactacatg gacagcagac gggacttcac cttcaacaag 1140

gacggcttcc gggacttccc cgccatggtg caggaactgc atcagggcgg cagacggtac 1200gacggcttcc gggacttccc cgccatggtg caggaactgc atcagggcgg cagacggtac 1200

atgatgatcg tggatcccgc catcagctcc tctggccctg ccggctctta cagaccctac 1260atgatgatcg tggatcccgc catcagctcc tctggccctg ccggctctta cagaccctac 1260

gacgagggcc tgcggagagg cgtgttcatc accaacgaga caggccagcc cctgatcggc 1320gacgagggcc tgcggagagg cgtgttcatc accaacgaga caggccagcc cctgatcggc 1320

aaagtgtggc ctggcagcac agccttcccc gacttcacca atcctaccgc cctggcttgg 1380aaagtgtggc ctggcagcac agccttcccc gacttcacca atcctaccgc cctggcttgg 1380

tgggaggaca tggtggccga gttccacgac caggtgccct tcgacggcat gtggatcgac 1440tgggaggaca tggtggccga gttccacgac caggtgccct tcgacggcat gtggatcgac 1440

atgaacgagc ccagcaactt catccggggc agcgaggatg gctgccccaa caacgaactg 1500atgaacgagc ccagcaactt catccggggc agcgaggatg gctgccccaa caacgaactg 1500

gaaaatcccc cttacgtgcc cggcgtcgtg ggcggaacac tgcaggccgc tacaatctgt 1560gaaaatcccc cttacgtgcc cggcgtcgtg ggcggaacac tgcaggccgc tacaatctgt 1560

gccagcagcc accagtttct gagcacccac tacaacctgc acaacctgta cggcctgacc 1620gccagcagcc accagtttct gagcacccac tacaacctgc acaacctgta cggcctgacc 1620

gaggccattg ccagccaccg cgctctcgtg aaagccagag gcacacggcc cttcgtgatc 1680gaggccattg ccagccaccg cgctctcgtg aaagccagag gcacacggcc cttcgtgatc 1680

agcagaagca cctttgccgg ccacggcaga tacgccggac attggactgg cgacgtgtgg 1740agcagaagca cctttgccgg ccacggcaga tacgccggac attggactgg cgacgtgtgg 1740

tcctcttggg agcagctggc ctctagcgtg cccgagatcc tgcagttcaa tctgctgggc 1800tcctcttggg agcagctggc ctctagcgtg cccgagatcc tgcagttcaa tctgctgggc 1800

gtgccactcg tgggcgccga tgtgtgtggc ttcctgggca acacctccga ggaactgtgt 1860gtgccactcg tgggcgccga tgtgtgtggc ttcctgggca acacctccga ggaactgtgt 1860

gtgcggtgga cacagctggg cgccttctac cctttcatga gaaaccacaa cagcctgctg 1920gtgcggtgga cacagctggg cgccttctac cctttcatga gaaaccacaa cagcctgctg 1920

agcctgcccc aggaacccta cagctttagc gagcctgcac agcaggccat gcggaaggcc 1980agcctgcccc aggaacccta cagctttagc gagcctgcac agcaggccat gcggaaggcc 1980

ctgacactga gatacgctct gctgccccac ctgtacaccc tgtttcacca ggcccatgtg 2040ctgacactga gatacgctct gctgccccac ctgtacaccc tgtttcacca ggcccatgtg 2040

gccggcgaga cagtggccag acctctgttt ctggaattcc ccaaggacag cagcacctgg 2100gccggcgaga cagtggccag acctctgttt ctggaattcc ccaaggacag cagcacctgg 2100

accgtggacc atcagctgct gtggggagag gctctgctga ttaccccagt gctgcaggca 2160accgtggacc atcagctgct gtggggagag gctctgctga ttaccccagt gctgcaggca 2160

ggcaaggccg aagtgaccgg ctactttccc ctgggcactt ggtacgacct gcagaccgtg 2220ggcaaggccg aagtgaccgg ctactttccc ctgggcactt ggtacgacct gcagaccgtg 2220

cctgtggaag ccctgggatc tctgcctcca cctcctgccg ctcctagaga gcctgccatt 2280cctgtggaag ccctgggatc tctgcctcca cctcctgccg ctcctagaga gcctgccatt 2280

cactctgagg gccagtgggt cacactgcct gcccccctgg ataccatcaa cgtgcacctg 2340cactctgagg gccagtgggt cacactgcct gcccccctgg ataccatcaa cgtgcacctg 2340

agggccggct acatcatacc actgcaggga cctggcctga ccaccaccga gtctagacag 2400agggccggct acatcatacc actgcaggga cctggcctga ccaccaccga gtctagacag 2400

cagccaatgg ccctggccgt ggccctgacc aaaggcggag aagctagggg cgagctgttc 2460cagccaatgg ccctggccgt ggccctgacc aaaggcggag aagctagggg cgagctgttc 2460

tgggacgatg gcgagagcct ggaagtgctg gaaagaggcg cctataccca agtgatcttc 2520tgggacgatg gcgagagcct ggaagtgctg gaaagaggcg cctataccca agtgatcttc 2520

ctggcccgga acaacaccat cgtgaacgag ctggtgcgcg tgacctctga aggcgctgga 2580ctggccccgga acaacaccat cgtgaacgag ctggtgcgcg tgacctctga aggcgctgga 2580

ctgcagctgc agaaagtgac cgtgctggga gtggccacag cccctcagca ggtgctgtct 2640ctgcagctgc agaaagtgac cgtgctggga gtggccacag cccctcagca ggtgctgtct 2640

aatggcgtgc ccgtgtccaa cttcacctac agccccgaca ccaaggtgct ggacatctgc 2700aatggcgtgc ccgtgtccaa cttcacctac agccccgaca ccaaggtgct ggacatctgc 2700

gtgtcactgc tgatgggaga gcagtttctg gtgtcctggt gctga 2745gtgtcactgc tgatgggaga gcagtttctg gtgtcctggt gctga 2745

<210> 39<210> 39

<211> 2745<211> 2745

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7+hGAAco2-Δ-29<223> sp7+hGAAco2-Δ-29

<400> 39<400> 39

atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggccaacag 60atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggccaacag 60

ggagcttcca gaccaggacc gagagacgcc caagcccatc ctggtagacc aagagctgtg 120ggagcttcca gaccaggacc gagagacgcc caagcccatc ctggtagacc aagagctgtg 120

cctacccaat gcgacgtgcc acccaactcc cgattcgact gcgcgccaga taaggctatt 180cctacccaat gcgacgtgcc acccaactcc cgattcgact gcgcgccaga taaggctatt 180

acccaagagc agtgtgaagc cagaggttgc tgctacatcc cagcgaagca aggattgcaa 240acccaagagc agtgtgaagc cagaggttgc tgctacatcc cagcgaagca aggattgcaa 240

ggcgcccaaa tgggacaacc ttggtgtttc ttcccccctt cgtacccatc atataaactc 300ggcgcccaaa tgggacaacc ttggtgtttc ttcccccctt cgtacccatc atataaactc 300

gaaaacctgt cctcttcgga aatgggttat actgccaccc tcaccagaac tactcctact 360gaaaacctgt cctcttcgga aatgggttat actgccaccc tcaccagaac tactcctact 360

ttcttcccga aagacatctt gaccttgagg ctggacgtga tgatggagac tgaaaaccgg 420ttcttcccga aagacatctt gaccttgagg ctggacgtga tgatggagac tgaaaaccgg 420

ctgcatttca ctatcaaaga tcctgccaat cggcgatacg aggtccctct ggaaacccct 480ctgcatttca ctatcaaaga tcctgccaat cggcgatacg aggtccctct ggaaacccct 480

cacgtgcact cacgggctcc ttctccgctt tactccgtcg aattctctga ggaacccttc 540cacgtgcact cacgggctcc ttctccgctt tactccgtcg aattctctga ggaacccttc 540

ggagtgatcg ttagacgcca gctggatggt agagtgctgt tgaacactac tgtggcccca 600ggagtgatcg ttagacgcca gctggatggt agagtgctgt tgaacactac tgtggcccca 600

cttttcttcg ctgaccagtt tctgcaactg tccacttccc tgccatccca gtacattact 660cttttcttcg ctgaccagtt tctgcaactg tccacttccc tgccatccca gtacattact 660

ggactcgccg aacacctgtc gccactgatg ctctcgacct cttggactag aatcactttg 720ggactcgccg aacacctgtc gccactgatg ctctcgacct cttggactag aatcactttg 720

tggaacagag acttggcccc tactccggga gcaaatctgt acggaagcca ccctttttac 780tggaacagag acttggcccc tactccggga gcaaatctgt acggaagcca ccctttttac 780

ctggcgctcg aagatggcgg atccgctcac ggagtgttcc tgctgaatag caacgcaatg 840ctggcgctcg aagatggcgg atccgctcac ggagtgttcc tgctgaatag caacgcaatg 840

gacgtggtgc tgcaaccttc ccctgcactc agttggagaa gtaccggggg tattctggac 900gacgtggtgc tgcaaccttc ccctgcactc agttggagaa gtaccggggg tattctggac 900

gtgtacatct tcctcggacc agaacccaag agcgtggtgc agcaatatct ggacgtggtc 960gtgtacatct tcctcggacc agaacccaag agcgtggtgc agcaatatct ggacgtggtc 960

ggataccctt ttatgcctcc ttactgggga ctgggattcc acctttgccg ttggggctac 1020ggataccctt ttatgcctcc ttactgggga ctgggattcc acctttgccg ttggggctac 1020

tcatccaccg ccattaccag acaggtggtg gagaatatga ccagagccca cttccctctc 1080tcatccaccg ccattaccag acaggtggtg gagaatatga ccagagccca cttccctctc 1080

gacgtgcagt ggaacgatct ggactatatg gactcccgga gagatttcac cttcaacaag 1140gacgtgcagt ggaacgatct ggactatatg gactccccgga gagatttcac cttcaacaag 1140

gacgggttcc gcgattttcc cgcgatggtt caagagctcc accagggtgg tcgaagatat 1200gacgggttcc gcgattttcc cgcgatggtt caagagctcc accagggtgg tcgaagatat 1200

atgatgatcg tcgacccagc catttcgagc agcggacccg ctggatctta tagaccttac 1260atgatgatcg tcgacccagc catttcgagc agcggacccg ctggatctta tagaccttac 1260

gacgaaggcc ttaggagagg agtgttcatc acaaacgaga ctggacagcc tttgatcggt 1320gacgaaggcc ttaggagagg agtgttcatc acaaacgaga ctggacagcc tttgatcggt 1320

aaagtgtggc ctggatcaac cgcctttcct gactttacca atcccactgc cttggcttgg 1380aaagtgtggc ctggatcaac cgcctttcct gactttacca atcccactgc cttggcttgg 1380

tgggaggaca tggtggccga attccacgac caagtcccct ttgatggaat gtggatcgat 1440tgggaggaca tggtggccga attccacgac caagtcccctttgatggaat gtggatcgat 1440

atgaacgaac caagcaattt tatcagaggt tccgaagacg gttgccccaa caacgaactg 1500atgaacgaac caagcaattt tatcagaggt tccgaagacg gttgccccaa caacgaactg 1500

gaaaaccctc cttatgtgcc cggagtcgtg ggcggaacat tacaggccgc gactatttgc 1560gaaaaccctc cttatgtgcc cggagtcgtg ggcggaacat tacaggccgc gactatttgc 1560

gccagcagcc accaattcct gtccactcac tacaacctcc acaaccttta tggattaacc 1620gccagcagcc accaattcct gtccactcac tacaacctcc acaaccttta tggattaacc 1620

gaagctattg caagtcacag ggctctggtg aaggctagag ggactaggcc ctttgtgatc 1680gaagctattg caagtcacag ggctctggtg aaggctagag ggactaggcc ctttgtgatc 1680

tcccgatcca cctttgccgg acacgggaga tacgccggtc actggactgg tgacgtgtgg 1740tcccgatcca cctttgccgg acacgggaga tacgccggtc actggactgg tgacgtgtgg 1740

agctcatggg aacaactggc ctcctccgtg ccggaaatct tacagttcaa ccttctgggt 1800agctcatggg aacaactggc ctcctccgtg ccggaaatct tacagttcaa ccttctgggt 1800

gtccctcttg tcggagcaga cgtgtgtggg tttcttggta acacctccga ggaactgtgt 1860gtccctcttg tcggagcaga cgtgtgtggg tttcttggta acacctccga ggaactgtgt 1860

gtgcgctgga ctcaactggg tgcattctac ccattcatga gaaaccacaa ctccttgctg 1920gtgcgctgga ctcaactggg tgcattctac ccattcatga gaaaccacaa ctccttgctg 1920

tccctgccac aagagcccta ctcgttcagc gagcctgcac aacaggctat gcggaaggca 1980tccctgccac aagagcccta ctcgttcagc gagcctgcac aacaggctat gcggaaggca 1980

ctgaccctga gatacgccct gcttccacac ttatacactc tcttccatca agcgcatgtg 2040ctgaccctga gatacgccct gcttccacac ttatacactc tcttccatca agcgcatgtg 2040

gcaggagaaa ccgttgcaag gcctcttttc cttgaattcc ccaaggattc ctcgacttgg 2100gcaggagaaa ccgttgcaag gcctcttttc cttgaattcc ccaaggattc ctcgacttgg 2100

acggtggatc atcagctgct gtggggagaa gctctgctga ttactccagt gttgcaagcc 2160acggtggatc atcagctgct gtggggagaa gctctgctga ttactccagt gttgcaagcc 2160

ggaaaagctg aggtgaccgg atactttccg ctgggaacct ggtacgacct ccagactgtc 2220ggaaaagctg aggtgaccgg atactttccg ctgggaacct ggtacgacct ccagactgtc 2220

cctgttgaag cccttggatc actgcctccg cctccggcag ctccacgcga accagctata 2280cctgttgaag cccttggatc actgcctccg cctccggcag ctccacgcga accagctata 2280

cattccgagg gacagtgggt tacattacca gctcctctgg acacaatcaa cgtccactta 2340cattccgagg gacagtgggt tacattacca gctcctctgg acacaatcaa cgtccactta 2340

agagctggct acattatccc tctgcaagga ccaggactga ctacgaccga gagcagacag 2400agagctggct acattatccc tctgcaagga ccaggactga ctacgaccga gagcagacag 2400

cagccaatgg cactggctgt ggctctgacc aagggagggg aagctagagg agaactcttc 2460cagccaatgg cactggctgt ggctctgacc aagggagggg aagctagagg agaactcttc 2460

tgggatgatg gggagtccct tgaagtgctg gaaagaggcg cttacactca agtcattttc 2520tgggatgatg gggagtccct tgaagtgctg gaaagaggcg cttacactca agtcattttc 2520

cttgcacgga acaacaccat tgtgaacgaa ttggtgcgag tgaccagcga aggagctgga 2580cttgcacgga acaacaccat tgtgaacgaa ttggtgcgag tgaccagcga aggagctgga 2580

cttcaactgc agaaggtcac tgtgctcgga gtggctaccg ctcctcagca agtgctgtcg 2640cttcaactgc agaaggtcac tgtgctcgga gtggctaccg ctcctcagca agtgctgtcg 2640

aatggagtcc ccgtgtcaaa ctttacctac tcccctgaca ctaaggtgct cgacatttgc 2700aatggagtcc ccgtgtcaaa ctttacctac tcccctgaca ctaaggtgct cgacatttgc 2700

gtgtccctcc tgatgggaga gcagttcctt gtgtcctggt gttga 2745gtgtccctcc tgatgggaga gcagttcctt gtgtcctggt gttga 2745

<210> 40<210> 40

<211> 2706<211> 2706

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7+hGAAwt-Δ-42<223> sp7+hGAAwt-Δ-42

<400> 40<400> 40

atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggcgcacac 60atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggcgcacac 60

cccgggcggc cgcgagcagt gcccacacag tgcgacgtcc cccccaacag ccgcttcgat 120cccgggcggc cgcgagcagt gcccacacag tgcgacgtcc cccccaacag ccgcttcgat 120

tgcgcccctg acaaggccat cacccaggaa cagtgcgagg cccgcggctg ttgctacatc 180tgcgcccctg acaaggccat cacccaggaa cagtgcgagg cccgcggctg ttgctacatc 180

cctgcaaagc aggggctgca gggagcccag atggggcagc cctggtgctt cttcccaccc 240cctgcaaagc aggggctgca gggagcccag atggggcagc cctggtgctt cttcccaccc 240

agctacccca gctacaagct ggagaacctg agctcctctg aaatgggcta cacggccacc 300agctacccca gctacaagct ggagaacctg agctcctctg aaatgggcta cacggccacc 300

ctgacccgta ccacccccac cttcttcccc aaggacatcc tgaccctgcg gctggacgtg 360ctgacccgta ccacccccac cttcttcccc aaggacatcc tgaccctgcg gctggacgtg 360

atgatggaga ctgagaaccg cctccacttc acgatcaaag atccagctaa caggcgctac 420atgatggaga ctgagaaccg cctccacttc acgatcaaag atccagctaa caggcgctac 420

gaggtgccct tggagacccc gcatgtccac agccgggcac cgtccccact ctacagcgtg 480gaggtgccct tggagacccc gcatgtccac agccgggcac cgtccccact ctacagcgtg 480

gagttctccg aggagccctt cggggtgatc gtgcgccggc agctggacgg ccgcgtgctg 540gagttctccg aggagccctt cggggtgatc gtgcgccggc agctggacgg ccgcgtgctg 540

ctgaacacga cggtggcgcc cctgttcttt gcggaccagt tccttcagct gtccacctcg 600ctgaacacga cggtggcgcc cctgttcttt gcggaccagt tccttcagct gtccacctcg 600

ctgccctcgc agtatatcac aggcctcgcc gagcacctca gtcccctgat gctcagcacc 660ctgccctcgc agtatatcac aggcctcgcc gagcacctca gtcccctgat gctcagcacc 660

agctggacca ggatcaccct gtggaaccgg gaccttgcgc ccacgcccgg tgcgaacctc 720agctggacca ggatcaccct gtggaaccgg gaccttgcgc ccacgcccgg tgcgaacctc 720

tacgggtctc accctttcta cctggcgctg gaggacggcg ggtcggcaca cggggtgttc 780tacgggtctc accctttcta cctggcgctg gaggacggcg ggtcggcaca cggggtgttc 780

ctgctaaaca gcaatgccat ggatgtggtc ctgcagccga gccctgccct tagctggagg 840ctgctaaaca gcaatgccat ggatgtggtc ctgcagccga gccctgccct tagctggagg 840

tcgacaggtg ggatcctgga tgtctacatc ttcctgggcc cagagcccaa gagcgtggtg 900tcgacaggtg ggatcctgga tgtctacatc ttcctgggcc cagagcccaa gagcgtggtg 900

cagcagtacc tggacgttgt gggatacccg ttcatgccgc catactgggg cctgggcttc 960cagcagtacc tggacgttgt gggatacccg ttcatgccgc catactgggg cctgggcttc 960

cacctgtgcc gctggggcta ctcctccacc gctatcaccc gccaggtggt ggagaacatg 1020cacctgtgcc gctggggcta ctcctccacc gctatcaccc gccaggtggt ggagaacatg 1020

accagggccc acttccccct ggacgtccag tggaacgacc tggactacat ggactcccgg 1080accagggccc acttccccct ggacgtccag tggaacgacc tggactacat ggactcccgg 1080

agggacttca cgttcaacaa ggatggcttc cgggacttcc cggccatggt gcaggagctg 1140agggacttca cgttcaacaa ggatggcttc cgggacttcc cggccatggt gcaggagctg 1140

caccagggcg gccggcgcta catgatgatc gtggatcctg ccatcagcag ctcgggccct 1200caccagggcg gccggcgcta catgatgatc gtggatcctg ccatcagcag ctcgggccct 1200

gccgggagct acaggcccta cgacgagggt ctgcggaggg gggttttcat caccaacgag 1260gccgggagct acaggcccta cgacgagggt ctgcggaggg gggttttcat caccaacgag 1260

accggccagc cgctgattgg gaaggtatgg cccgggtcca ctgccttccc cgacttcacc 1320accggccagc cgctgattgg gaaggtatgg cccgggtcca ctgccttccc cgacttcacc 1320

aaccccacag ccctggcctg gtgggaggac atggtggctg agttccatga ccaggtgccc 1380aaccccacag ccctggcctg gtgggaggac atggtggctg agttccatga ccaggtgccc 1380

ttcgacggca tgtggattga catgaacgag ccttccaact tcatcagggg ctctgaggac 1440ttcgacggca tgtggattga catgaacgag ccttccaact tcatcagggg ctctgaggac 1440

ggctgcccca acaatgagct ggagaaccca ccctacgtgc ctggggtggt tggggggacc 1500ggctgcccca acaatgagct ggagaaccca ccctacgtgc ctggggtggt tggggggacc 1500

ctccaggcgg ccaccatctg tgcctccagc caccagtttc tctccacaca ctacaacctg 1560ctccaggcgg ccaccatctg tgcctccagc caccagtttc tctccacaca ctacaacctg 1560

cacaacctct acggcctgac cgaagccatc gcctcccaca gggcgctggt gaaggctcgg 1620cacaacctct acggcctgac cgaagccatc gcctcccaca gggcgctggt gaaggctcgg 1620

gggacacgcc catttgtgat ctcccgctcg acctttgctg gccacggccg atacgccggc 1680gggacacgcc catttgtgat ctcccgctcg acctttgctg gccacggccg atacgccggc 1680

cactggacgg gggacgtgtg gagctcctgg gagcagctcg cctcctccgt gccagaaatc 1740cactggacgg gggacgtgtg gagctcctgg gagcagctcg cctcctccgt gccagaaatc 1740

ctgcagttta acctgctggg ggtgcctctg gtcggggccg acgtctgcgg cttcctgggc 1800ctgcagttta acctgctggg ggtgcctctg gtcggggccg acgtctgcgg cttcctgggc 1800

aacacctcag aggagctgtg tgtgcgctgg acccagctgg gggccttcta ccccttcatg 1860aacacctcag aggagctgtg tgtgcgctgg acccagctgg gggccttcta ccccttcatg 1860

cggaaccaca acagcctgct cagtctgccc caggagccgt acagcttcag cgagccggcc 1920cggaaccaca acagcctgct cagtctgccc caggagccgt acagcttcag cgagccggcc 1920

cagcaggcca tgaggaaggc cctcaccctg cgctacgcac tcctccccca cctctacaca 1980cagcaggcca tgaggaaggc cctcaccctg cgctacgcac tcctccccca cctctacaca 1980

ctgttccacc aggcccacgt cgcgggggag accgtggccc ggcccctctt cctggagttc 2040ctgttccacc aggcccacgt cgcgggggag accgtggccc ggcccctctt cctggagttc 2040

cccaaggact ctagcacctg gactgtggac caccagctcc tgtgggggga ggccctgctc 2100cccaaggact ctagcacctg gactgtggac caccagctcc tgtgggggga ggccctgctc 2100

atcaccccag tgctccaggc cgggaaggcc gaagtgactg gctacttccc cttgggcaca 2160atcaccccag tgctccaggc cgggaaggcc gaagtgactg gctacttccc cttgggcaca 2160

tggtacgacc tgcagacggt gccagtagag gcccttggca gcctcccacc cccacctgca 2220tggtacgacc tgcagacggt gccagtagag gcccttggca gcctcccacc cccacctgca 2220

gctccccgtg agccagccat ccacagcgag gggcagtggg tgacgctgcc ggcccccctg 2280gctccccgtg agccagccat ccacagcgag gggcagtggg tgacgctgcc ggcccccctg 2280

gacaccatca acgtccacct ccgggctggg tacatcatcc ccctgcaggg ccctggcctc 2340gacaccatca acgtccacct ccgggctggg tacatcatcc ccctgcaggg ccctggcctc 2340

acaaccacag agtcccgcca gcagcccatg gccctggctg tggccctgac caagggtggg 2400acaaccacag agtcccgcca gcagcccatg gccctggctg tggccctgac caagggtggg 2400

gaggcccgag gggagctgtt ctgggacgat ggagagagcc tggaagtgct ggagcgaggg 2460gaggcccgag gggagctgtt ctgggacgat ggagagagcc tggaagtgct ggagcgaggg 2460

gcctacacac aggtcatctt cctggccagg aataacacga tcgtgaatga gctggtacgt 2520gcctacacac aggtcatctt cctggccagg aataacacga tcgtgaatga gctggtacgt 2520

gtgaccagtg agggagctgg cctgcagctg cagaaggtga ctgtcctggg cgtggccacg 2580gtgaccagtg agggagctgg cctgcagctg cagaaggtga ctgtcctggg cgtggccacg 2580

gcgccccagc aggtcctctc caacggtgtc cctgtctcca acttcaccta cagccccgac 2640gcgccccagc aggtcctctc caacggtgtc cctgtctcca acttcaccta cagccccgac 2640

accaaggtcc tggacatctg tgtctcgctg ttgatgggag agcagtttct cgtcagctgg 2700accaaggtcc tggacatctg tgtctcgctg ttgatggggag agcagtttct cgtcagctgg 2700

tgttag 2706tgttag 2706

<210> 41<210> 41

<211> 2706<211> 2706

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7-hGAAco2-Δ-42<223> sp7-hGAAco2-Δ-42

<400> 41<400> 41

atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggcgcccat 60atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggcgcccat 60

cctggtagac caagagctgt gcctacccaa tgcgacgtgc cacccaactc ccgattcgac 120cctggtagac caagagctgt gcctacccaa tgcgacgtgc cacccaactc ccgattcgac 120

tgcgcgccag ataaggctat tacccaagag cagtgtgaag ccagaggttg ctgctacatc 180tgcgcgccag ataaggctat tacccaagag cagtgtgaag ccagaggttg ctgctacatc 180

ccagcgaagc aaggattgca aggcgcccaa atgggacaac cttggtgttt cttcccccct 240ccagcgaagc aaggattgca aggcgcccaa atgggacaac cttggtgttt cttcccccct 240

tcgtacccat catataaact cgaaaacctg tcctcttcgg aaatgggtta tactgccacc 300tcgtacccat catataaact cgaaaacctg tcctcttcgg aaatgggtta tactgccacc 300

ctcaccagaa ctactcctac tttcttcccg aaagacatct tgaccttgag gctggacgtg 360ctcaccagaa ctactcctac tttcttcccg aaagacatct tgaccttgag gctggacgtg 360

atgatggaga ctgaaaaccg gctgcatttc actatcaaag atcctgccaa tcggcgatac 420atgatggaga ctgaaaaccg gctgcatttc actatcaaag atcctgccaa tcggcgatac 420

gaggtccctc tggaaacccc tcacgtgcac tcacgggctc cttctccgct ttactccgtc 480gaggtccctc tggaaacccc tcacgtgcac tcacgggctc cttctccgct ttactccgtc 480

gaattctctg aggaaccctt cggagtgatc gttagacgcc agctggatgg tagagtgctg 540gaattctctg aggaaccctt cggagtgatc gttagacgcc agctggatgg tagagtgctg 540

ttgaacacta ctgtggcccc acttttcttc gctgaccagt ttctgcaact gtccacttcc 600ttgaacacta ctgtggcccc acttttcttc gctgaccagt ttctgcaact gtccacttcc 600

ctgccatccc agtacattac tggactcgcc gaacacctgt cgccactgat gctctcgacc 660ctgccatccc agtacattac tggactcgcc gaacacctgt cgccactgat gctctcgacc 660

tcttggacta gaatcacttt gtggaacaga gacttggccc ctactccggg agcaaatctg 720tcttggacta gaatcacttt gtggaacaga gacttggccc ctactccggg agcaaatctg 720

tacggaagcc acccttttta cctggcgctc gaagatggcg gatccgctca cggagtgttc 780tacggaagcc acccttttta cctggcgctc gaagatggcg gatccgctca cggagtgttc 780

ctgctgaata gcaacgcaat ggacgtggtg ctgcaacctt cccctgcact cagttggaga 840ctgctgaata gcaacgcaat ggacgtggtg ctgcaacctt cccctgcact cagttggaga 840

agtaccgggg gtattctgga cgtgtacatc ttcctcggac cagaacccaa gagcgtggtg 900agtaccgggg gtattctgga cgtgtacatc ttcctcggac cagaacccaa gagcgtggtg 900

cagcaatatc tggacgtggt cggataccct tttatgcctc cttactgggg actgggattc 960cagcaatatc tggacgtggt cggataccct tttatgcctc cttactgggg actgggattc 960

cacctttgcc gttggggcta ctcatccacc gccattacca gacaggtggt ggagaatatg 1020cacctttgcc gttggggcta ctcatccacc gccattacca gacaggtggt ggagaatatg 1020

accagagccc acttccctct cgacgtgcag tggaacgatc tggactatat ggactcccgg 1080accagagccc acttccctct cgacgtgcag tggaacgatc tggactatat ggactcccgg 1080

agagatttca ccttcaacaa ggacgggttc cgcgattttc ccgcgatggt tcaagagctc 1140agagatttca ccttcaacaa ggacgggttc cgcgattttc ccgcgatggt tcaagagctc 1140

caccagggtg gtcgaagata tatgatgatc gtcgacccag ccatttcgag cagcggaccc 1200caccagggtg gtcgaagata tatgatgatc gtcgacccag ccatttcgag cagcggaccc 1200

gctggatctt atagacctta cgacgaaggc cttaggagag gagtgttcat cacaaacgag 1260gctggatctt atagacctta cgacgaaggc cttaggagag gagtgttcat cacaaacgag 1260

actggacagc ctttgatcgg taaagtgtgg cctggatcaa ccgcctttcc tgactttacc 1320actggacagc ctttgatcgg taaagtgtgg cctggatcaa ccgcctttcc tgactttacc 1320

aatcccactg ccttggcttg gtgggaggac atggtggccg aattccacga ccaagtcccc 1380aatcccactg ccttggcttg gtgggaggac atggtggccg aattccacga ccaagtcccc 1380

tttgatggaa tgtggatcga tatgaacgaa ccaagcaatt ttatcagagg ttccgaagac 1440tttgatggaa tgtggatcga tatgaacgaa ccaagcaatt ttatcagagg ttccgaagac 1440

ggttgcccca acaacgaact ggaaaaccct ccttatgtgc ccggagtcgt gggcggaaca 1500ggttgcccca acaacgaact ggaaaaccct ccttatgtgc ccggagtcgt gggcggaaca 1500

ttacaggccg cgactatttg cgccagcagc caccaattcc tgtccactca ctacaacctc 1560ttacaggccg cgactatttg cgccagcagc caccaattcc tgtccactca ctacaacctc 1560

cacaaccttt atggattaac cgaagctatt gcaagtcaca gggctctggt gaaggctaga 1620cacaaccttt atggattaac cgaagctatt gcaagtcaca gggctctggt gaaggctaga 1620

gggactaggc cctttgtgat ctcccgatcc acctttgccg gacacgggag atacgccggt 1680gggactaggc cctttgtgat ctcccgatcc acctttgccg gacacgggag atacgccggt 1680

cactggactg gtgacgtgtg gagctcatgg gaacaactgg cctcctccgt gccggaaatc 1740cactggactg gtgacgtgtg gagctcatgg gaacaactgg cctcctccgt gccggaaatc 1740

ttacagttca accttctggg tgtccctctt gtcggagcag acgtgtgtgg gtttcttggt 1800ttacagttca accttctggg tgtccctctt gtcggagcag acgtgtgtgg gtttcttggt 1800

aacacctccg aggaactgtg tgtgcgctgg actcaactgg gtgcattcta cccattcatg 1860aacacctccg aggaactgtg tgtgcgctgg actcaactgg gtgcattcta cccattcatg 1860

agaaaccaca actccttgct gtccctgcca caagagccct actcgttcag cgagcctgca 1920agaaaccaca actccttgct gtccctgcca caagagccct actcgttcag cgagcctgca 1920

caacaggcta tgcggaaggc actgaccctg agatacgccc tgcttccaca cttatacact 1980caacaggcta tgcggaaggc actgaccctg agatacgccc tgcttccaca cttatacact 1980

ctcttccatc aagcgcatgt ggcaggagaa accgttgcaa ggcctctttt ccttgaattc 2040ctcttccatc aagcgcatgt ggcaggagaa accgttgcaa ggcctctttt ccttgaattc 2040

cccaaggatt cctcgacttg gacggtggat catcagctgc tgtggggaga agctctgctg 2100cccaaggatt cctcgacttg gacggtggat catcagctgc tgtggggaga agctctgctg 2100

attactccag tgttgcaagc cggaaaagct gaggtgaccg gatactttcc gctgggaacc 2160attactccag tgttgcaagc cggaaaagct gaggtgaccg gatactttcc gctgggaacc 2160

tggtacgacc tccagactgt ccctgttgaa gcccttggat cactgcctcc gcctccggca 2220tggtacgacc tccagactgt ccctgttgaa gcccttggat cactgcctcc gcctccggca 2220

gctccacgcg aaccagctat acattccgag ggacagtggg ttacattacc agctcctctg 2280gctccacgcg aaccagctat acattccgag ggacagtggg ttacattacc agctcctctg 2280

gacacaatca acgtccactt aagagctggc tacattatcc ctctgcaagg accaggactg 2340gacacaatca acgtccactt aagagctggc tacattatcc ctctgcaagg accaggactg 2340

actacgaccg agagcagaca gcagccaatg gcactggctg tggctctgac caagggaggg 2400actacgaccg agagcagaca gcagccaatg gcactggctg tggctctgac caagggaggg 2400

gaagctagag gagaactctt ctgggatgat ggggagtccc ttgaagtgct ggaaagaggc 2460gaagctagag gagaactctt ctgggatgat ggggagtccc ttgaagtgct ggaaagaggc 2460

gcttacactc aagtcatttt ccttgcacgg aacaacacca ttgtgaacga attggtgcga 2520gcttacactc aagtcatttt ccttgcacgg aacaacacca ttgtgaacga attggtgcga 2520

gtgaccagcg aaggagctgg acttcaactg cagaaggtca ctgtgctcgg agtggctacc 2580gtgaccagcg aaggagctgg acttcaactg cagaaggtca ctgtgctcgg agtggctacc 2580

gctcctcagc aagtgctgtc gaatggagtc cccgtgtcaa actttaccta ctcccctgac 2640gctcctcagc aagtgctgtc gaatggagtc cccgtgtcaa actttaccta ctcccctgac 2640

actaaggtgc tcgacatttg cgtgtccctc ctgatgggag agcagttcct tgtgtcctgg 2700actaaggtgc tcgacatttg cgtgtccctc ctgatgggag agcagttcct tgtgtcctgg 2700

tgttga 2706tgttga 2706

<210> 42<210> 42

<211> 2703<211> 2703

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7-hGAAwt-Δ-43<223> sp7-hGAAwt-Δ-43

<400> 42<400> 42

atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggccacccc 60atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggccaccc 60

gggcggccgc gagcagtgcc cacacagtgc gacgtccccc ccaacagccg cttcgattgc 120gggcggccgc gagcagtgcc cacacagtgc gacgtccccc ccaacagccg cttcgattgc 120

gcccctgaca aggccatcac ccaggaacag tgcgaggccc gcggctgttg ctacatccct 180gcccctgaca aggccatcac ccaggaacag tgcgaggccc gcggctgttg ctacatccct 180

gcaaagcagg ggctgcaggg agcccagatg gggcagccct ggtgcttctt cccacccagc 240gcaaagcagg ggctgcaggg agcccagatg gggcagccct ggtgcttctt cccacccagc 240

taccccagct acaagctgga gaacctgagc tcctctgaaa tgggctacac ggccaccctg 300taccccagct acaagctgga gaacctgagc tcctctgaaa tgggctacac ggccaccctg 300

acccgtacca cccccacctt cttccccaag gacatcctga ccctgcggct ggacgtgatg 360acccgtacca cccccacctt cttccccaag gacatcctga ccctgcggct ggacgtgatg 360

atggagactg agaaccgcct ccacttcacg atcaaagatc cagctaacag gcgctacgag 420atggagactg agaaccgcct ccacttcacg atcaaagatc cagctaacag gcgctacgag 420

gtgcccttgg agaccccgca tgtccacagc cgggcaccgt ccccactcta cagcgtggag 480gtgcccttgg agaccccgca tgtccacagc cgggcaccgt ccccactcta cagcgtggag 480

ttctccgagg agcccttcgg ggtgatcgtg cgccggcagc tggacggccg cgtgctgctg 540ttctccgagg agcccttcgg ggtgatcgtg cgccggcagc tggacggccg cgtgctgctg 540

aacacgacgg tggcgcccct gttctttgcg gaccagttcc ttcagctgtc cacctcgctg 600aacacgacgg tggcgcccct gttctttgcg gaccagttcc ttcagctgtc cacctcgctg 600

ccctcgcagt atatcacagg cctcgccgag cacctcagtc ccctgatgct cagcaccagc 660ccctcgcagt atatcacagg cctcgccgag cacctcagtc ccctgatgct cagcaccagc 660

tggaccagga tcaccctgtg gaaccgggac cttgcgccca cgcccggtgc gaacctctac 720tggaccagga tcaccctgtg gaaccgggac cttgcgccca cgcccggtgc gaacctctac 720

gggtctcacc ctttctacct ggcgctggag gacggcgggt cggcacacgg ggtgttcctg 780gggtctcacc ctttctacct ggcgctggag gacggcgggt cggcacacgg ggtgttcctg 780

ctaaacagca atgccatgga tgtggtcctg cagccgagcc ctgcccttag ctggaggtcg 840ctaaacagca atgccatgga tgtggtcctg cagccgagcc ctgcccttag ctggaggtcg 840

acaggtggga tcctggatgt ctacatcttc ctgggcccag agcccaagag cgtggtgcag 900acaggtggga tcctggatgt ctacatcttc ctgggcccag agcccaagag cgtggtgcag 900

cagtacctgg acgttgtggg atacccgttc atgccgccat actggggcct gggcttccac 960cagtacctgg acgttgtggg atacccgttc atgccgccat actggggcct gggcttccac 960

ctgtgccgct ggggctactc ctccaccgct atcacccgcc aggtggtgga gaacatgacc 1020ctgtgccgct ggggctactc ctccaccgct atcacccgcc aggtggtgga gaacatgacc 1020

agggcccact tccccctgga cgtccagtgg aacgacctgg actacatgga ctcccggagg 1080agggcccact tccccctgga cgtccagtgg aacgacctgg actacatgga ctcccggagg 1080

gacttcacgt tcaacaagga tggcttccgg gacttcccgg ccatggtgca ggagctgcac 1140gacttcacgt tcaacaagga tggcttccgg gacttcccgg ccatggtgca ggagctgcac 1140

cagggcggcc ggcgctacat gatgatcgtg gatcctgcca tcagcagctc gggccctgcc 1200cagggcggcc ggcgctacat gatgatcgtg gatcctgcca tcagcagctc gggccctgcc 1200

gggagctaca ggccctacga cgagggtctg cggagggggg ttttcatcac caacgagacc 1260gggagctaca ggccctacga cgagggtctg cggagggggg ttttcatcac caacgagacc 1260

ggccagccgc tgattgggaa ggtatggccc gggtccactg ccttccccga cttcaccaac 1320ggccagccgc tgattgggaa ggtatggccc gggtccactg ccttccccga cttcaccaac 1320

cccacagccc tggcctggtg ggaggacatg gtggctgagt tccatgacca ggtgcccttc 1380cccacagccc tggcctggtg ggaggacatg gtggctgagt tccatgacca ggtgcccttc 1380

gacggcatgt ggattgacat gaacgagcct tccaacttca tcaggggctc tgaggacggc 1440gacggcatgt ggattgacat gaacgagcct tccaacttca tcaggggctc tgaggacggc 1440

tgccccaaca atgagctgga gaacccaccc tacgtgcctg gggtggttgg ggggaccctc 1500tgccccaaca atgagctgga gaacccaccc tacgtgcctg gggtggttgg ggggaccctc 1500

caggcggcca ccatctgtgc ctccagccac cagtttctct ccacacacta caacctgcac 1560caggcggcca ccatctgtgc ctccagccac cagtttctct ccacacacta caacctgcac 1560

aacctctacg gcctgaccga agccatcgcc tcccacaggg cgctggtgaa ggctcggggg 1620aacctctacg gcctgaccga agccatcgcc tcccacaggg cgctggtgaa ggctcggggg 1620

acacgcccat ttgtgatctc ccgctcgacc tttgctggcc acggccgata cgccggccac 1680acacgcccat ttgtgatctc ccgctcgacc tttgctggcc acggccgata cgccggccac 1680

tggacggggg acgtgtggag ctcctgggag cagctcgcct cctccgtgcc agaaatcctg 1740tggacggggg acgtgtggag ctcctggggag cagctcgcct cctccgtgcc agaaatcctg 1740

cagtttaacc tgctgggggt gcctctggtc ggggccgacg tctgcggctt cctgggcaac 1800cagtttaacc tgctgggggt gcctctggtc ggggccgacg tctgcggctt cctgggcaac 1800

acctcagagg agctgtgtgt gcgctggacc cagctggggg ccttctaccc cttcatgcgg 1860acctcagagg agctgtgtgt gcgctggacc cagctggggg ccttctaccc cttcatgcgg 1860

aaccacaaca gcctgctcag tctgccccag gagccgtaca gcttcagcga gccggcccag 1920aaccacaaca gcctgctcag tctgccccag gagccgtaca gcttcagcga gccggcccag 1920

caggccatga ggaaggccct caccctgcgc tacgcactcc tcccccacct ctacacactg 1980caggccatga ggaaggcct caccctgcgc tacgcactcc tcccccacct ctacacactg 1980

ttccaccagg cccacgtcgc gggggagacc gtggcccggc ccctcttcct ggagttcccc 2040ttccaccagg cccacgtcgc gggggagacc gtggcccggc ccctcttcct ggagttcccc 2040

aaggactcta gcacctggac tgtggaccac cagctcctgt ggggggaggc cctgctcatc 2100aaggactcta gcacctggac tgtggaccac cagctcctgt ggggggaggc cctgctcatc 2100

accccagtgc tccaggccgg gaaggccgaa gtgactggct acttcccctt gggcacatgg 2160accccagtgc tccaggccgg gaaggccgaa gtgactggct acttcccctt gggcacatgg 2160

tacgacctgc agacggtgcc agtagaggcc cttggcagcc tcccaccccc acctgcagct 2220tacgacctgc agacggtgcc agtagaggcc cttggcagcc tcccacccccc acctgcagct 2220

ccccgtgagc cagccatcca cagcgagggg cagtgggtga cgctgccggc ccccctggac 2280ccccgtgagc cagccatcca cagcgagggg cagtgggtga cgctgccggc ccccctggac 2280

accatcaacg tccacctccg ggctgggtac atcatccccc tgcagggccc tggcctcaca 2340accatcaacg tccacctccg ggctgggtac atcatccccc tgcagggccc tggcctcaca 2340

accacagagt cccgccagca gcccatggcc ctggctgtgg ccctgaccaa gggtggggag 2400accacagagt cccgccagca gcccatggcc ctggctgtgg ccctgaccaa gggtggggag 2400

gcccgagggg agctgttctg ggacgatgga gagagcctgg aagtgctgga gcgaggggcc 2460gcccgagggg agctgttctg ggacgatgga gagagcctgg aagtgctgga gcgaggggcc 2460

tacacacagg tcatcttcct ggccaggaat aacacgatcg tgaatgagct ggtacgtgtg 2520tacacacagg tcatcttcct ggccaggaat aacacgatcg tgaatgagct ggtacgtgtg 2520

accagtgagg gagctggcct gcagctgcag aaggtgactg tcctgggcgt ggccacggcg 2580accagtgagg gagctggcct gcagctgcag aaggtgactg tcctgggcgt ggccacggcg 2580

ccccagcagg tcctctccaa cggtgtccct gtctccaact tcacctacag ccccgacacc 2640ccccagcagg tcctctccaa cggtgtccct gtctccaact tcacctacag ccccgacacc 2640

aaggtcctgg acatctgtgt ctcgctgttg atgggagagc agtttctcgt cagctggtgt 2700aaggtcctgg acatctgtgt ctcgctgttg atgggagagc agtttctcgt cagctggtgt 2700

tag 2703tag 2703

<210> 43<210> 43

<211> 2703<211> 2703

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7+hGAAco1-Δ-43<223> sp7+hGAAco1-Δ-43

<400> 43<400> 43

ggcagaccta gagctgtgcc tacccagtgt gacgtgcccc ccaacagcag attcgactgc 120ggcagaccta gagctgtgcc tacccagtgt gacgtgcccc ccaacagcag attcgactgc 120

gcccctgaca aggccatcac ccaggaacag tgcgaggcca gaggctgctg ctacatccct 180gcccctgaca aggccatcac ccaggaacag tgcgaggcca gaggctgctg ctacatccct 180

gccaagcagg gactgcaggg cgctcagatg ggacagccct ggtgcttctt cccaccctcc 240gccaagcagg gactgcaggg cgctcagatg ggacagccct ggtgcttctt cccaccctcc 240

taccccagct acaagctgga aaacctgagc agcagcgaga tgggctacac cgccaccctg 300taccccagct acaagctgga aaacctgagc agcagcgaga tgggctacac cgccaccctg 300

accagaacca cccccacatt cttcccaaag gacatcctga ccctgcggct ggacgtgatg 360accagaacca cccccacatt cttcccaaag gacatcctga ccctgcggct ggacgtgatg 360

atggaaaccg agaaccggct gcacttcacc atcaaggacc ccgccaatcg gagatacgag 420atggaaaccg agaaccggct gcacttcacc atcaaggacc ccgccaatcg gagatacgag 420

gtgcccctgg aaacccccca cgtgcactct agagccccca gccctctgta cagcgtggaa 480gtgcccctgg aaacccccca cgtgcactct agagccccca gccctctgta cagcgtggaa 480

ttcagcgagg aacccttcgg cgtgatcgtg cggagacagc tggatggcag agtgctgctg 540ttcagcgagg aacccttcgg cgtgatcgtg cggagacagc tggatggcag agtgctgctg 540

aacaccaccg tggcccctct gttcttcgcc gaccagttcc tgcagctgag caccagcctg 600aacaccaccg tggcccctct gttcttcgcc gaccagttcc tgcagctgag caccagcctg 600

cccagccagt acatcacagg actggccgag cacctgagcc ccctgatgct gagcacatcc 660cccagccagt acatcacagg actggccgag cacctgagcc ccctgatgct gagcacatcc 660

tggacccgga tcaccctgtg gaacagggat ctggccccta cccctggcgc caatctgtac 720tggacccgga tcaccctgtg gaacagggat ctggccccta cccctggcgc caatctgtac 720

ggcagccacc ctttctacct ggccctggaa gatggcggat ctgcccacgg agtgtttctg 780ggcagccacc ctttctacct ggccctggaa gatggcggat ctgcccacgg agtgtttctg 780

ctgaactcca acgccatgga cgtggtgctg cagcctagcc ctgccctgtc ttggagaagc 840ctgaactcca acgccatgga cgtggtgctg cagcctagcc ctgccctgtc ttggagaagc 840

acaggcggca tcctggatgt gtacatcttt ctgggccccg agcccaagag cgtggtgcag 900acaggcggca tcctggatgt gtacatcttt ctgggccccg agcccaagag cgtggtgcag 900

cagtatctgg atgtcgtggg ctaccccttc atgccccctt actggggcct gggattccac 960cagtatctgg atgtcgtggg ctaccccttc atgccccctt actggggcct gggattccac 960

ctgtgcagat ggggctactc cagcaccgcc atcaccagac aggtggtgga aaacatgacc 1020ctgtgcagat ggggctactc cagcaccgcc atcaccagac aggtggtgga aaacatgacc 1020

agagcccact tcccactgga tgtgcagtgg aacgacctgg actacatgga cagcagacgg 1080agagcccact tcccactgga tgtgcagtgg aacgacctgg actacatgga cagcagacgg 1080

gacttcacct tcaacaagga cggcttccgg gacttccccg ccatggtgca ggaactgcat 1140gacttcacct tcaacaagga cggcttccgg gacttccccg ccatggtgca ggaactgcat 1140

cagggcggca gacggtacat gatgatcgtg gatcccgcca tcagctcctc tggccctgcc 1200cagggcggca gacggtacat gatgatcgtg gatcccgcca tcagctcctc tggccctgcc 1200

ggctcttaca gaccctacga cgagggcctg cggagaggcg tgttcatcac caacgagaca 1260ggctcttaca gaccctacga cgagggcctg cggagaggcg tgttcatcac caacgagaca 1260

ggccagcccc tgatcggcaa agtgtggcct ggcagcacag ccttccccga cttcaccaat 1320ggccagcccc tgatcggcaa agtgtggcct ggcagcacag ccttccccga cttcaccaat 1320

cctaccgccc tggcttggtg ggaggacatg gtggccgagt tccacgacca ggtgcccttc 1380cctaccgccc tggcttggtg ggaggacatg gtggccgagt tccacgacca ggtgcccttc 1380

gacggcatgt ggatcgacat gaacgagccc agcaacttca tccggggcag cgaggatggc 1440gacggcatgt ggatcgacat gaacgagccc agcaacttca tccggggcag cgaggatggc 1440

tgccccaaca acgaactgga aaatccccct tacgtgcccg gcgtcgtggg cggaacactg 1500tgccccaaca acgaactgga aaatccccct tacgtgcccg gcgtcgtggg cggaacactg 1500

caggccgcta caatctgtgc cagcagccac cagtttctga gcacccacta caacctgcac 1560caggccgcta caatctgtgc cagcagccac cagtttctga gcacccacta caacctgcac 1560

aacctgtacg gcctgaccga ggccattgcc agccaccgcg ctctcgtgaa agccagaggc 1620aacctgtacg gcctgaccga ggccattgcc agccaccgcg ctctcgtgaa agccagaggc 1620

acacggccct tcgtgatcag cagaagcacc tttgccggcc acggcagata cgccggacat 1680acacggccct tcgtgatcag cagaagcacc tttgccggcc acggcagata cgccggacat 1680

tggactggcg acgtgtggtc ctcttgggag cagctggcct ctagcgtgcc cgagatcctg 1740tggactggcg acgtgtggtc ctcttggggag cagctggcct ctagcgtgcc cgagatcctg 1740

cagttcaatc tgctgggcgt gccactcgtg ggcgccgatg tgtgtggctt cctgggcaac 1800cagttcaatc tgctgggcgt gccactcgtg ggcgccgatg tgtgtggctt cctgggcaac 1800

acctccgagg aactgtgtgt gcggtggaca cagctgggcg ccttctaccc tttcatgaga 1860acctccgagg aactgtgtgt gcggtggaca cagctgggcg ccttctaccc tttcatgaga 1860

aaccacaaca gcctgctgag cctgccccag gaaccctaca gctttagcga gcctgcacag 1920aaccacaaca gcctgctgag cctgccccag gaaccctaca gctttagcga gcctgcacag 1920

caggccatgc ggaaggccct gacactgaga tacgctctgc tgccccacct gtacaccctg 1980caggccatgc ggaaggcct gacactgaga tacgctctgc tgccccacct gtacaccctg 1980

tttcaccagg cccatgtggc cggcgagaca gtggccagac ctctgtttct ggaattcccc 2040tttcaccagg cccatgtggc cggcgagaca gtggccagac ctctgtttct ggaattcccc 2040

aaggacagca gcacctggac cgtggaccat cagctgctgt ggggagaggc tctgctgatt 2100aaggacagca gcacctggac cgtggaccat cagctgctgt ggggagaggc tctgctgatt 2100

accccagtgc tgcaggcagg caaggccgaa gtgaccggct actttcccct gggcacttgg 2160accccagtgc tgcaggcagg caaggccgaa gtgaccggct actttcccct gggcacttgg 2160

tacgacctgc agaccgtgcc tgtggaagcc ctgggatctc tgcctccacc tcctgccgct 2220tacgacctgc agaccgtgcc tgtggaagcc ctgggatctc tgcctccacc tcctgccgct 2220

cctagagagc ctgccattca ctctgagggc cagtgggtca cactgcctgc ccccctggat 2280cctagagagc ctgccattca ctctgagggc cagtgggtca cactgcctgc ccccctggat 2280

accatcaacg tgcacctgag ggccggctac atcataccac tgcagggacc tggcctgacc 2340accatcaacg tgcacctgag ggccggctac atcataccac tgcagggacc tggcctgacc 2340

accaccgagt ctagacagca gccaatggcc ctggccgtgg ccctgaccaa aggcggagaa 2400accaccgagt ctagacagca gccaatggcc ctggccgtgg ccctgaccaa aggcggagaa 2400

gctaggggcg agctgttctg ggacgatggc gagagcctgg aagtgctgga aagaggcgcc 2460gctaggggcg agctgttctg ggacgatggc gagagcctgg aagtgctgga aagaggcgcc 2460

tatacccaag tgatcttcct ggcccggaac aacaccatcg tgaacgagct ggtgcgcgtg 2520tatacccaag tgatcttcct ggcccggaac aacaccatcg tgaacgagct ggtgcgcgtg 2520

acctctgaag gcgctggact gcagctgcag aaagtgaccg tgctgggagt ggccacagcc 2580acctctgaag gcgctggact gcagctgcag aaagtgaccg tgctgggagt ggccacagcc 2580

cctcagcagg tgctgtctaa tggcgtgccc gtgtccaact tcacctacag ccccgacacc 2640cctcagcagg tgctgtctaa tggcgtgccc gtgtccaact tcacctacag ccccgacacc 2640

aaggtgctgg acatctgcgt gtcactgctg atgggagagc agtttctggt gtcctggtgc 2700aaggtgctgg acatctgcgt gtcactgctg atgggagagc agtttctggt gtcctggtgc 2700

tga 2703tga 2703

<210> 44<210> 44

<211> 2703<211> 2703

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7+hGAAco2-Δ-43<223> sp7+hGAAco2-Δ-43

<400> 44<400> 44

atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggccatcct 60atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggccatcct 60

ggtagaccaa gagctgtgcc tacccaatgc gacgtgccac ccaactcccg attcgactgc 120ggtagaccaa gagctgtgcc tacccaatgc gacgtgccac ccaactcccg attcgactgc 120

gcgccagata aggctattac ccaagagcag tgtgaagcca gaggttgctg ctacatccca 180gcgccagata aggctattac ccaagagcag tgtgaagcca gaggttgctg ctacatccca 180

gcgaagcaag gattgcaagg cgcccaaatg ggacaacctt ggtgtttctt ccccccttcg 240gcgaagcaag gattgcaagg cgcccaaatg ggacaacctt ggtgtttctt ccccccttcg 240

tacccatcat ataaactcga aaacctgtcc tcttcggaaa tgggttatac tgccaccctc 300tacccatcat ataaactcga aaacctgtcc tcttcggaaa tgggttatac tgccaccctc 300

accagaacta ctcctacttt cttcccgaaa gacatcttga ccttgaggct ggacgtgatg 360accagaacta ctcctacttt cttcccgaaa gacatcttga ccttgaggct ggacgtgatg 360

atggagactg aaaaccggct gcatttcact atcaaagatc ctgccaatcg gcgatacgag 420atggagactg aaaaccggct gcatttcact atcaaagatc ctgccaatcg gcgatacgag 420

gtccctctgg aaacccctca cgtgcactca cgggctcctt ctccgcttta ctccgtcgaa 480gtccctctgg aaacccctca cgtgcactca cgggctcctt ctccgcttta ctccgtcgaa 480

ttctctgagg aacccttcgg agtgatcgtt agacgccagc tggatggtag agtgctgttg 540ttctctgagg aacccttcgg agtgatcgtt agacgccagc tggatggtag agtgctgttg 540

aacactactg tggccccact tttcttcgct gaccagtttc tgcaactgtc cacttccctg 600aacactactg tggccccact tttcttcgct gaccagtttc tgcaactgtc cacttccctg 600

ccatcccagt acattactgg actcgccgaa cacctgtcgc cactgatgct ctcgacctct 660ccatcccagt acattactgg actcgccgaa cacctgtcgc cactgatgct ctcgacctct 660

tggactagaa tcactttgtg gaacagagac ttggccccta ctccgggagc aaatctgtac 720tggactagaa tcactttgtg gaacagagac ttggccccta ctccgggagc aaatctgtac 720

ggaagccacc ctttttacct ggcgctcgaa gatggcggat ccgctcacgg agtgttcctg 780ggaagccacc ctttttacct ggcgctcgaa gatggcggat ccgctcacgg agtgttcctg 780

ctgaatagca acgcaatgga cgtggtgctg caaccttccc ctgcactcag ttggagaagt 840ctgaatagca acgcaatgga cgtggtgctg caaccttccc ctgcactcag ttggagaagt 840

accgggggta ttctggacgt gtacatcttc ctcggaccag aacccaagag cgtggtgcag 900accgggggta ttctggacgt gtacatcttc ctcggaccag aacccaagag cgtggtgcag 900

caatatctgg acgtggtcgg ataccctttt atgcctcctt actggggact gggattccac 960caatatctgg acgtggtcgg ataccctttt atgcctcctt actggggact gggattccac 960

ctttgccgtt ggggctactc atccaccgcc attaccagac aggtggtgga gaatatgacc 1020ctttgccgtt ggggctactc atccaccgcc attaccagac aggtggtgga gaatatgacc 1020

agagcccact tccctctcga cgtgcagtgg aacgatctgg actatatgga ctcccggaga 1080agagcccact tccctctcga cgtgcagtgg aacgatctgg actatatgga ctcccggaga 1080

gatttcacct tcaacaagga cgggttccgc gattttcccg cgatggttca agagctccac 1140gatttcacct tcaacaagga cgggttccgc gattttcccg cgatggttca agagctccac 1140

cagggtggtc gaagatatat gatgatcgtc gacccagcca tttcgagcag cggacccgct 1200cagggtggtc gaagatatat gatgatcgtc gacccagcca tttcgagcag cggacccgct 1200

ggatcttata gaccttacga cgaaggcctt aggagaggag tgttcatcac aaacgagact 1260ggatcttata gaccttacga cgaaggcctt aggagaggag tgttcatcac aaacgagact 1260

ggacagcctt tgatcggtaa agtgtggcct ggatcaaccg cctttcctga ctttaccaat 1320ggacagcctt tgatcggtaa agtgtggcct ggatcaaccg cctttcctga ctttaccaat 1320

cccactgcct tggcttggtg ggaggacatg gtggccgaat tccacgacca agtccccttt 1380cccactgcct tggcttggtg ggaggacatg gtggccgaat tccacgacca agtccccttt 1380

gatggaatgt ggatcgatat gaacgaacca agcaatttta tcagaggttc cgaagacggt 1440gatggaatgt ggatcgatat gaacgaacca agcaatttta tcagaggttc cgaagacggt 1440

tgccccaaca acgaactgga aaaccctcct tatgtgcccg gagtcgtggg cggaacatta 1500tgccccaaca acgaactgga aaaccctcct tatgtgcccg gagtcgtggg cggaacatta 1500

caggccgcga ctatttgcgc cagcagccac caattcctgt ccactcacta caacctccac 1560caggccgcga ctatttgcgc cagcagccac caattcctgt ccactcacta caacctccac 1560

aacctttatg gattaaccga agctattgca agtcacaggg ctctggtgaa ggctagaggg 1620aacctttatg gattaaccga agctattgca agtcacaggg ctctggtgaa ggctagaggg 1620

actaggccct ttgtgatctc ccgatccacc tttgccggac acgggagata cgccggtcac 1680actaggccct ttgtgatctc ccgatccacc tttgccggac acggggagata cgccggtcac 1680

tggactggtg acgtgtggag ctcatgggaa caactggcct cctccgtgcc ggaaatctta 1740tggactggtg acgtgtggag ctcatgggaa caactggcct cctccgtgcc ggaaatctta 1740

cagttcaacc ttctgggtgt ccctcttgtc ggagcagacg tgtgtgggtt tcttggtaac 1800cagttcaacc ttctgggtgt ccctcttgtc ggagcagacg tgtgtgggtt tcttggtaac 1800

acctccgagg aactgtgtgt gcgctggact caactgggtg cattctaccc attcatgaga 1860acctccgagg aactgtgtgt gcgctggact caactgggtg cattctaccc attcatgaga 1860

aaccacaact ccttgctgtc cctgccacaa gagccctact cgttcagcga gcctgcacaa 1920aaccacaact ccttgctgtc cctgccacaa gagccctact cgttcagcga gcctgcacaa 1920

caggctatgc ggaaggcact gaccctgaga tacgccctgc ttccacactt atacactctc 1980caggctatgc ggaaggcact gaccctgaga tacgccctgc ttccacactt atacactctc 1980

ttccatcaag cgcatgtggc aggagaaacc gttgcaaggc ctcttttcct tgaattcccc 2040ttccatcaag cgcatgtggc aggagaaacc gttgcaaggc ctcttttcct tgaattcccc 2040

aaggattcct cgacttggac ggtggatcat cagctgctgt ggggagaagc tctgctgatt 2100aaggattcct cgacttggac ggtggatcat cagctgctgt ggggagaagc tctgctgatt 2100

actccagtgt tgcaagccgg aaaagctgag gtgaccggat actttccgct gggaacctgg 2160actccagtgt tgcaagccgg aaaagctgag gtgaccggat actttccgct gggaacctgg 2160

tacgacctcc agactgtccc tgttgaagcc cttggatcac tgcctccgcc tccggcagct 2220tacgacctcc agactgtccc tgttgaagcc cttggatcac tgcctccgcc tccggcagct 2220

ccacgcgaac cagctataca ttccgaggga cagtgggtta cattaccagc tcctctggac 2280ccacgcgaac cagctataca ttccgaggga cagtgggtta cattaccagc tcctctggac 2280

acaatcaacg tccacttaag agctggctac attatccctc tgcaaggacc aggactgact 2340acaatcaacg tccacttaag agctggctac attatccctc tgcaaggacc aggactgact 2340

acgaccgaga gcagacagca gccaatggca ctggctgtgg ctctgaccaa gggaggggaa 2400acgaccgaga gcagacagca gccaatggca ctggctgtgg ctctgaccaa gggaggggaa 2400

gctagaggag aactcttctg ggatgatggg gagtcccttg aagtgctgga aagaggcgct 2460gctagaggag aactcttctg ggatgatggg gagtcccttg aagtgctgga aagaggcgct 2460

tacactcaag tcattttcct tgcacggaac aacaccattg tgaacgaatt ggtgcgagtg 2520tacactcaag tcattttcct tgcacggaac aacaccattg tgaacgaatt ggtgcgagtg 2520

accagcgaag gagctggact tcaactgcag aaggtcactg tgctcggagt ggctaccgct 2580accagcgaag gagctggact tcaactgcag aaggtcactg tgctcggagt ggctaccgct 2580

cctcagcaag tgctgtcgaa tggagtcccc gtgtcaaact ttacctactc ccctgacact 2640cctcagcaag tgctgtcgaa tggagtcccc gtgtcaaact ttacctactc ccctgacact 2640

aaggtgctcg acatttgcgt gtccctcctg atgggagagc agttccttgt gtcctggtgt 2700aaggtgctcg acatttgcgt gtccctcctg atgggagagc agttccttgt gtcctggtgt 2700

tga 2703tga 2703

<210> 45<210> 45

<211> 2691<211> 2691

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7+hGAAwt-Δ-47<223> sp7+hGAAwt-Δ-47

<400> 45<400> 45

atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggcccgcga 60atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggcccgcga 60

gcagtgccca cacagtgcga cgtccccccc aacagccgct tcgattgcgc ccctgacaag 120gcagtgccca cacagtgcga cgtccccccc aacagccgct tcgattgcgc ccctgacaag 120

gccatcaccc aggaacagtg cgaggcccgc ggctgttgct acatccctgc aaagcagggg 180gccatcaccc aggaacagtg cgaggcccgc ggctgttgct acatccctgc aaagcagggg 180

ctgcagggag cccagatggg gcagccctgg tgcttcttcc cacccagcta ccccagctac 240ctgcagggag cccagatggg gcagccctgg tgcttcttcc cacccagcta ccccagctac 240

aagctggaga acctgagctc ctctgaaatg ggctacacgg ccaccctgac ccgtaccacc 300aagctggaga acctgagctc ctctgaaatg ggctacacgg ccaccctgac ccgtaccacc 300

cccaccttct tccccaagga catcctgacc ctgcggctgg acgtgatgat ggagactgag 360cccaccttct tccccaagga catcctgacc ctgcggctgg acgtgatgat ggagactgag 360

aaccgcctcc acttcacgat caaagatcca gctaacaggc gctacgaggt gcccttggag 420aaccgcctcc acttcacgat caaagatcca gctaacaggc gctacgaggt gcccttggag 420

accccgcatg tccacagccg ggcaccgtcc ccactctaca gcgtggagtt ctccgaggag 480accccgcatg tccacagccg ggcaccgtcc ccactctaca gcgtggagtt ctccgaggag 480

cccttcgggg tgatcgtgcg ccggcagctg gacggccgcg tgctgctgaa cacgacggtg 540cccttcgggg tgatcgtgcg ccggcagctg gacggccgcg tgctgctgaa cacgacggtg 540

gcgcccctgt tctttgcgga ccagttcctt cagctgtcca cctcgctgcc ctcgcagtat 600gcgcccctgt tctttgcgga ccagttcctt cagctgtcca cctcgctgcc ctcgcagtat 600

atcacaggcc tcgccgagca cctcagtccc ctgatgctca gcaccagctg gaccaggatc 660atcacaggcc tcgccgagca cctcagtccc ctgatgctca gcaccagctg gaccaggatc 660

accctgtgga accgggacct tgcgcccacg cccggtgcga acctctacgg gtctcaccct 720accctgtgga accgggacct tgcgcccacg cccggtgcga acctctacgg gtctcaccct 720

ttctacctgg cgctggagga cggcgggtcg gcacacgggg tgttcctgct aaacagcaat 780ttctacctgg cgctggagga cggcgggtcg gcacacgggg tgttcctgct aaacagcaat 780

gccatggatg tggtcctgca gccgagccct gcccttagct ggaggtcgac aggtgggatc 840gccatggatg tggtcctgca gccgagccct gcccttagct ggaggtcgac aggtggggatc 840

ctggatgtct acatcttcct gggcccagag cccaagagcg tggtgcagca gtacctggac 900ctggatgtct acatcttcct gggcccagag cccaagagcg tggtgcagca gtacctggac 900

gttgtgggat acccgttcat gccgccatac tggggcctgg gcttccacct gtgccgctgg 960gttgtggggat acccgttcat gccgccatac tggggcctgg gcttccacct gtgccgctgg 960

ggctactcct ccaccgctat cacccgccag gtggtggaga acatgaccag ggcccacttc 1020ggctactcct ccaccgctat cacccgccag gtggtggaga acatgaccag ggcccacttc 1020

cccctggacg tccagtggaa cgacctggac tacatggact cccggaggga cttcacgttc 1080cccctggacg tccagtggaa cgacctggac tacatggact cccggaggga cttcacgttc 1080

aacaaggatg gcttccggga cttcccggcc atggtgcagg agctgcacca gggcggccgg 1140aacaaggatg gcttccggga cttcccggcc atggtgcagg agctgcacca gggcggccgg 1140

cgctacatga tgatcgtgga tcctgccatc agcagctcgg gccctgccgg gagctacagg 1200cgctacatga tgatcgtgga tcctgccatc agcagctcgg gccctgccgg gagctacagg 1200

ccctacgacg agggtctgcg gaggggggtt ttcatcacca acgagaccgg ccagccgctg 1260ccctacgacg agggtctgcg gaggggggttttcatcacca acgagaccgg ccagccgctg 1260

attgggaagg tatggcccgg gtccactgcc ttccccgact tcaccaaccc cacagccctg 1320attgggaagg tatggcccgg gtccactgcc ttccccgact tcaccaaccc cacagccctg 1320

gcctggtggg aggacatggt ggctgagttc catgaccagg tgcccttcga cggcatgtgg 1380gcctggtggg aggacatggt ggctgagttc catgaccagg tgcccttcga cggcatgtgg 1380

attgacatga acgagccttc caacttcatc aggggctctg aggacggctg ccccaacaat 1440attgacatga acgagccttc caacttcatc aggggctctg aggacggctg ccccaacaat 1440

gagctggaga acccacccta cgtgcctggg gtggttgggg ggaccctcca ggcggccacc 1500gagctggaga acccacccta cgtgcctggg gtggttgggg ggaccctcca ggcggccacc 1500

atctgtgcct ccagccacca gtttctctcc acacactaca acctgcacaa cctctacggc 1560atctgtgcct ccagccacca gtttctctcc acacactaca acctgcacaa cctctacggc 1560

ctgaccgaag ccatcgcctc ccacagggcg ctggtgaagg ctcgggggac acgcccattt 1620ctgaccgaag ccatcgcctc ccacagggcg ctggtgaagg ctcgggggac acgcccattt 1620

gtgatctccc gctcgacctt tgctggccac ggccgatacg ccggccactg gacgggggac 1680gtgatctccc gctcgacctt tgctggccac ggccgatacg ccggccactg gacgggggac 1680

gtgtggagct cctgggagca gctcgcctcc tccgtgccag aaatcctgca gtttaacctg 1740gtgtggagct cctgggagca gctcgcctcc tccgtgccag aaatcctgca gtttaacctg 1740

ctgggggtgc ctctggtcgg ggccgacgtc tgcggcttcc tgggcaacac ctcagaggag 1800ctgggggtgc ctctggtcgg ggccgacgtc tgcggcttcc tgggcaacac ctcagaggag 1800

ctgtgtgtgc gctggaccca gctgggggcc ttctacccct tcatgcggaa ccacaacagc 1860ctgtgtgtgc gctggaccca gctgggggcc ttctacccct tcatgcggaa ccacaacagc 1860

ctgctcagtc tgccccagga gccgtacagc ttcagcgagc cggcccagca ggccatgagg 1920ctgctcagtc tgccccagga gccgtacagc ttcagcgagc cggcccagca ggccatgagg 1920

aaggccctca ccctgcgcta cgcactcctc ccccacctct acacactgtt ccaccaggcc 1980aaggccctca ccctgcgcta cgcactcctc ccccacctct acacactgtt ccaccaggcc 1980

cacgtcgcgg gggagaccgt ggcccggccc ctcttcctgg agttccccaa ggactctagc 2040cacgtcgcgg gggagaccgt ggcccggccc ctcttcctgg agttccccaa ggactctagc 2040

acctggactg tggaccacca gctcctgtgg ggggaggccc tgctcatcac cccagtgctc 2100acctggactg tggaccacca gctcctgtgg ggggaggccc tgctcatcac cccagtgctc 2100

caggccggga aggccgaagt gactggctac ttccccttgg gcacatggta cgacctgcag 2160caggccggga aggccgaagt gactggctac ttccccttgg gcacatggta cgacctgcag 2160

acggtgccag tagaggccct tggcagcctc ccacccccac ctgcagctcc ccgtgagcca 2220acggtgccag tagaggccct tggcagcctc ccacccccac ctgcagctcc ccgtgagcca 2220

gccatccaca gcgaggggca gtgggtgacg ctgccggccc ccctggacac catcaacgtc 2280gccatccaca gcgaggggca gtgggtgacg ctgccggccc ccctggacac catcaacgtc 2280

cacctccggg ctgggtacat catccccctg cagggccctg gcctcacaac cacagagtcc 2340cacctccggg ctgggtacat catccccctg cagggccctg gcctcacaac cacagagtcc 2340

cgccagcagc ccatggccct ggctgtggcc ctgaccaagg gtggggaggc ccgaggggag 2400cgccagcagc ccatggccct ggctgtggcc ctgaccaagg gtggggaggc ccgaggggag 2400

ctgttctggg acgatggaga gagcctggaa gtgctggagc gaggggccta cacacaggtc 2460ctgttctggg acgatggaga gagcctggaa gtgctggagc gaggggccta cacacaggtc 2460

atcttcctgg ccaggaataa cacgatcgtg aatgagctgg tacgtgtgac cagtgaggga 2520atcttcctgg ccaggaataa cacgatcgtg aatgagctgg tacgtgtgac cagtgaggga 2520

gctggcctgc agctgcagaa ggtgactgtc ctgggcgtgg ccacggcgcc ccagcaggtc 2580gctggcctgc agctgcagaa ggtgactgtc ctgggcgtgg ccacggcgcc ccagcaggtc 2580

ctctccaacg gtgtccctgt ctccaacttc acctacagcc ccgacaccaa ggtcctggac 2640ctctccaacg gtgtccctgt ctccaacttc acctacagcc ccgacaccaa ggtcctggac 2640

atctgtgtct cgctgttgat gggagagcag tttctcgtca gctggtgtta g 2691atctgtgtct cgctgttgat gggagagcag tttctcgtca gctggtgtta g 2691

<210> 46<210> 46

<211> 2691<211> 2691

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7+hGAAco1-Δ-47<223> sp7+hGAAco1-Δ-47

<400> 46<400> 46

atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggccctaga 60atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggccctaga 60

gctgtgccta cccagtgtga cgtgcccccc aacagcagat tcgactgcgc ccctgacaag 120gctgtgccta cccagtgtga cgtgcccccc aacagcagat tcgactgcgc ccctgacaag 120

gccatcaccc aggaacagtg cgaggccaga ggctgctgct acatccctgc caagcaggga 180gccatcaccc aggaacagtg cgaggccaga ggctgctgct acatccctgc caagcaggga 180

ctgcagggcg ctcagatggg acagccctgg tgcttcttcc caccctccta ccccagctac 240ctgcagggcg ctcagatggg acagccctgg tgcttcttcc caccctccta ccccagctac 240

aagctggaaa acctgagcag cagcgagatg ggctacaccg ccaccctgac cagaaccacc 300aagctggaaa acctgagcag cagcgagatg ggctacaccg ccaccctgac cagaaccacc 300

cccacattct tcccaaagga catcctgacc ctgcggctgg acgtgatgat ggaaaccgag 360cccacattct tcccaaagga catcctgacc ctgcggctgg acgtgatgat ggaaaccgag 360

aaccggctgc acttcaccat caaggacccc gccaatcgga gatacgaggt gcccctggaa 420aaccggctgc acttcaccat caaggacccc gccaatcgga gatacgaggt gcccctggaa 420

accccccacg tgcactctag agcccccagc cctctgtaca gcgtggaatt cagcgaggaa 480accccccacg tgcactctag agcccccagc cctctgtaca gcgtggaatt cagcgaggaa 480

cccttcggcg tgatcgtgcg gagacagctg gatggcagag tgctgctgaa caccaccgtg 540cccttcggcg tgatcgtgcg gagacagctg gatggcagag tgctgctgaa caccaccgtg 540

gcccctctgt tcttcgccga ccagttcctg cagctgagca ccagcctgcc cagccagtac 600gcccctctgt tcttcgccga ccagttcctg cagctgagca ccagcctgcc cagccagtac 600

atcacaggac tggccgagca cctgagcccc ctgatgctga gcacatcctg gacccggatc 660atcacaggac tggccgagca cctgagcccc ctgatgctga gcacatcctg gacccggatc 660

accctgtgga acagggatct ggcccctacc cctggcgcca atctgtacgg cagccaccct 720accctgtgga acagggatct ggcccctacc cctggcgcca atctgtacgg cagccaccct 720

ttctacctgg ccctggaaga tggcggatct gcccacggag tgtttctgct gaactccaac 780ttctacctgg ccctggaaga tggcggatct gcccacggag tgtttctgct gaactccaac 780

gccatggacg tggtgctgca gcctagccct gccctgtctt ggagaagcac aggcggcatc 840gccatggacg tggtgctgca gcctagccct gccctgtctt ggagaagcac aggcggcatc 840

ctggatgtgt acatctttct gggccccgag cccaagagcg tggtgcagca gtatctggat 900ctggatgtgt acatctttct gggccccgag cccaagagcg tggtgcagca gtatctggat 900

gtcgtgggct accccttcat gcccccttac tggggcctgg gattccacct gtgcagatgg 960gtcgtgggct accccttcat gcccccttac tggggcctgg gattccacct gtgcagatgg 960

ggctactcca gcaccgccat caccagacag gtggtggaaa acatgaccag agcccacttc 1020ggctactcca gcaccgccat caccagacag gtggtggaaa acatgaccag agcccacttc 1020

ccactggatg tgcagtggaa cgacctggac tacatggaca gcagacggga cttcaccttc 1080ccactggatg tgcagtggaa cgacctggac tacatggaca gcagacggga cttcaccttc 1080

aacaaggacg gcttccggga cttccccgcc atggtgcagg aactgcatca gggcggcaga 1140aacaaggacg gcttccggga cttccccgcc atggtgcagg aactgcatca gggcggcaga 1140

cggtacatga tgatcgtgga tcccgccatc agctcctctg gccctgccgg ctcttacaga 1200cggtacatga tgatcgtgga tcccgccatc agctcctctg gccctgccgg ctcttacaga 1200

ccctacgacg agggcctgcg gagaggcgtg ttcatcacca acgagacagg ccagcccctg 1260ccctacgacg agggcctgcg gagaggcgtg ttcatcacca acgagacagg ccagcccctg 1260

atcggcaaag tgtggcctgg cagcacagcc ttccccgact tcaccaatcc taccgccctg 1320atcggcaaag tgtggcctgg cagcacagcc ttccccgact tcaccaatcc taccgccctg 1320

gcttggtggg aggacatggt ggccgagttc cacgaccagg tgcccttcga cggcatgtgg 1380gcttggtggg aggacatggt ggccgagttc cacgaccagg tgcccttcga cggcatgtgg 1380

atcgacatga acgagcccag caacttcatc cggggcagcg aggatggctg ccccaacaac 1440atcgacatga acgagcccag caacttcatc cggggcagcg aggatggctg ccccaacaac 1440

gaactggaaa atccccctta cgtgcccggc gtcgtgggcg gaacactgca ggccgctaca 1500gaactggaaa atccccctta cgtgcccggc gtcgtgggcg gaacactgca ggccgctaca 1500

atctgtgcca gcagccacca gtttctgagc acccactaca acctgcacaa cctgtacggc 1560atctgtgcca gcagccacca gtttctgagc accccactaca acctgcacaa cctgtacggc 1560

ctgaccgagg ccattgccag ccaccgcgct ctcgtgaaag ccagaggcac acggcccttc 1620ctgaccgagg ccattgccag ccaccgcgct ctcgtgaaag ccagaggcac acggcccttc 1620

gtgatcagca gaagcacctt tgccggccac ggcagatacg ccggacattg gactggcgac 1680gtgatcagca gaagcacctt tgccggccac ggcagatacg ccggacattg gactggcgac 1680

gtgtggtcct cttgggagca gctggcctct agcgtgcccg agatcctgca gttcaatctg 1740gtgtggtcct cttgggagca gctggcctct agcgtgcccg agatcctgca gttcaatctg 1740

ctgggcgtgc cactcgtggg cgccgatgtg tgtggcttcc tgggcaacac ctccgaggaa 1800ctgggcgtgc cactcgtggg cgccgatgtg tgtggcttcc tgggcaacac ctccgaggaa 1800

ctgtgtgtgc ggtggacaca gctgggcgcc ttctaccctt tcatgagaaa ccacaacagc 1860ctgtgtgtgc ggtggacaca gctgggcgcc ttctaccctt tcatgagaaa ccacaacagc 1860

ctgctgagcc tgccccagga accctacagc tttagcgagc ctgcacagca ggccatgcgg 1920ctgctgagcc tgccccagga accctacagc tttagcgagc ctgcacagca ggccatgcgg 1920

aaggccctga cactgagata cgctctgctg ccccacctgt acaccctgtt tcaccaggcc 1980aaggccctga cactgagata cgctctgctg ccccacctgt acaccctgtt tcaccaggcc 1980

catgtggccg gcgagacagt ggccagacct ctgtttctgg aattccccaa ggacagcagc 2040catgtggccg gcgagacagt ggccagacct ctgtttctgg aattccccaa ggacagcagc 2040

acctggaccg tggaccatca gctgctgtgg ggagaggctc tgctgattac cccagtgctg 2100acctggaccg tggaccatca gctgctgtgg ggagaggctc tgctgattac cccagtgctg 2100

caggcaggca aggccgaagt gaccggctac tttcccctgg gcacttggta cgacctgcag 2160caggcaggca aggccgaagt gaccggctac tttcccctgg gcacttggta cgacctgcag 2160

accgtgcctg tggaagccct gggatctctg cctccacctc ctgccgctcc tagagagcct 2220accgtgcctg tggaagccct gggatctctg cctccacctc ctgccgctcc tagagagcct 2220

gccattcact ctgagggcca gtgggtcaca ctgcctgccc ccctggatac catcaacgtg 2280gccattcact ctgagggcca gtgggtcaca ctgcctgccc ccctggatac catcaacgtg 2280

cacctgaggg ccggctacat cataccactg cagggacctg gcctgaccac caccgagtct 2340cacctgaggg ccggctacat cataccactg cagggacctg gcctgaccac caccgagtct 2340

agacagcagc caatggccct ggccgtggcc ctgaccaaag gcggagaagc taggggcgag 2400agacagcagc caatggccct ggccgtggcc ctgaccaaag gcggagaagc taggggcgag 2400

ctgttctggg acgatggcga gagcctggaa gtgctggaaa gaggcgccta tacccaagtg 2460ctgttctggg acgatggcga gagcctggaa gtgctggaaa gaggcgccta tacccaagtg 2460

atcttcctgg cccggaacaa caccatcgtg aacgagctgg tgcgcgtgac ctctgaaggc 2520atcttcctgg cccggaacaa caccatcgtg aacgagctgg tgcgcgtgac ctctgaaggc 2520

gctggactgc agctgcagaa agtgaccgtg ctgggagtgg ccacagcccc tcagcaggtg 2580gctggactgc agctgcagaa agtgaccgtg ctggggagtgg ccacagcccc tcagcaggtg 2580

ctgtctaatg gcgtgcccgt gtccaacttc acctacagcc ccgacaccaa ggtgctggac 2640ctgtctaatg gcgtgcccgt gtccaacttc acctacagcc ccgacaccaa ggtgctggac 2640

atctgcgtgt cactgctgat gggagagcag tttctggtgt cctggtgctg a 2691atctgcgtgt cactgctgat gggagagcag tttctggtgt cctggtgctg a 2691

<210> 47<210> 47

<211> 2691<211> 2691

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> sp7+hGAAco2-Δ-47<223> sp7+hGAAco2-Δ-47

<400> 47<400> 47

atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggcccaaga 60atggcctttc tgtggctgct gagctgttgg gccctgctgg gcaccacctt cggcccaaga 60

gctgtgccta cccaatgcga cgtgccaccc aactcccgat tcgactgcgc gccagataag 120gctgtgccta cccaatgcga cgtgccaccc aactcccgat tcgactgcgc gccagataag 120

gctattaccc aagagcagtg tgaagccaga ggttgctgct acatcccagc gaagcaagga 180gctattaccc aagagcagtg tgaagccaga ggttgctgct acatcccagc gaagcaagga 180

ttgcaaggcg cccaaatggg acaaccttgg tgtttcttcc ccccttcgta cccatcatat 240ttgcaaggcg cccaaatggg acaaccttgg tgtttcttcc ccccttcgta cccatcatat 240

aaactcgaaa acctgtcctc ttcggaaatg ggttatactg ccaccctcac cagaactact 300aaactcgaaa acctgtcctc ttcggaaatg ggttatactg ccaccctcac cagaactact 300

cctactttct tcccgaaaga catcttgacc ttgaggctgg acgtgatgat ggagactgaa 360cctactttct tcccgaaaga catcttgacc ttgaggctgg acgtgatgat ggagactgaa 360

aaccggctgc atttcactat caaagatcct gccaatcggc gatacgaggt ccctctggaa 420aaccggctgc atttcactat caaagatcct gccaatcggc gatacgaggt ccctctggaa 420

acccctcacg tgcactcacg ggctccttct ccgctttact ccgtcgaatt ctctgaggaa 480acccctcacg tgcactcacg ggctccttct ccgctttact ccgtcgaatt ctctgaggaa 480

cccttcggag tgatcgttag acgccagctg gatggtagag tgctgttgaa cactactgtg 540cccttcggag tgatcgttag acgccagctg gatggtagag tgctgttgaa cactactgtg 540

gccccacttt tcttcgctga ccagtttctg caactgtcca cttccctgcc atcccagtac 600gccccacttt tcttcgctga ccagtttctg caactgtcca cttccctgcc atcccagtac 600

attactggac tcgccgaaca cctgtcgcca ctgatgctct cgacctcttg gactagaatc 660attactggac tcgccgaaca cctgtcgcca ctgatgctct cgacctcttg gactagaatc 660

actttgtgga acagagactt ggcccctact ccgggagcaa atctgtacgg aagccaccct 720actttgtgga acagagactt ggcccctact ccggggagcaa atctgtacgg aagccaccct 720

ttttacctgg cgctcgaaga tggcggatcc gctcacggag tgttcctgct gaatagcaac 780ttttacctgg cgctcgaaga tggcggatcc gctcacggag tgttcctgct gaatagcaac 780

gcaatggacg tggtgctgca accttcccct gcactcagtt ggagaagtac cgggggtatt 840gcaatggacg tggtgctgca accttcccct gcactcagtt ggagaagtac cgggggtatt 840

ctggacgtgt acatcttcct cggaccagaa cccaagagcg tggtgcagca atatctggac 900ctggacgtgt acatcttcct cggaccagaa cccaagagcg tggtgcagca atatctggac 900

gtggtcggat acccttttat gcctccttac tggggactgg gattccacct ttgccgttgg 960gtggtcggat acccttttat gcctccttac tggggactgg gattccacct ttgccgttgg 960

ggctactcat ccaccgccat taccagacag gtggtggaga atatgaccag agcccacttc 1020ggctactcat ccaccgccat taccagacag gtggtggaga atatgaccag agcccacttc 1020

cctctcgacg tgcagtggaa cgatctggac tatatggact cccggagaga tttcaccttc 1080cctctcgacg tgcagtggaa cgatctggac tatatggact cccggagaga tttcaccttc 1080

aacaaggacg ggttccgcga ttttcccgcg atggttcaag agctccacca gggtggtcga 1140aacaaggacg ggttccgcga ttttcccgcg atggttcaag agctccacca gggtggtcga 1140

agatatatga tgatcgtcga cccagccatt tcgagcagcg gacccgctgg atcttataga 1200agatatatga tgatcgtcga cccagccatt tcgagcagcg gacccgctgg atctttataga 1200

ccttacgacg aaggccttag gagaggagtg ttcatcacaa acgagactgg acagcctttg 1260ccttacgacg aaggccttag gagaggagtg ttcatcacaa acgagactgg acagcctttg 1260

atcggtaaag tgtggcctgg atcaaccgcc tttcctgact ttaccaatcc cactgccttg 1320atcggtaaag tgtggcctgg atcaaccgcc tttcctgact ttaccaatcc cactgccttg 1320

gcttggtggg aggacatggt ggccgaattc cacgaccaag tcccctttga tggaatgtgg 1380gcttggtggg aggacatggt ggccgaattc cacgaccaag tcccctttga tggaatgtgg 1380

atcgatatga acgaaccaag caattttatc agaggttccg aagacggttg ccccaacaac 1440atcgatatga acgaaccaag caattttatc agaggttccg aagacggttg ccccaacaac 1440

gaactggaaa accctcctta tgtgcccgga gtcgtgggcg gaacattaca ggccgcgact 1500gaactggaaa accctcctta tgtgccccgga gtcgtgggcg gaacattaca ggccgcgact 1500

atttgcgcca gcagccacca attcctgtcc actcactaca acctccacaa cctttatgga 1560atttgcgcca gcagccacca attcctgtcc actcactaca acctccacaa cctttatgga 1560

ttaaccgaag ctattgcaag tcacagggct ctggtgaagg ctagagggac taggcccttt 1620ttaaccgaag ctattgcaag tcacagggct ctggtgaagg ctagagggac taggcccttt 1620

gtgatctccc gatccacctt tgccggacac gggagatacg ccggtcactg gactggtgac 1680gtgatctccc gatccacctt tgccggacac gggagatacg ccggtcactg gactggtgac 1680

gtgtggagct catgggaaca actggcctcc tccgtgccgg aaatcttaca gttcaacctt 1740gtgtggagct catgggaaca actggcctcc tccgtgccgg aaatcttaca gttcaacctt 1740

ctgggtgtcc ctcttgtcgg agcagacgtg tgtgggtttc ttggtaacac ctccgaggaa 1800ctgggtgtcc ctcttgtcgg agcagacgtg tgtgggtttc ttggtaacac ctccgaggaa 1800

ctgtgtgtgc gctggactca actgggtgca ttctacccat tcatgagaaa ccacaactcc 1860ctgtgtgtgc gctggactca actgggtgca ttctacccat tcatgagaaa ccacaactcc 1860

ttgctgtccc tgccacaaga gccctactcg ttcagcgagc ctgcacaaca ggctatgcgg 1920ttgctgtccc tgccacaaga gccctactcg ttcagcgagc ctgcacaaca ggctatgcgg 1920

aaggcactga ccctgagata cgccctgctt ccacacttat acactctctt ccatcaagcg 1980aaggcactga ccctgagata cgccctgctt ccacacttat acactctctt ccatcaagcg 1980

catgtggcag gagaaaccgt tgcaaggcct cttttccttg aattccccaa ggattcctcg 2040catgtggcag gagaaaccgt tgcaaggcct cttttccttg aattccccaa ggattcctcg 2040

acttggacgg tggatcatca gctgctgtgg ggagaagctc tgctgattac tccagtgttg 2100acttggacgg tggatcatca gctgctgtgg ggagaagctc tgctgattac tccagtgttg 2100

caagccggaa aagctgaggt gaccggatac tttccgctgg gaacctggta cgacctccag 2160caagccggaa aagctgaggt gaccggatac tttccgctgg gaacctggta cgacctccag 2160

actgtccctg ttgaagccct tggatcactg cctccgcctc cggcagctcc acgcgaacca 2220actgtccctg ttgaagccct tggatcactg cctccgcctc cggcagctcc acgcgaacca 2220

gctatacatt ccgagggaca gtgggttaca ttaccagctc ctctggacac aatcaacgtc 2280gctatacatt ccgagggaca gtgggttaca ttaccagctc ctctggacac aatcaacgtc 2280

cacttaagag ctggctacat tatccctctg caaggaccag gactgactac gaccgagagc 2340cacttaagag ctggctacat tatccctctg caaggaccag gactgactac gaccgagagc 2340

agacagcagc caatggcact ggctgtggct ctgaccaagg gaggggaagc tagaggagaa 2400agacagcagc caatggcact ggctgtggct ctgaccaagg gaggggaagc tagaggagaa 2400

ctcttctggg atgatgggga gtcccttgaa gtgctggaaa gaggcgctta cactcaagtc 2460ctcttctggg atgatgggga gtcccttgaa gtgctggaaa gaggcgctta cactcaagtc 2460

attttccttg cacggaacaa caccattgtg aacgaattgg tgcgagtgac cagcgaagga 2520attttccttg cacggaacaa caccattgtg aacgaattgg tgcgagtgac cagcgaagga 2520

gctggacttc aactgcagaa ggtcactgtg ctcggagtgg ctaccgctcc tcagcaagtg 2580gctggacttc aactgcagaa ggtcactgtg ctcggagtgg ctaccgctcc tcagcaagtg 2580

ctgtcgaatg gagtccccgt gtcaaacttt acctactccc ctgacactaa ggtgctcgac 2640ctgtcgaatg gagtccccgt gtcaaacttt acctactccc ctgacactaa ggtgctcgac 2640

atttgcgtgt ccctcctgat gggagagcag ttccttgtgt cctggtgttg a 2691atttgcgtgt ccctcctgat gggagagcag ttccttgtgt cctggtgttg a 2691

<210> 48<210> 48

<211> 2652<211> 2652

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAco1-Δ-42<223> hGAAco1-Δ-42

<400> 48<400> 48

gcccaccccg gcagacctag agctgtgcct acccagtgtg acgtgccccc caacagcaga 60gcccaccccg gcagacctag agctgtgcct acccagtgtg acgtgccccc caacagcaga 60

ttcgactgcg cccctgacaa ggccatcacc caggaacagt gcgaggccag aggctgctgc 120ttcgactgcg cccctgacaa ggccatcacc caggaacagt gcgaggccag aggctgctgc 120

tacatccctg ccaagcaggg actgcagggc gctcagatgg gacagccctg gtgcttcttc 180tacatccctg ccaagcaggg actgcagggc gctcagatgg gacagccctg gtgcttcttc 180

ccaccctcct accccagcta caagctggaa aacctgagca gcagcgagat gggctacacc 240ccaccctcct accccagcta caagctggaa aacctgagca gcagcgagat gggctacacc 240

gccaccctga ccagaaccac ccccacattc ttcccaaagg acatcctgac cctgcggctg 300gccaccctga ccagaaccac ccccacattc ttcccaaagg acatcctgac cctgcggctg 300

gacgtgatga tggaaaccga gaaccggctg cacttcacca tcaaggaccc cgccaatcgg 360gacgtgatga tggaaaccga gaaccggctg cacttcacca tcaaggaccc cgccaatcgg 360

agatacgagg tgcccctgga aaccccccac gtgcactcta gagcccccag ccctctgtac 420agatacgagg tgcccctgga aaccccccac gtgcactcta gagcccccag ccctctgtac 420

agcgtggaat tcagcgagga acccttcggc gtgatcgtgc ggagacagct ggatggcaga 480agcgtggaat tcagcgagga acccttcggc gtgatcgtgc ggagacagct ggatggcaga 480

gtgctgctga acaccaccgt ggcccctctg ttcttcgccg accagttcct gcagctgagc 540gtgctgctga acaccaccgt ggcccctctg ttcttcgccg accagttcct gcagctgagc 540

accagcctgc ccagccagta catcacagga ctggccgagc acctgagccc cctgatgctg 600accagcctgc ccagccagta catcacagga ctggccgagc acctgagccc cctgatgctg 600

agcacatcct ggacccggat caccctgtgg aacagggatc tggcccctac ccctggcgcc 660agcacatcct ggacccggat caccctgtgg aacagggatc tggcccctac ccctggcgcc 660

aatctgtacg gcagccaccc tttctacctg gccctggaag atggcggatc tgcccacgga 720aatctgtacg gcagccaccc tttctacctg gccctggaag atggcggatc tgcccacgga 720

gtgtttctgc tgaactccaa cgccatggac gtggtgctgc agcctagccc tgccctgtct 780gtgtttctgc tgaactccaa cgccatggac gtggtgctgc agcctagccc tgccctgtct 780

tggagaagca caggcggcat cctggatgtg tacatctttc tgggccccga gcccaagagc 840tggagaagca caggcggcat cctggatgtg tacatctttc tgggccccga gcccaagagc 840

gtggtgcagc agtatctgga tgtcgtgggc taccccttca tgccccctta ctggggcctg 900gtggtgcagc agtatctgga tgtcgtgggc taccccttca tgcccccctta ctggggcctg 900

ggattccacc tgtgcagatg gggctactcc agcaccgcca tcaccagaca ggtggtggaa 960ggattccacc tgtgcagatg gggctactcc agcaccgcca tcaccagaca ggtggtggaa 960

aacatgacca gagcccactt cccactggat gtgcagtgga acgacctgga ctacatggac 1020aacatgacca gagcccactt cccactggat gtgcagtgga acgacctgga ctacatggac 1020

agcagacggg acttcacctt caacaaggac ggcttccggg acttccccgc catggtgcag 1080agcagacggg acttcacctt caacaaggac ggcttccggg acttccccgc catggtgcag 1080

gaactgcatc agggcggcag acggtacatg atgatcgtgg atcccgccat cagctcctct 1140gaactgcatc agggcggcag acggtacatg atgatcgtgg atcccgccat cagctcctct 1140

ggccctgccg gctcttacag accctacgac gagggcctgc ggagaggcgt gttcatcacc 1200ggccctgccg gctcttacag accctacgac gagggcctgc ggagaggcgt gttcatcacc 1200

aacgagacag gccagcccct gatcggcaaa gtgtggcctg gcagcacagc cttccccgac 1260aacgagacag gccagcccct gatcggcaaa gtgtggcctg gcagcacagc cttccccgac 1260

ttcaccaatc ctaccgccct ggcttggtgg gaggacatgg tggccgagtt ccacgaccag 1320ttcaccaatc ctaccgccct ggcttggtgg gaggacatgg tggccgagtt ccacgaccag 1320

gtgcccttcg acggcatgtg gatcgacatg aacgagccca gcaacttcat ccggggcagc 1380gtgcccttcg acggcatgtg gatcgacatg aacgagccca gcaacttcat ccggggcagc 1380

gaggatggct gccccaacaa cgaactggaa aatccccctt acgtgcccgg cgtcgtgggc 1440gaggatggct gccccaacaa cgaactggaa aatccccctt acgtgcccgg cgtcgtgggc 1440

ggaacactgc aggccgctac aatctgtgcc agcagccacc agtttctgag cacccactac 1500ggaacactgc aggccgctac aatctgtgcc agcagccacc agtttctgag cacccactac 1500

aacctgcaca acctgtacgg cctgaccgag gccattgcca gccaccgcgc tctcgtgaaa 1560aacctgcaca acctgtacgg cctgaccgag gccattgcca gccaccgcgc tctcgtgaaa 1560

gccagaggca cacggccctt cgtgatcagc agaagcacct ttgccggcca cggcagatac 1620gccagaggca cacggccctt cgtgatcagc agaagcacct ttgccggcca cggcagatac 1620

gccggacatt ggactggcga cgtgtggtcc tcttgggagc agctggcctc tagcgtgccc 1680gccggacatt ggactggcga cgtgtggtcc tcttggggagc agctggcctc tagcgtgccc 1680

gagatcctgc agttcaatct gctgggcgtg ccactcgtgg gcgccgatgt gtgtggcttc 1740gagatcctgc agttcaatct gctgggcgtg ccactcgtgg gcgccgatgt gtgtggcttc 1740

ctgggcaaca cctccgagga actgtgtgtg cggtggacac agctgggcgc cttctaccct 1800ctgggcaaca cctccgagga actgtgtgtg cggtggacac agctgggcgc cttctaccct 1800

ttcatgagaa accacaacag cctgctgagc ctgccccagg aaccctacag ctttagcgag 1860ttcatgagaa accacaacag cctgctgagc ctgccccagg aaccctacag ctttagcgag 1860

cctgcacagc aggccatgcg gaaggccctg acactgagat acgctctgct gccccacctg 1920cctgcacagc aggccatgcg gaaggccctg acactgagat acgctctgct gccccacctg 1920

tacaccctgt ttcaccaggc ccatgtggcc ggcgagacag tggccagacc tctgtttctg 1980tacaccctgt ttcaccaggc ccatgtggcc ggcgagacag tggccagacc tctgtttctg 1980

gaattcccca aggacagcag cacctggacc gtggaccatc agctgctgtg gggagaggct 2040gaattcccca aggacagcag cacctggacc gtggaccatc agctgctgtg gggagaggct 2040

ctgctgatta ccccagtgct gcaggcaggc aaggccgaag tgaccggcta ctttcccctg 2100ctgctgatta ccccagtgct gcaggcaggc aaggccgaag tgaccggcta ctttcccctg 2100

ggcacttggt acgacctgca gaccgtgcct gtggaagccc tgggatctct gcctccacct 2160ggcacttggt acgacctgca gaccgtgcct gtggaagccc tgggatctct gcctccacct 2160

cctgccgctc ctagagagcc tgccattcac tctgagggcc agtgggtcac actgcctgcc 2220cctgccgctc ctagagagcc tgccattcac tctgagggcc agtgggtcac actgcctgcc 2220

cccctggata ccatcaacgt gcacctgagg gccggctaca tcataccact gcagggacct 2280cccctggata ccatcaacgt gcacctgagg gccggctaca tcataccact gcagggacct 2280

ggcctgacca ccaccgagtc tagacagcag ccaatggccc tggccgtggc cctgaccaaa 2340ggcctgacca ccaccgagtc tagacagcag ccaatggccc tggccgtggc cctgaccaaa 2340

ggcggagaag ctaggggcga gctgttctgg gacgatggcg agagcctgga agtgctggaa 2400ggcggagaag ctaggggcga gctgttctgg gacgatggcg agagcctgga agtgctggaa 2400

agaggcgcct atacccaagt gatcttcctg gcccggaaca acaccatcgt gaacgagctg 2460agaggcgcct atacccaagt gatcttcctg gcccggaaca acaccatcgt gaacgagctg 2460

gtgcgcgtga cctctgaagg cgctggactg cagctgcaga aagtgaccgt gctgggagtg 2520gtgcgcgtga cctctgaagg cgctggactg cagctgcaga aagtgaccgt gctggggagtg 2520

gccacagccc ctcagcaggt gctgtctaat ggcgtgcccg tgtccaactt cacctacagc 2580gccacagccc ctcagcaggt gctgtctaat ggcgtgcccg tgtccaactt cacctacagc 2580

cccgacacca aggtgctgga catctgcgtg tcactgctga tgggagagca gtttctggtg 2640cccgacacca aggtgctgga catctgcgtg tcactgctga tgggagagca gtttctggtg 2640

tcctggtgct ga 2652tcctggtgct ga 2652

<210> 49<210> 49

<211> 2652<211> 2652

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAco2-Δ-42<223> hGAAco2-Δ-42

<400> 49<400> 49

gcccatcctg gtagaccaag agctgtgcct acccaatgcg acgtgccacc caactcccga 60gcccatcctg gtagaccaag agctgtgcct acccaatgcg acgtgccacc caactcccga 60

ttcgactgcg cgccagataa ggctattacc caagagcagt gtgaagccag aggttgctgc 120ttcgactgcg cgccagataa ggctattacc caagagcagt gtgaagccag aggttgctgc 120

tacatcccag cgaagcaagg attgcaaggc gcccaaatgg gacaaccttg gtgtttcttc 180tacatcccag cgaagcaagg attgcaaggc gcccaaatgg gacaaccttg gtgtttcttc 180

cccccttcgt acccatcata taaactcgaa aacctgtcct cttcggaaat gggttatact 240cccccttcgt acccatcata taaactcgaa aacctgtcct cttcggaaat gggttatact 240

gccaccctca ccagaactac tcctactttc ttcccgaaag acatcttgac cttgaggctg 300gccaccctca ccagaactac tcctactttc ttcccgaaag acatcttgac cttgaggctg 300

gacgtgatga tggagactga aaaccggctg catttcacta tcaaagatcc tgccaatcgg 360gacgtgatga tggagactga aaaccggctg catttcacta tcaaagatcc tgccaatcgg 360

cgatacgagg tccctctgga aacccctcac gtgcactcac gggctccttc tccgctttac 420cgatacgagg tccctctgga aacccctcac gtgcactcac gggctccttc tccgctttac 420

tccgtcgaat tctctgagga acccttcgga gtgatcgtta gacgccagct ggatggtaga 480tccgtcgaat tctctgagga acccttcgga gtgatcgtta gacgccagct ggatggtaga 480

gtgctgttga acactactgt ggccccactt ttcttcgctg accagtttct gcaactgtcc 540gtgctgttga acactactgt ggccccactt ttcttcgctg accagtttct gcaactgtcc 540

acttccctgc catcccagta cattactgga ctcgccgaac acctgtcgcc actgatgctc 600acttccctgc catcccagta cattactgga ctcgccgaac acctgtcgcc actgatgctc 600

tcgacctctt ggactagaat cactttgtgg aacagagact tggcccctac tccgggagca 660tcgacctctt ggactagaat cactttgtgg aacagagact tggcccctac tccggggagca 660

aatctgtacg gaagccaccc tttttacctg gcgctcgaag atggcggatc cgctcacgga 720aatctgtacg gaagccaccc tttttacctg gcgctcgaag atggcggatc cgctcacgga 720

gtgttcctgc tgaatagcaa cgcaatggac gtggtgctgc aaccttcccc tgcactcagt 780gtgttcctgc tgaatagcaa cgcaatggac gtggtgctgc aaccttcccc tgcactcagt 780

tggagaagta ccgggggtat tctggacgtg tacatcttcc tcggaccaga acccaagagc 840tggagaagta ccgggggtat tctggacgtg tacatcttcc tcggaccaga acccaagagc 840

gtggtgcagc aatatctgga cgtggtcgga taccctttta tgcctcctta ctggggactg 900gtggtgcagc aatatctgga cgtggtcgga taccctttta tgcctcctta ctggggactg 900

ggattccacc tttgccgttg gggctactca tccaccgcca ttaccagaca ggtggtggag 960ggattccacc tttgccgttg gggctactca tccaccgcca ttaccagaca ggtggtggag 960

aatatgacca gagcccactt ccctctcgac gtgcagtgga acgatctgga ctatatggac 1020aatatgacca gagcccactt ccctctcgac gtgcagtgga acgatctgga ctatatggac 1020

tcccggagag atttcacctt caacaaggac gggttccgcg attttcccgc gatggttcaa 1080tcccggagag atttcacctt caacaaggac gggttccgcg attttcccgc gatggttcaa 1080

gagctccacc agggtggtcg aagatatatg atgatcgtcg acccagccat ttcgagcagc 1140gagctccacc agggtggtcg aagatatatg atgatcgtcg acccagccat ttcgagcagc 1140

ggacccgctg gatcttatag accttacgac gaaggcctta ggagaggagt gttcatcaca 1200ggacccgctg gatcttatag accttacgac gaaggcctta ggagaggagt gttcatcaca 1200

aacgagactg gacagccttt gatcggtaaa gtgtggcctg gatcaaccgc ctttcctgac 1260aacgagactg gacagccttt gatcggtaaa gtgtggcctg gatcaaccgc ctttcctgac 1260

tttaccaatc ccactgcctt ggcttggtgg gaggacatgg tggccgaatt ccacgaccaa 1320tttaccaatc ccactgcctt ggcttggtgg gaggacatgg tggccgaatt ccacgaccaa 1320

gtcccctttg atggaatgtg gatcgatatg aacgaaccaa gcaattttat cagaggttcc 1380gtcccctttg atggaatgtg gatcgatatg aacgaaccaa gcaattttat cagaggttcc 1380

gaagacggtt gccccaacaa cgaactggaa aaccctcctt atgtgcccgg agtcgtgggc 1440gaagacggtt gccccaacaa cgaactggaa aaccctcctt atgtgcccgg agtcgtgggc 1440

ggaacattac aggccgcgac tatttgcgcc agcagccacc aattcctgtc cactcactac 1500ggaacattac aggccgcgac tatttgcgcc agcagccacc aattcctgtc cactcactac 1500

aacctccaca acctttatgg attaaccgaa gctattgcaa gtcacagggc tctggtgaag 1560aacctccaca acctttatgg attaaccgaa gctattgcaa gtcacagggc tctggtgaag 1560

gctagaggga ctaggccctt tgtgatctcc cgatccacct ttgccggaca cgggagatac 1620gctagaggga ctaggccctt tgtgatctcc cgatccacct ttgccggaca cgggatac 1620

gccggtcact ggactggtga cgtgtggagc tcatgggaac aactggcctc ctccgtgccg 1680gccggtcact ggactggtga cgtgtggagc tcatgggaac aactggcctc ctccgtgccg 1680

gaaatcttac agttcaacct tctgggtgtc cctcttgtcg gagcagacgt gtgtgggttt 1740gaaatcttac agttcaacct tctgggtgtc cctcttgtcg gagcagacgt gtgtgggttt 1740

cttggtaaca cctccgagga actgtgtgtg cgctggactc aactgggtgc attctaccca 1800cttggtaaca cctccgagga actgtgtgtg cgctggactc aactgggtgc attctaccca 1800

ttcatgagaa accacaactc cttgctgtcc ctgccacaag agccctactc gttcagcgag 1860ttcatgagaa accacaactc cttgctgtcc ctgccacaag agccctactc gttcagcgag 1860

cctgcacaac aggctatgcg gaaggcactg accctgagat acgccctgct tccacactta 1920cctgcacaac aggctatgcg gaaggcactg accctgagat acgccctgct tccacactta 1920

tacactctct tccatcaagc gcatgtggca ggagaaaccg ttgcaaggcc tcttttcctt 1980tacactctct tccatcaagc gcatgtggca ggagaaaccg ttgcaaggcc tcttttcctt 1980

gaattcccca aggattcctc gacttggacg gtggatcatc agctgctgtg gggagaagct 2040gaattcccca aggattcctc gacttggacg gtggatcatc agctgctgtg gggagaagct 2040

ctgctgatta ctccagtgtt gcaagccgga aaagctgagg tgaccggata ctttccgctg 2100ctgctgatta ctccagtgtt gcaagccgga aaagctgagg tgaccggata ctttccgctg 2100

ggaacctggt acgacctcca gactgtccct gttgaagccc ttggatcact gcctccgcct 2160ggaacctggt acgacctcca gactgtccct gttgaagccc ttggatcact gcctccgcct 2160

ccggcagctc cacgcgaacc agctatacat tccgagggac agtgggttac attaccagct 2220ccggcagctc cacgcgaacc agctatacat tccgagggac agtgggttac attaccagct 2220

cctctggaca caatcaacgt ccacttaaga gctggctaca ttatccctct gcaaggacca 2280cctctggaca caatcaacgt ccacttaaga gctggctaca ttatccctct gcaaggacca 2280

ggactgacta cgaccgagag cagacagcag ccaatggcac tggctgtggc tctgaccaag 2340ggactgacta cgaccgagag cagacagcag ccaatggcac tggctgtggc tctgaccaag 2340

ggaggggaag ctagaggaga actcttctgg gatgatgggg agtcccttga agtgctggaa 2400ggaggggaag ctagaggaga actcttctgg gatgatgggg agtcccttga agtgctggaa 2400

agaggcgctt acactcaagt cattttcctt gcacggaaca acaccattgt gaacgaattg 2460agaggcgctt acactcaagt cattttcctt gcacggaaca acaccattgt gaacgaattg 2460

gtgcgagtga ccagcgaagg agctggactt caactgcaga aggtcactgt gctcggagtg 2520gtgcgagtga ccagcgaagg agctggactt caactgcaga aggtcactgt gctcggagtg 2520

gctaccgctc ctcagcaagt gctgtcgaat ggagtccccg tgtcaaactt tacctactcc 2580gctaccgctc ctcagcaagt gctgtcgaat ggagtccccg tgtcaaactt tacctactcc 2580

cctgacacta aggtgctcga catttgcgtg tccctcctga tgggagagca gttccttgtg 2640cctgacacta aggtgctcga catttgcgtg tccctcctga tgggagagca gttccttgtg 2640

tcctggtgtt ga 2652tcctggtgtt ga 2652

<210> 50<210> 50

<211> 2649<211> 2649

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAco1-Δ-43<223> hGAAco1-Δ-43

<400> 50<400> 50

caccccggca gacctagagc tgtgcctacc cagtgtgacg tgccccccaa cagcagattc 60caccccggca gacctagagc tgtgcctacc cagtgtgacg tgcccccccaa cagcagattc 60

gactgcgccc ctgacaaggc catcacccag gaacagtgcg aggccagagg ctgctgctac 120gactgcgccc ctgacaaggc catcacccag gaacagtgcg aggccagagg ctgctgctac 120

atccctgcca agcagggact gcagggcgct cagatgggac agccctggtg cttcttccca 180atccctgcca agcagggact gcagggcgct cagatgggac agccctggtg cttcttccca 180

ccctcctacc ccagctacaa gctggaaaac ctgagcagca gcgagatggg ctacaccgcc 240ccctcctacc ccagctacaa gctggaaaac ctgagcagca gcgagatggg ctacaccgcc 240

accctgacca gaaccacccc cacattcttc ccaaaggaca tcctgaccct gcggctggac 300accctgacca gaaccacccc cacattcttc ccaaaggaca tcctgaccct gcggctggac 300

gtgatgatgg aaaccgagaa ccggctgcac ttcaccatca aggaccccgc caatcggaga 360gtgatgatgg aaaccgagaa ccggctgcac ttcaccatca aggaccccgc caatcggaga 360

tacgaggtgc ccctggaaac cccccacgtg cactctagag cccccagccc tctgtacagc 420tacgaggtgc ccctggaaac cccccacgtg cactctagag cccccagccc tctgtacagc 420

gtggaattca gcgaggaacc cttcggcgtg atcgtgcgga gacagctgga tggcagagtg 480gtggaattca gcgaggaacc cttcggcgtg atcgtgcgga gacagctgga tggcagagtg 480

ctgctgaaca ccaccgtggc ccctctgttc ttcgccgacc agttcctgca gctgagcacc 540ctgctgaaca ccaccgtggc ccctctgttc ttcgccgacc agttcctgca gctgagcacc 540

agcctgccca gccagtacat cacaggactg gccgagcacc tgagccccct gatgctgagc 600agcctgccca gccagtacat cacaggactg gccgagcacc tgagccccct gatgctgagc 600

acatcctgga cccggatcac cctgtggaac agggatctgg cccctacccc tggcgccaat 660acatcctgga cccggatcac cctgtggaac agggatctgg cccctacccc tggcgccaat 660

ctgtacggca gccacccttt ctacctggcc ctggaagatg gcggatctgc ccacggagtg 720ctgtacggca gccacccttt ctacctggcc ctggaagatg gcggatctgc ccacggagtg 720

tttctgctga actccaacgc catggacgtg gtgctgcagc ctagccctgc cctgtcttgg 780tttctgctga actccaacgc catggacgtg gtgctgcagc ctagccctgc cctgtcttgg 780

agaagcacag gcggcatcct ggatgtgtac atctttctgg gccccgagcc caagagcgtg 840agaagcacag gcggcatcct ggatgtgtac atctttctgg gccccgagcc caagagcgtg 840

gtgcagcagt atctggatgt cgtgggctac cccttcatgc ccccttactg gggcctggga 900gtgcagcagt atctggatgt cgtgggctac cccttcatgc ccccttactg gggcctggga 900

ttccacctgt gcagatgggg ctactccagc accgccatca ccagacaggt ggtggaaaac 960ttccacctgt gcagatgggg ctactccagc accgccatca ccagacaggt ggtggaaaac 960

atgaccagag cccacttccc actggatgtg cagtggaacg acctggacta catggacagc 1020atgaccagag cccacttccc actggatgtg cagtggaacg acctggacta catggacagc 1020

agacgggact tcaccttcaa caaggacggc ttccgggact tccccgccat ggtgcaggaa 1080agacgggact tcaccttcaa caaggacggc ttccgggact tccccgccat ggtgcaggaa 1080

ctgcatcagg gcggcagacg gtacatgatg atcgtggatc ccgccatcag ctcctctggc 1140ctgcatcagg gcggcagacg gtacatgatg atcgtggatc ccgccatcag ctcctctggc 1140

cctgccggct cttacagacc ctacgacgag ggcctgcgga gaggcgtgtt catcaccaac 1200cctgccggct cttacagacc ctacgacgag ggcctgcgga gaggcgtgtt catcaccaac 1200

gagacaggcc agcccctgat cggcaaagtg tggcctggca gcacagcctt ccccgacttc 1260gagacaggcc agcccctgat cggcaaagtg tggcctggca gcacagcctt ccccgacttc 1260

accaatccta ccgccctggc ttggtgggag gacatggtgg ccgagttcca cgaccaggtg 1320accaatccta ccgccctggc ttggtgggag gacatggtgg ccgagttcca cgaccaggtg 1320

cccttcgacg gcatgtggat cgacatgaac gagcccagca acttcatccg gggcagcgag 1380cccttcgacg gcatgtggat cgacatgaac gagcccagca acttcatccg gggcagcgag 1380

gatggctgcc ccaacaacga actggaaaat cccccttacg tgcccggcgt cgtgggcgga 1440gatggctgcc ccaacaacga actggaaaat cccccttacg tgcccggcgt cgtgggcgga 1440

acactgcagg ccgctacaat ctgtgccagc agccaccagt ttctgagcac ccactacaac 1500acactgcagg ccgctacaat ctgtgccagc agccaccagt ttctgagcac ccactacaac 1500

ctgcacaacc tgtacggcct gaccgaggcc attgccagcc accgcgctct cgtgaaagcc 1560ctgcacaacc tgtacggcct gaccgaggcc attgccagcc accgcgctct cgtgaaagcc 1560

agaggcacac ggcccttcgt gatcagcaga agcacctttg ccggccacgg cagatacgcc 1620agaggcacac ggcccttcgt gatcagcaga agcacctttg ccggccacgg cagatacgcc 1620

ggacattgga ctggcgacgt gtggtcctct tgggagcagc tggcctctag cgtgcccgag 1680ggacattgga ctggcgacgt gtggtcctct tgggagcagc tggcctctag cgtgcccgag 1680

atcctgcagt tcaatctgct gggcgtgcca ctcgtgggcg ccgatgtgtg tggcttcctg 1740atcctgcagt tcaatctgct gggcgtgcca ctcgtgggcg ccgatgtgtg tggcttcctg 1740

ggcaacacct ccgaggaact gtgtgtgcgg tggacacagc tgggcgcctt ctaccctttc 1800ggcaacacct ccgaggaact gtgtgtgcgg tggacacagc tgggcgcctt ctaccctttc 1800

atgagaaacc acaacagcct gctgagcctg ccccaggaac cctacagctt tagcgagcct 1860atgagaaacc acaacagcct gctgagcctg ccccaggaac cctacagctt tagcgagcct 1860

gcacagcagg ccatgcggaa ggccctgaca ctgagatacg ctctgctgcc ccacctgtac 1920gcacagcagg ccatgcggaa ggccctgaca ctgagatacg ctctgctgcc ccacctgtac 1920

accctgtttc accaggccca tgtggccggc gagacagtgg ccagacctct gtttctggaa 1980accctgtttc accaggccca tgtggccggc gagacagtgg ccagacctct gtttctggaa 1980

ttccccaagg acagcagcac ctggaccgtg gaccatcagc tgctgtgggg agaggctctg 2040ttccccaagg acagcagcac ctggaccgtg gaccatcagc tgctgtgggg agaggctctg 2040

ctgattaccc cagtgctgca ggcaggcaag gccgaagtga ccggctactt tcccctgggc 2100ctgattaccc cagtgctgca ggcaggcaag gccgaagtga ccggctactt tcccctgggc 2100

acttggtacg acctgcagac cgtgcctgtg gaagccctgg gatctctgcc tccacctcct 2160acttggtacg acctgcagac cgtgcctgtg gaagccctgg gatctctgcc tccacctcct 2160

gccgctccta gagagcctgc cattcactct gagggccagt gggtcacact gcctgccccc 2220gccgctccta gagagcctgc cattcactct gagggccagt gggtcacact gcctgccccc 2220

ctggatacca tcaacgtgca cctgagggcc ggctacatca taccactgca gggacctggc 2280ctggatacca tcaacgtgca cctgagggcc ggctacatca taccactgca gggacctggc 2280

ctgaccacca ccgagtctag acagcagcca atggccctgg ccgtggccct gaccaaaggc 2340ctgaccacca ccgagtctag acagcagcca atggccctgg ccgtggccct gaccaaaggc 2340

ggagaagcta ggggcgagct gttctgggac gatggcgaga gcctggaagt gctggaaaga 2400ggagaagcta ggggcgagct gttctgggac gatggcgaga gcctggaagt gctggaaaga 2400

ggcgcctata cccaagtgat cttcctggcc cggaacaaca ccatcgtgaa cgagctggtg 2460ggcgcctata cccaagtgat cttcctggcc cggaacaaca ccatcgtgaa cgagctggtg 2460

cgcgtgacct ctgaaggcgc tggactgcag ctgcagaaag tgaccgtgct gggagtggcc 2520cgcgtgacct ctgaaggcgc tggactgcag ctgcagaaag tgaccgtgct gggagtggcc 2520

acagcccctc agcaggtgct gtctaatggc gtgcccgtgt ccaacttcac ctacagcccc 2580acagcccctc agcaggtgct gtctaatggc gtgcccgtgt ccaacttcac ctacagcccc 2580

gacaccaagg tgctggacat ctgcgtgtca ctgctgatgg gagagcagtt tctggtgtcc 2640gacaccaagg tgctggacat ctgcgtgtca ctgctgatgg gagagcagtt tctggtgtcc 2640

tggtgctga 2649tggtgctga 2649

<210> 51<210> 51

<211> 2649<211> 2649

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAco2-Δ-43<223> hGAAco2-Δ-43

<400> 51<400> 51

catcctggta gaccaagagc tgtgcctacc caatgcgacg tgccacccaa ctcccgattc 60catcctggta gaccaagagc tgtgcctacc caatgcgacg tgccacccaa ctcccgattc 60

gactgcgcgc cagataaggc tattacccaa gagcagtgtg aagccagagg ttgctgctac 120gactgcgcgc cagataaggc tattacccaa gagcagtgtg aagccagagg ttgctgctac 120

atcccagcga agcaaggatt gcaaggcgcc caaatgggac aaccttggtg tttcttcccc 180atcccagcga agcaaggatt gcaaggcgcc caaatgggac aaccttggtg tttcttcccc 180

ccttcgtacc catcatataa actcgaaaac ctgtcctctt cggaaatggg ttatactgcc 240ccttcgtacc catcatataa actcgaaaac ctgtcctctt cggaaatggg ttatactgcc 240

accctcacca gaactactcc tactttcttc ccgaaagaca tcttgacctt gaggctggac 300accctcacca gaactactcc tactttcttc ccgaaagaca tcttgacctt gaggctggac 300

gtgatgatgg agactgaaaa ccggctgcat ttcactatca aagatcctgc caatcggcga 360gtgatgatgg agactgaaaa ccggctgcat ttcactatca aagatcctgc caatcggcga 360

tacgaggtcc ctctggaaac ccctcacgtg cactcacggg ctccttctcc gctttactcc 420tacgaggtcc ctctggaaac ccctcacgtg cactcacggg ctccttctcc gctttatcc 420

gtcgaattct ctgaggaacc cttcggagtg atcgttagac gccagctgga tggtagagtg 480gtcgaattct ctgaggaacc cttcggagtg atcgttagac gccagctgga tggtagagtg 480

ctgttgaaca ctactgtggc cccacttttc ttcgctgacc agtttctgca actgtccact 540ctgttgaaca ctactgtggc cccacttttc ttcgctgacc agtttctgca actgtccact 540

tccctgccat cccagtacat tactggactc gccgaacacc tgtcgccact gatgctctcg 600tccctgccat cccagtacat tactggactc gccgaacacc tgtcgccact gatgctctcg 600

acctcttgga ctagaatcac tttgtggaac agagacttgg cccctactcc gggagcaaat 660acctcttgga ctagaatcac tttgtggaac agagacttgg cccctactcc gggagcaaat 660

ctgtacggaa gccacccttt ttacctggcg ctcgaagatg gcggatccgc tcacggagtg 720ctgtacggaa gccacccttt ttacctggcg ctcgaagatg gcggatccgc tcacggagtg 720

ttcctgctga atagcaacgc aatggacgtg gtgctgcaac cttcccctgc actcagttgg 780ttcctgctga atagcaacgc aatggacgtg gtgctgcaac cttcccctgc actcagttgg 780

agaagtaccg ggggtattct ggacgtgtac atcttcctcg gaccagaacc caagagcgtg 840agaagtaccg ggggtattct ggacgtgtac atcttcctcg gaccagaacc caagagcgtg 840

gtgcagcaat atctggacgt ggtcggatac ccttttatgc ctccttactg gggactggga 900gtgcagcaat atctggacgt ggtcggatac ccttttatgc ctccttatactg gggactggga 900

ttccaccttt gccgttgggg ctactcatcc accgccatta ccagacaggt ggtggagaat 960ttccaccttt gccgttgggg ctactcatcc accgccatta ccagacaggt ggtggagaat 960

atgaccagag cccacttccc tctcgacgtg cagtggaacg atctggacta tatggactcc 1020atgaccagag cccacttccc tctcgacgtg cagtggaacg atctggacta tatggactcc 1020

cggagagatt tcaccttcaa caaggacggg ttccgcgatt ttcccgcgat ggttcaagag 1080cggagagatt tcaccttcaa caaggacggg ttccgcgatt ttcccgcgat ggttcaagag 1080

ctccaccagg gtggtcgaag atatatgatg atcgtcgacc cagccatttc gagcagcgga 1140ctccaccagg gtggtcgaag atatatgatg atcgtcgacc cagccatttc gagcagcgga 1140

cccgctggat cttatagacc ttacgacgaa ggccttagga gaggagtgtt catcacaaac 1200cccgctggat cttatagacc ttacgacgaa ggccttagga gaggagtgtt catcacaaac 1200

gagactggac agcctttgat cggtaaagtg tggcctggat caaccgcctt tcctgacttt 1260gagactggac agcctttgat cggtaaagtg tggcctggat caaccgcctt tcctgacttt 1260

accaatccca ctgccttggc ttggtgggag gacatggtgg ccgaattcca cgaccaagtc 1320accaatccca ctgccttggc ttggtgggag gacatggtgg ccgaattcca cgaccaagtc 1320

ccctttgatg gaatgtggat cgatatgaac gaaccaagca attttatcag aggttccgaa 1380ccctttgatg gaatgtggat cgatatgaac gaaccaagca attttatcag aggttccgaa 1380

gacggttgcc ccaacaacga actggaaaac cctccttatg tgcccggagt cgtgggcgga 1440gacggttgcc ccaacaacga actggaaaac cctccttatg tgcccggagt cgtgggcgga 1440

acattacagg ccgcgactat ttgcgccagc agccaccaat tcctgtccac tcactacaac 1500acattacagg ccgcgactat ttgcgccagc agccaccaat tcctgtccac tcactacaac 1500

ctccacaacc tttatggatt aaccgaagct attgcaagtc acagggctct ggtgaaggct 1560ctccacaacc tttatggatt aaccgaagct attgcaagtc acagggctct ggtgaaggct 1560

agagggacta ggccctttgt gatctcccga tccacctttg ccggacacgg gagatacgcc 1620agagggacta ggccctttgt gatctcccga tccacctttg ccggacacgg gagatacgcc 1620

ggtcactgga ctggtgacgt gtggagctca tgggaacaac tggcctcctc cgtgccggaa 1680ggtcactgga ctggtgacgt gtggagctca tgggaacaac tggcctcctc cgtgccggaa 1680

atcttacagt tcaaccttct gggtgtccct cttgtcggag cagacgtgtg tgggtttctt 1740atcttacagt tcaaccttct gggtgtcct cttgtcggag cagacgtgtg tgggtttctt 1740

ggtaacacct ccgaggaact gtgtgtgcgc tggactcaac tgggtgcatt ctacccattc 1800ggtaacacct ccgaggaact gtgtgtgcgc tggactcaac tgggtgcatt ctacccattc 1800

atgagaaacc acaactcctt gctgtccctg ccacaagagc cctactcgtt cagcgagcct 1860atgagaaacc acaactcctt gctgtccctg ccacaagagc cctactcgtt cagcgagcct 1860

gcacaacagg ctatgcggaa ggcactgacc ctgagatacg ccctgcttcc acacttatac 1920gcacaacagg ctatgcggaa ggcactgacc ctgagatacg ccctgcttcc acacttatac 1920

actctcttcc atcaagcgca tgtggcagga gaaaccgttg caaggcctct tttccttgaa 1980actctcttcc atcaagcgca tgtggcagga gaaaccgttg caaggcctct tttccttgaa 1980

ttccccaagg attcctcgac ttggacggtg gatcatcagc tgctgtgggg agaagctctg 2040ttccccaagg attcctcgac ttggacggtg gatcatcagc tgctgtgggg agaagctctg 2040

ctgattactc cagtgttgca agccggaaaa gctgaggtga ccggatactt tccgctggga 2100ctgattactc cagtgttgca agccggaaaa gctgaggtga ccggatactt tccgctggga 2100

acctggtacg acctccagac tgtccctgtt gaagcccttg gatcactgcc tccgcctccg 2160acctggtacg acctccagac tgtccctgtt gaagcccttg gatcactgcc tccgcctccg 2160

gcagctccac gcgaaccagc tatacattcc gagggacagt gggttacatt accagctcct 2220gcagctccac gcgaaccagc tatacattcc gagggacagt gggttacatt accagctcct 2220

ctggacacaa tcaacgtcca cttaagagct ggctacatta tccctctgca aggaccagga 2280ctggacacaa tcaacgtcca cttaagagct ggctacatta tccctctgca aggaccagga 2280

ctgactacga ccgagagcag acagcagcca atggcactgg ctgtggctct gaccaaggga 2340ctgactacga ccgagagcag acagcagcca atggcactgg ctgtggctct gaccaaggga 2340

ggggaagcta gaggagaact cttctgggat gatggggagt cccttgaagt gctggaaaga 2400ggggaagcta gaggagaact cttctggggat gatggggagt cccttgaagt gctggaaaga 2400

ggcgcttaca ctcaagtcat tttccttgca cggaacaaca ccattgtgaa cgaattggtg 2460ggcgcttaca ctcaagtcat tttccttgca cggaacaaca ccattgtgaa cgaattggtg 2460

cgagtgacca gcgaaggagc tggacttcaa ctgcagaagg tcactgtgct cggagtggct 2520cgagtgacca gcgaaggagc tggacttcaa ctgcagaagg tcactgtgct cggagtggct 2520

accgctcctc agcaagtgct gtcgaatgga gtccccgtgt caaactttac ctactcccct 2580accgctcctc agcaagtgct gtcgaatgga gtccccgtgt caaactttac ctactcccct 2580

gacactaagg tgctcgacat ttgcgtgtcc ctcctgatgg gagagcagtt ccttgtgtcc 2640gacactaagg tgctcgacat ttgcgtgtcc ctcctgatgg gagagcagtt ccttgtgtcc 2640

tggtgttga 2649tggtgttga 2649

<210> 52<210> 52

<211> 2637<211> 2637

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAco1-Δ-47<223> hGAAco1-Δ-47

<400> 52<400> 52

cctagagctg tgcctaccca gtgtgacgtg ccccccaaca gcagattcga ctgcgcccct 60cctagagctg tgcctaccca gtgtgacgtg ccccccaaca gcagattcga ctgcgcccct 60

gacaaggcca tcacccagga acagtgcgag gccagaggct gctgctacat ccctgccaag 120gacaaggcca tcacccagga acagtgcgag gccagaggct gctgctacat ccctgccaag 120

cagggactgc agggcgctca gatgggacag ccctggtgct tcttcccacc ctcctacccc 180cagggactgc agggcgctca gatggggacag ccctggtgct tcttcccacc ctcctacccc 180

agctacaagc tggaaaacct gagcagcagc gagatgggct acaccgccac cctgaccaga 240agctacaagc tggaaaacct gagcagcagc gagatgggct acaccgccac cctgaccaga 240

accaccccca cattcttccc aaaggacatc ctgaccctgc ggctggacgt gatgatggaa 300accacccccca cattcttccc aaaggacatc ctgaccctgc ggctggacgt gatgatggaa 300

accgagaacc ggctgcactt caccatcaag gaccccgcca atcggagata cgaggtgccc 360accgagaacc ggctgcactt caccatcaag gaccccgcca atcggagata cgaggtgccc 360

ctggaaaccc cccacgtgca ctctagagcc cccagccctc tgtacagcgt ggaattcagc 420ctggaaaccc cccacgtgca ctctagagcc cccagccctc tgtacagcgt ggaattcagc 420

gaggaaccct tcggcgtgat cgtgcggaga cagctggatg gcagagtgct gctgaacacc 480gaggaaccct tcggcgtgat cgtgcggaga cagctggatg gcagagtgct gctgaacacc 480

accgtggccc ctctgttctt cgccgaccag ttcctgcagc tgagcaccag cctgcccagc 540accgtggccc ctctgttctt cgccgaccag ttcctgcagc tgagcaccag cctgcccagc 540

cagtacatca caggactggc cgagcacctg agccccctga tgctgagcac atcctggacc 600cagtacatca caggactggc cgagcacctg agccccctga tgctgagcac atcctggacc 600

cggatcaccc tgtggaacag ggatctggcc cctacccctg gcgccaatct gtacggcagc 660cggatcaccc tgtggaacag ggatctggcc cctacccctg gcgccaatct gtacggcagc 660

caccctttct acctggccct ggaagatggc ggatctgccc acggagtgtt tctgctgaac 720caccctttct acctggccct ggaagatggc ggatctgccc acggagtgtt tctgctgaac 720

tccaacgcca tggacgtggt gctgcagcct agccctgccc tgtcttggag aagcacaggc 780tccaacgcca tggacgtggt gctgcagcct agccctgccc tgtcttggag aagcacaggc 780

ggcatcctgg atgtgtacat ctttctgggc cccgagccca agagcgtggt gcagcagtat 840ggcatcctgg atgtgtacat ctttctgggc cccgagccca agagcgtggt gcagcagtat 840

ctggatgtcg tgggctaccc cttcatgccc ccttactggg gcctgggatt ccacctgtgc 900ctggatgtcg tgggctaccc cttcatgccc ccttactggg gcctgggatt ccacctgtgc 900

agatggggct actccagcac cgccatcacc agacaggtgg tggaaaacat gaccagagcc 960agatggggct actccagcac cgccatcacc agacaggtgg tggaaaacat gaccagagcc 960

cacttcccac tggatgtgca gtggaacgac ctggactaca tggacagcag acgggacttc 1020cacttcccac tggatgtgca gtggaacgac ctggactaca tggacagcag acgggacttc 1020

accttcaaca aggacggctt ccgggacttc cccgccatgg tgcaggaact gcatcagggc 1080accttcaaca aggacggctt ccgggacttc cccgccatgg tgcaggaact gcatcagggc 1080

ggcagacggt acatgatgat cgtggatccc gccatcagct cctctggccc tgccggctct 1140ggcagacggt acatgatgat cgtggatccc gccatcagct cctctggccc tgccggctct 1140

tacagaccct acgacgaggg cctgcggaga ggcgtgttca tcaccaacga gacaggccag 1200tacagaccct acgacgaggg cctgcggaga ggcgtgttca tcaccaacga gacaggccag 1200

cccctgatcg gcaaagtgtg gcctggcagc acagccttcc ccgacttcac caatcctacc 1260cccctgatcg gcaaagtgtg gcctggcagc acagccttcc ccgacttcac caatcctacc 1260

gccctggctt ggtgggagga catggtggcc gagttccacg accaggtgcc cttcgacggc 1320gccctggctt ggtggggagga catggtggcc gagttccacg accaggtgcc cttcgacggc 1320

atgtggatcg acatgaacga gcccagcaac ttcatccggg gcagcgagga tggctgcccc 1380atgtggatcg acatgaacga gcccagcaac ttcatccggg gcagcgagga tggctgcccc 1380

aacaacgaac tggaaaatcc cccttacgtg cccggcgtcg tgggcggaac actgcaggcc 1440aacaacgaac tggaaaatcc cccttacgtg cccggcgtcg tgggcggaac actgcaggcc 1440

gctacaatct gtgccagcag ccaccagttt ctgagcaccc actacaacct gcacaacctg 1500gctacaatct gtgccagcag ccaccagttt ctgagcaccc actacaacct gcacaacctg 1500

tacggcctga ccgaggccat tgccagccac cgcgctctcg tgaaagccag aggcacacgg 1560tacggcctga ccgaggccat tgccagccac cgcgctctcg tgaaagccag aggcacacgg 1560

cccttcgtga tcagcagaag cacctttgcc ggccacggca gatacgccgg acattggact 1620cccttcgtga tcagcagaag cacctttgcc ggccacggca gatacgccgg acattggact 1620

ggcgacgtgt ggtcctcttg ggagcagctg gcctctagcg tgcccgagat cctgcagttc 1680ggcgacgtgt ggtcctcttg ggagcagctg gcctctagcg tgcccgagat cctgcagttc 1680

aatctgctgg gcgtgccact cgtgggcgcc gatgtgtgtg gcttcctggg caacacctcc 1740aatctgctgg gcgtgccact cgtgggcgcc gatgtgtgtg gcttcctggg caacacctcc 1740

gaggaactgt gtgtgcggtg gacacagctg ggcgccttct accctttcat gagaaaccac 1800gaggaactgt gtgtgcggtg gacacagctg ggcgccttct accctttcat gagaaaccac 1800

aacagcctgc tgagcctgcc ccaggaaccc tacagcttta gcgagcctgc acagcaggcc 1860aacagcctgc tgagcctgcc ccaggaaccc tacagcttta gcgagcctgc acagcaggcc 1860

atgcggaagg ccctgacact gagatacgct ctgctgcccc acctgtacac cctgtttcac 1920atgcggaagg ccctgacact gagatacgct ctgctgcccc acctgtacac cctgtttcac 1920

caggcccatg tggccggcga gacagtggcc agacctctgt ttctggaatt ccccaaggac 1980caggcccatg tggccggcga gacagtggcc agacctctgt ttctggaatt ccccaaggac 1980

agcagcacct ggaccgtgga ccatcagctg ctgtggggag aggctctgct gattacccca 2040agcagcacct ggaccgtgga ccatcagctg ctgtggggag aggctctgct gattacccca 2040

gtgctgcagg caggcaaggc cgaagtgacc ggctactttc ccctgggcac ttggtacgac 2100gtgctgcagg caggcaaggc cgaagtgacc ggctactttc ccctgggcac ttggtacgac 2100

ctgcagaccg tgcctgtgga agccctggga tctctgcctc cacctcctgc cgctcctaga 2160ctgcagaccg tgcctgtgga agccctggga tctctgcctc cacctcctgc cgctcctaga 2160

gagcctgcca ttcactctga gggccagtgg gtcacactgc ctgcccccct ggataccatc 2220gagcctgcca ttcactctga gggccagtgg gtcacactgc ctgcccccct ggataccatc 2220

aacgtgcacc tgagggccgg ctacatcata ccactgcagg gacctggcct gaccaccacc 2280aacgtgcacc tgagggccgg ctacatcata ccactgcagg gacctggcct gaccaccacc 2280

gagtctagac agcagccaat ggccctggcc gtggccctga ccaaaggcgg agaagctagg 2340gagtctagac agcagccaat ggccctggcc gtggccctga ccaaaggcgg agaagctagg 2340

ggcgagctgt tctgggacga tggcgagagc ctggaagtgc tggaaagagg cgcctatacc 2400ggcgagctgt tctgggacga tggcgagagc ctggaagtgc tggaaagagg cgcctatacc 2400

caagtgatct tcctggcccg gaacaacacc atcgtgaacg agctggtgcg cgtgacctct 2460caagtgatct tcctggcccg gaacaacacc atcgtgaacg agctggtgcg cgtgacctct 2460

gaaggcgctg gactgcagct gcagaaagtg accgtgctgg gagtggccac agcccctcag 2520gaaggcgctg gactgcagct gcagaaagtg accgtgctgg gagtggccac agcccctcag 2520

caggtgctgt ctaatggcgt gcccgtgtcc aacttcacct acagccccga caccaaggtg 2580caggtgctgt ctaatggcgt gcccgtgtcc aacttcacct acagccccga caccaaggtg 2580

ctggacatct gcgtgtcact gctgatggga gagcagtttc tggtgtcctg gtgctga 2637ctggacatct gcgtgtcact gctgatggga gagcagtttc tggtgtcctg gtgctga 2637

<210> 53<210> 53

<211> 2637<211> 2637

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequence

<220><220>

<223> hGAAco2-Δ-47<223> hGAAco2-Δ-47

<400> 53<400> 53

ccaagagctg tgcctaccca atgcgacgtg ccacccaact cccgattcga ctgcgcgcca 60ccaagagctg tgcctaccca atgcgacgtg ccacccaact cccgattcga ctgcgcgcca 60

gataaggcta ttacccaaga gcagtgtgaa gccagaggtt gctgctacat cccagcgaag 120gataaggcta ttacccaaga gcagtgtgaa gccagaggtt gctgctacat cccagcgaag 120

caaggattgc aaggcgccca aatgggacaa ccttggtgtt tcttcccccc ttcgtaccca 180caaggattgc aaggcgccca aatgggacaa ccttggtgtt tcttcccccc ttcgtaccca 180

tcatataaac tcgaaaacct gtcctcttcg gaaatgggtt atactgccac cctcaccaga 240tcatataaac tcgaaaacct gtcctcttcg gaaatgggtt atactgccac cctcaccaga 240

actactccta ctttcttccc gaaagacatc ttgaccttga ggctggacgt gatgatggag 300actactccta ctttcttccc gaaagacatc ttgaccttga ggctggacgt gatgatggag 300

actgaaaacc ggctgcattt cactatcaaa gatcctgcca atcggcgata cgaggtccct 360actgaaaacc ggctgcattt cactatcaaa gatcctgcca atcggcgata cgaggtccct 360

ctggaaaccc ctcacgtgca ctcacgggct ccttctccgc tttactccgt cgaattctct 420ctggaaaccc ctcacgtgca ctcacgggct ccttctccgc tttactccgt cgaattctct 420

gaggaaccct tcggagtgat cgttagacgc cagctggatg gtagagtgct gttgaacact 480gaggaaccct tcggagtgat cgttagacgc cagctggatg gtagagtgct gttgaacact 480

actgtggccc cacttttctt cgctgaccag tttctgcaac tgtccacttc cctgccatcc 540actgtggccc cacttttctt cgctgaccag tttctgcaac tgtccacttc cctgccatcc 540

cagtacatta ctggactcgc cgaacacctg tcgccactga tgctctcgac ctcttggact 600cagtacatta ctggactcgc cgaacacctg tcgccactga tgctctcgac ctcttggact 600

agaatcactt tgtggaacag agacttggcc cctactccgg gagcaaatct gtacggaagc 660agaatcactt tgtggaacag agacttggcc cctactccgg gagcaaatct gtacggaagc 660

cacccttttt acctggcgct cgaagatggc ggatccgctc acggagtgtt cctgctgaat 720cacccttttt acctggcgct cgaagatggc ggatccgctc acggagtgtt cctgctgaat 720

agcaacgcaa tggacgtggt gctgcaacct tcccctgcac tcagttggag aagtaccggg 780agcaacgcaa tggacgtggt gctgcaacct tcccctgcac tcagttggag aagtaccggg 780

ggtattctgg acgtgtacat cttcctcgga ccagaaccca agagcgtggt gcagcaatat 840ggtattctgg acgtgtacat cttcctcgga ccagaaccca agagcgtggt gcagcaatat 840

ctggacgtgg tcggataccc ttttatgcct ccttactggg gactgggatt ccacctttgc 900ctggacgtgg tcggataccc ttttatgcct ccttactggg gactgggatt ccacctttgc 900

cgttggggct actcatccac cgccattacc agacaggtgg tggagaatat gaccagagcc 960cgttggggct actcatccac cgccattacc agacaggtgg tggagaatat gaccagagcc 960

cacttccctc tcgacgtgca gtggaacgat ctggactata tggactcccg gagagatttc 1020cacttccctc tcgacgtgca gtggaacgat ctggactata tggactcccg gagagatttc 1020

accttcaaca aggacgggtt ccgcgatttt cccgcgatgg ttcaagagct ccaccagggt 1080accttcaaca aggacgggtt ccgcgatttt cccgcgatgg ttcaagagct ccaccagggt 1080

ggtcgaagat atatgatgat cgtcgaccca gccatttcga gcagcggacc cgctggatct 1140ggtcgaagat atatgatgat cgtcgaccca gccatttcga gcagcggacc cgctggatct 1140

tatagacctt acgacgaagg ccttaggaga ggagtgttca tcacaaacga gactggacag 1200tatagacctt acgacgaagg ccttaggaga ggagtgttca tcacaaacga gactggacag 1200

cctttgatcg gtaaagtgtg gcctggatca accgcctttc ctgactttac caatcccact 1260cctttgatcg gtaaagtgtg gcctggatca accgcctttc ctgactttac caatcccact 1260

gccttggctt ggtgggagga catggtggcc gaattccacg accaagtccc ctttgatgga 1320gccttggctt ggtggggagga catggtggcc gaattccacg accaagtccc ctttgatgga 1320

atgtggatcg atatgaacga accaagcaat tttatcagag gttccgaaga cggttgcccc 1380atgtggatcg atatgaacga accaagcaat tttatcagag gttccgaaga cggttgcccc 1380

aacaacgaac tggaaaaccc tccttatgtg cccggagtcg tgggcggaac attacaggcc 1440aacaacgaac tggaaaaccc tccttatgtg cccggagtcg tgggcggaac attacaggcc 1440

gcgactattt gcgccagcag ccaccaattc ctgtccactc actacaacct ccacaacctt 1500gcgactattt gcgccagcag ccaccaattc ctgtccactc actacaacct ccacaacctt 1500

tatggattaa ccgaagctat tgcaagtcac agggctctgg tgaaggctag agggactagg 1560tatggattaa ccgaagctat tgcaagtcac agggctctgg tgaaggctag agggactagg 1560

ccctttgtga tctcccgatc cacctttgcc ggacacggga gatacgccgg tcactggact 1620ccctttgtga tctcccgatc cacctttgcc ggacacggga gatacgccgg tcactggact 1620

ggtgacgtgt ggagctcatg ggaacaactg gcctcctccg tgccggaaat cttacagttc 1680ggtgacgtgt ggagctcatg ggaacaactg gcctcctccg tgccggaaat cttacagttc 1680

aaccttctgg gtgtccctct tgtcggagca gacgtgtgtg ggtttcttgg taacacctcc 1740aaccttctgg gtgtccctct tgtcggagca gacgtgtgtg ggtttcttgg taacacctcc 1740

gaggaactgt gtgtgcgctg gactcaactg ggtgcattct acccattcat gagaaaccac 1800gaggaactgt gtgtgcgctg gactcaactg ggtgcattct acccattcat gagaaaccac 1800

aactccttgc tgtccctgcc acaagagccc tactcgttca gcgagcctgc acaacaggct 1860aactccttgc tgtccctgcc acaagagccc tactcgttca gcgagcctgc acaacaggct 1860

atgcggaagg cactgaccct gagatacgcc ctgcttccac acttatacac tctcttccat 1920atgcggaagg cactgaccct gagatacgcc ctgcttccac acttatacac tctcttccat 1920

caagcgcatg tggcaggaga aaccgttgca aggcctcttt tccttgaatt ccccaaggat 1980caagcgcatg tggcaggaga aaccgttgca aggcctcttt tccttgaatt ccccaaggat 1980

tcctcgactt ggacggtgga tcatcagctg ctgtggggag aagctctgct gattactcca 2040tcctcgactt ggacggtgga tcatcagctg ctgtggggag aagctctgct gattactcca 2040

gtgttgcaag ccggaaaagc tgaggtgacc ggatactttc cgctgggaac ctggtacgac 2100gtgttgcaag ccggaaaagc tgaggtgacc ggatactttc cgctgggaac ctggtacgac 2100

ctccagactg tccctgttga agcccttgga tcactgcctc cgcctccggc agctccacgc 2160ctccagactg tccctgttga agcccttgga tcactgcctc cgcctccggc agctccacgc 2160

gaaccagcta tacattccga gggacagtgg gttacattac cagctcctct ggacacaatc 2220gaaccagcta tacattccga gggacagtgg gttacattac cagctcctct ggacacaatc 2220

aacgtccact taagagctgg ctacattatc cctctgcaag gaccaggact gactacgacc 2280aacgtccact taagagctgg ctacattatc cctctgcaag gaccaggact gactacgacc 2280

gagagcagac agcagccaat ggcactggct gtggctctga ccaagggagg ggaagctaga 2340gagagcagac agcagccaat ggcactggct gtggctctga ccaagggagg ggaagctaga 2340

ggagaactct tctgggatga tggggagtcc cttgaagtgc tggaaagagg cgcttacact 2400ggagaactct tctggggatga tggggagtcc cttgaagtgc tggaaagagg cgcttacact 2400

caagtcattt tccttgcacg gaacaacacc attgtgaacg aattggtgcg agtgaccagc 2460caagtcattt tccttgcacg gaacaacacc attgtgaacg aattggtgcg agtgaccagc 2460

gaaggagctg gacttcaact gcagaaggtc actgtgctcg gagtggctac cgctcctcag 2520gaaggagctg gacttcaact gcagaaggtc actgtgctcg gagtggctac cgctcctcag 2520

caagtgctgt cgaatggagt ccccgtgtca aactttacct actcccctga cactaaggtg 2580caagtgctgt cgaatggagt ccccgtgtca aactttacct actcccctga cactaaggtg 2580

ctcgacattt gcgtgtccct cctgatggga gagcagttcc ttgtgtcctg gtgttga 2637ctcgacattt gcgtgtccct cctgatggga gagcagttcc ttgtgtcctg gtgttga 2637

Claims

1. A recombinant AAV vector comprising an expression cassette comprising a nucleic acid molecule encoding a truncated GAA polypeptide that has 8 deleted at its N-terminal end compared with the parent GAA polypeptide. to 43 consecutive amino acids, wherein the parent GAA polypeptide corresponds to a precursor form of a GAA polypeptide that does not contain a signal peptide, and

Wherein the truncated GAA polypeptide further comprises a signal peptide fused to its N-terminal end.

2. The recombinant AAV vector of claim 1, wherein the truncated GAA polypeptide has 8, 9, 10, 40, 41, 42, or 43 consecutive deletions at its N-terminal end compared to the parent GAA polypeptide. Amino acids.

3. The recombinant AAV vector of claim 1 or 2, wherein the truncated GAA polypeptide has 8, 42 or 43 consecutive amino acids truncated at its N-terminal end compared to the parent GAA polypeptide.

4. The recombinant AAV vector of claim 1 or 2, wherein the parent GAA polypeptide is human GAA.

5. The recombinant AAV vector of claim 1 or 2, wherein the parent GAA polypeptide is human GAA consisting of the amino acid sequence of SEQ ID NO: 1 or SEQ ID NO: 33.

6. The recombinant AAV vector of claim 1 or 2, wherein the parent GAA polypeptide is human GAA consisting of the amino acid sequence of SEQ ID NO: 1.

7. The recombinant AAV vector of claim 1 or 2, wherein the truncated GAA polypeptide consists of the amino acid sequence of SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 34 or SEQ ID NO: 35.

8. The recombinant AAV vector of claim 1 or 2, wherein the fused signal peptide is selected from the group consisting of SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6 and SEQ ID NO: 7 of amino acid sequence.

9. The recombinant AAV vector of claim 1 or 2, wherein the fused signal peptide consists of the amino acid sequence of SEQ ID NO: 3.

10. The recombinant AAV vector of claim 1 or 2, wherein the nucleic acid molecule encoding a truncated GAA polypeptide is optimized to increase expression of the truncated GAA polypeptide in vivo and/or to increase response to the truncated GAA. Immune tolerance of polypeptides, the nucleic acid molecule consisting of the nucleic acid sequence of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50 or SEQ ID NO: 51 composition.

11. The recombinant AAV vector of claim 1 or 2, wherein said nucleic acid molecule is operably linked to a promoter.

12. The recombinant AAV vector of claim 11, wherein said promoter is a liver-specific promoter.

13. The recombinant AAV vector of claim 12, wherein the promoter is selected from the group consisting of alpha-1 antitrypsin promoter, transthyretin promoter, albumin promoter and thyroxine-binding globulin promoter.

14. The recombinant AAV vector of claim 1 or 2, wherein the expression cassette further comprises an intron.

15. The recombinant AAV vector of claim 14, wherein the intron is selected from the group consisting of human β-globin b2 intron, FIX intron, chicken β-globin intron and SV40 intron.

16. The recombinant AAV vector of claim 14, wherein the intron is the modified human beta globin b2 intron of SEQ ID NO: 17, the modified FIX intron of SEQ ID NO: 19, or the modified FIX intron of SEQ ID NO: 17 :21 modified chicken β-globin intron.

17. The recombinant AAV vector of claim 1 or 2, wherein the expression cassette comprises in the following order: an enhancer; a promoter, an intron; the nucleic acid molecule encoding the truncated GAA polypeptide; and polyadenosine signal.

18. The recombinant AAV vector of claim 1 or 2, wherein said expression cassette comprises in the following order: ApoE control region; hAAT promoter; human beta globin b2 intron; said truncated GAA polypeptide encoding Nucleic acid molecules; and bovine growth hormone polyadenylation signals.

19. The recombinant AAV vector of claim 1 or 2, wherein the expression cassette consists of the nucleotide sequence of any one of SEQ ID NO: 22 to 26.

20. The recombinant AAV vector of claim 1 or 2, wherein the recombinant AAV vector is a single-stranded or double-stranded self-complementary AAV vector.

21. The recombinant AAV vector of claim 1 or 2, wherein the recombinant AAV vector is a vector having a gene selected from the group consisting of AAV1, AAV2, variant AAV2, AAV3, variant AAV3, AAV3B, variant AAV3B, AAV4, AAV5, AAV6, variant AAV capsids of AAV6, AAV7, AAV8, AAV9, AAV10, AAVrh74, AAVdj, AAV-Anc80, AAV-LK03, AAV2i8 and porcine AAV or AAV vectors with chimeric capsids.

22. The recombinant AAV vector of claim 1 or 2, wherein the recombinant AAV vector has an AAV8, AAV9, AAVrh74 or AAV2i8 capsid.

23. The recombinant AAV vector of claim 1 or 2, wherein the recombinant AAV vector has an AAV8, AAV9 or AAVrh74 capsid.

24. The recombinant AAV vector of claim 1 or 2, wherein the recombinant AAV vector has an AAV8 capsid.

25. The recombinant AAV vector of claim 1 or 2, comprising an expression cassette and a modified capsid, the expression cassette comprising a nucleic acid molecule encoding a truncated GAA polypeptide consisting of SEQ ID NO: 27 The amino acid sequence composition, the truncated GAA polypeptide also includes a signal peptide fused to its N-terminal end, the signal peptide consists of the amino acid sequence of SEQ ID NO: 3, and the modified capsid includes AAVrh74 derived from or one or more variant VP capsid proteins selected from RHM4-1, RHM15-1, RHM15-2, RHM15-3/RHM15-5, RHM15-4 and RHM15-6.

26. A cell transformed with the recombinant AAV vector of any one of claims 1 to 25.

27. The cell of claim 26, wherein said cell is a hepatocyte or a muscle cell.

28. A pharmaceutical composition comprising the recombinant AAV vector of any one of claims 1 to 25 or the cell of claim 26 or 27 in a pharmaceutically acceptable carrier.

29. Use of the recombinant AAV vector of any one of claims 1 to 25, the cell of claim 26 or 27 or the pharmaceutical composition of claim 28 for the preparation of a medicament for the treatment of GAA deficiency or glycogen storage sick.

30. The use of claim 29, wherein the glycogen storage disease is selected from the group consisting of GSDI, GSDII, GSDIII, GSDIV, GSDV, GSDVI, GSDVII, GSDVIII and fatal congenital glycogen storage diseases of the heart.

31. The use of claim 30, wherein the glycogen storage disease is GSDII or GSDIII.

32. The use of claim 31, wherein said glycogen storage disease is GSDIII.

33. Use of the recombinant AAV vector of any one of claims 1 to 25, the cell of claim 26 or 27 or the pharmaceutical composition of claim 28 for the preparation of a medicament for the treatment of glycogen storage disease while simultaneously The immunogenicity of the truncated GAA polypeptide is reduced.