CN113355325B

CN113355325B - Preparation method and application of humanized ACE2 gene modified mouse embryonic stem cell model

Info

Publication number: CN113355325B
Application number: CN202110245653.6A
Authority: CN
Inventors: 陈捷凯; 孙骄杨; 吴光明; 吴凯昕; 匡俊企; 全雄志; 郭琳
Original assignee: Guangzhou Institute of Biomedicine and Health of CAS
Current assignee: Guangzhou Institute of Biomedicine and Health of CAS
Priority date: 2020-03-06
Filing date: 2021-03-05
Publication date: 2023-07-14
Anticipated expiration: 2041-03-05
Also published as: CN113355325A

Abstract

The invention relates to the field of biological medicine, in particular to a preparation method and application of a humanized ACE2 gene modified mouse embryonic stem cell model. The invention provides a sgRNA fragment sequence. The targeting vector containing the sgRNA fragment sequence can specifically target ACE2 genes and can be used for preparing humanized ACE2 gene modified mouse embryonic stem cell models. The sgRNA fragment sequence of the invention has better targeting specificity and efficiency of cutting target genes.

Description

Preparation method and application of humanized ACE2 gene-modified mouse embryonic stem cell model

技术领域Technical Field

本发明涉及生物医药领域，尤其涉及一种人源化ACE2基因改造小鼠胚胎干细胞模型制备方法及应用。The present invention relates to the field of biomedicine, and in particular to a method for preparing a humanized ACE2 gene-modified mouse embryonic stem cell model and application thereof.

背景技术Background Art

虽然现阶段已有在Vero E6 cells等细胞模型上感染2019-nCoV进行药物筛选，但细胞模型的单一性远不能比拟动物模型优势。Although 2019-nCoV has been infected in cell models such as Vero E6 cells for drug screening, the singleness of cell models is far from comparable to the advantages of animal models.

实验动物疾病模型对于研究人类疾病发生的病因、发病机制、开发防治技术和治疗药物是不可缺少的研究工具。常见的实验动物包括小鼠、大鼠、豚鼠、地鼠(仓鼠)、兔、犬、猴、猪和鱼等。然而，人类和动物的基因和蛋白质序列还是存在不少差异，许多人类蛋白质不能与动物的同源蛋白质结合而产生生物活性，导致许多临床试验的结果也与动物实验的结果不相符。Experimental animal disease models are indispensable research tools for studying the causes and pathogenesis of human diseases, and developing prevention and treatment technologies and therapeutic drugs. Common experimental animals include mice, rats, guinea pigs, hamsters, rabbits, dogs, monkeys, pigs, and fish. However, there are still many differences in the gene and protein sequences between humans and animals. Many human proteins cannot bind to homologous proteins in animals to produce biological activity, resulting in the results of many clinical trials not being consistent with the results of animal experiments.

随着基因工程技术的不断发展和成熟，用人类细胞或基因替代或置换动物的内源性同类细胞或基因，以建立更接近人类的生物体系或疾病模型，建立人源化实验动物模型(humanized animal model1)，已经为临床上新的治疗方法或手段提供了重要工具。其中基因人源化动物模型，即利用基因遗传操作技术，用人类正常或突变基因替换动物同类基因，可在动物体内建立更接近人类的正常或突变，体系的塞因大源化呦物模型。因大源化动物不但本身具有重要应用价值，如通过基因人源化还可改进和提升细胞人源化小鼠CN107815468 A2/28页模型，更重要的是，由于人类基因片段的存在，动物体内可表达或部分表达含有人类功能的蛋白质，从而大大减少人和动物的临床实验差异，为在动物水平进行药物筛选提供了可能。With the continuous development and maturity of genetic engineering technology, human cells or genes are used to replace or displace the endogenous cells or genes of animals to establish a biological system or disease model closer to humans, and to establish a humanized experimental animal model (humanized animal model1), which has provided an important tool for new clinical treatment methods or means. Among them, the gene humanized animal model, that is, using gene genetic manipulation technology, replaces the same gene of animals with normal or mutant genes of humans, and can establish a normal or mutant system closer to humans in the animal body. Because the humanized animal itself not only has important application value, such as through gene humanization, it can also improve and enhance the cell humanized mouse CN107815468 A2/28 page model, more importantly, due to the presence of human gene fragments, the animal body can express or partially express proteins containing human functions, thereby greatly reducing the clinical experimental differences between humans and animals, and providing the possibility for drug screening at the animal level.

2019-nCoV与SARS冠状病毒一样都以血管紧张素转化酶2(Angiotensin-converting enzyme 2,ACE2)作为感染人类的关键靶点，其中2019-nCoV除感染人类外还可感染多种哺乳动物如猴子、猪、兔子、白鼬、猩猩等，但小鼠和大鼠除外。猴子、猪以及兔子等作为动物模型感染2019-nCoV用作药物筛选而言，其生长周期较长且体型较大难以大批量操作，而相对容易大批量操作的小鼠模型却对2019-nCoV不易感。Like SARS coronavirus, 2019-nCoV uses angiotensin-converting enzyme 2 (ACE2) as a key target for infecting humans. In addition to infecting humans, 2019-nCoV can also infect a variety of mammals such as monkeys, pigs, rabbits, ferrets, and gorillas, but mice and rats are excluded. Monkeys, pigs, and rabbits are used as animal models to infect 2019-nCoV for drug screening, but their growth cycle is long and their body size is large, making it difficult to operate in large quantities. However, the mouse model, which is relatively easy to operate in large quantities, is not susceptible to 2019-nCoV.

发明内容Summary of the invention

由于人源ACE2和鼠源ACE2在基因序列上存在一定差异，有文献报道：病毒感染后，人源ACE2较鼠源ACE2对SARS-CoV的敏感性更强，病理症状更为明显。目前的临床研究急需更好的动物模型。Because there are certain differences in the gene sequences of human ACE2 and mouse ACE2, there are literature reports that after virus infection, human ACE2 is more sensitive to SARS-CoV than mouse ACE2, and the pathological symptoms are more obvious. Current clinical research urgently needs better animal models.

已有hACE2转基因小鼠被证实可感染2019-nCoV并出现肺部组织典型致病特征(https://www.biorxiv.org/content/10.1101/2020.02.07.939389v3#disqus_thread)。但hACE2转基因小鼠为全身性过表达hACE2小鼠模型，并不能模拟ACE2时空组织表达特征，本发明人源化ACE2小鼠模型为原小鼠Ace2位点特异性表达hACE2小鼠模型，可模拟原mACE2表达特征，具有组织特异性，更为严谨的模拟人感染2019-nCoV发病过程。It has been confirmed that hACE2 transgenic mice can be infected with 2019-nCoV and show typical pathogenic characteristics in lung tissue (https://www.biorxiv.org/content/10.1101/2020.02.07.939389v3#disqus_thread). However, hACE2 transgenic mice are systemic hACE2 overexpression mouse models and cannot simulate the spatiotemporal tissue expression characteristics of ACE2. The humanized ACE2 mouse model of the present invention is an original mouse Ace2 site-specific hACE2 mouse model, which can simulate the original mACE2 expression characteristics, has tissue specificity, and more rigorously simulates the pathogenesis of human infection with 2019-nCoV.

所以本公开将通过在小鼠Ace2位点表达人源化ACE2的方法大批量制备2019-nCoV易感小鼠人源化ACE2动物模型来对2019-nCoV进行药物筛选和疾病研究等应用。Therefore, the present invention will prepare a 2019-nCoV susceptible mouse humanized ACE2 animal model in large quantities by expressing humanized ACE2 at the mouse Ace2 site for drug screening and disease research on 2019-nCoV.

本发明的目的之一在于提供一种特异性的核酸片段序列。One of the purposes of the present invention is to provide a specific nucleic acid fragment sequence.

本发明另一个目的在于提供一种特异性的靶向ACE2基因的sgRNA序列。Another object of the present invention is to provide a sgRNA sequence that specifically targets the ACE2 gene.

本发明另一个目的在于提供一种特异性的靶向ACE2基因的靶向载体。Another object of the present invention is to provide a targeting vector that specifically targets the ACE2 gene.

本发明另一个目的在于提供一种ACE2基因人源化细胞株。Another object of the present invention is to provide an ACE2 gene humanized cell line.

本发明另一个目的在于提供一种ACE2基因人源化细胞株的构建方法。Another object of the present invention is to provide a method for constructing an ACE2 gene humanized cell line.

本发明另一个目的在于提供一种基因人源化动物的构建方法。Another object of the present invention is to provide a method for constructing a genetically humanized animal.

本发明还有一个目的在于提供一种人源化ACE2基因改造小鼠胚胎干细胞模型。Another object of the present invention is to provide a humanized ACE2 gene-modified mouse embryonic stem cell model.

一方面，本发明提供了一种核酸片段序列，所述核酸片段序列如SEQ ID NO:1或SEQ ID NO:2所示。本公开的核酸片段序列的靶向特异性和切割目的基因的效率都较好。In one aspect, the present invention provides a nucleic acid fragment sequence, and the nucleic acid fragment sequence is shown in SEQ ID NO: 1 or SEQ ID NO: 2. The nucleic acid fragment sequence disclosed in the present invention has good targeting specificity and efficiency in cutting target genes.

一些实施例中，合成序列如SEQ ID NO：1所示的核酸的上下游引物序列分别如SEQID NO:6和SEQ ID NO:7所示。一些实施例中，合成序列如SEQ ID NO：2所示的核酸的上下游引物序列分别如SEQ ID NO:4和SEQ ID NO:5所示。In some embodiments, the upstream and downstream primer sequences of the nucleic acid synthesized as SEQ ID NO: 1 are shown as SEQ ID NO: 6 and SEQ ID NO: 7, respectively. In some embodiments, the upstream and downstream primer sequences of the nucleic acid synthesized as SEQ ID NO: 2 are shown as SEQ ID NO: 4 and SEQ ID NO: 5, respectively.

一些实施例中，所述核酸为sgRNA。In some embodiments, the nucleic acid is sgRNA.

一些实施例中，所述sgRNA片段序列靶向的基因为动物ACE2基因，用于定点切割ACE2基因。In some embodiments, the gene targeted by the sgRNA fragment sequence is the animal ACE2 gene, which is used for site-specific cutting of the ACE2 gene.

一些实施例中，所述sgRNA片段序列用于构建人源化动物模型。In some embodiments, the sgRNA fragment sequence is used to construct a humanized animal model.

一些实施例中，所述sgRNA片段序列的切割点位于所述的动物ACE2基因1号外显子。In some embodiments, the cutting point of the sgRNA fragment sequence is located in exon 1 of the animal ACE2 gene.

一些实施例中，所述切割点靶点处于所述的动物ACE2基因5’UTR后140bp碱基范围内。In some embodiments, the cleavage point target is within the 140bp base range after the 5'UTR of the animal ACE2 gene.

一些实施例中，所述动物为哺乳动物。In some embodiments, the animal is a mammal.

一些实施例中，所述哺乳动物为啮齿类动物。In some embodiments, the mammal is a rodent.

一些实施例中，所述啮齿类动物为鼠。In some embodiments, the rodent is a mouse.

另一方面，本发明提供了一种靶向载体，包含5’同源臂、人源ACE2基因片段和3’同源臂；所述的靶向载体靶向动物ACE2基因，所述动物ACE2基因含有与权利要求1或2所述的sgRNA片段序列相对应的切割点。所述5’同源臂为与在所述目标基因组基因座处的5’靶序列同源的5’同源臂。On the other hand, the present invention provides a targeting vector comprising a 5' homology arm, a human ACE2 gene fragment and a 3' homology arm; the targeting vector targets an animal ACE2 gene, and the animal ACE2 gene contains a cleavage point corresponding to the sgRNA fragment sequence of claim 1 or 2. The 5' homology arm is a 5' homology arm homologous to the 5' target sequence at the target genomic locus.

一些实施例中，所述靶向载体用于将人源ACE2基因的CDS序列插入到动物基因的启动子和5’UTR区域序列后，利用动物目的基因启动子启动人目的基因表达。In some embodiments, the targeting vector is used to insert the CDS sequence of the human ACE2 gene into the promoter and 5'UTR region sequence of the animal gene, and then use the animal target gene promoter to initiate the expression of the human target gene.

一些实施例中，所述打靶载体还包含3’同源臂，所述3’同源臂为与在所述目标基因组基因座处的3’靶序列同源的3’同源臂。In some embodiments, the targeting vector further comprises a 3' homology arm, wherein the 3' homology arm is a 3' homology arm homologous to the 3' target sequence at the target genomic locus.

一些实施例中，所述5’同源臂序列如SEQ ID NO:13所示，3’同源臂的序列如SEQID NO:14所示。In some embodiments, the 5' homology arm sequence is shown as SEQ ID NO:13, and the 3' homology arm sequence is shown as SEQ ID NO:14.

一些实施例中，所述ACE2基因的CDS序列如SEQ ID NO:10所示。In some embodiments, the CDS sequence of the ACE2 gene is shown in SEQ ID NO:10.

一些实施例中，所述靶向载体还包含如SEQ ID NO:12所示的SV40 polyA序列，所述SV40 polyA序列位于所述的CDS序列之后。In some embodiments, the targeting vector further comprises an SV40 polyA sequence as shown in SEQ ID NO: 12, and the SV40 polyA sequence is located after the CDS sequence.

一些实施例中，所述靶向载体还包含筛选标志PGK-Puro。In some embodiments, the targeting vector further comprises a selection marker PGK-Puro.

一些实施例中，所述动物为鼠。In some embodiments, the animal is a mouse.

一些实施例中，所述靶向载体还包含pX330载体。In some embodiments, the targeting vector further comprises a pX330 vector.

另一方面，本发明提供了一种编码所述的sgRNA片段序列的DNA分子。On the other hand, the present invention provides a DNA molecule encoding the sgRNA fragment sequence.

另一方面，本发明提供了一种细胞，包含所述的sgRNA片段序列或所述的靶向载体或所述的DNA分子。On the other hand, the present invention provides a cell comprising the sgRNA fragment sequence or the targeting vector or the DNA molecule.

另一方面，本发明提供了所述的sgRNA片段序列在制备基因人源化细胞株或人源化动物模型中的应用。On the other hand, the present invention provides the use of the sgRNA fragment sequence in preparing a genetic humanized cell line or a humanized animal model.

一些实施例中，所述基因为ACE2。In some embodiments, the gene is ACE2.

一些实施例中，所述动物为啮齿类动物。In some embodiments, the animal is a rodent.

另一方面，本发明提供了一种人源化动物细胞株的构建方法，所述方法中使用了所述的sgRNA片段序列。On the other hand, the present invention provides a method for constructing a humanized animal cell line, wherein the sgRNA fragment sequence is used.

一些实施例中，所述方法包括将人源目的基因导入动物细胞中，使得目的基因在动物细胞内原位表达人源全长目的基因的CDS。In some embodiments, the method includes introducing a human target gene into an animal cell, so that the target gene expresses the CDS of the human full-length target gene in situ in the animal cell.

一些实施例中，所述目的基因为ACE2。In some embodiments, the target gene is ACE2.

一些实施例中，所述细胞为胚胎干细胞。In some embodiments, the cells are embryonic stem cells.

一些实施例中，所述方法使用了基因编辑技术进行人源化动物细胞模型的构建。In some embodiments, the method uses gene editing technology to construct a humanized animal cell model.

一些实施例中，所述基因编辑技术包括基于胚胎干细胞的DNA同源重组技术、CRISPR/Cas9技术。In some embodiments, the gene editing technology includes DNA homologous recombination technology based on embryonic stem cells and CRISPR/Cas9 technology.

一些实施例中，所述基因编辑技术为基于CRISPR/Cas9的基因编辑技术进行ACE2人源化动物的构建。In some embodiments, the gene editing technology is based on CRISPR/Cas9 gene editing technology to construct ACE2 humanized animals.

一些实施例中，所述构建方法包括以下步骤：(1)构建权利要求3所述的靶向载体；(2)将构建的靶向载体和连接有sgRNA的载体导入动物来源的胚胎干细胞中；(3)将步骤(2)中的胚胎干细胞培养成克隆，即得。In some embodiments, the construction method comprises the following steps: (1) constructing the targeting vector of claim 3; (2) introducing the constructed targeting vector and the vector connected with sgRNA into embryonic stem cells of animal origin; (3) culturing the embryonic stem cells in step (2) into clones.

另一方面，本发明提供了所述的方法制备得到的ACE2基因人源化细胞株。On the other hand, the present invention provides an ACE2 gene humanized cell line prepared by the method described.

一些实施例中，所述细胞株为人源化小鼠胚胎干细胞，经特定分化或制备成小鼠模型后可表达人的全长ACE2蛋白，同时内源ACE2蛋白表达降低或消失。In some embodiments, the cell line is a humanized mouse embryonic stem cell, which can express the full-length human ACE2 protein after specific differentiation or preparation into a mouse model, while the expression of endogenous ACE2 protein is reduced or disappears.

另一方面，本发明提供了一种基因人源化动物模型的构建方法，包括将所述的动物细胞注入动物体内。On the other hand, the present invention provides a method for constructing a genetically humanized animal model, comprising injecting the animal cells into an animal.

一些实施例中，所述基因人源化动物模型的构建方法包括将所述的动物细胞通过四倍体补偿或囊胚注射方法形成动物个体。In some embodiments, the method for constructing the genetically humanized animal model includes forming animal individuals by tetraploid compensation or blastocyst injection of the animal cells.

另一方面，本发明提供了一种人源化鼠或其子孙的组织、体液、细胞、以及它们的破碎物或提取物，所述的人源化鼠是利用基因人源化动物模型的构建方法得到的。On the other hand, the present invention provides a humanized mouse or its offspring's tissue, body fluid, cell, and their fragments or extracts, wherein the humanized mouse is obtained by using the method for constructing a genetic humanized animal model.

利用本发明的方法成功得到了人源化ACE2基因改造小鼠。Humanized ACE2 gene-modified mice were successfully obtained using the method of the present invention.

在一些实施方式中，所述人源化ACE2基因改造小鼠在RNA水平上表达人ACE2并具有组织特异性。In some embodiments, the humanized ACE2 genetically modified mouse expresses human ACE2 at the RNA level and has tissue specificity.

在一些实施方式中，所述人源化ACE2基因改造小鼠在蛋白水平上表达人ACE2并具有组织特异性。In some embodiments, the humanized ACE2 genetically modified mouse expresses human ACE2 at the protein level and has tissue specificity.

在一些实施方式中，本发明研究显示利用本发明的方法获得的人源化ACE2基因改造小鼠对SARS-CoV-2易感。从而成功地获得了对2019-nCoV易感的小鼠，从而能够很好地为2019-nCoV的药物筛选或其他研究提供很好的动物模型。具有很好的现实意义。In some embodiments, the present invention shows that the humanized ACE2 gene-modified mice obtained by the method of the present invention are susceptible to SARS-CoV-2. Thus, mice susceptible to 2019-nCoV are successfully obtained, which can provide a good animal model for drug screening or other research on 2019-nCoV. It has great practical significance.

还有一方面，本发明提供了来源于所述的构建方法得到的人源化动物模型或其后代在制造人类抗体，或者作为药理学、免疫学、微生物学和医学研究的模型系统中的应用，或在生产和利用动物实验疾病模型，用于病原学研究和/或用于开发新的诊断策略和/或治疗策略中的应用，或在筛选、验证、评价或研究ACE2基因功能、ACE2抗体、针对ACE2靶点的药物、药效研究方面的用途。In still another aspect, the present invention provides the use of a humanized animal model or its progeny obtained by the construction method in the manufacture of human antibodies, or as a model system for pharmacology, immunology, microbiology and medical research, or in the production and use of animal experimental disease models for etiology research and/or for the development of new diagnostic strategies and/or treatment strategies, or in the screening, verification, evaluation or research of ACE2 gene function, ACE2 antibodies, drugs targeting ACE2 targets, and efficacy research.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为pX330质粒图谱。Figure 1 is a plasmid map of pX330.

图2为pX330-sgRNA1构建测序结果图。Figure 2 shows the sequencing results of pX330-sgRNA1 construction.

图3为pX330-sgRNA2构建测序结果图。Figure 3 shows the sequencing results of pX330-sgRNA2 construction.

图4为pX330-sgRNA1-3切割效率验证结果。FIG4 is the result of verifying the cutting efficiency of pX330-sgRNA1-3.

图5为pX330-sgRNA3构建测序结果图。Figure 5 is a diagram showing the sequencing results of pX330-sgRNA3 construction.

图6为人源化ACE2打靶策略示意图。Figure 6 is a schematic diagram of the humanized ACE2 targeting strategy.

图7为人源化ACE2打靶载体测序结果信息。Figure 7 shows the sequencing results of the humanized ACE2 targeting vector.

图8为对比例1中的三片段连接的PCR鉴定图。FIG8 is a PCR identification diagram of the three-fragment connection in Comparative Example 1.

图9为人源化ACE2小鼠胚胎干细胞基因型的PCR鉴定结果图。FIG. 9 is a diagram showing the PCR identification results of the genotype of humanized ACE2 mouse embryonic stem cells.

图10为删除PGK-Puro后人源化ACE2小鼠胚胎干细胞基因型的PCR鉴定结果图。Figure 10 is a graph showing the PCR identification results of the humanized ACE2 mouse embryonic stem cell genotype after deleting PGK-Puro.

图11为人源化ACE2基因示意图。Figure 11 is a schematic diagram of the humanized ACE2 gene.

图12为本发明获得的人源化ACE2基因改造小鼠。FIG. 12 shows the humanized ACE2 gene-modified mice obtained by the present invention.

图13显示人源化ACE2基因改造小鼠在RNA水平上表达人ACE2并具有组织特异性。Figure 13 shows that humanized ACE2 gene-modified mice express human ACE2 at the RNA level and have tissue specificity.

图14的组织免疫荧光结果显示人ACE2在蛋白在人源化ACE2基因改造小鼠组织上特异表达。The tissue immunofluorescence results in Figure 14 show that human ACE2 protein is specifically expressed in the tissues of humanized ACE2 gene-modified mice.

图15显示本发明人源化ACE2基因改造小鼠对SARS-CoV-2易感。Figure 15 shows that the humanized ACE2 gene-modified mice of the present invention are susceptible to SARS-CoV-2.

具体实施方式DETAILED DESCRIPTION

以下通过具体的实施例进一步说明本发明的技术方案，具体实施例不代表对本发明保护范围的限制。其他人根据本发明理念所做出的一些非本质的修改和调整仍属于本发明的保护范围。The technical solution of the present invention is further described below by specific embodiments, which do not limit the protection scope of the present invention. Some non-essential modifications and adjustments made by others based on the concept of the present invention still fall within the protection scope of the present invention.

除非另有定义,本文中所使用的所有技术与科学术语的定义与本领域技术人员所熟悉的定义相同。此外,任何与所记载内容相似或均等的方法和材料皆可应用于本发明方法中,具体实施方式中描述了优选的方法和材料。Unless otherwise defined, the definitions of all technical and scientific terms used herein are the same as those familiar to those skilled in the art. In addition, any methods and materials similar or equivalent to those described herein can be applied to the methods of the present invention, and preferred methods and materials are described in the specific embodiments.

文中所用的“一”和“一种”指语法上不定冠词的释义,表示“一个”、“一种”或“多个”、“多种"(即“至少一个”、“至少一种”)。例如“一要素”指一种或种要素。As used herein, "a" and "an" refer to the grammatical indefinite article, meaning "one", "a kind" or "a plurality", "a variety" (i.e., "at least one", "at least one"). For example, "an element" refers to one or a kind of element.

“CDS”是编码序列(Coding sequence)的缩写，“编码序列”是指用于编码基因的多肽产物的任何核苷酸序列。相反，术语“非编码序列”是指不编码基因的多肽产物的任何核苷酸序列。"CDS" is the abbreviation of coding sequence, which refers to any nucleotide sequence that encodes the polypeptide product of a gene. In contrast, the term "non-coding sequence" refers to any nucleotide sequence that does not encode the polypeptide product of a gene.

术语“片段”将被理解为指长度比参考核酸短并且在共同部分包含与参考核酸相同的核苷酸序列的核苷酸序列。如果合适的话，根据本发明的这样的核酸片段可以被包含在更大的多核苷酸中，该片段是该更大的多核苷酸的组成成分。这样的片段包括长度在本发明的核酸的至少6、8、9、10、12、15、18、20、21、22、23、24、25、30、39、40、42、45、48、50、51、54、57、60、63、66、70、75、78、80、90、100、105、120、135、150、200、300、500、720、900、1000或1500个连续核苷酸范围内的寡核苷酸，或可选地由这样的寡核苷酸组成。The term "fragment" will be understood to refer to a nucleotide sequence that is shorter than a reference nucleic acid and that comprises the same nucleotide sequence as the reference nucleic acid in the common portion. If appropriate, such nucleic acid fragment according to the present invention can be included in a larger polynucleotide, and this fragment is a constituent of this larger polynucleotide. Such a fragment comprises an oligonucleotide having a length of at least 6,8,9,10,12,15,18,20,21,22,23,24,25,30,39,40,42,45,48,50,51,54,57,60,63,66,70,75,78,80,90,100,105,120,135,150,200,300,500,720,900,1000 or 1500 continuous nucleotides of nucleic acid of the present invention, or alternatively consists of such an oligonucleotide.

在本说明中,除非上下文另有要求，词语"包含"、"包括"将被理解为是指包括所述的步骤或要素或步骤和要素的集合,但并不排除任何其它的步骤或要素或步骤和要素的集合；即开放式限定。In this specification, unless the context requires otherwise, the words "comprise" and "include" will be understood to mean including the steps or elements or a collection of steps and elements, but not excluding any other steps or elements or a collection of steps and elements; that is, an open limitation.

“相对应”是指:(a)一多核苷酸具有与参考核苷酸序列的全部或者部分基本上相同或互补的核苷酸序列,或者一多核苷酸编码与肽或者蛋白质中的氨基酸序列完全相同的氨基酸序列；或(b)一肽或多肽具有一种氨基酸序列,该氨基酸序列基本上与参考肽或蛋白质中的氨基酸序列相同。"Corresponding to" means: (a) a polynucleotide has a nucleotide sequence that is substantially identical or complementary to all or part of a reference nucleotide sequence, or a polynucleotide encodes an amino acid sequence that is completely identical to the amino acid sequence in a peptide or protein; or (b) a peptide or polypeptide has an amino acid sequence that is substantially identical to the amino acid sequence in a reference peptide or protein.

术语“下游”是指位于参考核苷酸序列3’端的核苷酸序列。特别地，下游核苷酸序列通常涉及转录起始点之后的序列。例如，基因的翻译起始密码子位于转录起始位点的下游。The term "downstream" refers to a nucleotide sequence located at the 3' end of a reference nucleotide sequence. In particular, a downstream nucleotide sequence generally refers to a sequence after the transcription start point. For example, the translation start codon of a gene is located downstream of the transcription start site.

术语“上游”是指位于参考核苷酸序列5’端的核苷酸序列。特别地，上游核苷酸通常涉及位于编码序列或转录起始点的5’侧的序列。例如，大部分的启动子位于转录起始位点的上游。The term "upstream" refers to a nucleotide sequence located at the 5' end of a reference nucleotide sequence. In particular, upstream nucleotides generally refer to sequences located at the 5' side of a coding sequence or transcription start site. For example, most promoters are located upstream of the transcription start site.

“启动子”是指能够控制编码序列或功能性RNA的表达的DNA序列。一般而言，编码序列位于启动子序列的3’端。启动子可以整体源自天然基因，或由源自天然发现的不同启动子的不同元件组成，或甚至包括合成的DNA片段。本领域技术人员应该理解的是，不同的启动子可以指导基因在不同的组织或细胞类型中，或在发育的不同阶段，或应答于不同的环境或生理条件进行表达。导致基因在大多数细胞类型中在大多数的时间被表达的启动子通常称为“组成型启动子”。导致基因在特定的细胞类型中被表达的启动子通常称为“细胞特异性启动子”或“组织特异性启动子”。导致基因在特定的发育或细胞分化阶段被表达的启动子通常称为“发育特异性启动子”或“细胞分化特异性启动子”。在将细胞暴露于诱导启动子的药剂、生物分子、化学品、配体、光或类似物，或用这些物质对细胞进行处理之后，被诱导并导致基因被表达的启动子通常被称为“诱导型启动子”或“调控型启动子”。还应该认识的是，因为在大多数情况下，调控序列的准确界限还没有完全限定，所以不同长度的DNA片段可能具有相同的启动子活性。"Promoter" refers to a DNA sequence that can control the expression of a coding sequence or functional RNA. Generally speaking, the coding sequence is located at the 3' end of the promoter sequence. The promoter can be derived from a natural gene as a whole, or composed of different elements derived from different promoters found in nature, or even include synthetic DNA fragments. It should be understood by those skilled in the art that different promoters can direct genes to be expressed in different tissues or cell types, or at different stages of development, or in response to different environmental or physiological conditions. Promoters that cause genes to be expressed in most cell types at most times are generally referred to as "constitutive promoters". Promoters that cause genes to be expressed in specific cell types are generally referred to as "cell-specific promoters" or "tissue-specific promoters". Promoters that cause genes to be expressed at specific developmental or cell differentiation stages are generally referred to as "developmental specific promoters" or "cell differentiation specific promoters". After cells are exposed to agents, biomolecules, chemicals, ligands, light or the like that induce promoters, or cells are treated with these substances, promoters that are induced and cause genes to be expressed are generally referred to as "inducible promoters" or "regulatory promoters". It should also be recognized that because in most cases the exact boundaries of regulatory sequences have not been completely defined, DNA fragments of different lengths may have identical promoter activity.

术语“5’UTR”或“5'非编码序列”或“5'非翻译区(UTR)”是指位于编码序列上游(5')的DNA序列。The term "5'UTR" or "5' non-coding sequence" or "5' untranslated region (UTR)" refers to the DNA sequence located upstream (5') of a coding sequence.

术语“限制性核酸内切酶”和“限制性酶”是指结合并切割双链DNA内的特定核苷酸序列的酶。The terms "restriction endonuclease" and "restriction enzyme" refer to enzymes that bind to and cleave specific nucleotide sequences within double-stranded DNA.

术语“载体”意指核酸分子，其能转移它已连接的核酸分子。一种载体类型是“质粒”，其指环状双链DNA环，其他DNA区段可连接进去。另一类载体是病毒载体，其中其他DNA区段可连接到病毒基因组内。某些载体能在引入了这些载体的宿主细胞中自我复制(如具有细菌的复制起始点的细菌载体以及游离型哺乳动物载体)。其他载体(如非游离型哺乳动物载体)在引入到宿主细胞中后能够整合到宿主细胞的基因组中，因此随宿主基因组复制。此外，某些载体能够指导表达它们操作地连接的基因。The term "vector" means a nucleic acid molecule that is capable of transferring a nucleic acid molecule to which it has been linked. One type of vector is a "plasmid," which refers to a circular double-stranded DNA loop into which additional DNA segments can be linked. Another type of vector is a viral vector, in which additional DNA segments can be linked into the viral genome. Certain vectors are capable of self-replication in a host cell into which they are introduced (e.g., bacterial vectors with bacterial origins of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are capable of integrating into the genome of a host cell upon introduction into the host cell, thereby replicating with the host genome. In addition, certain vectors are capable of directing the expression of genes to which they are operatively linked.

一些载体本发明称为“重组表达载体”(或简称为“表达载体”)是指被设计成使得插入的核酸序列在转化入宿主后能够表达的载体、质粒或媒介。总的来说，重组DNA技术所使用的表达载体经常是质粒形式。本说明书的“质粒”和“载体”交换使用，因为质粒是载体最普遍使用的形式。但是，本发明意包括这种表达载体的其他形式，如病毒载体(如复制缺陷型反转录病毒、腺病毒和腺伴随病毒)，其充当同等作用。Some vectors referred to herein as "recombinant expression vectors" (or simply "expression vectors") refer to vectors, plasmids or vectors designed to allow the inserted nucleic acid sequence to be expressed after transformation into a host. In general, expression vectors used in recombinant DNA technology are often in the form of plasmids. "Plasmid" and "vector" are used interchangeably in this specification because plasmids are the most commonly used form of vectors. However, the present invention is intended to include other forms of such expression vectors, such as viral vectors (such as replication-defective retroviruses, adenoviruses and adeno-associated viruses), which serve the same role.

术语“质粒”是指染色体外元件，它们常常携带不是作为细胞的中心代谢的一部分的基因，并且常常是环状双链DNA分子的形式。这样的元件可以是来自任何来源的自主复制序列、基因组整合序列、噬菌体或核苷酸序列，线性、环状或超螺旋的，单链或双链DNA或RNA，其中许多核苷酸序列已经被连接入或重组入独特的结构中，该结构能够将针对所选的基因产物的启动子片段和DNA序列以及合适的3’端非翻译序列导入细胞。The term "plasmid" refers to an extrachromosomal element that often carries genes that are not part of the central metabolism of the cell and is often in the form of a circular double-stranded DNA molecule. Such an element can be an autonomously replicating sequence, a genome integrating sequence, a phage, or a nucleotide sequence from any source, linear, circular or supercoiled, single-stranded or double-stranded DNA or RNA, in which many nucleotide sequences have been linked or recombined into a unique structure that is capable of introducing a promoter fragment and DNA sequence for a selected gene product and appropriate 3' non-translated sequences into the cell.

“靶向载体”或“打靶载体”是含有邻接所需遗传修饰的、与内源染色体核酸序列“同源的”序列的DNA构建体。所述侧翼同源序列(称为“同源臂”)借助所述同源臂和相应的内源序列之间存在的同源性，指导所述打靶载体定位于所述基因组中的特定染色体位置，并且通过称为“同源重组”的过程引入所述所需的遗传修饰。“靶向载体”和“打靶载体”某些时候可以通用。采用靶向载体以将插入核酸引入大鼠、真核、非大鼠真核、哺乳动物、非人类哺乳动物、人类、啮齿动物、非大鼠啮齿动物、小鼠或仓鼠核酸的靶基因座中。所述靶向载体包含所述插入核酸且进一步包含5’同源臂和3’同源臂，其侧接所述插入核酸。侧接所述插入核酸的同源臂对应于在大鼠、真核、非大鼠真核、哺乳动物、非人类哺乳动物、人类、啮齿动物、非大鼠啮齿动物、小鼠或仓鼠核酸的靶基因座内的区域。为了便于提及，在靶基因组基因座内的相应同源基因组区在本文中称为“靶位点”。例如，靶向载体可包含由与第一靶位点和第二靶位点互补的第一同源臂和第二同源臂侧接的第一插入核酸。因而，所述靶向载体由此有助于经由在细胞的基因组内的同源臂和互补靶位点之间发生的同源重组事件将插入核酸整合到大鼠、真核、非大鼠真核、哺乳动物、非人类哺乳动物、人类、啮齿动物、非大鼠啮动物、小鼠或仓鼠核酸的靶基因座中。"Targeting vector" or "targeting vector" is a DNA construct containing a sequence "homologous" to an endogenous chromosomal nucleic acid sequence adjacent to the desired genetic modification. The flanking homologous sequences (called "homology arms") guide the targeting vector to locate at a specific chromosomal position in the genome by means of the homology between the homology arms and the corresponding endogenous sequences, and introduce the desired genetic modification by a process called "homologous recombination." "Targeting vector" and "targeting vector" can be used interchangeably at some times. A targeting vector is used to introduce an insert nucleic acid into a target locus of a rat, eukaryotic, non-rat eukaryotic, mammal, non-human mammal, human, rodent, non-rat rodent, mouse or hamster nucleic acid. The targeting vector comprises the insert nucleic acid and further comprises a 5' homology arm and a 3' homology arm, which flanks the insert nucleic acid. The homology arms flanking the insert nucleic acid correspond to regions within the target locus of a rat, eukaryotic, non-rat eukaryotic, mammal, non-human mammal, human, rodent, non-rat rodent, mouse or hamster nucleic acid. For ease of reference, the corresponding homologous genomic region within the target genomic locus is referred to herein as a "target site". For example, a targeting vector may comprise a first insert nucleic acid flanked by a first homology arm complementary to the first target site and a second homology arm complementary to the first target site. Thus, the targeting vector thereby facilitates integration of the insert nucleic acid into the target locus of a rat, eukaryotic, non-rat eukaryotic, mammalian, non-human mammalian, human, rodent, non-rat rodent, mouse or hamster nucleic acid via a homologous recombination event occurring between the homology arms and the complementary target sites within the genome of the cell.

在一个实施方案中，所述大鼠、真核、非大鼠真核、哺乳动物、非人类哺乳动物、人类、啮齿动物、非大鼠啮齿动物、小鼠或仓鼠核酸的靶基因座包含与5’同源臂互补的第一核酸序列和与3’同源臂互补的第二核酸序列。In one embodiment, the target locus of the rat, eukaryotic, non-rat eukaryotic, mammalian, non-human mammalian, human, rodent, non-rat rodent, mouse or hamster nucleic acid comprises a first nucleic acid sequence that is complementary to the 5' homology arm and a second nucleic acid sequence that is complementary to the 3' homology arm.

载体可以通过本领域已知的方法导入期望的宿主细胞，例如转染、电穿孔、微注射、转导、细胞融合、DEAE葡聚糖、磷酸钙沉淀、脂转染(溶酶体融合)、使用基因枪或DNA载体转运蛋白(参见例如Wu et al.,1992,J.Biol.Chem.267:963-967；Wu和Wu,1988,J.Biol.Chem.263:14621-14624；和Hartmut etal.,1990年3月15日提交的加拿大专利申请2,012,311)。The vector can be introduced into the desired host cell by methods known in the art, such as transfection, electroporation, microinjection, transduction, cell fusion, DEAE dextran, calcium phosphate precipitation, lipofection (lysosome fusion), use of a gene gun or a DNA vector transporter (see, e.g., Wu et al., 1992, J. Biol. Chem. 267:963-967; Wu and Wu, 1988, J. Biol. Chem. 263:14621-14624; and Hartmut et al., Canadian Patent Application No. 2,012,311 filed March 15, 1990).

术语“转染”是指细胞摄取外源或异源RNA或DNA。当外源或异源RNA或DNA已被引入细胞内时，该细胞便被这样的RNA或DNA“转染”。当转染的RNA或DNA影响表型变化时，该细胞便被外源或异源RNA或DNA“转化”。转化RNA或DNA可以被整合入(共价连接入)构成细胞基因组的染色体DNA。The term "transfection" refers to the uptake of exogenous or heterologous RNA or DNA by a cell. When exogenous or heterologous RNA or DNA has been introduced into a cell, the cell is "transfected" by such RNA or DNA. When the transfected RNA or DNA affects a phenotypic change, the cell is "transformed" by the exogenous or heterologous RNA or DNA. The transforming RNA or DNA can be integrated into (covalently linked into) the chromosomal DNA that makes up the genome of the cell.

术语“同源性”或“同源的”指序列例如核苷酸或氨基酸序列的两条序列经最佳比对和比较具有至少约75％的核苷酸或氨基酸、至少约80％的核苷酸或氨基酸、至少约90-95％的核苷酸或氨基酸，例如超过97％的核苷酸或氨基酸是相同的。本领域技术人员将理解，对于最佳基因靶向而言，靶向构建体应含有与内源性DNA序列同源的臂(即“同源臂”)；因此，能够在靶向构建体与被靶向的内源性序列之间发生同源重组。The term "homology" or "homologous" refers to two sequences, such as nucleotide or amino acid sequences, that are optimally aligned and compared with at least about 75% of the nucleotides or amino acids, at least about 80% of the nucleotides or amino acids, at least about 90-95% of the nucleotides or amino acids, such as more than 97% of the nucleotides or amino acids being identical. Those skilled in the art will appreciate that for optimal gene targeting, the targeting construct should contain arms that are homologous to endogenous DNA sequences (i.e., "homology arms"); therefore, homologous recombination can occur between the targeting construct and the targeted endogenous sequence.

如本文中所用，在两个区彼此共有足够水平的序列同一性时同源臂和靶位点(即同源基因组区)彼此互补，从而充当用于同源重组反应的底物。“同源性”是指DNA序列与相应或“互补”序列相同或共有序列同一性。在给定靶位点和在靶向载体上发现的相应同源臂之间的序列同一性可为允许同源重组发生的任何程度的序列同一性。例如，靶向载体的同源臂(或其片段)与靶位点(或其片段)共有的序列同一性的量可为至少51％、53％、57％、60％、65％、70％、75％、80％、83％、85％、87％、89％、91％、93％、95％、97％、98％、99％或100％序列同一性，因此所述序列经历同源重组。此外，在同源臂和互补靶位点之间同源的互补区可具有足以促进在裂解的识别位点处同源重组的任何长度。因此同源臂与在细胞的基因组内的相应靶位点具有足以进行同源重组的同源性。为了便于提及，同源臂在本文中作为5’同源臂和3’同源臂提到。该术语涉及在靶向载体中同源臂与插入核酸的相对位置。As used herein, homology arms and target sites (i.e., homologous genomic regions) are complementary to each other when two regions have a sufficient level of sequence identity, thereby serving as a substrate for homologous recombination reactions." homology " refers to that a DNA sequence is identical or has a common sequence identity to a corresponding or "complementary" sequence. The sequence identity between a given target site and the corresponding homology arms found on a targeting vector can be any degree of sequence identity that allows homologous recombination to occur. For example, the amount of the sequence identity shared by the homology arms (or its fragment) of the targeting vector and the target site (or its fragment) can be at least 51%, 53%, 57%, 60%, 65%, 70%, 75%, 80%, 83%, 85%, 87%, 89%, 91%, 93%, 95%, 97%, 98%, 99% or 100% sequence identity, so the sequence undergoes homologous recombination. In addition, the complementary region of homology between homology arms and complementary target sites can have any length that is enough to promote homologous recombination at the recognition site of cracking. Therefore, the homology arms have enough homology to carry out homologous recombination with the corresponding target site in the genome of the cell. For ease of reference, homology arms are mentioned in this article as 5' homology arms and 3' homology arms. The term relates to the relative position of homology arms and the inserted nucleic acid in the targeting vector.

一些实施例中，所述靶向载体的同源臂可具有足以促进与相应靶位点的同源重组事件的任何长度，包括例如至少5-10kb、5-15kb、10-20kb、20-30kb、30-40kb、40-50kb、50-60kb、60-70kb、70-80kb、80-90kb、90-100kb、100-110kb、110-120kb、120-130kb、130-140kb、140-150kb、150-160kb、160-170kb、170-180kb、180-190kb、190-200kb长或更长。如下文进一步详细概述，靶向载体可采用更大长度的靶向臂。在一个特定的实施方案中5’同源臂和3’同源臂的总和为至少10kb或5’同源臂和3’同源臂的总和为至少约16kb-约100kb或约30kb-约100kb，在其它实施方案中，所述ACE2的5’同源臂和3’同源臂的总和的尺寸为约10kb-约150kb、约10kb-约100kb、约10kb-约75kb、约20kb-约150kb、约20kb-约100kb、约20kb-约75kb、约30kb-约150kb、约30kb-约100kb、约30kb-约75kb、约40kb-约150kb、约40kb-约100kb、约40kb-约75kb、约50kb-约150kb、约50kb-约100kb、或约50kb-约75kb、约10kb-约30kb、约20kb-约40kb、约40kb-约60kb、约60kb-约80kb、约80kb-约100kb、约100kb-约120kb或约120kb-约150kb。In some embodiments, the homology arms of the targeting vector can have any length sufficient to promote homologous recombination events with corresponding target sites, including, for example, at least 5-10 kb, 5-15 kb, 10-20 kb, 20-30 kb, 30-40 kb, 40-50 kb, 50-60 kb, 60-70 kb, 70-80 kb, 80-90 kb, 90-100 kb, 100-110 kb, 110-120 kb, 120-130 kb, 130-140 kb, 140-150 kb, 150-160 kb, 160-170 kb, 170-180 kb, 180-190 kb, 190-200 kb long or longer. As further detailed below, targeting vectors can employ targeting arms of greater length. In a specific embodiment, the sum of the 5' homology arm and the 3' homology arm is at least 10 kb, or the sum of the 5' homology arm and the 3' homology arm is at least about 16 kb to about 100 kb or about 30 kb to about 100 kb. In other embodiments, the size of the sum of the 5' homology arm and the 3' homology arm of ACE2 is about 10 kb to about 150 kb, about 10 kb to about 100 kb, about 10 kb to about 75 kb, about 20 kb to about 150 kb, about 20 kb to about 100 kb, about 20 kb to about 75 kb, about 30 kb to about 150 kb, about 20 kb to about 100 kb, about 20 kb to about 75 kb, about 30 kb to about 15 kb, about 10 kb to about 30 kb, about 20 kb to about 40 kb, about 40 kb to about 60 kb, about 60 kb to about 80 kb, about 80 kb to about 100 kb, about 100 kb to about 120 kb, or about 120 kb to about 150 kb.

本文中的某些实施例涉及人源化基因编辑哺乳动物，所述人源化基因编辑哺乳动物的基因组包括编码人的全长ACE2蛋白的多聚核糖核酸。例如，所述多聚核糖核酸可操作地连接至启动子多聚核糖核酸。在一些实施例中，人源化基因编辑哺乳动物不表达全部的或部分的编码人源化基因编辑哺乳动物的内源性ACE2蛋白的多聚核糖核酸，并且所述编码人ACE2蛋白的多聚核糖核酸包括人ACE2蛋白基因的修饰。Certain embodiments herein relate to humanized gene-edited mammals, the genome of which includes a polyribonucleic acid encoding a full-length ACE2 protein of a human. For example, the polyribonucleic acid is operably linked to a promoter polyribonucleic acid. In some embodiments, the humanized gene-edited mammal does not express all or part of the polyribonucleic acid encoding the endogenous ACE2 protein of the humanized gene-edited mammal, and the polyribonucleic acid encoding the human ACE2 protein includes a modification of the human ACE2 protein gene.

一些实施例中，所述细胞为多潜能细胞、非多潜能细胞、哺乳动物细胞、人类细胞、非人类哺乳动物细胞、啮齿动物细胞、小鼠细胞、仓鼠细胞、非人类多潜能细胞、人类多潜能细胞、啮齿动物多潜能细胞或成纤维细胞或肺细胞。In some embodiments, the cell is a pluripotent cell, a non-pluripotent cell, a mammalian cell, a human cell, a non-human mammalian cell, a rodent cell, a mouse cell, a hamster cell, a non-human pluripotent cell, a human pluripotent cell, a rodent pluripotent cell, or a fibroblast or a lung cell.

在一些上述方法中，所述细胞为原代细胞或永生化细胞。在一些上述方法中,所述啮齿动物多潜能细胞为小鼠或大鼠胚胎干(ES)细胞。In some of the above methods, the cell is a primary cell or an immortalized cell. In some of the above methods, the rodent pluripotent cell is a mouse or rat embryonic stem (ES) cell.

在一些上述方法中，所述动物细胞或所述人类细胞为原代细胞或永生化细胞。In some of the above methods, the animal cell or the human cell is a primary cell or an immortalized cell.

在一些上述方法中，所述动物细胞或所述人类细胞为多潜能细胞。在一些上述方法中，所述动物多潜能细胞为小鼠胚胎干(ES)细胞。在一些上述方法中，所述人类多潜能细胞为人类胚胎干(ES)细胞、人类成人干细胞、发育受限的人类祖细胞或人类诱导的多潜能干(iPS)细胞。In some of the above methods, the animal cell or the human cell is a pluripotent cell. In some of the above methods, the animal pluripotent cell is a mouse embryonic stem (ES) cell. In some of the above methods, the human pluripotent cell is a human embryonic stem (ES) cell, a human adult stem cell, a developmentally restricted human progenitor cell or a human induced pluripotent stem (iPS) cell.

一些实施例中，本文中的某些实施例提供人源化基因编辑的细胞，尤其是还提供分离的人类和非人类全能或多潜能干细胞，尤其是小鼠胚胎干细胞，其能够在一种或多种体外连续基因修饰之后维持多潜能性且能够经由种系将所述靶向基因修饰传递到后代。In some embodiments, certain embodiments herein provide humanized gene-edited cells, and in particular, also provide isolated human and non-human totipotent or pluripotent stem cells, in particular mouse embryonic stem cells, which are capable of maintaining pluripotency after one or more continuous gene modifications in vitro and are capable of transmitting the targeted gene modifications to offspring via the germline.

本文使用的术语“胚胎干细胞”或"ES细胞”包括在引入胚胎中后能够促进发育胚胎的任何组织的源自胚胎的全能或多潜能细胞。本文使用的术语“多潜能细胞”包括具有发育成多于一种类型的分化细胞的能力的未分化细胞。术语“非多潜能细胞”包括不是多潜能细胞的细胞。The term "embryonic stem cell" or "ES cell" used herein includes an embryonic omnipotence or multipotency cell that can promote any tissue of the developing embryo after being introduced into the embryo. The term "pluripotent cell" used herein includes an undifferentiated cell with the ability to develop into more than one type of differentiated cell. The term "non-pluripotent cell" includes cells that are not pluripotent cells.

在一些上述方法中，所述靶向基因编辑同时包括在所述目标基因组基因座处缺失内源核酸序列或在所述目标基因组基因座处插入所述核酸。In some of the above methods, the targeted gene editing simultaneously comprises deleting an endogenous nucleic acid sequence at the target genomic locus or inserting the nucleic acid at the target genomic locus.

一些实施例中，所述基因修饰或基因编辑包括对细胞(例如，真核细胞、非大鼠真核细胞、哺乳动物细胞、类细胞、非人类哺乳动物细胞、多潜能细胞、非多潜能细胞、非人类多潜能细胞、人类多潜能细胞、人类ES细胞、人类成人干细胞、发育受限的人类祖细胞、人类iPS细胞、人类细胞、啮齿动物细胞、非大鼠啮齿动物细胞、大鼠细胞、小鼠细胞、仓鼠细胞、成纤维细胞或中国仓鼠卵巢(CHO)细胞)独立地实施的两种或多种修饰。第一修饰可通过电穿孔或本领域已知的任何其他方法实现。随后，采用合适的第二核酸构建体对同一细胞基因组进行第二修饰。所述第三修饰可通过第二电穿孔或本领域已知的任何其它方法实现。在各种实施方案中，在同一细胞的第一基因修饰和第二基因修饰之后,可使用例如连续电穿孔或本领域已知的任何其它合适方法(连续地)实现第三基因修饰、第四基因修饰、第五基因修饰、第六基因修饰等连修饰(一种基因修饰跟着另一基因修饰)。In some embodiments, the gene modification or gene editing includes two or more modifications independently implemented to cells (e.g., eukaryotic cells, non-rat eukaryotic cells, mammalian cells, cell-like cells, non-human mammalian cells, pluripotent cells, non-pluripotent cells, non-human pluripotent cells, human pluripotent cells, human ES cells, human adult stem cells, developmentally restricted human progenitor cells, human iPS cells, human cells, rodent cells, non-rat rodent cells, rat cells, mouse cells, hamster cells, fibroblasts or Chinese hamster ovary (CHO) cells). The first modification can be achieved by electroporation or any other method known in the art. Subsequently, the same cell genome is modified for the second time using a suitable second nucleic acid construct. The third modification can be achieved by a second electroporation or any other method known in the art. In various embodiments, after the first gene modification and the second gene modification of the same cell, the third gene modification, the fourth gene modification, the fifth gene modification, the sixth gene modification, etc. can be achieved (continuously) using, for example, continuous electroporation or any other suitable method known in the art.

一些实施例中，本发明是打靶载体同源重组方法进行基因的编辑，将外源核酸插入内源基因组中。In some embodiments, the present invention uses a targeting vector homologous recombination method to perform gene editing and insert exogenous nucleic acid into an endogenous genome.

一些实施例中，所述插入核酸包含插入同源或直系同源人类核酸序列或用其替换真核、非大鼠真核、哺乳动物、人类或非人类哺乳动物核酸序列。In some embodiments, the insert nucleic acid comprises inserting a homologous or orthologous human nucleic acid sequence or replacing a eukaryotic, non-rat eukaryotic, mammalian, human or non-human mammalian nucleic acid sequence with it.

一些实施例中，所述给定插入的多核苷酸可来自任何生物体，包括例如啮齿动物、非大鼠啮齿动物、大鼠、小鼠、仓鼠、哺乳动物、非人类哺乳动物、真核生物、非大鼠真核生物、人类、农业动物或家养动物。In some embodiments, the given inserted polynucleotide can be from any organism, including, for example, a rodent, a non-rat rodent, a rat, a mouse, a hamster, a mammal, a non-human mammal, a eukaryote, a non-rat eukaryote, a human, an agricultural animal, or a domesticated animal.

在特定的实施方案中，所述插入核酸可包含来自大鼠的核酸，其可包含基因组DNA的片段、CDNA、调控区或其任何部分或组合。在其它实施方案中，所述插入核酸可包括来自真核生物、非大鼠真核生物、哺乳动物、人类、非人类哺乳动物、啮齿动物、非大鼠啮齿动物、人类、大鼠、小鼠、仓鼠、兔、猪、牛、鹿、绵羊、山羊、小鸡、猫、狗、白融、灵长类动物(例如，狨猴、恒河猴)、家养哺乳动物或农业哺乳动物或任何其它目标生物体的核酸。如在本文中更详细地概述，在各种方法和组合物中采用的插入核酸可引起目标靶基因座的“人源化”。In a specific embodiment, the insertion nucleic acid may include a nucleic acid from a rat, which may include a fragment of a genomic DNA, a cDNA, a regulatory region, or any portion or combination thereof. In other embodiments, the insertion nucleic acid may include a nucleic acid from a eukaryote, a non-rat eukaryote, a mammal, a human, a non-human mammal, a rodent, a non-rat rodent, a human, a rat, a mouse, a hamster, a rabbit, a pig, a cow, a deer, a sheep, a goat, a chicken, a cat, a dog, a white melt, a primate (e.g., a marmoset, a rhesus monkey), a domestic mammal, or an agricultural mammal, or any other target organism. As outlined in more detail herein, the insertion nucleic acid used in various methods and compositions may cause "humanization" of the target locus.

在一个实施方案中，所述基因修饰为添加核酸序列。在一个实施方案中，所述插入核酸包含在编码序列中的基因修饰。在一个实施方案中，所述基因修饰包括编码序列的缺失突变。在一个实施方案中，所述基因修饰包括两种内源编码序列的融合。在一个实施方案中，所述插入核酸包含插入同源或直系同源人类核酸序列或用其替换真核、非大鼠真核、哺乳动物、人类或非人类哺乳动物核酸序列。在一个实施方案中，所述插入核酸包含在包含相应小鼠DNA序列的内源小鼠基因编码区插入同源或直系同源人类核酸序列或用其替换小鼠DNA序列。在一个实施方案中，所述插入核酸包含在包含相应小鼠DNA序列的内源小鼠基因编码区插入同源或直系同源人类核酸序列或用其替换小鼠DNA序列。在一个实施方案中，利用打靶载体，针对小鼠Ace2位点，在紧接mAce2的EXON1 CDS起始ATG位点插入hACE2序列。In one embodiment, the genetic modification is to add a nucleic acid sequence. In one embodiment, the insert nucleic acid comprises a genetic modification in a coding sequence. In one embodiment, the genetic modification comprises a deletion mutation of a coding sequence. In one embodiment, the genetic modification comprises a fusion of two endogenous coding sequences. In one embodiment, the insert nucleic acid comprises inserting a homologous or orthologous human nucleic acid sequence or replacing a eukaryotic, non-rat eukaryotic, mammalian, human or non-human mammalian nucleic acid sequence with it. In one embodiment, the insert nucleic acid comprises inserting a homologous or orthologous human nucleic acid sequence into an endogenous mouse gene coding region comprising a corresponding mouse DNA sequence or replacing a mouse DNA sequence with it. In one embodiment, the insert nucleic acid comprises inserting a homologous or orthologous human nucleic acid sequence into an endogenous mouse gene coding region comprising a corresponding mouse DNA sequence or replacing a mouse DNA sequence with it. In one embodiment, a targeting vector is used to insert a hACE2 sequence at the start ATG site of the EXON1 CDS immediately following mAce2 for the mouse Ace2 site.

在一个实施方案中，所述靶向载体的核酸序列可包含在整合到基因组中时将生成哺乳动物、人类或非人类哺乳动物ACE2基因座的区域的基因修饰的多核苷酸，其中在ACE2基因座处的基因修饰引起ACE2活性减小、ACE2活性增加或ACE2活性调整。在一个实施方案中，产生ACE2基因被完全替换。In one embodiment, the nucleic acid sequence of the targeting vector may include a polynucleotide that will generate a genetic modification of the region of the ACE2 locus of a mammal, a human or a non-human mammal when integrated into the genome, wherein the genetic modification at the ACE2 locus causes a decrease in ACE2 activity, an increase in ACE2 activity or an adjustment in ACE2 activity. In one embodiment, the ACE2 gene is completely replaced.

在一个实施方案中，所述插入核酸可包含调控元件，包括例如启动子、增强子或转录。In one embodiment, the insert nucleic acid may comprise a regulatory element, including, for example, a promoter, enhancer or transcription factor.

一些实施例中，给定插入的多核苷酸和/或哺乳动物、人类细胞或非人类哺乳动物基因座的相应替换区可为编码区、内含子、外显子、未转译区、调控区、启动子或增强子或其任何组合。In some embodiments, a given inserted polynucleotide and/or the corresponding replaced region of a mammalian, human cell, or non-human mammalian locus may be a coding region, an intron, an exon, an untranslated region, a regulatory region, a promoter, or an enhancer, or any combination thereof.

本文提供允许将一种或多种目标多核苷酸靶向整合到靶基因座中的方法，如上概述，引入序列，由此产生的基因编辑细胞。“引入”以使得序列进入细胞内部的方式而将序列呈递到细胞中(多肽或多核苷酸)。Provided herein are methods that allow for targeted integration of one or more polynucleotides of interest into a target locus, as outlined above, introduction of sequences, and gene-edited cells produced thereby."Introduction" is the presentation of a sequence into a cell (polypeptide or polynucleotide) in such a way that the sequence enters the interior of the cell.

来自任何生物体的任何细胞都可用于本文提供的方法中。在特定的实施方案中，所述细胞来自真核生物、非大鼠真核生物、哺乳动物、非人类哺乳动物、人类、啮齿动物、非大鼠啮齿动物、大鼠、小鼠或仓鼠。在特定的实施方案中，所述细胞为真核细胞、非大鼠真核细胞、多潜能细胞、非多潜能细胞、非人类多潜能细胞、非人类哺乳动物细胞、人类多潜能细胞、人类ES细胞、人类成人干细胞、发育受限的人类祖细胞、人类诱导的多潜能细胞(iPS)细胞、哺乳动物细胞、人类细胞、成纤维细胞、啮齿动物细胞、非大鼠啮齿动物细胞、大鼠细胞、小鼠细胞、小鼠ES细胞、仓鼠细胞或CHO细胞。Any cell from any organism can be used in the method provided herein.In a specific embodiment, the cell is from eukaryotic organism, non-rat eukaryotic organism, mammal, non-human mammal, mankind, rodent, non-rat rodent, rat, mouse or hamster.In a specific embodiment, the cell is eukaryotic cell, non-rat eukaryotic cell, pluripotent cell, non-pluripotent cell, non-human pluripotent cell, non-human mammal cell, human pluripotent cell, human ES cell, human adult stem cell, development-restricted human progenitor cell, human induced pluripotent cell (iPS) cell, mammalian cell, human cell, fibroblast, rodent cell, non-rat rodent cell, rat cell, mouse cell, mouse ES cell, hamster cell or CHO cell.

在一些实施方案中，在所述方法中采用的细胞具有稳定地并入其基因座中的DNA构建体。“稳定地并入”或“稳定地引入”是指将多核苷酸引入细胞中，因此核苷酸序列整合到细胞的基因组中且能够由其子代遗传。In some embodiments, the cell employed in the method has the DNA construct stably incorporated into its locus."Stably incorporated" or "stably introduced" refers to the introduction of a polynucleotide into a cell so that the nucleotide sequence is integrated into the genome of the cell and can be inherited by its progeny.

在一个实施方案中，向细胞中引入一种或多种多核苷酸通过电穿孔、胞浆内注射、病毒感染、腺病毒、慢病毒、反转录病毒、转染、脂质介导的转染介导，或者经由NucleofectionTM介导。In one embodiment, introduction of one or more polynucleotides into a cell is mediated by electroporation, intracytoplasmic injection, viral infection, adenovirus, lentivirus, retrovirus, transfection, lipid-mediated transfection, or via Nucleofection™.

在一个实施例中，所述表达构建体与所述被引入的核酸一起引入。In one embodiment, the expression construct is introduced together with the introduced nucleic acid.

在一个实施方案中，向所述细胞中引入所述一种或多种多核苷酸可经一段时间执行多次。在一个实施方案中，向所述细胞中引入所述一种或多种多核苷酸可经一段时间执行至少两次，经一段时间执行至少三次，经一段时间执行至少四次，经一段时间执行至少五次，经一段时间执行至少六次，经一段时间执行至少七次，经一段时间执行至少八次，经一段时间执行至少九次，经一段时间执行至少十次，经一段时间执行至少十一次，经一段时间执行至少十二次，经一段时间执行至少十三次，经一段时间执行至少十四次，经一段时间执行至少十五次，经一段时间执行至少十六次，经一段时间执行至少十七次，经一段时间执行至少十八次，经一段时间执行至少十九次或经一段时间执行至少二十次。In one embodiment, the introduction of the one or more polynucleotides into the cell can be performed multiple times over a period of time. In one embodiment, the introduction of the one or more polynucleotides into the cell can be performed at least twice over a period of time, at least three times over a period of time, at least four times over a period of time, at least five times over a period of time, at least six times over a period of time, at least seven times over a period of time, at least eight times over a period of time, at least nine times over a period of time, at least ten times over a period of time, at least eleven times over a period of time, at least twelve times over a period of time, at least thirteen times over a period of time, at least fourteen times over a period of time, at least fifteen times over a period of time, at least sixteen times over a period of time, at least seventeen times over a period of time, at least eighteen times over a period of time, at least nineteen times over a period of time, or at least twenty times over a period of time.

在一个实施方案中，所述靶向载体(含有引入核酸)与表达载体(含有sgRNA)同时引入细胞中。In one embodiment, the targeting vector (containing the introduced nucleic acid) and the expression vector (containing the sgRNA) are introduced into the cell simultaneously.

在一个实施例中，进一步提供用于制造人源化非人类动物的方法，其包括：(a)用包含插入核酸的靶向载体修饰多潜能细胞的基因组以形成供体细胞，所述插入核酸包含人类核酸序列；(b)将所述供体细胞引入宿主胚胎中；和(c)在代孕母体中孕育所述宿主胚胎，其中所述代孕母体生成包含所述人类核酸序列的子代。在一个实施方案中，所述供体细胞引入处于胚泡段或处于前桑椹胚阶段(即，4-细胞阶段或8-细胞阶段)的宿主胚胎中。在更进一步的实施方案中，所述基因修饰能够经由种系传递。In one embodiment, a method for making a humanized non-human animal is further provided, comprising: (a) modifying the genome of pluripotent cells to form donor cells with a targeting vector comprising an insertion nucleic acid, wherein the insertion nucleic acid comprises a human nucleic acid sequence; (b) the donor cell is introduced into a host embryo; and (c) the host embryo is bred in a surrogate mother, wherein the surrogate mother generates a progeny comprising the human nucleic acid sequence. In one embodiment, the donor cell is introduced into a host embryo in a blastocyst segment or in a pre-morula stage (i.e., 4-cell stage or 8-cell stage). In a further embodiment, the genetic modification can be transmitted via germline.

在一个特定实施例中，提供了用于制造人源化小鼠的方法，所述方法包括：(a)将包含ACE2基因片段的靶向载体与连接有sgRNA的表达载体引入小鼠胚胎细胞中，形成基因编辑的供体细胞；(b)将所述供体细胞引入小鼠胚胎中；和(c)在代孕母体中孕育所述小鼠胚胎，其中所述代孕母体生成包含所述人类ACE2序列的子代。In a specific embodiment, a method for making a humanized mouse is provided, the method comprising: (a) introducing a targeting vector comprising an ACE2 gene fragment and an expression vector connected to an sgRNA into mouse embryonic cells to form gene-edited donor cells; (b) introducing the donor cells into mouse embryos; and (c) gestating the mouse embryos in a surrogate mother, wherein the surrogate mother generates offspring comprising the human ACE2 sequence.

实施例1 Ace2基因sgRNA 1及pX330-sgRNA质粒构建Example 1 Construction of Ace2 gene sgRNA 1 and pX330-sgRNA plasmid

合成识别靶位点的sgRNA1序列。Synthesize the sgRNA1 sequence that recognizes the target site.

sgRNA1序列(SEQ ID NO:1)：5’-tactgctcagtccctcaccgagg-3’sgRNA1 sequence (SEQ ID NO:1): 5’-tactgctcagtccctcaccgagg-3’

针对sgRNA位点引入BbsI酶切位点合成sgRNA的上下游退火引物进行后续退火实验。合成sgRNA1的上下游单链引物序列如下：The upstream and downstream annealing primers for synthesizing sgRNA by introducing BbsI restriction site into sgRNA site were used for subsequent annealing experiments. The upstream and downstream single-stranded primer sequences for synthesizing sgRNA1 are as follows:

上游：5’-caccgtactgctcagtccctcaccg-3’(SEQ ID NO:6)Upstream: 5'-caccgtactgctcagtccctcaccg-3' (SEQ ID NO: 6)

下游：5’-aaaccggtgagggactgagcagtac-3’(SEQ ID NO:7)Downstream: 5’-aaaccggtgagggactgagcagtac-3’(SEQ ID NO:7)

pX330质粒来源：pX330载体图谱，参见图1。该质粒骨架来源淼灵质粒平台，货号P0123。Source of pX330 plasmid: See Figure 1 for the map of pX330 vector. The plasmid backbone is from Miaoling Plasmid Platform, catalog number P0123.

将上述sgRNA退火引物经退火后分别连接至pX330质粒(质粒先用BbsI线性化)，获得表达载体pX330-sgRNA1。The above sgRNA annealing primers were ligated to the pX330 plasmid (the plasmid was linearized with BbsI first) after annealing to obtain the expression vector pX330-sgRNA1.

具体的连接反应体系如表1所示。The specific ligation reaction system is shown in Table 1.

表1连接反应体系Table 1 Ligation reaction system

sgRNA退火产物sgRNA annealing product 1μL(0.5μM)1μL(0.5μM) pX330-sgRNA载体pX330-sgRNA vector 1μL(20ng)1μL(20ng) T4 DNALigaseT4 DNALigase 1μL(5U)1μL(5U) 10×T4 DNALigase buffer10×T4 DNA Ligase buffer 1μL1μL H₂O _H2O 补至10μLMake up to 10 μL

反应条件为：16℃连接30min以上，转化至30μL TOP10感受态细胞中，然后取200μL涂布于Amp抗性的平板，37℃培养至少12小时后挑选2个克隆接种含有Amp抗性的LB培养基(5mL)中，37℃，250rpm摇培至少12小时。The reaction conditions are as follows: connect at 16°C for more than 30 min, transform into 30 μL TOP10 competent cells, then take 200 μL and spread on the Amp-resistant plate, culture at 37°C for at least 12 hours, select 2 clones and inoculate into LB medium (5 mL) containing Amp resistance, and culture at 37°C and 250 rpm for at least 12 hours.

随机挑选的克隆送测序公司进行测序验证，测序结果如图2所示，选择连接正确的表达载体pX330-sgRNA1进行后续实验。The randomly selected clones were sent to a sequencing company for sequencing verification. The sequencing results are shown in Figure 2. The correctly connected expression vector pX330-sgRNA1 was selected for subsequent experiments.

实施例2 Ace2基因sgRNA2及pX330-sgRNA2质粒构建Example 2 Construction of Ace2 gene sgRNA2 and pX330-sgRNA2 plasmid

合成识别靶位点的sgRNA2序列。Synthesize the sgRNA2 sequence that recognizes the target site.

sgRNA2-序列(SEQ ID NO:2)：5’-cttggcattttcctcggtgaggg-3’sgRNA2-Sequence (SEQ ID NO:2): 5’-cttggcattttcctcggtgaggg-3’

针对sgRNA位点引入BbsI酶切位点合成sgRNA的上下游退火引物进行后续退火实验。合成sgRNA2的上下游单链引物序列如下：The upstream and downstream annealing primers for synthesizing sgRNA by introducing BbsI restriction site into sgRNA site were used for subsequent annealing experiments. The upstream and downstream single-stranded primer sequences for synthesizing sgRNA2 are as follows:

上游：5’-caccgcttggcattttcctcggtga-3’(SEQ ID NO:4)Upstream: 5’-caccgcttggcattttcctcggtga-3’(SEQ ID NO:4)

下游：5’-aaactcaccgaggaaaatgccaagc-3’(SEQ ID NO:5)Downstream: 5'-aaactcaccgaggaaaatgccaagc-3' (SEQ ID NO: 5)

将上述sgRNA退火引物经退火后分别连接至pX330质粒(质粒先用BbsI线性化)，获得表达载体pX330-sgRNA2。The above sgRNA annealing primers were ligated to the pX330 plasmid (the plasmid was linearized with BbsI) after annealing to obtain the expression vector pX330-sgRNA2.

具体的连接反应体系如表2所示。The specific ligation reaction system is shown in Table 2.

表2连接反应体系Table 2 Ligation reaction system

sgRNA退火产物sgRNA annealing product 1μL(0.5μM)1μL(0.5μM) pX330-sgRNA载体pX330-sgRNA vector 1μL(20ng)1μL(20ng) T4 DNA LigaseT4 DNA Ligase 1μL(5U)1μL(5U) 10×T4 DNA Ligase buffer10×T4 DNA Ligase buffer 1μL1μL H₂O _H2O 补至10μLMake up to 10 μL

随机挑选的克隆送测序公司进行测序验证，测序结果如图3所示，选择连接正确的表达载体pX330-sgRNA2进行后续实验。The randomly selected clones were sent to a sequencing company for sequencing verification. The sequencing results are shown in Figure 3. The correctly connected expression vector pX330-sgRNA2 was selected for subsequent experiments.

实施例3 pX330-sgRNA切割效率鉴定Example 3 Identification of pX330-sgRNA Cutting Efficiency

分别将2μg实施例1、实施例2制备的pX330-sgRNA质粒利用脂质体3000(Lipofectamine 3000，invitrogen，货号L3000001)方法转染进小鼠胚胎干细胞，具体转染操作参考Lipofectamine 3000试剂操作说明。收取转染两天后的小鼠胚胎干细胞，用细胞基因组提取试剂盒进行基因组提取(天根，DP304-02)。随后在基因组切割位点两侧设计PCR上下游引物，上游引物(5arm-sgF：ggttttgatttggccataaaatgttagc(SEQ ID NO:10))和下游引物(3arm-sgR：attcccaggtccagtttcacctaag(SEQ ID NO:11))对提取所得基因组进行PCR反应(使用诺唯赞Phanta Max Super-FidelityDNAPolymerase)。然后利用T7核酸内切酶Ⅰ(T7EⅠ)(Biolabs，M0302L)进行pX330-sgRNA切割效率验证，原理T7核酸内切酶Ⅰ可识别并切割不完全配对DNA。2 μg of pX330-sgRNA plasmid prepared in Example 1 and Example 2 were transfected into mouse embryonic stem cells using Lipofectamine 3000 (Invitrogen, Cat. No. L3000001). For specific transfection operations, refer to the Lipofectamine 3000 reagent operation instructions. Mouse embryonic stem cells were collected two days after transfection, and genome extraction was performed using a cell genome extraction kit (Tiangen, DP304-02). Subsequently, PCR upstream and downstream primers were designed on both sides of the genome cleavage site, and the upstream primer (5arm-sgF: ggttttgatttggccataaaatgttagc (SEQ ID NO: 10)) and the downstream primer (3arm-sgR: attcccaggtccagtttcacctaag (SEQ ID NO: 11)) were subjected to PCR reaction (using Novozyme Phanta Max Super-Fidelity DNA Polymerase) on the extracted genome. Then, T7 endonuclease I (T7EⅠ) (Biolabs, M0302L) was used to verify the cutting efficiency of pX330-sgRNA. The principle is that T7 endonuclease I can recognize and cut incompletely paired DNA.

T7 EⅠ实验具体操作步骤：Specific steps of T7 EⅠ experiment:

将上述从各转染pX330-sgRNA1和pX330-sgRNA2的细胞提取基因组，通过5arm-sgF+3arm-sgR扩增得到的PCR产物进行回收(天根，DP214-02)，取1μg进行退火反应，体系如下所示：The genomes were extracted from the cells transfected with pX330-sgRNA1 and pX330-sgRNA2, and the PCR products obtained by 5arm-sgF+3arm-sgR amplification were recovered (Tian Gen, DP214-02), and 1 μg was taken for annealing reaction. The system is as follows:

表3Table 3

PCR产物PCR products 1μg1 μg Buffer 2Buffer 2 2μL2μL H₂O _H2O 补至20μLMake up to 20 μL 反应条件Reaction conditions 水浴锅煮沸10min后自然冷却Boil in a water bath for 10 minutes and then cool naturally

然后往退火产物中加入1μL T7核酸内切酶Ⅰ于37℃反应30min，后直接跑胶验证。跑胶结果如图4所示。实施例1、实施例2的sgRNA1和sgRNA2其切割条带亮度较强(sgRNA1可切割成约376bp和581bp两条带，sgRNA2可切割成约371bp和586bp两条带)，具有较高切割效率。Then, 1 μL of T7 endonuclease I was added to the annealed product and reacted at 37°C for 30 min, and then directly run on the gel for verification. The gel running results are shown in Figure 4. The cleavage bands of sgRNA1 and sgRNA2 in Example 1 and Example 2 were brighter (sgRNA1 can be cleaved into two bands of about 376 bp and 581 bp, and sgRNA2 can be cleaved into two bands of about 371 bp and 586 bp), and had a higher cleavage efficiency.

由于切割效率主要通过图显示，为了显现实施例1、2的方案的效果，图4中同时列举了另一sgRNA(指代为sgRNA3)的切割效率。Since the cutting efficiency is mainly displayed through the graph, in order to show the effects of the schemes of Examples 1 and 2, the cutting efficiency of another sgRNA (referred to as sgRNA3) is also listed in FIG4 .

所列举的sgRNRA3序列(SEQ ID NO:3)：5’-caagtgaactttgataagacagg-3’The listed sgRNRA3 sequence (SEQ ID NO:3): 5'-caagtgaactttgataagacagg-3'

合成sgRNA3的上下游单链引物序列如下：The upstream and downstream single-stranded primer sequences for synthesizing sgRNA3 are as follows:

上游：5’-caccgcaagtgaactttgataagac-3’(SEQ ID NO:8)Upstream: 5’-caccgcaagtgaactttgataagac-3’(SEQ ID NO:8)

下游：5’-aaacgtcttatcaaagttcacttgc-3’(SEQ ID NO:9)Downstream: 5'-aaacgtcttatcaaagttcacttgc-3' (SEQ ID NO: 9)

其余步骤如同实施例1。随机挑选克隆送测序公司进行测序验证，测序结果如图5所示，选择连接正确的表达载体pX330-sgRNA3进行后续实验。The remaining steps were the same as in Example 1. Randomly selected clones were sent to a sequencing company for sequencing verification. The sequencing results are shown in FIG5 . The correctly connected expression vector pX330-sgRNA3 was selected for subsequent experiments.

由于其他的sgRNA及pX330-sgRNA质粒构建非本发明实施例的重点，因此不作过多赘述和列举。Since the construction of other sgRNAs and pX330-sgRNA plasmids is not the focus of the embodiments of the present invention, they are not described or listed in detail.

实施例4打靶载体的设计Example 4 Design of Targeting Vector

将人ACE2基因(Gene ID：ID:59272)CDS蛋白编码序列(基于NCBI登录号为NM_001371415.1→NP_001358344.1的转录本，其CDS序列如hACE2-CDS SEQ ID NO:12所示，蛋白序列如hACE2-protein SEQ ID NO:13所示)插入到鼠Ace2的启动子和5’UTR区域序列后，利用鼠Ace2启动子启动人ACE2基因表达。同时，插入的人ACE2 CDS序列后加上SV40 polyA序列信号(SV40-polyA序列如SEQ ID NO:14所示)终止信号加强人ACE2 mRNA转录停止。After inserting the CDS protein coding sequence of the human ACE2 gene (Gene ID: ID: 59272) (based on the transcript with NCBI accession number NM_001371415.1→NP_001358344.1, its CDS sequence is shown in hACE2-CDS SEQ ID NO: 12, and the protein sequence is shown in hACE2-protein SEQ ID NO: 13) into the promoter and 5'UTR region sequence of mouse Ace2, the mouse Ace2 promoter was used to start the expression of human ACE2 gene. At the same time, the inserted human ACE2 CDS sequence was followed by an SV40 polyA sequence signal (SV40-polyA sequence is shown in SEQ ID NO: 14) termination signal to strengthen the termination of human ACE2 mRNA transcription.

hACE2-CDS序列如下(SEQ ID NO:12)：The hACE2-CDS sequence is as follows (SEQ ID NO: 12):

atgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacacctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggccctctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactggagtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaaatgg tctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttca gcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcgg caagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggatt attggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtgggg tagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgg gatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccataacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgaca gctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagct gttggggaaatcatgtcactttctgcagccacacctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaa aaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgtt ccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaaacatgaaggccctctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggag caaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgg gatggagtaccgactggagtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggc taatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcagg atgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatggggagtgatagtggttggcattgtcatcctgatcttcactggggatcagagatcggaagaagaaaaaata aagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttag

hACE2-protein序列如下(SEQ ID NO:13)：The hACE2-protein sequence is as follows (SEQ ID NO: 13):

SV40 polyA序列(SEQ ID NO:14)：SV40 polyA sequence (SEQ ID NO: 14):

aacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatctta

根据序列设计，发明人进一步的设计了如图6所示的打靶方案和包含5’同源臂(5arm)、人ACE2基因片段、3’同源臂(3arm)的载体。其中5’同源臂为NCBI登录号为AC091606.8的第125683-126652位核苷酸(5’同源臂序列如SEQ ID NO:15所示)，3’同源臂为NCBI登录号为AC091606.8的第126796-127766位核苷酸(3’同源臂序列如SEQ ID NO:16所示)。同时，PGK-嘌呤霉素异种基因包也被插入载体用做筛选需要，嘌呤霉素抗性基因(PGK-puro，SEQ ID NO:17)表达框两端设计有一对同向的frt位点(SEQ ID NO:18)，因此可以利用FLP重组酶去除从而解决转基因引起的安全问题。构建的载体被进行完全序列验证。在靶向之前，通过AgeI，所述载体被线性化(打靶载体示意图和打靶策略示意图如图6所示)。According to the sequence design, the inventor further designed a targeting scheme as shown in Figure 6 and a vector comprising a 5' homology arm (5arm), a human ACE2 gene fragment, and a 3' homology arm (3arm). Wherein the 5' homology arm is the 125683-126652 nucleotides of the NCBI accession number AC091606.8 (the 5' homology arm sequence is shown in SEQ ID NO: 15), and the 3' homology arm is the 126796-127766 nucleotides of the NCBI accession number AC091606.8 (the 3' homology arm sequence is shown in SEQ ID NO: 16). At the same time, the PGK-puromycin heterologous gene package is also inserted into the vector for screening needs, and a pair of frt sites (SEQ ID NO: 18) are designed at both ends of the puromycin resistance gene (PGK-puro, SEQ ID NO: 17) expression frame, so FLP recombinase can be used to remove and solve the safety problems caused by transgenics. The constructed vector is fully sequence verified. Prior to targeting, the vector was linearized by AgeI (a schematic diagram of the targeting vector and a schematic diagram of the targeting strategy are shown in FIG6 ).

5’同源臂序列(SEQ ID NO:15)：5' homology arm sequence (SEQ ID NO: 15):

ccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccattaattttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttataggattcttaaaaagaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggagagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagttgctcagtggatgggatcttggcgcacggggaaagccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccatta attttaagtgaatgcttgcaaaagctc acttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttaggattcttaaaaagaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaagg ctacagagaggaactgtctaaaaacaccaagaaagagagaaagga gagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggtt ttctaggattaatgaataacatgtcttt atttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagatttttactc tagggaaagttgctcagtggatgggatcttggcgcacggggaaag

3’同源臂序列(SEQ ID NO:16)：3' homology arm sequence (SEQ ID NO: 16):

GaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaaatgcaactggggtttactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactcttttttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtcttGaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaa aattgttatgaatgaa ttatttgcccagattccaaagatgagtgaaaatgtttaataaaagttgccatcactattctcattatttggtatgtaaagcattcatggaaatgttctaagtcgttatgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaaatgcaactggggtttactgaagg cagcagcttagtaattaaggtaaccatggcttaggt gaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagttaagtggt ggaaaattgagcaactatttaagaaaagactcttt tttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagactttgacaatg tagttgccaaatgagttgcttttaactgctctaatagtttggtctt

PGK-puro序列(SEQ ID NO:17)：PGK-puro sequence (SEQ ID NO: 17):

GggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatcccccggtaggcgccaaccggctccgttctttggtggccccttcgcgccaccttctactcctcccctagtcaggaagttcccccccgccccgcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaatggaagcgggtaggcctttggggcagcggccaatagcagctttgctccttcgctttctgggctcagaggctgggaaggggtgggtccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcgctgttctcctcttcctcatctccgggcctttcgacctgcagcccaagctagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctgaGggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatccccccggtaggcgccaaccggctccgttctttggtggccccttcgcgccaccttctactcctcccctagtcaggaagttcccccccgcccc gcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaa tggaagcgggtaggcctttggggcagcggccaatagcagctttgctccttcgctttctgggctcagaggctgggaaggggtgggtccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcg ctgttctcctcttcctcatctccgggcctttcgacctgcagcccaagctagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccg cgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtgg cggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagc aacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctcccc ttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctga

Frt序列(SEQ ID NO:18)：gaagttcctattctctagaaagtataggaacttcFrt sequence (SEQ ID NO:18): gaagttcctattctctgaaagtataggaacttc

载体构建过程如下：The vector construction process is as follows:

设计扩增5’同源臂的搭桥片段的上游引物(5arm-pcrF：tcgcacacattccacatccaccggtccctatggagtggagaagagtctta(SEQ ID NO:19))和与其匹配的下游引物(5arm-pcrR：gaaggagccaggaagagcttgacatctttccccgtgcgccaagatcc(SEQ ID NO:20))；Design the upstream primer (5arm-pcrF: tcgcacacattccacatccaccggtccctatggagtggagaagagtctta (SEQ ID NO: 19)) and the matching downstream primer (5arm-pcrR: gaaggagccaggaagagcttgacatctttccccgtgcgccaagatcc (SEQ ID NO: 20)) for amplifying the bridging fragment of the 5' homology arm;

设计扩增人ACE2 CDS的搭桥片段的上游引物(hACE2-F：ggatcttggcgcacggggaaagatgtcaagctcttcctggctccttc(SEQ ID NO:21))和与其匹配的下游引物(hACE2-R：cattataagctgcaataaacaagttctaaaaggaggtctgaacatcatc(SEQ ID NO:22))；Design an upstream primer (hACE2-F: ggatcttggcgcacggggaaagatgtcaagctcttcctggctccttc (SEQ ID NO: 21)) and a matching downstream primer (hACE2-R: cattataagctgcaataaacaagttctaaaaggaggtctgaacatcatc (SEQ ID NO: 22)) for amplifying the bridging fragment of human ACE2 CDS;

设计扩增SV40 polyA的搭桥片段的上游引物(SV40-F：gatgatgttcagacctccttttagaacttgtttattgcagcttataatg(SEQ ID NO:23))和与其匹配的下游引物(SV40-R：AGAGAATAGGAACTTCGCACGCGTtaagatacattgatgagtttggac(SEQ ID NO:24))；Design an upstream primer (SV40-F: gatgatgttcagacctccttttagaacttgtttattgcagcttataatg (SEQ ID NO: 23)) and a matching downstream primer (SV40-R: AGAGAATAGGAACTTCGCACGCGTtaagatacattgatgagtttggac (SEQ ID NO: 24)) for amplifying the bridge fragment of SV40 polyA;

设计扩增3’同源臂片段的上游引物(3arm-pcrF：tacgaagttatGtcgacgcGGCGCGCCgaattataatactaacattactg(SEQ ID NO:25))和与其匹配的下游引物(3arm-pcrR：tatgaccatgattacgccaagcttaagaccaaactattagagcagttaaaagc(SEQ ID NO:26))。5’同源臂与3’同源臂的扩增模板为C57BL6/J小鼠基因组，人ACE2的扩增模板为人肺细胞cDNA。PCR反应体系(使用诺唯赞Phanta Max Super-FidelityDNAPolymerase)和条件见表4：The upstream primer (3arm-pcrF: tacgaagttatGtcgacgcGGCGCGCCgaattataatactaacattactg (SEQ ID NO: 25)) and the matching downstream primer (3arm-pcrR: tatgaccatgattacgccaagcttaagaccaaactattagagcagttaaaagc (SEQ ID NO: 26)) for amplifying the 3' homology arm fragment were designed. The amplification templates of the 5' homology arm and the 3' homology arm were the C57BL6/J mouse genome, and the amplification template of human ACE2 was human lung cell cDNA. The PCR reaction system (using Novozyme Phanta Max Super-Fidelity DNA Polymerase) and conditions are shown in Table 4:

表4 PCR反应体系(50μL)Table 4 PCR reaction system (50 μL)

如是，将PCR扩增得到的产物片段5’同源臂、人ACE2 CDS、SV40 polyA使用搭桥PCR(overlap PCR)搭配Touch down PCR方法PCR成连续片段5arm-hACE-SV40；以及PCR扩增得到的3’同源臂回收后直接用于构建同源重组打靶载体，构建过程如下：If so, the 5' homology arm of the product fragment obtained by PCR amplification, human ACE2 CDS, and SV40 polyA are PCR-amplified into a continuous fragment 5arm-hACE-SV40 using the overlap PCR (overlap PCR) and touch down PCR method; and the 3' homology arm obtained by PCR amplification is directly used to construct a homologous recombination targeting vector after recovery. The construction process is as follows:

1.将5arm-hACE-SV40片段进行AgeI+MluI双酶切；3’同源臂片段进行AscI+HindIII双酶切后分别通过酶切连接的方法连接上去，从而得打靶载体。2.通过酶切鉴定后将获得的阳性打靶载体送去测序鉴定，经测序公司验证序列正确(如图7)，从而成功获得人源化ACE2打靶载体，序列如下：1. The 5arm-hACE-SV40 fragment was double-digested with AgeI+MluI; the 3' homology arm fragment was double-digested with AscI+HindIII and then connected by enzyme digestion and ligation to obtain a targeting vector. 2. After enzyme digestion identification, the obtained positive targeting vector was sent for sequencing identification. The sequencing company verified that the sequence was correct (as shown in Figure 7), thus successfully obtaining a humanized ACE2 targeting vector, the sequence is as follows:

整个打靶载体序列(SEQ ID NO:27)：The entire targeting vector sequence (SEQ ID NO: 27):

tcgcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccggagacggtcacagcttgtctgtaagcggatgccgggagcagacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggctggcttaactatgcggcatcagagcagattgtactgagagtgcaccatatgtaccgggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatccaccggtccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccattaattttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttataggattcttaaaaagaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggagagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagttgctcagtggatgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacacctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggccctctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactggagtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttagaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctagaaagtataggaacttcatcgataccgggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatcccccggtaggcgccaaccggctccgttctttggtggccccttcgcgccaccttctactcctcccctagtcaggaagttcccccccgccccgcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaatggaagcgggtaggcctttggggcagcggccaatagcagctttgctccttcgctttctgggctcagaggctgggaaggggtgggtccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcgctgttctcctcttcctcatctccgggcctttcgacctgcagcccaagctagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctgaggtacctctcatgctggagttcttcgcccaccccaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatcgatgaagttcctattctctagaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacgaagttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaaatgcaactggggtttactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactcttttttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtcttaagcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccgctcacaattccacacaacatacgagccggaagcataaagtgtaaagcctggggtgcctaatgagtgagctaactcacattaattgcgttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaagaacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcaccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctccatccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaagttggccgcagtgttatcactcatggttatggcagcactgcataattctcttactgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaactgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaatattattgaagcatttatcagggttattgtctcatgagcggatacatatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaagtgccacctgacgtctaagaaaccattattatcatgacattaacctataaaaataggcgtatcacgaggccctttcgtctcgcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccggagacggtcacagcttgtctgtaagcggatgccgggagcagacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggctggcttaactatgcggcatcagagcagattgtactgagagtgcaccatat gtaccgggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgct acacaagtggcctctggcctcgcacacattccacatccaccggtccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagt gctgggattaaaggcatggaccaccactgcctgccccattctctccattaatttta agtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgttttataggattcttaaaaagaaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacaga gggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaagga gagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggtt ttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatca ttctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaaactcatgaagagattttactctagggaaagttgctcagtgg atgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaac tgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccact acaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaat gggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcag ctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaa gagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatt tgcttggtgatatgtggggtagattttggacaaatctgtactc tttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccaca gcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacga cttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacaccctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctg ctcaaacaagcactcacgattgttgggactctgccatttacttacatgtta gagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagc actttgtcaagcagctaaacatgaaggccctctgcacaaatgtgacatct caaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactggagtccatatgcagaccaaagcat caaagtgaggataagcctaaaatcagctcttggagataaagcata tgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactga agttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacag cctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaaatccttatgcctccatcgatattagcaaaggaga aaataatccaggattccaaaacactgatgatgttcagacctccttttagaacttgttt attgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaaaagcattttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctagaaagtataggaacttcatcgataccgggtaggggaggcgcttttcccaaggcagtctggagcatgc gctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacat tccacatcccccggtaggcgccaaccggctccgttctttggtggccccttcgcgccaccttctactcctcccctagtcaggaagttccccccgccccgcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaatggaagcggg taggcctttggggcagcggccaatagcagctttgctccttcgctttctgggctcagaggctgggaaggggtgggt ccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcgctgttctcctcttcctcatctccgggcctttcgacctgcagcccaagctagcttaccatgaccgagtacaagcccacggtgcgcctcgcc acccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccaca ccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggcc gagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggtt cctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgccc gaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctgaggtacctctcatgctggagttcttcgcccacccca acttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatcgatgaagttcctattctctagaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacga agttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaatgcc caaaagatggtaagttcttgaggctacccagggggttattgattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattc caaagatgagtgaaaatgtttaataaagttgccatcacta ttctcattatttggtatgtaaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtcttccgagaactacaaatgacatattaactgaaaaatgcaactggggtttaactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcat tgacacagagctctgaggaatttccaaaggtcacagaagaaaagctat aattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaactttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactctttttttttcttcttccagcaatgctttccttcaaaacggtag cttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattca tagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtcttaagcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccg ctcacaattccacacaacatacgagccggaagcataaagtgtaaagcctggggtgcctaatgagtgagct aactcacattaattgcgttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgcattaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagct cactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaagg ccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctg tccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggt cgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctac actagaagaacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaagagtt ggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctag atccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagtaaacttggtctgacagtt accaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcaccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaag tggtcctgcaactttatccgcctccatccagtctattaattgttgccgggaagcta gagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatcgttgt cagaagtaagttggccgcagtgttatcactcatggttatggcagcactgcataattctcttactgtcatgccatccgtaag atgcttttctgtgactggtgagtactcaaccaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgat gtaacccactcgtgcacccaactgatcttcagcatcttttactttcaccagcgtttctgggtgagca aaaacaggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaatattattgaagcatttatcagggttattgtctcatgagcggatacatatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaagtgccacctgacgtcta agaaaccattattatcatgacattaacctataaaaataggcgtatcacgaggccctttcgtc

对比例1Comparative Example 1

将PCR扩增得到的产物片段5’同源臂、人ACE2 CDS、SV40 polyA使用搭桥PCR(overlap PCR)方法PCR成连续片段5arm-hACE-SV40，PCR反应条件如表5所示，其余条件同实施例4。The product fragment 5' homology arm, human ACE2 CDS, and SV40 polyA obtained by PCR amplification were PCR-amplified into a continuous fragment 5arm-hACE-SV40 using the overlap PCR method. The PCR reaction conditions are shown in Table 5, and the other conditions are the same as Example 4.

表5 PCR反应体系(50μL)Table 5 PCR reaction system (50 μL)

结果如图8所示，并未能扩增出3片段连接片段。The results are shown in FIG8 , and no 3-fragment junction fragment could be amplified.

实施例5人源化ACE2小鼠胚胎干细胞的获得Example 5 Obtaining humanized ACE2 mouse embryonic stem cells

其中一个实施方式中，人源化ACE2小鼠胚胎干细胞的获得可按照如下方法进行：In one embodiment, obtaining humanized ACE2 mouse embryonic stem cells can be performed as follows:

C57BL6/J小鼠胚胎干细胞(所述的胚胎干细胞来源于已确立的胚胎干细胞系)从液氮冻存细胞库中复苏，特别使用代数为p10以内的小鼠胚胎干细胞，在6cm培养皿中培养生长3天。C57BL6/J mouse embryonic stem cells (the embryonic stem cells are derived from an established embryonic stem cell line) were revived from a liquid nitrogen frozen cell bank, and mouse embryonic stem cells within the generation of p10 were used, and cultured and grown in a 6 cm culture dish for 3 days.

通过配套使用Nucleofector^TMIIs/2b电转仪以及Mouse ES Cell

Kit(Lonza，VPH-1001)小鼠胚胎干细胞电转试剂盒，使用A023程序，约2×10⁶个细胞被电穿孔，并且所述电转过程含有3μg线性化打靶载体以及1μg pX330-sgRNA1的100μL电转缓冲液中进行。所述转染细胞被种到3个6孔板的孔中，然后恢复36小时。恢复后，1μg/mL嘌呤霉素(默克公司)被加入到细胞培养基中。By using the Nucleofector ^TM IIs/2b electroporator and Mouse ES Cell

Kit (Lonza, VPH-1001) mouse embryonic stem cell electroporation kit, using the A023 program, about 2×10 ⁶ cells were electroporated, and the electroporation process was carried out in 100 μL electroporation buffer containing 3 μg of linearized targeting vector and 1 μg of pX330-sgRNA1. The transfected cells were seeded into three wells of a 6-well plate and then recovered for 36 hours. After recovery, 1 μg/mL puromycin (Merck) was added to the cell culture medium.

经过3-4天的筛选后，通过使用玻璃拉针吸取单克隆的方式挑取并培养嘌呤霉素抗性小鼠胚胎干细胞克隆进96孔板中培养。第二天将单克隆通过胰酶消化并分成两份培养，其中一份细胞在10μL的NP 40裂解缓冲液中于56℃下单独地裂解60分钟，随后在95℃下裂解10分钟。After 3-4 days of screening, puromycin-resistant mouse embryonic stem cell clones were picked and cultured in 96-well plates by using a glass needle to draw single clones. The next day, the single clones were trypsinized and divided into two cultures, one of which was lysed separately in 10 μL of NP 40 lysis buffer at 56°C for 60 minutes, followed by lysis at 95°C for 10 minutes.

其中，小鼠胚胎干细胞培养具体步骤：提前一天准备饲养层feeder细胞，按照需求铺在不同的孔板中，过夜培养形成单层。其中饲养层feeder细胞为经丝裂霉素C(MMC)法处理制备的小鼠胚胎成纤维细胞。使用含有LIF和2i(chir99021和pD0325901inhibitor)的mES培养基培养，每天更换新鲜培养基，可根据细胞的长势适当增加培养基，一般3天传一代，传代时用0.25％胰酶消化，按照6厘米盘30万、6孔板10万每孔左右的密度种植。Among them, the specific steps of mouse embryonic stem cell culture are: prepare the feeder layer feeder cells one day in advance, spread them in different well plates as needed, and culture them overnight to form a monolayer. The feeder layer feeder cells are mouse embryonic fibroblasts prepared by mitomycin C (MMC) method. Use mES culture medium containing LIF and 2i (chir99021 and pD0325901inhibitor) for culture, replace fresh culture medium every day, and increase the culture medium appropriately according to the growth of cells. Generally, one generation is passed every 3 days. During the passaging, use 0.25% trypsin to digest and plant at a density of about 300,000 per well in a 6-cm dish and 100,000 per well in a 6-well plate.

mES+LIF+2i培养基组成为：Knockout DMEM(1×,gibco)+15％ FBS(FRONTBIOMEDICAL,0.22μm Millipore滤器过滤)+GlutaMAX(100×,gibco)+NEAA(100×,gibco)+P/S(P:50units,S:50mg/ml,Hyclone)+β-mercaptoethanol(gibco，使用浓度0.1mM)+LIF(1000units/ml,Millipore)+CHIR99021(GSK3β抑制剂，使用浓度3μM)+PD0325901(MEK抑制剂，使用浓度1μM)。The composition of mES+LIF+2i culture medium is: Knockout DMEM (1×, gibco) + 15% FBS (FRONTBIOMEDICAL, filtered through 0.22μm Millipore filter) + GlutaMAX (100×, gibco) + NEAA (100×, gibco) + P/S (P: 50 units, S: 50 mg/ml, Hyclone) + β-mercaptoethanol (gibco, used at a concentration of 0.1 mM) + LIF (1000 units/ml, Millipore) + CHIR99021 (GSK3β inhibitor, used at a concentration of 3μM) + PD0325901 (MEK inhibitor, used at a concentration of 1μM).

NP40裂解液的组成为：10mL TE(20mM Tris pH8.0，150mM NaCl，2mM EDTA)+0.5％NP40+10μL蛋白酶K(10mg/mL)。The composition of NP40 lysis buffer is: 10 mL TE (20 mM Tris pH 8.0, 150 mM NaCl, 2 mM EDTA) + 0.5% NP40 + 10 μL proteinase K (10 mg/mL).

上述方法中的pX330-sgRNA1替换成pX330-sgRNA2或pX330-sgRNA3，其他步骤同pX330-sgRNA1。In the above method, pX330-sgRNA1 was replaced with pX330-sgRNA2 or pX330-sgRNA3, and the other steps were the same as pX330-sgRNA1.

pX330-sgRNA1、pX330-sgRNA2和pX330-sgRNA3搭配打靶载体打靶效果如表6所示。The targeting effects of pX330-sgRNA1, pX330-sgRNA2 and pX330-sgRNA3 combined with targeting vectors are shown in Table 6.

表6.sgRNA1-3搭配打靶载体打靶效果(存活克隆数)Table 6. Targeting effect of sgRNA1-3 combined with targeting vector (number of surviving clones)

pX330-sgRNA1pX330-sgRNA1 pX330-sgRNA2pX330-sgRNA2 pX330-sgRNA3pX330-sgRNA3 231231 187187 4343

实施例6人源化ACE2小鼠胚胎干细胞基因型鉴定Example 6 Identification of humanized ACE2 mouse embryonic stem cell genotype

将实施例5最后得到的细胞裂解物用作基因型鉴定PCR筛选模板。通过使用PhantaMax Super-FidelityDNAPolymerase试剂(诺唯赞公司)，根据制造商的说明书实施PCR筛选。PCR分析用于定向插入小鼠Ace2位点的具有5’同源臂和3’同源臂的HDR。由于ACE2基因位于X染色体上，使用的小鼠胚胎干细胞为XY雄性，所以不存在双等位基因靶向发生。The cell lysate obtained at the end of Example 5 was used as a genotype identification PCR screening template. PCR screening was performed according to the manufacturer's instructions using PhantaMax Super-Fidelity DNA Polymerase reagent (Novozyme). PCR analysis was used for directional insertion of HDR with 5' homology arms and 3' homology arms in the mouse Ace2 site. Since the ACE2 gene is located on the X chromosome, the mouse embryonic stem cells used are XY males, so there is no double allele targeting.

鉴定定向插入引物的上游位于嘌呤霉素抗性内部(Puro-F：aacctccccttctacgagc(SEQ ID NO:28))，与其配对的引物下游位于3’同源臂的外侧(3arm-outR：tacagccaggatctggatgtcagc(SEQ ID NO:29))。如果重组载体插入位置正确，则应出现一条长度应为1504bp的条带。此时小鼠胚胎干细胞Ace2位点基因组序列替换为SEQ ID NO:30：The upstream of the primer for identification of directional insertion is located inside the puromycin resistance (Puro-F: aacctccccttctacgagc (SEQ ID NO: 28)), and the downstream of the primer paired with it is located outside the 3' homology arm (3arm-outR: tacagccaggatctggatgtcagc (SEQ ID NO: 29)). If the insertion position of the recombinant vector is correct, a band with a length of 1504 bp should appear. At this time, the genomic sequence of the mouse embryonic stem cell Ace2 site is replaced with SEQ ID NO: 30:

ccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccattaattttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttataggattcttaaaaagaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggagagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagttgctcagtggatgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacacctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggccctctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactggagtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttagaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctagaaagtataggaacttcatcgataccgggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatcccccggtaggcgccaaccggctccgttctttggtggccccttcgcgccaccttctactcctcccctagtcaggaagttcccccccgccccgcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaatggaagcgggtaggcctttggggcagcggccaatagcagctttgctccttcgctttctgggctcagaggctgggaaggggtgggtccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcgctgttctcctcttcctcatctccgggcctttcgacctgcagcccaagctagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctgaggtacctctcatgctggagttcttcgcccaccccaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatcgatgaagttcctattctctagaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacgaagttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaaatgcaactggggtttactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactcttttttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtctt(seq id no:30)ccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccattaatt ttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttataggattcttaaaaagaaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggag gcagaggaaagcagatctcttgagtttgaggctagcctagtctacagaggggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggagagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttct gttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttactt aaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgtctcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagtt gctcagtggatgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttc ttggaattaacaccaatattactgaagagaatgtccaaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacac aattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctg aggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgag ggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttac tgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgtgcaca aaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacacctaagcatt taaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaaaagtggtgggagatgaagcgagatagttggggtggtgga acctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggccctctgcacaa atgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaagaattcttttgtggggatggagtaccgactggagtccatatgca gaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattcttt ttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggac ctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatggggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaa aggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttagaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctctaga aagtataggaacttcatcgataccgggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatcccccggtaggcgccaaccggctccgttctttgg tggccccttcgcgccaccttctactcctcccctagtcaggaagttcccccccgccccgcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaatggaagcgggtaggcctttggggcagcggccaatagcagctttgctcctt cgctttctgggctcagaggctgggaaggggtgggtccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcgctgttctcctcttcctcatctccgggcctttcgacctgcagccca agctagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactacccggccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggct cgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaagga gcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgac gtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctgaggtacctctcatgctggagttcttcgcccaccccaacttgttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaaataaagcatttttttcactgcattctagttgtgtgg tttgtccaaactcatcaatgtatcttatcatcgatgaagttcctattctctagaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacgaagttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggtt attgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgtta tgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaatgcaactggggtttact gaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaa gcagcatattttgtttgacaagtgcaaggacttagaactttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactcttttttttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggct gtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtctt(seq id no:30)

PCR鉴定结果见图9，共鉴定了14株克隆，其中标注*号为阳性克隆，同时PCR产物经测序结果匹配一致。此PCR结果阳性的小鼠胚胎干细胞克隆为成功编辑的人源化小鼠胚胎干细胞模型，但其中还保留有PGK-Puro筛选标志，由于PGK-Puro两端具有一对同向的frt位点，因此可以利用FLP重组酶去除从而解决转基因引起的安全问题。The PCR identification results are shown in Figure 9. A total of 14 clones were identified, of which the ones marked with * are positive clones, and the PCR products are consistent with the sequencing results. The mouse embryonic stem cell clone with positive PCR results is a successfully edited humanized mouse embryonic stem cell model, but it still retains the PGK-Puro screening marker. Since there is a pair of frt sites in the same direction at both ends of PGK-Puro, it can be removed by FLP recombinase to solve the safety problem caused by transgenics.

PGK-Puro筛选标志的删除方法如下：The PGK-Puro screening marker is removed as follows:

将上述PCR结果阳性的小鼠胚胎干细胞继续培养至6cm盘，通过配套使用Nucleofector^TMIIs/2b电转仪以及Mouse ES Cell

Kit(Lonza，VPH-1001)小鼠胚胎干细胞电转试剂盒，使用A023程序，约2×10⁶个细胞被电穿孔，并且所述电转过程含有pPGK-FLPo质粒(Addgene，13793)的100μL电转缓冲液中进行。所述转染细胞被种到6个12孔板的孔中，3天后，采取玻璃拉针吸取单克隆的方式挑取小鼠胚胎干细胞克隆进96孔板中培养，第二天将单克隆消化传代一分为二，一份进行嘌呤霉素筛选，一份正常培养。若成功删除PGK-Puro抗性筛选标志，则嘌呤霉素筛选后细胞不耐受死亡。嘌呤霉素不耐受相对应的克隆可使用上游引物(hACE2-F：tgatagtggttggcattgt(SEQ ID NO:31))和下游引物(3arm-outR：tacagccaggatctggatgtcagc(SEQ ID NO:29))进行基因组基因型PCR鉴定。若成功删除PGK-Puro筛选标志，则PCR产物长度为1532bp，删除PGK-Puro前的PCR产物长度为2863bp。基因型鉴定结果如图9所示，跑胶条带在1500bp处的为成功删除PGK-puro后的克隆，条带在3000bp处的为删除前克隆。嘌呤霉素不耐受相对应的克隆则为最终成功编辑为人源化ACE2打靶的小鼠胚胎干细胞，也就是小鼠胚胎干细胞最终获得如图10所示的人源化ACE2基因。最终删除PGK-puro后小鼠胚胎干细胞Ace2位点基因组序列替换为SEQ ID NO:32，如下所示：The mouse embryonic stem cells with positive PCR results were further cultured in 6 cm dishes and electroporated using the Nucleofector ^TM IIs/2b electroporator and Mouse ES Cell

Kit (Lonza, VPH-1001) mouse embryonic stem cell electroporation kit, using the A023 program, about 2×10 ⁶ cells were electroporated, and the electroporation process was carried out in 100 μL electroporation buffer containing pPGK-FLPo plasmid (Addgene, 13793). The transfected cells were seeded into 6 wells of 12-well plates. After 3 days, the mouse embryonic stem cell clones were picked up by glass needle aspiration and cultured in 96-well plates. The next day, the monoclonal digestion and passage were divided into two, one for puromycin screening and one for normal culture. If the PGK-Puro resistance screening marker is successfully deleted, the cells will die due to intolerance after puromycin screening. The clones corresponding to puromycin intolerance can be identified by genomic genotype PCR using upstream primers (hACE2-F: tgatagtggttggcattgt (SEQ ID NO: 31)) and downstream primers (3arm-outR: tacagccaggatctggatgtcagc (SEQ ID NO: 29)). If the PGK-Puro screening marker is successfully deleted, the length of the PCR product is 1532bp, and the length of the PCR product before deleting PGK-Puro is 2863bp. The genotype identification results are shown in Figure 9. The band at 1500bp is the clone after the successful deletion of PGK-puro, and the band at 3000bp is the clone before deletion. The clone corresponding to puromycin intolerance is the mouse embryonic stem cell that was finally successfully edited to target humanized ACE2, that is, the mouse embryonic stem cell finally obtained the humanized ACE2 gene as shown in Figure 10. After the final deletion of PGK-puro, the genomic sequence of the mouse embryonic stem cell Ace2 site was replaced with SEQ ID NO:32, as shown below:

ccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccattaattttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttataggattcttaaaaagaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggagagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagttgctcagtggatgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacacctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggccctctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactggagtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttagaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctagaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacgaagttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaaatgcaactggggtttactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactcttttttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtctt(seq id no:32)ccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccatta attttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgcct tcgtttttataggattcttaaaaagaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggaga gagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccc tttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctc caggattaacttcatattggtccagcagcttgtttactgttctcttctgtttcttcttctgctttttttttcttctct tctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagttgctcagtggatgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacattttt ggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtcc aaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactgg aaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagt ttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctt tgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgc ctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttatgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgt tggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatccca cagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacaccctaagcattta aaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaag cactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgttctaatgattactcattcattc gatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggcct ctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactgg agtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatga atggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagtt gaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaa cacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatggggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacact gatgatgttcagacctccttttagaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttca caaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctgaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacgaagttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaat gcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaact ggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcat ggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgaga actacaaatgacatattaactgaaaaatgcaactggggtttactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaaatctcagcctactctgggaaagcagcatattttgtttga caagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagtta agtggtggaaaattgagcaactatttaagaaaagactctttttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccag ttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtctt(seq id no:32)

实施例7人源化ACE2基因改造小鼠的形成Example 7 Formation of Humanized ACE2 Gene-Transformed Mice

1、腹腔注射7.5单位PMSG到4-10周龄B6C3F1雌鼠，48小时后注射hCG并与CD1种公鼠合笼，次日早上检查雌鼠阴栓并将有阴栓的雌鼠挑出，记录雌鼠相应的受精时间。1. Intraperitoneally inject 7.5 units of PMSG into 4-10 week old B6C3F1 female mice. 48 hours later, inject hCG and place them in a cage with CD1 male mice. The next morning, check the female mice for vaginal plugs and select the female mice with vaginal plugs. Record the corresponding fertilization time of the female mice.

2、下一日将孕鼠进行颈椎脱臼法安乐死，70％酒精消毒小鼠腹部，用辅料镊子和眼科剪刀剪开腹部皮肤和肌肉层，打开腹腔。用镊子抓住一个子宫角的上部，用剪刀在靠近输卵管的膜上开一个小口，将输卵管和卵巢连接处剪断，将输卵管和附带的子宫移至35mm培养皿中，用镊子固定好输卵管伞口端，用装填好M2培养液(Hogan,B.(1994).Manipulating the mouse embryo:a laboratory manual,2nd edn(Cold SpringHarbor,NY,Cold Spring Harbor Laboratory Press).)的冲洗针轻轻插入伞口，用0.1mL的M2培养液冲洗输卵管，用移卵管收集冲出的胚胎并用M2清洗3次，收集E1.5的小鼠2-细胞胚胎。2. The next day, the pregnant mice were euthanized by cervical dislocation, the abdomen of the mice was disinfected with 70% alcohol, and the abdominal skin and muscle layer were cut open with auxiliary forceps and ophthalmic scissors to open the abdominal cavity. The upper part of one uterine horn was grasped with forceps, and a small incision was made on the membrane near the fallopian tube with scissors, and the connection between the fallopian tube and the ovary was cut off. The fallopian tube and the attached uterus were moved to a 35 mm culture dish, and the fimbria of the fallopian tube was fixed with forceps. The flushing needle filled with M2 culture medium (Hogan, B. (1994). Manipulating the mouse embryo: a laboratory manual, 2nd edn (Cold Spring Harbor, NY, Cold Spring Harbor Laboratory Press).) was gently inserted into the fimbria, and the fallopian tube was flushed with 0.1 mL of M2 culture medium. The embryos flushed out were collected with an oviduct transfer tube and washed 3 times with M2 to collect E1.5 mouse 2-cell embryos.

3、对收集到的小鼠胚胎放入含0.1mM MgSO4,0.1mM CaCl₂和0.3％牛血清白蛋白的0.3M甘露醇(Sigma-Aldrich Inc.,St.Louis,MO)中，用Cellfusion CF-150/B电融合仪和250-um融合槽(BLS Ltd.,Budapest,Hungary)进行60V 50微秒直流电融合，获得4倍体胚胎；放入KSOM培养基(Summers,M.C.,McGinnis,L.K.,Lawitts,J.A.,Raffin,M.,andBiggers,J.D.(2000).IVF of mouse ova in a simplex optimized mediumsupplemented with amino acids.Hum Reprod 15,1791-1801.)，于CO2培养箱中培养24小时后用酸性台氏液(Sigma-Aldrich，T1788)去掉透明带，与胚胎干细胞(即本申请制备的人源化ACE2小鼠胚胎干细胞)聚集，形成嵌合体胚胎(Nagy,A.,Rossant,J.,Nagy,R.,Abramow-Newerly,W.,and Roder,J.C.(1993).Derivation of completely cellculture-derived mice from early-passage embryonic stem cells.Proc Natl AcadSci U S A 90,8424-8428.)。3. The collected mouse embryos were placed in 0.3 M mannitol (Sigma-Aldrich Inc., St. Louis, MO) containing 0.1 mM MgSO4, _0.1 mM CaCl2 and 0.3% bovine serum albumin, and fused with 60 V 50 microseconds direct current using a Cellfusion CF-150/B electrofusion instrument and a 250-um fusion tank (BLS Ltd., Budapest, Hungary) to obtain quadruploid embryos; placed in KSOM medium (Summers, MC, McGinnis, LK, Lawtts, JA, Raffin, M., and Biggers, JD (2000). IVF of mouse ova in a simplex optimized medium supplemented with amino acids. Hum Reprod 15, 1791-1801.), after culturing in a CO2 incubator for 24 hours, the zona pellucida was removed with acidic Tyrode's solution (Sigma-Aldrich, T1788), and the cells were aggregated with embryonic stem cells (i.e., humanized ACE2 mouse embryonic stem cells prepared in the present application) to form chimeric embryos (Nagy, A., Rossant, J., Nagy, R., Abramow-Newerly, W., and Roder, JC (1993). Derivation of completely cell culture-derived mice from early-passage embryonic stem cells. Proc Natl Acad Sci USA 90, 8424-8428.).

4、于CO₂培养箱中培养过夜，移植到E2.5天的假孕鼠子宫，17天后在将代孕鼠颈椎脱臼法安乐死后进行剖腹手术，存活和呼吸的新生鼠放入代乳鼠笼内，21天后断奶，获得完全来自胚胎干细胞的转基因小鼠，即得人源化ACE2基因改造小鼠(图12)。4. Cultured overnight in a _CO2 incubator, transplanted into the uterus of a pseudo-pregnant mouse on day E2.5, and euthanized by cervical dislocation after 17 days, and then performed a laparotomy. The surrogate mice that were alive and breathing were placed in a suckling mouse cage and weaned after 21 days to obtain transgenic mice completely derived from embryonic stem cells, i.e., humanized ACE2 gene-modified mice (Figure 12).

相应的培养基配方如表7所示。The corresponding culture medium formula is shown in Table 7.

表7培养基配方Table 7 Culture medium formula

实施例8人源化ACE2基因改造小鼠的hACE2表达验证Example 8 Verification of hACE2 expression in humanized ACE2 gene-modified mice

取一只野生型小鼠以及一只人源化ACE2基因改造小鼠(实施例7制备得到的)的小肠、肺以及肾脏样品，用于RNA提取以及qPCR验证hACE2基因表达情况。组织样品加入Trizol试剂裂解。每毫升的Trizol裂解液中加入200μl氯仿，剧烈震荡后静置至分层，重复操作3次后，4℃，14000rpm离心15min，取上清400μl至预冷的400μl异丙醇中，颠倒混匀，冰上静置5min，4℃，14000pm离心10min，在EP管底部可见白色沉淀，为RNA。Small intestine, lung and kidney samples of a wild-type mouse and a humanized ACE2 gene-modified mouse (prepared in Example 7) were taken for RNA extraction and qPCR verification of hACE2 gene expression. Tissue samples were lysed by adding Trizol reagent. 200 μl of chloroform was added to each milliliter of Trizol lysate, and the mixture was allowed to stand until stratification after vigorous shaking. After repeating the operation 3 times, centrifuged at 4°C, 14000rpm for 15min, and 400 μl of the supernatant was added to 400 μl of pre-cooled isopropanol, inverted and mixed, and allowed to stand on ice for 5min. Centrifuged at 4°C, 14000pm for 10min, a white precipitate was visible at the bottom of the EP tube, which was RNA.

弃上清，加入预冷的70％乙醇溶液，洗涤一次，离心，去掉乙醇，将RNA晾干至无色透明，按照沉淀多少，加入适量RNase-free H2O，60℃水浴至完全溶解。分别取1μl，用10mMTris-HCl(pH 7.5)以及H2O稀释50倍测定纯度及浓度，OD260/280的值为2.0左右为宜。提完的RNA存放于-80℃，可长期保存。Discard the supernatant, add pre-cooled 70% ethanol solution, wash once, centrifuge, remove ethanol, dry RNA until colorless and transparent, add appropriate amount of RNase-free H2O according to the amount of precipitation, and bathe in 60℃ water until completely dissolved. Take 1μl respectively, dilute 50 times with 10mMTris-HCl (pH 7.5) and H2O to determine the purity and concentration, and the OD260/280 value is about 2.0. The extracted RNA is stored at -80℃ for long-term storage.

取2μg RNA进行反转，体系如表8所示。反转得到的cDNA样品稀释30倍后，进一步用来做荧光定量PCR检测基因表达情况，hACE2检测qPCR引物如下(hACE2-qF：GGTCTTCAGTGCTCTCAG(SEQ ID NO:33)；hACE2-qR：GCATTCTTGTGGATTATCTGG(SEQ ID NO:34))，并可见人源化ACE2基因改造小鼠在RNA水平上表达人ACE2并具有组织特异性(图13)。2 μg RNA was taken for inversion, and the system is shown in Table 8. After the cDNA sample obtained by inversion was diluted 30 times, it was further used for fluorescence quantitative PCR to detect gene expression. The hACE2 detection qPCR primers are as follows (hACE2-qF: GGTCTTCAGTGCTCTCAG (SEQ ID NO: 33); hACE2-qR: GCATTCTTGTGGATTATCTGG (SEQ ID NO: 34)), and it can be seen that the humanized ACE2 gene-modified mice express human ACE2 at the RNA level and have tissue specificity (Figure 13).

表8Table 8

另外再取人源化ACE2基因改造小鼠的小肠、肺以及睾丸组织进行免疫荧光组织蛋白水平验证人ACE2表达情况。将小鼠组织在4％PFA中于4℃固定过夜。第二天，将组织在PBS中漂洗3次，并在4℃在PBS中的30％蔗糖中冷冻保护过夜。然后将组织在30％蔗糖/PBS和O.C.T.按1：1混合物中孵育2-4小时。冷冻嵌入培养基(Sakura，目录号4583)。接下来，将组织从蔗糖/OCT混合物中转移到低温模具中，并用O.C.T.填充。将包埋的组织在干冰上冷冻，然后保存在-80℃直至低温恒温器切片。使用低温恒温器将冷冻的类器官组织切成20μm切片，并收集在superfrost Ultra Plus载玻片上。将切片干燥过夜，然后用于免疫荧光。在室温下，将4％PFA直接后固定在载玻片上10分钟，然后在PBS中洗涤3x10次。使用疏水性PAP笔勾勒出组织区域的轮廓。使用含0.05％叠氮化钠的5％BSA/0.3％TX100的PBS封闭，并在室温下孵育30分钟。然后将组织与抗ACE2抗体(ET1611-58，Huabio)在4℃孵育过夜。PBS洗涤后，二抗(A11004，Invitrogen)孵育1小时，然后DAPI 2分钟。最后，将盖玻片安装在载玻片上进行观察，可从组织免疫荧光结果中看出人ACE2在蛋白表达水平上在人源化ACE2基因改造小鼠组织上特异表达(图14)。In addition, the small intestine, lung and testicular tissues of humanized ACE2 gene-modified mice were taken for immunofluorescence tissue protein level verification of human ACE2 expression. Mouse tissues were fixed in 4% PFA at 4°C overnight. The next day, the tissues were rinsed 3 times in PBS and cryoprotected in 30% sucrose in PBS at 4°C overnight. The tissues were then incubated in a 1:1 mixture of 30% sucrose/PBS and O.C.T. for 2-4 hours. Frozen embedding medium (Sakura, catalog number 4583). Next, the tissues were transferred from the sucrose/OCT mixture to a cryogenic mold and filled with O.C.T. The embedded tissues were frozen on dry ice and then stored at -80°C until cryostat sectioning. The frozen organoid tissues were cut into 20 μm slices using a cryostat and collected on superfrost Ultra Plus slides. The sections were dried overnight and then used for immunofluorescence. At room temperature, 4% PFA was directly post-fixed on the slide for 10 minutes, and then washed 3x10 times in PBS. The outline of the tissue area was outlined using a hydrophobic PAP pen. Blocked with 5% BSA/0.3% TX100 in PBS containing 0.05% sodium azide and incubated at room temperature for 30 minutes. The tissue was then incubated with anti-ACE2 antibody (ET1611-58, Huabio) at 4°C overnight. After washing with PBS, the secondary antibody (A11004, Invitrogen) was incubated for 1 hour, and then DAPI for 2 minutes. Finally, the coverslip was mounted on a slide for observation, and it can be seen from the tissue immunofluorescence results that human ACE2 is specifically expressed in humanized ACE2 genetically modified mouse tissues at the protein expression level (Figure 14).

实施例9 SARS-CoV-2感染人源化ACE2基因改造小鼠Example 9 SARS-CoV-2 infection of humanized ACE2 genetically modified mice

人源化ACE2基因改造小鼠通过滴鼻吸入30μl DMEM，内含2×10⁶TCID₅₀病毒载量的SARS-CoV-2，对照组则滴入30μl DMEM。分别在D1、D3、D5、D6、D7以及D9取感染小鼠的肺、气道和小肠组织用Trizol裂解并进行RNA提取以及荧光定量PCR检测SARS-CoV-2病毒滴度，RNA提取方法同上述hACE2表达检测实施例一致。提取得到的组织RNA将使用

Probe One-step qRT-PCR Kit(Toyobo)试剂盒对SARS-CoV-2的核蛋白N基因进行检测。荧光定量PCR检测所用到的引物如下：正向引物：5’-GGGGAACTTCTCCTGCTAGAAT-3’(SEQ IDNO:35)；反向引物:5’-CAGACATTTTGCTCTCAAGCTG-3’(SEQ ID NO:36)，TaqMan探针序列为5’-FAM-TTGCTGCTGCTTGACAGATT-TAMRA-3’(SEQ ID NO:37)，结果可见人源化ACE2基因改造小鼠经SARS-CoV-2感染后肺、气道及小肠在D1、D3病毒载量达到顶峰，随之下降，表明本发明人源化ACE2基因改造小鼠对SARS-CoV-2易感(图15)，而对照组小鼠并未检测到病毒。图15中的折线表示平均值。The humanized ACE2 gene-modified mice inhaled 30 μl of DMEM containing 2×10 ⁶ TCID ₅₀ viral load of SARS-CoV-2 through nasal drops, while the control group was dripped with 30 μl of DMEM. The lung, airway and small intestinal tissues of the infected mice were taken on D1, D3, D5, D6, D7 and D9, and RNA was extracted and fluorescent quantitative PCR was used to detect the SARS-CoV-2 virus titer. The RNA extraction method was consistent with the above-mentioned hACE2 expression detection example. The extracted tissue RNA was used

The Probe One-step qRT-PCR Kit (Toyobo) was used to detect the nucleoprotein N gene of SARS-CoV-2. The primers used for the fluorescence quantitative PCR test are as follows: forward primer: 5'-GGGGAACTTCTCCTGCTAGAAT-3' (SEQ ID NO: 35); reverse primer: 5'-CAGACATTTTGCTCTCAAGCTG-3' (SEQ ID NO: 36), and the TaqMan probe sequence is 5'-FAM-TTGCTGCTGCTTGACAGATT-TAMRA-3' (SEQ ID NO: 37). The results show that the humanized ACE2 gene-modified mice were infected with SARS-CoV-2. The lung, airway and small intestine reached a peak viral load on D1 and D3, and then decreased, indicating that the humanized ACE2 gene-modified mice of the present invention are susceptible to SARS-CoV-2 (Figure 15), while the control group mice did not detect the virus. The broken line in Figure 15 represents the average value.

序列表Sequence Listing

<110> 中国科学院广州生物医药与健康研究院<110> Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences

<120> 人源化ACE2基因改造小鼠胚胎干细胞模型制备方法及应用<120> Preparation method and application of humanized ACE2 gene-modified mouse embryonic stem cell model

<150> 202010152182.X<150> 202010152182.X

<151> 2020-03-06<151> 2020-03-06

<160> 37<160> 37

<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0

<210> 1<210> 1

<211> 23<211> 23

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> sgRNA1序列<223> sgRNA1 sequence

<400> 1<400> 1

tactgctcag tccctcaccg agg 23tactgctcag tccctcaccg agg 23

<210> 2<210> 2

<211> 23<211> 23

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> sgRNA2序列<223> sgRNA2 sequence

<400> 2<400> 2

cttggcattt tcctcggtga ggg 23cttggcattt tcctcggtga ggg 23

<210> 3<210> 3

<211> 23<211> 23

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> sgRNRA3序列<223> sgRNRA3 sequence

<400> 3<400> 3

caagtgaact ttgataagac agg 23caagtgaactttgataagac agg 23

<210> 4<210> 4

<211> 25<211> 25

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 合成sgRNA2的上游单链引物序列<223> Synthesize the upstream single-stranded primer sequence of sgRNA2

<400> 4<400> 4

caccgcttgg cattttcctc ggtga 25caccgcttgg cattttcctc ggtga 25

<210> 5<210> 5

<211> 25<211> 25

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 合成sgRNA2的下游单链引物序列<223> Synthesize the downstream single-stranded primer sequence of sgRNA2

<400> 5<400> 5

aaactcaccg aggaaaatgc caagc 25aaactcaccg aggaaaatgc caagc 25

<210> 6<210> 6

<211> 25<211> 25

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 合成sgRNA1的上游单链引物序列<223> Synthesize the upstream single-stranded primer sequence of sgRNA1

<400> 6<400> 6

caccgtactg ctcagtccct caccg 25caccgtactg ctcagtccct caccg 25

<210> 7<210> 7

<211> 25<211> 25

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 合成sgRNA1的下游单链引物序列<223> Synthesize the downstream single-stranded primer sequence of sgRNA1

<400> 7<400> 7

aaaccggtga gggactgagc agtac 25aaaccggtga gggactgagc agtac 25

<210> 8<210> 8

<211> 25<211> 25

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 合成sgRNA3的上游单链引物序列<223> Synthesize the upstream single-stranded primer sequence of sgRNA3

<400> 8<400> 8

caccgcaagt gaactttgat aagac 25caccgcaagt gaactttgat aagac 25

<210> 9<210> 9

<211> 25<211> 25

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 合成sgRNA3的下游单链引物序列<223> Synthesize the downstream single-stranded primer sequence of sgRNA3

<400> 9<400> 9

aaacgtctta tcaaagttca cttgc 25aaacgtctta tcaaagttca cttgc 25

<210> 10<210> 10

<211> 28<211> 28

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> 5arm-sgF<223> 5arm-sgF

<400> 10<400> 10

ggttttgatt tggccataaa atgttagc 28ggttttgatt tggccataaa atgttagc 28

<210> 11<210> 11

<211> 25<211> 25

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> 3arm-sgR<223> 3arm-sgR

<400> 11<400> 11

attcccaggt ccagtttcac ctaag 25attcccaggt ccagtttcac ctaag 25

<210> 12<210> 12

<211> 2418<211> 2418

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> hACE2-CDS序列<223> hACE2-CDS sequence

<400> 12<400> 12

atgtcaagct cttcctggct ccttctcagc cttgttgctg taactgctgc tcagtccacc 60atgtcaagct cttcctggct ccttctcagc cttgttgctg taactgctgc tcagtccacc 60

attgaggaac aggccaagac atttttggac aagtttaacc acgaagccga agacctgttc 120attgaggaac aggccaagac atttttggac aagtttaacc acgaagccga agacctgttc 120

tatcaaagtt cacttgcttc ttggaattat aacaccaata ttactgaaga gaatgtccaa 180tatcaaagtt cacttgcttc ttggaattat aacaccaata ttactgaaga gaatgtccaa 180

aacatgaata atgctgggga caaatggtct gcctttttaa aggaacagtc cacacttgcc 240aacatgaata atgctgggga caaatggtct gcctttttaa aggaacagtc cacacttgcc 240

caaatgtatc cactacaaga aattcagaat ctcacagtca agcttcagct gcaggctctt 300caaatgtatc cactacaaga aattcagaat ctcacagtca agcttcagct gcaggctctt 300

cagcaaaatg ggtcttcagt gctctcagaa gacaagagca aacggttgaa cacaattcta 360cagcaaaatg ggtcttcagt gctctcagaa gacaagagca aacggttgaa cacaattcta 360

aatacaatga gcaccatcta cagtactgga aaagtttgta acccagataa tccacaagaa 420aatacaatga gcaccatcta cagtactgga aaagtttgta acccagataa tccacaagaa 420

tgcttattac ttgaaccagg tttgaatgaa ataatggcaa acagtttaga ctacaatgag 480tgcttattac ttgaaccagg tttgaatgaa ataatggcaa acagtttaga ctacaatgag 480

aggctctggg cttgggaaag ctggagatct gaggtcggca agcagctgag gccattatat 540aggctctggg cttgggaaag ctggagatct gaggtcggca agcagctgag gccattatat 540

gaagagtatg tggtcttgaa aaatgagatg gcaagagcaa atcattatga ggactatggg 600gaagagtatg tggtcttgaa aaatgagatg gcaagagcaa atcattatga ggactatggg 600

gattattgga gaggagacta tgaagtaaat ggggtagatg gctatgacta cagccgcggc 660gattattgga gaggagacta tgaagtaaat ggggtagatg gctatgacta cagccgcggc 660

cagttgattg aagatgtgga acataccttt gaagagatta aaccattata tgaacatctt 720cagttgattg aagatgtgga acataccttt gaagagatta aaccattata tgaacatctt 720

catgcctatg tgagggcaaa gttgatgaat gcctatcctt cctatatcag tccaattgga 780catgcctatg tgagggcaaa gttgatgaat gcctatcctt cctatatcag tccaattgga 780

tgcctccctg ctcatttgct tggtgatatg tggggtagat tttggacaaa tctgtactct 840tgcctccctg ctcatttgct tggtgatatg tggggtagat tttggacaaa tctgtactct 840

ttgacagttc cctttggaca gaaaccaaac atagatgtta ctgatgcaat ggtggaccag 900ttgacagttc cctttggaca gaaaccaaac atagatgtta ctgatgcaat ggtggaccag 900

gcctgggatg cacagagaat attcaaggag gccgagaagt tctttgtatc tgttggtctt 960gcctgggatg cacagagaat attcaaggag gccgagaagt tctttgtatc tgttggtctt 960

cctaatatga ctcaaggatt ctgggaaaat tccatgctaa cggacccagg aaatgttcag 1020cctaatatga ctcaaggatt ctgggaaaat tccatgctaa cggacccagg aaatgttcag 1020

aaagcagtct gccatcccac agcttgggac ctggggaagg gcgacttcag gatccttatg 1080aaagcagtct gccatcccac agcttgggac ctggggaagg gcgacttcag gatccttatg 1080

tgcacaaagg tgacaatgga cgacttcctg acagctcatc atgagatggg gcatatccag 1140tgcacaaagg tgacaatgga cgacttcctg acagctcatc atgagatggg gcatatccag 1140

tatgatatgg catatgctgc acaacctttt ctgctaagaa atggagctaa tgaaggattc 1200tatgatatgg catatgctgc acaacctttt ctgctaagaa atggagctaa tgaaggattc 1200

catgaagctg ttggggaaat catgtcactt tctgcagcca cacctaagca tttaaaatcc 1260catgaagctg ttggggaaat catgtcactt tctgcagcca cacctaagca tttaaaatcc 1260

attggtcttc tgtcacccga ttttcaagaa gacaatgaaa cagaaataaa cttcctgctc 1320attggtcttc tgtcacccga ttttcaagaa gacaatgaaa cagaaataaa cttcctgctc 1320

aaacaagcac tcacgattgt tgggactctg ccatttactt acatgttaga gaagtggagg 1380aaacaagcac tcacgattgt tgggactctg ccatttactt acatgttaga gaagtggagg 1380

tggatggtct ttaaagggga aattcccaaa gaccagtgga tgaaaaagtg gtgggagatg 1440tggatggtct ttaaagggga aattcccaaa gaccagtgga tgaaaaagtg gtgggatg 1440

aagcgagaga tagttggggt ggtggaacct gtgccccatg atgaaacata ctgtgacccc 1500aagcgagaga tagttggggt ggtggaacct gtgccccatg atgaaacata ctgtgacccc 1500

gcatctctgt tccatgtttc taatgattac tcattcattc gatattacac aaggaccctt 1560gcatctctgt tccatgtttc taatgattac tcattcattc gatattacac aaggaccctt 1560

taccaattcc agtttcaaga agcactttgt caagcagcta aacatgaagg ccctctgcac 1620taccaattcc agtttcaaga agcactttgt caagcagcta aacatgaagg ccctctgcac 1620

aaatgtgaca tctcaaactc tacagaagct ggacagaaac tgttcaatat gctgaggctt 1680aaatgtgaca tctcaaactc tacagaagct ggacagaaac tgttcaatat gctgaggctt 1680

ggaaaatcag aaccctggac cctagcattg gaaaatgttg taggagcaaa gaacatgaat 1740ggaaaatcag aaccctggac cctagcattg gaaaatgttg taggagcaaa gaacatgaat 1740

gtaaggccac tgctcaacta ctttgagccc ttatttacct ggctgaaaga ccagaacaag 1800gtaaggccac tgctcaacta ctttgagccc ttatttacct ggctgaaaga ccagaacaag 1800

aattcttttg tgggatggag taccgactgg agtccatatg cagaccaaag catcaaagtg 1860aattcttttg tgggatggag taccgactgg agtccatatg cagaccaaag catcaaagtg 1860

aggataagcc taaaatcagc tcttggagat aaagcatatg aatggaacga caatgaaatg 1920aggataagcc taaaatcagc tcttggagat aaagcatatg aatggaacga caatgaaatg 1920

tacctgttcc gatcatctgt tgcatatgct atgaggcagt actttttaaa agtaaaaaat 1980tacctgttcc gatcatctgt tgcatatgct atgaggcagt actttttaaa agtaaaaaat 1980

cagatgattc tttttgggga ggaggatgtg cgagtggcta atttgaaacc aagaatctcc 2040cagatgattc tttttgggga ggaggatgtg cgagtggcta atttgaaacc aagaatctcc 2040

tttaatttct ttgtcactgc acctaaaaat gtgtctgata tcattcctag aactgaagtt 2100tttaatttct ttgtcactgc acctaaaaat gtgtctgata tcattcctag aactgaagtt 2100

gaaaaggcca tcaggatgtc ccggagccgt atcaatgatg ctttccgtct gaatgacaac 2160gaaaaggcca tcaggatgtc ccggagccgt atcaatgatg ctttccgtct gaatgacaac 2160

agcctagagt ttctggggat acagccaaca cttggacctc ctaaccagcc ccctgtttcc 2220agcctagagt ttctggggat acagccaaca cttggacctc ctaaccagcc ccctgtttcc 2220

atatggctga ttgtttttgg agttgtgatg ggagtgatag tggttggcat tgtcatcctg 2280atatggctga ttgtttttgg agttgtgatg ggagtgatag tggttggcat tgtcatcctg 2280

atcttcactg ggatcagaga tcggaagaag aaaaataaag caagaagtgg agaaaatcct 2340atcttcactg ggatcagaga tcggaagaag aaaaataaag caagaagtgg agaaaatcct 2340

tatgcctcca tcgatattag caaaggagaa aataatccag gattccaaaa cactgatgat 2400tatgcctcca tcgatattag caaaggagaa aataatccag gattccaaaa cactgatgat 2400

gttcagacct ccttttag 2418gttcagacctccttttag 2418

<210> 13<210> 13

<211> 805<211> 805

<212> PRT<212> PRT

<213> Natural sequence<213> Natural sequence

<220><220>

<223> hACE2-protein序列<223> hACE2-protein sequence

<400> 13<400> 13

Met Ser Ser Ser Ser Trp Leu Leu Leu Ser Leu Val Ala Val Thr AlaMet Ser Ser Ser Ser Trp Leu Leu Leu Ser Leu Val Ala Val Thr Ala

1 5 10 151 5 10 15

Ala Gln Ser Thr Ile Glu Glu Gln Ala Lys Thr Phe Leu Asp Lys PheAla Gln Ser Thr Ile Glu Glu Gln Ala Lys Thr Phe Leu Asp Lys Phe

20 25 3020 25 30

Asn His Glu Ala Glu Asp Leu Phe Tyr Gln Ser Ser Leu Ala Ser TrpAsn His Glu Ala Glu Asp Leu Phe Tyr Gln Ser Ser Leu Ala Ser Trp

35 40 4535 40 45

Asn Tyr Asn Thr Asn Ile Thr Glu Glu Asn Val Gln Asn Met Asn AsnAsn Tyr Asn Thr Asn Ile Thr Glu Glu Asn Val Gln Asn Met Asn Asn

50 55 6050 55 60

Ala Gly Asp Lys Trp Ser Ala Phe Leu Lys Glu Gln Ser Thr Leu AlaAla Gly Asp Lys Trp Ser Ala Phe Leu Lys Glu Gln Ser Thr Leu Ala

65 70 75 8065 70 75 80

Gln Met Tyr Pro Leu Gln Glu Ile Gln Asn Leu Thr Val Lys Leu GlnGln Met Tyr Pro Leu Gln Glu Ile Gln Asn Leu Thr Val Lys Leu Gln

85 90 9585 90 95

Leu Gln Ala Leu Gln Gln Asn Gly Ser Ser Val Leu Ser Glu Asp LysLeu Gln Ala Leu Gln Gln Asn Gly Ser Ser Val Leu Ser Glu Asp Lys

100 105 110100 105 110

Ser Lys Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr SerSer Lys Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser

115 120 125115 120 125

Thr Gly Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu LeuThr Gly Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu

130 135 140130 135 140

Glu Pro Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn GluGlu Pro Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu

145 150 155 160145 150 155 160

Arg Leu Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln LeuArg Leu Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu

165 170 175165 170 175

Arg Pro Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala ArgArg Pro Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg

180 185 190180 185 190

Ala Asn His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr GluAla Asn His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu

195 200 205195 200 205

Val Asn Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile GluVal Asn Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu

210 215 220210 215 220

Asp Val Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His LeuAsp Val Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu

225 230 235 240225 230 235 240

His Ala Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr IleHis Ala Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile

245 250 255245 250 255

Ser Pro Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp GlySer Pro Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly

260 265 270260 265 270

Arg Phe Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln LysArg Phe Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys

275 280 285275 280 285

Pro Asn Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp AlaPro Asn Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala

290 295 300290 295 300

Gln Arg Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly LeuGln Arg Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu

305 310 315 320305 310 315 320

Pro Asn Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Asp ProPro Asn Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Asp Pro

325 330 335325 330 335

Gly Asn Val Gln Lys Ala Val Cys His Pro Thr Ala Trp Asp Leu GlyGly Asn Val Gln Lys Ala Val Cys His Pro Thr Ala Trp Asp Leu Gly

340 345 350340 345 350

Lys Gly Asp Phe Arg Ile Leu Met Cys Thr Lys Val Thr Met Asp AspLys Gly Asp Phe Arg Ile Leu Met Cys Thr Lys Val Thr Met Asp Asp

355 360 365355 360 365

Phe Leu Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met AlaPhe Leu Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala

370 375 380370 375 380

Tyr Ala Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly PheTyr Ala Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe

385 390 395 400385 390 395 400

His Glu Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro LysHis Glu Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys

405 410 415405 410 415

His Leu Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp AsnHis Leu Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn

420 425 430420 425 430

Glu Thr Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val GlyGlu Thr Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly

435 440 445435 440 445

Thr Leu Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val PheThr Leu Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe

450 455 460450 455 460

Lys Gly Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu MetLys Gly Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met

465 470 475 480465 470 475 480

Lys Arg Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu ThrLys Arg Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr

485 490 495485 490 495

Tyr Cys Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser PheTyr Cys Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe

500 505 510500 505 510

Ile Arg Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu AlaIle Arg Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala

515 520 525515 520 525

Leu Cys Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp IleLeu Cys Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile

530 535 540530 535 540

Ser Asn Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg LeuSer Asn Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu

545 550 555 560545 550 555 560

Gly Lys Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly AlaGly Lys Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala

565 570 575565 570 575

Lys Asn Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu PheLys Asn Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe

580 585 590580 585 590

Thr Trp Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser ThrThr Trp Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr

595 600 605595 600 605

Asp Trp Ser Pro Tyr Ala Asp Gln Ser Ile Lys Val Arg Ile Ser LeuAsp Trp Ser Pro Tyr Ala Asp Gln Ser Ile Lys Val Arg Ile Ser Leu

610 615 620610 615 620

Lys Ser Ala Leu Gly Asp Lys Ala Tyr Glu Trp Asn Asp Asn Glu MetLys Ser Ala Leu Gly Asp Lys Ala Tyr Glu Trp Asn Asp Asn Glu Met

625 630 635 640625 630 635 640

Tyr Leu Phe Arg Ser Ser Val Ala Tyr Ala Met Arg Gln Tyr Phe LeuTyr Leu Phe Arg Ser Ser Val Ala Tyr Ala Met Arg Gln Tyr Phe Leu

645 650 655645 650 655

Lys Val Lys Asn Gln Met Ile Leu Phe Gly Glu Glu Asp Val Arg ValLys Val Lys Asn Gln Met Ile Leu Phe Gly Glu Glu Asp Val Arg Val

660 665 670660 665 670

Ala Asn Leu Lys Pro Arg Ile Ser Phe Asn Phe Phe Val Thr Ala ProAla Asn Leu Lys Pro Arg Ile Ser Phe Asn Phe Phe Val Thr Ala Pro

675 680 685675 680 685

Lys Asn Val Ser Asp Ile Ile Pro Arg Thr Glu Val Glu Lys Ala IleLys Asn Val Ser Asp Ile Ile Pro Arg Thr Glu Val Glu Lys Ala Ile

690 695 700690 695 700

Arg Met Ser Arg Ser Arg Ile Asn Asp Ala Phe Arg Leu Asn Asp AsnArg Met Ser Arg Ser Arg Ile Asn Asp Ala Phe Arg Leu Asn Asp Asn

705 710 715 720705 710 715 720

Ser Leu Glu Phe Leu Gly Ile Gln Pro Thr Leu Gly Pro Pro Asn GlnSer Leu Glu Phe Leu Gly Ile Gln Pro Thr Leu Gly Pro Pro Asn Gln

725 730 735725 730 735

Pro Pro Val Ser Ile Trp Leu Ile Val Phe Gly Val Val Met Gly ValPro Pro Val Ser Ile Trp Leu Ile Val Phe Gly Val Val Met Gly Val

740 745 750740 745 750

Ile Val Val Gly Ile Val Ile Leu Ile Phe Thr Gly Ile Arg Asp ArgIle Val Val Gly Ile Val Ile Leu Ile Phe Thr Gly Ile Arg Asp Arg

755 760 765755 760 765

Lys Lys Lys Asn Lys Ala Arg Ser Gly Glu Asn Pro Tyr Ala Ser IleLys Lys Lys Asn Lys Ala Arg Ser Gly Glu Asn Pro Tyr Ala Ser Ile

770 775 780770 775 780

Asp Ile Ser Lys Gly Glu Asn Asn Pro Gly Phe Gln Asn Thr Asp AspAsp Ile Ser Lys Gly Glu Asn Asn Pro Gly Phe Gln Asn Thr Asp Asp

785 790 795 800785 790 795 800

Val Gln Thr Ser PheVal Gln Thr Ser Phe

805805

<210> 14<210> 14

<211> 122<211> 122

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> SV40 polyA序列<223> SV40 polyA sequence

<400> 14<400> 14

aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca 60aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca 60

aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct 120aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct 120

ta 122ta 122

<210> 15<210> 15

<211> 970<211> 970

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> 5’同源臂序列<223> 5' homology arm sequence

<400> 15<400> 15

ccctatggag tggagaagag tcttataatt ttttaaatgg gcagagaaat gaatttattt 60ccctatggag tggagaagag tcttataatt ttttaaatgg gcagagaaat gaatttattt 60

ttaattttta gagacagggt ttctttgtat agctctagct gtctttgatt ggtagacaaa 120ttaattttta gagacagggt ttctttgtat agctctagct gtctttgatt ggtagacaaa 120

gctgtcctca aactcagaga tcttccttcc tttgtctcct gagtgctggg attaaaggca 180gctgtcctca aactcagaga tcttccttcc tttgtctcct gagtgctggg attaaaggca 180

tggaccacca ctgccctgcc ccattctctc cattaatttt aagtgaatgc ttgcaaaagc 240tggaccacca ctgccctgcc ccattctctc cattaatttt aagtgaatgc ttgcaaaagc 240

tcacttcttt ggtgaacagc ttcctttaca aataagtacc tttgccttcg tttttatagg 300tcacttcttt ggtgaacagc ttcctttaca aataagtacc tttgccttcg tttttatagg 300

attcttaaaa agaaaaaaaa gattcagcca ggtggttgtg gtgcacacct ttaatcccag 360attcttaaaa agaaaaaaaa gattcagcca ggtggttgtg gtgcacacct ttaatcccag 360

cagtcaggag gcagaggaaa gcagatctct tgagtttgag gctagcctag tctacagagg 420cagtcaggag gcagaggaaa gcagatctct tgagtttgag gctagcctag tctacagagg 420

gagttccagg acagccaagg ctacagagag gaactgtcta aaaacaccaa gaaagagaga 480gagttccagg acagccaagg ctacagagag gaactgtcta aaaacaccaa gaaagagaga 480

aaggagagag ggagaggatg gatagcttat tgatagaatt gtcagaaaag gctataagtt 540aaggagagag ggagaggatg gatagctttat tgatagaatt gtcagaaaag gctataagtt 540

ccaatatgtg tcccatgatt tctaagtcta gccctttctg ttatagtaaa atcatagtac 600ccaatatgtg tcccatgatt tctaagtcta gccctttctg ttatagtaaa atcatagtac 600

accctcctcc tccagtgtat ctttaacagc ttttaaggaa catattaact aaatgtccag 660accctcctcc tccagtgtat ctttaacagc ttttaaggaa catattaact aaatgtccag 660

gttttgattt ggccataaaa tgttagcaaa gctaaggttt tctaggatta atgaataaca 720gttttgattt ggccataaaa tgttagcaaa gctaaggttt tctaggatta atgaataaca 720

tgtctttatt tagtttactt aaaaaaatca ttctaaaata tctgtttaca tatctgtcct 780tgtctttatt tagtttactt aaaaaaatca ttctaaaata tctgtttaca tatctgtcct 780

ctccaggatt aacttcatat tggtccagca gcttgtttac tgttctcttc tgtttcttct 840ctccaggatt aacttcatat tggtccagca gcttgtttac tgttctcttc tgtttcttct 840

tctgcttttt ttttcttctc ttctcagtgc ccaacccaag ttcaaaggct gatgagagag 900tctgcttttt ttttcttctc ttctcagtgc ccaacccaag ttcaaaggct gatgagagag 900

aaaaactcat gaagagattt tactctaggg aaagttgctc agtggatggg atcttggcgc 960aaaaactcat gaagagattt tactctaggg aaagttgctc agtggatggg atcttggcgc 960

acggggaaag 970acggggaaag 970

<210> 16<210> 16

<211> 971<211> 971

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> 3’同源臂序列<223> 3’ homology arm sequence

<400> 16<400> 16

gaattataat actaacatta ctgaagaaaa tgcccaaaag atggtaagtt cttgaggcta 60gaattataat actaacatta ctgaagaaaa tgcccaaaag atggtaagtt cttgaggcta 60

cccagggggt tattgattgc ttcttaaaga tcagaattac tgcctataaa actggataag 120cccagggggt tattgattgc ttcttaaaga tcagaattac tgcctataaa actggataag 120

gaaatcatag agatctctca agtgtgagga tgagtgactg cctctgtagc tctgatccta 180gaaatcatag agatctctca agtgtgagga tgagtgactg cctctgtagc tctgatccta 180

gtctcccaga tggctaaatt caattgacct tagagttcat ctggaaaatt gttatgaatg 240gtctcccaga tggctaaatt caattgacct tagagttcat ctggaaaatt gttatgaatg 240

aattatttgc ccagattcca aagatgagtg aaaatgttta ataaagttgc catcactatt 300aattatttgc ccagattcca aagatgagtg aaaatgttta ataaagttgc catcactatt 300

ctcattatat ttggtatgta aagcattcat ggaaatgttc taagtcgtta ttgagccaat 360ctcattatat ttggtatgta aagcattcat ggaaatgttc taagtcgtta ttgagccaat 360

aattttcttt agcttataat gccaacaggt ctatccgaga actacaaatg acatattaac 420aattttcttt agcttataat gccaacaggt ctatccgaga actacaaatg acatattaac 420

tgaaaaatgc aactggggtt tactgaaggc agcagcttag taattaaggt aaccatggct 480tgaaaaatgc aactggggtt tactgaaggc agcagcttag taattaaggt aaccatggct 480

taggtgaaac tggacctggg aattccttct ttcattgaca cagagctctg aggaatttcc 540taggtgaaac tggacctggg aattccttct ttcattgaca cagagctctg aggaatttcc 540

aaaggtcaca gaagaaaagc tataattaaa ctagtcccaa aaaatctcag cctactctgg 600aaaggtcaca gaagaaaagc tataattaaa ctagtcccaa aaaatctcag cctactctgg 600

gaaagcagca tattttgttt gacaagtgca aggacttaga actttttttt ttctcactga 660gaaagcagca tattttgttt gacaagtgca aggacttaga actttttttt ttctcactga 660

tcctgaagtg ccttttaagt atagttaagt ggtggaaaat tgagcaacta tttaagaaaa 720tcctgaagtg ccttttaagt atagttaagt ggtggaaaat tgagcaacta tttaagaaaa 720

gactcttttt tttcttcttc cagcaatgct ttccttcaaa acggtagctt caaaacttcc 780gactcttttt tttcttcttc cagcaatgct ttccttcaaa acggtagctt caaaacttcc 780

tgtcttttaa atgatcaggg ggctgtgtgt ttaaattatt gccattcata gaacagagtg 840tgtcttttaa atgatcaggg ggctgtgtgt ttaaattatt gccattcata gaacagagtg 840

ggtctgagga tgcctgtttc ctttgaaatt ctatgccccc tcccagtttt ctaaaattta 900ggtctgagga tgcctgtttc ctttgaaatt ctatgccccc tcccagtttt ctaaaattta 900

agaaaccaca gagactttga caatgtagtt gccaaatgag ttgcttttaa ctgctctaat 960agaaaccaca gagactttga caatgtagtt gccaaatgag ttgcttttaa ctgctctaat 960

agtttggtct t 971agtttggtct t 971

<210> 17<210> 17

<211> 1124<211> 1124

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> PGK-puro序列<223> PGK-puro sequence

<400> 17<400> 17

gggtagggga ggcgcttttc ccaaggcagt ctggagcatg cgctttagca gccccgctgg 60gggtagggga ggcgcttttc ccaaggcagt ctggagcatg cgctttagca gccccgctgg 60

gcacttggcg ctacacaagt ggcctctggc ctcgcacaca ttccacatcc cccggtaggc 120gcacttggcg ctacacaagt ggcctctggc ctcgcacaca ttccacatcc cccggtaggc 120

gccaaccggc tccgttcttt ggtggcccct tcgcgccacc ttctactcct cccctagtca 180gccaaccggc tccgttcttt ggtggcccct tcgcgccacc ttctactcct cccctagtca 180

ggaagttccc ccccgccccg cagctcgcgt cgtgcaggac gtgacaaatg gaagtagcac 240ggaagttccc ccccgccccg cagctcgcgt cgtgcaggac gtgacaaatg gaagtagcac 240

gtctcactag tctcgtgcag atggacagca ccgctgagca atggaagcgg gtaggccttt 300gtctcactag tctcgtgcag atggacagca ccgctgagca atggaagcgg gtaggccttt 300

ggggcagcgg ccaatagcag ctttgctcct tcgctttctg ggctcagagg ctgggaaggg 360ggggcagcgg ccaatagcag ctttgctcct tcgctttctg ggctcagagg ctgggaaggg 360

gtgggtccgg gggcgggctc aggggcgggc tcaggggcgg ggcgggcgcc cgaaggtcct 420gtgggtccgg gggcgggctc aggggcgggc tcaggggcgg ggcgggcgcc cgaaggtcct 420

ccggaggccc ggcattctgc acgcttcaaa agcgcacgtc tgccgcgctg ttctcctctt 480ccggaggccc ggcattctgc acgcttcaaa agcgcacgtc tgccgcgctg ttctcctctt 480

cctcatctcc gggcctttcg acctgcagcc caagctagct taccatgacc gagtacaagc 540cctcatctcc gggcctttcg acctgcagcc caagctagct taccatgacc gagtacaagc 540

ccacggtgcg cctcgccacc cgcgacgacg tccccagggc cgtacgcacc ctcgccgccg 600ccacggtgcg cctcgccacc cgcgacgacg tccccagggc cgtacgcacc ctcgccgccg 600

cgttcgccga ctaccccgcc acgcgccaca ccgtcgatcc ggaccgccac atcgagcggg 660cgttcgccga ctaccccgcc acgcgccaca ccgtcgatcc ggaccgccac atcgagcggg 660

tcaccgagct gcaagaactc ttcctcacgc gcgtcgggct cgacatcggc aaggtgtggg 720tcaccgagct gcaagaactc ttcctcacgc gcgtcgggct cgacatcggc aaggtgtggg 720

tcgcggacga cggcgccgcg gtggcggtct ggaccacgcc ggagagcgtc gaagcggggg 780tcgcggacga cggcgccgcg gtggcggtct ggaccacgcc ggagagcgtc gaagcggggg 780

cggtgttcgc cgagatcggc ccgcgcatgg ccgagttgag cggttcccgg ctggccgcgc 840cggtgttcgc cgagatcggc ccgcgcatgg ccgagttgag cggttcccgg ctggccgcgc 840

agcaacagat ggaaggcctc ctggcgccgc accggcccaa ggagcccgcg tggttcctgg 900agcaacagat ggaaggcctc ctggcgccgc accggcccaa ggagcccgcg tggttcctgg 900

ccaccgtcgg cgtctcgccc gaccaccagg gcaagggtct gggcagcgcc gtcgtgctcc 960ccaccgtcgg cgtctcgccc gaccaccagg gcaagggtct gggcagcgcc gtcgtgctcc 960

ccggagtgga ggcggccgag cgcgccgggg tgcccgcctt cctggagacc tccgcgcccc 1020ccggagtgga ggcggccgag cgcgccgggg tgcccgcctt cctggagacc tccgcgcccc 1020

gcaacctccc cttctacgag cggctcggct tcaccgtcac cgccgacgtc gaggtgcccg 1080gcaacctccc cttctacgag cggctcggct tcaccgtcac cgccgacgtc gaggtgcccg 1080

aaggaccgcg cacctggtgc atgacccgca agcccggtgc ctga 1124aaggaccgcg cacctggtgc atgacccgca agcccggtgc ctga 1124

<210> 18<210> 18

<211> 34<211> 34

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> Frt序列<223> Frt sequence

<400> 18<400> 18

gaagttccta ttctctagaa agtataggaa cttc 34gaagttccta ttctctagaa agtataggaa cttc 34

<210> 19<210> 19

<211> 50<211> 50

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 5arm-pcrF<223> 5arm-pcrF

<400> 19<400> 19

tcgcacacat tccacatcca ccggtcccta tggagtggag aagagtctta 50tcgcacacat tccacatcca ccggtcccta tggagtggag aagagtctta 50

<210> 20<210> 20

<211> 47<211> 47

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 5arm-pcrR<223> 5arm-pcrR

<400> 20<400> 20

gaaggagcca ggaagagctt gacatctttc cccgtgcgcc aagatcc 47gaaggagcca ggaagagctt gacatctttc cccgtgcgcc aagatcc 47

<210> 21<210> 21

<211> 47<211> 47

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> hACE2-F<223> hACE2-F

<400> 21<400> 21

ggatcttggc gcacggggaa agatgtcaag ctcttcctgg ctccttc 47ggatcttggc gcacggggaa agatgtcaag ctcttcctgg ctccttc 47

<210> 22<210> 22

<211> 49<211> 49

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> hACE2-R<223> hACE2-R

<400> 22<400> 22

cattataagc tgcaataaac aagttctaaa aggaggtctg aacatcatc 49cattataagc tgcaataaac aagttctaaa aggaggtctg aacatcatc 49

<210> 23<210> 23

<211> 49<211> 49

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> SV40-F<223> SV40-F

<400> 23<400> 23

gatgatgttc agacctcctt ttagaacttg tttattgcag cttataatg 49gatgatgttc agacctcctt ttagaacttg tttattgcag cttataatg 49

<210> 24<210> 24

<211> 48<211> 48

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> SV40-R<223> SV40-R

<400> 24<400> 24

agagaatagg aacttcgcac gcgttaagat acattgatga gtttggac 48agagaatagg aacttcgcac gcgttaagat acattgatga gtttggac 48

<210> 25<210> 25

<211> 50<211> 50

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 3arm-pcrF<223> 3arm-pcrF

<400> 25<400> 25

tacgaagtta tgtcgacgcg gcgcgccgaa ttataatact aacattactg 50tacgaagtta tgtcgacgcg gcgcgccgaa ttataatact aacattactg 50

<210> 26<210> 26

<211> 53<211> 53

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 3arm-pcrR<223> 3arm-pcrR

<400> 26<400> 26

tatgaccatg attacgccaa gcttaagacc aaactattag agcagttaaa agc 53tatgaccatg attacgccaa gcttaagacc aaactattag agcagttaaa agc 53

<210> 27<210> 27

<211> 8470<211> 8470

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 打靶载体序列<223> Targeting vector sequence

<400> 27<400> 27

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgta ccgggtaggg gaggcgcttt tcccaaggca gtctggagca tgcgctttag 240accatatgta ccgggtaggg gaggcgcttt tcccaaggca gtctggagca tgcgctttag 240

cagccccgct gggcacttgg cgctacacaa gtggcctctg gcctcgcaca cattccacat 300cagccccgct gggcacttgg cgctacacaa gtggcctctg gcctcgcaca cattccacat 300

ccaccggtcc ctatggagtg gagaagagtc ttataatttt ttaaatgggc agagaaatga 360ccaccggtcc ctatggagtg gagaagagtc ttataatttt ttaaatgggc agagaaatga 360

atttattttt aatttttaga gacagggttt ctttgtatag ctctagctgt ctttgattgg 420atttattttt aatttttaga gacagggttt ctttgtatag ctctagctgt ctttgattgg 420

tagacaaagc tgtcctcaaa ctcagagatc ttccttcctt tgtctcctga gtgctgggat 480tagacaaagc tgtcctcaaa ctcagagatc ttccttcctt tgtctcctga gtgctgggat 480

taaaggcatg gaccaccact gccctgcccc attctctcca ttaattttaa gtgaatgctt 540taaaggcatg gaccaccact gccctgcccc attctctcca ttaattttaa gtgaatgctt 540

gcaaaagctc acttctttgg tgaacagctt cctttacaaa taagtacctt tgccttcgtt 600gcaaaagctc acttctttgg tgaacagctt cctttacaaa taagtacctt tgccttcgtt 600

tttataggat tcttaaaaag aaaaaaaaga ttcagccagg tggttgtggt gcacaccttt 660tttataggat tcttaaaaag aaaaaaaaga ttcagccagg tggttgtggt gcacaccttt 660

aatcccagca gtcaggaggc agaggaaagc agatctcttg agtttgaggc tagcctagtc 720aatcccagca gtcaggaggc agaggaaagc agatctcttg agtttgaggc tagcctagtc 720

tacagaggga gttccaggac agccaaggct acagagagga actgtctaaa aacaccaaga 780tacagaggga gttccaggac agccaaggct acagagagga actgtctaaa aacaccaaga 780

aagagagaaa ggagagaggg agaggatgga tagcttattg atagaattgt cagaaaaggc 840aagagagaaa ggagagagggg agaggatgga tagcttattg atagaattgt cagaaaaggc 840

tataagttcc aatatgtgtc ccatgatttc taagtctagc cctttctgtt atagtaaaat 900tataagttcc aatatgtgtc ccatgatttc taagtctagc cctttctgtt atagtaaaat 900

catagtacac cctcctcctc cagtgtatct ttaacagctt ttaaggaaca tattaactaa 960catagtacac cctcctcctc cagtgtatct ttaacagctt ttaaggaaca tattaactaa 960

atgtccaggt tttgatttgg ccataaaatg ttagcaaagc taaggttttc taggattaat 1020atgtccaggt tttgatttgg ccataaaatg ttagcaaagc taaggttttc taggattaat 1020

gaataacatg tctttattta gtttacttaa aaaaatcatt ctaaaatatc tgtttacata 1080gaataacatg tctttattta gtttacttaa aaaaatcatt ctaaaatatc tgtttacata 1080

tctgtcctct ccaggattaa cttcatattg gtccagcagc ttgtttactg ttctcttctg 1140tctgtcctct ccaggattaa cttcatattg gtccagcagc ttgtttactg ttctcttctg 1140

tttcttcttc tgcttttttt ttcttctctt ctcagtgccc aacccaagtt caaaggctga 1200tttcttcttc tgcttttttt ttcttctctt ctcagtgccc aacccaagtt caaaggctga 1200

tgagagagaa aaactcatga agagatttta ctctagggaa agttgctcag tggatgggat 1260tgagagagaa aaactcatga agagatttta ctctagggaa agttgctcag tggatgggat 1260

cttggcgcac ggggaaagat gtcaagctct tcctggctcc ttctcagcct tgttgctgta 1320cttggcgcac ggggaaagat gtcaagctct tcctggctcc ttctcagcct tgttgctgta 1320

actgctgctc agtccaccat tgaggaacag gccaagacat ttttggacaa gtttaaccac 1380actgctgctc agtccaccat tgaggaacag gccaagacat ttttggacaa gtttaaccac 1380

gaagccgaag acctgttcta tcaaagttca cttgcttctt ggaattataa caccaatatt 1440gaagccgaag acctgttcta tcaaagttca cttgcttctt ggaattataa caccaatatt 1440

actgaagaga atgtccaaaa catgaataat gctggggaca aatggtctgc ctttttaaag 1500actgaagaga atgtccaaaa catgaataat gctggggaca aatggtctgc ctttttaaag 1500

gaacagtcca cacttgccca aatgtatcca ctacaagaaa ttcagaatct cacagtcaag 1560gaacagtcca cacttgccca aatgtatcca ctacaagaaa ttcagaatct cacagtcaag 1560

cttcagctgc aggctcttca gcaaaatggg tcttcagtgc tctcagaaga caagagcaaa 1620cttcagctgc aggctcttca gcaaaatggg tcttcagtgc tctcagaaga caagagcaaa 1620

cggttgaaca caattctaaa tacaatgagc accatctaca gtactggaaa agtttgtaac 1680cggttgaaca caattctaaa tacaatgagc accatctaca gtactggaaa agtttgtaac 1680

ccagataatc cacaagaatg cttattactt gaaccaggtt tgaatgaaat aatggcaaac 1740ccagataatc cacaagaatg cttattactt gaaccaggtt tgaatgaaat aatggcaaac 1740

agtttagact acaatgagag gctctgggct tgggaaagct ggagatctga ggtcggcaag 1800agtttagact acaatgagag gctctgggct tgggaaagct ggagatctga ggtcggcaag 1800

cagctgaggc cattatatga agagtatgtg gtcttgaaaa atgagatggc aagagcaaat 1860cagctgaggc cattatatga agagtatgtg gtcttgaaaa atgagatggc aagagcaaat 1860

cattatgagg actatgggga ttattggaga ggagactatg aagtaaatgg ggtagatggc 1920cattatgagg actatgggga ttatggaga ggagactatg aagtaaatgg ggtagatggc 1920

tatgactaca gccgcggcca gttgattgaa gatgtggaac atacctttga agagattaaa 1980tatgactaca gccgcggcca gttgattgaa gatgtggaac atacctttga agagattaaa 1980

ccattatatg aacatcttca tgcctatgtg agggcaaagt tgatgaatgc ctatccttcc 2040ccattatatg aacatcttca tgcctatgtg agggcaaagt tgatgaatgc ctatccttcc 2040

tatatcagtc caattggatg cctccctgct catttgcttg gtgatatgtg gggtagattt 2100tatatcagtc caattggatg cctccctgct catttgcttg gtgatatgtg gggtagattt 2100

tggacaaatc tgtactcttt gacagttccc tttggacaga aaccaaacat agatgttact 2160tggacaaatc tgtactcttt gacagttccc tttggacaga aaccaaacat agatgttatact 2160

gatgcaatgg tggaccaggc ctgggatgca cagagaatat tcaaggaggc cgagaagttc 2220gatgcaatgg tggaccaggc ctgggatgca cagagaatat tcaaggaggc cgagaagttc 2220

tttgtatctg ttggtcttcc taatatgact caaggattct gggaaaattc catgctaacg 2280tttgtatctg ttggtcttcc taatatgact caaggattct gggaaaattc catgctaacg 2280

gacccaggaa atgttcagaa agcagtctgc catcccacag cttgggacct ggggaagggc 2340gacccaggaa atgttcagaa agcagtctgc catcccacag cttggggacct ggggaagggc 2340

gacttcagga tccttatgtg cacaaaggtg acaatggacg acttcctgac agctcatcat 2400gacttcagga tccttatgtg cacaaaggtg acaatggacg acttcctgac agctcatcat 2400

gagatggggc atatccagta tgatatggca tatgctgcac aaccttttct gctaagaaat 2460gagatggggc atatccagta tgatatggca tatgctgcac aaccttttct gctaagaaat 2460

ggagctaatg aaggattcca tgaagctgtt ggggaaatca tgtcactttc tgcagccaca 2520ggagctaatg aaggattcca tgaagctgtt ggggaaatca tgtcactttc tgcagccaca 2520

cctaagcatt taaaatccat tggtcttctg tcacccgatt ttcaagaaga caatgaaaca 2580cctaagcatt taaaatccat tggtcttctg tcacccgatt ttcaagaaga caatgaaaca 2580

gaaataaact tcctgctcaa acaagcactc acgattgttg ggactctgcc atttacttac 2640gaaataaact tcctgctcaa acaagcactc acgattgttg ggactctgcc atttacttac 2640

atgttagaga agtggaggtg gatggtcttt aaaggggaaa ttcccaaaga ccagtggatg 2700atgttagaga agtggaggtg gatggtcttt aaaggggaaa ttcccaaaga ccagtggatg 2700

aaaaagtggt gggagatgaa gcgagagata gttggggtgg tggaacctgt gccccatgat 2760aaaaagtggt gggagatgaa gcgagagata gttggggtgg tggaacctgt gccccatgat 2760

gaaacatact gtgaccccgc atctctgttc catgtttcta atgattactc attcattcga 2820gaaacatact gtgaccccgc atctctgttc catgtttcta atgattactc attcattcga 2820

tattacacaa ggacccttta ccaattccag tttcaagaag cactttgtca agcagctaaa 2880tattacacaa ggacccttta ccaattccag tttcaagaag cactttgtca agcagctaaa 2880

catgaaggcc ctctgcacaa atgtgacatc tcaaactcta cagaagctgg acagaaactg 2940catgaaggcc ctctgcacaa atgtgacatc tcaaactcta cagaagctgg acagaaactg 2940

ttcaatatgc tgaggcttgg aaaatcagaa ccctggaccc tagcattgga aaatgttgta 3000ttcaatatgc tgaggcttgg aaaatcagaa ccctggaccc tagcattgga aaatgttgta 3000

ggagcaaaga acatgaatgt aaggccactg ctcaactact ttgagccctt atttacctgg 3060ggagcaaaga acatgaatgt aaggccactg ctcaactact ttgagccctt atttacctgg 3060

ctgaaagacc agaacaagaa ttcttttgtg ggatggagta ccgactggag tccatatgca 3120ctgaaagacc agaacaagaa ttcttttgtg ggatggagta ccgactggag tccatatgca 3120

gaccaaagca tcaaagtgag gataagccta aaatcagctc ttggagataa agcatatgaa 3180gaccaaagca tcaaagtgag gataagccta aaatcagctc ttggagataa agcatatgaa 3180

tggaacgaca atgaaatgta cctgttccga tcatctgttg catatgctat gaggcagtac 3240tggaacgaca atgaaatgta cctgttccga tcatctgttg catatgctat gaggcagtac 3240

tttttaaaag taaaaaatca gatgattctt tttggggagg aggatgtgcg agtggctaat 3300tttttaaaag taaaaaatca gatgattctt tttggggagg aggatgtgcg agtggctaat 3300

ttgaaaccaa gaatctcctt taatttcttt gtcactgcac ctaaaaatgt gtctgatatc 3360ttgaaaccaa gaatctcctt taatttcttt gtcactgcac ctaaaaatgt gtctgatatc 3360

attcctagaa ctgaagttga aaaggccatc aggatgtccc ggagccgtat caatgatgct 3420attcctagaa ctgaagttga aaaggccatc aggatgtccc ggagccgtat caatgatgct 3420

ttccgtctga atgacaacag cctagagttt ctggggatac agccaacact tggacctcct 3480ttccgtctga atgacaacag cctagagttt ctggggatac agccaacact tggacctcct 3480

aaccagcccc ctgtttccat atggctgatt gtttttggag ttgtgatggg agtgatagtg 3540aaccagcccc ctgtttccat atggctgatt gtttttggag ttgtgatggg agtgatagtg 3540

gttggcattg tcatcctgat cttcactggg atcagagatc ggaagaagaa aaataaagca 3600gttggcattg tcatcctgat cttcactggg atcagagatc ggaagaagaa aaataaagca 3600

agaagtggag aaaatcctta tgcctccatc gatattagca aaggagaaaa taatccagga 3660agaagtggag aaaatcctta tgcctccatc gatattagca aaggagaaaa taatccagga 3660

ttccaaaaca ctgatgatgt tcagacctcc ttttagaact tgtttattgc agcttataat 3720ttccaaaaca ctgatgatgt tcagacctcc ttttagaact tgtttatattgc agcttataat 3720

ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgcat 3780ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgcat 3780

tctagttgtg gtttgtccaa actcatcaat gtatcttaac gcgtgcgaag ttcctattct 3840tctagttgtg gtttgtccaa actcatcaat gtatcttaac gcgtgcgaag ttcctattct 3840

ctagaaagta taggaacttc atcgataccg ggtaggggag gcgcttttcc caaggcagtc 3900ctagaaagta taggaacttc atcgataccg ggtagggggag gcgcttttcc caaggcagtc 3900

tggagcatgc gctttagcag ccccgctggg cacttggcgc tacacaagtg gcctctggcc 3960tggagcatgc gctttagcag ccccgctggg cacttggcgc tacacaagtg gcctctggcc 3960

tcgcacacat tccacatccc ccggtaggcg ccaaccggct ccgttctttg gtggcccctt 4020tcgcacacat tccacatccc ccggtaggcg ccaaccggct ccgttctttg gtggcccctt 4020

cgcgccacct tctactcctc ccctagtcag gaagttcccc cccgccccgc agctcgcgtc 4080cgcgccacct tctactcctc ccctagtcag gaagttcccc cccgccccgc agctcgcgtc 4080

gtgcaggacg tgacaaatgg aagtagcacg tctcactagt ctcgtgcaga tggacagcac 4140gtgcaggacg tgacaaatgg aagtagcacg tctcactagt ctcgtgcaga tggacagcac 4140

cgctgagcaa tggaagcggg taggcctttg gggcagcggc caatagcagc tttgctcctt 4200cgctgagcaa tggaagcggg taggcctttg gggcagcggc caatagcagc tttgctcctt 4200

cgctttctgg gctcagaggc tgggaagggg tgggtccggg ggcgggctca ggggcgggct 4260cgctttctgg gctcagaggc tgggaagggg tgggtccggg ggcgggctca ggggcgggct 4260

caggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca cgcttcaaaa 4320caggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca cgcttcaaaa 4320

gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga cctgcagccc 4380gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga cctgcagccc 4380

aagctagctt accatgaccg agtacaagcc cacggtgcgc ctcgccaccc gcgacgacgt 4440aagctagctt accatgaccg agtacaagcc cacggtgcgc ctcgccaccc gcgacgacgt 4440

ccccagggcc gtacgcaccc tcgccgccgc gttcgccgac taccccgcca cgcgccacac 4500ccccagggcc gtacgcaccc tcgccgccgc gttcgccgac taccccgcca cgcgccacac 4500

cgtcgatccg gaccgccaca tcgagcgggt caccgagctg caagaactct tcctcacgcg 4560cgtcgatccg gaccgccaca tcgagcgggt caccgagctg caagaactct tcctcacgcg 4560

cgtcgggctc gacatcggca aggtgtgggt cgcggacgac ggcgccgcgg tggcggtctg 4620cgtcgggctc gacatcggca aggtgtgggt cgcggacgac ggcgccgcgg tggcggtctg 4620

gaccacgccg gagagcgtcg aagcgggggc ggtgttcgcc gagatcggcc cgcgcatggc 4680gaccacgccg gagagcgtcg aagcgggggc ggtgttcgcc gagatcggcc cgcgcatggc 4680

cgagttgagc ggttcccggc tggccgcgca gcaacagatg gaaggcctcc tggcgccgca 4740cgagttgagc ggttcccggc tggccgcgca gcaacagatg gaaggcctcc tggcgccgca 4740

ccggcccaag gagcccgcgt ggttcctggc caccgtcggc gtctcgcccg accaccaggg 4800ccggcccaag gagcccgcgt ggttcctggc caccgtcggc gtctcgcccg accaccaggg 4800

caagggtctg ggcagcgccg tcgtgctccc cggagtggag gcggccgagc gcgccggggt 4860caagggtctg ggcagcgccg tcgtgctccc cggagtggag gcggccgagc gcgccggggt 4860

gcccgccttc ctggagacct ccgcgccccg caacctcccc ttctacgagc ggctcggctt 4920gcccgccttc ctggagacct ccgcgccccg caacctcccc ttctacgagc ggctcggctt 4920

caccgtcacc gccgacgtcg aggtgcccga aggaccgcgc acctggtgca tgacccgcaa 4980caccgtcacc gccgacgtcg aggtgcccga aggaccgcgc acctggtgca tgacccgcaa 4980

gcccggtgcc tgaggtacct ctcatgctgg agttcttcgc ccaccccaac ttgtttattg 5040gcccggtgcc tgaggtacct ctcatgctgg agttcttcgc ccaccccaac ttgtttattg 5040

cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt 5100cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt 5100

tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catcgatgaa 5160tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catcgatgaa 5160

gttcctattc tctagaaagt ataggaactt ctaacctccc gggtgacaga taacttcgta 5220gttcctattc tctagaaagt ataggaactt ctaacctccc gggtgacaga taacttcgta 5220

taatgtatgc tatacgaagt tatgtcgacg cggcgcgccg aattataata ctaacattac 5280taatgtatgc tatacgaagt tatgtcgacg cggcgcgccg aattataata ctaacattac 5280

tgaagaaaat gcccaaaaga tggtaagttc ttgaggctac ccagggggtt attgattgct 5340tgaagaaaat gcccaaaaga tggtaagttc ttgaggctac ccagggggtt attgattgct 5340

tcttaaagat cagaattact gcctataaaa ctggataagg aaatcataga gatctctcaa 5400tcttaaagat cagaattact gcctataaaa ctggataagg aaatcataga gatctctcaa 5400

gtgtgaggat gagtgactgc ctctgtagct ctgatcctag tctcccagat ggctaaattc 5460gtgtgaggat gagtgactgc ctctgtagct ctgatcctag tctcccagat ggctaaattc 5460

aattgacctt agagttcatc tggaaaattg ttatgaatga attatttgcc cagattccaa 5520aattgacctt agagttcatc tggaaaattg ttatgaatga attatttgcc cagattccaa 5520

agatgagtga aaatgtttaa taaagttgcc atcactattc tcattatatt tggtatgtaa 5580agatgagtga aaatgtttaa taaagttgcc atcactattc tcattatatt tggtatgtaa 5580

agcattcatg gaaatgttct aagtcgttat tgagccaata attttcttta gcttataatg 5640agcattcatg gaaatgttct aagtcgttat tgagccaata attttcttta gcttataatg 5640

ccaacaggtc tatccgagaa ctacaaatga catattaact gaaaaatgca actggggttt 5700ccaacaggtc tatccgagaa ctacaaatga catattaact gaaaaatgca actggggttt 5700

actgaaggca gcagcttagt aattaaggta accatggctt aggtgaaact ggacctggga 5760actgaaggca gcagcttagt aattaaggta accatggctt aggtgaaact ggacctggga 5760

attccttctt tcattgacac agagctctga ggaatttcca aaggtcacag aagaaaagct 5820attccttctt tcattgacac agagctctga ggaatttcca aaggtcacag aagaaaagct 5820

ataattaaac tagtcccaaa aaatctcagc ctactctggg aaagcagcat attttgtttg 5880ataattaaac tagtcccaaa aaatctcagc ctactctggg aaagcagcat attttgtttg 5880

acaagtgcaa ggacttagaa cttttttttt tctcactgat cctgaagtgc cttttaagta 5940acaagtgcaa ggacttagaa cttttttttt tctcactgat cctgaagtgc cttttaagta 5940

tagttaagtg gtggaaaatt gagcaactat ttaagaaaag actctttttt ttcttcttcc 6000tagttaagtg gtggaaaatt gagcaactat ttaagaaaag actctttttt ttcttcttcc 6000

agcaatgctt tccttcaaaa cggtagcttc aaaacttcct gtcttttaaa tgatcagggg 6060agcaatgctt tccttcaaaa cggtagcttc aaaacttcct gtcttttaaa tgatcagggg 6060

gctgtgtgtt taaattattg ccattcatag aacagagtgg gtctgaggat gcctgtttcc 6120gctgtgtgtt taaattattg ccattcatag aacagagtgg gtctgaggat gcctgtttcc 6120

tttgaaattc tatgccccct cccagttttc taaaatttaa gaaaccacag agactttgac 6180tttgaaattc tatgccccct cccagttttc taaaatttaa gaaaccacag agactttgac 6180

aatgtagttg ccaaatgagt tgcttttaac tgctctaata gtttggtctt aagcttggcg 6240aatgtagttg ccaaatgagt tgcttttaac tgctctaata gtttggtctt aagcttggcg 6240

taatcatggt catagctgtt tcctgtgtga aattgttatc cgctcacaat tccacacaac 6300taatcatggt catagctgtt tcctgtgtga aattgttatc cgctcacaat tccacacaac 6300

atacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca 6360atacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca 6360

ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat 6420ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat 6420

taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc 6480taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc 6480

tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 6540tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 6540

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 6600aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 6600

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 6660aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 6660

ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 6720ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 6720

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 6780acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 6780

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 6840ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 6840

tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 6900tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 6900

tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 6960tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 6960

gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 7020gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 7020

agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 7080agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 7080

tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 7140tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 7140

agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 7200agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 7200

tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 7260tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 7260

acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 7320acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 7320

tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 7380tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 7380

agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc 7440agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc 7440

tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact 7500tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact 7500

acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc 7560acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc 7560

tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt 7620tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt 7620

ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta 7680ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta 7680

agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg 7740agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg 7740

tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt 7800tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt 7800

acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc 7860acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc 7860

agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt 7920agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt 7920

actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc 7980actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc 7980

tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc 8040tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc 8040

gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa 8100gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa 8100

ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac 8160ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac 8160

tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa 8220tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa 8220

aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt 8280aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt 8280

tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa 8340tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa 8340

tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct 8400tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct 8400

gacgtctaag aaaccattat tatcatgaca ttaacctata aaaataggcg tatcacgagg 8460gacgtctaag aaaccattat tatcatgaca ttaacctata aaaataggcg tatcacgagg 8460

ccctttcgtc 8470ccctttcgtc 8470

<210> 28<210> 28

<211> 19<211> 19

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> Puro-F<223> Puro-F

<400> 28<400> 28

aacctcccct tctacgagc 19aacctccccttctacgagc 19

<210> 29<210> 29

<211> 24<211> 24

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> 3arm-outR<223> 3arm-outR

<400> 29<400> 29

tacagccagg atctggatgt cagc 24tacagccagg atctggatgt cagc 24

<210> 30<210> 30

<211> 5922<211> 5922

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 鼠胚胎干细胞Ace2位点基因组替换序列<223> Genomic replacement sequence of Ace2 locus in mouse embryonic stem cells

<400> 30<400> 30

acggggaaag atgtcaagct cttcctggct ccttctcagc cttgttgctg taactgctgc 1020acggggaaag atgtcaagct cttcctggct ccttctcagc cttgttgctg taactgctgc 1020

tcagtccacc attgaggaac aggccaagac atttttggac aagtttaacc acgaagccga 1080tcagtccacc attgaggaac aggccaagac atttttggac aagtttaacc acgaagccga 1080

agacctgttc tatcaaagtt cacttgcttc ttggaattat aacaccaata ttactgaaga 1140agacctgttc tatcaaagtt cacttgcttc ttggaattat aacaccaata ttactgaaga 1140

gaatgtccaa aacatgaata atgctgggga caaatggtct gcctttttaa aggaacagtc 1200gaatgtccaa aacatgaata atgctgggga caaatggtct gcctttttaa aggaacagtc 1200

cacacttgcc caaatgtatc cactacaaga aattcagaat ctcacagtca agcttcagct 1260cacacttgcc caaatgtatc cactacaaga aattcagaat ctcacagtca agcttcagct 1260

gcaggctctt cagcaaaatg ggtcttcagt gctctcagaa gacaagagca aacggttgaa 1320gcaggctctt cagcaaaatg ggtcttcagt gctctcagaa gacaagagca aacggttgaa 1320

cacaattcta aatacaatga gcaccatcta cagtactgga aaagtttgta acccagataa 1380cacaattcta aatacaatga gcaccatcta cagtactgga aaagtttgta acccagataa 1380

tccacaagaa tgcttattac ttgaaccagg tttgaatgaa ataatggcaa acagtttaga 1440tccacaagaa tgcttattac ttgaaccagg tttgaatgaa ataatggcaa acagtttaga 1440

ctacaatgag aggctctggg cttgggaaag ctggagatct gaggtcggca agcagctgag 1500ctacaatgag aggctctggg cttgggaaag ctggagatct gaggtcggca agcagctgag 1500

gccattatat gaagagtatg tggtcttgaa aaatgagatg gcaagagcaa atcattatga 1560gccattat gaagagtatg tggtcttgaa aaatgagatg gcaagagcaa atcattatga 1560

ggactatggg gattattgga gaggagacta tgaagtaaat ggggtagatg gctatgacta 1620ggactatggg gattattgga gaggagacta tgaagtaaat ggggtagatg gctatgacta 1620

cagccgcggc cagttgattg aagatgtgga acataccttt gaagagatta aaccattata 1680cagccgcggc cagttgattg aagatgtgga acataccttt gaagagatta aaccattata 1680

tgaacatctt catgcctatg tgagggcaaa gttgatgaat gcctatcctt cctatatcag 1740tgaacatctt catgcctatg tgagggcaaa gttgatgaat gcctatcctt cctatatcag 1740

tccaattgga tgcctccctg ctcatttgct tggtgatatg tggggtagat tttggacaaa 1800tccaattgga tgcctccctg ctcatttgct tggtgatatg tggggtagat tttggacaaa 1800

tctgtactct ttgacagttc cctttggaca gaaaccaaac atagatgtta ctgatgcaat 1860tctgtactct ttgacagttc cctttggaca gaaaccaaac atagatgtta ctgatgcaat 1860

ggtggaccag gcctgggatg cacagagaat attcaaggag gccgagaagt tctttgtatc 1920ggtggaccag gcctggggatg cacagagaat attcaaggag gccgagaagt tctttgtatc 1920

tgttggtctt cctaatatga ctcaaggatt ctgggaaaat tccatgctaa cggacccagg 1980tgttggtctt cctaatatga ctcaaggatt ctgggaaaat tccatgctaa cggacccagg 1980

aaatgttcag aaagcagtct gccatcccac agcttgggac ctggggaagg gcgacttcag 2040aaatgttcag aaagcagtct gccatcccac agcttgggac ctggggaagg gcgacttcag 2040

gatccttatg tgcacaaagg tgacaatgga cgacttcctg acagctcatc atgagatggg 2100gatccttatg tgcacaaagg tgacaatgga cgacttcctg acagctcatc atgagatggg 2100

gcatatccag tatgatatgg catatgctgc acaacctttt ctgctaagaa atggagctaa 2160gcatatccag tatgatatgg catatgctgc acaacctttt ctgctaagaa atggagctaa 2160

tgaaggattc catgaagctg ttggggaaat catgtcactt tctgcagcca cacctaagca 2220tgaaggattc catgaagctg ttggggaaat catgtcactt tctgcagcca cacctaagca 2220

tttaaaatcc attggtcttc tgtcacccga ttttcaagaa gacaatgaaa cagaaataaa 2280tttaaaatcc attggtcttc tgtcacccga ttttcaagaa gacaatgaaa cagaaataaa 2280

cttcctgctc aaacaagcac tcacgattgt tgggactctg ccatttactt acatgttaga 2340cttcctgctc aaacaagcac tcacgattgt tgggactctg ccatttactt acatgttaga 2340

gaagtggagg tggatggtct ttaaagggga aattcccaaa gaccagtgga tgaaaaagtg 2400gaagtggagg tggatggtct ttaaagggga aattcccaaa gaccagtgga tgaaaaagtg 2400

gtgggagatg aagcgagaga tagttggggt ggtggaacct gtgccccatg atgaaacata 2460gtgggagatg aagcgagaga tagttggggt ggtggaacct gtgccccatg atgaaacata 2460

ctgtgacccc gcatctctgt tccatgtttc taatgattac tcattcattc gatattacac 2520ctgtgacccc gcatctctgt tccatgtttc taatgattac tcattcattc gatattacac 2520

aaggaccctt taccaattcc agtttcaaga agcactttgt caagcagcta aacatgaagg 2580aaggaccctt taccaattcc agtttcaaga agcactttgt caagcagcta aacatgaagg 2580

ccctctgcac aaatgtgaca tctcaaactc tacagaagct ggacagaaac tgttcaatat 2640ccctctgcac aaatgtgaca tctcaaactc tacagaagct ggacagaaac tgttcaatat 2640

gctgaggctt ggaaaatcag aaccctggac cctagcattg gaaaatgttg taggagcaaa 2700gctgaggctt ggaaaatcag aaccctggac cctagcattg gaaaatgttg taggagcaaa 2700

gaacatgaat gtaaggccac tgctcaacta ctttgagccc ttatttacct ggctgaaaga 2760gaacatgaat gtaaggccac tgctcaacta ctttgagccc ttatttacct ggctgaaaga 2760

ccagaacaag aattcttttg tgggatggag taccgactgg agtccatatg cagaccaaag 2820ccagaacaag aattcttttg tgggatggag taccgactgg agtccatatg cagaccaaag 2820

catcaaagtg aggataagcc taaaatcagc tcttggagat aaagcatatg aatggaacga 2880catcaaagtg aggataagcc taaaatcagc tcttggagat aaagcatatg aatggaacga 2880

caatgaaatg tacctgttcc gatcatctgt tgcatatgct atgaggcagt actttttaaa 2940caatgaaatg tacctgttcc gatcatctgt tgcatatgct atgaggcagt actttttaaa 2940

agtaaaaaat cagatgattc tttttgggga ggaggatgtg cgagtggcta atttgaaacc 3000agtaaaaaat cagatgattc tttttgggga ggaggatgtg cgagtggcta atttgaaacc 3000

aagaatctcc tttaatttct ttgtcactgc acctaaaaat gtgtctgata tcattcctag 3060aagaatctcc tttaatttct ttgtcactgc acctaaaaat gtgtctgata tcattcctag 3060

aactgaagtt gaaaaggcca tcaggatgtc ccggagccgt atcaatgatg ctttccgtct 3120aactgaagtt gaaaaggcca tcaggatgtc ccggagccgt atcaatgatg ctttccgtct 3120

gaatgacaac agcctagagt ttctggggat acagccaaca cttggacctc ctaaccagcc 3180gaatgacaac agcctagagt ttctggggat acagccaaca cttggacctc ctaaccagcc 3180

ccctgtttcc atatggctga ttgtttttgg agttgtgatg ggagtgatag tggttggcat 3240ccctgtttcc atatggctga ttgtttttgg agttgtgatg ggagtgatag tggttggcat 3240

tgtcatcctg atcttcactg ggatcagaga tcggaagaag aaaaataaag caagaagtgg 3300tgtcatcctg atcttcactg ggatcagaga tcggaagaag aaaaataaag caagaagtgg 3300

agaaaatcct tatgcctcca tcgatattag caaaggagaa aataatccag gattccaaaa 3360agaaaatcct tatgcctcca tcgatattag caaaggagaa aataatccag gattccaaaa 3360

cactgatgat gttcagacct ccttttagaa cttgtttatt gcagcttata atggttacaa 3420cactgatgat gttcagacct ccttttagaa cttgtttatt gcagcttata atggttacaa 3420

ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3480ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3480

tggtttgtcc aaactcatca atgtatctta acgcgtgcga agttcctatt ctctagaaag 3540tggtttgtcc aaactcatca atgtatctta acgcgtgcga agttcctatt ctctagaaag 3540

tataggaact tcatcgatac cgggtagggg aggcgctttt cccaaggcag tctggagcat 3600tataggaact tcatcgatac cgggtagggg aggcgctttt cccaaggcag tctggagcat 3600

gcgctttagc agccccgctg ggcacttggc gctacacaag tggcctctgg cctcgcacac 3660gcgctttagc agccccgctg ggcacttggc gctacacaag tggcctctgg cctcgcacac 3660

attccacatc ccccggtagg cgccaaccgg ctccgttctt tggtggcccc ttcgcgccac 3720attccacatc ccccggtagg cgccaaccgg ctccgttctt tggtggcccc ttcgcgccac 3720

cttctactcc tcccctagtc aggaagttcc cccccgcccc gcagctcgcg tcgtgcagga 3780cttctactcc tcccctagtc aggaagttcc cccccgcccc gcagctcgcg tcgtgcagga 3780

cgtgacaaat ggaagtagca cgtctcacta gtctcgtgca gatggacagc accgctgagc 3840cgtgacaaat ggaagtagca cgtctcacta gtctcgtgca gatggacagc accgctgagc 3840

aatggaagcg ggtaggcctt tggggcagcg gccaatagca gctttgctcc ttcgctttct 3900aatggaagcg ggtaggcctt tggggcagcg gccaatagca gctttgctcc ttcgctttct 3900

gggctcagag gctgggaagg ggtgggtccg ggggcgggct caggggcggg ctcaggggcg 3960gggctcagag gctgggaagg ggtgggtccg ggggcgggct caggggcggg ctcaggggcg 3960

gggcgggcgc ccgaaggtcc tccggaggcc cggcattctg cacgcttcaa aagcgcacgt 4020gggcgggcgc ccgaaggtcc tccggaggcc cggcattctg cacgcttcaa aagcgcacgt 4020

ctgccgcgct gttctcctct tcctcatctc cgggcctttc gacctgcagc ccaagctagc 4080ctgccgcgct gttctcctct tcctcatctc cgggcctttc gacctgcagc ccaagctagc 4080

ttaccatgac cgagtacaag cccacggtgc gcctcgccac ccgcgacgac gtccccaggg 4140ttaccatgac cgagtacaag cccacggtgc gcctcgccac ccgcgacgac gtccccaggg 4140

ccgtacgcac cctcgccgcc gcgttcgccg actaccccgc cacgcgccac accgtcgatc 4200ccgtacgcac cctcgccgcc gcgttcgccg actaccccgc cacgcgccac accgtcgatc 4200

cggaccgcca catcgagcgg gtcaccgagc tgcaagaact cttcctcacg cgcgtcgggc 4260cggaccgcca catcgagcgg gtcaccgagc tgcaagaact cttcctcacg cgcgtcgggc 4260

tcgacatcgg caaggtgtgg gtcgcggacg acggcgccgc ggtggcggtc tggaccacgc 4320tcgacatcgg caaggtgtgg gtcgcggacg acggcgccgc ggtggcggtc tggaccacgc 4320

cggagagcgt cgaagcgggg gcggtgttcg ccgagatcgg cccgcgcatg gccgagttga 4380cggagagcgt cgaagcgggg gcggtgttcg ccgagatcgg cccgcgcatg gccgagttga 4380

gcggttcccg gctggccgcg cagcaacaga tggaaggcct cctggcgccg caccggccca 4440gcggttcccg gctggccgcg cagcaacaga tggaaggcct cctggcgccg caccggccca 4440

aggagcccgc gtggttcctg gccaccgtcg gcgtctcgcc cgaccaccag ggcaagggtc 4500aggagcccgc gtggttcctg gccaccgtcg gcgtctcgcc cgaccaccag ggcaagggtc 4500

tgggcagcgc cgtcgtgctc cccggagtgg aggcggccga gcgcgccggg gtgcccgcct 4560tgggcagcgc cgtcgtgctc cccggagtgg aggcggccga gcgcgccggg gtgcccgcct 4560

tcctggagac ctccgcgccc cgcaacctcc ccttctacga gcggctcggc ttcaccgtca 4620tcctggagac ctccgcgccc cgcaacctcc ccttctacga gcggctcggc ttcaccgtca 4620

ccgccgacgt cgaggtgccc gaaggaccgc gcacctggtg catgacccgc aagcccggtg 4680ccgccgacgt cgaggtgccc gaaggaccgc gcacctggtg catgacccgc aagcccggtg 4680

cctgaggtac ctctcatgct ggagttcttc gcccacccca acttgtttat tgcagcttat 4740cctgaggtac ctctcatgct ggagttcttc gcccaccccca acttgtttat tgcagctttat 4740

aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt tttttcactg 4800aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatttttttcactg 4800

cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatcgatg aagttcctat 4860cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatcgatg aagttcctat 4860

tctctagaaa gtataggaac ttctaacctc ccgggtgaca gataacttcg tataatgtat 4920tctctagaaa gtataggaac ttctaacctc ccgggtgaca gataacttcg tataatgtat 4920

gctatacgaa gttatgtcga cgcggcgcgc cgaattataa tactaacatt actgaagaaa 4980gctatacgaa gttatgtcga cgcggcgcgc cgaattataa tactaacatt actgaagaaa 4980

atgcccaaaa gatggtaagt tcttgaggct acccaggggg ttattgattg cttcttaaag 5040atgcccaaaa gatggtaagt tcttgaggct acccaggggg ttattgattg cttcttaaag 5040

atcagaatta ctgcctataa aactggataa ggaaatcata gagatctctc aagtgtgagg 5100atcagaatta ctgcctataa aactggataa ggaaatcata gagatctctc aagtgtgagg 5100

atgagtgact gcctctgtag ctctgatcct agtctcccag atggctaaat tcaattgacc 5160atgagtgact gcctctgtag ctctgatcct agtctcccag atggctaaat tcaattgacc 5160

ttagagttca tctggaaaat tgttatgaat gaattatttg cccagattcc aaagatgagt 5220ttagagttca tctggaaaat tgttatgaat gaattatttg cccagattcc aaagatgagt 5220

gaaaatgttt aataaagttg ccatcactat tctcattata tttggtatgt aaagcattca 5280gaaaatgttt aataaagttg ccatcactat tctcattata tttggtatgt aaagcattca 5280

tggaaatgtt ctaagtcgtt attgagccaa taattttctt tagcttataa tgccaacagg 5340tggaaatgtt ctaagtcgtt attgagccaa taattttctt tagcttataa tgccaacagg 5340

tctatccgag aactacaaat gacatattaa ctgaaaaatg caactggggt ttactgaagg 5400tctatccgag aactacaaat gacatattaa ctgaaaaatg caactggggt ttactgaagg 5400

cagcagctta gtaattaagg taaccatggc ttaggtgaaa ctggacctgg gaattccttc 5460cagcagctta gtaattaagg taaccatggc ttaggtgaaa ctggacctgg gaattccttc 5460

tttcattgac acagagctct gaggaatttc caaaggtcac agaagaaaag ctataattaa 5520tttcattgac acagagctct gaggaatttc caaaggtcac agaagaaaag ctataattaa 5520

actagtccca aaaaatctca gcctactctg ggaaagcagc atattttgtt tgacaagtgc 5580actagtccca aaaaatctca gcctactctg ggaaagcagc atattttgtt tgacaagtgc 5580

aaggacttag aacttttttt tttctcactg atcctgaagt gccttttaag tatagttaag 5640aaggacttag aacttttttt tttctcactg atcctgaagt gccttttaag tatagttaag 5640

tggtggaaaa ttgagcaact atttaagaaa agactctttt ttttcttctt ccagcaatgc 5700tggtggaaaa ttgagcaact atttaagaaa agactctttt ttttcttctt ccagcaatgc 5700

tttccttcaa aacggtagct tcaaaacttc ctgtctttta aatgatcagg gggctgtgtg 5760tttccttcaa aacggtagct tcaaaacttc ctgtctttta aatgatcagg gggctgtgtg 5760

tttaaattat tgccattcat agaacagagt gggtctgagg atgcctgttt cctttgaaat 5820tttaaattat tgccattcat agaacagagt gggtctgagg atgcctgttt cctttgaaat 5820

tctatgcccc ctcccagttt tctaaaattt aagaaaccac agagactttg acaatgtagt 5880tctatgcccc ctcccagttt tctaaaattt aagaaaccac agagactttg acaatgtagt 5880

tgccaaatga gttgctttta actgctctaa tagtttggtc tt 5922tgccaaatga gttgctttta actgctctaa tagtttggtc tt 5922

<210> 31<210> 31

<211> 19<211> 19

<212> DNA<212> DNA

<213> Natural sequence<213> Natural sequence

<220><220>

<223> hACE2-F<223> hACE2-F

<400> 31<400> 31

tgatagtggt tggcattgt 19tgatagtggt tggcattgt 19

<210> 32<210> 32

<211> 4591<211> 4591

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 最终删除PGK-puro后小鼠胚胎干细胞Ace2位点基因组替换序列<223> Genomic replacement sequence of the Ace2 locus in mouse embryonic stem cells after final deletion of PGK-puro

<400> 32<400> 32

tataggaact tctaacctcc cgggtgacag ataacttcgt ataatgtatg ctatacgaag 3600tataggaact tctaacctcc cgggtgacag ataacttcgt ataatgtatg ctatacgaag 3600

ttatgtcgac gcggcgcgcc gaattataat actaacatta ctgaagaaaa tgcccaaaag 3660ttatgtcgac gcggcgcgcc gaattataat actaacatta ctgaagaaaa tgcccaaaag 3660

atggtaagtt cttgaggcta cccagggggt tattgattgc ttcttaaaga tcagaattac 3720atggtaagtt cttgaggcta cccagggggt tattgattgc ttcttaaaga tcagaattac 3720

tgcctataaa actggataag gaaatcatag agatctctca agtgtgagga tgagtgactg 3780tgcctataaa actggataag gaaatcatag agatctctca agtgtgagga tgagtgactg 3780

cctctgtagc tctgatccta gtctcccaga tggctaaatt caattgacct tagagttcat 3840cctctgtagc tctgatccta gtctcccaga tggctaaatt caattgacct tagagttcat 3840

ctggaaaatt gttatgaatg aattatttgc ccagattcca aagatgagtg aaaatgttta 3900ctggaaaatt gttatgaatg aattatttgc ccagattcca aagatgagtg aaaatgttta 3900

ataaagttgc catcactatt ctcattatat ttggtatgta aagcattcat ggaaatgttc 3960ataaagttgc catcactatt ctcattatat ttggtatgta aagcattcat ggaaatgttc 3960

taagtcgtta ttgagccaat aattttcttt agcttataat gccaacaggt ctatccgaga 4020taagtcgtta ttgagccaat aattttcttt agcttataat gccaacaggt ctatccgaga 4020

actacaaatg acatattaac tgaaaaatgc aactggggtt tactgaaggc agcagcttag 4080actacaaatg acatattaac tgaaaaatgc aactggggtt tactgaaggc agcagcttag 4080

taattaaggt aaccatggct taggtgaaac tggacctggg aattccttct ttcattgaca 4140taattaaggt aaccatggct taggtgaaac tggacctggg aattccttct ttcattgaca 4140

cagagctctg aggaatttcc aaaggtcaca gaagaaaagc tataattaaa ctagtcccaa 4200cagagctctg aggaatttcc aaaggtcaca gaagaaaagc tataattaaa ctagtcccaa 4200

aaaatctcag cctactctgg gaaagcagca tattttgttt gacaagtgca aggacttaga 4260aaaatctcag cctactctgg gaaagcagca tattttgttt gacaagtgca aggacttaga 4260

actttttttt ttctcactga tcctgaagtg ccttttaagt atagttaagt ggtggaaaat 4320actttttttt ttctcactga tcctgaagtg ccttttaagt atagttaagt ggtggaaaat 4320

tgagcaacta tttaagaaaa gactcttttt tttcttcttc cagcaatgct ttccttcaaa 4380tgagcaacta tttaagaaaa gactcttttt tttcttcttc cagcaatgct ttccttcaaa 4380

acggtagctt caaaacttcc tgtcttttaa atgatcaggg ggctgtgtgt ttaaattatt 4440acggtagctt caaaacttcc tgtcttttaa atgatcaggg ggctgtgtgt ttaaattatt 4440

gccattcata gaacagagtg ggtctgagga tgcctgtttc ctttgaaatt ctatgccccc 4500gccattcata gaacagagtg ggtctgagga tgcctgtttc ctttgaaatt ctatgccccc 4500

tcccagtttt ctaaaattta agaaaccaca gagactttga caatgtagtt gccaaatgag 4560tcccagtttt ctaaaattta agaaaccaca gagactttga caatgtagtt gccaaatgag 4560

ttgcttttaa ctgctctaat agtttggtct t 4591ttgcttttaa ctgctctaat agtttggtct t 4591

<210> 33<210> 33

<211> 18<211> 18

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> hACE2-qF<223> hACE2-qF

<400> 33<400> 33

ggtcttcagt gctctcag 18ggtcttcagt gctctcag 18

<210> 34<210> 34

<211> 21<211> 21

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> hACE2-qR<223> hACE2-qR

<400> 34<400> 34

gcattcttgt ggattatctg g 21gcattcttgt ggattatctg g 21

<210> 35<210> 35

<211> 22<211> 22

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 正向引物<223> Forward primer

<400> 35<400> 35

ggggaacttc tcctgctaga at 22ggggaacttc tcctgctaga at 22

<210> 36<210> 36

<211> 22<211> 22

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> 反向引物<223> Reverse primer

<400> 36<400> 36

cagacatttt gctctcaagc tg 22cagacatttt gctctcaagc tg 22

<210> 37<210> 37

<211> 20<211> 20

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<220><220>

<223> TaqMan 探针<223> TaqMan Probes

<400> 37<400> 37

ttgctgctgc ttgacagatt 20ttgctgctgc ttgacagatt 20

Claims

1. The sgRNA is characterized in that the sequence of the sgRNA is shown as SEQ ID NO. 1 or SEQ ID NO. 2.

2. The sgRNA of claim 1, wherein the sequence of the primers upstream and downstream of the synthesis of the sgRNA of SEQ ID NO. 2 is shown as SEQ ID NO. 4 and SEQ ID NO. 5, respectively.

3. The sgRNA of claim 1, wherein the sequences of the primers upstream and downstream of the synthesis of the sgRNA of SEQ ID No. 1 are shown in SEQ ID No. 6 and SEQ ID No. 7, respectively.

4. The sgRNA of claim 1, wherein the sgRNA-targeted gene is an animal ACE2 gene for site-directed cleavage of the ACE2 gene.

5. The sgRNA of claim 1, wherein the sgRNA is used to construct a humanized animal model.

6. The sgRNA of claim 1, wherein the target point of the sgRNA is located on exon 1 of the ACE2 gene of the animal.

7. The sgRNA of claim 6, wherein the target is within 140bp of the 5' utr of the ACE2 gene of the animal.

8. The sgRNA of any one of claims 4 to 7, wherein the animal is a mammal.

9. The sgRNA of claim 8, wherein the mammal is a rodent.

10. The sgRNA of claim 9, wherein the rodent is a mouse.

11. A carrier assembly, the carrier assembly comprising: (1) A vector comprising the sgRNA of any one of claims 1-10; (2) A targeting vector comprising a 5 'homology arm, a human ACE2 gene fragment, an SV40 polyA sequence, and a 3' homology arm; the targeting vector targets an animal ACE2 gene, the animal ACE2 gene comprising a target corresponding to the sgRNA of any one of claims 1 to 10.

12. The vector combination of claim 11, wherein the 5' homology arm is a 5' homology arm homologous to a 5' target sequence at a genomic locus of interest.

13. The vector combination of claim 12, wherein the targeting vector is used to initiate expression of a human gene of interest using an animal gene promoter after insertion of the CDS sequence of the human gene into the promoter and 5' utr region sequences of the animal gene.

14. The vector combination of claim 12, wherein the 3' homology arm is a 3' homology arm that is homologous to a 3' target sequence at a genomic locus of interest.

15. The vector combination of claim 14, wherein the targeting vector further comprises the screening marker PGK-Puro.

16. The carrier combination of claim 11, wherein the animal is a mammal.

17. The vector combination of claim 16, wherein the mammal is a rodent.

18. The vector combination of claim 13, wherein the animal is a mouse.

19. A DNA molecule encoding the sgRNA of any one of claims 1-10.

20. A cell comprising the sgRNA of any one of claims 1-10 or the targeting vector of any one of claims 11-18 or the DNA molecule of claim 19, said cell being incapable of developing into an animal subject, said cell excluding germ cells.

21. Use of the sgRNA of any one of claims 1 to 10 for the preparation of a genetically humanized cell line or a genetically humanized animal model, said animal being a mouse.

22. The use of claim 21, wherein the gene is ACE2.

23. A method for constructing a humanized animal cell line, wherein the sgRNA of any one of claims 1 to 10 is used in the method, the method comprising the steps of: (1) Constructing a targeting vector according to any one of the claims 11-18; (2) Introducing the constructed targeting vector and the vector with the sgRNA of any one of claims 1 to 10 attached thereto into embryonic stem cells of animal origin; (3) Culturing the embryonic stem cells in the step (2) into clones to obtain the embryonic stem cells; the animals were mice.

24. The method of claim 23, comprising introducing a human gene of interest into the animal cell such that the gene of interest expresses the CDS of the full length human gene of interest in situ within the animal cell.

25. The method of claim 24, wherein the gene of interest is ACE2.

26. The method of claim 23, wherein the method uses gene editing techniques for the construction of the humanized animal cell model.

27. The method of claim 26, wherein the gene editing technique comprises an embryonic stem cell-based DNA homologous recombination technique, a CRISPR/Cas9 technique.

28. The ACE2 gene humanized animal cell line produced by the method of any one of claims 23-27, which is incapable of developing into an animal subject, said cells not comprising germ cells.

29. A method of constructing a genetically humanized animal model comprising injecting the animal cell of claim 28 into an animal.

30. The method of claim 29, comprising forming the animal cells into an animal subject by tetraploid compensation or blastocyst injection.

31. A tissue, body fluid, cell or a fragment thereof or an extract thereof of a humanized mouse, or a tissue, body fluid, cell or a fragment thereof or an extract thereof of a humanized offspring of a humanized mouse, wherein the humanized mouse is constructed using the method of any one of claims 29-30, wherein the cell or tissue is incapable of developing into an animal subject, and wherein the cell does not comprise germ cells.

32. Use of a humanized animal model or a progeny thereof derived from the construction method of any one of claims 29-30 for the manufacture of human antibodies, or as a model system for pharmacological, immunological, microbiological and medical research, or for the production of animal experimental disease models for etiology research and/or for the development of new diagnostic and/or therapeutic strategies, or for screening, validating, evaluating or studying ACE2 gene function, ACE2 antibodies, drugs against ACE2 targets, pharmacodynamic research.