CN112079903B

CN112079903B - Mutant of mismatching binding protein and coding gene thereof

Info

Publication number: CN112079903B
Application number: CN201910518166.5A
Authority: CN
Inventors: 张佳; 柴百惠; 徐健
Original assignee: Qingdao Institute of Bioenergy and Bioprocess Technology of CAS
Current assignee: Qingdao Institute of Bioenergy and Bioprocess Technology of CAS
Priority date: 2019-06-14
Filing date: 2019-06-14
Publication date: 2022-11-18
Anticipated expiration: 2039-06-14
Also published as: CN112079903A

Abstract

The invention provides a mismatch binding protein eMuts, the amino acid sequence of which comprises the amino acid sequence shown in SEQ ID NO. 52 and has at least one mutation selected from the group consisting of: 1) Serine at positions 120 and 151 were mutated to cysteine; 2) Leucine at position 157 and glycine at position 233 were mutated to cysteine; 3) Glutamic acid at position 451 and valine at position 465 are mutated to cysteine; 4) The 609 th methionine and the 723 th threonine are mutated into cysteine; or 5) serine at position 120, leucine at position 157, glutamic acid at position 451, and methionine at position 609 are all mutated to cysteine. The mismatch binding protein provided by the invention has higher thermal stability, and can stably and specifically identify and bind to DNA containing an error sequence, thereby improving the accuracy of the DNA. The invention also provides host cells comprising a gene encoding the mismatch binding protein and methods of producing the mismatch binding protein using the same.

Description

A mutant of mismatch binding protein and its coding gene

技术领域technical field

本发明涉及生物技术领域。具体地说，本发明涉及错配结合蛋白(简称eMutS)的突变体及其应用。The present invention relates to the field of biotechnology. Specifically, the present invention relates to a mutant of mismatch binding protein (abbreviated as eMutS) and its application.

背景技术Background technique

随着合成生物学的兴起，我们进入了创造具有新基因功能甚至全基因组的基因的时代。DNA合成具有广泛的应用，包括基因合成，代谢通路合成，甚至整个基因组的合成(Gibson et al.,2008；Gibson et al.,2010)。从头基因合成，作为合成生物学的重要技术手段，主要涉及通过基于PCR或DNA连接酶的连接反应将短链寡核苷酸(oligouncletoides，oligos)组装成长链DNA(Jingdong et al.,2004)，在合成生物学和生物技术领域发挥着越来越重要的作用(Bayer and Smolke,2005；Carr and Church,2009；Gibson et al.,2008；Wang et al.,2009)。目前，在多孔二氧化硅珠粒上合成的寡核苷酸通常用作合成反应的起始寡核苷酸，其相对昂贵且具有低的合成通量。与传统方法相比，利用微流体合成技术在固相平面(微流控芯片)上平行合成寡核苷酸，不仅降低了寡核苷酸的成本，而且大大提高了合成的通量(Gao et al.,2001；Quan et al.,2011；Richmond et al.,2004；Sriram etal.,2010；Xiaochuan et al.,2004)。然而，微芯片合成反应的每个步骤的效率低于常规方法的效率，并且具有相对较高的错误率，导致微芯片合成具有较低的合成质量和较短的寡核苷酸长度(Tian et al.,2009)。合成基因中的错误来源于寡核苷酸化学合成过程中引入的错误碱基，聚合酶的延伸以及寡核苷酸之间的错误组装(Cline et al.,1996)。由于合成错误的存在，使用克隆测序方法从多个克隆中选择具有正确序列的克隆通常是不可避免的，并且该部分的成本通常占据基因合成总成本的较大部分。因此，提高合成DNA的保真度是大规模应用基因从头合成技术的迫切要求。酶纠错方法是改善错误的经济有效的方法。通常，这些酶特异性识别并结合含有错误配对序列的DNA。因此，DNA双链通过变性退火将错误的单链和正确的单链随机组合，使得错误合成的碱基形成错配，然后使用错配结合蛋白(MutS)或错配切除酶(例如，resolvase)删除合成错误。Lubock等人系统地比较了六种不同的纠错酶，并得出结论：MutS最适合增加完美组装的数量(最多25.2倍)(Lubock et al.,2017)。With the rise of synthetic biology, we have entered the era of creating genes with new gene functions and even whole genomes. DNA synthesis has a wide range of applications, including gene synthesis, metabolic pathway synthesis, and even whole genome synthesis (Gibson et al., 2008; Gibson et al., 2010). De novo gene synthesis, as an important technique in synthetic biology, mainly involves the assembly of short-chain oligonucleotides (oligouncletoides, oligos) into long-chain DNA by ligation reactions based on PCR or DNA ligase (Jingdong et al., 2004), Playing an increasingly important role in synthetic biology and biotechnology (Bayer and Smolke, 2005; Carr and Church, 2009; Gibson et al., 2008; Wang et al., 2009). Currently, oligonucleotides synthesized on porous silica beads are generally used as starting oligonucleotides for synthesis reactions, which are relatively expensive and have low synthesis throughput. Compared with traditional methods, the use of microfluidic synthesis technology to synthesize oligonucleotides in parallel on a solid phase plane (microfluidic chip) not only reduces the cost of oligonucleotides, but also greatly improves the throughput of synthesis (Gao et al. al., 2001; Quan et al., 2011; Richmond et al., 2004; Sriram et al., 2010; Xiaochuan et al., 2004). However, the efficiency of each step of the microchip synthesis reaction is lower than that of conventional methods, and has a relatively high error rate, resulting in microchip synthesis with lower synthesis quality and shorter oligonucleotide length (Tian et al. al., 2009). Errors in synthetic genes originate from erroneous bases introduced during chemical oligonucleotide synthesis, polymerase extension, and misassembly between oligonucleotides (Cline et al., 1996). Due to the existence of synthesis errors, it is usually unavoidable to select a clone with the correct sequence from multiple clones using clone sequencing methods, and the cost of this part usually occupies a larger part of the total cost of gene synthesis. Therefore, improving the fidelity of synthetic DNA is an urgent requirement for large-scale application of de novo gene synthesis technology. Enzymatic error correction methods are a cost-effective way to improve errors. Typically, these enzymes specifically recognize and bind DNA that contains mismatched sequences. Therefore, the DNA duplex is randomly combined with the wrong single strand and the correct single strand through denaturing annealing, so that the wrongly synthesized bases form a mismatch, and then use a mismatch binding protein (MutS) or a mismatch excision enzyme (for example, resolvase) Removed synthesis errors. Lubock et al. systematically compared six different error-correcting enzymes and concluded that MutS was the most suitable for increasing the number of perfect assemblies (up to 25.2-fold) (Lubock et al., 2017).

真核生物和原核生物中普遍存在的错配修复系统(MMR)系统确保了遗传物质的精确和稳定复制。错配结合蛋白MutS是MMR的重要组分，其识别并结合许多不同的DNA错配(And and Jinksrobertson,2000；Iyer et al.,2006；Modrich,1991)。尽管蛋白质结合不同错配的能力不同，但它可以结合几乎所有单碱基错配以及1-4碱基插入或缺失突变(Whitehouse et al.,1997)。源自大肠杆菌(EcoMutS)的MutS为约97kDa，并且与G：T，A：C，A：A，G：G的结合效率高于其他错配(Brown et al.,2001)。MutS以同源二聚体的形式不对称地结合错配的DNA。The Mismatch Repair (MMR) system ubiquitous in eukaryotes and prokaryotes ensures precise and stable replication of genetic material. The mismatch binding protein MutS is an important component of MMR, which recognizes and binds many different DNA mismatches (And and Jinksrobertson, 2000; Iyer et al., 2006; Modrich, 1991). Although the ability of the protein to bind different mismatches varies, it can bind almost all single-base mismatches as well as 1-4 base insertion or deletion mutations (Whitehouse et al., 1997). MutS derived from Escherichia coli (EcoMutS) is about 97 kDa and binds G:T, A:C, A:A, G:G more efficiently than other mismatches (Brown et al., 2001). MutS asymmetrically binds mismatched DNA as a homodimer.

据报道，MutS用于校正寡核苷酸组装的基因片段(Binkowski et al.,2005；Carret al.,2004)。来自水生栖热菌(TaqMutS)的MutS通常用于错误校正，因为它比EcoMutS更具热稳定性并且对完全互补的DNA具有弱的结合能力。然而，TaqMutS与错配碱基的结合能力远弱于EcoMutS，后者直接降低了错误修正效率(Brown et al.,2001；Cho et al.,2007)。最近，建立了用于校正芯片合成的寡核苷酸文库及其组装基因中的合成错误的高通量错误校正方法(Wan et al.,2017；Wen et al.,2014)。这种使用方法，可以将具有错配结合能力的蛋白EcoMutS和TaqMutS固定化在纤维素柱上，这种方案低成本，高通量。然而，EcoMutS的热稳定性较差限制了该错误校正系统在基因合成中的大规模应用。MutS has been reported to be used to correct oligonucleotide-assembled gene fragments (Binkowski et al., 2005; Carret al., 2004). MutS from Thermus aquaticus (TaqMutS) is commonly used for error correction because it is more thermostable than EcoMutS and has weak binding capacity for fully complementary DNA. However, the binding ability of TaqMutS to mismatched bases is much weaker than that of EcoMutS, which directly reduces the efficiency of error correction (Brown et al., 2001; Cho et al., 2007). Recently, high-throughput error correction methods for correcting synthetic errors in ChIP-synthesized oligonucleotide libraries and their assembled genes were established (Wan et al., 2017; Wen et al., 2014). Using this method, the proteins EcoMutS and TaqMutS with mismatch binding ability can be immobilized on the cellulose column, which is low cost and high throughput. However, the poor thermal stability of EcoMutS limits the large-scale application of this error correction system in gene synthesis.

因此，综上所述，本领域急需具有较高热稳定性的错配结合蛋白突变体。Therefore, in summary, there is an urgent need in the art for mismatch binding protein mutants with higher thermal stability.

发明内容Contents of the invention

本发明的目的在于提供一种错配结合蛋白的突变体及此类突变体的应用。The purpose of the present invention is to provide a mutant of mismatch binding protein and the application of the mutant.

在第一方面，本发明提供一种错配结合蛋白，所述错配结合蛋白：In a first aspect, the invention provides a mismatch binding protein that:

a.包含如SEQ ID NO:52所示氨基酸序列且具有选自以下至少一组的突变：a. comprising an amino acid sequence as shown in SEQ ID NO:52 and having a mutation selected from at least one of the following groups:

1)第120位和第151位的丝氨酸均突变为半胱氨酸，1) Both serine at position 120 and position 151 are mutated to cysteine,

2)第157位的亮氨酸和第233位的甘氨酸均突变为半胱氨酸，2) Leucine at position 157 and glycine at position 233 are both mutated to cysteine,

3)第451位的谷氨酸和第465位的缬氨酸均突变为半胱氨酸，3) Both glutamic acid at position 451 and valine at position 465 are mutated into cysteine,

4)第609位的蛋氨酸和第723位的苏氨酸均突变为半胱氨酸，或4) both methionine at position 609 and threonine at position 723 are mutated to cysteine, or

5)第120位的丝氨酸、第157位的亮氨酸、第451位的谷氨酸和第609位的蛋氨酸均突变为半胱氨酸；或5) Serine at position 120, leucine at position 157, glutamic acid at position 451 and methionine at position 609 are all mutated to cysteine; or

b.具有a)所限定的序列经过1-20个氨基酸残基的取代、缺失或添加而形成的序列，且基本具有a)所限定的错配结合蛋白功能的由a)衍生的错配结合蛋白。在优选的实施方式中，所述错配结合蛋白来源于埃希氏菌属细菌，优选地，来源于大肠杆菌。b. The sequence defined in a) is formed by substitution, deletion or addition of 1-20 amino acid residues, and the mismatch binding protein derived from a) basically has the function of a mismatch binding protein as defined in a) protein. In a preferred embodiment, the mismatch binding protein is derived from bacteria of the genus Escherichia, preferably from Escherichia coli.

在优选的实施方式中，所述错配结合蛋白：In a preferred embodiment, the mismatch binding protein:

a.具有如SEQ ID NO:52所示氨基酸序列且具有选自以下至少一组的突变:a. have an amino acid sequence as shown in SEQ ID NO:52 and have a mutation selected from at least one group of the following:

4)第609位的蛋氨酸和第723位的苏氨酸突变均为半胱氨酸，或4) both the methionine at position 609 and the threonine at position 723 mutations are cysteine, or

b.具有a)所限定的序列经过一个或几个氨基酸残基，优选1-20个、更优选1-15个、更优选1-10个、更优选1-3个、最优选1个氨基酸残基的取代、缺失或添加而形成的序列，且基本具有a)所限定的错配结合蛋白功能的由a)衍生的错配结合蛋白。b. have the sequence defined in a) through one or several amino acid residues, preferably 1-20, more preferably 1-15, more preferably 1-10, more preferably 1-3, most preferably 1 amino acid residues A mismatch binding protein derived from a) that is a sequence formed by substitution, deletion or addition of residues, and basically has the function of the mismatch binding protein defined in a).

a.其氨基酸序列如SEQ ID NO:54、56、58、60或62所示；或a. its amino acid sequence is shown in SEQ ID NO: 54, 56, 58, 60 or 62; or

b.包含a)所限定的序列经过一个或几个氨基酸残基，优选1-20个、更优选1-15个、更优选1-10个、更优选1-3个、最优选1个氨基酸残基的取代、缺失或添加而形成的序列，且基本具有a)所限定的错配结合蛋白功能的由a)衍生的错配结合蛋白。b. comprising the sequence defined in a) through one or several amino acid residues, preferably 1-20, more preferably 1-15, more preferably 1-10, more preferably 1-3, most preferably 1 amino acid residues A mismatch binding protein derived from a) that is a sequence formed by substitution, deletion or addition of residues, and basically has the function of the mismatch binding protein defined in a).

在优选的实施方式中，所述错配结合蛋白的氨基酸序列如SEQ ID NO:54、56、58、60或62所示。In a preferred embodiment, the amino acid sequence of the mismatch binding protein is shown in SEQ ID NO: 54, 56, 58, 60 or 62.

在一优选例中，所述的错配结合蛋白在4℃条件下，保存90天至少保留50％以上的活性；优选地，60％-70％以上的活性；更优选地，70％-80％以上的活性，最优选，90％以上的活性。In a preferred example, the mismatch binding protein retains at least 50% of its activity when stored at 4°C for 90 days; preferably, 60%-70% of its activity; more preferably, 70%-80% % or more activity, most preferably, more than 90% activity.

在一优选例中，所述的错配结合蛋白利用固定化柱子纯化方式(MICC错配纠正系统)(Wen et al.,2014)，将合成基因的错误率降低到2/kb，优选地，降低到1/kb，最优选地，降低到0.56/kb。In a preferred example, the mismatch binding protein is purified using an immobilized column (MICC mismatch correction system) (Wen et al., 2014), which reduces the error rate of the synthetic gene to 2/kb, preferably, Reduced to 1/kb, most preferably, to 0.56/kb.

本文所用的术语“对应于”具有本领域普通技术人员通常理解的意义。具体地说，“对应于”表示两条序列经同源性或序列相同性比对后，一条序列与另一条序列中的指定位置相对应的位置。The term "corresponding to" used herein has a meaning commonly understood by those of ordinary skill in the art. Specifically, "corresponding to" means that after two sequences are aligned for homology or sequence identity, one sequence corresponds to the specified position in the other sequence.

在第二方面，本发明提供一种编码本发明第一方面所述蛋白的基因。In a second aspect, the present invention provides a gene encoding the protein described in the first aspect of the present invention.

在一优选例中，所述基因的核苷酸序列如SEQ ID NO:53、55、57、59或61所示。In a preferred example, the nucleotide sequence of the gene is shown in SEQ ID NO: 53, 55, 57, 59 or 61.

在第三方面，本发明提供包含本发明第二方面所述编码基因的载体。In a third aspect, the present invention provides a vector comprising the coding gene of the second aspect of the present invention.

在第四方面，本发明提供一种宿主细胞，所述宿主细胞含有本发明第二方面所述编码基因。In the fourth aspect, the present invention provides a host cell containing the coding gene of the second aspect of the present invention.

在优选的实施方式中，所述宿主细胞来自埃希氏菌属(Escherichia)、棒状杆菌属(Corynebacterium)、短杆菌属(Brevibacterium sp.)、芽孢杆菌属(Bacillus)、沙雷氏菌属(Serratia)或弧菌属(Vibrio)。In a preferred embodiment, the host cell is from the genus Escherichia, Corynebacterium, Brevibacterium sp., Bacillus, Serratia ( Serratia) or Vibrio.

在优选的实施方式中，所述宿主细胞是大肠杆菌(E.Coli)。In a preferred embodiment, the host cell is Escherichia coli (E. Coli).

在一优选例中，所述宿主细胞染色体上整合有本发明第二方面所述编码基因或含有本发明第三方面所述载体。In a preferred example, the host cell is chromosomally integrated with the coding gene of the second aspect of the present invention or contains the vector of the third aspect of the present invention.

在优选的实施方式中，所述宿主细胞表达所述错配结合蛋白。In a preferred embodiment, said host cell expresses said mismatch binding protein.

在第五方面，本发明提供一种制备错配结合蛋白的方法，所述方法包括以下步骤：In a fifth aspect, the present invention provides a method of preparing a mismatch binding protein, said method comprising the steps of:

a.培养本发明第四方面所述的宿主细胞，使之产生所述错配结合蛋白；和a. cultivating the host cell according to the fourth aspect of the present invention to produce the mismatch binding protein; and

b.从培养液中分离所述错配结合蛋白。b. Isolating the mismatch binding protein from the culture medium.

在优选的实施方式中，所述方法培养温度在16-45℃，更优选地，培养温度在16℃实施。In a preferred embodiment, the cultivation temperature of the method is 16-45°C, more preferably, the cultivation temperature is implemented at 16°C.

在第六方面，本发明提供本发明第一方面所述错配结合蛋白在识别并结合错配DNA的应用。In the sixth aspect, the present invention provides the use of the mismatch binding protein described in the first aspect of the present invention in recognizing and binding to mismatched DNA.

在一优选例中，本发明第一方面所述错配结合蛋白以自由液态用于识别并结合错配DNA或者与EGFP(Enhance green fluorescent protein，绿色荧光蛋白)、CBM(Cellulosebinding module，纤维素结构域)联合使用从而固定化错配结合蛋白于柱子上用于识别并结合错配DNA。优选地，本发明第一方面所述错配结合蛋白与EGFP、CBM联合使用从而固定化错配结合蛋白于柱子上热稳定性更高。In a preferred example, the mismatch binding protein described in the first aspect of the present invention is used in a free liquid state to recognize and bind to mismatched DNA or to combine with EGFP (Enhance green fluorescent protein, green fluorescent protein), CBM (Cellulosebinding module, cellulose structure) domain) to immobilize the mismatch binding protein on the column for recognition and binding to mismatched DNA. Preferably, the mismatch binding protein described in the first aspect of the present invention is used in combination with EGFP and CBM so that the immobilized mismatch binding protein has higher thermal stability on the column.

在第七方面，本发明提供制备本发明第一方面所述错配结合蛋白的方法，所述方法包括以下步骤：In the seventh aspect, the present invention provides a method for preparing the mismatch binding protein according to the first aspect of the present invention, the method comprising the following steps:

a.改造SEQ ID NO:52所示氨基酸序列的编码序列，使得编码的氨基酸序列中对应于SEQ ID NO:52所示氨基酸序列具有选自以下至少一组突变：a. Transform the coding sequence of the amino acid sequence shown in SEQ ID NO:52, so that the amino acid sequence corresponding to SEQ ID NO:52 in the encoded amino acid sequence has at least one group of mutations selected from the following:

5)第120位的丝氨酸、第157位的亮氨酸、第451位的谷氨酸和第609位的蛋氨酸均突变为半胱氨酸；5) Serine at position 120, leucine at position 157, glutamic acid at position 451 and methionine at position 609 are all mutated to cysteine;

b.将a)得到的编码序列直接转染合适的宿主细胞或经载体引入合适的宿主细胞；b. directly transfecting the coding sequence obtained in a) into a suitable host cell or introducing a vector into a suitable host cell;

c.培养b)得到的宿主细胞；c. cultivating the host cell obtained in b);

d.从步骤c)得到的培养体系中分离所述宿主细胞产生的错配结合蛋白；和d. isolating the mismatch binding protein produced by the host cell from the culture system obtained in step c); and

e.测定所述错配结合蛋白的酶活以及热稳定性。e. Determining the enzyme activity and thermal stability of the mismatch binding protein.

在第八方面，本发明提供改造野生型错配结合蛋白使之提高热稳定性的方法，所述方法包括以下步骤：In an eighth aspect, the present invention provides a method for modifying a wild-type mismatch binding protein to improve thermal stability, the method comprising the following steps:

a.将野生型错配结合蛋白的氨基酸序列与SEQ ID NO:52所示氨基酸序列作比对；和a. aligning the amino acid sequence of the wild-type mismatch binding protein with the amino acid sequence shown in SEQ ID NO:52; and

b.改造所述野生型错配结合蛋白的编码序列，使得编码的氨基酸序列中对应于SEQ ID NO:52所示氨基酸序列具有选自以下至少一组的突变：b. transforming the coding sequence of the wild-type mismatch binding protein, so that the amino acid sequence corresponding to SEQ ID NO:52 has a mutation selected from at least one of the following groups in the encoded amino acid sequence:

c.将b得到的编码序列直接转染合适的宿主细胞或经载体引入合适的宿主细胞；c. Directly transfecting the coding sequence obtained in b into a suitable host cell or introducing a vector into a suitable host cell;

d.培养c得到的宿主细胞；d. culturing the host cells obtained in c;

e.从步骤d得到的培养体系中分离所述宿主细胞产生的错配结合蛋白；和e. isolating the mismatch binding protein produced by the host cell from the culture system obtained in step d; and

f.测定所述错配结合蛋白突变体的酶活以及热稳定性。f. Determining the enzyme activity and thermal stability of the mismatch binding protein mutant.

应理解，在本发明范围内中，本发明的上述各技术特征和在下文(如实施例)中具体描述的各技术特征之间都可以互相组合，从而构成新的或优选的技术方案。限于篇幅，在此不再一一累述。It should be understood that within the scope of the present invention, the above-mentioned technical features of the present invention and the technical features specifically described in the following (such as embodiments) can be combined with each other to form new or preferred technical solutions. Due to space limitations, we will not repeat them here.

附图说明Description of drawings

图1展示了本发明中质粒的构建图。Figure 1 shows the construction diagram of the plasmid in the present invention.

图2比较了本发明的eMutS突变体和野生型eMutS在4℃下的热稳定性。Figure 2 compares the thermostability at 4°C between the eMutS mutant of the present invention and the wild-type eMutS.

A.利用软件“ImageJ”处理的结果。A. Results processed with the software "ImageJ".

根据DNA琼脂糖凝胶上DNA的亮度结果，利用“ImageJ”软件对所有样品给出一个RawIntDen值，RawIntDen比例是指样品的RawIntDen值/对照eMutS的RawIntDen值，如果RawIntDen比例低于0.3，则表示蛋白有活性；如果RawIntDen比例高于0.3，则表示蛋白失去活性。According to the DNA brightness results on the DNA agarose gel, use the "ImageJ" software to give a RawIntDen value for all samples, and the RawIntDen ratio refers to the RawIntDen value of the sample/the RawIntDen value of the control eMutS, if the RawIntDen ratio is lower than 0.3, it means The protein is active; if the RawIntDen ratio is higher than 0.3, it means that the protein is inactive.

B.不同的eMutS蛋白突变体的热稳定性。B. Thermostability of different eMutS protein mutants.

eMutS是野生型对照，eMutS-E38C-G70C、eMutS-S120C-S151C、eMutS-L157C-G233C、eMutS-E177C-G195C、eMutS-Y474C-R500C、eMutS-E451C-V465C、eMutS-T581C-K644C、eMutS-H585C-Q626C、eMutS-I597C-H760C、eMutS-M609C-T723C和eMutS-S120C-L157C-E451C-M609C都是不同的eMutS蛋白突变体样品。eMutS is the wild-type control, eMutS-E38C-G70C, eMutS-S120C-S151C, eMutS-L157C-G233C, eMutS-E177C-G195C, eMutS-Y474C-R500C, eMutS-E451C-V465C, eMutS-T581C-MutS-K644C, H585C-Q626C, eMutS-I597C-H760C, eMutS-M609C-T723C and eMutS-S120C-L157C-E451C-M609C are all different eMutS protein mutant samples.

具体实施方式Detailed ways

发明人经过广泛而深入的研究，出乎意料地发现对大肠杆菌来源的错配结合蛋白进行定点突变，增加二硫键，获得的错配结合蛋白突变体在4℃下保存90天仍能保持大于90％的活性。在此基础上完成了本发明。After extensive and in-depth research, the inventor unexpectedly found that the mismatch binding protein derived from Escherichia coli was subjected to site-directed mutation to increase the disulfide bond, and the obtained mismatch binding protein mutant could still maintain the protein after being stored at 4°C for 90 days. Greater than 90% activity. The present invention has been accomplished on this basis.

本发明的应用与优点：Application and advantage of the present invention:

1.本发明提供的各种错配结合蛋白高效识别并结合错配的DNA；1. Various mismatch binding proteins provided by the invention efficiently recognize and bind mismatched DNA;

2.本发明提供的各种错配结合蛋白比未经改造的野生型错配结合蛋白具有较高的热稳定性，在工业上的应用前景广阔。2. The various mismatch binding proteins provided by the present invention have higher thermal stability than unmodified wild-type mismatch binding proteins, and have broad prospects for industrial application.

下面结合具体实施例，进一步阐述本发明。应理解，这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法，通常按照常规条件如Sambrook等人，分子克隆：实验室手册(New York:Cold Spring Harbor LaboratoryPress,1989)中所述的条件，或按照制造厂商所建议的条件。Below in conjunction with specific embodiment, further illustrate the present invention. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. The experimental methods not indicating specific conditions in the following examples are usually according to conventional conditions such as Sambrook et al., molecular cloning: the conditions described in the laboratory manual (New York: Cold Spring Harbor Laboratory Press, 1989), or according to the manufacturer's instructions suggested conditions.

试剂和菌株：本发明中的所有试剂均是市场购买的试剂级以上的试剂。其中，Tryptone，Yeast extract，NaCl，质粒提取试剂盒，胶回收试剂盒以及所有的限制性内切酶均来源于上海生工生物工程公司。PrimeSTAR HS DNA聚合酶，Solution I连接酶以及pMD18-T载体购自于大连宝生物公司。大肠杆菌Escherichia coli XL10-gold菌株作为DNA操作时使用的宿主菌(美国加利福利亚Stratagene公司)，包含100μg/ml氨苄青霉素的Luria-Bertani(LB)培养基用作培养E.coli。含有SEQ ID NO:51所示序列的野生型eMutS基因的质粒pEcoMutS-CBM3-EGFP(Wen et al.,2014)由中国科学技术大学教授惠赠，目前保存在本发明人所在的实验室。LB培养基(5g/l酵母提取物,10g/l胰蛋白胨,5g/l Nacl)用于诱导表达培养基。Reagents and bacterial strains: All reagents in the present invention are commercially purchased reagent grade reagents. Among them, Tryptone, Yeast extract, NaCl, plasmid extraction kit, gel recovery kit and all restriction endonucleases are from Shanghai Sangon Bioengineering Company. PrimeSTAR HS DNA polymerase, Solution I ligase and pMD18-T vector were purchased from Dalian Bao Biological Company. Escherichia coli XL10-gold strain was used as the host bacteria (Stratagene Company, California, USA) used for DNA manipulation, and Luria-Bertani (LB) medium containing 100 μg/ml ampicillin was used for culturing E. coli. The plasmid pEcoMutS-CBM3-EGFP (Wen et al., 2014) containing the wild-type eMutS gene sequence shown in SEQ ID NO: 51 was donated by a professor at the University of Science and Technology of China, and is currently kept in the laboratory of the inventor. LB medium (5g/l yeast extract, 10g/l tryptone, 5g/l Nacl) was used to induce expression medium.

实施例1菌株的制备：The preparation of embodiment 1 bacterial strain:

1.构建本发明中的各种质粒载体：1. Construct various plasmid vectors among the present invention:

1).提取pEcoMutS-CBM3-EGFP(中国科学技术大学教授惠赠)的质粒，并将提取的pEcoMutS-CBM3-EGFP质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物E38C-G70C-F1(SEQ ID No.01)和E38C-G70C-R1(SEQ ID No.02)引物进行PCR，扩增产物利用DpnI酶切处理后获得一个位点突变的质粒pZJQIBEBT001a。然后以质粒pZJQIBEBT001a为模版，用使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物E38C-G70C-F2(SEQ ID No.03)和E38C-G70C-R2(SEQ ID No.04)进行PCR扩增出pEcoMutS-E38C-G70C-CBM3-EGFP全质粒，用Dpn I酶切处理后获得二个位点突变的质粒，命名为pZJQIBEBT001。1). Extract the plasmid of pEcoMutS-CBM3-EGFP (a gift from the professor of University of Science and Technology of China), and use the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primer E38C-G70C-F1 (SEQ ID No.01) and E38C-G70C-R1 (SEQ ID No.02) primers were used for PCR, and the amplified product was digested with DpnI to obtain a plasmid pZJQIBEBT001a with a site mutation. Then using the plasmid pZJQIBEBT001a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers E38C-G70C-F2 (SEQ ID No.03) and E38C-G70C-R2 (SEQ ID No.04) to carry out PCR amplification The pEcoMutS-E38C-G70C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT001.

具体操作步骤为：The specific operation steps are:

首先，提取质粒，其提取步骤为：First, extract the plasmid, the extraction steps are:

a.将2ml含相应抗生素的LB培养基加入到容量为15ml的试管中，接种一单菌落，用棉塞封口，在37℃剧烈震荡(250rpm)培养过夜。a. Add 2ml of LB medium containing corresponding antibiotics into a test tube with a capacity of 15ml, inoculate a single colony, seal it with a cotton plug, and culture overnight at 37°C with vigorous shaking (250rpm).

b.将1.5ml培养物转入离心管中，用台式离心机在4℃条件下离心30秒(12,000×g)。b. Transfer 1.5ml of the culture into a centrifuge tube and centrifuge at 4°C for 30 seconds (12,000×g) in a tabletop centrifuge.

c.弃上清液，用滤纸吸干离心管中的液体培养基，将细菌沉淀物悬浮于200m lSTET缓冲液，用涡旋混合器充分混匀。c. Discard the supernatant, blot the liquid medium in the centrifuge tube with filter paper, suspend the bacterial sediment in 200 ml STET buffer, and mix well with a vortex mixer.

d.加入4m l新鲜配制的溶菌酶液，混匀，置室温下5min。d. Add 4ml of freshly prepared lysozyme solution, mix well, and place at room temperature for 5 minutes.

e.用漂子架住离心管，置于沸水浴中，精确记时45秒，取出后立刻离心10min。e. Hold the centrifuge tube with a float, place it in a boiling water bath, accurately time it for 45 seconds, and centrifuge it for 10 minutes immediately after taking it out.

f.用无菌牙签挑取离心管中的沉淀物弃去，离心管的上清液中加入8m l 5％CTAB，用混合器混匀后，高速离心5min，弃上清液，用滤纸吸干离心管中的液体。f. Use a sterile toothpick to pick up the sediment in the centrifuge tube and discard it. Add 8ml 5% CTAB to the supernatant of the centrifuge tube. After mixing with a mixer, centrifuge at high speed for 5min. Discard the supernatant and absorb it with filter paper. Dry the liquid in the centrifuge tube.

g.加入1.2M氯化钠300m l，充分溶解沉淀物，再加入750m l的冷乙醇，充分混匀后，离心15min，弃上清液。g. Add 300ml of 1.2M sodium chloride to fully dissolve the precipitate, then add 750ml of cold ethanol, mix thoroughly, centrifuge for 15min, and discard the supernatant.

h.取1ml 70％冷乙醇，缓慢淋洗离心管内壁，离心5min。弃上清液，用滤纸吸干管壁上的液体，置室温中使核酸沉淀自然干燥5-10min。h. Take 1ml of 70% cold ethanol, slowly rinse the inner wall of the centrifuge tube, and centrifuge for 5min. Discard the supernatant, blot the liquid on the tube wall with filter paper, and let the nucleic acid precipitate dry naturally at room temperature for 5-10 minutes.

i.沉淀物溶于50m l TE缓冲液中，混合器混匀。即成质粒制备液，制备的质粒供下一步实验使用。i. Dissolve the precipitate in 50ml TE buffer solution and mix with a mixer. The plasmid preparation solution is ready, and the prepared plasmid is used for the next experiment.

突变体质粒获取具体操作步骤为：The specific steps for obtaining mutant plasmids are as follows:

(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增pEcoMutS-E38C-CBM3-EGFP全质粒的PCR体系：(1) PCR system for amplifying pEcoMutS-E38C-CBM3-EGFP full plasmid using pEcoMutS-CBM3-EGFP plasmid as a template:

PCR程序PCR program

(2)将扩增产物pEcoMutS-E38C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第一个点突变的质粒pZJQIBEBT001a。(2) The entire plasmid of the amplified product pEcoMutS-E38C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT001a.

①pEcoMutS-E38C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-E38C-CBM3-EGFP whole plasmid:

②将获得的进行了定点突变的质粒，命名为pZJQIBEBT001a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT001a (Fig. 1).

(3)以pZJQIBEBT001a质粒为模板扩增pEcoMutS-E38C-G70C-CBM3-EGFP全质粒的PCR体系：(3) A PCR system for amplifying the pEcoMutS-E38C-G70C-CBM3-EGFP full plasmid using the pZJQIBEBT001a plasmid as a template:

PCR程序PCR program

(4)将扩增产物pEcoMutS-E38C-G70C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第二个点突变的质粒pZJQIBEBT001。(4) The entire plasmid of the amplified product pEcoMutS-E38C-G70C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT001.

①pEcoMutS-E38C-G70C-CBM3-EGFP全质粒的酶切体系：① Enzyme digestion system of pEcoMutS-E38C-G70C-CBM3-EGFP whole plasmid:

②将获得的进行了第二个位点定点突变的质粒，命名为pZJQIBEBT001(图1)。② The obtained plasmid subjected to site-directed mutation at the second site was named pZJQIBEBT001 ( FIG. 1 ).

2).以提取的pEcoMutS-CBM3-EGFP质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物S120C-S151C-F1(SEQ ID No.05)和S120C-S151C-R1(SEQ ID No.06)进行PCR，扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT002a。然后以质粒pZJQIBEBT002a为模版，用使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物S120C-S151C-F2(SEQ ID No.07)和S120C-S151C-R2(SEQ ID No.08)进行PCR扩增出pEcoMutS-S120C-S151C-CBM3-EGFP全质粒，用Dpn I酶切处理后获得二个位点突变的质粒，命名为pZJQIBEBT002。2). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers S120C-S151C-F1 (SEQ ID No.05) and S120C-S151C-R1 (SEQ ID No. 06) PCR was performed, and the amplified product was digested with Dpn I to obtain a site-mutated plasmid pZJQIBEBT002a. Then using the plasmid pZJQIBEBT002a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers S120C-S151C-F2 (SEQ ID No.07) and S120C-S151C-R2 (SEQ ID No.08) to carry out PCR amplification The pEcoMutS-S120C-S151C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT002.

(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增pEcoMutS-S120C-CBM3-EGFP全质粒的PCR体系：(1) PCR system for amplifying pEcoMutS-S120C-CBM3-EGFP full plasmid using pEcoMutS-CBM3-EGFP plasmid as a template:

PCR程序PCR program

(2)将扩增产物pEcoMutS-S120C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第一个点突变的质粒pZJQIBEBT002a。(2) The entire plasmid of the amplified product pEcoMutS-S120C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT002a.

①pEcoMutS-S120C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-S120C-CBM3-EGFP whole plasmid:

②将获得的进行了定点突变的质粒，命名为pZJQIBEBT002a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT002a ( FIG. 1 ).

(3)以pZJQIBEBT002a质粒为模板扩增pEcoMutS-S120C-S151C-CBM3-EGFP全质粒的PCR体系：(3) A PCR system for amplifying the pEcoMutS-S120C-S151C-CBM3-EGFP full plasmid using the pZJQIBEBT002a plasmid as a template:

PCR程序PCR program

(4)将扩增产物pEcoMutS-S120C-S151C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第二个点突变的质粒pZJQIBEBT002。(4) Digest the entire plasmid of the amplified product pEcoMutS-S120C-S151C-CBM3-EGFP with Dpn I, and recover the product to obtain the second point-mutated plasmid pZJQIBEBT002.

①pEcoMutS-S120C-S151C-CBM3-EGFP全质粒的酶切体系：① Enzyme digestion system of pEcoMutS-S120C-S151C-CBM3-EGFP whole plasmid:

②将获得的进行了第二个位点定点突变的质粒，命名为pZJQIBEBT002(图1)。② The obtained plasmid subjected to the second site-directed mutation was named pZJQIBEBT002 ( FIG. 1 ).

3).以提取的pEcoMutS-CBM3-EGFP质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物L157C-G233C-F1(SEQ ID No.09)和L157C-G233C-R1(SEQ ID No.10)进行PCR，扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT003a。然后以质粒pZJQIBEBT003a为模版，用使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物L157C-G233C-F2(SEQ ID No.11)和L157C-G233C-R2(SEQ ID No.12)进行PCR扩增出pEcoMutS-L157C-G233C-CBM3-EGFP全质粒，用Dpn I酶切处理后获得二个位点突变的质粒，命名为pZJQIBEBT003。3). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers L157C-G233C-F1 (SEQ ID No.09) and L157C-G233C-R1 (SEQ ID No. 10) PCR was performed, and the amplified product was digested with Dpn I to obtain a site-mutated plasmid pZJQIBEBT003a. Then using the plasmid pZJQIBEBT003a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers L157C-G233C-F2 (SEQ ID No.11) and L157C-G233C-R2 (SEQ ID No.12) to carry out PCR amplification The pEcoMutS-L157C-G233C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT003.

(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增pEcoMutS-L157C-CBM3-EGFP全质粒的PCR体系：(1) The PCR system for amplifying the pEcoMutS-L157C-CBM3-EGFP full plasmid using the pEcoMutS-CBM3-EGFP plasmid as a template:

PCR程序PCR program

(2)将扩增产物pEcoMutS-L157C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第一个点突变的质粒pZJQIBEBT003a。(2) The entire plasmid of the amplified product pEcoMutS-L157C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT003a.

①pEcoMutS-L157C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-L157C-CBM3-EGFP whole plasmid:

②将获得的进行了定点突变的质粒，命名为pZJQIBEBT003a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT003a (Fig. 1).

(3)以pZJQIBEBT003a质粒为模板扩增全质粒pEcoMutS-L157C-G233C-CBM3-EGFP的PCR体系：(3) The PCR system for amplifying the whole plasmid pEcoMutS-L157C-G233C-CBM3-EGFP using the pZJQIBEBT003a plasmid as a template:

PCR程序PCR program

(4)将扩增产物pEcoMutS-L157C-G233C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第二个点突变的质粒pZJQIBEBT003。(4) The entire plasmid of the amplified product pEcoMutS-L157C-G233C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT003.

①pEcoMutS-L157C-G233C-CBM3-EGFP全质粒的酶切体系：① Enzyme digestion system of pEcoMutS-L157C-G233C-CBM3-EGFP whole plasmid:

②将获得的进行了第二个位点定点突变的质粒，命名为pZJQIBEBT003(图1)。② The obtained plasmid subjected to site-directed mutation at the second site was named pZJQIBEBT003 ( FIG. 1 ).

4).以提取的pEcoMutS-CBM3-EGFP质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物E177C-G195C-F1(SEQ ID No.13)和E177C-G195C-R1(SEQ ID No.14)进行PCR，扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT004a。然后以质粒pZJQIBEBT004a为模版，用使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物E177C-G195C-F2(SEQ ID No.15)和E177C-G195C-R2(SEQ ID No.16)进行PCR扩增出pEcoMutS-E177C-G195C-CBM3-EGFP全质粒，用Dpn I酶切处理后获得二个位点突变的质粒，命名为pZJQIBEBT004。4). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers E177C-G195C-F1 (SEQ ID No.13) and E177C-G195C-R1 (SEQ ID No. 14) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT004a with a site mutation. Then using the plasmid pZJQIBEBT004a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers E177C-G195C-F2 (SEQ ID No.15) and E177C-G195C-R2 (SEQ ID No.16) to carry out PCR amplification The pEcoMutS-E177C-G195C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT004.

(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-E177C-CBM3-EGFP的PCR体系：(1) The PCR system for amplifying the whole plasmid pEcoMutS-E177C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:

PCR程序PCR program

(2)将扩增产物pEcoMutS-E177C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第一个点突变的质粒pZJQIBEBT004a。(2) The entire plasmid of the amplified product pEcoMutS-E177C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT004a.

①pEcoMutS-E177C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-E177C-CBM3-EGFP whole plasmid:

②将获得的进行了定点突变的质粒，命名为pZJQIBEBT004a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT004a (Fig. 1).

(3)以pZJQIBEBT004a质粒为模板扩增全质粒pEcoMutS-E177C-G195C-CBM3-EGFP的PCR体系：(3) A PCR system for amplifying the entire plasmid pEcoMutS-E177C-G195C-CBM3-EGFP using the pZJQIBEBT004a plasmid as a template:

PCR程序PCR program

(4)将扩增产物pEcoMutS-E177C-G195C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第二个点突变的质粒pZJQIBEBT004。(4) The entire plasmid of the amplified product pEcoMutS-E177C-G195C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT004.

①pEcoMutS-E177C-G195C-CBM3-EGFP全质粒的酶切体系：① Enzyme digestion system of pEcoMutS-E177C-G195C-CBM3-EGFP whole plasmid:

②将获得的进行了第二个位点定点突变的质粒，命名为pZJQIBEBT004(图1)。② The obtained plasmid subjected to site-directed mutation at the second site was named pZJQIBEBT004 ( FIG. 1 ).

5).以提取的pEcoMutS-CBM3-EGFP质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物E451C-V465C-F1(SEQ ID No.17)和E451C-V465C-R1(SEQ ID No.18)进行PCR，扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT005a。然后以质粒pZJQIBEBT005a为模版，用使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物E451C-V465C-F2(SEQ ID No.19)和E451C-V465C-R2(SEQ ID No.20)进行PCR扩增出pEcoMutS-E451C-V465C-CBM3-EGFP全质粒，用Dpn I酶切处理后获得二个位点突变的质粒，命名为pZJQIBEBT005。5). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers E451C-V465C-F1 (SEQ ID No.17) and E451C-V465C-R1 (SEQ ID No. 18) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT005a with a site mutation. Then using the plasmid pZJQIBEBT005a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers E451C-V465C-F2 (SEQ ID No.19) and E451C-V465C-R2 (SEQ ID No.20) to carry out PCR amplification The whole pEcoMutS-E451C-V465C-CBM3-EGFP plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT005.

(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-E451C-CBM3-EGFP的PCR体系：(1) The PCR system for amplifying the whole plasmid pEcoMutS-E451C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:

PCR程序PCR program

(2)将扩增产物pEcoMutS-E451C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第一个点突变的质粒pZJQIBEBT005a。(2) The entire plasmid of the amplified product pEcoMutS-E451C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT005a.

①pEcoMutS-E451C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-E451C-CBM3-EGFP whole plasmid:

②将获得的进行了定点突变的质粒，命名为pZJQIBEBT005a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT005a (Fig. 1).

(3)以pZJQIBEBT005a质粒为模板扩增全质粒pEcoMutS-E451C-V465C-CBM3-EGFP的PCR体系：(3) The PCR system for amplifying the whole plasmid pEcoMutS-E451C-V465C-CBM3-EGFP using the pZJQIBEBT005a plasmid as a template:

PCR程序PCR program

(4)将扩增产物pEcoMutS-E451C-V465C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第二个点突变的质粒pZJQIBEBT005。(4) Digest the entire plasmid of the amplified product pEcoMutS-E451C-V465C-CBM3-EGFP with Dpn I, and recover the product to obtain the second point-mutated plasmid pZJQIBEBT005.

①pEcoMutS-E451C-V465C-CBM3-EGFP全质粒的酶切体系：① Enzyme digestion system of pEcoMutS-E451C-V465C-CBM3-EGFP whole plasmid:

②将获得的进行了第二个位点定点突变的质粒，命名为pZJQIBEBT005(图1)。② The obtained plasmid subjected to site-directed mutation at the second site was named pZJQIBEBT005 ( FIG. 1 ).

6).以提取的pEcoMutS-CBM3-EGFP质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物Y474C-R500C-F1(SEQ ID No.21)和Y474C-R500C-R1(SEQ ID No.22)进行PCR，扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT006a。然后以质粒pZJQIBEBT006a为模版，用使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物Y474C-R500C-F2(SEQ ID No.23)和Y474C-R500C-R2(SEQ ID No.24)进行PCR扩增出pEcoMutS-Y474C-R500C-CBM3-EGFP全质粒，用Dpn I酶切处理后获得二个位点突变的质粒，命名为pZJQIBEBT006。6). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers Y474C-R500C-F1 (SEQ ID No.21) and Y474C-R500C-R1 (SEQ ID No. 22) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT006a with a site mutation. Then using the plasmid pZJQIBEBT006a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers Y474C-R500C-F2 (SEQ ID No.23) and Y474C-R500C-R2 (SEQ ID No.24) to carry out PCR amplification The pEcoMutS-Y474C-R500C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT006.

(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-Y474C-CBM3-EGFP的PCR体系：(1) PCR system for amplifying the whole plasmid pEcoMutS-Y474C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:

PCR程序PCR program

(2)将扩增产物pEcoMutS-Y474C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第一个点突变的质粒pZJQIBEBT006a。(2) The entire plasmid of the amplified product pEcoMutS-Y474C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT006a.

①pEcoMutS-Y474C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-Y474C-CBM3-EGFP whole plasmid:

②将获得的进行了定点突变的质粒，命名为pZJQIBEBT006a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT006a ( FIG. 1 ).

(3)以pZJQIBEBT006a质粒为模板扩增全质粒pEcoMutS-Y474C-R500C-CBM3-EGFP的PCR体系：(3) PCR system for amplifying the whole plasmid pEcoMutS-Y474C-R500C-CBM3-EGFP using the pZJQIBEBT006a plasmid as a template:

PCR程序PCR program

(4)将扩增产物pEcoMutS-Y474C-R500C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第二个点突变的质粒pZJQIBEBT006。(4) Digest the entire plasmid of the amplified product pEcoMutS-Y474C-R500C-CBM3-EGFP with Dpn I, and recover the product to obtain the second point-mutated plasmid pZJQIBEBT006.

①pEcoMutS-Y474C-R500C-CBM3-EGFP全质粒的酶切体系：① Enzyme digestion system of pEcoMutS-Y474C-R500C-CBM3-EGFP whole plasmid:

②将获得的进行了第二个位点定点突变的质粒，命名为pZJQIBEBT006(图1)。② The obtained plasmid subjected to the second site-directed mutation was named pZJQIBEBT006 ( FIG. 1 ).

7).以提取的pEcoMutS-CBM3-EGFP质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物T581C-K644C-F1(SEQ ID No.25)和T581C-K644C-R1(SEQ ID No.26)进行PCR，扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT007a。然后以质粒pZJQIBEBT007a为模版，用使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物T581C-K644C-F2(SEQ ID No.27)和T581C-K644C-R2(SEQ ID No.28)进行PCR扩增出pEcoMutS-T581C-K644C-CBM3-EGFP全质粒，用Dpn I酶切处理后获得二个位点突变的质粒，命名为pZJQIBEBT007。7). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers T581C-K644C-F1 (SEQ ID No.25) and T581C-K644C-R1 (SEQ ID No. 26) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT007a with a site mutation. Then using the plasmid pZJQIBEBT007a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers T581C-K644C-F2 (SEQ ID No.27) and T581C-K644C-R2 (SEQ ID No.28) to carry out PCR amplification The whole pEcoMutS-T581C-K644C-CBM3-EGFP plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT007.

(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-T581C-CBM3-EGFP的PCR体系：(1) The PCR system for amplifying the whole plasmid pEcoMutS-T581C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:

PCR程序PCR program

(2)将扩增产物pEcoMutS-T581C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第一个点突变的质粒pZJQIBEBT007a。(2) Digest the entire plasmid of the amplified product pEcoMutS-T581C-CBM3-EGFP with Dpn I, and recover the product to obtain the first point-mutated plasmid pZJQIBEBT007a.

①pEcoMutS-T581C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-T581C-CBM3-EGFP whole plasmid:

②将获得的进行了定点突变的质粒，命名为pZJQIBEBT007a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT007a (Fig. 1).

(3)以pZJQIBEBT007a质粒为模板扩增全质粒pEcoMutS-T581C-K644C-CBM3-EGFP的PCR体系：(3) PCR system for amplifying the whole plasmid pEcoMutS-T581C-K644C-CBM3-EGFP using the pZJQIBEBT007a plasmid as a template:

PCR程序PCR program

(4)将扩增产物pEcoMutS-T581C-K644C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第二个点突变的质粒pZJQIBEBT007。(4) The entire plasmid of the amplified product pEcoMutS-T581C-K644C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT007.

①pEcoMutS-T581C-K644C-CBM3-EGFP全质粒的酶切体系：① Enzyme digestion system of pEcoMutS-T581C-K644C-CBM3-EGFP whole plasmid:

②将获得的进行了第二个位点定点突变的质粒，命名为pZJQIBEBT007(图1)。② The obtained plasmid subjected to site-directed mutation at the second site was named pZJQIBEBT007 ( FIG. 1 ).

8).以提取的pEcoMutS-CBM3-EGFP质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物H585C-Q626C-F1(SEQ ID No.29)和H585C-Q626C-R1(SEQ ID No.30)进行PCR，扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT008a。然后以质粒pZJQIBEBT008a为模版，用使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物H585C-Q626C-F2(SEQ ID No.31)和H585C-Q626C-R2(SEQ ID No.32)进行PCR扩增出pEcoMutS-H585C-Q626C-CBM3-EGFP全质粒，用Dpn I酶切处理后获得二个位点突变的质粒，命名为pZJQIBEBT008。8). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers H585C-Q626C-F1 (SEQ ID No.29) and H585C-Q626C-R1 (SEQ ID No. 30) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT008a with a site mutation. Then using the plasmid pZJQIBEBT008a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers H585C-Q626C-F2 (SEQ ID No.31) and H585C-Q626C-R2 (SEQ ID No.32) to carry out PCR amplification The pEcoMutS-H585C-Q626C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT008.

(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-H585C-CBM3-EGFP的PCR体系：(1) PCR system for amplifying the whole plasmid pEcoMutS-H585C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:

PCR程序PCR program

(2)将扩增产物pEcoMutS-H585C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第一个点突变的质粒pZJQIBEBT008a。(2) The entire plasmid of the amplified product pEcoMutS-H585C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT008a.

①pEcoMutS-H585C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-H585C-CBM3-EGFP whole plasmid:

②将获得的进行了定点突变的质粒，命名为pZJQIBEBT008a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT008a (Fig. 1).

(3)以pZJQIBEBT008a质粒为模板扩增全质粒pEcoMutS-H585C-Q626C-CBM3-EGFP的PCR体系：(3) The PCR system for amplifying the whole plasmid pEcoMutS-H585C-Q626C-CBM3-EGFP using the pZJQIBEBT008a plasmid as a template:

PCR程序PCR program

(4)将扩增产物pEcoMutS-H585C-Q626C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第二个点突变的质粒pZJQIBEBT008。(4) The entire plasmid of the amplified product pEcoMutS-H585C-Q626C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT008.

①pEcoMutS-H585C-Q626C-CBM3-EGFP全质粒的酶切体系：① Enzyme digestion system of pEcoMutS-H585C-Q626C-CBM3-EGFP whole plasmid:

②将获得的进行了第二个位点定点突变的质粒，命名为pZJQIBEBT008(图1)。② The obtained plasmid subjected to the second site-directed mutation was named pZJQIBEBT008 ( FIG. 1 ).

9).以提取的pEcoMutS-CBM3-EGFP质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物I597C-H760C-F1(SEQ ID No.33)和I597C-H760C-R1(SEQ ID No.34)进行PCR，扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT009a。然后以质粒pZJQIBEBT009a为模版，用使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物I597C-H760C-F2(SEQ ID No.35)和I597C-H760C-R2(SEQ ID No.36)进行PCR扩增出pEcoMutS-I597C-H760C-CBM3-EGFP全质粒，用Dpn I酶切处理后获得二个位点突变的质粒，命名为pZJQIBEBT009。9). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers I597C-H760C-F1 (SEQ ID No.33) and I597C-H760C-R1 (SEQ ID No. 34) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT009a with a site mutation. Then using the plasmid pZJQIBEBT009a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers I597C-H760C-F2 (SEQ ID No.35) and I597C-H760C-R2 (SEQ ID No.36) to carry out PCR amplification The pEcoMutS-I597C-H760C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, named pZJQIBEBT009.

(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-I597C-CBM3-EGFP的PCR体系：(1) The PCR system for amplifying the whole plasmid pEcoMutS-I597C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:

PCR程序PCR program

(2)将扩增产物pEcoMutS-I597C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第一个点突变的质粒pZJQIBEBT009a。(2) The entire plasmid of the amplified product pEcoMutS-I597C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT009a.

①pEcoMutS-I597C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-I597C-CBM3-EGFP whole plasmid:

②将获得的进行了定点突变的质粒，命名为pZJQIBEBT009a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT009a ( FIG. 1 ).

(3)以pZJQIBEBT009a质粒为模板扩增全质粒pEcoMutS-I597C-H760C-CBM3-EGFP的PCR体系：(3) The PCR system for amplifying the whole plasmid pEcoMutS-I597C-H760C-CBM3-EGFP using the pZJQIBEBT009a plasmid as a template:

PCR程序PCR program

(4)将扩增产物pEcoMutS-I597C-H760C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第二个点突变的质粒pZJQIBEBT009。(4) The entire plasmid of the amplified product pEcoMutS-I597C-H760C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT009.

①pEcoMutS-I597C-H760C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-I597C-H760C-CBM3-EGFP whole plasmid:

②将获得的进行了第二个位点定点突变的质粒，命名为pZJQIBEBT009(图1)。② The obtained plasmid subjected to the second site-directed mutation was named pZJQIBEBT009 ( FIG. 1 ).

10).以提取的pEcoMutS-CBM3-EGFP质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物M609C-T723C-F1(SEQ ID No.37)和M609C-T723C-R1(SEQ ID No.38)进行PCR，扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT010a。然后以质粒pZJQIBEBT010a为模版，用使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物M609C-T723C-F2(SEQ ID No.39)和M609C-T723C-R2(SEQ ID No.40)进行PCR扩增出pEcoMutS-M609C-T723C-CBM3-EGFP全质粒，用Dpn I酶切处理后获得二个位点突变的质粒，命名为pZJQIBEBT010。10). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers M609C-T723C-F1 (SEQ ID No.37) and M609C-T723C-R1 (SEQ ID No. 38) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT010a with a site mutation. Then, using the plasmid pZJQIBEBT010a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers M609C-T723C-F2 (SEQ ID No.39) and M609C-T723C-R2 (SEQ ID No.40) to carry out PCR amplification The pEcoMutS-M609C-T723C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, named pZJQIBEBT010.

(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-M609C-CBM3-EGFP的PCR体系：(1) The PCR system for amplifying the whole plasmid pEcoMutS-M609C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:

PCR程序PCR program

(2)将扩增产物pEcoMutS-M609C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第一个点突变的质粒pZJQIBEBT010a。(2) The entire plasmid of the amplified product pEcoMutS-M609C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT010a.

①pEcoMutS-M609C-CBM3-EGFP全质粒的酶切体系：①Enzyme digestion system of pEcoMutS-M609C-CBM3-EGFP whole plasmid:

②将获得的进行了定点突变的质粒，命名为pZJQIBEBT010a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT010a ( FIG. 1 ).

(3)以pZJQIBEBT010a质粒为模板扩增全质粒pEcoMutS-M609C-T723C-CBM3-EGFP的PCR体系：(3) The PCR system for amplifying the whole plasmid pEcoMutS-M609C-T723C-CBM3-EGFP using the pZJQIBEBT010a plasmid as a template:

PCR程序PCR program

(4)将扩增产物pEcoMutS-M609C-T723C-CBM3-EGFP全质粒利用Dpn I进行酶切，回收产物即可获得第二个点突变的质粒pZJQIBEBT010。(4) The entire plasmid of the amplified product pEcoMutS-M609C-T723C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT010.

①pEcoMutS-M609C-T723C-CBM3-EGFP全质粒的酶切体系：① Enzyme digestion system of pEcoMutS-M609C-T723C-CBM3-EGFP whole plasmid:

②将获得的进行了第二个位点定点突变的质粒，命名为pZJQIBEBT010(图1)。② The obtained plasmid subjected to the second site-directed mutation was named pZJQIBEBT010 ( FIG. 1 ).

10).分别以提取的pEcoMutS-CBM3-EGFP质粒，pZJQIBEBT002质粒，pZJQIBEBT003质粒，pZJQIBEBT006质粒，pZJQIBEBT010质粒作为模板，使用KOD Plus Neo DNA聚合酶(TOYOBO)，引物eMutS-NheI-F(SEQ ID No.41)和eMutS-340-R(SEQ ID No.42)，eMutS-310-F(SEQ ID No.43)和eMutS-S120C-460-R(SEQ ID No.44)，eMutS-430-F(SEQ ID No.45)和eMutS-800-R(SEQ ID No.46)，eMutS-770-F(SEQ ID No.47)和eMutS-1500-R(SEQ IDNo.48)，eMutS-1470-F(SEQ ID No.49)和eMutS-SacI-R(SEQ ID No.50)进行PCR，获得产物eMutS-1-340，S120C-310-460，L157C-430-800，E451C-770-1500，M609C-1470-末尾。然后将扩增产物利用KOD Plus Neo DNA聚合酶(TOYOBO)，引物eMutS-NheI-F(SEQ ID No.41)和eMutS-SacI-R(SEQ ID No.50)进行融合PCR，获得产物eMutS-S120C-L157C-E451C-M609C。利用NheI和SacI分别酶切片段eMutS-S120C-L157C-E451C-M609C和质粒pEcoMutS-CBM3-EGFP，连接，转化后获得八个位点突变的质粒，命名为pZJQIBEBT011(图1)。10). Taking the extracted pEcoMutS-CBM3-EGFP plasmid, pZJQIBEBT002 plasmid, pZJQIBEBT003 plasmid, pZJQIBEBT006 plasmid, and pZJQIBEBT010 plasmid as templates respectively, using KOD Plus Neo DNA polymerase (TOYOBO), primer eMutS-NheI-F (SEQ ID No. 41) and eMutS-340-R (SEQ ID No.42), eMutS-310-F (SEQ ID No.43) and eMutS-S120C-460-R (SEQ ID No.44), eMutS-430-F ( SEQ ID No.45) and eMutS-800-R (SEQ ID No.46), eMutS-770-F (SEQ ID No.47) and eMutS-1500-R (SEQ ID No.48), eMutS-1470-F (SEQ ID No.49) and eMutS-SacI-R (SEQ ID No.50) carry out PCR, obtain product eMutS-1-340, S120C-310-460, L157C-430-800, E451C-770-1500, M609C -1470- end. The amplified product is then utilized for fusion PCR with KOD Plus Neo DNA polymerase (TOYOBO), primers eMutS-NheI-F (SEQ ID No.41) and eMutS-SacI-R (SEQ ID No.50), to obtain the product eMutS- S120C-L157C-E451C-M609C. The fragment eMutS-S120C-L157C-E451C-M609C and the plasmid pEcoMutS-CBM3-EGFP were respectively digested with NheI and SacI, ligated, and a plasmid with eight mutations was obtained after transformation, which was named pZJQIBEBT011 (Figure 1).

(1)以pEcoMutS-CBM3-EGFP质粒为模板，eMutS-NheI-F(SEQ ID No.41)和eMutS-340-R(SEQ ID No.42)为引物，扩增eMutS-1-340片段的PCR体系：(1) Using the pEcoMutS-CBM3-EGFP plasmid as a template, eMutS-NheI-F (SEQ ID No.41) and eMutS-340-R (SEQ ID No.42) as primers, amplify the eMutS-1-340 fragment PCR system:

PCR程序PCR program

(2)以pZJQIBEBT002质粒为模板，eMutS-310-F(SEQ ID No.43)和eMutS-S120C-460-R(SEQ ID No.44)为引物，扩增eMutS-1-340片段的PCR体系：(2) Using the pZJQIBEBT002 plasmid as a template, eMutS-310-F (SEQ ID No.43) and eMutS-S120C-460-R (SEQ ID No.44) as primers, a PCR system for amplifying the eMutS-1-340 fragment :

PCR程序PCR program

(3)以pZJQIBEBT003质粒为模板，eMutS-430-F(SEQ ID No.45)和eMutS-800-R(SEQ ID No.46)为引物，扩增L157C-430-800片段的PCR体系：(3) Using the pZJQIBEBT003 plasmid as a template, eMutS-430-F (SEQ ID No.45) and eMutS-800-R (SEQ ID No.46) as primers, a PCR system for amplifying the L157C-430-800 fragment:

PCR程序PCR program

(4)以pZJQIBEBT006质粒为模板，eMutS-770-F(SEQ ID No.47)和eMutS-1500-R(SEQ ID No.48)为引物，扩增E451C-770-1500片段的PCR体系：(4) Using the pZJQIBEBT006 plasmid as a template, eMutS-770-F (SEQ ID No.47) and eMutS-1500-R (SEQ ID No.48) as primers, a PCR system for amplifying the E451C-770-1500 fragment:

PCR程序PCR program

(5)以pZJQIBEBT010质粒为模板，eMutS-1470-F(SEQ ID No.49)和eMutS-SacI-R(SEQ ID No.50)为引物，扩增M609C-1470-末尾片段的PCR体系：(5) Using the pZJQIBEBT010 plasmid as a template, eMutS-1470-F (SEQ ID No.49) and eMutS-SacI-R (SEQ ID No.50) as primers, the PCR system for amplifying the M609C-1470-end fragment:

PCR程序PCR program

(6)然后将扩增产物利用KOD Plus Neo DNA聚合酶(TOYOBO)，引物eMutS-NheI-F(SEQ ID No.41)和eMutS-SacI-R(SEQ ID No.50)进行融合PCR，获得产物eMutS-S120C-L157C-E451C-M609C的PCR体系：(6) Then the amplified product was carried out fusion PCR using KOD Plus Neo DNA polymerase (TOYOBO), primers eMutS-NheI-F (SEQ ID No.41) and eMutS-SacI-R (SEQ ID No.50), to obtain The PCR system of the product eMutS-S120C-L157C-E451C-M609C:

PCR程序PCR program

(7)利用NheI和SacI分别酶切片段eMutS-S120C-L157C-E451C-M609C和质粒pEcoMutS-CBM3-EGFP，连接，转化后获得八个位点突变的质粒，命名为pZJQIBEBT011(图1)。(7) The fragment eMutS-S120C-L157C-E451C-M609C and the plasmid pEcoMutS-CBM3-EGFP were respectively digested with NheI and SacI, ligated, and a plasmid with eight mutations was obtained after transformation, which was named pZJQIBEBT011 (Figure 1).

①片段eMutS-S120C-L157C-E451C-M609C的酶切体系：① Enzyme digestion system for fragment eMutS-S120C-L157C-E451C-M609C:

②质粒pEcoMutS-CBM3-EGFP的酶切体系：② Enzyme digestion system of plasmid pEcoMutS-CBM3-EGFP:

③将酶切产物利用Solution I(TAKATA)进行连接，将获得的含有8个位点定点突变的质粒，命名为pZJQIBEBT011(图1)。③ The digested products were ligated using Solution I (TAKATA), and the obtained plasmid containing 8 site-directed mutations was named pZJQIBEBT011 (Figure 1).

2.将构建的载体转入表达载体大肠杆菌BL21中：2. Transform the constructed vector into the expression vector Escherichia coli BL21:

分别将进行定点突变了的10个质粒转化到宿主菌株大肠杆菌BL21的制备The preparation of the host strain Escherichia coli BL21 by transforming the 10 plasmids that have undergone site-directed mutation

1)大肠杆菌化学转化步骤:1) Escherichia coli chemical transformation steps:

①.从-70℃超低温冰柜中取出一管(100ul)感受态菌，立即用手指加温融化后插入冰上，冰浴5～10min。。①. Take out a tube (100ul) of competent bacteria from the ultra-low temperature freezer at -70°C, heat it with your fingers immediately to melt it, insert it on ice, and bathe in ice for 5-10 minutes. .

②.加入5ul连接好的质粒混合液(DNA含量不超过100ng)，轻轻震荡后放置冰上20min。②. Add 5ul of the ligated plasmid mixture (DNA content does not exceed 100ng), shake gently and place on ice for 20min.

③.轻轻摇匀后插入42℃水浴中45s进行热休克，然后迅速放回冰中，静置3～5min。③. Gently shake well and insert into a 42°C water bath for 45 seconds for heat shock, then quickly put it back into the ice, and let it stand for 3-5 minutes.

④.在超净工作台中向上述各管中分别加入500ul LB培养基(不含抗菌素)轻轻混匀，然后固定到摇床的弹簧架上37℃震荡1h。④. Add 500ul LB medium (without antibiotics) to each of the above-mentioned tubes in the ultra-clean workbench and mix gently, then fix it on the spring stand of the shaker at 37°C and shake for 1h.

⑤.在超净工作台中取上述转化混合液100-300ul，分别滴到含合适抗菌素的固体LB平板培养皿中，用酒精灯烧过的玻璃涂布棒涂布均匀。⑤. Take 100-300 ul of the above-mentioned transformation mixture in the ultra-clean workbench, drop them into solid LB plate culture dishes containing appropriate antibiotics, and spread evenly with a glass coating rod burned by an alcohol lamp.

⑥.在涂好的培养皿上做上标记，先放置在37℃恒温培养箱中30～60min直到表面的液体都渗透到培养基里后，再倒置过来放入37℃恒温培养箱过夜。⑥. Mark the coated culture dish, place it in a 37°C constant temperature incubator for 30-60 minutes until the liquid on the surface penetrates into the medium, then turn it upside down and put it in a 37°C constant temperature incubator overnight.

⑦.挑取板上克隆在液体LB中培养，提取质粒，并通过PCR鉴定转化结果。⑦. Pick the clones on the plate and culture them in liquid LB, extract the plasmids, and identify the transformation results by PCR.

2)构建各种eMutS蛋白突变体表达菌株的具体过程:2) The specific process of constructing various eMutS protein mutant expression strains:

将含有E38C位和G70C位突变的质粒pZJQIBEBT001，转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能，并新增eMutS的功能。在含有Amp抗性的培养基(配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l)上筛选阳性克隆，所得到的转化菌株命名为：EZJQIBEBT001；The plasmid pZJQIBEBT001 containing mutations at E38C and G70C was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT001;

将含有S120C位和S151C位突变的质粒pZJQIBEBT002，转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能，并新增eMutS的功能。在含有Amp抗性的培养基(配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l)上筛选阳性克隆，所得到的转化菌株命名为：EZJQIBEBT002；The plasmid pZJQIBEBT002 containing the S120C and S151C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT002;

将含有L157C位和G233C位突变的质粒pZJQIBEBT003，转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能，并新增eMutS的功能。在含有Amp抗性的培养基(配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l)上筛选阳性克隆，所得到的转化菌株命名为：EZJQIBEBT003；The plasmid pZJQIBEBT003 containing the L157C and G233C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT003;

将含有E177C位和G195C位突变的质粒pZJQIBEBT004，转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能，并新增eMutS的功能。在含有Amp抗性的培养基(配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l)上筛选阳性克隆，所得到的转化菌株命名为：EZJQIBEBT004；The plasmid pZJQIBEBT004 containing the E177C and G195C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT004;

将含有E451C位和V465C位突变的质粒pZJQIBEBT005，转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能，并新增eMutS的功能。在含有Amp抗性的培养基(配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l)上筛选阳性克隆，所得到的转化菌株命名为：EZJQIBEBT005；The plasmid pZJQIBEBT005 containing mutations at E451C and V465C was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT005;

将含有Y474C位和R500C位突变的质粒pZJQIBEBT006，转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能，并新增eMutS的功能。在含有Amp抗性的培养基(配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l)上筛选阳性克隆，所得到的转化菌株命名为：EZJQIBEBT006；The plasmid pZJQIBEBT006 containing Y474C and R500C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT006;

将含有T581C位和K644C位突变的质粒pZJQIBEBT007，转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能，并新增eMutS的功能。在含有Amp抗性的培养基(配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l)上筛选阳性克隆，所得到的转化菌株命名为：EZJQIBEBT007；The plasmid pZJQIBEBT007 containing T581C and K644C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT007;

将含有H585C位和Q626C位突变的质粒pZJQIBEBT008，转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能，并新增eMutS的功能。在含有Amp抗性的培养基(配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l)上筛选阳性克隆，所得到的转化菌株命名为：EZJQIBEBT008；The plasmid pZJQIBEBT008 containing the H585C and Q626C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT008;

将含有I597C位和H760C位突变的质粒pZJQIBEBT009，转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能，并新增eMutS的功能。在含有Amp抗性的培养基(配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l)上筛选阳性克隆，所得到的转化菌株命名为：EZJQIBEBT009；The plasmid pZJQIBEBT009 containing the I597C and H760C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT009;

将含有M609C位和T723C位突变的质粒pZJQIBEBT010，转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能，并新增eMutS的功能。在含有Amp抗性的培养基(配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l)上筛选阳性克隆，所得到的转化菌株命名为：EZJQIBEBT010；The plasmid pZJQIBEBT010 containing the M609C and T723C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT010;

3)提取质粒，通过PCR鉴定大肠杆菌转化的阳性菌株。3) extract the plasmid, and identify the positive strain transformed by Escherichia coli by PCR.

⑴大肠杆菌质粒提取步骤:(1) Escherichia coli plasmid extraction steps:

①.收菌：将过夜培养(37℃，12-16小时)的菌液于室温≧10,000g离心1-2分钟，彻底弃除上清。①. Bacteria collection: Centrifuge the overnight culture (37°C, 12-16 hours) at room temperature≧10,000g for 1-2 minutes, and discard the supernatant completely.

②.重悬：加入250μl含RNase A的细胞悬浮液(S1)，充分混悬震荡或用枪头反复抽打使细菌彻底分散悬浮。②. Resuspension: Add 250 μl of RNase A-containing cell suspension (S1), fully suspend and shake or beat repeatedly with a pipette tip to completely disperse and suspend the bacteria.

③.裂解：加入250μl细胞裂解液(S2)，轻轻上下颠倒混合5次，室温静置1-5分钟，待细菌充分裂解，溶液变半透明。③. Lysis: Add 250 μl of cell lysate (S2), mix gently up and down 5 times, and let stand at room temperature for 1-5 minutes. After the bacteria are fully lysed, the solution becomes translucent.

④.中和：加入350μl中和缓冲液(S3)，轻轻上下颠倒混合5次，充分混匀，避免剧烈震荡。室温下≧12,000g离心10分钟。④. Neutralization: Add 350 μl neutralization buffer (S3), mix gently up and down 5 times, mix well, avoid violent shock. Centrifuge at ≧12,000g for 10 minutes at room temperature.

⑤.DNA结合：小心吸取上清，转移到插入收集管的离心吸附柱内，室温下≧12,000g离心1分钟，弃除收集管中的废液，将离心吸附柱重新插回收集管中。⑤. DNA binding: Carefully aspirate the supernatant, transfer it to a centrifugal adsorption column inserted into a collection tube, centrifuge at ≧12,000g for 1 minute at room temperature, discard the waste liquid in the collection tube, and reinsert the centrifugal adsorption column into the collection tube.

⑥.清洗：加入500μl漂洗液(WB,请确认已加入乙醇！)于离心吸附柱中，室温下≧12,000g离心30秒，弃除收集管中的废液，将离心吸附柱重新插回收集管中。⑥. Cleaning: Add 500μl of washing solution (WB, please confirm that ethanol has been added!) to the centrifugal adsorption column, centrifuge at ≧12,000g for 30 seconds at room temperature, discard the waste liquid in the collection tube, and insert the centrifugal adsorption column back to collect tube.

⑦.清洗：加入500μl漂洗液(WB,请确认已加入乙醇！)于离心吸附柱中，室温下≧12,000g离心30秒，弃除收集管中的废液，将离心吸附柱重新插回收集管中。将离心吸附柱开盖再次离心2分钟，彻底除去残余漂洗液。。⑦. Cleaning: Add 500 μl of washing solution (WB, please confirm that ethanol has been added!) to the centrifugal adsorption column, centrifuge at ≧12,000g for 30 seconds at room temperature, discard the waste liquid in the collection tube, and insert the centrifugal adsorption column back to collect tube. Uncover the centrifugal adsorption column and centrifuge again for 2 minutes to completely remove the residual rinse solution. .

⑧.洗脱：小心取出离心吸附柱，将其套入一个新的1.5ml灭菌离心管中。向硅胶吸附膜的中央加入100μl洗脱缓冲液(EB)，室温放置1分钟后，≧12,000g离心1分钟收集质粒DNA。⑧. Elution: Carefully take out the centrifugal adsorption column and insert it into a new 1.5ml sterilized centrifuge tube. Add 100 μl of elution buffer (EB) to the center of the silica gel adsorption membrane, and after standing at room temperature for 1 minute, centrifuge at ≧12,000 g for 1 minute to collect plasmid DNA.

⑨.储存：弃除离心吸附柱，纯化的质粒可直接用于后续反应或于-20℃长期保存。⑨. Storage: Discard the centrifugal adsorption column, and the purified plasmid can be used directly for subsequent reactions or stored at -20°C for a long time.

⑵鉴定大肠杆菌转化的阳性菌株的PCR体系：(2) PCR system for identifying positive strains transformed by E. coli:

①包含eMutS的质粒的PCR体系：①Plasmid PCR system containing eMutS:

PCR程序PCR program

以通用引物M13作为引物，以质粒为模板，PCR扩增后，能特异性的扩增出基因条带的菌株，即为阳性菌株，并进行下一步实验。其中基因的大小为2586bp。The general primer M13 is used as a primer, and the plasmid is used as a template. After PCR amplification, the strain that can specifically amplify the gene band is a positive strain, and the next step of the experiment is carried out. The size of the gene is 2586bp.

实施例2eMutS蛋白突变体的热稳定性The thermostability of embodiment 2eMutS protein mutant

该实施例用于比较不同eMutS突变体的菌株表达的蛋白酶活稳定性的高低。结果表明EZJQIBEBT011菌株表达的蛋白eMutS-S120C-L157C-E451C-M609C活性稳定性最高。This example is used to compare the stability of protease activity expressed by strains of different eMutS mutants. The results showed that the protein eMutS-S120C-L157C-E451C-M609C expressed by EZJQIBEBT011 strain had the highest activity and stability.

1.在LB培养基平板上复苏菌株。对照株：eMutS。实验株：eMutS-E38C-G70C、eMutS-Y474C-R500C、eMutS-T581C-K644C、eMutS-I597C-H760C、eMutS-S120C-S151C、eMutS-L157C-G233C、eMutS-E451C-V465C、eMutS-M609C-T723C、eMutS-E177C-G195C、eMutS-H585C-Q626C、eMutS-S120C-L157C-E451C-M609C。37℃培养1天。1. Recover strains on LB medium plates. Control strain: eMutS. Experimental strains: eMutS-E38C-G70C, eMutS-Y474C-R500C, eMutS-T581C-K644C, eMutS-I597C-H760C, eMutS-S120C-S151C, eMutS-L157C-G233C, eMutS-E451C-V465C, eMutS-T79CutS-M70 , eMutS-E177C-G195C, eMutS-H585C-Q626C, eMutS-S120C-L157C-E451C-M609C. Incubate at 37°C for 1 day.

2.分别挑取单克隆，接于5ml液体LB培养基。37℃，250rpm，过夜。2. Pick out single clones respectively, and connect them to 5ml liquid LB medium. 37°C, 250rpm, overnight.

3.配制36瓶100mL的YPD培养基分装于1L锥形三角瓶。配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l。灭菌待用。3. Prepare 36 bottles of 100mL YPD medium and distribute them in 1L Erlenmeyer flasks. Recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l. Sterilized and ready to use.

4.取适量过夜培养物接入200ml LB培养基中，接种比例为1:1000,37℃,250rpm培养。4. Take an appropriate amount of overnight culture and inoculate it into 200ml LB medium at an inoculation ratio of 1:1000, culture at 37°C and 250rpm.

5.OD600＝0.8左右时，收菌，破碎菌体，纯化蛋白，测酶活。5. When OD600=0.8 or so, harvest the bacteria, crush the bacteria, purify the protein, and measure the enzyme activity.

6.从图2可知，突变体eMutS-E38C-G70C和eMutS-Y474C-R500C失去酶活。突变体eMutS-T581C-K644C和eMutS-I597C-H760C酶活可以持续21天，与对照组相似。突变体eMutS-S120C-S151C、eMutS-L157C-G233C、eMutS-E451C-V465C和eMutS-M609C-T723C，eMutS-S120C-L157C-E451C-M609C比对照组酶活持续时间要长，而且，突变体eMutS-S120C-L157C-E451C-M609C的酶活可以持续长达90天。6. It can be seen from Figure 2 that the mutants eMutS-E38C-G70C and eMutS-Y474C-R500C lost enzyme activity. The enzyme activity of mutants eMutS-T581C-K644C and eMutS-I597C-H760C can last for 21 days, which is similar to that of the control group. Mutants eMutS-S120C-S151C, eMutS-L157C-G233C, eMutS-E451C-V465C and eMutS-M609C-T723C, eMutS-S120C-L157C-E451C-M609C were longer than the control group, and the mutant eMutS -S120C-L157C-E451C-M609C can last for up to 90 days.

实施例3eMutS蛋白突变体结合错配DNA的纠错能力Example 3 Error Correction Ability of eMutS Protein Mutants in Binding to Mismatched DNA

该实施例用于验证两个酶活稳定性最强的eMutS突变体的菌株表达的蛋白酶的纠错能力。结果表明稳定性提高的蛋白酶的纠错能力仍然保持。其中，eMutS-S120C-L157C-E451C-M609C纠错效果最好，甚至优于突变前菌株。This example is used to verify the error correction ability of the protease expressed by the strains of the two eMutS mutants with the strongest enzyme activity stability. The results showed that the error-correcting ability of proteases with improved stability was still maintained. Among them, eMutS-S120C-L157C-E451C-M609C has the best error correction effect, even better than the strain before mutation.

1.在LB培养基平板上复苏菌株。对照株：eMutS。实验株：eMutS-S120C-S151C,eMutS-L157C-G233C，eMutS-S120C-L157C-E451C-M609C。37℃培养1天。1. Recover strains on LB medium plates. Control strain: eMutS. Experimental strains: eMutS-S120C-S151C, eMutS-L157C-G233C, eMutS-S120C-L157C-E451C-M609C. Incubate at 37°C for 1 day.

3.配制12瓶100mL的YPD培养基分装于1L锥形三角瓶。配方：胰蛋白胨10g/l，酵母提取物5g/l，NaCl 10g/l。灭菌待用。3. Prepare 12 bottles of 100mL YPD medium and distribute them in 1L Erlenmeyer flasks. Recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l. Sterilized and ready to use.

5.OD600＝0.8左右时，收菌，破碎菌体，纯化蛋白，用于DNA纠错实验。5. When OD600=0.8 or so, harvest the bacteria, crush the bacteria, purify the protein, and use it for the DNA error correction experiment.

6.在应用实验中，所有eMutS蛋白都与MICC系统一起使用，这些在过去的研究中有报道。利用稳定性提高后的纠错系统来纠错用MCp-oligos合成的XR和Cas9同源基因。如表1所示，寡聚物的错误率约为1.2％。使用MICC去除错误显着降低了合成XR和Cas9同源基因的错误率。实验表明，二硫键构建使eMutS蛋白稳定性提高，同时保持原有纠错能力。其中，eMutS-S120C-L157C-E451C-M609C不仅热稳定性大幅度提高，而且纠错效果最好，甚至优于突变前菌株。6. In applied experiments, all eMutS proteins were used with the MICC system, which were reported in past studies. Error correction of XR and Cas9 homologs synthesized with MCp-oligos using the error correction system with improved stability. As shown in Table 1, the oligomer error rate is about 1.2%. Removing errors using MICC significantly reduced the error rate for synthesizing XR and Cas9 homologs. Experiments show that the construction of disulfide bonds improves the stability of eMutS protein while maintaining the original error correction ability. Among them, eMutS-S120C-L157C-E451C-M609C not only greatly improved thermal stability, but also had the best error correction effect, even better than the strain before mutation.

表1. eMutS突变体进行DNA纠错的结果Table 1. Results of DNA error correction in eMutS mutants

应该理解，尽管参考其示例性的实施方案，已经对本发明进行具体地显示和描述，但是本领域的普通技术人员应该理解，在不背离由权利要求所定义的本发明的精神和范围的条件下，可以在其中进行各种形式和细节的变化，可以进行各种实施方案的任意组合。It should be understood that while the invention has been particularly shown and described with reference to exemplary embodiments thereof, those skilled in the art will appreciate that, without departing from the spirit and scope of the invention as defined by the appended claims, , various changes in form and details can be made therein, and any combination of various embodiments can be made.

序列表sequence listing

<110> 中国科学院青岛生物能源与过程研究所<110> Qingdao Institute of Bioenergy and Process Technology, Chinese Academy of Sciences

<120> 一种错配结合蛋白的突变体及其编码基因<120> A mutant of a mismatch binding protein and its coding gene

<130> P2019-0992<130> P2019-0992

<160> 62<160> 62

<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0

<210> 1<210> 1

<211> 36<211> 36

<212> DNA<212>DNA

<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)

<400> 1<400> 1

ggatgggtga tttttattgc ctgttttatg acgacg 36ggatgggtga tttttattgc ctgttttatg acgacg 36

<210> 2<210> 2

<211> 36<211> 36

<212> DNA<212>DNA

<400> 2<400> 2

cgtcgtcata aaacaggcaa taaaaatcac ccatcc 36cgtcgtcata aaacaggcaa taaaaatcac ccatcc 36

<210> 3<210> 3

<211> 35<211> 35

<212> DNA<212>DNA

<400> 3<400> 3

gccgatcccg atggcgtgca ttccctacca tgcgg 35gccgatcccg atggcgtgca ttccctacca tgcgg 35

<210> 4<210> 4

<211> 35<211> 35

<212> DNA<212>DNA

<400> 4<400> 4

ccgcatggta gggaatgcac gccatcggga tcggc 35ccgcatggta gggaatgcac gccatcggga tcggc 35

<210> 5<210> 5

<211> 35<211> 35

<212> DNA<212>DNA

<400> 5<400> 5

tacgccaggc accatctgcg atgaagccct gttgc 35tacgccaggc accatctgcg atgaagccct gttgc 35

<210> 6<210> 6

<211> 35<211> 35

<212> DNA<212>DNA

<400> 6<400> 6

gcaacagggc ttcatcgcag atggtgcctg gcgta 35gcaacagggc ttcatcgcag atggtgcctg gcgta 35

<210> 7<210> 7

<211> 33<211> 33

<212> DNA<212>DNA

<400> 7<400> 7

gcgacgctgg atatctgctc cgggcgtttt cgc 33gcgacgctgg atatctgctc cgggcgtttt cgc 33

<210> 8<210> 8

<211> 33<211> 33

<212> DNA<212>DNA

<400> 8<400> 8

gcgaaaacgc ccggagcaga tatccagcgt cgc 33gcgaaaacgc ccggagcaga tatccagcgt cgc 33

<210> 9<210> 9

<211> 35<211> 35

<212> DNA<212>DNA

<400> 9<400> 9

ttccgggcgt tttcgctgca gcgaaccggc tgacc 35ttccgggcgt tttcgctgca gcgaaccggc tgacc 35

<210> 10<210> 10

<211> 35<211> 35

<212> DNA<212>DNA

<400> 10<400> 10

ggtcagccgg ttcgctgcag cgaaaacgcc cggaa 35ggtcagccgg ttcgctgcag cgaaaacgcc cggaa 35

<210> 11<210> 11

<211> 36<211> 36

<212> DNA<212>DNA

<400> 11<400> 11

cgagaacgcg ccgcgctgcc tttgtgctgc cggttg 36cgagaacgcg ccgcgctgcc tttgtgctgc cggttg 36

<210> 12<210> 12

<211> 36<211> 36

<212> DNA<212>DNA

<400> 12<400> 12

caaccggcag cacaaaggca gcgcggcgcg ttctcg 36caaccggcag cacaaaggca gcgcggcgcg ttctcg 36

<210> 13<210> 13

<211> 35<211> 35

<212> DNA<212>DNA

<400> 13<400> 13

acgcactaat cctgcgtgcc tgctgtatgc agaag 35acgcactaat cctgcgtgcc tgctgtatgc agaag 35

<210> 14<210> 14

<211> 35<211> 35

<212> DNA<212>DNA

<400> 14<400> 14

cttctgcata cagcaggcac gcaggattag tgcgt 35cttctgcata cagcaggcac gcaggattag tgcgt 35

<210> 15<210> 15

<211> 33<211> 33

<212> DNA<212>DNA

<400> 15<400> 15

attgaaggcc gtcgctgcct gcgccgtcgc ccg 33attgaaggcc gtcgctgcct gcgccgtcgc ccg 33

<210> 16<210> 16

<211> 33<211> 33

<212> DNA<212>DNA

<400> 16<400> 16

cgggcgacgg cgcaggcagc gacggccttc aat 33cgggcgacgg cgcaggcagc gacggccttc aat 33

<210> 17<210> 17

<211> 34<211> 34

<212> DNA<212>DNA

<400> 17<400> 17

tatctggagc gtctgtgcgt ccgcgagcgt gaac 34tatctggagc gtctgtgcgt ccgcgagcgt gaac 34

<210> 18<210> 18

<211> 34<211> 34

<212> DNA<212>DNA

<400> 18<400> 18

gttcacgctc gcggacgcac agacgctcca gata 34gttcacgctc gcggacgcac agacgctcca gata 34

<210> 19<210> 19

<211> 33<211> 33

<212> DNA<212>DNA

<400> 19<400> 19

ctggacacgc tgaaatgcgg ctttaatgcg gtg 33ctggacacgc tgaaatgcgg ctttaatgcg gtg 33

<210> 20<210> 20

<211> 33<211> 33

<212> DNA<212>DNA

<400> 20<400> 20

caccgcatta aagccgcatt tcagcgtgtc cag 33caccgcatta aagccgcatt tcagcgtgtc cag 33

<210> 21<210> 21

<211> 33<211> 33

<212> DNA<212>DNA

<400> 21<400> 21

gcggtgcacg gctactgcat tcaaatcagc cgt 33gcggtgcacg gctactgcat tcaaatcagc cgt 33

<210> 22<210> 22

<211> 33<211> 33

<212> DNA<212>DNA

<400> 22<400> 22

acggctgatt tgaatgcagt agccgtgcac cgc 33acggctgatt tgaatgcagt agccgtgcac cgc 33

<210> 23<210> 23

<211> 33<211> 33

<212> DNA<212>DNA

<400> 23<400> 23

ctgaaaaacg ccgagtgcta catcattcca gag 33ctgaaaaacg ccgagtgcta catcattcca gag 33

<210> 24<210> 24

<211> 33<211> 33

<212> DNA<212>DNA

<400> 24<400> 24

ctctggaatg atgtagcact cggcgttttt cag 33ctctggaatg atgtagcact cggcgttttt cag 33

<210> 25<210> 25

<211> 33<211> 33

<212> DNA<212>DNA

<400> 25<400> 25

ccgggcattc gcatttgcga aggtcgccat ccg 33ccgggcattc gcatttgcga aggtcgccat ccg 33

<210> 26<210> 26

<211> 33<211> 33

<212> DNA<212>DNA

<400> 26<400> 26

cggatggcga ccttcgcaaa tgcgaatgcc cgg 33cggatggcga ccttcgcaaa tgcgaatgcc cgg 33

<210> 27<210> 27

<211> 33<211> 33

<212> DNA<212>DNA

<400> 27<400> 27

tatgtaccgg cacaatgcgt cgagattgga cct 33tatgtaccgg cacaatgcgt cgagattgga cct 33

<210> 28<210> 28

<211> 33<211> 33

<212> DNA<212>DNA

<400> 28<400> 28

aggtccaatc tcgacgcatt gtgccggtac ata 33aggtccaatc tcgacgcatt gtgccggtac ata 33

<210> 29<210> 29

<211> 33<211> 33

<212> DNA<212>DNA

<400> 29<400> 29

attaccgaag gtcgctgccc ggtagttgaa caa 33attaccgaag gtcgctgccc ggtagttgaa caa 33

<210> 30<210> 30

<211> 33<211> 33

<212> DNA<212>DNA

<400> 30<400> 30

ttgttcaact accgggcagc gaccttcggt aat 33ttgttcaact accgggcagc gaccttcggt aat 33

<210> 31<210> 31

<211> 33<211> 33

<212> DNA<212>DNA

<400> 31<400> 31

agtacctata tgcgctgcac cgcactgatt gcg 33agtacctata tgcgctgcac cgcactgatt gcg 33

<210> 32<210> 32

<211> 33<211> 33

<212> DNA<212>DNA

<400> 32<400> 32

cgcaatcagt gcggtgcagc gcatataggt act 33cgcaatcagt gcggtgcagc gcatataggt act 33

<210> 33<210> 33

<211> 33<211> 33

<212> DNA<212>DNA

<400> 33<400> 33

ctgaatgagc cattttgcgc caacccgctg aat 33ctgaatgagc cattttgcgc caacccgctg aat 33

<210> 34<210> 34

<211> 33<211> 33

<212> DNA<212> DNA

<400> 34<400> 34

attcagcggg ttggcgcaaa atggctcatt cag 33attcagcggg ttggcgcaaa atggctcatt cag 33

<210> 35<210> 35

<211> 33<211> 33

<212> DNA<212> DNA

<400> 35<400> 35

accattgcct ttatgtgcag cgtgcaggat ggc 33accattgcct ttatgtgcag cgtgcaggat ggc 33

<210> 36<210> 36

<211> 33<211> 33

<212> DNA<212>DNA

<400> 36<400> 36

gccatcctgc acgctgcaca taaaggcaat ggt 33gccatcctgc acgctgcaca taaaggcaat ggt 33

<210> 37<210> 37

<211> 34<211> 34

<212> DNA<212>DNA

<400> 37<400> 37

tcgccgcagc gccgctgctt gatcatcacc ggtc 34tcgccgcagc gccgctgctt gatcatcacc ggtc 34

<210> 38<210> 38

<211> 34<211> 34

<212> DNA<212>DNA

<400> 38<400> 38

gaccggtgat gatcaagcag cggcgctgcg gcga 34gaccggtgat gatcaagcag cggcgctgcg gcga 34

<210> 39<210> 39

<211> 36<211> 36

<212> DNA<212>DNA

<400> 39<400> 39

taagattaag gcattgtgct tatttgctac ccacta 36taagattaag gcattgtgct tattgctac cacta 36

<210> 40<210> 40

<211> 36<211> 36

<212> DNA<212>DNA

<400> 40<400> 40

tagtgggtag caaataagca caatgcctta atctta 36tagtgggtag caaataagca caatgcctta atctta 36

<210> 41<210> 41

<211> 31<211> 31

<212> DNA<212>DNA

<400> 41<400> 41

ctagctagca tgagtgcaat agaaaatttc g 31ctagctagca tgagtgcaat agaaaatttc g 31

<210> 42<210> 42

<211> 20<211> 20

<212> DNA<212>DNA

<400> 42<400> 42

cgatacgcac aactttgcgc 20cgatacgcac aactttgcgc 20

<210> 43<210> 43

<211> 20<211> 20

<212> DNA<212>DNA

<400> 43<400> 43

gtccggttga gcgcaaagtt 20gtccggttga gcgcaaagtt 20

<210> 44<210> 44

<211> 20<211> 20

<212> DNA<212>DNA

<400> 44<400> 44

gcccggagca gatatccagc 20gcccggagca gatatccagc 20

<210> 45<210> 45

<211> 20<211> 20

<212> DNA<212>DNA

<400> 45<400> 45

gctacgcgac gctggatatc 20gctacgcgac gctggatatc 20

<210> 46<210> 46

<211> 20<211> 20

<212> DNA<212>DNA

<400> 46<400> 46

atgctgtcct gctcacgttc 20atgctgtcct gctcacgttc 20

<210> 47<210> 47

<211> 20<211> 20

<212> DNA<212>DNA

<400> 47<400> 47

catcaccatg gaacgtgagc 20catcaccatg gaacgtgagc 20

<210> 48<210> 48

<211> 20<211> 20

<212> DNA<212>DNA

<400> 48<400> 48

gcgctcggcg tttttcagcg 20gcgctcggcg tttttcagcg 20

<210> 49<210> 49

<211> 22<211> 22

<212> DNA<212>DNA

<400> 49<400> 49

cgtcgccaga cgctgaaaaa cg 22cgtcgccaga cgctgaaaaa cg 22

<210> 50<210> 50

<211> 25<211> 25

<212> DNA<212>DNA

<400> 50<400> 50

cgagctccac caggctcttc aagcg 25cgagctccac caggctcttc aagcg 25

<210> 51<210> 51

<211> 2562<211> 2562

<212> DNA<212>DNA

<213> 大肠杆菌(Escherichia coli)<213> Escherichia coli

<400> 51<400> 51

atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60

aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120

tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180

tcggcgggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240tcggcggggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240

gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300

accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatcagc 360accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatcagc 360

gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420

aaaggtttcg gctacgcgac gctggatatc agttccgggc gttttcgcct gagcgaaccg 480aaaggtttcg gctacgcgac gctggatatc agttccgggc gttttcgcct gagcgaaccg 480

gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540

gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600

ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660

gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg 720gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg 720

ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780

gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840

cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900

ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960

ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020

gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080

cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140

cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200

gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260

gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320

gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg 1380gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg 1380

gacacgctga aagttggctt taatgcggtg cacggctact acattcaaat cagccgtggg 1440gacacgctga aagttggctt taatgcggtg cacggctact aattcaaat cagccgtggg 1440

caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500

tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560

ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620

gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680

cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740

accgaaggtc gccatccggt agttgaacaa gtactgaatg agccatttat cgccaacccg 1800accgaaggtc gccatccggt agttgaacaa gtactgaatg agccattat cgccaacccg 1800

ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa 1860ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa 1860

agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920

ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980

gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040

ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100

acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160

gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220

ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280

agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340

gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400

ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460

gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520

caggcgctgg agtggattta tcgcttgaag agcctggtgt aa 2562caggcgctgg agtggatta tcgcttgaag agcctggtgt aa 2562

<210> 52<210> 52

<211> 853<211> 853

<212> PRT<212> PRT

<213> 大肠杆菌(Escherichia coli)<213> Escherichia coli

<400> 52<400> 52

Met Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln GlnMet Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln Gln

1 5 10 151 5 10 15

Tyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr ArgTyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr Arg

20 25 30 20 25 30

Met Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala SerMet Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala Ser

35 40 45 35 40 45

Gln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly GluGln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly Glu

50 55 60 50 55 60

Pro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr LeuPro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr Leu

65 70 75 8065 70 75 80

Ala Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln IleAla Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln Ile

85 90 95 85 90 95

Gly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val ArgGly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val Arg

100 105 110 100 105 110

Ile Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu ArgIle Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu Arg

115 120 125 115 120 125

Gln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe GlyGln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe Gly

130 135 140 130 135 140

Tyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Leu Ser Glu ProTyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Leu Ser Glu Pro

145 150 155 160145 150 155 160

Ala Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro AlaAla Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro Ala

165 170 175 165 170 175

Glu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu GlyGlu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu Gly

180 185 190 180 185 190

Arg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp ThrArg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp Thr

195 200 205 195 200 205

Ala Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val GlyAla Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val Gly

210 215 220 210 215 220

Phe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys LeuPhe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys Leu

225 230 235 240225 230 235 240

Leu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile ArgLeu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile Arg

245 250 255 245 250 255

Ser Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala AlaSer Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala Ala

260 265 270 260 265 270

Thr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala GluThr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala Glu

275 280 285 275 280 285

Asn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly SerAsn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly Ser

290 295 300 290 295 300

Arg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg ValArg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg Val

305 310 315 320305 310 315 320

Leu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr AlaLeu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr Ala

325 330 335 325 330 335

Gly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile LeuGly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile Leu

340 345 350 340 345 350

Ala Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg MetAla Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg Met

355 360 365 355 360 365

Arg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu ThrArg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu Thr

370 375 380 370 375 380

Val Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu PheVal Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu Phe

385 390 395 400385 390 395 400

Ala Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro ProAla Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro Pro

405 410 415 405 410 415

Val Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu GluVal Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu Glu

420 425 430 420 425 430

Leu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu GluLeu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu Glu

435 440 445 435 440 445

Arg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu LysArg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu Lys

450 455 460 450 455 460

Val Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg GlyVal Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg Gly

465 470 475 480465 470 475 480

Gln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu LysGln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu Lys

485 490 495 485 490 495

Asn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp LysAsn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp Lys

500 505 510 500 505 510

Val Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu TyrVal Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu Tyr

515 520 525 515 520 525

Glu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln GlnGlu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln Gln

530 535 540 530 535 540

Ser Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala GluSer Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala Glu

545 550 555 560545 550 555 560

Arg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys ProArg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys Pro

565 570 575 565 570 575

Gly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val LeuGly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val Leu

580 585 590 580 585 590

Asn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg ArgAsn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg Arg

595 600 605 595 600 605

Met Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr MetMet Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr Met

610 615 620 610 615 620

Arg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr ValArg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr Val

625 630 635 640625 630 635 640

Pro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr ArgPro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr Arg

645 650 655 645 650 655

Val Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met ValVal Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met Val

660 665 670 660 665 670

Glu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr SerGlu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr Ser

675 680 685 675 680 685

Leu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp GlyLeu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp Gly

690 695 700 690 695 700

Leu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile LysLeu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile Lys

705 710 715 720705 710 715 720

Ala Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu ProAla Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu Pro

725 730 735 725 730 735

Glu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu HisGlu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu His

740 745 750 740 745 750

Gly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala SerGly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala Ser

755 760 765 755 760 765

Lys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys GluLys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys Glu

770 775 780 770 775 780

Val Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile SerVal Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile Ser

785 790 795 800785 790 795 800

Pro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu LeuPro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu Leu

805 810 815 805 810 815

Ser Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn LeuSer Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn Leu

820 825 830 820 825 830

Asp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr ArgAsp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr Arg

835 840 845 835 840 845

Leu Lys Ser Leu ValLeu Lys Ser Leu Val

850 850

<210> 53<210> 53

<211> 2562<211> 2562

<212> DNA<212>DNA

<400> 53<400> 53

accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360

aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgcct gagcgaaccg 480aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgcct gagcgaaccg 480

<210> 54<210> 54

<211> 853<211> 853

<212> PRT<212> PRT

<400> 54<400> 54

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

Ile Val Thr Pro Gly Thr Ile Cys Asp Glu Ala Leu Leu Gln Glu ArgIle Val Thr Pro Gly Thr Ile Cys Asp Glu Ala Leu Leu Gln Glu Arg

115 120 125 115 120 125

130 135 140 130 135 140

Tyr Ala Thr Leu Asp Ile Cys Ser Gly Arg Phe Arg Leu Ser Glu ProTyr Ala Thr Leu Asp Ile Cys Ser Gly Arg Phe Arg Leu Ser Glu Pro

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

740 745 750 740 745 750

755 760 765 755 760 765

770 775 780 770 775 780

785 790 795 800785 790 795 800

805 810 815 805 810 815

820 825 830 820 825 830

835 840 845 835 840 845

Leu Lys Ser Leu ValLeu Lys Ser Leu Val

850 850

<210> 55<210> 55

<211> 2562<211> 2562

<212> DNA<212>DNA

<400> 55<400> 55

aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgctg cagcgaaccg 480aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgctg cagcgaaccg 480

gatctggtcg gttttggcgt cgagaacgcg ccgcgctgcc tttgtgctgc cggttgtctg 720gatctggtcg gttttggcgt cgagaacgcg ccgcgctgcc tttgtgctgc cggttgtctg 720

<210> 56<210> 56

<211> 853<211> 853

<212> PRT<212> PRT

<400> 56<400> 56

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

Tyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Cys Ser Glu ProTyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Cys Ser Glu Pro

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

Phe Gly Val Glu Asn Ala Pro Arg Cys Leu Cys Ala Ala Gly Cys LeuPhe Gly Val Glu Asn Ala Pro Arg Cys Leu Cys Ala Ala Gly Cys Leu

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

740 745 750 740 745 750

755 760 765 755 760 765

770 775 780 770 775 780

785 790 795 800785 790 795 800

805 810 815 805 810 815

820 825 830 820 825 830

835 840 845 835 840 845

Leu Lys Ser Leu ValLeu Lys Ser Leu Val

850 850

<210> 57<210> 57

<211> 2562<211> 2562

<212> DNA<212>DNA

<400> 57<400> 57

gacggcgcga ccgattatct ggagcgtctg tgcgtccgcg agcgtgaacg taccggcctg 1380gacggcgcga ccgattatct ggagcgtctg tgcgtccgcg agcgtgaacg taccggcctg 1380

gacacgctga aatgcggctt taatgcggtg cacggctact acattcaaat cagccgtggg 1440gacacgctga aatgcggctt taatgcggtg cacggctact aattcaaat cagccgtggg 1440

<210> 58<210> 58

<211> 853<211> 853

<212> PRT<212> PRT

<400> 58<400> 58

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

Arg Leu Cys Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu LysArg Leu Cys Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu Lys

450 455 460 450 455 460

Cys Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg GlyCys Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg Gly

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

740 745 750 740 745 750

755 760 765 755 760 765

770 775 780 770 775 780

785 790 795 800785 790 795 800

805 810 815 805 810 815

820 825 830 820 825 830

835 840 845 835 840 845

Leu Lys Ser Leu ValLeu Lys Ser Leu Val

850 850

<210> 59<210> 59

<211> 2562<211> 2562

<212> DNA<212>DNA

<400> 59<400> 59

ctgaatctgt cgccgcagcg ccgctgcttg atcatcaccg gtccgaacat gggcggtaaa 1860ctgaatctgt cgccgcagcg ccgctgcttg atcatcaccg gtccgaacat gggcggtaaa 1860

gcattgtgct tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220gcattgtgct tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220

<210> 60<210> 60

<211> 853<211> 853

<212> PRT<212> PRT

<400> 60<400> 60

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

Cys Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr MetCys Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr Met

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

Ala Leu Cys Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu ProAla Leu Cys Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu Pro

725 730 735 725 730 735

740 745 750 740 745 750

755 760 765 755 760 765

770 775 780 770 775 780

785 790 795 800785 790 795 800

805 810 815 805 810 815

820 825 830 820 825 830

835 840 845 835 840 845

Leu Lys Ser Leu ValLeu Lys Ser Leu Val

850 850

<210> 61<210> 61

<211> 2562<211> 2562

<212> DNA<212>DNA

<400> 61<400> 61

<210> 62<210> 62

<211> 853<211> 853

<212> PRT<212> PRT

<400> 62<400> 62

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

Tyr Ala Thr Leu Asp Ile Cys Ser Gly Arg Phe Arg Cys Ser Glu ProTyr Ala Thr Leu Asp Ile Cys Ser Gly Arg Phe Arg Cys Ser Glu Pro

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

740 745 750 740 745 750

755 760 765 755 760 765

770 775 780 770 775 780

785 790 795 800785 790 795 800

805 810 815 805 810 815

820 825 830 820 825 830

835 840 845 835 840 845

Leu Lys Ser Leu ValLeu Lys Ser Leu Val

850 850

Claims

1. A mismatch binding protein, characterized in that, the amino acid sequence of the mismatch binding protein is selected from the group:

1) The amino acid sequence obtained by mutating the 120th and 151st serines in the amino acid sequence shown in SEQ ID NO:52 to cysteine;

2) The amino acid sequence obtained by mutating the 157th leucine and the 233rd glycine into cysteine in the amino acid sequence shown in SEQ ID NO:52; or

3) The serine at the 120th position, the leucine at the 157th position, the glutamic acid at the 451st position and the methionine at the 609th position in the amino acid sequence shown in SEQ ID NO:52 are all mutated into cysteine, and the obtained amino acid sequence.

2. The mismatch binding protein according to claim 1, wherein the amino acid sequence of the mismatch binding protein is as shown in SEQ ID NO:54, SEQ ID NO:56 or SEQ ID NO:62.

3. A gene encoding the mismatch binding protein of claim 1.

4. The coding gene according to claim 3, wherein the sequence of the gene is shown in SEQ ID NO:53, SEQ ID NO:55 or SEQ ID NO:61.

5. A vector comprising the coding gene of claim 3.

6. A host cell, characterized in that the host cell contains the coding gene of claim 3.

7. Use of the host cell according to claim 6 in the production of a mismatch binding protein.

8. A method for preparing a mismatch binding protein, characterized in that the method comprises the following steps:

a. culturing the host cell of claim 6 to produce said mismatch binding protein; and

b. Isolating the mismatch binding protein from the culture medium.

9. The use of the mismatch binding protein of claim 1 in identifying and binding DNA mismatches.

10. The application according to claim 9, wherein the mismatch binding protein is used in a free liquid state to identify and bind mismatched DNA, or is used in combination with EGFP and CBM to immobilize the mismatch binding protein on the column Used to recognize and bind mismatched DNA.

11. A method for transforming a wild-type mismatch binding protein to improve thermal stability, characterized in that the method comprises the following steps:

a. comparing the amino acid sequence of the wild-type mismatch binding protein with the amino acid sequence shown in SEQ ID NO:52;

b. Transform the coding sequence of the wild-type mismatch binding protein so that the mutation corresponding to the amino acid sequence shown in SEQ ID NO:52 in the coded amino acid sequence is selected from one of the following groups:

1) Both serine at position 120 and position 151 are mutated to cysteine,

2) Both leucine at position 157 and glycine at position 233 are mutated to cysteine, or

3) Serine at position 120, leucine at position 157, glutamic acid at position 451 and methionine at position 609 are all mutated to cysteine;

c. Directly transfecting the coding sequence obtained in b into a suitable host cell or introducing a vector into a suitable host cell;

d. culturing the host cells obtained in c;

e. isolating the mismatch binding protein produced by the host cell from the culture system obtained in step d; and

f. Determining the thermal stability of the mismatch binding protein.