[go: up one dir, main page]

CN110938610B - A kind of transposase mutant, fusion protein, its preparation method and application - Google Patents

A kind of transposase mutant, fusion protein, its preparation method and application Download PDF

Info

Publication number
CN110938610B
CN110938610B CN201911295765.1A CN201911295765A CN110938610B CN 110938610 B CN110938610 B CN 110938610B CN 201911295765 A CN201911295765 A CN 201911295765A CN 110938610 B CN110938610 B CN 110938610B
Authority
CN
China
Prior art keywords
leu
ala
glu
ser
arg
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911295765.1A
Other languages
Chinese (zh)
Other versions
CN110938610A (en
Inventor
张志毅
章瑞程
贺婷
杜翔斐
钟淑瑶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Fapon Biotech Co Ltd
Original Assignee
Guangdong Fapon Biotech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Fapon Biotech Co Ltd filed Critical Guangdong Fapon Biotech Co Ltd
Priority to CN201911295765.1A priority Critical patent/CN110938610B/en
Publication of CN110938610A publication Critical patent/CN110938610A/en
Application granted granted Critical
Publication of CN110938610B publication Critical patent/CN110938610B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1241Nucleotidyltransferases (2.7.7)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1093General methods of preparing gene libraries, not provided for in other subgroups
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B50/00Methods of creating libraries, e.g. combinatorial synthesis
    • C40B50/06Biochemical methods, e.g. using enzymes or whole viable microorganisms
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/70Fusion polypeptide containing domain for protein-protein interaction
    • C07K2319/705Fusion polypeptide containing domain for protein-protein interaction containing a protein-A fusion

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Biomedical Technology (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Immunology (AREA)
  • Microbiology (AREA)
  • Wood Science & Technology (AREA)
  • Biotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Analytical Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Medicinal Chemistry (AREA)
  • Urology & Nephrology (AREA)
  • Hematology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Cell Biology (AREA)
  • Food Science & Technology (AREA)
  • Biophysics (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Plant Pathology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Peptides Or Proteins (AREA)

Abstract

本发明公开了一种转座酶突变体、融合蛋白、其制备方法和应用,涉及生物技术领域,所述转座酶突变体相较于野生型转座酶具有至少一种突变形式,所述野生型转座酶的氨基酸序列如SEQ ID No.1所示,该转座酶突变体的稳定性高,能长期于‑20℃的条件下储存,可以显著降低储存成本,使转座酶或其构建的融合蛋白的使用更方便,简化实验流程。

Figure 201911295765

The invention discloses a transposase mutant, a fusion protein, a preparation method and an application thereof, and relates to the field of biotechnology. The transposase mutant has at least one mutation form compared with a wild-type transposase, and the amino acid sequence of the wild-type transposase is shown in SEQ ID No. 1. The transposase mutant has high stability and can be stored at -20°C for a long time, which can significantly reduce the storage cost, make the use of the transposase or the fusion protein constructed thereof more convenient, and simplify the experimental process.

Figure 201911295765

Description

一种转座酶突变体、融合蛋白、其制备方法和应用A kind of transposase mutant, fusion protein, its preparation method and application

技术领域technical field

本发明涉及生物技术领域,具体而言,涉及一种转座酶突变体、融合蛋白、其制备方法和应用。The present invention relates to the field of biotechnology, in particular, to a transposase mutant, a fusion protein, a preparation method and application thereof.

背景技术Background technique

随着体外转座相关技术的大量应用,新型转座酶的开发和相关应用也越来越多,用转座酶介导的各种新型技术来替代旧技术的案例层出不穷。With the extensive application of in vitro transposition-related technologies, the development and related applications of new transposases are also increasing, and there are endless cases of replacing old technologies with various new technologies mediated by transposases.

转座酶不仅可以应用于一般的二代测序建库过程,而且在多组学研究中的应用也越来越广泛。但现有的转座酶蛋白存在一些缺陷,其蛋白稳定性较差,于-20℃存放时,容易出现活性显著降低或丧失的情况,需要相对苛刻的保存条件,通常将其与经过专门设计序列的核酸共孵育形成复合体来提高其稳定性,这种使用条件上的限制通常提高了实验本身的成本,并且增加了实验整体流程的复杂程度。Transposases can not only be used in the general next-generation sequencing library construction process, but also more and more widely used in multi-omics research. However, the existing transposase proteins have some defects, and their protein stability is poor. When stored at -20 °C, the activity is likely to be significantly reduced or lost, and relatively harsh storage conditions are required. The nucleic acid co-incubation of the sequence increases the stability of the complex to form a complex. This limitation on the use conditions usually increases the cost of the experiment itself and increases the complexity of the overall experimental process.

鉴于此,特提出本发明。In view of this, the present invention is proposed.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于提供转座酶突变体、融合蛋白、分离的核酸、载体、宿主细胞、融合蛋白的制备方法、用于构建测序文库的试剂盒、转座酶突变体或融合蛋白在DNA建库中的应用以及融合蛋白在染色质共沉淀中的应用。The object of the present invention is to provide transposase mutants, fusion proteins, isolated nucleic acids, vectors, host cells, methods for preparing fusion proteins, kits for constructing sequencing libraries, transposase mutants or fusion proteins in DNA construction Applications in libraries and fusion proteins in chromatin co-precipitation.

本发明是这样实现的:The present invention is realized in this way:

第一方面,实施例提供一种转座酶突变体,所述转座酶突变体相较于野生型转座酶具有如下突变中的至少一种:R27T、M56G、Y41R、E190A、K120Q以及E146N;In a first aspect, the embodiments provide a transposase mutant having at least one of the following mutations compared to the wild-type transposase: R27T, M56G, Y41R, E190A, K120Q and E146N ;

所述野生型转座酶的氨基酸序列如SEQ ID No.1所示。The amino acid sequence of the wild-type transposase is shown in SEQ ID No.1.

第二方面,实施例提供一种融合蛋白,其包括有上述转座酶突变体。In a second aspect, the embodiment provides a fusion protein comprising the above-mentioned transposase mutant.

第三方面,实施例提供一种分离的核酸,其编码如前述实施例所述的转座酶突变体或融合蛋白。In a third aspect, the embodiments provide an isolated nucleic acid encoding the transposase mutant or fusion protein as described in the preceding embodiments.

第四方面,实施例提供一种载体,其含有如前述实施例所述的分离的核酸。In a fourth aspect, the embodiments provide a vector comprising the isolated nucleic acid as described in the preceding embodiments.

第五方面,实施例提供一种宿主细胞,其含有如前述实施例所述的载体。In a fifth aspect, the embodiments provide a host cell comprising the vector described in the preceding embodiments.

第六方面,实施例提供制备转座酶突变体或融合蛋白的方法,其包括培养如前述实施例所述的宿主细胞。In a sixth aspect, the Examples provide a method of making a transposase mutant or fusion protein, comprising culturing the host cell as described in the preceding examples.

第七方面,实施例提供一种用于构建测序文库的试剂盒,其包括有如前述实施例所述的转座酶突变体或融合蛋白。In a seventh aspect, the embodiment provides a kit for constructing a sequencing library, which includes the transposase mutant or fusion protein as described in the preceding embodiments.

第八方面,实施例提供一种前述实施例所述的转座酶突变体或融合蛋白在DNA建库中的应用。In an eighth aspect, the embodiment provides an application of the transposase mutant or fusion protein described in the preceding embodiment in DNA library construction.

第九方面,实施例提供前述实施例所述的转座酶突变体或融合蛋白在分析蛋白与染色质的相互作用中的应用。In a ninth aspect, the examples provide applications of the transposase mutants or fusion proteins described in the preceding examples in analyzing the interaction between proteins and chromatin.

本发明具有以下有益效果:The present invention has the following beneficial effects:

本发明实施例提供的一种转座酶突变体,该转座酶突变体的稳定性高,能够长期于-20℃的条件下储存,可以显著降低转座酶的储存成本,使转座酶或其构建的融合蛋白的使用更方便,简化实验流程。The embodiment of the present invention provides a transposase mutant, the transposase mutant has high stability, can be stored at -20°C for a long time, can significantly reduce the storage cost of the transposase, and make the transposase The use of the fusion protein or the constructed fusion protein is more convenient and the experimental process is simplified.

附图说明Description of drawings

为了更清楚地说明本发明实施例的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,应当理解,以下附图仅示出了本发明的某些实施例,因此不应被看作是对范围的限定,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他相关的附图。In order to illustrate the technical solutions of the embodiments of the present invention more clearly, the following briefly introduces the accompanying drawings used in the embodiments. It should be understood that the following drawings only show some embodiments of the present invention, and therefore do not It should be regarded as a limitation of the scope, and for those of ordinary skill in the art, other related drawings can also be obtained according to these drawings without any creative effort.

图1为本发明验证例1中转座酶1与野生型转座酶的稳定性检测结果;Fig. 1 is the stability detection result of transposase 1 and wild-type transposase in verification example 1 of the present invention;

图2为本发明验证例3中不同融合蛋白和多种酶标抗体结合的效果。FIG. 2 shows the binding effect of different fusion proteins and various enzyme-labeled antibodies in Validation Example 3 of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将对本发明实施例中的技术方案进行清楚、完整地描述。实施例中未注明具体条件者,按照常规条件或制造商建议的条件进行。所用试剂或仪器未注明生产厂商者,均为可以通过市售购买获得的常规产品。In order to make the objectives, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be described clearly and completely below. If the specific conditions are not indicated in the examples, it is carried out according to the conventional conditions or the conditions suggested by the manufacturer. The reagents or instruments used without the manufacturer's indication are conventional products that can be purchased from the market.

本发明通过对野生型Tn5序列进行功能、蛋白结构和关键位点进行分析,并设计了大量突变进行稳定性改造,最终获得如下具有稳定性的转座酶突变体,并且该转座酶突变体与蛋白A构建成融合蛋白后也不影响其稳定性。The present invention analyzes the function, protein structure and key sites of the wild-type Tn5 sequence, and designs a large number of mutations for stability transformation, and finally obtains the following stable transposase mutant, and the transposase mutant It does not affect its stability after it is constructed into a fusion protein with protein A.

本发明实施例提供了一种转座酶突变体,所述转座酶突变体相较于野生型转座酶具有如下突变中的至少一种或多种的组合:R27T、M56G、Y41R、E190A、K120Q以及E146N;所述野生型转座酶的氨基酸序列如SEQ ID No.1所示。该转座酶突变体的稳定性高,能显著降低储存成本,简化实验流程。The embodiment of the present invention provides a transposase mutant, the transposase mutant has at least one or a combination of the following mutations compared to the wild-type transposase: R27T, M56G, Y41R, E190A , K120Q and E146N; the amino acid sequence of the wild-type transposase is shown in SEQ ID No.1. The transposase mutant has high stability, can significantly reduce storage costs, and simplify experimental procedures.

需要说明的是,上述“如下突变中的至少一种或多种的组合”选自:上述6种突变中的任意1种突变、任意2种突变、任意3种突变、任意4种突变、任意5种突变以及以上6种突变的组合。It should be noted that the above-mentioned “combination of at least one or more of the following mutations” is selected from: any one of the six mutations, any two mutations, any three mutations, any four mutations, any 5 mutations and combinations of the above 6 mutations.

在可选实施方式中,所述转座酶突变体相较于所述野生型转座酶具有如下(1)~(3)所示突变组合中的任意一种;In an optional embodiment, the transposase mutant has any one of the following combinations of mutations (1) to (3) compared to the wild-type transposase;

(1)R27T和M56G;(1) R27T and M56G;

(2)Y41R和M56G;(2) Y41R and M56G;

(3)K120Q和E146N。(3) K120Q and E146N.

在可选实施方式中,所述转座酶突变体相较于野生型转座酶还具有突变E54K和/或K120L。需要说明的是“还具有突变E54K和/或K120L”是指:在转座酶突变体相较于野生型转座酶具有上述6种突变中的任意1种突变、任意2种突变、任意3种突变、任意4种突变、任意5种突变以及以上6种突变的组合的基础上,还具有突变E54K和/或K120L。In alternative embodiments, the transposase mutant also has mutations E54K and/or K120L compared to the wild-type transposase. It should be noted that "it also has mutations E54K and/or K120L" means that the transposase mutant has any one of the above six mutations, any two mutations, any three On the basis of any kind of mutation, any 4 kinds of mutations, any 5 kinds of mutations and the combination of the above 6 kinds of mutations, it also has mutations E54K and/or K120L.

在可选实施方式中,所述转座酶突变体相较于野生型转座酶具有如下(4)~(5)所示突变组合中的任意一种:In an optional embodiment, the transposase mutant has any one of the following combinations of mutations (4) to (5) compared to the wild-type transposase:

(4)Y41R、E54K和M56G;(4) Y41R, E54K and M56G;

(5)K120L和E190A。(5) K120L and E190A.

相对于其他突变组合,具有上述(1)~(5)中任意突变的转座酶突变体的稳定性得到进一步提高。需要说明的是,“转座酶突变体相较于野生型转座酶具有(1)所述突变组合”是指,转座酶突变体的氨基酸序列为:野生型转座酶的氨基酸序列上第27位的精氨酸突变为苏氨酸的同时,野生型转座酶的氨基酸序列上第56位的蛋氨酸突变为甘氨酸,(2)~(5)以此类推。Compared with other mutation combinations, the stability of the transposase mutant having any of the above-mentioned mutations (1) to (5) is further improved. It should be noted that "the transposase mutant has (1) the combination of mutations described in the wild-type transposase" means that the amino acid sequence of the transposase mutant is: on the amino acid sequence of the wild-type transposase When the 27th arginine is mutated to threonine, the 56th methionine in the amino acid sequence of the wild-type transposase is mutated to glycine, and so on.

本发明实施例提供了一种融合蛋白,其包括前述任一实施方式所述的转座酶突变体。The embodiment of the present invention provides a fusion protein comprising the transposase mutant described in any one of the foregoing embodiments.

在可选实施方式中,所述融合蛋白还包括有用于结合免疫球蛋白Fc段的结合蛋白,所述结合蛋白与所述转座酶突变体相连。In an alternative embodiment, the fusion protein further comprises a binding protein for binding to the Fc fragment of an immunoglobulin, and the binding protein is linked to the transposase mutant.

在可选实施方式中,所述结合蛋白包括金黄色葡萄球菌A蛋白(Protein A,PA)或其变体、链球菌G蛋白(Protein G)或其变体和链球菌L蛋白(Protein L)或其变体中的至少一种。In alternative embodiments, the binding proteins include Staphylococcus aureus Protein A (Protein A, PA) or a variant thereof, Streptococcus Protein G (Protein G) or a variant thereof, and Streptococcus Protein L (Protein L) or at least one of its variants.

在可选实施方式中,所述结合蛋白包括金黄色葡萄球菌A蛋白或其变体。In alternative embodiments, the binding protein comprises Staphylococcus aureus protein A or a variant thereof.

在可选实施方式中,所述结合蛋白的氨基酸序列如SEQ ID No.2或3所示。In an alternative embodiment, the amino acid sequence of the binding protein is shown in SEQ ID No. 2 or 3.

在可选实施方式中,所述结合蛋白的氨基酸序列位于所述融合蛋白的氨基酸序列的N端,所述转座酶突变体的氨基酸序列位于所述融合蛋白的氨基酸序列的C端。In an alternative embodiment, the amino acid sequence of the binding protein is located at the N-terminus of the amino acid sequence of the fusion protein, and the amino acid sequence of the transposase mutant is located at the C-terminus of the amino acid sequence of the fusion protein.

在可选实施例方式中,所述结合蛋白与所述转座酶突变体通过连接肽连接,融合蛋白的结构为:结合蛋白-连接肽-转座酶突变体。In an optional embodiment, the binding protein and the transposase mutant are linked through a linking peptide, and the fusion protein has the structure: binding protein-linking peptide-transposase mutant.

在可选实施例方式中,所述连接肽为刚性连接肽或柔性连接肽。In alternative embodiments, the linker peptide is a rigid linker peptide or a flexible linker peptide.

在可选实施例方式中,所述连接肽为柔性连接肽。In an alternative embodiment, the linker peptide is a flexible linker peptide.

在可选实施例方式中,所述连接肽的长度为6~59个氨基酸。In an alternative embodiment, the length of the linking peptide is 6-59 amino acids.

在可选实施例方式中,所述连接肽的长度为20~35个氨基酸。In an alternative embodiment, the linker peptide is 20-35 amino acids in length.

在可选实施方式中,所述连接肽的序列如SEQ ID No.4所示。In an alternative embodiment, the sequence of the linking peptide is shown in SEQ ID No.4.

本发明实施例提供了一种分离的核酸,其编码如前述任一实施方式所述的转座酶突变体或如前述任一实施方式所述的融合蛋白。The embodiments of the present invention provide an isolated nucleic acid encoding the transposase mutant according to any of the foregoing embodiments or the fusion protein according to any of the foregoing embodiments.

本发明实施例提供了一种载体,其含有如前述实施例所述的分离的核酸。Embodiments of the present invention provide a vector containing the isolated nucleic acid as described in the preceding embodiments.

本发明实施例提供了一种宿主细胞,其含有前述实施例所述的载体。The embodiment of the present invention provides a host cell, which contains the vector described in the previous embodiment.

本发明实施例提供了一种制备前述任一实施例所述的转座酶突变体或融合蛋白的方法,其包括培养如前述任一实施例所述的宿主细胞。The embodiment of the present invention provides a method for preparing the transposase mutant or fusion protein described in any of the foregoing embodiments, which comprises culturing the host cell described in any of the foregoing embodiments.

本发明实施例提供了一种用于构建测序文库的试剂盒,其包括如前述任一实施方式所述的转座酶突变体或如前述任一实施方式所述的融合蛋白。The embodiments of the present invention provide a kit for constructing a sequencing library, which includes the transposase mutant described in any of the foregoing embodiments or the fusion protein described in any of the foregoing embodiments.

在可选实施方式中,所述试剂盒还包括Tn5转座酶缓冲液和/或PCR反应液。In an optional embodiment, the kit further includes Tn5 transposase buffer and/or PCR reaction solution.

本发明实施例提供了如前述任一实施方式所述的转座酶突变体或如前述任一实施方式所述的融合蛋白在DNA建库中的应用。The embodiment of the present invention provides the application of the transposase mutant according to any of the foregoing embodiments or the fusion protein according to any of the foregoing embodiments in DNA library construction.

在可选实施方式中,所述应用包括将所述转座酶突变体或所述融合蛋白中的转座酶突变体进行接头(Adapter)包埋,形成Tn5转座体,将得到的转座体与目的基因混合共孵育,激活转座体以片段化目的基因,得到两端带有接头的DNA文库。In an optional embodiment, the application includes embedding the transposase mutant or the transposase mutant in the fusion protein with an adapter to form a Tn5 transposome, and the obtained transposase The transposome is mixed and incubated with the target gene, and the transposome is activated to fragment the target gene to obtain a DNA library with adapters at both ends.

在可选实施方式中,还包括采用PCR技术对片段后的目的基因进行PCR扩增。In an optional embodiment, it also includes using PCR technology to amplify the target gene after the fragment by PCR.

本发明实施例提供了如前述任一实施方式所述的转座酶突变体或如前述任一实施方式所述的融合蛋白在分析蛋白与染色质的相互作用中的应用。The embodiment of the present invention provides the application of the transposase mutant according to any of the foregoing embodiments or the fusion protein according to any of the foregoing embodiments in analyzing the interaction between proteins and chromatin.

在可选实施方式中,所述应用包括孵育含有待分析的目标蛋白、目标染色质、以及所述转座酶突变体的混合物体系。In an alternative embodiment, the application comprises incubating a mixture containing the target protein to be analyzed, the target chromatin, and the transposase mutant.

在可选实施方式中,所述混合物体系还包括目标蛋白的抗体,所述转座酶为结合有用于结合免疫球蛋白Fc段的结合蛋白的所述融合蛋白,所述应用包括先将目标蛋白、目标染色质与目标蛋白的抗体结合,再加入融合蛋白共孵育。In an optional embodiment, the mixture system further comprises an antibody to a target protein, the transposase is the fusion protein bound with a binding protein for binding to the Fc segment of an immunoglobulin, and the application comprises firstly combining the target protein , The target chromatin is combined with the antibody of the target protein, and then the fusion protein is added for co-incubation.

在混合体系中,目标蛋白与目标染色质结合,目标蛋白的抗体与目标蛋白结合,融合蛋白中的结合蛋白与目标蛋白的抗体的Fc段结合,融合蛋白中的转座酶突变体激活后,用于切割目标染色质,得到片段化的目标序列,用于测序。In the mixed system, the target protein binds to the target chromatin, the antibody of the target protein binds to the target protein, the binding protein in the fusion protein binds to the Fc segment of the antibody of the target protein, and after the transposase mutant in the fusion protein is activated, It is used to cut the target chromatin to obtain the fragmented target sequence for sequencing.

在可选实施方式中,采用金属离子激活转座酶突变体,金属离子优选为钙离子或镁离子。In an alternative embodiment, the transposase mutant is activated using a metal ion, preferably calcium or magnesium.

以下结合实施例对本发明的特征和性能作进一步的详细描述。The features and performances of the present invention will be further described in detail below in conjunction with the embodiments.

实施例1Example 1

本实施例提供了5种转座酶突变体,具体如下。This example provides five transposase mutants, as follows.

转座酶1的氨基酸序列如SEQ ID No.5所示,核酸序列如SEQ ID No.6所示;The amino acid sequence of transposase 1 is shown in SEQ ID No.5, and the nucleic acid sequence is shown in SEQ ID No.6;

转座酶2的氨基酸序列如SEQ ID No.7所示,核酸序列如SEQ ID No.8所示;The amino acid sequence of transposase 2 is shown in SEQ ID No.7, and the nucleic acid sequence is shown in SEQ ID No.8;

转座酶3的氨基酸序列如SEQ ID No.9所示,核酸序列如SEQ ID No.10所示;The amino acid sequence of transposase 3 is shown in SEQ ID No.9, and the nucleic acid sequence is shown in SEQ ID No.10;

转座酶4的氨基酸序列如SEQ ID No.11所示,核酸序列如SEQ ID No.12所示;The amino acid sequence of transposase 4 is shown in SEQ ID No.11, and the nucleic acid sequence is shown in SEQ ID No.12;

转座酶5的氨基酸序列如SEQ ID No.13所示,核酸序列如SEQ ID No.14所示。The amino acid sequence of transposase 5 is shown in SEQ ID No.13, and the nucleic acid sequence is shown in SEQ ID No.14.

实施例2Example 2

本实施例提供一种融合蛋白,该融合蛋白包括有实施例1提供的转座酶突变体、用于结合免疫球蛋白Fc段的结合蛋白以及连接肽,结合蛋白与转座酶的氨基酸序列通过连接肽连接,该融合蛋白的结构为:结合蛋白-连接肽-转座酶突变体。This example provides a fusion protein, which includes the transposase mutant provided in Example 1, a binding protein for binding to the Fc segment of an immunoglobulin, and a linking peptide. The amino acid sequence of the binding protein and the transposase passes through The connecting peptide is connected, and the structure of the fusion protein is: binding protein-connecting peptide-transposase mutant.

其中,结合蛋白为蛋白A(PA),其包括有PA1和PA2。Among them, the binding protein is protein A (PA), which includes PA1 and PA2.

具体地,PA1的序列如SEQ ID No.2所示,PA2的序列如SEQ ID No.3所示,连接肽的序列如SEQ ID No.4所示。Specifically, the sequence of PA1 is shown in SEQ ID No. 2, the sequence of PA2 is shown in SEQ ID No. 3, and the sequence of the connecting peptide is shown in SEQ ID No. 4.

融合蛋白的制备方法如下。The preparation method of the fusion protein is as follows.

将上述转座酶突变体(实施例1中的转座酶1~5)和野生型转座酶Tn5,分别与PA1和PA2进行融合蛋白构建,所有序列通过人工基因合成,并将上述序列的片段分别连接至质粒载体上,用于构建表达载体,分别获得融合蛋白1~12。The above-mentioned transposase mutants (transposases 1 to 5 in Example 1) and wild-type transposase Tn5 were respectively constructed as fusion proteins with PA1 and PA2. All sequences were synthesized by artificial genes, and the above sequences were synthesized. The fragments were respectively connected to plasmid vectors for constructing expression vectors, and fusion proteins 1-12 were obtained respectively.

融合蛋白1:PA1-连接肽-转座酶1;Fusion protein 1: PA1-linking peptide-transposase 1;

融合蛋白2:PA1-连接肽-转座酶2;Fusion protein 2: PA1-connecting peptide-transposase 2;

融合蛋白3:PA1-连接肽-转座酶3;Fusion protein 3: PA1-linking peptide-transposase 3;

融合蛋白4:PA1-连接肽-转座酶4;Fusion protein 4: PA1-linking peptide-transposase 4;

融合蛋白5:PA1-连接肽-转座酶5;Fusion protein 5: PA1-connecting peptide-transposase 5;

融合蛋白6:PA1-连接肽-野生型转座酶Tn5;Fusion protein 6: PA1-connecting peptide-wild-type transposase Tn5;

融合蛋白7:PA2-连接肽-转座酶1;Fusion protein 7: PA2-linking peptide-transposase 1;

融合蛋白8:PA2-连接肽-转座酶2;Fusion protein 8: PA2-linking peptide-transposase 2;

融合蛋白9:PA2-连接肽-转座酶3;Fusion protein 9: PA2-linking peptide-transposase 3;

融合蛋白10:PA2-连接肽-转座酶4;Fusion protein 10: PA2-linking peptide-transposase 4;

融合蛋白11:PA2-连接肽-转座酶5;Fusion protein 11: PA2-linking peptide-transposase 5;

融合蛋白12:PA2-连接肽-野生型转座酶Tn5。Fusion protein 12: PA2-linking peptide-wild-type transposase Tn5.

将所构建的表达载体转化到感受态大肠杆菌表达菌株内,将表达菌株添加至1mLLB培养基中,并在37℃培养30min,培养后取200μl菌液涂布于含卡那霉素的LB固体平板上,置于37℃恒温箱中过夜培养16h,并挑选白色菌落的单克隆菌株接种于3ml含有卡那霉素的液体LB培养基中,置于恒温摇床中以37℃过夜培养16h。The constructed expression vector was transformed into a competent Escherichia coli expression strain, the expression strain was added to 1 mL of LB medium, and cultured at 37°C for 30 min. After culturing, 200 μl of bacterial solution was taken and coated on LB solid containing kanamycin. On the plate, placed in a 37 °C incubator overnight for 16 h, and the monoclonal strains of white colonies were selected and inoculated into 3 ml of liquid LB medium containing kanamycin, and placed in a constant temperature shaker for 16 h overnight at 37 °C.

次日取培养后的菌液,在500ml的三角培养瓶中按照1:100的比例接入到200ml含有卡那霉素的液体LB培养基中,并在37℃恒温摇床中培养,在OD600达到0.6左右时,加入终浓度为0.25mM的IPTG进行诱导,并在18℃过夜培养16h。对于发酵得到的表达目的蛋白的基因工程菌菌液,分别取500ml使用低温高速离心机进行离心(4℃,12000rpm,5min),弃去上清,对收集到的菌体分别加入50ml裂解缓冲液(50mM HEPES-KOH at PH 7.5,800mM NaCl,1mM EDTA,0.25%Tween-20)重悬菌体,对菌体进行超声破碎(30s超声破碎,10次),在相同条件下再次离心,收集离心得到的上清。对上清进行纯化以得到纯度高于90%的目的蛋白(融合蛋白)。The next day, the cultured bacterial liquid was taken and placed in a 500ml triangular flask at a ratio of 1:100 to 200ml of liquid LB medium containing kanamycin, and cultured in a constant temperature shaker at 37°C at OD600 When it reached about 0.6, IPTG with a final concentration of 0.25 mM was added for induction, and cultured at 18°C overnight for 16 h. For the genetically engineered bacteria liquid expressing the target protein obtained by fermentation, take 500 ml of centrifuge (4°C, 12000 rpm, 5 min) with a low-temperature high-speed centrifuge, discard the supernatant, and add 50 ml of lysis buffer to the collected cells. (50mM HEPES-KOH at PH 7.5, 800mM NaCl, 1mM EDTA, 0.25% Tween-20) to resuspend the cells, sonicate the cells (30s sonication, 10 times), centrifuge again under the same conditions, collect the centrifuge obtained supernatant. The supernatant was purified to obtain the target protein (fusion protein) with a purity higher than 90%.

验证例1Verification Example 1

验证实施例1提供的转座酶突变体的稳定性。The stability of the transposase mutant provided in Example 1 was verified.

分别取3×100μl实施例1提供的转座酶1~5和野生型转座酶至1.5mL的EP管中,对照设置8组试验例,试验例1~8的转座酶突变体相对于野生型转座酶而言依次具有以下突变:R27T、M56G、Y41R、E54K、K120L、E190A、K120Q以及E146N;Take 3 × 100 μl of transposases 1 to 5 and wild-type transposase provided in Example 1 into a 1.5 mL EP tube respectively, and set up 8 groups of test examples as a control. For the wild-type transposase, it has the following mutations in order: R27T, M56G, Y41R, E54K, K120L, E190A, K120Q and E146N;

然后将转座酶1~5、野生型转座酶以及试验例1~8提供的转座酶突变体分别置于25℃、4℃、-20℃环境中持续7天,以进行目的蛋白的稳定性检测。Then, transposases 1 to 5, wild-type transposase, and transposase mutants provided in test examples 1 to 8 were placed at 25 °C, 4 °C, and -20 °C for 7 days, respectively, to carry out the target protein. Stability check.

当目的蛋白在各考核温度条件下放置7天后,对其进行DNA片段化活性的检测,检测方法如下。When the target protein was placed for 7 days under the conditions of each examination temperature, the DNA fragmentation activity was detected, and the detection method was as follows.

制备Adapter MixPrepare Adapter Mix

将100μM的Primer M分别与100μM Primer T1、100μM Primer T2等体积混匀后退火,然后将M-T1和M-T2的退火产物等体积混合均匀,所得混合物即为Adapter Mix。Primer序列如下:Mix 100 μM Primer M with equal volumes of 100 μM Primer T1 and 100 μM Primer T2, respectively, and then anneal them, then mix the annealed products of M-T1 and M-T2 in equal volumes, and the resulting mixture is the Adapter Mix. The Primer sequence is as follows:

Primer M:5’-pCTGTCTCTTATACACATCT-3’(p指磷酸化修饰);Primer M: 5'-pCTGTCTCTTATACACATCT-3' (p refers to phosphorylation modification);

Primer T1:5’-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG-3’;Primer T1: 5'-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG-3';

Primer T2:5’-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG-3’;Primer T2: 5'-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG-3';

制备转座体Preparation of transposomes

将30μM样品(转座酶1~5以及野生型转座酶)分别与60μM退火的Adapter Mix在Coupling buffer(50mM Tris-Ac at PH8.2,400mM Nacl,0.1mM EDTA,0.25%Tween-20,25%甘油)中混匀置于室温孵育1h,得到转座体。30 μM samples (transposase 1-5 and wild-type transposase) were respectively mixed with 60 μM annealed Adapter Mix in Coupling buffer (50 mM Tris-Ac at PH8.2, 400 mM NaCl, 0.1 mM EDTA, 0.25% Tween-20, 25% glycerol) was mixed and incubated at room temperature for 1 h to obtain transposomes.

DNA片段化DNA fragmentation

取1μl上述转座体,加入到含有2×Tnp buffer(50mM TAPS at PH7.9,5mM MgCl2,2mM海藻糖、0.1%NP-40)以及200ng人基因组的体系中,并补加水至20μl,然后在热循环仪中以55℃孵育10min(Lid 99℃),取反应产物10μl,通过琼脂糖凝胶电泳检测片段化产物的分布范围,结果请参照图1和表1。Take 1 μl of the above transposome, add it to the system containing 2×Tnp buffer (50mM TAPS at PH7.9, 5mM MgCl 2 , 2mM trehalose, 0.1% NP-40) and 200ng human genome, and add water to 20 μl, Then incubate at 55 °C for 10 min in a thermal cycler (Lid 99 °C), take 10 μl of the reaction product, and detect the distribution range of the fragmented products by agarose gel electrophoresis. Please refer to Figure 1 and Table 1 for the results.

表1转座酶的稳定性结果Table 1 Transposase stability results

Figure BDA0002320484200000061
Figure BDA0002320484200000061

Figure BDA0002320484200000071
Figure BDA0002320484200000071

备注:1、+++表示片段化程度高,条带弥散主要在100-500kb,无2000kb以上的片段;Remarks: 1. +++ indicates a high degree of fragmentation, the band dispersion is mainly 100-500kb, and there are no fragments above 2000kb;

2、++表示片段化程度高,条带弥散主要在500-1000kb,少量2000kb以上片段;2. ++ indicates a high degree of fragmentation, the band dispersion is mainly 500-1000kb, and a small amount of fragments above 2000kb;

3、+表示片段化程度低,条带主要弥散在2000kb以上;3. + means that the degree of fragmentation is low, and the bands are mainly scattered above 2000kb;

4、-表示基本无法片段化,条带未呈现弥散状。4. - means that it is basically impossible to fragment, and the bands are not diffused.

图1为转座酶1与野生型转座酶的稳定性检测结果,其中,1~3为野生型转座酶分别在-20℃、-4℃、25℃的条件下储存7天的检测结果,4~6为转座酶1分别在-20℃、-4℃、25℃的条件下储存7天的结果。Figure 1 shows the stability test results of transposase 1 and wild-type transposase, wherein 1 to 3 are the detection of wild-type transposase stored at -20°C, -4°C, and 25°C for 7 days, respectively. As a result, 4 to 6 are the results of storing transposase 1 under conditions of -20°C, -4°C, and 25°C for 7 days, respectively.

由表1可知,试验例中的单突变K120Q、E146N、E190A、Y41R、M56G以及R27T相对于野生型转座酶而言,稳定性得到了提高;It can be seen from Table 1 that the stability of the single mutations K120Q, E146N, E190A, Y41R, M56G and R27T in the test example has been improved compared to the wild-type transposase;

结合表1和图1,本发明实施例1提供的转座酶突变体的稳定性显著优于野生型和试验例1~8中的转座酶突变体(单突变),其可以于-20℃的条件下进行长期储存,甚至可以在4℃条件下进行短期储存,避免了存放于-80℃条件下导致的取用不便的情况,更无需将其构建为转座体进行储存,极大地提高了使用和体系调整的便利性。Combined with Table 1 and Figure 1, the stability of the transposase mutant provided in Example 1 of the present invention is significantly better than that of the wild type and the transposase mutants (single mutation) in Test Examples 1 to 8, which can be in -20 Long-term storage at ℃, and even short-term storage at 4℃, avoids the inconvenience caused by storage at -80℃, and there is no need to construct it as a transposome for storage, which greatly improves the efficiency of storage. Improve the convenience of use and system adjustment.

验证例2Verification example 2

对实施例2制备得到的融合蛋白进行稳定性检测。The stability of the fusion protein prepared in Example 2 was tested.

检测方法同验证例1。The detection method is the same as the verification example 1.

检测结果请参照表2。Please refer to Table 2 for the test results.

表2稳定性评价结果Table 2 Stability evaluation results

Figure BDA0002320484200000072
Figure BDA0002320484200000072

Figure BDA0002320484200000081
Figure BDA0002320484200000081

由表2可知,由转座酶突变体构建的融合蛋白的稳定性显著高于由野生型转座酶构建的融合蛋白的稳定性。It can be seen from Table 2 that the stability of the fusion protein constructed by the transposase mutant is significantly higher than that of the fusion protein constructed by the wild-type transposase.

验证例3Verification example 3

验证实施例2制备得到的融合蛋白结合抗体的能力。The ability of the fusion protein prepared in Example 2 to bind to the antibody was verified.

使用酶联免疫反应(ELISA)原理检测不同融合蛋白与多种酶标抗体(IgG)结合的能力,具体检测流程如下。The enzyme-linked immune reaction (ELISA) principle was used to detect the ability of different fusion proteins to bind to various enzyme-labeled antibodies (IgG). The specific detection process is as follows.

用稀释液将实施例2提供的融合蛋白1~12均稀释至统一浓度(0.8nmol/ml即0.6mg/ml)。将稀释好的融合蛋白各100μl加入酶标板的对应孔中,每个样品均设置24个平行组,同时有3个反应孔对应一种酶标抗体以保证数据准确性,同时设置阴性对照并以ddH2O代替,封板后放置37℃孵育30min;移去孔中液体并用洗液洗涤4次并彻底去除孔中残留的洗液,然后向每个孔中加入250μl封闭液,封板后置于37℃孵育1h,用稀释液将多种酶标抗体(鼠、兔、犬、羊、牛、马、人、猪)稀释至(0.4nmol/ml),分别向对应的反应孔中加入250μl封板后置于37℃孵育30min;移去孔中液体并用洗液洗涤4次并彻底去除孔中残留的洗液,然后向每个孔中加入100μl显色液,封板后置于室温(20-25℃)避光孵育15min,然后向每孔中加入50μl终止液,终止后立即用酶标仪测定OD450数值。The fusion proteins 1-12 provided in Example 2 were all diluted to a uniform concentration (0.8 nmol/ml, ie 0.6 mg/ml) with a diluent. Add 100 μl of the diluted fusion protein to the corresponding wells of the ELISA plate. Each sample is set up with 24 parallel groups. At the same time, 3 reaction wells correspond to one enzyme-labeled antibody to ensure the accuracy of the data. Replace with ddH 2 O, place the plate to incubate at 37°C for 30 min after sealing; remove the liquid in the wells and wash with washing solution 4 times to completely remove the residual washing solution in the wells, then add 250 μl blocking solution to each well, and then seal the plate. Incubate at 37°C for 1h, dilute various enzyme-labeled antibodies (mouse, rabbit, dog, sheep, cattle, horse, human, pig) to (0.4nmol/ml) with diluent, and add them to the corresponding reaction wells respectively. 250 μl of the plate was sealed and incubated at 37°C for 30 min; the liquid in the well was removed and washed 4 times with washing solution and the residual washing solution in the well was completely removed, then 100 μl of chromogenic solution was added to each well, and the plate was placed at room temperature after sealing. (20-25°C) Incubate in the dark for 15 min, then add 50 μl of stop solution to each well, and measure the OD450 value with a microplate reader immediately after termination.

以OD450数值为纵坐标,以多种酶标抗体类型为横坐标,可以看出不同融合蛋白和多种酶标抗体结合的效果,结果请参见图2。Taking the value of OD450 as the ordinate and the types of various enzyme-labeled antibodies as the abscissa, it can be seen that the binding effect of different fusion proteins and various enzyme-labeled antibodies is shown in Figure 2.

由图2可知,融合蛋白7~12对于不同种类的抗体均能有效结合,可见PA2构建的融合蛋白对于不同物种来源的抗体结合能力没有明显的偏倚,其应用场景相对而言更加广泛。It can be seen from Figure 2 that fusion proteins 7-12 can effectively bind to different types of antibodies. It can be seen that the fusion proteins constructed by PA2 have no obvious bias in the binding ability of antibodies from different species, and their application scenarios are relatively wider.

验证例4Verification Example 4

采用实施例2提供的融合蛋白进行蛋白-染色质互作的分析(ChIP-Seq技术),其包括以下步骤。The protein-chromatin interaction analysis (ChIP-Seq technology) was performed using the fusion protein provided in Example 2, which included the following steps.

制备Adapter MixPrepare Adapter Mix

将100μM的Primer M分别与100μM Primer T1、100μM Primer T2等体积混匀后退火,然后将M-T1和M-T2的退火产物等体积混合均匀,所得混合物即为Adapter Mix,Primer序列如下:Mix 100 μM Primer M with equal volumes of 100 μM Primer T1 and 100 μM Primer T2, respectively, and then anneal them. Then mix the annealed products of M-T1 and M-T2 in equal volumes. The resulting mixture is the Adapter Mix. The Primer sequence is as follows:

Primer M:5’-pCTGTCTCTTATACACATCT-3’(p指磷酸化修饰);Primer M: 5'-pCTGTCTCTTATACACATCT-3' (p refers to phosphorylation modification);

Primer T1:5’-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG-3’;Primer T1: 5'-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG-3';

Primer T2:5’-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG-3’。Primer T2: 5'-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG-3'.

制备转座体Preparation of transposomes

将30μM目的蛋白(融合蛋白1~5)分别与60μM混合均匀的Adapter Mix在Couplingbuffer(50mM Tris-Ac at PH8.2,400mM Nacl,0.1mM EDTA,0.25%Tween-20,25%甘油)中混匀置于室温孵育1h,得到转座体。Mix 30 μM target protein (fusion protein 1-5) with 60 μM Adapter Mix in Couplingbuffer (50 mM Tris-Ac at PH8.2, 400 mM NaCl, 0.1 mM EDTA, 0.25% Tween-20, 25% glycerol). Incubate at room temperature for 1 h to obtain transposomes.

DNA片段化DNA fragmentation

取1μl上述转座体,加入到含有2×Tnp buffer(50mM TAPS at PH7.9,5mM MgCl2,2mM海藻糖、0.1%NP-40)、50ng人基因组的体系中,并补加水至20μl,然后在热循环仪中孵育10min(Lid 99℃),向反应后的体系中加入1μl消化缓冲液,然后在热循环仪中以55℃孵育5min(Lid 99℃)进行终止。Take 1 μl of the above transposome, add it to a system containing 2×Tnp buffer (50mM TAPS at PH7.9, 5mM MgCl 2 , 2mM trehalose, 0.1% NP-40), 50ng human genome, and add water to 20 μl, Then incubate in a thermocycler for 10 min (Lid 99°C), add 1 μl of digestion buffer to the reaction system, and then incubate at 55°C in a thermocycler for 5 min (Lid 99°C) to terminate.

随后将反应体系补水至50μl并用1×FP Beads进行纯化,使用FP NM008试剂盒对纯化产物进行扩增,并对扩增产物以0.6×/0.15×的FP Beads进行双轮片段分选,得到最终的文库,对文库用illumina Hiseq测序,测序策略为PE150。测序数据对照见表3。The reaction system was then filled with water to 50 μl and purified with 1×FP Beads. The purified product was amplified using the FP NM008 kit, and the amplified product was subjected to two rounds of fragment sorting with 0.6×/0.15× FP Beads to obtain the final The library was sequenced with illumina Hiseq, and the sequencing strategy was PE150. The sequencing data comparison is shown in Table 3.

表3测序数据对照结果Table 3 Sequencing data comparison results

Figure BDA0002320484200000091
Figure BDA0002320484200000091

由表3可知,本发明实施例提供的融合蛋白均能够用于建库测序,其中,从覆盖度和有效数据率分析可以发现,融合蛋白1和3的效果较优。It can be seen from Table 3 that the fusion proteins provided in the embodiments of the present invention can all be used for library construction and sequencing. Among them, from the analysis of coverage and effective data rate, it can be found that fusion proteins 1 and 3 have better effects.

以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the present invention. For those skilled in the art, the present invention may have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included within the protection scope of the present invention.

SEQUENCE LISTINGSEQUENCE LISTING

<110> 广东菲鹏生物有限公司<110> Guangdong Feipeng Biological Co., Ltd.

<120> 一种转座酶突变体、融合蛋白、其制备方法和应用<120> A kind of transposase mutant, fusion protein, its preparation method and application

<160> 14<160> 14

<170> PatentIn version 3.5<170> PatentIn version 3.5

<210> 1<210> 1

<211> 476<211> 476

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequences

<400> 1<400> 1

Met Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser ValMet Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser Val

1 5 10 151 5 10 15

Phe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu ValPhe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu Val

20 25 30 20 25 30

Asn Val Ala Ala Gln Leu Ala Lys Tyr Ser Gly Lys Ser Ile Thr IleAsn Val Ala Ala Gln Leu Ala Lys Tyr Ser Gly Lys Ser Ile Thr Ile

35 40 45 35 40 45

Ser Ser Glu Gly Ser Glu Ala Met Gln Glu Gly Ala Tyr Arg Phe IleSer Ser Glu Gly Ser Glu Ala Met Gln Glu Gly Ala Tyr Arg Phe Ile

50 55 60 50 55 60

Arg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala MetArg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala Met

65 70 75 8065 70 75 80

Gln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile GluGln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile Glu

85 90 95 85 90 95

Asp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu GlyAsp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu Gly

100 105 110 100 105 110

Lys Leu Gly Ser Ile Gln Asp Lys Ser Arg Gly Trp Trp Val His SerLys Leu Gly Ser Ile Gln Asp Lys Ser Arg Gly Trp Trp Val His Ser

115 120 125 115 120 125

Val Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu HisVal Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu His

130 135 140 130 135 140

Gln Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu LysGln Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu Lys

145 150 155 160145 150 155 160

Glu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg MetGlu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg Met

165 170 175 165 170 175

Gly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Glu Ala AspGly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Glu Ala Asp

180 185 190 180 185 190

Ile His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe ValIle His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe Val

195 200 205 195 200 205

Val Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr LeuVal Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr Leu

210 215 220 210 215 220

Tyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile SerTyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile Ser

225 230 235 240225 230 235 240

Ile Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn ArgIle Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn Arg

245 250 255 245 250 255

Pro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr LeuPro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr Leu

260 265 270 260 265 270

Lys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile AsnLys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile Asn

275 280 285 275 280 285

Pro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Thr Ser GluPro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Leu Thr Ser Glu

290 295 300 290 295 300

Pro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr ThrPro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr Thr

305 310 315 320305 310 315 320

His Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly AlaHis Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly Ala

325 330 335 325 330 335

Gly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg MetGly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg Met

340 345 350 340 345 350

Val Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg GluVal Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg Glu

355 360 365 355 360 365

Ser Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys GluSer Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys Glu

370 375 380 370 375 380

Ala Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro AspAla Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro Asp

385 390 395 400385 390 395 400

Glu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg LysGlu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg Lys

405 410 415 405 410 415

Glu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg LeuGlu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg Leu

420 425 430 420 425 430

Gly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly AlaGly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly Ala

435 440 445 435 440 445

Leu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe LeuLeu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe Leu

450 455 460 450 455 460

Ala Ala Lys Asp Leu Met Ala Gln Gly Ile Lys IleAla Ala Lys Asp Leu Met Ala Gln Gly Ile Lys Ile

465 470 475465 470 475

<210> 2<210> 2

<211> 58<211> 58

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequences

<400> 2<400> 2

Val Asp Asn Lys Phe Asn Lys Glu Gln Gln Asn Ala Phe Tyr Glu IleVal Asp Asn Lys Phe Asn Lys Glu Gln Gln Asn Ala Phe Tyr Glu Ile

1 5 10 151 5 10 15

Leu His Leu Pro Asn Leu Asn Glu Glu Gln Arg Asn Ala Phe Ile GlnLeu His Leu Pro Asn Leu Asn Glu Glu Gln Arg Asn Ala Phe Ile Gln

20 25 30 20 25 30

Ser Leu Lys Asp Asp Pro Ser Gln Ser Ala Asn Leu Leu Ala Glu AlaSer Leu Lys Asp Asp Pro Ser Gln Ser Ala Asn Leu Leu Ala Glu Ala

35 40 45 35 40 45

Lys Lys Leu Asn Asp Ala Gln Ala Pro LysLys Lys Leu Asn Asp Ala Gln Ala Pro Lys

50 55 50 55

<210> 3<210> 3

<211> 57<211> 57

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequences

<400> 3<400> 3

Ala Ser Gln Arg Phe Asn Lys Asp Gln Gln Leu Ala Phe Tyr Glu IleAla Ser Gln Arg Phe Asn Lys Asp Gln Gln Leu Ala Phe Tyr Glu Ile

1 5 10 151 5 10 15

Ile Gln Pro Pro Asn Leu Glu Cys Thr Gln Arg Asn Gly Phe Ile GlnIle Gln Pro Pro Asn Leu Glu Cys Thr Gln Arg Asn Gly Phe Ile Gln

20 25 30 20 25 30

Ser Leu Lys Asp Asp Phe Ser Ser Ser Gly Asn Ile Ile Val Glu AlaSer Leu Lys Asp Asp Phe Ser Ser Ser Ser Gly Asn Ile Ile Val Glu Ala

35 40 45 35 40 45

Lys Lys Ile Asn Ser Thr Gln Ala ProLys Lys Ile Asn Ser Thr Gln Ala Pro

50 55 50 55

<210> 4<210> 4

<211> 27<211> 27

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequences

<400> 4<400> 4

Asp Asp Asp Lys Glu Phe Gly Gly Gly Gly Ser Gly Gly Gly Gly SerAsp Asp Asp Lys Glu Phe Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser

1 5 10 151 5 10 15

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser HisGly Gly Gly Gly Ser Gly Gly Gly Gly Ser His

20 25 20 25

<210> 5<210> 5

<211> 476<211> 476

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequences

<400> 5<400> 5

Met Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser ValMet Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser Val

1 5 10 151 5 10 15

Phe Ser Ser Ala Ala Leu Gly Asp Pro Arg Thr Thr Ala Arg Leu ValPhe Ser Ser Ala Ala Leu Gly Asp Pro Arg Thr Thr Ala Arg Leu Val

20 25 30 20 25 30

Asn Val Ala Ala Gln Leu Ala Lys Tyr Ser Gly Lys Ser Ile Thr IleAsn Val Ala Ala Gln Leu Ala Lys Tyr Ser Gly Lys Ser Ile Thr Ile

35 40 45 35 40 45

Ser Ser Glu Gly Ser Glu Ala Gly Gln Glu Gly Ala Tyr Arg Phe IleSer Ser Glu Gly Ser Glu Ala Gly Gln Glu Gly Ala Tyr Arg Phe Ile

50 55 60 50 55 60

Arg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala MetArg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala Met

65 70 75 8065 70 75 80

Gln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile GluGln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile Glu

85 90 95 85 90 95

Asp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu GlyAsp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu Gly

100 105 110 100 105 110

Lys Leu Gly Ser Ile Gln Asp Lys Ser Arg Gly Trp Trp Val His SerLys Leu Gly Ser Ile Gln Asp Lys Ser Arg Gly Trp Trp Val His Ser

115 120 125 115 120 125

Val Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu HisVal Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu His

130 135 140 130 135 140

Gln Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu LysGln Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu Lys

145 150 155 160145 150 155 160

Glu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg MetGlu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg Met

165 170 175 165 170 175

Gly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Glu Ala AspGly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Glu Ala Asp

180 185 190 180 185 190

Ile His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe ValIle His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe Val

195 200 205 195 200 205

Val Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr LeuVal Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr Leu

210 215 220 210 215 220

Tyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile SerTyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile Ser

225 230 235 240225 230 235 240

Ile Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn ArgIle Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn Arg

245 250 255 245 250 255

Pro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr LeuPro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr Leu

260 265 270 260 265 270

Lys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile AsnLys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile Asn

275 280 285 275 280 285

Pro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Thr Ser GluPro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Leu Thr Ser Glu

290 295 300 290 295 300

Pro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr ThrPro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr Thr

305 310 315 320305 310 315 320

His Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly AlaHis Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly Ala

325 330 335 325 330 335

Gly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg MetGly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg Met

340 345 350 340 345 350

Val Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg GluVal Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg Glu

355 360 365 355 360 365

Ser Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys GluSer Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys Glu

370 375 380 370 375 380

Ala Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro AspAla Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro Asp

385 390 395 400385 390 395 400

Glu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg LysGlu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg Lys

405 410 415 405 410 415

Glu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg LeuGlu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg Leu

420 425 430 420 425 430

Gly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly AlaGly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly Ala

435 440 445 435 440 445

Leu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe LeuLeu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe Leu

450 455 460 450 455 460

Ala Ala Lys Asp Leu Met Ala Gln Gly Ile Lys IleAla Ala Lys Asp Leu Met Ala Gln Gly Ile Lys Ile

465 470 475465 470 475

<210> 6<210> 6

<211> 1428<211> 1428

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 6<400> 6

atgatcacct ctgctctgca ccgtgctgct gactgggcta aatctgtttt ctcttctgct 60atgatcacct ctgctctgca ccgtgctgct gactgggcta aatctgtttt ctcttctgct 60

gctctgggtg acccgcgtac caccgctcgt ctggttaacg ttgctgctca gctggctaaa 120gctctgggtg acccgcgtac caccgctcgt ctggttaacg ttgctgctca gctggctaaa 120

tactctggta aatctatcac catctcttct gaaggttctg aagctggtca ggaaggtgct 180tactctggta aatctatcac catctcttct gaaggttctg aagctggtca ggaaggtgct 180

taccgtttca tccgtaaccc gaacgtttct gctgaagcta tccgtaaagc tggtgctatg 240taccgtttca tccgtaaccc gaacgtttct gctgaagcta tccgtaaagc tggtgctatg 240

cagaccgtta aactggctca ggaatttccg gaactgctgg ctatcgaaga caccacctct 300cagaccgtta aactggctca ggaatttccg gaactgctgg ctatcgaaga caccacctct 300

ctgtcttacc gtcaccaggt tgctgaagaa ctgggtaaac tgggttctat ccaggacaaa 360ctgtcttacc gtcaccaggt tgctgaagaa ctgggtaaac tgggttctat ccaggacaaa 360

tctcgtggtt ggtgggttca ctctgttctg ctgctggaag ctaccacctt ccgtaccgtt 420tctcgtggtt ggtgggttca ctctgttctg ctgctggaag ctaccacctt ccgtaccgtt 420

ggtctgctgc accaggaatg gtggatgcgt ccggacgacc cggctgacgc tgacgaaaaa 480ggtctgctgc accaggaatg gtggatgcgt ccggacgacc cggctgacgc tgacgaaaaa 480

gaatctggta aatggctggc tgctgctgct acctctcgtc tgcgtatggg ttctatgatg 540gaatctggta aatggctggc tgctgctgct acctctcgtc tgcgtatggg ttctatgatg 540

tctaacgtta tcgctgtttg cgaccgtgaa gctgacatcc acgcttacct gcaggacaaa 600tctaacgtta tcgctgtttg cgaccgtgaa gctgacatcc acgcttacct gcaggacaaa 600

ctggctcaca acgaacgttt cgttgttcgt tctaaacacc cgcgtaaaga cgttgaatct 660ctggctcaca acgaacgttt cgttgttcgt tctaaacacc cgcgtaaaga cgttgaatct 660

ggtctgtacc tgtacgacca cctgaaaaac cagccggaac tgggtggtta ccagatctct 720ggtctgtacc tgtacgacca cctgaaaaac cagccggaac tgggtggtta ccagatctct 720

atcccgcaga aaggtgttgt tgacaaacgt ggtaaacgta aaaaccgtcc ggctcgtaaa 780atcccgcaga aaggtgttgt tgacaaacgt ggtaaacgta aaaaccgtcc ggctcgtaaa 780

gcttctctgt ctctgcgttc tggtcgtatc accctgaaac agggtaacat caccctgaac 840gcttctctgt ctctgcgttc tggtcgtatc accctgaaac agggtaacat caccctgaac 840

gctgttctgg ctgaagaaat caacccgccg aaaggtgaaa ccccgctgaa atggctgctg 900gctgttctgg ctgaagaaat caacccgccg aaaggtgaaa ccccgctgaa atggctgctg 900

ctgacctctg aaccggttga atctctggct caggctctgc gtgttatcga catctacacc 960ctgacctctg aaccggttga atctctggct caggctctgc gtgttatcga catctacacc 960

caccgttggc gtatcgaaga atttcacaaa gcttggaaaa ccggtgctgg tgctgaacgt 1020caccgttggc gtatcgaaga atttcacaaa gcttggaaaa ccggtgctgg tgctgaacgt 1020

cagcgtatgg aagaaccgga caacctggaa cgtatggttt ctatcctgtc tttcgttgct 1080cagcgtatgg aagaaccgga caacctggaa cgtatggttt ctatcctgtc tttcgttgct 1080

gttcgtctgc tgcagctgcg tgaatctttc accctgccgc aggctctgcg tgctcagggt 1140gttcgtctgc tgcagctgcg tgaatctttc accctgccgc aggctctgcg tgctcagggt 1140

ctgctgaaag aagctgaaca cgttgaatct cagtctgctg aaaccgttct gaccccggac 1200ctgctgaaag aagctgaaca cgttgaatct cagtctgctg aaaccgttct gaccccggac 1200

gaatgccagc tgctgggtta cctggacaaa ggtaaacgta aacgtaaaga aaaagctggt 1260gaatgccagc tgctgggtta cctggacaaa ggtaaacgta aacgtaaaga aaaagctggt 1260

tctctgcagt gggcttacat ggctatcgct cgtctgggtg gtttcatgga ctctaaacgt 1320tctctgcagt gggcttacat ggctatcgct cgtctgggtg gtttcatgga ctctaaacgt 1320

accggtatcg cttcttgggg tgctctgtgg gaaggttggg aagctctgca gtcgaagctg 1380accggtatcg cttcttgggg tgctctgtgg gaaggttggg aagctctgca gtcgaagctg 1380

gacggcttcc tggctgctaa ggacctgatg gctcagggta tcaaaatc 1428gacggcttcc tggctgctaa ggacctgatg gctcagggta tcaaaatc 1428

<210> 7<210> 7

<211> 476<211> 476

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequences

<400> 7<400> 7

Met Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser ValMet Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser Val

1 5 10 151 5 10 15

Phe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu ValPhe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu Val

20 25 30 20 25 30

Asn Val Ala Ala Gln Leu Ala Lys Arg Ser Gly Lys Ser Ile Thr IleAsn Val Ala Ala Gln Leu Ala Lys Arg Ser Gly Lys Ser Ile Thr Ile

35 40 45 35 40 45

Ser Ser Glu Gly Ser Glu Ala Gly Gln Glu Gly Ala Tyr Arg Phe IleSer Ser Glu Gly Ser Glu Ala Gly Gln Glu Gly Ala Tyr Arg Phe Ile

50 55 60 50 55 60

Arg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala MetArg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala Met

65 70 75 8065 70 75 80

Gln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile GluGln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile Glu

85 90 95 85 90 95

Asp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu GlyAsp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu Gly

100 105 110 100 105 110

Lys Leu Gly Ser Ile Gln Asp Lys Ser Arg Gly Trp Trp Val His SerLys Leu Gly Ser Ile Gln Asp Lys Ser Arg Gly Trp Trp Val His Ser

115 120 125 115 120 125

Val Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu HisVal Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu His

130 135 140 130 135 140

Gln Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu LysGln Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu Lys

145 150 155 160145 150 155 160

Glu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg MetGlu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg Met

165 170 175 165 170 175

Gly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Glu Ala AspGly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Glu Ala Asp

180 185 190 180 185 190

Ile His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe ValIle His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe Val

195 200 205 195 200 205

Val Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr LeuVal Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr Leu

210 215 220 210 215 220

Tyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile SerTyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile Ser

225 230 235 240225 230 235 240

Ile Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn ArgIle Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn Arg

245 250 255 245 250 255

Pro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr LeuPro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr Leu

260 265 270 260 265 270

Lys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile AsnLys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile Asn

275 280 285 275 280 285

Pro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Thr Ser GluPro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Leu Thr Ser Glu

290 295 300 290 295 300

Pro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr ThrPro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr Thr

305 310 315 320305 310 315 320

His Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly AlaHis Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly Ala

325 330 335 325 330 335

Gly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg MetGly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg Met

340 345 350 340 345 350

Val Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg GluVal Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg Glu

355 360 365 355 360 365

Ser Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys GluSer Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys Glu

370 375 380 370 375 380

Ala Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro AspAla Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro Asp

385 390 395 400385 390 395 400

Glu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg LysGlu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg Lys

405 410 415 405 410 415

Glu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg LeuGlu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg Leu

420 425 430 420 425 430

Gly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly AlaGly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly Ala

435 440 445 435 440 445

Leu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe LeuLeu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe Leu

450 455 460 450 455 460

Ala Ala Lys Asp Leu Met Ala Gln Gly Ile Lys IleAla Ala Lys Asp Leu Met Ala Gln Gly Ile Lys Ile

465 470 475465 470 475

<210> 8<210> 8

<211> 1428<211> 1428

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 8<400> 8

atgatcacct ctgctctgca ccgtgctgct gactgggcta aatctgtttt ctcttctgct 60atgatcacct ctgctctgca ccgtgctgct gactgggcta aatctgtttt ctcttctgct 60

gctctgggtg acccgcgtcg taccgctcgt ctggttaacg ttgctgctca gctggctaaa 120gctctgggtg acccgcgtcg taccgctcgt ctggttaacg ttgctgctca gctggctaaa 120

cgttctggta aatctatcac catctcttct gaaggttctg aagctggtca ggaaggtgct 180cgttctggta aatctatcac catctcttct gaaggttctg aagctggtca ggaaggtgct 180

taccgtttca tccgtaaccc gaacgtttct gctgaagcta tccgtaaagc tggtgctatg 240taccgtttca tccgtaaccc gaacgtttct gctgaagcta tccgtaaagc tggtgctatg 240

cagaccgtta aactggctca ggaatttccg gaactgctgg ctatcgaaga caccacctct 300cagaccgtta aactggctca ggaatttccg gaactgctgg ctatcgaaga caccacctct 300

ctgtcttacc gtcaccaggt tgctgaagaa ctgggtaaac tgggttctat ccaggacaaa 360ctgtcttacc gtcaccaggt tgctgaagaa ctgggtaaac tgggttctat ccaggacaaa 360

tctcgtggtt ggtgggttca ctctgttctg ctgctggaag ctaccacctt ccgtaccgtt 420tctcgtggtt ggtgggttca ctctgttctg ctgctggaag ctaccacctt ccgtaccgtt 420

ggtctgctgc accaggaatg gtggatgcgt ccggacgacc cggctgacgc tgacgaaaaa 480ggtctgctgc accaggaatg gtggatgcgt ccggacgacc cggctgacgc tgacgaaaaa 480

gaatctggta aatggctggc tgctgctgct acctctcgtc tgcgtatggg ttctatgatg 540gaatctggta aatggctggc tgctgctgct acctctcgtc tgcgtatggg ttctatgatg 540

tctaacgtta tcgctgtttg cgaccgtgaa gctgacatcc acgcttacct gcaggacaaa 600tctaacgtta tcgctgtttg cgaccgtgaa gctgacatcc acgcttacct gcaggacaaa 600

ctggctcaca acgaacgttt cgttgttcgt tctaaacacc cgcgtaaaga cgttgaatct 660ctggctcaca acgaacgttt cgttgttcgt tctaaacacc cgcgtaaaga cgttgaatct 660

ggtctgtacc tgtacgacca cctgaaaaac cagccggaac tgggtggtta ccagatctct 720ggtctgtacc tgtacgacca cctgaaaaac cagccggaac tgggtggtta ccagatctct 720

atcccgcaga aaggtgttgt tgacaaacgt ggtaaacgta aaaaccgtcc ggctcgtaaa 780atcccgcaga aaggtgttgt tgacaaacgt ggtaaacgta aaaaccgtcc ggctcgtaaa 780

gcttctctgt ctctgcgttc tggtcgtatc accctgaaac agggtaacat caccctgaac 840gcttctctgt ctctgcgttc tggtcgtatc accctgaaac agggtaacat caccctgaac 840

gctgttctgg ctgaagaaat caacccgccg aaaggtgaaa ccccgctgaa atggctgctg 900gctgttctgg ctgaagaaat caacccgccg aaaggtgaaa ccccgctgaa atggctgctg 900

ctgacctctg aaccggttga atctctggct caggctctgc gtgttatcga catctacacc 960ctgacctctg aaccggttga atctctggct caggctctgc gtgttatcga catctacacc 960

caccgttggc gtatcgaaga atttcacaaa gcttggaaaa ccggtgctgg tgctgaacgt 1020caccgttggc gtatcgaaga atttcacaaa gcttggaaaa ccggtgctgg tgctgaacgt 1020

cagcgtatgg aagaaccgga caacctggaa cgtatggttt ctatcctgtc tttcgttgct 1080cagcgtatgg aagaaccgga caacctggaa cgtatggttt ctatcctgtc tttcgttgct 1080

gttcgtctgc tgcagctgcg tgaatctttc accctgccgc aggctctgcg tgctcagggt 1140gttcgtctgc tgcagctgcg tgaatctttc accctgccgc aggctctgcg tgctcagggt 1140

ctgctgaaag aagctgaaca cgttgaatct cagtctgctg aaaccgttct gaccccggac 1200ctgctgaaag aagctgaaca cgttgaatct cagtctgctg aaaccgttct gaccccggac 1200

gaatgccagc tgctgggtta cctggacaaa ggtaaacgta aacgtaaaga aaaagctggt 1260gaatgccagc tgctgggtta cctggacaaa ggtaaacgta aacgtaaaga aaaagctggt 1260

tctctgcagt gggcttacat ggctatcgct cgtctgggtg gtttcatgga ctctaaacgt 1320tctctgcagt gggcttacat ggctatcgct cgtctgggtg gtttcatgga ctctaaacgt 1320

accggtatcg cttcttgggg tgctctgtgg gaaggttggg aagctctgca gtcgaagctg 1380accggtatcg cttcttgggg tgctctgtgg gaaggttggg aagctctgca gtcgaagctg 1380

gacggcttcc tggctgctaa ggacctgatg gctcagggta tcaaaatc 1428gacggcttcc tggctgctaa ggacctgatg gctcagggta tcaaaatc 1428

<210> 9<210> 9

<211> 476<211> 476

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequences

<400> 9<400> 9

Met Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser ValMet Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser Val

1 5 10 151 5 10 15

Phe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu ValPhe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu Val

20 25 30 20 25 30

Asn Val Ala Ala Gln Leu Ala Lys Arg Ser Gly Lys Ser Ile Thr IleAsn Val Ala Ala Gln Leu Ala Lys Arg Ser Gly Lys Ser Ile Thr Ile

35 40 45 35 40 45

Ser Ser Glu Gly Ser Lys Ala Gly Gln Glu Gly Ala Tyr Arg Phe IleSer Ser Glu Gly Ser Lys Ala Gly Gln Glu Gly Ala Tyr Arg Phe Ile

50 55 60 50 55 60

Arg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala MetArg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala Met

65 70 75 8065 70 75 80

Gln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile GluGln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile Glu

85 90 95 85 90 95

Asp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu GlyAsp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu Gly

100 105 110 100 105 110

Lys Leu Gly Ser Ile Gln Asp Lys Ser Arg Gly Trp Trp Val His SerLys Leu Gly Ser Ile Gln Asp Lys Ser Arg Gly Trp Trp Val His Ser

115 120 125 115 120 125

Val Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu HisVal Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu His

130 135 140 130 135 140

Gln Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu LysGln Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu Lys

145 150 155 160145 150 155 160

Glu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg MetGlu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg Met

165 170 175 165 170 175

Gly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Glu Ala AspGly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Glu Ala Asp

180 185 190 180 185 190

Ile His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe ValIle His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe Val

195 200 205 195 200 205

Val Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr LeuVal Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr Leu

210 215 220 210 215 220

Tyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile SerTyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile Ser

225 230 235 240225 230 235 240

Ile Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn ArgIle Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn Arg

245 250 255 245 250 255

Pro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr LeuPro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr Leu

260 265 270 260 265 270

Lys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile AsnLys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile Asn

275 280 285 275 280 285

Pro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Thr Ser GluPro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Leu Thr Ser Glu

290 295 300 290 295 300

Pro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr ThrPro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr Thr

305 310 315 320305 310 315 320

His Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly AlaHis Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly Ala

325 330 335 325 330 335

Gly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg MetGly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg Met

340 345 350 340 345 350

Val Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg GluVal Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg Glu

355 360 365 355 360 365

Ser Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys GluSer Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys Glu

370 375 380 370 375 380

Ala Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro AspAla Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro Asp

385 390 395 400385 390 395 400

Glu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg LysGlu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg Lys

405 410 415 405 410 415

Glu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg LeuGlu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg Leu

420 425 430 420 425 430

Gly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly AlaGly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly Ala

435 440 445 435 440 445

Leu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe LeuLeu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe Leu

450 455 460 450 455 460

Ala Ala Lys Asp Leu Met Ala Gln Gly Ile Lys IleAla Ala Lys Asp Leu Met Ala Gln Gly Ile Lys Ile

465 470 475465 470 475

<210> 10<210> 10

<211> 1428<211> 1428

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 10<400> 10

atgatcacct ctgctctgca ccgtgctgct gactgggcta aatctgtttt ctcttctgct 60atgatcacct ctgctctgca ccgtgctgct gactgggcta aatctgtttt ctcttctgct 60

gctctgggtg acccgcgtcg taccgctcgt ctggttaacg ttgctgctca gctggctaaa 120gctctgggtg acccgcgtcg taccgctcgt ctggttaacg ttgctgctca gctggctaaa 120

cgttctggta aatctatcac catctcttct gaaggttcta aagctggtca ggaaggtgct 180cgttctggta aatctatcac catctcttct gaaggttcta aagctggtca ggaaggtgct 180

taccgtttca tccgtaaccc gaacgtttct gctgaagcta tccgtaaagc tggtgctatg 240taccgtttca tccgtaaccc gaacgtttct gctgaagcta tccgtaaagc tggtgctatg 240

cagaccgtta aactggctca ggaatttccg gaactgctgg ctatcgaaga caccacctct 300cagaccgtta aactggctca ggaatttccg gaactgctgg ctatcgaaga caccacctct 300

ctgtcttacc gtcaccaggt tgctgaagaa ctgggtaaac tgggttctat ccaggacaaa 360ctgtcttacc gtcaccaggt tgctgaagaa ctgggtaaac tgggttctat ccaggacaaa 360

tctcgtggtt ggtgggttca ctctgttctg ctgctggaag ctaccacctt ccgtaccgtt 420tctcgtggtt ggtgggttca ctctgttctg ctgctggaag ctaccacctt ccgtaccgtt 420

ggtctgctgc accaggaatg gtggatgcgt ccggacgacc cggctgacgc tgacgaaaaa 480ggtctgctgc accaggaatg gtggatgcgt ccggacgacc cggctgacgc tgacgaaaaa 480

gaatctggta aatggctggc tgctgctgct acctctcgtc tgcgtatggg ttctatgatg 540gaatctggta aatggctggc tgctgctgct acctctcgtc tgcgtatggg ttctatgatg 540

tctaacgtta tcgctgtttg cgaccgtgaa gctgacatcc acgcttacct gcaggacaaa 600tctaacgtta tcgctgtttg cgaccgtgaa gctgacatcc acgcttacct gcaggacaaa 600

ctggctcaca acgaacgttt cgttgttcgt tctaaacacc cgcgtaaaga cgttgaatct 660ctggctcaca acgaacgttt cgttgttcgt tctaaacacc cgcgtaaaga cgttgaatct 660

ggtctgtacc tgtacgacca cctgaaaaac cagccggaac tgggtggtta ccagatctct 720ggtctgtacc tgtacgacca cctgaaaaac cagccggaac tgggtggtta ccagatctct 720

atcccgcaga aaggtgttgt tgacaaacgt ggtaaacgta aaaaccgtcc ggctcgtaaa 780atcccgcaga aaggtgttgt tgacaaacgt ggtaaacgta aaaaccgtcc ggctcgtaaa 780

gcttctctgt ctctgcgttc tggtcgtatc accctgaaac agggtaacat caccctgaac 840gcttctctgt ctctgcgttc tggtcgtatc accctgaaac agggtaacat caccctgaac 840

gctgttctgg ctgaagaaat caacccgccg aaaggtgaaa ccccgctgaa atggctgctg 900gctgttctgg ctgaagaaat caacccgccg aaaggtgaaa ccccgctgaa atggctgctg 900

ctgacctctg aaccggttga atctctggct caggctctgc gtgttatcga catctacacc 960ctgacctctg aaccggttga atctctggct caggctctgc gtgttatcga catctacacc 960

caccgttggc gtatcgaaga atttcacaaa gcttggaaaa ccggtgctgg tgctgaacgt 1020caccgttggc gtatcgaaga atttcacaaa gcttggaaaa ccggtgctgg tgctgaacgt 1020

cagcgtatgg aagaaccgga caacctggaa cgtatggttt ctatcctgtc tttcgttgct 1080cagcgtatgg aagaaccgga caacctggaa cgtatggttt ctatcctgtc tttcgttgct 1080

gttcgtctgc tgcagctgcg tgaatctttc accctgccgc aggctctgcg tgctcagggt 1140gttcgtctgc tgcagctgcg tgaatctttc accctgccgc aggctctgcg tgctcagggt 1140

ctgctgaaag aagctgaaca cgttgaatct cagtctgctg aaaccgttct gaccccggac 1200ctgctgaaag aagctgaaca cgttgaatct cagtctgctg aaaccgttct gaccccggac 1200

gaatgccagc tgctgggtta cctggacaaa ggtaaacgta aacgtaaaga aaaagctggt 1260gaatgccagc tgctgggtta cctggacaaa ggtaaacgta aacgtaaaga aaaagctggt 1260

tctctgcagt gggcttacat ggctatcgct cgtctgggtg gtttcatgga ctctaaacgt 1320tctctgcagt gggcttacat ggctatcgct cgtctgggtg gtttcatgga ctctaaacgt 1320

accggtatcg cttcttgggg tgctctgtgg gaaggttggg aagctctgca gtcgaagctg 1380accggtatcg cttcttgggg tgctctgtgg gaaggttggg aagctctgca gtcgaagctg 1380

gacggcttcc tggctgctaa ggacctgatg gctcagggta tcaaaatc 1428gacggcttcc tggctgctaa ggacctgatg gctcagggta tcaaaatc 1428

<210> 11<210> 11

<211> 476<211> 476

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequences

<400> 11<400> 11

Met Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser ValMet Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser Val

1 5 10 151 5 10 15

Phe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu ValPhe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu Val

20 25 30 20 25 30

Asn Val Ala Ala Gln Leu Ala Lys Tyr Ser Gly Lys Ser Ile Thr IleAsn Val Ala Ala Gln Leu Ala Lys Tyr Ser Gly Lys Ser Ile Thr Ile

35 40 45 35 40 45

Ser Ser Glu Gly Ser Glu Ala Met Gln Glu Gly Ala Tyr Arg Phe IleSer Ser Glu Gly Ser Glu Ala Met Gln Glu Gly Ala Tyr Arg Phe Ile

50 55 60 50 55 60

Arg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala MetArg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala Met

65 70 75 8065 70 75 80

Gln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile GluGln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile Glu

85 90 95 85 90 95

Asp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu GlyAsp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu Gly

100 105 110 100 105 110

Lys Leu Gly Ser Ile Gln Asp Leu Ser Arg Gly Trp Trp Val His SerLys Leu Gly Ser Ile Gln Asp Leu Ser Arg Gly Trp Trp Val His Ser

115 120 125 115 120 125

Val Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu HisVal Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu His

130 135 140 130 135 140

Gln Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu LysGln Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu Lys

145 150 155 160145 150 155 160

Glu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg MetGlu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg Met

165 170 175 165 170 175

Gly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Ala Ala AspGly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Ala Ala Asp

180 185 190 180 185 190

Ile His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe ValIle His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe Val

195 200 205 195 200 205

Val Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr LeuVal Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr Leu

210 215 220 210 215 220

Tyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile SerTyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile Ser

225 230 235 240225 230 235 240

Ile Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn ArgIle Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn Arg

245 250 255 245 250 255

Pro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr LeuPro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr Leu

260 265 270 260 265 270

Lys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile AsnLys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile Asn

275 280 285 275 280 285

Pro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Thr Ser GluPro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Leu Thr Ser Glu

290 295 300 290 295 300

Pro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr ThrPro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr Thr

305 310 315 320305 310 315 320

His Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly AlaHis Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly Ala

325 330 335 325 330 335

Gly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg MetGly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg Met

340 345 350 340 345 350

Val Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg GluVal Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg Glu

355 360 365 355 360 365

Ser Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys GluSer Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys Glu

370 375 380 370 375 380

Ala Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro AspAla Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro Asp

385 390 395 400385 390 395 400

Glu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg LysGlu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg Lys

405 410 415 405 410 415

Glu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg LeuGlu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg Leu

420 425 430 420 425 430

Gly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly AlaGly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly Ala

435 440 445 435 440 445

Leu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe LeuLeu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe Leu

450 455 460 450 455 460

Ala Ala Lys Asp Leu Met Ala Gln Gly Ile Lys IleAla Ala Lys Asp Leu Met Ala Gln Gly Ile Lys Ile

465 470 475465 470 475

<210> 12<210> 12

<211> 1428<211> 1428

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 12<400> 12

atgatcacct ctgctctgca ccgtgctgct gactgggcta aatctgtttt ctcttctgct 60atgatcacct ctgctctgca ccgtgctgct gactgggcta aatctgtttt ctcttctgct 60

gctctgggtg acccgcgtcg taccgctcgt ctggttaacg ttgctgctca gctggctaaa 120gctctgggtg acccgcgtcg taccgctcgt ctggttaacg ttgctgctca gctggctaaa 120

tactctggta aatctatcac catctcttct gaaggttctg aagctatgca ggaaggtgct 180tactctggta aatctatcac catctcttct gaaggttctg aagctatgca ggaaggtgct 180

taccgtttca tccgtaaccc gaacgtttct gctgaagcta tccgtaaagc tggtgctatg 240taccgtttca tccgtaaccc gaacgtttct gctgaagcta tccgtaaagc tggtgctatg 240

cagaccgtta aactggctca ggaatttccg gaactgctgg ctatcgaaga caccacctct 300cagaccgtta aactggctca ggaatttccg gaactgctgg ctatcgaaga caccacctct 300

ctgtcttacc gtcaccaggt tgctgaagaa ctgggtaaac tgggttctat ccaggacctg 360ctgtcttacc gtcaccaggt tgctgaagaa ctgggtaaac tgggttctat ccaggacctg 360

tctcgtggtt ggtgggttca ctctgttctg ctgctggaag ctaccacctt ccgtaccgtt 420tctcgtggtt ggtgggttca ctctgttctg ctgctggaag ctaccacctt ccgtaccgtt 420

ggtctgctgc accaggaatg gtggatgcgt ccggacgacc cggctgacgc tgacgaaaaa 480ggtctgctgc accaggaatg gtggatgcgt ccggacgacc cggctgacgc tgacgaaaaa 480

gaatctggta aatggctggc tgctgctgct acctctcgtc tgcgtatggg ttctatgatg 540gaatctggta aatggctggc tgctgctgct acctctcgtc tgcgtatggg ttctatgatg 540

tctaacgtta tcgctgtttg cgaccgtgct gctgacatcc acgcttacct gcaggacaaa 600tctaacgtta tcgctgtttg cgaccgtgct gctgacatcc acgcttacct gcaggacaaa 600

ctggctcaca acgaacgttt cgttgttcgt tctaaacacc cgcgtaaaga cgttgaatct 660ctggctcaca acgaacgttt cgttgttcgt tctaaacacc cgcgtaaaga cgttgaatct 660

ggtctgtacc tgtacgacca cctgaaaaac cagccggaac tgggtggtta ccagatctct 720ggtctgtacc tgtacgacca cctgaaaaac cagccggaac tgggtggtta ccagatctct 720

atcccgcaga aaggtgttgt tgacaaacgt ggtaaacgta aaaaccgtcc ggctcgtaaa 780atcccgcaga aaggtgttgt tgacaaacgt ggtaaacgta aaaaccgtcc ggctcgtaaa 780

gcttctctgt ctctgcgttc tggtcgtatc accctgaaac agggtaacat caccctgaac 840gcttctctgt ctctgcgttc tggtcgtatc accctgaaac agggtaacat caccctgaac 840

gctgttctgg ctgaagaaat caacccgccg aaaggtgaaa ccccgctgaa atggctgctg 900gctgttctgg ctgaagaaat caacccgccg aaaggtgaaa ccccgctgaa atggctgctg 900

ctgacctctg aaccggttga atctctggct caggctctgc gtgttatcga catctacacc 960ctgacctctg aaccggttga atctctggct caggctctgc gtgttatcga catctacacc 960

caccgttggc gtatcgaaga atttcacaaa gcttggaaaa ccggtgctgg tgctgaacgt 1020caccgttggc gtatcgaaga atttcacaaa gcttggaaaa ccggtgctgg tgctgaacgt 1020

cagcgtatgg aagaaccgga caacctggaa cgtatggttt ctatcctgtc tttcgttgct 1080cagcgtatgg aagaaccgga caacctggaa cgtatggttt ctatcctgtc tttcgttgct 1080

gttcgtctgc tgcagctgcg tgaatctttc accctgccgc aggctctgcg tgctcagggt 1140gttcgtctgc tgcagctgcg tgaatctttc accctgccgc aggctctgcg tgctcagggt 1140

ctgctgaaag aagctgaaca cgttgaatct cagtctgctg aaaccgttct gaccccggac 1200ctgctgaaag aagctgaaca cgttgaatct cagtctgctg aaaccgttct gaccccggac 1200

gaatgccagc tgctgggtta cctggacaaa ggtaaacgta aacgtaaaga aaaagctggt 1260gaatgccagc tgctgggtta cctggacaaa ggtaaacgta aacgtaaaga aaaagctggt 1260

tctctgcagt gggcttacat ggctatcgct cgtctgggtg gtttcatgga ctctaaacgt 1320tctctgcagt gggcttacat ggctatcgct cgtctgggtg gtttcatgga ctctaaacgt 1320

accggtatcg cttcttgggg tgctctgtgg gaaggttggg aagctctgca gtcgaagctg 1380accggtatcg cttcttgggg tgctctgtgg gaaggttggg aagctctgca gtcgaagctg 1380

gacggcttcc tggctgctaa ggacctgatg gctcagggta tcaaaatc 1428gacggcttcc tggctgctaa ggacctgatg gctcagggta tcaaaatc 1428

<210> 13<210> 13

<211> 476<211> 476

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequences

<400> 13<400> 13

Met Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser ValMet Ile Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser Val

1 5 10 151 5 10 15

Phe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu ValPhe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu Val

20 25 30 20 25 30

Asn Val Ala Ala Gln Leu Ala Lys Tyr Ser Gly Lys Ser Ile Thr IleAsn Val Ala Ala Gln Leu Ala Lys Tyr Ser Gly Lys Ser Ile Thr Ile

35 40 45 35 40 45

Ser Ser Glu Gly Ser Glu Ala Met Gln Glu Gly Ala Tyr Arg Phe IleSer Ser Glu Gly Ser Glu Ala Met Gln Glu Gly Ala Tyr Arg Phe Ile

50 55 60 50 55 60

Arg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala MetArg Asn Pro Asn Val Ser Ala Glu Ala Ile Arg Lys Ala Gly Ala Met

65 70 75 8065 70 75 80

Gln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile GluGln Thr Val Lys Leu Ala Gln Glu Phe Pro Glu Leu Leu Ala Ile Glu

85 90 95 85 90 95

Asp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu GlyAsp Thr Thr Ser Leu Ser Tyr Arg His Gln Val Ala Glu Glu Leu Gly

100 105 110 100 105 110

Lys Leu Gly Ser Ile Gln Asp Gln Ser Arg Gly Trp Trp Val His SerLys Leu Gly Ser Ile Gln Asp Gln Ser Arg Gly Trp Trp Val His Ser

115 120 125 115 120 125

Val Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu HisVal Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu His

130 135 140 130 135 140

Gln Asn Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu LysGln Asn Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu Lys

145 150 155 160145 150 155 160

Glu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg MetGlu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg Met

165 170 175 165 170 175

Gly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Glu Ala AspGly Ser Met Met Ser Asn Val Ile Ala Val Cys Asp Arg Glu Ala Asp

180 185 190 180 185 190

Ile His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe ValIle His Ala Tyr Leu Gln Asp Lys Leu Ala His Asn Glu Arg Phe Val

195 200 205 195 200 205

Val Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr LeuVal Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr Leu

210 215 220 210 215 220

Tyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile SerTyr Asp His Leu Lys Asn Gln Pro Glu Leu Gly Gly Tyr Gln Ile Ser

225 230 235 240225 230 235 240

Ile Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn ArgIle Pro Gln Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn Arg

245 250 255 245 250 255

Pro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr LeuPro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg Ile Thr Leu

260 265 270 260 265 270

Lys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile AsnLys Gln Gly Asn Ile Thr Leu Asn Ala Val Leu Ala Glu Glu Ile Asn

275 280 285 275 280 285

Pro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Thr Ser GluPro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Leu Thr Ser Glu

290 295 300 290 295 300

Pro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr ThrPro Val Glu Ser Leu Ala Gln Ala Leu Arg Val Ile Asp Ile Tyr Thr

305 310 315 320305 310 315 320

His Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly AlaHis Arg Trp Arg Ile Glu Glu Phe His Lys Ala Trp Lys Thr Gly Ala

325 330 335 325 330 335

Gly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg MetGly Ala Glu Arg Gln Arg Met Glu Glu Pro Asp Asn Leu Glu Arg Met

340 345 350 340 345 350

Val Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg GluVal Ser Ile Leu Ser Phe Val Ala Val Arg Leu Leu Gln Leu Arg Glu

355 360 365 355 360 365

Ser Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys GluSer Phe Thr Leu Pro Gln Ala Leu Arg Ala Gln Gly Leu Leu Lys Glu

370 375 380 370 375 380

Ala Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro AspAla Glu His Val Glu Ser Gln Ser Ala Glu Thr Val Leu Thr Pro Asp

385 390 395 400385 390 395 400

Glu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg LysGlu Cys Gln Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg Lys

405 410 415 405 410 415

Glu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg LeuGlu Lys Ala Gly Ser Leu Gln Trp Ala Tyr Met Ala Ile Ala Arg Leu

420 425 430 420 425 430

Gly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly AlaGly Gly Phe Met Asp Ser Lys Arg Thr Gly Ile Ala Ser Trp Gly Ala

435 440 445 435 440 445

Leu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe LeuLeu Trp Glu Gly Trp Glu Ala Leu Gln Ser Lys Leu Asp Gly Phe Leu

450 455 460 450 455 460

Ala Ala Lys Asp Leu Met Ala Gln Gly Ile Lys IleAla Ala Lys Asp Leu Met Ala Gln Gly Ile Lys Ile

465 470 475465 470 475

<210> 14<210> 14

<211> 1428<211> 1428

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 14<400> 14

atgatcacct ctgctctgca ccgtgctgct gactgggcta aatctgtttt ctcttctgct 60atgatcacct ctgctctgca ccgtgctgct gactgggcta aatctgtttt ctcttctgct 60

gctctgggtg acccgcgtcg taccgctcgt ctggttaacg ttgctgctca gctggctaaa 120gctctgggtg acccgcgtcg taccgctcgt ctggttaacg ttgctgctca gctggctaaa 120

tactctggta aatctatcac catctcttct gaaggttctg aagctatgca ggaaggtgct 180tactctggta aatctatcac catctcttct gaaggttctg aagctatgca ggaaggtgct 180

taccgtttca tccgtaaccc gaacgtttct gctgaagcta tccgtaaagc tggtgctatg 240taccgtttca tccgtaaccc gaacgtttct gctgaagcta tccgtaaagc tggtgctatg 240

cagaccgtta aactggctca ggaatttccg gaactgctgg ctatcgaaga caccacctct 300cagaccgtta aactggctca ggaatttccg gaactgctgg ctatcgaaga caccacctct 300

ctgtcttacc gtcaccaggt tgctgaagaa ctgggtaaac tgggttctat ccaggaccag 360ctgtcttacc gtcaccaggt tgctgaagaa ctgggtaaac tgggttctat ccaggaccag 360

tctcgtggtt ggtgggttca ctctgttctg ctgctggaag ctaccacctt ccgtaccgtt 420tctcgtggtt ggtgggttca ctctgttctg ctgctggaag ctaccacctt ccgtaccgtt 420

ggtctgctgc accagaactg gtggatgcgt ccggacgacc cggctgacgc tgacgaaaaa 480ggtctgctgc accagaactg gtggatgcgt ccggacgacc cggctgacgc tgacgaaaaa 480

gaatctggta aatggctggc tgctgctgct acctctcgtc tgcgtatggg ttctatgatg 540gaatctggta aatggctggc tgctgctgct acctctcgtc tgcgtatggg ttctatgatg 540

tctaacgtta tcgctgtttg cgaccgtgaa gctgacatcc acgcttacct gcaggacaaa 600tctaacgtta tcgctgtttg cgaccgtgaa gctgacatcc acgcttacct gcaggacaaa 600

ctggctcaca acgaacgttt cgttgttcgt tctaaacacc cgcgtaaaga cgttgaatct 660ctggctcaca acgaacgttt cgttgttcgt tctaaacacc cgcgtaaaga cgttgaatct 660

ggtctgtacc tgtacgacca cctgaaaaac cagccggaac tgggtggtta ccagatctct 720ggtctgtacc tgtacgacca cctgaaaaac cagccggaac tgggtggtta ccagatctct 720

atcccgcaga aaggtgttgt tgacaaacgt ggtaaacgta aaaaccgtcc ggctcgtaaa 780atcccgcaga aaggtgttgt tgacaaacgt ggtaaacgta aaaaccgtcc ggctcgtaaa 780

gcttctctgt ctctgcgttc tggtcgtatc accctgaaac agggtaacat caccctgaac 840gcttctctgt ctctgcgttc tggtcgtatc accctgaaac agggtaacat caccctgaac 840

gctgttctgg ctgaagaaat caacccgccg aaaggtgaaa ccccgctgaa atggctgctg 900gctgttctgg ctgaagaaat caacccgccg aaaggtgaaa ccccgctgaa atggctgctg 900

ctgacctctg aaccggttga atctctggct caggctctgc gtgttatcga catctacacc 960ctgacctctg aaccggttga atctctggct caggctctgc gtgttatcga catctacacc 960

caccgttggc gtatcgaaga atttcacaaa gcttggaaaa ccggtgctgg tgctgaacgt 1020caccgttggc gtatcgaaga atttcacaaa gcttggaaaa ccggtgctgg tgctgaacgt 1020

cagcgtatgg aagaaccgga caacctggaa cgtatggttt ctatcctgtc tttcgttgct 1080cagcgtatgg aagaaccgga caacctggaa cgtatggttt ctatcctgtc tttcgttgct 1080

gttcgtctgc tgcagctgcg tgaatctttc accctgccgc aggctctgcg tgctcagggt 1140gttcgtctgc tgcagctgcg tgaatctttc accctgccgc aggctctgcg tgctcagggt 1140

ctgctgaaag aagctgaaca cgttgaatct cagtctgctg aaaccgttct gaccccggac 1200ctgctgaaag aagctgaaca cgttgaatct cagtctgctg aaaccgttct gaccccggac 1200

gaatgccagc tgctgggtta cctggacaaa ggtaaacgta aacgtaaaga aaaagctggt 1260gaatgccagc tgctgggtta cctggacaaa ggtaaacgta aacgtaaaga aaaagctggt 1260

tctctgcagt gggcttacat ggctatcgct cgtctgggtg gtttcatgga ctctaaacgt 1320tctctgcagt gggcttacat ggctatcgct cgtctgggtg gtttcatgga ctctaaacgt 1320

accggtatcg cttcttgggg tgctctgtgg gaaggttggg aagctctgca gtcgaagctg 1380accggtatcg cttcttgggg tgctctgtgg gaaggttggg aagctctgca gtcgaagctg 1380

gacggcttcc tggctgctaa ggacctgatg gctcagggta tcaaaatc 1428gacggcttcc tggctgctaa ggacctgatg gctcagggta tcaaaatc 1428

Claims (21)

1.一种转座酶突变体,其特征在于,所述转座酶突变体相较于野生型转座酶具有R27T突变;所述野生型转座酶的氨基酸序列如SEQ ID No.1所示。1. a transposase mutant, it is characterized in that, described transposase mutant has R27T mutation compared with wild-type transposase; The aminoacid sequence of described wild-type transposase is as shown in SEQ ID No.1 Show. 2.根据权利要求1所述的转座酶突变体,其特征在于,所述转座酶突变体相较于所述野生型转座酶具有如下(1)所示的突变组合;2. The transposase mutant according to claim 1, wherein the transposase mutant has a mutation combination shown in (1) below compared to the wild-type transposase; (1)R27T和M56G。(1) R27T and M56G. 3.一种融合蛋白,其特征在于,其包括权利要求1或2所述的转座酶突变体;3. a fusion protein is characterized in that, it comprises the described transposase mutant of claim 1 or 2; 所述融合蛋白还包括用于结合免疫球蛋白Fc段的结合蛋白,所述结合蛋白与所述转座酶突变体相连。The fusion protein also includes a binding protein for binding to the Fc fragment of an immunoglobulin, the binding protein being linked to the transposase mutant. 4.根据权利要求3所述的融合蛋白,其特征在于,所述结合蛋白包括金黄色葡萄球菌A蛋白、链球菌G蛋白和链球菌L蛋白中的至少一种。4. The fusion protein according to claim 3, wherein the binding protein comprises at least one of Staphylococcus aureus protein A, Streptococcus protein G and Streptococcus L protein. 5.根据权利要求3所述的融合蛋白,其特征在于,所述结合蛋白的氨基酸序列如SEQ IDNo.2或3所示。5 . The fusion protein according to claim 3 , wherein the amino acid sequence of the binding protein is shown in SEQ ID No. 2 or 3. 6 . 6.根据权利要求3所述的融合蛋白,其特征在于,所述结合蛋白位于所述融合蛋白的氨基酸序列的N端,所述转座酶突变体位于所述融合蛋白的氨基酸序列的C端。6. The fusion protein according to claim 3, wherein the binding protein is located at the N-terminus of the amino acid sequence of the fusion protein, and the transposase mutant is located at the C-terminus of the amino acid sequence of the fusion protein . 7.根据权利要求3所述的融合蛋白,其特征在于,所述结合蛋白与所述转座酶突变体通过连接肽连接。7. The fusion protein of claim 3, wherein the binding protein is linked to the transposase mutant through a linker peptide. 8.根据权利要求7所述的融合蛋白,其特征在于,所述连接肽为刚性连接肽或柔性连接肽。8. The fusion protein according to claim 7, wherein the connecting peptide is a rigid connecting peptide or a flexible connecting peptide. 9.根据权利要求7所述的融合蛋白,其特征在于,所述连接肽为柔性连接肽。9. The fusion protein of claim 7, wherein the connecting peptide is a flexible connecting peptide. 10.根据权利要求7所述的融合蛋白,其特征在于,所述连接肽的长度为6~59个氨基酸。10 . The fusion protein according to claim 7 , wherein the length of the connecting peptide is 6-59 amino acids. 11 . 11.根据权利要求7所述的融合蛋白,其特征在于,所述连接肽的长度为20~35个氨基酸。11. The fusion protein according to claim 7, wherein the length of the connecting peptide is 20-35 amino acids. 12.根据权利要求7所述的融合蛋白,其特征在于,所述连接肽的序列如SEQ ID No.4所示。12 . The fusion protein according to claim 7 , wherein the sequence of the connecting peptide is shown in SEQ ID No. 4. 13 . 13.一种分离的核酸,其特征在于,其编码如权利要求1或2所述的转座酶突变体或如权利要求3~12任一项所述的融合蛋白。13. An isolated nucleic acid, characterized in that it encodes the transposase mutant according to claim 1 or 2 or the fusion protein according to any one of claims 3-12. 14.一种载体,其特征在于,其含有如权利要求13所述的分离的核酸。14. A vector comprising the isolated nucleic acid of claim 13. 15.一种宿主细胞,其特征在于,其含有权利要求14所述的载体。15. A host cell comprising the vector of claim 14. 16.制备如权利要求1或2所述的转座酶突变体或如权利要求3~12任一项所述的融合蛋白的方法,其特征在于,其包括培养如权利要求15所述的宿主细胞。16. The method for preparing the transposase mutant according to claim 1 or 2 or the fusion protein according to any one of claims 3-12, characterized in that it comprises culturing the host according to claim 15 cell. 17.一种用于构建测序文库的试剂盒,其特征在于,其包括如权利要求1或2所述的转座酶突变体或如权利要求3~12任一项所述的融合蛋白。17. A kit for constructing a sequencing library, characterized in that it comprises the transposase mutant according to claim 1 or 2 or the fusion protein according to any one of claims 3-12. 18.如权利要求1或2所述的转座酶突变体或如权利要求3~12任一项所述的融合蛋白在DNA建库中的应用。18. Use of the transposase mutant according to claim 1 or 2 or the fusion protein according to any one of claims 3 to 12 in DNA library construction. 19.如权利要求1或2所述的转座酶突变体或如权利要求3~12任一项所述的融合蛋白在分析蛋白与染色质的相互作用中的应用。19. Use of the transposase mutant according to claim 1 or 2 or the fusion protein according to any one of claims 3 to 12 in analyzing the interaction of proteins with chromatin. 20.根据权利要求19所述的应用,其特征在于,所述应用包括孵育含有待分析的目标蛋白、目标染色质、以及所述转座酶突变体的混合物体系。20. The application of claim 19, wherein the application comprises incubating a mixture system containing the target protein to be analyzed, the target chromatin, and the transposase mutant. 21.根据权利要求20所述的应用,其特征在于,所述混合物体系还包括目标蛋白的抗体。21. The application according to claim 20, wherein the mixture system further comprises an antibody to a target protein.
CN201911295765.1A 2019-12-16 2019-12-16 A kind of transposase mutant, fusion protein, its preparation method and application Active CN110938610B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911295765.1A CN110938610B (en) 2019-12-16 2019-12-16 A kind of transposase mutant, fusion protein, its preparation method and application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911295765.1A CN110938610B (en) 2019-12-16 2019-12-16 A kind of transposase mutant, fusion protein, its preparation method and application

Publications (2)

Publication Number Publication Date
CN110938610A CN110938610A (en) 2020-03-31
CN110938610B true CN110938610B (en) 2020-12-01

Family

ID=69911526

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911295765.1A Active CN110938610B (en) 2019-12-16 2019-12-16 A kind of transposase mutant, fusion protein, its preparation method and application

Country Status (1)

Country Link
CN (1) CN110938610B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116103368B (en) * 2022-07-29 2025-07-15 南方科技大学 A method for constructing a 3'mRNA sequencing library

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102796728A (en) * 2011-05-23 2012-11-28 安捷伦科技有限公司 Methods and compositions for DNA fragmentation and tagging by transposases
CN108219006A (en) * 2018-03-27 2018-06-29 上海欣百诺生物科技有限公司 A kind of fusion protein of transposase-antibody binding proteins and its preparation and application

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2015247779B2 (en) * 2014-04-15 2021-06-24 Illumina, Inc. Modified transposases for improved insertion sequence bias and increased DNA input tolerance

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102796728A (en) * 2011-05-23 2012-11-28 安捷伦科技有限公司 Methods and compositions for DNA fragmentation and tagging by transposases
CN108219006A (en) * 2018-03-27 2018-06-29 上海欣百诺生物科技有限公司 A kind of fusion protein of transposase-antibody binding proteins and its preparation and application

Also Published As

Publication number Publication date
CN110938610A (en) 2020-03-31

Similar Documents

Publication Publication Date Title
CN110372799A (en) A kind of fusion protein and its application for the preparation of the unicellular library ChIP-seq
US20230134868A1 (en) Biomagnetic microsphere and preparation method therefor and use thereof
CN109929839A (en) Detatching single base gene editing system and its application
JP2017000154A (en) Peptide library
CN114057861B (en) A bio-PROTAC artificial protein targeting UBE2C
CN110938610B (en) A kind of transposase mutant, fusion protein, its preparation method and application
CN104844691B (en) Sox2 protein peptides aptamer and its identification
CN114957415A (en) Streptavidin mutant and application thereof, product, gene, recombinant plasmid and genetically engineered bacterium
EP2190989B1 (en) Method for manufacturing a modified peptide
CN117304326B (en) A monoclonal antibody against MEIOB and its application
CN109957003B (en) Stable SAA mutant and application thereof in disease detection
CN109880840B (en) In vivo biotinylation labeling system for recombinant protein escherichia coli
EP2516702B1 (en) Protein display
EP1981978B1 (en) Affinity polypeptide for purification of recombinant proteins
CN103388001B (en) A kind of method containing thrombin cleavage site GST membranous type expression vector and transfection positive cell
CN109096394B (en) Nano antibody of B subunit of anti-staphylococcal protein A, nucleic acid molecule and application
US20200363410A1 (en) A process for immobilizing polypeptides
CN117003850A (en) Methylation enriched protein and encoding gene, preparation method and application thereof
CN110845596A (en) Mutant xSUMO and its related products
KR102826762B1 (en) Biomagnetic microspheres and their manufacturing method and use
US20240026343A1 (en) Method for screening dimerized cyclic peptide
CN120210151A (en) Mutant MuA transposase and application thereof
Wonga et al. A cell-free workflow for detecting and characterizing RiPP recognition element-precursor peptide interactions.
WO2025129707A1 (en) Dna polymerase mutant and use thereof
CN120099096A (en) A Cre reporter system based on single domain antibody labeling and its application

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: A transposase mutant, fusion protein, preparation method and application

Granted publication date: 20201201

Pledgee: China Co. truction Bank Corp Dongguan branch

Pledgor: GUANGDONG FAPON BIOTECH Co.,Ltd.

Registration number: Y2025980000271

PE01 Entry into force of the registration of the contract for pledge of patent right