[go: up one dir, main page]

CN103451181B - A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium - Google Patents

A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium Download PDF

Info

Publication number
CN103451181B
CN103451181B CN201310386264.0A CN201310386264A CN103451181B CN 103451181 B CN103451181 B CN 103451181B CN 201310386264 A CN201310386264 A CN 201310386264A CN 103451181 B CN103451181 B CN 103451181B
Authority
CN
China
Prior art keywords
resistance
gene
plasmid
seq
hyg
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201310386264.0A
Other languages
Chinese (zh)
Other versions
CN103451181A (en
Inventor
张天宇
杨峰
邹文英
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Institute of Biomedicine and Health of CAS
Original Assignee
Guangzhou Institute of Biomedicine and Health of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Institute of Biomedicine and Health of CAS filed Critical Guangzhou Institute of Biomedicine and Health of CAS
Priority to CN201310386264.0A priority Critical patent/CN103451181B/en
Publication of CN103451181A publication Critical patent/CN103451181A/en
Application granted granted Critical
Publication of CN103451181B publication Critical patent/CN103451181B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

本发明公开了一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒。该抗性表达盒含有精简后的潮霉素抗性基因(SEQ ID NO:1所示),以及位于潮霉素抗性基因两端的dif1和dif2序列。本发明精简后的Hyg基因序列由原来的1.7kb缩短至1kb左右,去掉了转录终止子,较长的天然启动子等复杂结构,便于PCR扩增,内部酶切位点更少(常见酶切位点已被定点突变掉),同时,Hyg自带短的人工启动子,在大肠杆菌和分枝杆菌中均可表达,定向插入时不影响下游基因的转录和翻译。本发明的抗性表达盒在精简后的Hyg基因两端添加了dif序列,可以在分枝杆菌中自动解离。Hyg基因丢失后,被破坏的基因仍可继续表达缩短的小肽,不影响下游基因的翻译,从而可有效避免极性效应。The invention discloses a resistance expression cassette for efficiently constructing recombinant mycobacteria without resistance markers. The resistance expression cassette contains a simplified hygromycin resistance gene (shown in SEQ ID NO: 1), and dif1 and dif2 sequences located at both ends of the hygromycin resistance gene. The simplified Hyg gene sequence of the present invention is shortened from the original 1.7kb to about 1kb, and complex structures such as transcription terminators and longer natural promoters are removed, which is convenient for PCR amplification, and there are fewer internal enzyme cutting sites (common enzyme cutting site has been site-directed mutation), and at the same time, Hyg comes with a short artificial promoter, which can be expressed in both Escherichia coli and mycobacteria, and the transcription and translation of downstream genes will not be affected when directional insertion. The resistance expression cassette of the present invention adds a dif sequence to both ends of the simplified Hyg gene, which can automatically dissociate in mycobacteria. After the Hyg gene is lost, the damaged gene can still continue to express the shortened small peptide without affecting the translation of downstream genes, thus effectively avoiding the polarity effect.

Description

一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒A resistance expression cassette for efficiently constructing recombinant mycobacteria without resistance markers

技术领域 technical field

本发明属于基因工程领域,具体涉及一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒及其应用。 The invention belongs to the field of genetic engineering, and in particular relates to a resistance expression cassette for efficiently constructing recombinant mycobacteria without resistance markers and an application thereof.

背景技术 Background technique

Mtb是引起结核病的病原菌,可侵犯全身各器官,但以肺结核为最多见。结核病至今仍为重要的传染病,其诊断,预防和治疗研究进展缓慢。这3个方面均涉及到研究Mtb基因的功能,而对Mtb进行遗传改造在Mtb的研究中起着举足轻重的作用。结核分枝杆菌生长缓慢,难于对其进行遗传操作,需要昂贵的带负压的3级生物安全实验室,长时间培养不仅占用空间而且容易污染等等使结核分枝杆菌的研究耗时耗财耗力。以抗Mtb药物的研发为例,抗Mtb药物的研发昂贵且周期长。近半个世纪以来,直到2012年底刚刚有一种新作用机制的药物问世,且具有潜在的毒力。目前研究抗Mtb药物/疫苗最核心的问题之一就是建立有效,快速和廉价的基因工程菌。 Mtb is the pathogenic bacterium that causes tuberculosis, which can invade various organs of the body, but tuberculosis is the most common. Tuberculosis is still an important infectious disease, and its diagnosis, prevention and treatment are progressing slowly. These three aspects are all related to the study of the function of Mtb gene, and the genetic modification of Mtb plays a pivotal role in the study of Mtb. Mycobacterium tuberculosis grows slowly, it is difficult to carry out genetic manipulation on it, and requires an expensive level 3 biosafety laboratory with negative pressure. Long-term cultivation not only takes up space but is easy to contaminate, etc., making the research on Mycobacterium tuberculosis time-consuming and expensive exhausting. Taking the research and development of anti-Mtb drugs as an example, the research and development of anti-Mtb drugs is expensive and takes a long time. For nearly half a century, until the end of 2012, a drug with a new mechanism of action has just come out, and it has potential toxicity. One of the core issues in the current research on anti-Mtb drugs/vaccine is to establish effective, fast and cheap genetically engineered bacteria.

目前在对分枝杆菌进行基因操作的过程中,仅有卡那霉素抗性基因和潮霉素抗性基因比较有效。因此,进行分枝杆菌突变株的研究时,如果所用菌株具有抗性基因,这会对以后的遗传操作带来不便。例如,基因敲除/遗传互补至少需2种抗性基因;遗传重组技术(recombineering)也需要2种抗性基因等。同时,在分枝杆菌中,有时需要表达多种抗原蛋白,需要多次遗传操作,因此,产生无抗性重组菌株对后续操作尤为重要。可利用基因敲除技术用抗性基因将靶基因替换,但是基因敲除时还常常用到噬菌体包装技术。该技术需要将靶基因上游序列+抗性基因+靶基因下游序列(简称三片段)一起包装到噬菌体中,形成噬粒。三片段总长度有一定的限制,如果过长,将不能被包装到噬菌体,无法获得用于基因敲除的噬粒。由于包装有容量有限制,如果抗性基因短小,则可以适当增加靶基因上游序列+靶基因下游序列的长度,甚至可以加入其它元件,因此,有利于适当增加噬菌体包装的有效DNA片段长度。基因敲除后,最有效的检测方法之一是进行PCR验证,然而,以往的Hyg基因具有的转录终止子具有复杂的二级结构使PCR很难进行。有发明将 Res等序列加到Hyg 两侧,而Res等序列需要人工表达一种外源蛋白来识别并作用后才可将Hyg解离。即实际操作中,先将带Res-Hyg-Res的序列的质粒导入分枝杆菌,等筛选到Res-Hyg-Res序列插入基因组后的分枝杆菌,再导入另外一个可以表达解离蛋白的质粒,再经过筛选,可以得到Hyg丢失的分枝杆菌。许多实验往往还需要继续筛选表达解离蛋白的质粒丢失的菌株。因此,非常繁琐,费时费力。有些类似的系统在分枝杆菌中抗性基因丢失的效率极低。dif 序列加到 Hyg 两侧便于Hyg基因的解离,最近几年才见报道。其优势在于,不需要人工表达外源蛋白来实现抗性基因的解离,因为分枝杆菌本身就编码这样的蛋白,因此,省去了很多不必要的麻烦。而且该系统的效率非常高,兼容性好(例如,慢速生长的结核分枝杆菌的dif序列在快速生长的耻垢分枝杆菌中同样起作用)。然而,在采用dif序列体系的研究中,其所用Hyg抗性基因,片段长(1.7kb),带有复杂的3’端2级结构,内部有常见的酶切位点,两端可用酶切位点少,且构建的抗性表达盒不能够 “通用”。 Currently, only the kanamycin resistance gene and the hygromycin resistance gene are relatively effective in the genetic manipulation of mycobacteria. Therefore, when conducting research on mycobacterial mutant strains, if the strains used have resistance genes, this will bring inconvenience to future genetic manipulations. For example, gene knockout/genetic complementation requires at least two resistance genes; genetic recombination technology (recombineering) also requires two resistance genes, etc. At the same time, in mycobacteria, it is sometimes necessary to express multiple antigenic proteins and multiple genetic manipulations. Therefore, the generation of non-resistant recombinant strains is particularly important for subsequent manipulations. Gene knockout technology can be used to replace the target gene with a resistance gene, but phage packaging technology is often used for gene knockout. This technology needs to package the upstream sequence of the target gene + the resistance gene + the downstream sequence of the target gene (referred to as the three fragments) together into the phage to form a phagemid. The total length of the three fragments has a certain limit. If it is too long, it will not be packaged into phages, and phagemids for gene knockout cannot be obtained. Due to the limited capacity of the packaging, if the resistance gene is short, the length of the upstream sequence of the target gene + the downstream sequence of the target gene can be appropriately increased, and other elements can even be added. Therefore, it is beneficial to appropriately increase the length of the effective DNA fragment for phage packaging. After gene knockout, one of the most effective detection methods is PCR verification. However, the transcription terminator of the previous Hyg gene has a complex secondary structure, making PCR difficult. Some inventions add Res and other sequences to both sides of Hyg , and Res and other sequences need to artificially express a foreign protein to recognize and act on Hyg before it can be dissociated. That is, in actual operation, the plasmid with the Res-Hyg-Res sequence is first introduced into mycobacteria, and then the mycobacteria after the Res-Hyg-Res sequence is inserted into the genome are screened, and then another plasmid that can express the dissociated protein is introduced. , and then screened to obtain mycobacteria with Hyg loss. Many experiments often require continued screening for strains that have lost the plasmid expressing the dissociated protein. Therefore, it is very cumbersome and time-consuming. Some similar systems are extremely inefficient for resistance gene loss in mycobacteria. The addition of dif sequence to both sides of Hyg facilitates the dissociation of Hyg gene, which has only been reported in recent years. The advantage is that there is no need to artificially express foreign proteins to realize the dissociation of resistance genes, because the mycobacterium itself encodes such proteins, thus saving a lot of unnecessary troubles. Moreover, the efficiency of the system is very high, and the compatibility is good (for example, the dif sequence of the slow-growing Mycobacterium tuberculosis also works in the fast-growing Mycobacterium smegmatis). However, in the research using the dif sequence system, the Hyg resistance gene used is a long fragment (1.7kb) with a complex secondary structure at the 3' end. There are common enzyme cutting sites inside, and both ends can be cut by enzymes There are few sites, and the constructed resistance expression cassette cannot be "universal".

所以,如果能够构建在分枝杆菌中带有自动解离功能的更短小、易于遗传操作的带有多克隆位点的通用抗性表达盒,且可以不产生极性效应,将会极大简化多种分枝杆菌的遗传操作。在进行分枝杆菌遗传操作时候,因为利用抗性表达盒带入的抗性基因可自动解离,所以后续的遗传操作时将不会受到抗性基因选择的限制,而且,不带抗性基因的突变株也使得多种遗传研究更为可能,将为分枝杆菌的基因工程研究提供强有力的工具。 Therefore, if it is possible to construct a shorter, genetically manipulated universal resistance expression cassette with multiple cloning sites with automatic dissociation function in mycobacteria, and without polarity effects, it will greatly simplify Genetic manipulation of diverse mycobacteria. During the genetic manipulation of mycobacteria, because the resistance gene introduced by the resistance expression cassette can be dissociated automatically, the subsequent genetic manipulation will not be limited by the selection of the resistance gene, and there is no resistance gene The mutant strains also make a variety of genetic studies more possible, and will provide a powerful tool for the genetic engineering research of mycobacteria.

发明内容 Contents of the invention

本发明的一个目的在于提供一种经精简且自带人工启动子的潮霉素抗性基因HygOne object of the present invention is to provide a simplified hygromycin resistance gene Hyg with its own artificial promoter.

本发明的另一个目的在于提供一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒。 Another object of the present invention is to provide a resistance expression cassette for efficiently constructing recombinant mycobacteria without resistance markers.

本发明的另一个目的在于提供一种用于高效构建无抗性标记重组分枝杆菌的重组质粒。 Another object of the present invention is to provide a recombinant plasmid for efficiently constructing recombinant mycobacteria without resistance markers.

本发明所采取的技术方案是: The technical scheme that the present invention takes is:

一种精简改造后的潮霉素抗性基因,其核苷酸序列如SEQ ID NO:1所示。 A hygromycin resistance gene after simplification and transformation, its nucleotide sequence is shown in SEQ ID NO:1.

一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒,其含有精简改造后的潮霉素抗性基因,以及位于潮霉素抗性基因两端的dif1dif2序列,其特征在于,所述精简改造后的潮霉素抗性基因的核苷酸序列如SEQ ID NO:1所示,dif1dif2序列分别如SEQ ID NO:7和SEQ ID NO:8所示。在此将该抗性表达盒命名为“dif-ΩHYG-dif抗性表达盒” A resistance expression cassette for efficiently constructing recombinant mycobacteria without resistance markers, which contains a simplified and transformed hygromycin resistance gene, and dif1 and dif2 sequences located at both ends of the hygromycin resistance gene, its characteristics In that, the nucleotide sequence of the simplified and modified hygromycin resistance gene is shown in SEQ ID NO: 1, and the sequences of dif1 and dif2 are shown in SEQ ID NO: 7 and SEQ ID NO: 8, respectively. The resistance expression cassette is named "dif- ΩHYG - dif resistance expression cassette" here .

优选的,所述抗性表达盒两端还添加了多重酶切位点。 Preferably, multiple enzyme cutting sites are added to both ends of the resistance expression cassette.

所述抗性表达盒的核苷酸序列如SEQ ID NO:5或SEQ ID NO:6所示。 The nucleotide sequence of the resistance expression cassette is shown in SEQ ID NO:5 or SEQ ID NO:6.

含有以上所述dif-ΩHYG-dif 抗性表达盒的重组质粒。优选的,所述重组质粒的核苷酸序列如SEQ ID NO:2所示。 Recombinant plasmid containing the dif- ΩHYG - dif resistance expression cassette described above. Preferably, the nucleotide sequence of the recombinant plasmid is shown in SEQ ID NO:2.

一种用于高效构建无抗性标记重组分枝杆菌的重组质粒,其含有:启动子、噬菌体整合位点、整合酶基因、dif-ΩHYG-dif抗性表达盒、复制起始位点。 A recombinant plasmid for efficiently constructing recombinant mycobacteria without resistance markers, which contains: a promoter, a phage integration site, an integrase gene, a dif- ΩHYG - dif resistance expression box, and a replication origin site.

按顺时针方向,噬菌体整合位点、整合酶基因、dif-ΩHYG-dif 抗性表达盒依次连接在一起。 In a clockwise direction, the phage integration site, integrase gene, and dif- ΩHYG - dif resistance expression cassette are sequentially linked together.

所述启动子为LacZ启动子或BLA启动子,所述复制起始位点为大肠杆菌复制起点。 The promoter is a LacZ promoter or a BLA promoter, and the replication initiation site is an Escherichia coli replication origin.

优选的,所述重组质粒的的核苷酸序列如SEQ ID NO:4所示。 Preferably, the nucleotide sequence of the recombinant plasmid is shown in SEQ ID NO:4.

本发明的有益效果是: The beneficial effects of the present invention are:

(1)本发明精简后的Hyg 基因序列由原来的1.7kb缩短至1kb左右,去掉了转录终止子,较长的天然启动子等复杂结构,便于PCR扩增,内部酶切位点更少(常见酶切位点已被定点突变掉),同时,Hyg自带短的人工启动子,在大肠杆菌和分枝杆菌中均可表达,定向插入时不影响下游基因的转录和翻译。 (1) The simplified Hyg gene sequence of the present invention is shortened from the original 1.7kb to about 1kb, and complex structures such as transcription terminators and longer natural promoters are removed, which is convenient for PCR amplification and has fewer internal restriction sites ( The common enzyme cutting site has been mutated), meanwhile, Hyg has a short artificial promoter, which can be expressed in both Escherichia coli and mycobacteria, and the transcription and translation of downstream genes will not be affected by directional insertion.

(2)本发明的抗性表达盒在精简后的Hyg 基因两端添加了dif序列,可以在分枝杆菌中自动解离。Hyg基因丢失后,被破坏的基因仍可继续表达缩短的小肽,不影响下游基因的翻译,从而可有效避免极性效应。 (2) The resistance expression cassette of the present invention adds a dif sequence at both ends of the simplified Hyg gene, which can automatically dissociate in mycobacteria. After the Hyg gene is lost, the damaged gene can still continue to express the shortened small peptide without affecting the translation of downstream genes, thus effectively avoiding the polarity effect.

(3)本发明的抗性表达盒两端还添加了多重酶切位点,使得该表达盒既可以插入到特定序列中,还便于在其两侧定向添加其他序列。 (3) Multiple enzyme cutting sites are added to both ends of the resistance expression cassette of the present invention, so that the expression cassette can be inserted into a specific sequence, and it is also convenient to add other sequences on both sides.

附图说明 Description of drawings

图1为质粒pUCDHmKE的结构示意图; Figure 1 is a schematic diagram of the structure of the plasmid pUCDHmKE;

图2为质粒pUCDHmKE酶切位点图; Figure 2 is a map of the restriction sites of plasmid pUCDHmKE;

图3为质粒pUCDHmKE的构建流程图; Fig. 3 is the construction flowchart of plasmid pUCDHmKE;

图4为整合型质粒pMH94DHmKE的结构示意图; Figure 4 is a schematic diagram of the structure of the integrated plasmid pMH94DHmKE;

图5为整合型质粒pMH94DHmKE的构建流程图; Figure 5 is a flow chart of the construction of the integrated plasmid pMH94DHmKE;

图6为整合质粒pblDHCiGn的结构示意图; Figure 6 is a schematic diagram of the structure of the integrated plasmid pblDHCiGn;

图7为质粒pblDHCiGn的构建流程图; Fig. 7 is the construction flowchart of plasmid pblDHCiGn;

图8为dif-ΩHYG-dif 表达盒与以往在分枝杆菌遗传操作中常用的含有潮霉素(HYG)抗性基因(Hyg)的片段的比较简图(P1,P2,P3:假定的Hyg 启动子;Pa:假定的Hyg 人工启动子,它覆盖了dif-ΩHYG-dif表达盒中的dif1,常用的酶切位点:E1,EcoRI(在我们的dif-ΩHYG-dif 表达盒中被去除了),Nt,NotI;Sp,SpeI;B,BamHI,E5,EcoRV;Nd,NdeI;H,HindIII;Xo,XhoI;C,ClaI;Xb,XbaI;S,SmaI;Nc,NcoI;P,PstI;Nh,NheI;K,KpnI); Figure 8 is a schematic diagram comparing the dif -ΩHYG- dif expression cassette with the fragment containing the hygromycin (HYG) resistance gene ( Hyg ) commonly used in the genetic manipulation of mycobacteria in the past (P1, P2, P3: putative Hyg Promoter; Pa: putative Hyg artificial promoter, which covers dif1 in the dif - ΩHYG - dif expression cassette, commonly used restriction sites: E1, EcoRI (removed in our dif -ΩHYG- dif expression cassette out), Nt, Not I; Sp, Spe I; B, BamH I, E5, Eco RV; Nd, Nde I; H, Hin dIII; Xo, Xho I; C, Cla I; Xb, Xba I; S, Sma I; Nc, Nco I; P, Pst I; Nh, Nhe I; K, Kpn I);

图9为存在于pUCDHmke质粒中的HindIII-XhoI- dif-XhoI-HindIII在dif正链 (A) 和负链 (B)中的序列及其对应表达的多肽的分析(它们是dif-ΩHYG-dif 表达盒V1解离后留在基因组中的序列. SpeI-BamHI-EcoRV-EcoRI-NdeI-HindIII-XhoI-dif-XhoI-HindIII-NcoI-PstI-NheI(存在于pblDHC1n中)在dif负链中的序列及其对应表达的多肽的分析 (C),这是dif-ΩHYG-dif 表达盒V2解离后留在基因组中的序列,由三种读码框编码的氨基酸用蓝色的大写字母表示,划线部分表示在可读通的读码框中的dif 序列编码的氨基酸,星号(*)表示终止子)。 Figure 9 is the analysis of the sequence of Hin dIII- Xho I- dif - Xho I- Hin dIII present in the pUCDHmke plasmid in the dif positive chain (A) and the negative chain (B) and the corresponding expressed polypeptide (they are dif - ΩHYG - the sequence remaining in the genome after dissociation of the expression cassette V1 of the dif. Spe I- Bam HI- Eco RV- Eco RI- Nde I- Hin dIII- Xho I- dif - Xho I- Hin dIII- Nco I- Pst I- Analysis of the sequence of Nhe I (present in pblDHC1n) in the negative strand of dif and its corresponding expressed polypeptide (C), which is the sequence left in the genome after dissociation of the dif- ΩHYG - dif expression cassette V2, represented by The amino acids encoded by the three reading frames are indicated by blue capital letters, the underlined part indicates the amino acids encoded by the dif sequence in the open reading frame, and the asterisk (*) indicates the terminator).

具体实施方式 Detailed ways

下面结合实施例对本发明作进一步的说明,但并不局限于此。 The present invention will be further described below in conjunction with the examples, but not limited thereto.

以下实施例中所采用的分子生物学实验技术包括PCR扩增、质粒提取、质粒转化、DNA片段连接、酶切、凝胶电泳等,如无特殊说明,通常按照常规方法操作,具体可参见《分子克隆实验指南》(第三版) (Sambrook J, Russell DW,Janssen K, Argentine J.黄培堂等译,2002,北京:科学出版社) ,或按照制造厂商所建议的条件。 The molecular biology experimental techniques used in the following examples include PCR amplification, plasmid extraction, plasmid transformation, DNA fragment ligation, enzyme digestion, gel electrophoresis, etc., unless otherwise specified, generally operate according to conventional methods. For details, please refer to " Molecular Cloning Experiment Guide (Third Edition) (translated by Sambrook J, Russell DW, Janssen K, Argentine J. Huang Peitang, etc., 2002, Beijing: Science Press), or according to the conditions suggested by the manufacturer.

以下实施例中PCR反应所用的pfu酶、dNTP以及相关试剂均购买自上海生工生物有限公司;抗生素潮霉素购自罗氏公司;大肠杆菌感受态DH5a购买于广州东盛生物科技有限公司,货号为C1042;DNA连接反应均采用Takara 宝生物公司的 T4 DNA连接试剂盒,型号:D6020A;质粒DNA提取试剂盒(BSC01M1)、DNA回收试剂盒(BSC02M1)、PCR纯化试剂盒(BSC03M1)购自博日生物公司。 The pfu enzymes, dNTPs and related reagents used in the PCR reactions in the following examples were purchased from Shanghai Sangon Biotechnology Co., Ltd.; the antibiotic hygromycin was purchased from Roche; E. coli competent DH5a was purchased from Guangzhou Dongsheng Biotechnology Co., Ltd., Cat. No. It was C1042; the DNA ligation reaction used the T4 DNA ligation kit of Takara Bao Bio Company, model: D6020A; the plasmid DNA extraction kit (BSC01M1), DNA recovery kit (BSC02M1), and PCR purification kit (BSC03M1) were purchased from Bo Nichibio Corporation.

以下实施例中所用到的PCR引物和DNA片段由上海捷瑞生物工程有限公司负责合成。 The PCR primers and DNA fragments used in the following examples were synthesized by Shanghai Jierui Bioengineering Co., Ltd.

实施例1 构建质粒pUCDHmKEExample 1 Construction of plasmid pUCDHmKE

质粒pUCDHmKE的结构示意图见图1,酶切位点图见图2,其含有上述的dif-ΩHYG-dif 抗性表达盒。由图1可见,质粒pUCDHmKE按顺时针方向依次含有:大肠杆菌复制起点(pMB1 ori),dif-ΩHYG-dif 抗性表达盒,启动子P(BLA)(用于启动氨苄青霉素抗性基因Amp(bla),氨苄青霉素抗性基因Amp可用于质粒筛选。 The schematic diagram of the structure of the plasmid pUCDHmKE is shown in Figure 1, and the restriction site map is shown in Figure 2, which contains the above-mentioned dif- ΩHYG - dif resistance expression cassette. It can be seen from Figure 1 that the plasmid pUCDHmKE contains in a clockwise direction: the E. coli replication origin (pMB1 ori), the dif- ΩHYG - dif resistance expression cassette, and the promoter P (BLA) (used to initiate the ampicillin resistance gene Amp ( That is bla) , the ampicillin resistance gene Amp can be used for plasmid screening.

所述dif-ΩHYG-dif 抗性表达盒上含有精简改造后的潮霉素抗性基因(Hyg),如SEQ ID NO:1所述,该基因上的第24~73bp为启动子序列(Pr),用于启动Hyg基因的表达。Hyg基因序列两端连接的dif序列如SEQ ID NO:7和SEQ ID NO:8所示。 The dif- ΩHYG - dif resistance expression cassette contains a streamlined modified hygromycin resistance gene ( Hyg ), as described in SEQ ID NO: 1, the 24th to 73th bp of the gene is a promoter sequence (Pr ), used to initiate the expression of the Hyg gene. The dif sequences connected at both ends of the Hyg gene sequence are shown in SEQ ID NO:7 and SEQ ID NO:8.

另外在dif-ΩHYG-dif 抗性表达盒的两端分别依次连接HindIII酶切位点、XhoI酶切位点、XbaI酶切位点,此3个酶切位点可用来切下质粒上的dif-ΩHYG-dif 抗性表达盒。 In addition, the two ends of the dif- ΩHYG - dif resistance expression cassette are respectively connected with the Hin dIII restriction site, the Xho I restriction site, and the Xba I restriction site respectively. These three restriction sites can be used to cut out the plasmid dif- ΩHYG - dif resistance expression cassette on .

质粒pUCDHmKE的构建流程图见图3,本流程中将使用到质粒pNBV1(由美国约翰霍普金斯大学William Bishai实验室惠赠,质粒图谱见图3,具体构建方法见参考文献1。质粒pNBV1中含有原始潮霉素抗性基因(Hyg*),约长1.7kb。 The construction flow chart of plasmid pUCDHmKE is shown in Figure 3. Plasmid pNBV1 will be used in this process (gifted by the William Bishai Laboratory of Johns Hopkins University, USA. The plasmid map is shown in Figure 3. For the specific construction method, see reference 1. In the plasmid pNBV1 Contains the original hygromycin resistance gene ( Hyg *), about 1.7kb in length.

具体方法如下: The specific method is as follows:

1. 质粒pUCDis的构建 1. Construction of plasmid pUCDis

用限制性内切酶KpnI和Hind III消化质粒pUC19(商品质粒),纯化回收试剂盒回收2.6kb的片段; Plasmid pUC19 (commercial plasmid) was digested with restriction endonucleases Kpn I and Hind III, and a 2.6kb fragment was recovered by purification and recovery kit;

合成DNA片段dif1和 dif2(如SEQ ID NO:7和SEQ ID NO:8所示,由上海捷瑞生物工程有限公司负责合成),合成的dif1两端带有Hind III和XbaI酶切位点,dif2两端带有XbaI和KpnI酶切位点,所以,将dif1和 dif2以及纯化回收后的质粒pUC19(KpnI和HindIII双酶切)进行三片段连接,转化大肠杆菌感受态DH5α,利用含氨苄青霉素抗性的LB固体平板筛选出阳性克隆,挑取单克隆到LB液体培养基中培养后,提取质粒,酶切鉴定出正确的克隆即为质粒pUCDis。 Synthetic DNA fragments dif1 and dif2 (as shown in SEQ ID NO:7 and SEQ ID NO:8, synthesized by Shanghai Jierui Bioengineering Co., Ltd.), the two ends of the synthesized dif1 have Hind III and Xba I restriction sites , there are Xba I and Kpn I restriction sites at both ends of dif2 , so, connect dif1 and dif2 and the purified and recovered plasmid pUC19 (double digestion with Kpn I and Hind III) in three fragments, and transform Escherichia coli competent DH5α , use the ampicillin-resistant LB solid plate to screen out positive clones, pick a single clone and culture it in LB liquid medium, extract the plasmid, and identify the correct clone by enzyme digestion, which is the plasmid pUCDis.

2. 质粒pUCDHmKE的构建 2. Construction of plasmid pUCDHmKE

扩增片段Hyg:本步骤以质粒pNBV1为模板,用引物Hygf2(5’-TG TCTAGACCGGCCGTGCGGAATTAA-3’)(SEQ ID NO:9)和Hygr727(5’-ATTCTAGATCAGGCGCCGGGGGCGGTGTC-3’)(SEQ ID NO:10)进行扩增。这对引物可从质粒pNBV1中扩增出约1.1kb的Hyg片段,比原来的1.7kb明显精简了,且精简后的Hyg基因依然可以表达并行使抗性基因的作用,其效率与原始质粒中Hyg*基因表达抗性的效率相当。 Amplified fragment Hyg : In this step, plasmid pNBV1 is used as a template, and primers Hygf2 (5'-TG TCTAGACCGGCCGTGCGGAATTAA-3') (SEQ ID NO:9) and Hygr727 (5'-ATTCTAGATCAGGCGCCGGGGGCGGTGTC-3') (SEQ ID NO:10 ) for amplification. This pair of primers can amplify a Hyg fragment of about 1.1kb from the plasmid pNBV1, which is significantly more streamlined than the original 1.7kb, and the streamlined Hyg gene can still express and function as a resistance gene, and its efficiency is the same as that in the original plasmid. The Hyg * genes expressed resistance with comparable efficiency.

扩增所得片段的两端带有XbaI酶切位点,将扩增获得的片段用XbaI消化,获得Hyg片段(XbaI切),大小约为1.1kb。 Both ends of the amplified fragment have Xba I restriction sites, and the amplified fragment is digested with Xba I to obtain a Hyg fragment (cut by Xba I), with a size of about 1.1 kb.

将上一步中构建的质粒 pUCDis 用XbaI消化,将以上酶切产物进行纯化回收4.0 kb片段,即为pUCDI片段(XbaI切)。 The plasmid pUCDis constructed in the previous step was digested with Xba I, and the above digested product was purified to recover a 4.0 kb fragment, which was the pUCDI fragment ( Xba I cut).

将pUCDI片段(XbaI切)和Hyg片段(XbaI切)进行连接反应,转化大肠杆菌感受态DH5α,利用含潮霉素抗性的LB固体平板筛选出阳性克隆,挑取单克隆到LB液体培养基中培养后,提取质粒,酶切鉴定出正确的克隆即为质粒pUCDH。将此质粒中的KpnI位点和EcoRI位点进行突变,去除掉这2个酶切位点,即得质粒pUCDHmKE,其序列如SEQ ID NO:2所示,其第426bp~1553bp为长1128bp的dif-ΩHYG-dif 抗性表达盒,抗性表达盒两端为HindIII酶切位点aagcttXhoI酶切位点ctcgagdif-ΩHYG-dif 抗性表达盒中所含有的Hyg片段如SEQ ID NO:1所示。该Hyg片段去掉了转录终止子,较长的天然启动子等复杂结构,便于PCR扩增,内部酶切位点更少(常见酶切位点已被定点突变掉),同时,Hyg自带短的人工启动子,在大肠杆菌和分枝杆菌中均可表达,定向插入时不影响下游基因的转录和翻译。 Ligate the pUCDI fragment ( Xba I cut) and Hyg fragment ( Xba I cut) to transform Escherichia coli competent DH5α, use hygromycin-resistant LB solid plate to screen positive clones, and pick single clones into LB liquid After culturing in the culture medium, the plasmid was extracted, and the correct clone identified by enzyme digestion was the plasmid pUCDH. Mutate the Kpn I site and Eco RI site in this plasmid, remove these two restriction sites, and obtain the plasmid pUCDHmKE, its sequence is shown in SEQ ID NO: 2, and its length is 426bp to 1553bp 1128bp dif- ΩHYG - dif resistance expression cassette, with HindIII restriction site aagctt and Xho I restriction site ctcgag at both ends of the resistance expression cassette. The Hyg fragment contained in the dif- ΩHYG - dif resistance expression cassette is shown in SEQ ID NO:1. The Hyg fragment removes complex structures such as transcription terminators and long natural promoters, which is convenient for PCR amplification and has fewer internal restriction sites (common restriction sites have been mutated by site-directed mutations). At the same time, Hyg comes with a short The artificial promoter can be expressed in both Escherichia coli and mycobacteria, and the transcription and translation of downstream genes will not be affected when inserted in a directional manner.

实施例2整合型质粒pMH94DHmKE的构建及应用Example 2 Construction and Application of Integrated Plasmid pMH94DHmKE

整合型质粒pMH94DHmKE的结构示意图见图4,质粒pMH94DHmKE按顺时针方向依次含有:LacZ启动子(可用于蓝白斑筛选,以及启动后面整个质粒骨架的表达)、噬菌体整合位点attP、整合酶基因Int(可表达出整合酶,使质粒通过attP位点整合入分枝杆菌基因组),dif-ΩHYG-dif 抗性表达盒,大肠杆菌复制起点(oriE),氨苄青霉素抗性基因Amp(用于质粒筛选)。dif-ΩHYG-dif 抗性表达盒两端分别依次连接HindIII酶切位点、XhoI酶切位点,可用于切下质粒上的dif-ΩHYG-dif 抗性表达盒。 The structural diagram of the integrated plasmid pMH94DHmKE is shown in Figure 4. The plasmid pMH94DHmKE contains in a clockwise direction: the LacZ promoter (which can be used for blue-white screening and to initiate the expression of the entire plasmid backbone), the phage integration site attP , and the integrase gene Int (can express integrase, so that the plasmid can be integrated into the mycobacterium genome through the attP site), dif- ΩHYG - dif resistance expression cassette, E. coli replication origin ( oriE ), ampicillin resistance gene Amp (for plasmid selection ). The two ends of the dif- ΩHYG - dif resistance expression cassette are respectively connected with the Hin dIII restriction site and the Xho I restriction site respectively, which can be used to cut out the dif- ΩHYG - dif resistance expression cassette on the plasmid.

整合型质粒pMH94DHmKE的构建流程见图5,本构建流程中所用到的质粒pMH94由由美国约翰霍普金斯大学William Bishai实验室惠赠,质粒图谱见图5,具体构建方法见参考文献2。 The construction process of the integrated plasmid pMH94DHmKE is shown in Figure 5. The plasmid pMH94 used in this construction process was donated by the William Bishai Laboratory of Johns Hopkins University in the United States. The plasmid map is shown in Figure 5. For the specific construction method, see Reference 2.

具体操作如下: The specific operation is as follows:

1. 将质粒pMH94与实施例1构建的质粒pUCDHmKE用HindIII消化,回收pMH94消化所得的大小约为6.0 kb的片段、pUCDHmKE消化所得的大小约为1.1 kb的片段,将两个片段进行连接反应后,转化大肠杆菌感受态DH5α,利用含潮霉素抗性的LB固体平板筛选出阳性克隆,挑取单克隆到LB液体培养基中培养后,提取质粒,酶切鉴定出正确的克隆即为质粒pMH94DHmKE。 1. Digest the plasmid pMH94 and the plasmid pUCDHmKE constructed in Example 1 with HindIII, recover the fragment of about 6.0 kb in size digested by pMH94 and the fragment of about 1.1 kb in size obtained by digestion of pUCDHmKE, and carry out the ligation reaction of the two fragments Afterwards, transform Escherichia coli competent DH5α, use the LB solid plate containing hygromycin resistance to screen out positive clones, pick a single clone and culture it in LB liquid medium, extract the plasmid, and identify the correct clone by enzyme digestion. Plasmid pMH94DHmKE.

2. 将质粒pMH94DHmKE通过电转化的方法将其分别转入结核分枝杆菌(H37Rv结核标准菌株,由广州市胸科医院惠赠)和耻垢分枝杆菌(ATCC 700044),获得相应的转化株。 2. The plasmid pMH94DHmKE was transformed into Mycobacterium tuberculosis (H37Rv tuberculosis standard strain, donated by Guangzhou Chest Hospital) and Mycobacterium smegmatis (ATCC 700044) by electroporation to obtain corresponding transformants.

将所得转化株于37℃孵箱中孵育12小时,充分复活细菌并使其抗性表达。用带滤芯的枪尖吹吸混匀后,分装至小管中,10000 rpm离心1分钟沉淀转化菌,弃上清,用0.6 mL 7H9培养基重悬后,以500 μL/板铺于7H11含相应抗生素(150 μg/mL HYG)培养板中。同时稀释一百倍后,500 μL/板铺板。避光于37度孵箱培养。 The resulting transformant was incubated in a 37°C incubator for 12 hours to fully revive the bacteria and express their resistance. After blowing and mixing with a gun tip with a filter element, aliquot into small tubes, centrifuge at 10,000 rpm for 1 minute to precipitate the transformed bacteria, discard the supernatant, resuspend with 0.6 mL of 7H9 medium, and spread 500 μL/plate on 7H11 containing Corresponding antibiotics (150 μg/mL HYG) culture plate. At the same time, after 100-fold dilution, 500 μL/plate was plated. Protected from light and cultured in a 37-degree incubator.

在含有潮霉素(HYG)的平板上可获得含有pMH94DHmKE质粒的结核分枝杆菌或耻垢分枝杆菌,此步证明了本发明的dif-ΩHYG-dif 抗性表达盒可成功在分枝杆菌中表达。 Mycobacterium tuberculosis or Mycobacterium smegmatis containing the pMH94DHmKE plasmid can be obtained on a plate containing hygromycin (HYG) . in the expression.

将获得的带有pMH94DHmKE质粒的结核分枝杆菌或耻垢分枝杆菌转化株进行液体传代培养,在培养3天后(对于耻垢分枝杆菌)或14天后(对于结核分枝杆菌),将培养的菌液进行稀释后铺平板,所得的平板上约有80%的菌落丢失了Hyg抗性,即无法在含有潮霉素(HYG)的平板上生长,此步证明了本专利中的dif-ΩHYG-dif 抗性表达盒可成功在分枝杆菌中自动解离,且有较高的效率。 The obtained M. tuberculosis or M. smegmatis transformants carrying the pMH94DHmKE plasmid were subcultured in liquid, and after 3 days (for M. smegmatis) or 14 days (for M. tuberculosis) of culture, the culture The bacterial solution was diluted and plated, and about 80% of the colonies on the obtained plate lost Hyg resistance, that is, they could not grow on the plate containing hygromycin (HYG). This step proved the dif- The ΩHYG -dif resistance expression cassette can be successfully auto-dissociated in mycobacteria with high efficiency.

实施例3 整合质粒pblDHCiGn的构建及其应用Example 3 Construction and application of integrated plasmid pblDHCiGn

整合质粒pblDHCiGn中,抗性表达盒以两侧带有双酶切位点的形式存在,此质粒的构建是为了证明在此种形式下,抗性表达盒依然有效。质粒结构见图6,按顺时针方向,其依次含有:启动子P(BLA)(用于启动后面Amp的表达)、氨苄青霉素抗性基因Amp(可用于质粒筛选)、大肠杆菌复制起点(OriE)、噬菌体整合位点(attP)、整合酶基因Int(可表达出整合酶,使质粒通过attP位点整合入分枝杆菌基因组)、dif-ΩHYG-dif 抗性表达盒、增强的绿色荧光蛋白基因eGFPdif-ΩHYG-dif 抗性表达盒两端连接不同的酶切位点。 In the integrated plasmid pblDHCiGn, the resistance expression cassette exists in the form of double restriction sites on both sides. The construction of this plasmid is to prove that the resistance expression cassette is still effective in this form. The plasmid structure is shown in Figure 6. In a clockwise direction, it contains: promoter P(BLA) (used to initiate the expression of the following Amp ), ampicillin resistance gene Amp (usable for plasmid screening), Escherichia coli origin of replication ( OriE ), phage integration site ( attP ), integrase gene Int (can express integrase, so that the plasmid can be integrated into the mycobacterium genome through the attP site), dif -ΩHYG -dif resistance expression cassette, enhanced green fluorescent protein Gene eGFP . The two ends of the dif- ΩHYG - dif resistance expression cassette were connected with different enzyme cutting sites.

质粒pblDHCiGn的构建流程见图7,该流程中所用到的质粒pblueINT由美国约翰霍普金斯大学,Eric Nuermberger教授惠赠,其质粒图谱见图7,具体构建方法见参考文献3,序列如SEQ ID NO:3所示。 The construction process of plasmid pblDHCiGn is shown in Figure 7. The plasmid pblueINT used in this process was donated by Professor Eric Nuermberger of Johns Hopkins University in the United States. Its plasmid map is shown in Figure 7. For the specific construction method, see Reference 3. NO:3 shown.

具体构建步骤如下: The specific construction steps are as follows:

1. 构建质粒pblDHC1n:用限制性内切酶KpnI和BamHI消化质粒pblueINT,回收2.9 kb的功能片段A;以质粒pUCDHmKE为模板,用引物Hygcf (5’-GCTGGTACCGCTAGCGCTGCAGCCATGGCAAGCTTCTCGAGTAAG-3’) (SEQ ID NO:11)和Hygcr (5’-GCAGGATCCGATATCGAATTCCATATGCCCAAGCTTCTCGAGACT-3’)(SEQ ID NO:12)扩增出1.2 kb的dif-ΩHYG-dif 片段(两端分别带有KpnI和BamHI酶切位点),再经过酶切后,与功能片段A进行连接,得到4.1kb的质粒pblDHC1n。 1. Construction of plasmid pblDHC1n: Plasmid pblueINT was digested with restriction endonuclease Kpn I and Bam HI, and functional fragment A of 2.9 kb was recovered; plasmid pUCDHmKE was used as template, and primer Hygcf (5'-GCTGGTACCGCTAGCGCTGCAGCCATGGCAAGCTTCTCGAGTAAG-3') (SEQ ID NO:11) and Hygcr (5'-GCAGGATCCGATATCGAATTCCATATGCCCAAGCTTCTCGAGACT-3')(SEQ ID NO:12) amplified a 1.2 kb dif- ΩHYG - dif fragment (with Kpn I and Bam HI restriction sites at both ends, respectively) , and then digested with the functional fragment A to obtain a 4.1 kb plasmid pblDHC1n.

2. 构建质粒pblDHCGn:用限制性内切酶 NcoI和KpnI消化质粒pblDHC1n,回收4.1kb的功能片段B;以质粒pEGFP-N1(商品质粒)为模板,用引物GYF-f (5’-GGTCCATGGTGAGCAAGGGCGAGG-3’)(SEQ ID NO:13)和GYF-r(5’- GCGGGTACC TTACTTGTACAGCTCGTCCATGC-3’) (SEQ ID NO:14)扩增出734 bp的eGFP片段,经过酶切后,与功能片段B进行连接,得到质粒pblDHCGn。 2. Construction of plasmid pblDHCGn: digest plasmid pblDHC1n with restriction endonucleases Nco I and Kpn I, and recover 4.1kb functional fragment B; use plasmid pEGFP-N1 (commercial plasmid) as template, use primer GYF-f (5'- GGTCCATGGTGAGCAAGGGCGAGG-3')(SEQ ID NO:13) and GYF-r(5'- GCGGGTACC TTACTTGTACAGCTCGTCCATGC-3') (SEQ ID NO:14) amplified a 734 bp eGFP fragment, which was digested with the functional fragment B is ligated to obtain plasmid pblDHCGn.

3. 构建质粒pblDHCiGn:用限制性内切酶SpeI和EcoRI消化质粒pblDHCGn,回收4.7kb的功能片段C;用限制性内切酶XbaI和EcoRI消化质粒pblueINT,回收2.1kb的Int:attP片段,与功能片段C进行连接,得到质粒pblDHCiGn,其序列如SEQ ID NO:4所示。 3. Construction of plasmid pblDHCiGn : digest plasmid pblDHCGn with restriction enzymes SpeI and EcoRI, and recover 4.7kb functional fragment C; digest plasmid pblueINT with restriction enzymes XbaI and EcoRI, and recover 2.1kb Int:attP fragment , and functional fragment C were connected to obtain plasmid pblDHCiGn, the sequence of which is shown in SEQ ID NO:4.

    将构建得到的质粒pblDHCiGn通过电转化的方法转化入结核分枝杆菌或耻垢分枝杆菌中,获得相应的转化子,转化子可以观察到eGFP的表达。由此证明了在dif-ΩHYG-dif 抗性表达盒两端添加了双酶切位点后,Hyg的启动子不仅可以启动Hyg抗性基因的表达,而且可以实现启动Hyg抗性基因下游dif 序列+酶切位点及其后面的基因的表达,此处绿色荧光蛋作为指示标记。 The constructed plasmid pblDHCiGn was transformed into Mycobacterium tuberculosis or Mycobacterium smegmatis by electroporation to obtain corresponding transformants, and the expression of eGFP could be observed in the transformants. This proves that after adding double restriction sites at both ends of the dif- ΩHYG - dif resistance expression cassette, the Hyg promoter can not only initiate the expression of the Hyg resistance gene, but also can activate the downstream dif sequence of the Hyg resistance gene +Expression of the restriction site and the gene behind it, where the green fluorescent egg is used as an indicator.

    下一步,将获得的带有pblDHCiGn质粒的结核分枝杆菌或耻垢分枝杆菌转化株进行液体传代培养,在培养3天后(对于耻垢分枝杆菌)或14天后(对于结核分枝杆菌),将培养的菌液进行稀释后铺平板,所得的平板上约有80%的菌落丢失了HYG抗性,即无法在含有潮霉素(HYG)的平板上生长,由此再次证明了本发明的dif-ΩHYG-dif 抗性表达盒可成功在分枝杆菌中自动解离,且有较高的效率。而且,丢失了HYG抗性的转化子依然可以观察到绿色荧光,此设计说明了抗性表达盒的自动解离不会对其插入后下游基因的表达产生影响,即本发明所设计的抗性表达盒无论存在与否均不影响下游基因的表达。 Next, the obtained M. tuberculosis or M. smegmatis transformants harboring the pblDHCiGn plasmid were subjected to liquid subculture after 3 days (for M. smegmatis) or 14 days (for M. tuberculosis) of culture , after diluting the cultured bacterial solution, spread it on the plate, about 80% of the colonies on the obtained plate lost HYG resistance, that is, they could not grow on the plate containing hygromycin (HYG), thus proving again that the present invention The dif- ΩHYG - dif resistance expression cassette can be successfully auto-dissociated in mycobacteria with high efficiency. Moreover, green fluorescence can still be observed in transformants that have lost HYG resistance. This design shows that the automatic dissociation of the resistance expression cassette will not affect the expression of downstream genes after its insertion, that is, the resistance designed in the present invention The presence or absence of an expression cassette does not affect the expression of downstream genes.

参考文献:references:

[1] Howard NS, Gomez JE, Ko C, Bishai WR. Color selection with a hygromycin-resistance-based escherichia coli-mycobacterial shuttle vector. Gene 1995;166:181-182. [1] Howard NS, Gomez JE, Ko C, Bishai WR. Color selection with a hygromycin-resistance-based escherichia coli-mycobacterial shuttle vector. Gene 1995;166:181-182.

[2] Lee MH, Pascopella L, Jacobs WR, Jr., Hatfull GF. Site-specific integration of mycobacteriophage l5: Integration-proficient vectors for mycobacterium smegmatis, mycobacterium tuberculosis, and bacille calmette-guerin. Proc Natl Acad Sci U S A 1991;88:3111-3115. [2] Lee MH, Pascopella L, Jacobs WR, Jr., Hatfull GF. Site-specific integration of mycobacteriophage l5: Integration-proficient vectors for mycobacterium smegmatis, mycobacterium tuberculosis, and bacille S Nat U calmette-guerin. Procad 1991;88:3111-3115.

[3] Zhang T, Li SY, Nuermberger EL. Autoluminescent Mycobacterium tuberculosis for Rapid, Real-Time, Non-Invasive Assessment of Drug and Vaccine Efficacy. PLoS ONE. (2012), 7(1): e29774。 [3] Zhang T, Li SY, Nuermberger EL. Autoluminescent Mycobacterium tuberculosis for Rapid, Real-Time, Non-Invasive Assessment of Drug and Vaccine Efficacy. PLoS ONE. (2012), 7(1): e29774.

<110>  中国科学院广州生物医药与健康研究院 <110> Guangzhou Institute of Biomedicine and Health, Chinese Academy of Sciences

<120>  一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒 <120> A resistance expression cassette for efficient construction of recombinant mycobacteria without resistance markers

<130> <130>

<150>  2013102245916 <150> 2013102245916

<151>  2013-06-06 <151> 2013-06-06

<160>  14    <160> 14

<170>  PatentIn version 3.5 <170> PatentIn version 3.5

  the

<210>  1 <210> 1

<211>  1049 <211> 1049

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  1 <400> 1

ccggccgtgc ggaattaagc cggcccgtac cctgtgaata gaggtccgct gtgacacaag       60 ccggccgtgc ggaattaagc cggcccgtac cctgtgaata gaggtccgct gtgacacaag 60

aatccctgtt acttctcgac cgtattgatt cggatgattc ctacgcgagc ctgcggaacg      120 aatccctgtt acttctcgac cgtattgatt cggatgattc ctacgcgagc ctgcggaacg 120

accaggagtt ctgggagccg ctggcccgcc gagccctgga ggagctcggg ctgccggtgc      180 accaggagtt ctgggagccg ctggcccgcc gagccctgga ggagctcggg ctgccggtgc 180

cgccggtgct gcgggtgccc ggcgagagca ccaaccccgt actggtcggc gagcccggcc      240 cgccggtgct gcgggtgccc ggcgagagca ccaaccccgt actggtcggc gagcccggcc 240

cggtgatcaa gctgttcggc gagcactggt gcggtccgga gagcctcgcg tcggagtcgg      300 cggtgatcaa gctgttcggc gagcactggt gcggtccgga gagcctcgcg tcggagtcgg 300

aggcgtacgc ggtcctggcg gacgccccgg ttccggtgcc ccgcctcctc ggccgcggcg      360 aggcgtacgc ggtcctggcg gacgccccgg ttccggtgcc ccgcctcctc ggccgcggcg 360

agctgcggcc cggcaccgga gcctggccgt ggccctacct ggtgatgagc cggatgaccg      420 agctgcggcc cggcaccgga gcctggccgt ggccctacct ggtgatgagc cggatgaccg 420

gcaccacctg gcggtccgcg atggacggca cgaccgaccg gaacgcgctg ctcgccctgg      480 gcaccacctg gcggtccgcg atggacggca cgaccgaccg gaacgcgctg ctcgccctgg 480

cccgcgaact cggccgggtg ctcggacggc tgcacagggt gccgctgacc gggaacaccg      540 cccgcgaact cggccgggtg ctcggacggc tgcacagggt gccgctgacc gggaacaccg 540

tgctcacccc ccattccgag gtcttcccgg aactgctgcg ggaacgccgc gcggcgaccg      600 tgctcacccc ccattccgag gtcttcccgg aactgctgcg ggaacgccgc gcggcgaccg 600

tcgaggacca ccgcgggtgg ggctacctct cgccccggct gctggaccgc ctggaggact      660 tcgaggacca ccgcgggtgg ggctacctct cgccccggct gctggaccgc ctggaggact 660

ggctgccgga cgtggacacg ctgctggccg gccgcgaacc ccggttcgtc cacggcgacc      720 ggctgccgga cgtggacacg ctgctggccg gccgcgaacc ccggttcgtc cacggcgacc 720

tgcacgggac caacatcttc gtggacctgg ccgcgaccga ggtcaccggg atcgtcgact      780 tgcacgggac caacatcttc gtggacctgg ccgcgaccga ggtcaccggg atcgtcgact 780

tcaccgacgt ctatgcggga gactcccgct acagcctggt gcaactgcat ctcaacgcct      840 tcaccgacgt ctatgcggga gactcccgct acagcctggt gcaactgcat ctcaacgcct 840

tccggggcga ccgcgagatc ctggccgcgc tgctcgacgg ggcgcagtgg aagcggaccg      900 tccggggcga ccgcgagatc ctggccgcgc tgctcgacgg ggcgcagtgg aagcggaccg 900

aggacttcgc ccgcgaactg ctcgccttca ccttcctgca cgacttcgag gtgttcgagg      960 aggacttcgc ccgcgaactg ctcgccttca ccttcctgca cgacttcgag gtgttcgagg 960

agaccccgct ggatctctcc ggcttcaccg atccggagga actggcgcag ttcctctggg     1020 agaccccgct ggatctctcc ggcttcaccg atccggagga actggcgcag ttcctctggg 1020

ggccgccgga caccgccccc ggcgcctga                                       1049 ggccgccgga caccgccccc ggcgcctga 1049

  the

  the

<210>  2 <210> 2

<211>  3799 <211> 3799

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  2 <400> 2

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60 tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120 cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180 ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240 accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat      300 attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360 tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgagctcggt accaagcttc      420 tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgagctcggt accaagcttc 420

tcgagacttg acataatgtc gcttatcggc ttaatcgatc tagaccggcc gtgcggaatt      480 tcgagacttg acataatgtc gcttatcggc ttaatcgatc tagaccggcc gtgcggaatt 480

aagccggccc gtaccctgtg aatagaggtc cgctgtgaca caagaatccc tgttacttct      540 aagccggccc gtaccctgtg aatagaggtc cgctgtgaca caagaatccc tgttacttct 540

cgaccgtatt gattcggatg attcctacgc gagcctgcgg aacgaccagg agttctggga      600 cgaccgtatt gattcggatg attcctacgc gagcctgcgg aacgaccagg agttctggga 600

gccgctggcc cgccgagccc tggaggagct cgggctgccg gtgccgccgg tgctgcgggt      660 gccgctggcc cgccgagccc tggaggagct cgggctgccg gtgccgccgg tgctgcgggt 660

gcccggcgag agcaccaacc ccgtactggt cggcgagccc ggcccggtga tcaagctgtt      720 gcccggcgag agcaccaacc ccgtactggt cggcgagccc ggcccggtga tcaagctgtt 720

cggcgagcac tggtgcggtc cggagagcct cgcgtcggag tcggaggcgt acgcggtcct      780 cggcgagcac tggtgcggtc cggagagcct cgcgtcggag tcggaggcgt acgcggtcct 780

ggcggacgcc ccggttccgg tgccccgcct cctcggccgc ggcgagctgc ggcccggcac      840 ggcggacgcc ccggttccgg tgccccgcct cctcggccgc ggcgagctgc ggcccggcac 840

cggagcctgg ccgtggccct acctggtgat gagccggatg accggcacca cctggcggtc      900 cggagcctgg ccgtggccct acctggtgat gagccggatg accggcacca cctggcggtc 900

cgcgatggac ggcacgaccg accggaacgc gctgctcgcc ctggcccgcg aactcggccg      960 cgcgatggac ggcacgaccg accggaacgc gctgctcgcc ctggcccgcg aactcggccg 960

ggtgctcgga cggctgcaca gggtgccgct gaccgggaac accgtgctca ccccccattc     1020 ggtgctcgga cggctgcaca gggtgccgct gaccgggaac accgtgctca ccccccattc 1020

cgaggtcttc ccggaactgc tgcgggaacg ccgcgcggcg accgtcgagg accaccgcgg     1080 cgaggtcttc ccggaactgc tgcgggaacg ccgcgcggcg accgtcgagg accaccgcgg 1080

gtggggctac ctctcgcccc ggctgctgga ccgcctggag gactggctgc cggacgtgga     1140 gtggggctac ctctcgcccc ggctgctgga ccgcctggag gactggctgc cggacgtgga 1140

cacgctgctg gccggccgcg aaccccggtt cgtccacggc gacctgcacg ggaccaacat     1200 cacgctgctg gccggccgcg aaccccggtt cgtccacggc gacctgcacg ggaccaacat 1200

cttcgtggac ctggccgcga ccgaggtcac cgggatcgtc gacttcaccg acgtctatgc     1260 cttcgtggac ctggccgcga ccgaggtcac cgggatcgtc gacttcaccg acgtctatgc 1260

gggagactcc cgctacagcc tggtgcaact gcatctcaac gccttccggg gcgaccgcga     1320 gggagactcc cgctacagcc tggtgcaact gcatctcaac gccttccggg gcgaccgcga 1320

gatcctggcc gcgctgctcg acggggcgca gtggaagcgg accgaggact tcgcccgcga     1380 gatcctggcc gcgctgctcg acggggcgca gtggaagcgg accgaggact tcgcccgcga 1380

actgctcgcc ttcaccttcc tgcacgactt cgaggtgttc gaggagaccc cgctggatct     1440 actgctcgcc ttcaccttcc tgcacgactt cgaggtgttc gaggagaccc cgctggatct 1440

ctccggcttc accgatccgg aggaactggc gcagttcctc tgggggccgc cggacaccgc     1500 ctccggcttc accgatccgg aggaactggc gcagttcctc tgggggccgc cggacaccgc 1500

ccccggcgcc tgatctagac ccgggacttg acataatgtc gcttatcggc ttactcgaga     1560 ccccggcgcc tgatctagac ccgggacttg acataatgtc gcttatcggc ttactcgaga 1560

agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt     1620 agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt 1620

ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc     1680 ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc 1680

taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc     1740 taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc 1740

cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct     1800 cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct 1800

tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca     1860 tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca 1860

gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac     1920 gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac 1920

atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt     1980 atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt 1980

ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg     2040 ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg 2040

cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc     2100 cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc 2100

tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc     2160 tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc 2160

gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc     2220 gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc 2220

aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac     2280 aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac 2280

tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt     2340 tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt 2340

aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct     2400 aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct 2400

aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc     2460 aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc 2460

ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt     2520 ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt 2520

ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg     2580 ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg 2580

atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc     2640 atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc 2640

atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa     2700 atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa 2700

tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag     2760 tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag 2760

gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg     2820 gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg 2820

tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga     2880 tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga 2880

gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag     2940 gacccacgct caccggctcc agattatca gcaataaacc agccagccgg aagggccgag 2940

cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa     3000 cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa 3000

gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc     3060 gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc 3060

atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca     3120 atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca 3120

aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg     3180 aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg 3180

atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat     3240 atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat 3240

aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc     3300 aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc 3300

aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg     3360 aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg 3360

gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg     3420 gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg 3420

gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt     3480 gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta accactcgt 3480

gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca     3540 gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca 3540

ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata     3600 ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata 3600

ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac     3660 ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac 3660

atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa     3720 atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa 3720

gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt     3780 gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt 3780

atcacgaggc cctttcgtc                                                  3799 atcacgaggc cctttcgtc 3799

  the

  the

<210>  3 <210> 3

<211>  5064 <211> 5064

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  3 <400> 3

ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60 ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc 60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120 atttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga 120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180 gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc 180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240 caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc 240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300 ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag 300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360 cccccgattt agagcttgac gggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa 360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420 agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac 420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480 cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg 480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540 caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg 540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600 gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg 600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctcca      660 taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctcca 660

ccgcggtggc ggccgctcta gaactagtgg atcccccggg ctgcaggaat tcgatatcaa      720 ccgcggtggc ggccgctcta gaactagtgg atcccccggg ctgcaggaat tcgatatcaa 720

gcttgcatgc ctgcaggtcg accaagacga tcaccgggct cgtcggagct ggaggcgcag      780 gcttgcatgc ctgcaggtcg accaagacga tcaccgggct cgtcggagct ggaggcgcag 780

cgggagctgg ctctgtattc tcaggcaagg ccggaggccc tggaggaaac accacggcgt      840 cgggagctgg ctctgtattc tcaggcaagg ccggaggccc tggaggaaac accacggcgt 840

ccgctgtcgg atggtcaggt ttgaccgcaa ccggcggtcc cggaggctct gtgatcgaca      900 ccgctgtcgg atggtcaggt ttgaccgcaa ccggcggtcc cggaggctct gtgatcgaca 900

tcctcagcgt cgccggaaag tcgcctggag atcggaccta caacgaccag ctctacatag      960 tcctcagcgt cgccggaaag tcgcctggag atcggaccta caacgaccag ctctacatag 960

gcggcgcaca acagaactca gctggcggga acggcaatgc tcctggcggc ggcggggctg     1020 gcggcgcaca acagaactca gctggcggga acggcaatgc tcctggcggc ggcggggctg 1020

gtgcccaggt ctccgcacag agcggcggtg ctggcgctcg cggccaggcg tggttcttcg     1080 gtgcccaggt ctccgcacag agcggcggtg ctggcgctcg cggccaggcg tggttcttcg 1080

cgtactgaca agaaaccccc ctctttagga ctcagtgtcc ttgggagggg ggctttttgc     1140 cgtactgaca agaaaccccc ctctttagga ctcagtgtcc ttgggagggg ggctttttgc 1140

gtttcaggag gtcttggcca gcttggacat cgcctcagcg atagcctcgt cgcgggcctc     1200 gtttcaggag gtcttggcca gcttggacat cgcctcagcg atagcctcgt cgcgggcctc 1200

agacgccatc tggtacttca tcgccatcct aggagtcgtg tgaccgagac gggccatcag     1260 agacgccatc tggtacttca tcgccatcct aggagtcgtg tgaccgagac gggccatcag 1260

ctccttggtc gtcgcacctg cctgagcggc gaacgtagcg ccgacagcgc ggaggtcgtg     1320 ctccttggtc gtcgcacctg cctgagcggc gaacgtagcg ccgacagcgc ggaggtcgtg 1320

gatgcggagt tccggccgac cgatcttggc gtagccacgc ttcagcgact tggtgaacgc     1380 gatgcggagt tccggccgac cgatcttggc gtagccacgc ttcagcgact tggtgaacgc 1380

ggacttcgac agccggttgc cctgcgtcgt ggtcaccagg aatgcctcgg ggcccttgtt     1440 ggacttcgac agccggttgc cctgcgtcgt ggtcaccagg aatgcctcgg ggcccttgtt 1440

catcttcgta cggtccttca tgtgcgctcg gatcatctcc gcgacgtgag gcggaaccgt     1500 catcttcgta cggtccttca tgtgcgctcg gatcatctcc gcgacgtgag gcggaaccgt 1500

cacaggacgc ttcgaccgga cggtcttggc gttgccaacg acgatcttgt tccccacgcg     1560 cacaggacgc ttcgaccgga cggtcttggc gttgccaacg acgatcttgt tccccacgcg 1560

ggaagcgcca cggcgcaccc ggagcttcat cgtcatgccg tcgtccacga tgtccttgcg     1620 ggaagcgcca cggcgcaccc ggagcttcat cgtcatgccg tcgtccacga tgtccttgcg 1620

gcgaagctcg atcagctctc cgaaccggag gctcgtccac gccaggatgt atgccgcgat     1680 gcgaagctcg atcagctctc cgaaccggag gctcgtccac gccaggatgt atgccgcgat 1680

ccggtagtgc tcgaagatct cagcggcgac gatgtccagc tcctcaggcg tcagcgcctc     1740 ccggtagtgc tcgaagatct cagcggcgac gatgtccagc tcctcaggcg tcagcgcctc 1740

tacgtcgcgc tcatcggctg ccttctgctc gatccggcac gggttctctg cgatcagctt     1800 tacgtcgcgc tcatcggctg ccttctgctc gatccggcac gggttctctg cgatcagctt 1800

gtcctcgacc gctgtgttca tcaccgcccg gaggacgttg taggcatgcc ggcgggcagt     1860 gtcctcgacc gctgtgttca tcaccgcccg gaggacgttg taggcatgcc ggcgggcagt 1860

cgggtgcttc ctacccatcc cggcccacca cgcacgcacc agagctggcg tcatctctgt     1920 cgggtgcttc ctacccatcc cggcccacca cgcacgcacc agagctggcg tcatctctgt 1920

gaccgccact tcacctagca ccgggtagat gcggcgctcc gcgtgcccgc tgtacagatc     1980 gaccgccact tcacctagca ccgggtagat gcggcgctcc gcgtgcccgc tgtacagatc 1980

cctggtgccg tctgcgaggt cgcgctccac gagccacttc cgggtgtact cctccagcgt     2040 cctggtgccg tctgcgaggt cgcgctccac gagccacttc cgggtgtact cctccagcgt 2040

gatggcgctg gcggctgcct tcttcgcccg gtcctgtgga ggggtccagg tctccatctc     2100 gatggcgctg gcggctgcct tcttcgcccg gtcctgtgga ggggtccagg tctccatctc 2100

gatgagccgc ttctcgcccg cgagccaggc ttcggcgtcc atcttgttgt cgtaggtctg     2160 gatgagccgc ttctcgcccg cgagccaggc ttcggcgtcc atcttgttgt cgtaggtctg 2160

cagcgcgtag tacctcacac cgtcctgcgg gttgacgtat gaggcttgga tcctcccgct     2220 cagcgcgtag tacctcacac cgtcctgcgg gttgacgtat gaggcttgga tcctcccgct 2220

gcgctgagtc ttcagcgatc cccatccgcg acgtgccaac taggtctcct ctcgtcgtga     2280 gcgctgagtc ttcagcgatc cccatccgcg acgtgccaac taggtctcct ctcgtcgtga 2280

acaaggctac cgggttgcaa ctcctgtgca actctcaggc ttcaacgcgc ttctacgacc     2340 acaaggctac cgggttgcaa ctcctgtgca actctcaggc ttcaacgcgc ttctacgacc 2340

tgcaatttct ttccacttag aggatgcagc cgagaggggg taaaaaccta tcttgaccgg     2400 tgcaatttct ttccacttag aggatgcagc cgagagggggg taaaaaccta tcttgaccgg 2400

cccatatgtg gtcggcagac acccattctt ccaaactagc tacgcgggtt cgattcccgt     2460 cccatatgtg gtcggcagac accattctt ccaaactagc tacgcgggtt cgattcccgt 2460

cgcccgctcc gctggtcaga gggtgttttc gccctctggc catttttctt tccaggggtc     2520 cgcccgctcc gctggtcaga gggtgttttc gccctctggc catttttctt tccagggggtc 2520

tgcaactctt gtgcgactct tctgacctgg gcatacgcgg ttgcaacgca tccctgatct     2580 tgcaactctt gtgcgactct tctgacctgg gcatacgcgg ttgcaacgca tccctgatct 2580

ggctactttc gatgctgaca aacgaataga gccccccgcc tgcgcgaaca gacgaggggc     2640 ggctactttc gatgctgaca aacgaataga gccccccgcc tgcgcgaaca gacgaggggc 2640

attcacacca gattggagct ggtgcagtga agagaataga ccgggacaag gttgcaccgg     2700 attcacacca gattggagct ggtgcagtga agagaataga ccgggacaag gttgcaccgg 2700

gagttgcagc ggtcggaacc ctcgccgtcg gcgggctggc gttcgccctg tcgttcacgg     2760 gagttgcagc ggtcggaacc ctcgccgtcg gcgggctggc gttcgccctg tcgttcacgg 2760

ctctcagcga gctggctgcg gccaacgggg tggcccaagc agagatggtg cccttggtgg     2820 ctctcagcga gctggctgcg gccaacgggg tggcccaagc agagatggtg cccttggtgg 2820

tcgactctag aggatccccg acctcgaggg ggggcccggt acccagcttt tgttcccttt     2880 tcgactctag aggatccccg acctcgaggg ggggcccggt acccagcttt tgttcccttt 2880

agtgagggtt aattgcgcgc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt     2940 agtgagggtt aattgcgcgc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt 2940

gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg     3000 gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg 3000

gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt     3060 gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt 3060

cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt     3120 cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt 3120

tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc     3180 tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc 3180

tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg     3240 tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg 3240

ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg     3300 ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg 3300

ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac     3360 ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac 3360

gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg     3420 gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 3420

gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct     3480 gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct 3480

ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg     3540 ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg 3540

tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct     3600 tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct 3600

gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac     3660 gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 3660

tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt     3720 tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt 3720

tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc     3780 tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc 3780

tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca     3840 tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca 3840

ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat     3900 ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat 3900

ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac     3960 ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac 3960

gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt     4020 gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt 4020

aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc     4080 aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc 4080

aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg     4140 aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg 4140

cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg     4200 cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg 4200

ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc     4260 ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc 4260

cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta     4320 cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta 4320

ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg     4380 ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg 4380

ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct     4440 ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct 4440

ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta     4500 ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 4500

gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg     4560 gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 4560

ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga     4620 ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 4620

ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt     4680 ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 4680

gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca     4740 gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 4740

ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt     4800 ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 4800

cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt     4860 cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 4860

ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga     4920 ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 4920

aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt     4980 aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 4980

gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc     5040 gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 5040

gcacatttcc ccgaaaagtg ccac                                            5064 gcacatttcc ccgaaaagtg ccac 5064

  the

  the

<210>  4 <210> 4

<211>  6898 <211> 6898

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  4 <400> 4

gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt       60 gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt 60

caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa      120 caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa 120

ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt      180 ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt 180

gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt      240 gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt 240

tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt      300 tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt 300

ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg      360 ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg 360

tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga      420 tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga 420

atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa      480 atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa 480

gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga      540 gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga 540

caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa      600 caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa 600

ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca      660 ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca 660

ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta      720 ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta 720

ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac      780 ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggacc 780

ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc      840 ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc 840

gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag      900 gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag 900

ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga      960 ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga 960

taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt     1020 taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt 1020

agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata     1080 agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata 1080

atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag     1140 atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag 1140

aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa     1200 aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa 1200

caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt     1260 caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt 1260

ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc     1320 ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc 1320

cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa     1380 cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa 1380

tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa     1440 tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa 1440

gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc     1500 gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc 1500

ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa     1560 ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa 1560

gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa     1620 gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa 1620

caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg     1680 caggagagcg cacgaggggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg 1680

ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc     1740 ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc 1740

tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg     1800 tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg 1800

ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg     1860 ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg 1860

agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg     1920 agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg 1920

aagcggaaga gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat     1980 aagcggaaga gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat 1980

gcagctggca cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg     2040 gcagctggca cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg 2040

tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt     2100 tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt 2100

tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg     2160 tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg 2160

ccaagcgcgc aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggtggcg     2220 ccaagcgcgc aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggtggcg 2220

gccgctctag aactagagtc gaccaccaag ggcaccatct ctgcttgggc caccccgttg     2280 gccgctctag aactagagtc gaccaccaag ggcaccatct ctgcttgggc caccccgttg 2280

gccgcagcca gctcgctgag agccgtgaac gacagggcga acgccagccc gccgacggcg     2340 gccgcagcca gctcgctgag agccgtgaac gacagggcga acgccagccc gccgacggcg 2340

agggttccga ccgctgcaac tcccggtgca accttgtccc ggtctattct cttcactgca     2400 agggttccga ccgctgcaac tcccggtgca accttgtccc ggtctattct cttcactgca 2400

ccagctccaa tctggtgtga atgcccctcg tctgttcgcg caggcggggg gctctattcg     2460 ccagctccaa tctggtgtga atgcccctcg tctgttcgcg caggcggggg gctctattcg 2460

tttgtcagca tcgaaagtag ccagatcagg gatgcgttgc aaccgcgtat gcccaggtca     2520 tttgtcagca tcgaaagtag ccagatcagg gatgcgttgc aaccgcgtat gcccaggtca 2520

gaagagtcgc acaagagttg cagacccctg gaaagaaaaa tggccagagg gcgaaaacac     2580 gaagagtcgc acaagagttg cagacccctg gaaagaaaaa tggccagagg gcgaaaacac 2580

cctctgacca gcggagcggg cgacgggaat cgaacccgcg tagctagttt ggaagaatgg     2640 cctctgacca gcggagcggg cgacgggaat cgaacccgcg tagctagttt ggaagaatgg 2640

gtgtctgccg accacatatg ggccggtcaa gataggtttt taccccctct cggctgcatc     2700 gtgtctgccg accacatatg ggccggtcaa gataggtttt taccccctct cggctgcatc 2700

ctctaagtgg aaagaaattg caggtcgtag aagcgcgttg aagcctgaga gttgcacagg     2760 ctctaagtgg aaagaaattg caggtcgtag aagcgcgttg aagcctgaga gttgcacagg 2760

agttgcaacc cggtagcctt gttcacgacg agaggagacc tagttggcac gtcgcggatg     2820 agttgcaacc cggtagcctt gttcacgacg agaggagacc tagttggcac gtcgcggatg 2820

gggatcgctg aagactcagc gcagcgggag gatccaagcc tcatacgtca acccgcagga     2880 gggatcgctg aagactcagc gcagcgggag gatccaagcc tcatacgtca acccgcagga 2880

cggtgtgagg tactacgcgc tgcagaccta cgacaacaag atggacgccg aagcctggct     2940 cggtgtgagg tactacgcgc tgcagaccta cgacaacaag atggacgccg aagcctggct 2940

cgcgggcgag aagcggctca tcgagatgga gacctggacc cctccacagg accgggcgaa     3000 cgcgggcgag aagcggctca tcgagatgga gacctggacc cctccacagg accgggcgaa 3000

gaaggcagcc gccagcgcca tcacgctgga ggagtacacc cggaagtggc tcgtggagcg     3060 gaaggcagcc gccagcgcca tcacgctgga ggagtacacc cggaagtggc tcgtggagcg 3060

cgacctcgca gacggcacca gggatctgta cagcgggcac gcggagcgcc gcatctaccc     3120 cgacctcgca gacggcacca gggatctgta cagcgggcac gcggagcgcc gcatctaccc 3120

ggtgctaggt gaagtggcgg tcacagagat gacgccagct ctggtgcgtg cgtggtgggc     3180 ggtgctaggt gaagtggcgg tcacagagat gacgccagct ctggtgcgtg cgtggtgggc 3180

cgggatgggt aggaagcacc cgactgcccg ccggcatgcc tacaacgtcc tccgggcggt     3240 cgggatgggt aggaagcacc cgactgcccg ccggcatgcc tacaacgtcc tccgggcggt 3240

gatgaacaca gcggtcgagg acaagctgat cgcagagaac ccgtgccgga tcgagcagaa     3300 gatgaacaca gcggtcgagg acaagctgat cgcagagaac ccgtgccgga tcgagcagaa 3300

ggcagccgat gagcgcgacg tagaggcgct gacgcctgag gagctggaca tcgtcgccgc     3360 ggcagccgat gagcgcgacg tagaggcgct gacgcctgag gagctggaca tcgtcgccgc 3360

tgagatcttc gagcactacc ggatcgcggc atacatcctg gcgtggacga gcctccggtt     3420 tgagatcttc gagcactacc ggatcgcggc atacatcctg gcgtggacga gcctccggtt 3420

cggagagctg atcgagcttc gccgcaagga catcgtggac gacggcatga cgatgaagct     3480 cggagagctg atcgagcttc gccgcaagga catcgtggac gacggcatga cgatgaagct 3480

ccgggtgcgc cgtggcgctt cccgcgtggg gaacaagatc gtcgttggca acgccaagac     3540 ccgggtgcgc cgtggcgctt cccgcgtggg gaacaagatc gtcgttggca acgccaagac 3540

cgtccggtcg aagcgtcctg tgacggttcc gcctcacgtc gcggagatga tccgagcgca     3600 cgtccggtcg aagcgtcctg tgacggttcc gcctcacgtc gcggagatga tccgagcgca 3600

catgaaggac cgtacgaaga tgaacaaggg ccccgaggca ttcctggtga ccacgacgca     3660 catgaaggac cgtacgaaga tgaacaaggg ccccgaggca ttcctggtga ccacgacgca 3660

gggcaaccgg ctgtcgaagt ccgcgttcac caagtcgctg aagcgtggct acgccaagat     3720 gggcaaccgg ctgtcgaagt ccgcgttcac caagtcgctg aagcgtggct acgccaagat 3720

cggtcggccg gaactccgca tccacgacct ccgcgctgtc ggcgctacgt tcgccgctca     3780 cggtcggccg gaactccgca tccacgacct ccgcgctgtc ggcgctacgt tcgccgctca 3780

ggcaggtgcg acgaccaagg agctgatggc ccgtctcggt cacacgactc ctaggatggc     3840 ggcaggtgcg acgaccaagg agctgatggc ccgtctcggt cacacgactc ctaggatggc 3840

gatgaagtac cagatggcgt ctgaggcccg cgacgaggct atcgctgagg cgatgtccaa     3900 gatgaagtac cagatggcgt ctgaggcccg cgacgaggct atcgctgagg cgatgtccaa 3900

gctggccaag acctcctgaa acgcaaaaag cccccctccc aaggacactg agtcctaaag     3960 gctggccaag acctcctgaa acgcaaaaag cccccctccc aaggacactg agtcctaaag 3960

aggggggttt cttgtcagta cgcgaagaac cacgcctggc cgcgagcgcc agcaccgccg     4020 aggggggttt cttgtcagta cgcgaagaac cacgcctggc cgcgagcgcc agcaccgccg 4020

ctctgtgcgg agacctgggc accagccccg ccgccgccag gagcattgcc gttcccgcca     4080 ctctgtgcgg agacctgggc accagccccg ccgccgccag gagcattgcc gttcccgcca 4080

gctgagttct gttgtgcgcc gcctatgtag agctggtcgt tgtaggtccg atctccaggc     4140 gctgagttct gttgtgcgcc gcctatgtag agctggtcgt tgtaggtccg atctccaggc 4140

gactttccgg cgacgctgag gatgtcgatc acagagcctc cgggaccgcc ggttgcggtc     4200 gactttccgg cgacgctgag gatgtcgatc acaggcctc cgggaccgcc ggttgcggtc 4200

aaacctgacc atccgacagc ggacgccgtg gtgtttcctc cagggcctcc ggccttgcct     4260 aaacctgacc atccgacagc ggacgccgtg gtgtttcctc cagggcctcc ggccttgcct 4260

gagaatacag agccagctcc cgctgcgcct ccagctccga cgagcccggt gatcgtcttg     4320 gagaatacag agccagctcc cgctgcgcct ccagctccga cgagcccggt gatcgtcttg 4320

gtcgacctgc aggcatgcaa gcttgatatc gaattccata tgcccaagct tctcgagact     4380 gtcgacctgc aggcatgcaa gcttgatatc gaattccata tgcccaagct tctcgagact 4380

tgacataatg tcgcttatcg gcttaatcga tctagaccgg ccgtgcggaa ttaagccggc     4440 tgacataatg tcgcttatcg gcttaatcga tctagaccgg ccgtgcggaa ttaagccggc 4440

ccgtaccctg tgaatagagg tccgctgtga cacaagaatc cctgttactt ctcgaccgta     4500 ccgtaccctg tgaatagagg tccgctgtga cacaagaatc cctgttactt ctcgaccgta 4500

ttgattcgga tgattcctac gcgagcctgc ggaacgacca ggagttctgg gagccgctgg     4560 ttgattcgga tgattcctac gcgagcctgc ggaacgacca ggagttctgg gagccgctgg 4560

cccgccgagc cctggaggag ctcgggctgc cggtgccgcc ggtgctgcgg gtgcccggcg     4620 cccgccgagc cctggaggag ctcgggctgc cggtgccgcc ggtgctgcgg gtgcccggcg 4620

agagcaccaa ccccgtactg gtcggcgagc ccggcccggt gatcaagctg ttcggcgagc     4680 agagcaccaa ccccgtactg gtcggcgagc ccggcccggt gatcaagctg ttcggcgagc 4680

actggtgcgg tccggagagc ctcgcgtcgg agtcggaggc gtacgcggtc ctggcggacg     4740 actggtgcgg tccggagagc ctcgcgtcgg agtcggaggc gtacgcggtc ctggcggacg 4740

ccccggttcc ggtgccccgc ctcctcggcc gcggcgagct gcggcccggc accggagcct     4800 ccccggttcc ggtgccccgc ctcctcggcc gcggcgagct gcggcccggc accggagcct 4800

ggccgtggcc ctacctggtg atgagccgga tgaccggcac cacctggcgg tccgcgatgg     4860 ggccgtggcc ctacctggtg atgagccgga tgaccggcac cacctggcgg tccgcgatgg 4860

acggcacgac cgaccggaac gcgctgctcg ccctggcccg cgaactcggc cgggtgctcg     4920 acggcacgac cgaccggaac gcgctgctcg ccctggcccg cgaactcggc cgggtgctcg 4920

gacggctgca cagggtgccg ctgaccggga acaccgtgct caccccccat tccgaggtct     4980 gacggctgca cagggtgccg ctgaccggga acaccgtgct cacccccccat tccgaggtct 4980

tcccggaact gctgcgggaa cgccgcgcgg cgaccgtcga ggaccaccgc gggtggggct     5040 tcccggaact gctgcgggaa cgccgcgcgg cgaccgtcga ggaccaccgc gggtggggct 5040

acctctcgcc ccggctgctg gaccgcctgg aggactggct gccggacgtg gacacgctgc     5100 acctctcgcc ccggctgctg gaccgcctgg aggactggct gccggacgtg gacacgctgc 5100

tggccggccg cgaaccccgg ttcgtccacg gcgacctgca cgggaccaac atcttcgtgg     5160 tggccggccg cgaaccccgg ttcgtccacg gcgacctgca cgggaccaac atcttcgtgg 5160

acctggccgc gaccgaggtc accgggatcg tcgacttcac cgacgtctat gcgggagact     5220 acctggccgc gaccgaggtc accgggatcg tcgacttcac cgacgtctat gcgggagact 5220

cccgctacag cctggtgcaa ctgcatctca acgccttccg gggcgaccgc gagatcctgg     5280 cccgctacag cctggtgcaa ctgcatctca acgccttccg gggcgaccgc gagatcctgg 5280

ccgcgctgct cgacggggcg cagtggaagc ggaccgagga cttcgcccgc gaactgctcg     5340 ccgcgctgct cgacggggcg cagtggaagc ggaccgagga cttcgcccgc gaactgctcg 5340

ccttcacctt cctgcacgac ttcgaggtgt tcgaggagac cccgctggat ctctccggct     5400 ccttcacctt cctgcacgac ttcgaggtgt tcgaggagac cccgctggat ctctccggct 5400

tcaccgatcc ggaggaactg gcgcagttcc tctgggggcc gccggacacc gcccccggcg     5460 tcaccgatcc ggaggaactg gcgcagttcc tctgggggcc gccggacacc gcccccggcg 5460

cctgatctag acccgggact tgacataatg tcgcttatcg gcttactcga gaagcttgcc     5520 cctgatctag acccgggact tgacataatg tcgcttatcg gcttactcga gaagcttgcc 5520

atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac     5580 atggtgagca agggcgagga gctgttcacc gggtggtgc ccatcctggt cgagctggac 5580

ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac     5640 ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 5640

ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc     5700 ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 5700

ctcgtgacca ccctgaccta cggcgtgcag tgcttcagcc gctaccccga ccacatgaag     5760 ctcgtgacca ccctgaccta cggcgtgcag tgcttcagcc gctaccccga ccacatgaag 5760

cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc     5820 cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 5820

ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg     5880 ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 5880

gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac     5940 gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 5940

aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac     6000 aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 6000

ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc     6060 ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 6060

gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac     6120 gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 6120

tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc     6180 tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 6180

ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaagtaa     6240 ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaagtaa 6240

ggtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca     6300 ggtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca 6300

acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc     6360 acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 6360

tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg     6420 tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 6420

cagcctgaat ggcgaatggg acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt     6480 cagcctgaat ggcgaatggg acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt 6480

ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt     6540 ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt 6540

cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa atcgggggct     6600 cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa atcgggggct 6600

ccctttaggg ttccgattta gtgctttacg gcacctcgac cccaaaaaac ttgattaggg     6660 ccctttaggg ttccgatta gtgctttacg gcacctcgac cccaaaaaac ttgattaggg 6660

tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt tgacgttgga     6720 tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt tgacgttgga 6720

gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca accctatctc     6780 gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca accctatctc 6780

ggtctattct tttgatttat aagggatttt gccgatttcg gcctattggt taaaaaatga     6840 ggtctattct tttgattatt aagggatttt gccgatttcg gcctattggt taaaaaatga 6840

gctgatttaa caaaaattta acgcgaattt taacaaaata ttaacgctta caatttag       6898 gctgatttaa caaaaattta acgcgaattt taacaaaata ttaacgctta caatttag 6898

  the

  the

<210>  5 <210> 5

<211>  1152 <211> 1152

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  5 <400> 5

aagcttctcg agacttgaca taatgtcgct tatcggctta atcgatctag accggccgtg       60 aagcttctcg agacttgaca taatgtcgct tatcggctta atcgatctag accggccgtg 60

cggaattaag ccggcccgta ccctgtgaat agaggtccgc tgtgacacaa gaatccctgt      120 cggaattaag ccggcccgta ccctgtgaat agaggtccgc tgtgacacaa gaatccctgt 120

tacttctcga ccgtattgat tcggatgatt cctacgcgag cctgcggaac gaccaggagt      180 tacttctcga ccgtattgat tcggatgatt cctacgcgag cctgcggaac gaccaggagt 180

tctgggagcc gctggcccgc cgagccctgg aggagctcgg gctgccggtg ccgccggtgc      240 tctgggagcc gctggcccgc cgagccctgg aggagctcgg gctgccggtg ccgccggtgc 240

tgcgggtgcc cggcgagagc accaaccccg tactggtcgg cgagcccggc ccggtgatca      300 tgcgggtgcc cggcgagagc accaaccccg tactggtcgg cgagcccggc ccggtgatca 300

agctgttcgg cgagcactgg tgcggtccgg agagcctcgc gtcggagtcg gaggcgtacg      360 agctgttcgg cgagcactgg tgcggtccgg agagcctcgc gtcggagtcg gaggcgtacg 360

cggtcctggc ggacgccccg gttccggtgc cccgcctcct cggccgcggc gagctgcggc      420 cggtcctggc ggacgccccg gttccggtgc cccgcctcct cggccgcggc gagctgcggc 420

ccggcaccgg agcctggccg tggccctacc tggtgatgag ccggatgacc ggcaccacct      480 ccggcaccgg agcctggccg tggccctacc tggtgatgag ccggatgacc ggcaccacct 480

ggcggtccgc gatggacggc acgaccgacc ggaacgcgct gctcgccctg gcccgcgaac      540 ggcggtccgc gatggacggc acgaccgacc ggaacgcgct gctcgccctg gcccgcgaac 540

tcggccgggt gctcggacgg ctgcacaggg tgccgctgac cgggaacacc gtgctcaccc      600 tcggccgggt gctcggacgg ctgcacaggg tgccgctgac cgggaacacc gtgctcaccc 600

cccattccga ggtcttcccg gaactgctgc gggaacgccg cgcggcgacc gtcgaggacc      660 cccattccga ggtcttcccg gaactgctgc gggaacgccg cgcggcgacc gtcgaggacc 660

accgcgggtg gggctacctc tcgccccggc tgctggaccg cctggaggac tggctgccgg      720 accgcgggtg gggctacctc tcgccccggc tgctggaccg cctggaggac tggctgccgg 720

acgtggacac gctgctggcc ggccgcgaac cccggttcgt ccacggcgac ctgcacggga      780 acgtggacac gctgctggcc ggccgcgaac cccggttcgt ccacggcgac ctgcacggga 780

ccaacatctt cgtggacctg gccgcgaccg aggtcaccgg gatcgtcgac ttcaccgacg      840 ccaacatctt cgtggacctg gccgcgaccg aggtcaccgg gatcgtcgac ttcaccgacg 840

tctatgcggg agactcccgc tacagcctgg tgcaactgca tctcaacgcc ttccggggcg      900 tctatgcggg agactcccgc tacagcctgg tgcaactgca tctcaacgcc ttccggggcg 900

accgcgagat cctggccgcg ctgctcgacg gggcgcagtg gaagcggacc gaggacttcg      960 accgcgagat cctggccgcg ctgctcgacg gggcgcagtg gaagcggacc gaggacttcg 960

cccgcgaact gctcgccttc accttcctgc acgacttcga ggtgttcgag gagaccccgc     1020 cccgcgaact gctcgccttc accttcctgc acgacttcga ggtgttcgag gagaccccgc 1020

tggatctctc cggcttcacc gatccggagg aactggcgca gttcctctgg gggccgccgg     1080 tggatctctc cggcttcacc gatccggagg aactggcgca gttcctctgg gggccgccgg 1080

acaccgcccc cggcgcctga tctagacccg ggacttgaca taatgtcgct tatcggctta     1140 acaccgcccc cggcgcctga tctagacccg ggacttgaca taatgtcgct tatcggctta 1140

ctcgagaagc tt                                                         1152 ctcgagaagc tt 1152

  the

  the

<210>  6 <210> 6

<211>  1226 <211> 1226

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  6 <400> 6

gcggccgctc tagaactagt ggatccgata tcgaattcca tatgcccaag cttctcgaga       60 gcggccgctc tagaactagt ggatccgata tcgaattcca tatgcccaag cttctcgaga 60

cttgacataa tgtcgcttat cggcttaatc gatctagacc ggccgtgcgg aattaagccg      120 cttgacataa tgtcgcttat cggcttaatc gatctagacc ggccgtgcgg aattaagccg 120

gcccgtaccc tgtgaataga ggtccgctgt gacacaagaa tccctgttac ttctcgaccg      180 gcccgtaccc tgtgaataga ggtccgctgt gacacaagaa tccctgttac ttctcgaccg 180

tattgattcg gatgattcct acgcgagcct gcggaacgac caggagttct gggagccgct      240 tattgattcg gatgattcct acgcgagcct gcggaacgac caggagttct gggagccgct 240

ggcccgccga gccctggagg agctcgggct gccggtgccg ccggtgctgc gggtgcccgg      300 ggcccgccga gccctggagg agctcgggct gccggtgccg ccggtgctgc gggtgcccgg 300

cgagagcacc aaccccgtac tggtcggcga gcccggcccg gtgatcaagc tgttcggcga      360 cgagagcacc aaccccgtac tggtcggcga gcccggcccg gtgatcaagc tgttcggcga 360

gcactggtgc ggtccggaga gcctcgcgtc ggagtcggag gcgtacgcgg tcctggcgga      420 gcactggtgc ggtccggaga gcctcgcgtc ggagtcggag gcgtacgcgg tcctggcgga 420

cgccccggtt ccggtgcccc gcctcctcgg ccgcggcgag ctgcggcccg gcaccggagc      480 cgccccggtt ccggtgcccc gcctcctcgg ccgcggcgag ctgcggcccg gcaccggagc 480

ctggccgtgg ccctacctgg tgatgagccg gatgaccggc accacctggc ggtccgcgat      540 ctggccgtgg ccctacctgg tgatgagccg gatgaccggc accacctggc ggtccgcgat 540

ggacggcacg accgaccgga acgcgctgct cgccctggcc cgcgaactcg gccgggtgct      600 ggacggcacg accgaccgga acgcgctgct cgccctggcc cgcgaactcg gccgggtgct 600

cggacggctg cacagggtgc cgctgaccgg gaacaccgtg ctcacccccc attccgaggt      660 cggacggctg cacagggtgc cgctgaccgg gaacaccgtg ctcacccccc attccgaggt 660

cttcccggaa ctgctgcggg aacgccgcgc ggcgaccgtc gaggaccacc gcgggtgggg      720 cttcccggaa ctgctgcggg aacgccgcgc ggcgaccgtc gaggaccacc gcgggtgggg 720

ctacctctcg ccccggctgc tggaccgcct ggaggactgg ctgccggacg tggacacgct      780 ctacctctcg ccccggctgc tggaccgcct ggaggactgg ctgccggacg tggacacgct 780

gctggccggc cgcgaacccc ggttcgtcca cggcgacctg cacgggacca acatcttcgt      840 gctggccggc cgcgaaccccc ggttcgtcca cggcgacctg cacgggacca acatcttcgt 840

ggacctggcc gcgaccgagg tcaccgggat cgtcgacttc accgacgtct atgcgggaga      900 ggacctggcc gcgaccgagg tcaccgggat cgtcgacttc accgacgtct atgcgggaga 900

ctcccgctac agcctggtgc aactgcatct caacgccttc cggggcgacc gcgagatcct      960 ctcccgctac agcctggtgc aactgcatct caacgccttc cggggcgacc gcgagatcct 960

ggccgcgctg ctcgacgggg cgcagtggaa gcggaccgag gacttcgccc gcgaactgct     1020 ggccgcgctg ctcgacgggg cgcagtggaa gcggaccgag gacttcgccc gcgaactgct 1020

cgccttcacc ttcctgcacg acttcgaggt gttcgaggag accccgctgg atctctccgg     1080 cgccttcacc ttcctgcacg acttcgaggt gttcgaggag accccgctgg atctctccgg 1080

cttcaccgat ccggaggaac tggcgcagtt cctctggggg ccgccggaca ccgcccccgg     1140 cttcaccgat ccggaggaac tggcgcagtt cctctggggg ccgccggaca ccgcccccgg 1140

cgcctgatct agacccggga cttgacataa tgtcgcttat cggcttactc gagaagcttg     1200 cgcctgatct agacccggga cttgacataa tgtcgcttat cggcttactc gagaagcttg 1200

gccatggctg cagcgctagc ggtacc                                          1226 gccatggctg cagcgctagc ggtacc 1226

  the

  the

<210>  7 <210> 7

<211>  51 <211> 51

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  7 <400> 7

agcttctcga gtaagccgat aagcgacatt atgtcaagtc ccgggtctag a                51 agcttctcga gtaagccgat aagcgacatt atgtcaagtc ccgggtctag a 51

  the

  the

<210>  8 <210> 8

<211>  51 <211> 51

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  8 <400> 8

atcgattaag ccgataagcg acattatgtc aagtctcgag aagcttggta c                51 atcgattaag ccgataagcg aattatgtc aagtctcgag aagcttggta c 51

  the

  the

<210>  9 <210> 9

<211>  26 <211> 26

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  9 <400> 9

tgtctagacc ggccgtgcgg aattaa                                            26 tgtctagacc ggccgtgcgg aattaa 26

  the

  the

<210>  10 <210> 10

<211>  29 <211> 29

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  10 <400> 10

attctagatc aggcgccggg ggcggtgtc                                         29 attctagatc aggcgccggg ggcggtgtc 29

  the

  the

<210>  11 <210> 11

<211>  45 <211> 45

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  11 <400> 11

gctggtaccg ctagcgctgc agccatggca agcttctcga gtaag                       45 gctggtaccg ctagcgctgc agccatggca agcttctcga gtaag 45

  the

  the

<210>  12 <210> 12

<211>  45 <211> 45

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  12 <400> 12

gcaggatccg atatcgaatt ccatatgccc aagcttctcg agact                       45 gcaggatccg atatcgaatt ccatatgccc aagcttctcg agact 45

  the

  the

<210>  13 <210> 13

<211>  24 <211> 24

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  13 <400> 13

ggtccatggt gagcaagggc gagg                                              24 ggtccatggt gagcaagggc gagg 24

  the

  the

<210>  14 <210> 14

<211>  32 <211> 32

<212>  DNA <212> DNA

<213>  人工序列 <213> Artificial sequence

<400>  14 <400> 14

gcgggtacct tacttgtaca gctcgtccat gc                                     32 gcgggtacct tacttgtaca gctcgtccat gc 32

Claims (10)

1. simplify an improved hygromycin gene, its nucleotide sequence is as shown in SEQ ID NO:1.
2. for efficiently building resistance expression's box of non-resistant mark recombinant mycobacterium, containing the hygromycin gene after simplifying, and be positioned at hygromycin gene two ends dif1with dif2sequence, is characterized in that, described in simplify after the nucleotide sequence of hygromycin gene as shown in SEQ ID NO:1, dif1with dif2sequence is respectively as shown in SEQ ID NO:7 and SEQ ID NO:8.
3. resistance expression's box according to claim 2, is characterized in that, described expression cassette two sections also added multiple restriction enzyme site.
4. resistance expression's box according to claim 3, is characterized in that, the nucleotide sequence of described resistance expression's box is as shown in SEQ ID NO:5 or SEQ ID NO:6.
5. the recombinant plasmid containing resistance expression's box described in any one of claim 2 ~ 4.
6. recombinant plasmid according to claim 5, is characterized in that, the nucleotide sequence of described plasmid is as shown in SEQ ID NO:2.
7., for efficiently building a recombinant plasmid for non-resistant mark recombinant mycobacterium, it contains: promotor, phage integration site, integrase gene, the resistance expression's box described in any one of claim 2 ~ 4, replication origin.
8. recombinant plasmid according to claim 7, is characterized in that, in the direction of the clock, successively containing phage integration site, integrase gene, resistance expression's box described in any one of claim 2 ~ 4.
9. the recombinant plasmid according to claim 7 or 8, is characterized in that, described promotor is LacZ promotor or BLA promotor, and described replication origin is intestinal bacteria replication orgin.
10. the recombinant plasmid according to claim 7 or 8, is characterized in that, the nucleotide sequence of described recombinant plasmid is as shown in SEQ ID NO:4.
CN201310386264.0A 2013-06-06 2013-08-29 A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium Expired - Fee Related CN103451181B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310386264.0A CN103451181B (en) 2013-06-06 2013-08-29 A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN201310224591.6 2013-06-06
CN2013102245916 2013-06-06
CN 201310224591 CN103320433A (en) 2013-06-06 2013-06-06 Resistance expression cassette used for highly efficiently constructing recombinant mycobacteria with no resistance marker
CN201310386264.0A CN103451181B (en) 2013-06-06 2013-08-29 A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium

Publications (2)

Publication Number Publication Date
CN103451181A CN103451181A (en) 2013-12-18
CN103451181B true CN103451181B (en) 2015-09-23

Family

ID=49189477

Family Applications (2)

Application Number Title Priority Date Filing Date
CN 201310224591 Withdrawn CN103320433A (en) 2013-06-06 2013-06-06 Resistance expression cassette used for highly efficiently constructing recombinant mycobacteria with no resistance marker
CN201310386264.0A Expired - Fee Related CN103451181B (en) 2013-06-06 2013-08-29 A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN 201310224591 Withdrawn CN103320433A (en) 2013-06-06 2013-06-06 Resistance expression cassette used for highly efficiently constructing recombinant mycobacteria with no resistance marker

Country Status (1)

Country Link
CN (2) CN103320433A (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106636167A (en) * 2016-10-17 2017-05-10 中国科学院广州生物医药与健康研究院 Thiostrepton-gentamicin resistance gene system as well as resistance expression box and recombinant plasmid containing same
CN111378679B (en) * 2020-03-20 2023-10-03 苏州金唯智生物科技有限公司 Gene expression assembly, cloning vector constructed by same and application of cloning vector

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
PL1947926T3 (en) * 2005-10-28 2015-08-31 Dow Agrosciences Llc Novel herbicide resistance genes
US20110047635A1 (en) * 2006-08-28 2011-02-24 University of Hawail Methods and compositions for transposon-mediated transgenesis
HUE026199T2 (en) * 2006-12-07 2016-05-30 Dow Agrosciences Llc Novel selectable marker genes
JP4547643B1 (en) * 2009-06-05 2010-09-22 東洋紡績株式会社 Expression vector optimized for cloning

Also Published As

Publication number Publication date
CN103451181A (en) 2013-12-18
CN103320433A (en) 2013-09-25

Similar Documents

Publication Publication Date Title
AU2020264412B2 (en) Dna-binding protein using ppr motif, and use thereof
DK2670846T3 (en) METHODS FOR THE DEVELOPMENT OF TERPEN SYNTHASE VARIETIES
DK2663645T3 (en) Yeast strains modified for the production of ETHANOL FROM GLYCEROL
KR20140146616A (en) Acetate supplemention of medium for butanologens
IL236992A (en) Genetically modified cyanobacteria producing ethanol
US20030211597A1 (en) Expression of core-glycosylated HCV envelope proteins in yeast
CN111465689B (en) CAS9 variants and methods of use
CN112501269B (en) A method for rapid identification of high-affinity TCR antigen cross-reactivity
KR20220134001A (en) Method for the preparation of closed linear DNA
JP2024037797A (en) Use of infectious nucleic acids to treat cancer
KR20210105382A (en) RNA encoding protein
CN103451181B (en) A kind of resistance expression&#39;s box for efficiently building non-resistant mark recombinant mycobacterium
Gust et al. PCR targeting system in Streptomyces coelicolor A3 (2)
KR102055215B1 (en) Porcine circovirus type 2 capsid protein and method of preparing a pharmaceutical composition comprising the same
US6803230B2 (en) Phagemid vectors
CN111315212B (en) Genome edited birds
CN114836461B (en) Recombinant plasmid expressing collagenase, yeast strain, fermentation medium and fermentation culture method thereof
KR20160012153A (en) Polypeptides with permease activity
KR102416059B1 (en) Over-expression of a fatty acid transporter gene and of genes encoding enzymes of the beta-oxidation pathway for higher production of riboflavin via fermentation of eremothecium
CN109652325B (en) Saccharomyces cerevisiae industrial strain for delta integration and secretory expression of cellulase and application
WO2020043869A2 (en) Methods and compositions for producing a virus
CN116457465A (en) Methods and compositions for genome modification
KR20230169221A (en) Non-viral homology-mediated end joining
KR20230112625A (en) Compositions and methods for vaccination against Neisseria gonorrhea
KR102794718B1 (en) Recombinant vector comprising hybrid signal sequence, and secretary preparation method of human insulin-like growth factor-1 using the same

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20150923

Termination date: 20170829