CN103451181B - A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium - Google Patents
A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium Download PDFInfo
- Publication number
- CN103451181B CN103451181B CN201310386264.0A CN201310386264A CN103451181B CN 103451181 B CN103451181 B CN 103451181B CN 201310386264 A CN201310386264 A CN 201310386264A CN 103451181 B CN103451181 B CN 103451181B
- Authority
- CN
- China
- Prior art keywords
- resistance
- gene
- plasmid
- seq
- hyg
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 241000186359 Mycobacterium Species 0.000 title claims description 6
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 57
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 claims abstract description 28
- 239000013612 plasmid Substances 0.000 claims description 92
- 239000002773 nucleotide Substances 0.000 claims description 10
- 125000003729 nucleotide group Chemical group 0.000 claims description 10
- 108010061833 Integrases Proteins 0.000 claims description 8
- 108020005091 Replication Origin Proteins 0.000 claims description 6
- 230000010354 integration Effects 0.000 claims description 6
- 108091008146 restriction endonucleases Proteins 0.000 claims description 6
- 241000894006 Bacteria Species 0.000 claims description 4
- 230000010076 replication Effects 0.000 claims description 3
- 230000000968 intestinal effect Effects 0.000 claims 1
- 241000588724 Escherichia coli Species 0.000 abstract description 11
- 101150062015 hyg gene Proteins 0.000 abstract description 11
- 102000004190 Enzymes Human genes 0.000 abstract description 9
- 108090000790 Enzymes Proteins 0.000 abstract description 9
- 101100443238 Caenorhabditis elegans dif-1 gene Proteins 0.000 abstract description 7
- 238000013518 transcription Methods 0.000 abstract description 7
- 230000035897 transcription Effects 0.000 abstract description 7
- 238000013519 translation Methods 0.000 abstract description 5
- 238000012408 PCR amplification Methods 0.000 abstract description 4
- 108090000765 processed proteins & peptides Proteins 0.000 abstract description 4
- 230000000694 effects Effects 0.000 abstract description 3
- 238000003780 insertion Methods 0.000 abstract description 3
- 230000037431 insertion Effects 0.000 abstract description 3
- 230000035772 mutation Effects 0.000 abstract description 2
- 239000012634 fragment Substances 0.000 description 33
- 108020004414 DNA Proteins 0.000 description 21
- 238000010276 construction Methods 0.000 description 21
- 238000010353 genetic engineering Methods 0.000 description 10
- 241000187480 Mycobacterium smegmatis Species 0.000 description 8
- 201000008827 tuberculosis Diseases 0.000 description 8
- 238000000034 method Methods 0.000 description 7
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 6
- 238000010586 diagram Methods 0.000 description 6
- 238000010494 dissociation reaction Methods 0.000 description 6
- 230000005593 dissociations Effects 0.000 description 6
- 102000004169 proteins and genes Human genes 0.000 description 6
- 229960000723 ampicillin Drugs 0.000 description 5
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 5
- 238000003209 gene knockout Methods 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- 229940079593 drug Drugs 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 238000001976 enzyme digestion Methods 0.000 description 4
- 101150087129 mtb gene Proteins 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 102100034343 Integrase Human genes 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 150000001413 amino acids Chemical class 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000004806 packaging method and process Methods 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 238000012827 research and development Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- AILDTIZEPVHXBF-UHFFFAOYSA-N Argentine Natural products C1C(C2)C3=CC=CC(=O)N3CC1CN2C(=O)N1CC(C=2N(C(=O)C=CC=2)C2)CC2C1 AILDTIZEPVHXBF-UHFFFAOYSA-N 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 244000308495 Potentilla anserina Species 0.000 description 1
- 235000016594 Potentilla anserina Nutrition 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 238000007664 blowing Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 238000003167 genetic complementation Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 238000012536 packaging technology Methods 0.000 description 1
- 231100000683 possible toxicity Toxicity 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000002250 progressing effect Effects 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒。该抗性表达盒含有精简后的潮霉素抗性基因(SEQ ID NO:1所示),以及位于潮霉素抗性基因两端的dif1和dif2序列。本发明精简后的Hyg基因序列由原来的1.7kb缩短至1kb左右,去掉了转录终止子,较长的天然启动子等复杂结构,便于PCR扩增,内部酶切位点更少(常见酶切位点已被定点突变掉),同时,Hyg自带短的人工启动子,在大肠杆菌和分枝杆菌中均可表达,定向插入时不影响下游基因的转录和翻译。本发明的抗性表达盒在精简后的Hyg基因两端添加了dif序列,可以在分枝杆菌中自动解离。Hyg基因丢失后,被破坏的基因仍可继续表达缩短的小肽,不影响下游基因的翻译,从而可有效避免极性效应。The invention discloses a resistance expression cassette for efficiently constructing recombinant mycobacteria without resistance markers. The resistance expression cassette contains a simplified hygromycin resistance gene (shown in SEQ ID NO: 1), and dif1 and dif2 sequences located at both ends of the hygromycin resistance gene. The simplified Hyg gene sequence of the present invention is shortened from the original 1.7kb to about 1kb, and complex structures such as transcription terminators and longer natural promoters are removed, which is convenient for PCR amplification, and there are fewer internal enzyme cutting sites (common enzyme cutting site has been site-directed mutation), and at the same time, Hyg comes with a short artificial promoter, which can be expressed in both Escherichia coli and mycobacteria, and the transcription and translation of downstream genes will not be affected when directional insertion. The resistance expression cassette of the present invention adds a dif sequence to both ends of the simplified Hyg gene, which can automatically dissociate in mycobacteria. After the Hyg gene is lost, the damaged gene can still continue to express the shortened small peptide without affecting the translation of downstream genes, thus effectively avoiding the polarity effect.
Description
技术领域 technical field
本发明属于基因工程领域,具体涉及一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒及其应用。 The invention belongs to the field of genetic engineering, and in particular relates to a resistance expression cassette for efficiently constructing recombinant mycobacteria without resistance markers and an application thereof.
背景技术 Background technique
Mtb是引起结核病的病原菌,可侵犯全身各器官,但以肺结核为最多见。结核病至今仍为重要的传染病,其诊断,预防和治疗研究进展缓慢。这3个方面均涉及到研究Mtb基因的功能,而对Mtb进行遗传改造在Mtb的研究中起着举足轻重的作用。结核分枝杆菌生长缓慢,难于对其进行遗传操作,需要昂贵的带负压的3级生物安全实验室,长时间培养不仅占用空间而且容易污染等等使结核分枝杆菌的研究耗时耗财耗力。以抗Mtb药物的研发为例,抗Mtb药物的研发昂贵且周期长。近半个世纪以来,直到2012年底刚刚有一种新作用机制的药物问世,且具有潜在的毒力。目前研究抗Mtb药物/疫苗最核心的问题之一就是建立有效,快速和廉价的基因工程菌。 Mtb is the pathogenic bacterium that causes tuberculosis, which can invade various organs of the body, but tuberculosis is the most common. Tuberculosis is still an important infectious disease, and its diagnosis, prevention and treatment are progressing slowly. These three aspects are all related to the study of the function of Mtb gene, and the genetic modification of Mtb plays a pivotal role in the study of Mtb. Mycobacterium tuberculosis grows slowly, it is difficult to carry out genetic manipulation on it, and requires an expensive level 3 biosafety laboratory with negative pressure. Long-term cultivation not only takes up space but is easy to contaminate, etc., making the research on Mycobacterium tuberculosis time-consuming and expensive exhausting. Taking the research and development of anti-Mtb drugs as an example, the research and development of anti-Mtb drugs is expensive and takes a long time. For nearly half a century, until the end of 2012, a drug with a new mechanism of action has just come out, and it has potential toxicity. One of the core issues in the current research on anti-Mtb drugs/vaccine is to establish effective, fast and cheap genetically engineered bacteria.
目前在对分枝杆菌进行基因操作的过程中,仅有卡那霉素抗性基因和潮霉素抗性基因比较有效。因此,进行分枝杆菌突变株的研究时,如果所用菌株具有抗性基因,这会对以后的遗传操作带来不便。例如,基因敲除/遗传互补至少需2种抗性基因;遗传重组技术(recombineering)也需要2种抗性基因等。同时,在分枝杆菌中,有时需要表达多种抗原蛋白,需要多次遗传操作,因此,产生无抗性重组菌株对后续操作尤为重要。可利用基因敲除技术用抗性基因将靶基因替换,但是基因敲除时还常常用到噬菌体包装技术。该技术需要将靶基因上游序列+抗性基因+靶基因下游序列(简称三片段)一起包装到噬菌体中,形成噬粒。三片段总长度有一定的限制,如果过长,将不能被包装到噬菌体,无法获得用于基因敲除的噬粒。由于包装有容量有限制,如果抗性基因短小,则可以适当增加靶基因上游序列+靶基因下游序列的长度,甚至可以加入其它元件,因此,有利于适当增加噬菌体包装的有效DNA片段长度。基因敲除后,最有效的检测方法之一是进行PCR验证,然而,以往的Hyg基因具有的转录终止子具有复杂的二级结构使PCR很难进行。有发明将 Res等序列加到Hyg 两侧,而Res等序列需要人工表达一种外源蛋白来识别并作用后才可将Hyg解离。即实际操作中,先将带Res-Hyg-Res的序列的质粒导入分枝杆菌,等筛选到Res-Hyg-Res序列插入基因组后的分枝杆菌,再导入另外一个可以表达解离蛋白的质粒,再经过筛选,可以得到Hyg丢失的分枝杆菌。许多实验往往还需要继续筛选表达解离蛋白的质粒丢失的菌株。因此,非常繁琐,费时费力。有些类似的系统在分枝杆菌中抗性基因丢失的效率极低。dif 序列加到 Hyg 两侧便于Hyg基因的解离,最近几年才见报道。其优势在于,不需要人工表达外源蛋白来实现抗性基因的解离,因为分枝杆菌本身就编码这样的蛋白,因此,省去了很多不必要的麻烦。而且该系统的效率非常高,兼容性好(例如,慢速生长的结核分枝杆菌的dif序列在快速生长的耻垢分枝杆菌中同样起作用)。然而,在采用dif序列体系的研究中,其所用Hyg抗性基因,片段长(1.7kb),带有复杂的3’端2级结构,内部有常见的酶切位点,两端可用酶切位点少,且构建的抗性表达盒不能够 “通用”。 Currently, only the kanamycin resistance gene and the hygromycin resistance gene are relatively effective in the genetic manipulation of mycobacteria. Therefore, when conducting research on mycobacterial mutant strains, if the strains used have resistance genes, this will bring inconvenience to future genetic manipulations. For example, gene knockout/genetic complementation requires at least two resistance genes; genetic recombination technology (recombineering) also requires two resistance genes, etc. At the same time, in mycobacteria, it is sometimes necessary to express multiple antigenic proteins and multiple genetic manipulations. Therefore, the generation of non-resistant recombinant strains is particularly important for subsequent manipulations. Gene knockout technology can be used to replace the target gene with a resistance gene, but phage packaging technology is often used for gene knockout. This technology needs to package the upstream sequence of the target gene + the resistance gene + the downstream sequence of the target gene (referred to as the three fragments) together into the phage to form a phagemid. The total length of the three fragments has a certain limit. If it is too long, it will not be packaged into phages, and phagemids for gene knockout cannot be obtained. Due to the limited capacity of the packaging, if the resistance gene is short, the length of the upstream sequence of the target gene + the downstream sequence of the target gene can be appropriately increased, and other elements can even be added. Therefore, it is beneficial to appropriately increase the length of the effective DNA fragment for phage packaging. After gene knockout, one of the most effective detection methods is PCR verification. However, the transcription terminator of the previous Hyg gene has a complex secondary structure, making PCR difficult. Some inventions add Res and other sequences to both sides of Hyg , and Res and other sequences need to artificially express a foreign protein to recognize and act on Hyg before it can be dissociated. That is, in actual operation, the plasmid with the Res-Hyg-Res sequence is first introduced into mycobacteria, and then the mycobacteria after the Res-Hyg-Res sequence is inserted into the genome are screened, and then another plasmid that can express the dissociated protein is introduced. , and then screened to obtain mycobacteria with Hyg loss. Many experiments often require continued screening for strains that have lost the plasmid expressing the dissociated protein. Therefore, it is very cumbersome and time-consuming. Some similar systems are extremely inefficient for resistance gene loss in mycobacteria. The addition of dif sequence to both sides of Hyg facilitates the dissociation of Hyg gene, which has only been reported in recent years. The advantage is that there is no need to artificially express foreign proteins to realize the dissociation of resistance genes, because the mycobacterium itself encodes such proteins, thus saving a lot of unnecessary troubles. Moreover, the efficiency of the system is very high, and the compatibility is good (for example, the dif sequence of the slow-growing Mycobacterium tuberculosis also works in the fast-growing Mycobacterium smegmatis). However, in the research using the dif sequence system, the Hyg resistance gene used is a long fragment (1.7kb) with a complex secondary structure at the 3' end. There are common enzyme cutting sites inside, and both ends can be cut by enzymes There are few sites, and the constructed resistance expression cassette cannot be "universal".
所以,如果能够构建在分枝杆菌中带有自动解离功能的更短小、易于遗传操作的带有多克隆位点的通用抗性表达盒,且可以不产生极性效应,将会极大简化多种分枝杆菌的遗传操作。在进行分枝杆菌遗传操作时候,因为利用抗性表达盒带入的抗性基因可自动解离,所以后续的遗传操作时将不会受到抗性基因选择的限制,而且,不带抗性基因的突变株也使得多种遗传研究更为可能,将为分枝杆菌的基因工程研究提供强有力的工具。 Therefore, if it is possible to construct a shorter, genetically manipulated universal resistance expression cassette with multiple cloning sites with automatic dissociation function in mycobacteria, and without polarity effects, it will greatly simplify Genetic manipulation of diverse mycobacteria. During the genetic manipulation of mycobacteria, because the resistance gene introduced by the resistance expression cassette can be dissociated automatically, the subsequent genetic manipulation will not be limited by the selection of the resistance gene, and there is no resistance gene The mutant strains also make a variety of genetic studies more possible, and will provide a powerful tool for the genetic engineering research of mycobacteria.
发明内容 Contents of the invention
本发明的一个目的在于提供一种经精简且自带人工启动子的潮霉素抗性基因Hyg。 One object of the present invention is to provide a simplified hygromycin resistance gene Hyg with its own artificial promoter.
本发明的另一个目的在于提供一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒。 Another object of the present invention is to provide a resistance expression cassette for efficiently constructing recombinant mycobacteria without resistance markers.
本发明的另一个目的在于提供一种用于高效构建无抗性标记重组分枝杆菌的重组质粒。 Another object of the present invention is to provide a recombinant plasmid for efficiently constructing recombinant mycobacteria without resistance markers.
本发明所采取的技术方案是: The technical scheme that the present invention takes is:
一种精简改造后的潮霉素抗性基因,其核苷酸序列如SEQ ID NO:1所示。 A hygromycin resistance gene after simplification and transformation, its nucleotide sequence is shown in SEQ ID NO:1.
一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒,其含有精简改造后的潮霉素抗性基因,以及位于潮霉素抗性基因两端的dif1和dif2序列,其特征在于,所述精简改造后的潮霉素抗性基因的核苷酸序列如SEQ ID NO:1所示,dif1和dif2序列分别如SEQ ID NO:7和SEQ ID NO:8所示。在此将该抗性表达盒命名为“dif-ΩHYG-dif抗性表达盒”。 A resistance expression cassette for efficiently constructing recombinant mycobacteria without resistance markers, which contains a simplified and transformed hygromycin resistance gene, and dif1 and dif2 sequences located at both ends of the hygromycin resistance gene, its characteristics In that, the nucleotide sequence of the simplified and modified hygromycin resistance gene is shown in SEQ ID NO: 1, and the sequences of dif1 and dif2 are shown in SEQ ID NO: 7 and SEQ ID NO: 8, respectively. The resistance expression cassette is named "dif- ΩHYG - dif resistance expression cassette" here .
优选的,所述抗性表达盒两端还添加了多重酶切位点。 Preferably, multiple enzyme cutting sites are added to both ends of the resistance expression cassette.
所述抗性表达盒的核苷酸序列如SEQ ID NO:5或SEQ ID NO:6所示。 The nucleotide sequence of the resistance expression cassette is shown in SEQ ID NO:5 or SEQ ID NO:6.
含有以上所述dif-ΩHYG-dif 抗性表达盒的重组质粒。优选的,所述重组质粒的核苷酸序列如SEQ ID NO:2所示。 Recombinant plasmid containing the dif- ΩHYG - dif resistance expression cassette described above. Preferably, the nucleotide sequence of the recombinant plasmid is shown in SEQ ID NO:2.
一种用于高效构建无抗性标记重组分枝杆菌的重组质粒,其含有:启动子、噬菌体整合位点、整合酶基因、dif-ΩHYG-dif抗性表达盒、复制起始位点。 A recombinant plasmid for efficiently constructing recombinant mycobacteria without resistance markers, which contains: a promoter, a phage integration site, an integrase gene, a dif- ΩHYG - dif resistance expression box, and a replication origin site.
按顺时针方向,噬菌体整合位点、整合酶基因、dif-ΩHYG-dif 抗性表达盒依次连接在一起。 In a clockwise direction, the phage integration site, integrase gene, and dif- ΩHYG - dif resistance expression cassette are sequentially linked together.
所述启动子为LacZ启动子或BLA启动子,所述复制起始位点为大肠杆菌复制起点。 The promoter is a LacZ promoter or a BLA promoter, and the replication initiation site is an Escherichia coli replication origin.
优选的,所述重组质粒的的核苷酸序列如SEQ ID NO:4所示。 Preferably, the nucleotide sequence of the recombinant plasmid is shown in SEQ ID NO:4.
本发明的有益效果是: The beneficial effects of the present invention are:
(1)本发明精简后的Hyg 基因序列由原来的1.7kb缩短至1kb左右,去掉了转录终止子,较长的天然启动子等复杂结构,便于PCR扩增,内部酶切位点更少(常见酶切位点已被定点突变掉),同时,Hyg自带短的人工启动子,在大肠杆菌和分枝杆菌中均可表达,定向插入时不影响下游基因的转录和翻译。 (1) The simplified Hyg gene sequence of the present invention is shortened from the original 1.7kb to about 1kb, and complex structures such as transcription terminators and longer natural promoters are removed, which is convenient for PCR amplification and has fewer internal restriction sites ( The common enzyme cutting site has been mutated), meanwhile, Hyg has a short artificial promoter, which can be expressed in both Escherichia coli and mycobacteria, and the transcription and translation of downstream genes will not be affected by directional insertion.
(2)本发明的抗性表达盒在精简后的Hyg 基因两端添加了dif序列,可以在分枝杆菌中自动解离。Hyg基因丢失后,被破坏的基因仍可继续表达缩短的小肽,不影响下游基因的翻译,从而可有效避免极性效应。 (2) The resistance expression cassette of the present invention adds a dif sequence at both ends of the simplified Hyg gene, which can automatically dissociate in mycobacteria. After the Hyg gene is lost, the damaged gene can still continue to express the shortened small peptide without affecting the translation of downstream genes, thus effectively avoiding the polarity effect.
(3)本发明的抗性表达盒两端还添加了多重酶切位点,使得该表达盒既可以插入到特定序列中,还便于在其两侧定向添加其他序列。 (3) Multiple enzyme cutting sites are added to both ends of the resistance expression cassette of the present invention, so that the expression cassette can be inserted into a specific sequence, and it is also convenient to add other sequences on both sides.
附图说明 Description of drawings
图1为质粒pUCDHmKE的结构示意图; Figure 1 is a schematic diagram of the structure of the plasmid pUCDHmKE;
图2为质粒pUCDHmKE酶切位点图; Figure 2 is a map of the restriction sites of plasmid pUCDHmKE;
图3为质粒pUCDHmKE的构建流程图; Fig. 3 is the construction flowchart of plasmid pUCDHmKE;
图4为整合型质粒pMH94DHmKE的结构示意图; Figure 4 is a schematic diagram of the structure of the integrated plasmid pMH94DHmKE;
图5为整合型质粒pMH94DHmKE的构建流程图; Figure 5 is a flow chart of the construction of the integrated plasmid pMH94DHmKE;
图6为整合质粒pblDHCiGn的结构示意图; Figure 6 is a schematic diagram of the structure of the integrated plasmid pblDHCiGn;
图7为质粒pblDHCiGn的构建流程图; Fig. 7 is the construction flowchart of plasmid pblDHCiGn;
图8为dif-ΩHYG-dif 表达盒与以往在分枝杆菌遗传操作中常用的含有潮霉素(HYG)抗性基因(Hyg)的片段的比较简图(P1,P2,P3:假定的Hyg 启动子;Pa:假定的Hyg 人工启动子,它覆盖了dif-ΩHYG-dif表达盒中的dif1,常用的酶切位点:E1,EcoRI(在我们的dif-ΩHYG-dif 表达盒中被去除了),Nt,NotI;Sp,SpeI;B,BamHI,E5,EcoRV;Nd,NdeI;H,HindIII;Xo,XhoI;C,ClaI;Xb,XbaI;S,SmaI;Nc,NcoI;P,PstI;Nh,NheI;K,KpnI); Figure 8 is a schematic diagram comparing the dif -ΩHYG- dif expression cassette with the fragment containing the hygromycin (HYG) resistance gene ( Hyg ) commonly used in the genetic manipulation of mycobacteria in the past (P1, P2, P3: putative Hyg Promoter; Pa: putative Hyg artificial promoter, which covers dif1 in the dif - ΩHYG - dif expression cassette, commonly used restriction sites: E1, EcoRI (removed in our dif -ΩHYG- dif expression cassette out), Nt, Not I; Sp, Spe I; B, BamH I, E5, Eco RV; Nd, Nde I; H, Hin dIII; Xo, Xho I; C, Cla I; Xb, Xba I; S, Sma I; Nc, Nco I; P, Pst I; Nh, Nhe I; K, Kpn I);
图9为存在于pUCDHmke质粒中的HindIII-XhoI- dif-XhoI-HindIII在dif正链 (A) 和负链 (B)中的序列及其对应表达的多肽的分析(它们是dif-ΩHYG-dif 表达盒V1解离后留在基因组中的序列. SpeI-BamHI-EcoRV-EcoRI-NdeI-HindIII-XhoI-dif-XhoI-HindIII-NcoI-PstI-NheI(存在于pblDHC1n中)在dif负链中的序列及其对应表达的多肽的分析 (C),这是dif-ΩHYG-dif 表达盒V2解离后留在基因组中的序列,由三种读码框编码的氨基酸用蓝色的大写字母表示,划线部分表示在可读通的读码框中的dif 序列编码的氨基酸,星号(*)表示终止子)。 Figure 9 is the analysis of the sequence of Hin dIII- Xho I- dif - Xho I- Hin dIII present in the pUCDHmke plasmid in the dif positive chain (A) and the negative chain (B) and the corresponding expressed polypeptide (they are dif - ΩHYG - the sequence remaining in the genome after dissociation of the expression cassette V1 of the dif. Spe I- Bam HI- Eco RV- Eco RI- Nde I- Hin dIII- Xho I- dif - Xho I- Hin dIII- Nco I- Pst I- Analysis of the sequence of Nhe I (present in pblDHC1n) in the negative strand of dif and its corresponding expressed polypeptide (C), which is the sequence left in the genome after dissociation of the dif- ΩHYG - dif expression cassette V2, represented by The amino acids encoded by the three reading frames are indicated by blue capital letters, the underlined part indicates the amino acids encoded by the dif sequence in the open reading frame, and the asterisk (*) indicates the terminator).
具体实施方式 Detailed ways
下面结合实施例对本发明作进一步的说明,但并不局限于此。 The present invention will be further described below in conjunction with the examples, but not limited thereto.
以下实施例中所采用的分子生物学实验技术包括PCR扩增、质粒提取、质粒转化、DNA片段连接、酶切、凝胶电泳等,如无特殊说明,通常按照常规方法操作,具体可参见《分子克隆实验指南》(第三版) (Sambrook J, Russell DW,Janssen K, Argentine J.黄培堂等译,2002,北京:科学出版社) ,或按照制造厂商所建议的条件。 The molecular biology experimental techniques used in the following examples include PCR amplification, plasmid extraction, plasmid transformation, DNA fragment ligation, enzyme digestion, gel electrophoresis, etc., unless otherwise specified, generally operate according to conventional methods. For details, please refer to " Molecular Cloning Experiment Guide (Third Edition) (translated by Sambrook J, Russell DW, Janssen K, Argentine J. Huang Peitang, etc., 2002, Beijing: Science Press), or according to the conditions suggested by the manufacturer.
以下实施例中PCR反应所用的pfu酶、dNTP以及相关试剂均购买自上海生工生物有限公司;抗生素潮霉素购自罗氏公司;大肠杆菌感受态DH5a购买于广州东盛生物科技有限公司,货号为C1042;DNA连接反应均采用Takara 宝生物公司的 T4 DNA连接试剂盒,型号:D6020A;质粒DNA提取试剂盒(BSC01M1)、DNA回收试剂盒(BSC02M1)、PCR纯化试剂盒(BSC03M1)购自博日生物公司。 The pfu enzymes, dNTPs and related reagents used in the PCR reactions in the following examples were purchased from Shanghai Sangon Biotechnology Co., Ltd.; the antibiotic hygromycin was purchased from Roche; E. coli competent DH5a was purchased from Guangzhou Dongsheng Biotechnology Co., Ltd., Cat. No. It was C1042; the DNA ligation reaction used the T4 DNA ligation kit of Takara Bao Bio Company, model: D6020A; the plasmid DNA extraction kit (BSC01M1), DNA recovery kit (BSC02M1), and PCR purification kit (BSC03M1) were purchased from Bo Nichibio Corporation.
以下实施例中所用到的PCR引物和DNA片段由上海捷瑞生物工程有限公司负责合成。 The PCR primers and DNA fragments used in the following examples were synthesized by Shanghai Jierui Bioengineering Co., Ltd.
实施例1 构建质粒pUCDHmKEExample 1 Construction of plasmid pUCDHmKE
质粒pUCDHmKE的结构示意图见图1,酶切位点图见图2,其含有上述的dif-ΩHYG-dif 抗性表达盒。由图1可见,质粒pUCDHmKE按顺时针方向依次含有:大肠杆菌复制起点(pMB1 ori),dif-ΩHYG-dif 抗性表达盒,启动子P(BLA)(用于启动氨苄青霉素抗性基因Amp(即bla),氨苄青霉素抗性基因Amp可用于质粒筛选。 The schematic diagram of the structure of the plasmid pUCDHmKE is shown in Figure 1, and the restriction site map is shown in Figure 2, which contains the above-mentioned dif- ΩHYG - dif resistance expression cassette. It can be seen from Figure 1 that the plasmid pUCDHmKE contains in a clockwise direction: the E. coli replication origin (pMB1 ori), the dif- ΩHYG - dif resistance expression cassette, and the promoter P (BLA) (used to initiate the ampicillin resistance gene Amp ( That is bla) , the ampicillin resistance gene Amp can be used for plasmid screening.
所述dif-ΩHYG-dif 抗性表达盒上含有精简改造后的潮霉素抗性基因(Hyg),如SEQ ID NO:1所述,该基因上的第24~73bp为启动子序列(Pr),用于启动Hyg基因的表达。Hyg基因序列两端连接的dif序列如SEQ ID NO:7和SEQ ID NO:8所示。 The dif- ΩHYG - dif resistance expression cassette contains a streamlined modified hygromycin resistance gene ( Hyg ), as described in SEQ ID NO: 1, the 24th to 73th bp of the gene is a promoter sequence (Pr ), used to initiate the expression of the Hyg gene. The dif sequences connected at both ends of the Hyg gene sequence are shown in SEQ ID NO:7 and SEQ ID NO:8.
另外在dif-ΩHYG-dif 抗性表达盒的两端分别依次连接HindIII酶切位点、XhoI酶切位点、XbaI酶切位点,此3个酶切位点可用来切下质粒上的dif-ΩHYG-dif 抗性表达盒。 In addition, the two ends of the dif- ΩHYG - dif resistance expression cassette are respectively connected with the Hin dIII restriction site, the Xho I restriction site, and the Xba I restriction site respectively. These three restriction sites can be used to cut out the plasmid dif- ΩHYG - dif resistance expression cassette on .
质粒pUCDHmKE的构建流程图见图3,本流程中将使用到质粒pNBV1(由美国约翰霍普金斯大学William Bishai实验室惠赠,质粒图谱见图3,具体构建方法见参考文献1。质粒pNBV1中含有原始潮霉素抗性基因(Hyg*),约长1.7kb。 The construction flow chart of plasmid pUCDHmKE is shown in Figure 3. Plasmid pNBV1 will be used in this process (gifted by the William Bishai Laboratory of Johns Hopkins University, USA. The plasmid map is shown in Figure 3. For the specific construction method, see reference 1. In the plasmid pNBV1 Contains the original hygromycin resistance gene ( Hyg *), about 1.7kb in length.
具体方法如下: The specific method is as follows:
1. 质粒pUCDis的构建 1. Construction of plasmid pUCDis
用限制性内切酶KpnI和Hind III消化质粒pUC19(商品质粒),纯化回收试剂盒回收2.6kb的片段; Plasmid pUC19 (commercial plasmid) was digested with restriction endonucleases Kpn I and Hind III, and a 2.6kb fragment was recovered by purification and recovery kit;
合成DNA片段dif1和 dif2(如SEQ ID NO:7和SEQ ID NO:8所示,由上海捷瑞生物工程有限公司负责合成),合成的dif1两端带有Hind III和XbaI酶切位点,dif2两端带有XbaI和KpnI酶切位点,所以,将dif1和 dif2以及纯化回收后的质粒pUC19(KpnI和HindIII双酶切)进行三片段连接,转化大肠杆菌感受态DH5α,利用含氨苄青霉素抗性的LB固体平板筛选出阳性克隆,挑取单克隆到LB液体培养基中培养后,提取质粒,酶切鉴定出正确的克隆即为质粒pUCDis。 Synthetic DNA fragments dif1 and dif2 (as shown in SEQ ID NO:7 and SEQ ID NO:8, synthesized by Shanghai Jierui Bioengineering Co., Ltd.), the two ends of the synthesized dif1 have Hind III and Xba I restriction sites , there are Xba I and Kpn I restriction sites at both ends of dif2 , so, connect dif1 and dif2 and the purified and recovered plasmid pUC19 (double digestion with Kpn I and Hind III) in three fragments, and transform Escherichia coli competent DH5α , use the ampicillin-resistant LB solid plate to screen out positive clones, pick a single clone and culture it in LB liquid medium, extract the plasmid, and identify the correct clone by enzyme digestion, which is the plasmid pUCDis.
2. 质粒pUCDHmKE的构建 2. Construction of plasmid pUCDHmKE
扩增片段Hyg:本步骤以质粒pNBV1为模板,用引物Hygf2(5’-TG TCTAGACCGGCCGTGCGGAATTAA-3’)(SEQ ID NO:9)和Hygr727(5’-ATTCTAGATCAGGCGCCGGGGGCGGTGTC-3’)(SEQ ID NO:10)进行扩增。这对引物可从质粒pNBV1中扩增出约1.1kb的Hyg片段,比原来的1.7kb明显精简了,且精简后的Hyg基因依然可以表达并行使抗性基因的作用,其效率与原始质粒中Hyg*基因表达抗性的效率相当。 Amplified fragment Hyg : In this step, plasmid pNBV1 is used as a template, and primers Hygf2 (5'-TG TCTAGACCGGCCGTGCGGAATTAA-3') (SEQ ID NO:9) and Hygr727 (5'-ATTCTAGATCAGGCGCCGGGGGCGGTGTC-3') (SEQ ID NO:10 ) for amplification. This pair of primers can amplify a Hyg fragment of about 1.1kb from the plasmid pNBV1, which is significantly more streamlined than the original 1.7kb, and the streamlined Hyg gene can still express and function as a resistance gene, and its efficiency is the same as that in the original plasmid. The Hyg * genes expressed resistance with comparable efficiency.
扩增所得片段的两端带有XbaI酶切位点,将扩增获得的片段用XbaI消化,获得Hyg片段(XbaI切),大小约为1.1kb。 Both ends of the amplified fragment have Xba I restriction sites, and the amplified fragment is digested with Xba I to obtain a Hyg fragment (cut by Xba I), with a size of about 1.1 kb.
将上一步中构建的质粒 pUCDis 用XbaI消化,将以上酶切产物进行纯化回收4.0 kb片段,即为pUCDI片段(XbaI切)。 The plasmid pUCDis constructed in the previous step was digested with Xba I, and the above digested product was purified to recover a 4.0 kb fragment, which was the pUCDI fragment ( Xba I cut).
将pUCDI片段(XbaI切)和Hyg片段(XbaI切)进行连接反应,转化大肠杆菌感受态DH5α,利用含潮霉素抗性的LB固体平板筛选出阳性克隆,挑取单克隆到LB液体培养基中培养后,提取质粒,酶切鉴定出正确的克隆即为质粒pUCDH。将此质粒中的KpnI位点和EcoRI位点进行突变,去除掉这2个酶切位点,即得质粒pUCDHmKE,其序列如SEQ ID NO:2所示,其第426bp~1553bp为长1128bp的dif-ΩHYG-dif 抗性表达盒,抗性表达盒两端为HindIII酶切位点aagctt和XhoI酶切位点ctcgag。dif-ΩHYG-dif 抗性表达盒中所含有的Hyg片段如SEQ ID NO:1所示。该Hyg片段去掉了转录终止子,较长的天然启动子等复杂结构,便于PCR扩增,内部酶切位点更少(常见酶切位点已被定点突变掉),同时,Hyg自带短的人工启动子,在大肠杆菌和分枝杆菌中均可表达,定向插入时不影响下游基因的转录和翻译。 Ligate the pUCDI fragment ( Xba I cut) and Hyg fragment ( Xba I cut) to transform Escherichia coli competent DH5α, use hygromycin-resistant LB solid plate to screen positive clones, and pick single clones into LB liquid After culturing in the culture medium, the plasmid was extracted, and the correct clone identified by enzyme digestion was the plasmid pUCDH. Mutate the Kpn I site and Eco RI site in this plasmid, remove these two restriction sites, and obtain the plasmid pUCDHmKE, its sequence is shown in SEQ ID NO: 2, and its length is 426bp to 1553bp 1128bp dif- ΩHYG - dif resistance expression cassette, with HindIII restriction site aagctt and Xho I restriction site ctcgag at both ends of the resistance expression cassette. The Hyg fragment contained in the dif- ΩHYG - dif resistance expression cassette is shown in SEQ ID NO:1. The Hyg fragment removes complex structures such as transcription terminators and long natural promoters, which is convenient for PCR amplification and has fewer internal restriction sites (common restriction sites have been mutated by site-directed mutations). At the same time, Hyg comes with a short The artificial promoter can be expressed in both Escherichia coli and mycobacteria, and the transcription and translation of downstream genes will not be affected when inserted in a directional manner.
实施例2整合型质粒pMH94DHmKE的构建及应用Example 2 Construction and Application of Integrated Plasmid pMH94DHmKE
整合型质粒pMH94DHmKE的结构示意图见图4,质粒pMH94DHmKE按顺时针方向依次含有:LacZ启动子(可用于蓝白斑筛选,以及启动后面整个质粒骨架的表达)、噬菌体整合位点attP、整合酶基因Int(可表达出整合酶,使质粒通过attP位点整合入分枝杆菌基因组),dif-ΩHYG-dif 抗性表达盒,大肠杆菌复制起点(oriE),氨苄青霉素抗性基因Amp(用于质粒筛选)。dif-ΩHYG-dif 抗性表达盒两端分别依次连接HindIII酶切位点、XhoI酶切位点,可用于切下质粒上的dif-ΩHYG-dif 抗性表达盒。 The structural diagram of the integrated plasmid pMH94DHmKE is shown in Figure 4. The plasmid pMH94DHmKE contains in a clockwise direction: the LacZ promoter (which can be used for blue-white screening and to initiate the expression of the entire plasmid backbone), the phage integration site attP , and the integrase gene Int (can express integrase, so that the plasmid can be integrated into the mycobacterium genome through the attP site), dif- ΩHYG - dif resistance expression cassette, E. coli replication origin ( oriE ), ampicillin resistance gene Amp (for plasmid selection ). The two ends of the dif- ΩHYG - dif resistance expression cassette are respectively connected with the Hin dIII restriction site and the Xho I restriction site respectively, which can be used to cut out the dif- ΩHYG - dif resistance expression cassette on the plasmid.
整合型质粒pMH94DHmKE的构建流程见图5,本构建流程中所用到的质粒pMH94由由美国约翰霍普金斯大学William Bishai实验室惠赠,质粒图谱见图5,具体构建方法见参考文献2。 The construction process of the integrated plasmid pMH94DHmKE is shown in Figure 5. The plasmid pMH94 used in this construction process was donated by the William Bishai Laboratory of Johns Hopkins University in the United States. The plasmid map is shown in Figure 5. For the specific construction method, see Reference 2.
具体操作如下: The specific operation is as follows:
1. 将质粒pMH94与实施例1构建的质粒pUCDHmKE用HindIII消化,回收pMH94消化所得的大小约为6.0 kb的片段、pUCDHmKE消化所得的大小约为1.1 kb的片段,将两个片段进行连接反应后,转化大肠杆菌感受态DH5α,利用含潮霉素抗性的LB固体平板筛选出阳性克隆,挑取单克隆到LB液体培养基中培养后,提取质粒,酶切鉴定出正确的克隆即为质粒pMH94DHmKE。 1. Digest the plasmid pMH94 and the plasmid pUCDHmKE constructed in Example 1 with HindIII, recover the fragment of about 6.0 kb in size digested by pMH94 and the fragment of about 1.1 kb in size obtained by digestion of pUCDHmKE, and carry out the ligation reaction of the two fragments Afterwards, transform Escherichia coli competent DH5α, use the LB solid plate containing hygromycin resistance to screen out positive clones, pick a single clone and culture it in LB liquid medium, extract the plasmid, and identify the correct clone by enzyme digestion. Plasmid pMH94DHmKE.
2. 将质粒pMH94DHmKE通过电转化的方法将其分别转入结核分枝杆菌(H37Rv结核标准菌株,由广州市胸科医院惠赠)和耻垢分枝杆菌(ATCC 700044),获得相应的转化株。 2. The plasmid pMH94DHmKE was transformed into Mycobacterium tuberculosis (H37Rv tuberculosis standard strain, donated by Guangzhou Chest Hospital) and Mycobacterium smegmatis (ATCC 700044) by electroporation to obtain corresponding transformants.
将所得转化株于37℃孵箱中孵育12小时,充分复活细菌并使其抗性表达。用带滤芯的枪尖吹吸混匀后,分装至小管中,10000 rpm离心1分钟沉淀转化菌,弃上清,用0.6 mL 7H9培养基重悬后,以500 μL/板铺于7H11含相应抗生素(150 μg/mL HYG)培养板中。同时稀释一百倍后,500 μL/板铺板。避光于37度孵箱培养。 The resulting transformant was incubated in a 37°C incubator for 12 hours to fully revive the bacteria and express their resistance. After blowing and mixing with a gun tip with a filter element, aliquot into small tubes, centrifuge at 10,000 rpm for 1 minute to precipitate the transformed bacteria, discard the supernatant, resuspend with 0.6 mL of 7H9 medium, and spread 500 μL/plate on 7H11 containing Corresponding antibiotics (150 μg/mL HYG) culture plate. At the same time, after 100-fold dilution, 500 μL/plate was plated. Protected from light and cultured in a 37-degree incubator.
在含有潮霉素(HYG)的平板上可获得含有pMH94DHmKE质粒的结核分枝杆菌或耻垢分枝杆菌,此步证明了本发明的dif-ΩHYG-dif 抗性表达盒可成功在分枝杆菌中表达。 Mycobacterium tuberculosis or Mycobacterium smegmatis containing the pMH94DHmKE plasmid can be obtained on a plate containing hygromycin (HYG) . in the expression.
将获得的带有pMH94DHmKE质粒的结核分枝杆菌或耻垢分枝杆菌转化株进行液体传代培养,在培养3天后(对于耻垢分枝杆菌)或14天后(对于结核分枝杆菌),将培养的菌液进行稀释后铺平板,所得的平板上约有80%的菌落丢失了Hyg抗性,即无法在含有潮霉素(HYG)的平板上生长,此步证明了本专利中的dif-ΩHYG-dif 抗性表达盒可成功在分枝杆菌中自动解离,且有较高的效率。 The obtained M. tuberculosis or M. smegmatis transformants carrying the pMH94DHmKE plasmid were subcultured in liquid, and after 3 days (for M. smegmatis) or 14 days (for M. tuberculosis) of culture, the culture The bacterial solution was diluted and plated, and about 80% of the colonies on the obtained plate lost Hyg resistance, that is, they could not grow on the plate containing hygromycin (HYG). This step proved the dif- The ΩHYG -dif resistance expression cassette can be successfully auto-dissociated in mycobacteria with high efficiency.
实施例3 整合质粒pblDHCiGn的构建及其应用Example 3 Construction and application of integrated plasmid pblDHCiGn
整合质粒pblDHCiGn中,抗性表达盒以两侧带有双酶切位点的形式存在,此质粒的构建是为了证明在此种形式下,抗性表达盒依然有效。质粒结构见图6,按顺时针方向,其依次含有:启动子P(BLA)(用于启动后面Amp的表达)、氨苄青霉素抗性基因Amp(可用于质粒筛选)、大肠杆菌复制起点(OriE)、噬菌体整合位点(attP)、整合酶基因Int(可表达出整合酶,使质粒通过attP位点整合入分枝杆菌基因组)、dif-ΩHYG-dif 抗性表达盒、增强的绿色荧光蛋白基因eGFP。dif-ΩHYG-dif 抗性表达盒两端连接不同的酶切位点。 In the integrated plasmid pblDHCiGn, the resistance expression cassette exists in the form of double restriction sites on both sides. The construction of this plasmid is to prove that the resistance expression cassette is still effective in this form. The plasmid structure is shown in Figure 6. In a clockwise direction, it contains: promoter P(BLA) (used to initiate the expression of the following Amp ), ampicillin resistance gene Amp (usable for plasmid screening), Escherichia coli origin of replication ( OriE ), phage integration site ( attP ), integrase gene Int (can express integrase, so that the plasmid can be integrated into the mycobacterium genome through the attP site), dif -ΩHYG -dif resistance expression cassette, enhanced green fluorescent protein Gene eGFP . The two ends of the dif- ΩHYG - dif resistance expression cassette were connected with different enzyme cutting sites.
质粒pblDHCiGn的构建流程见图7,该流程中所用到的质粒pblueINT由美国约翰霍普金斯大学,Eric Nuermberger教授惠赠,其质粒图谱见图7,具体构建方法见参考文献3,序列如SEQ ID NO:3所示。 The construction process of plasmid pblDHCiGn is shown in Figure 7. The plasmid pblueINT used in this process was donated by Professor Eric Nuermberger of Johns Hopkins University in the United States. Its plasmid map is shown in Figure 7. For the specific construction method, see Reference 3. NO:3 shown.
具体构建步骤如下: The specific construction steps are as follows:
1. 构建质粒pblDHC1n:用限制性内切酶KpnI和BamHI消化质粒pblueINT,回收2.9 kb的功能片段A;以质粒pUCDHmKE为模板,用引物Hygcf (5’-GCTGGTACCGCTAGCGCTGCAGCCATGGCAAGCTTCTCGAGTAAG-3’) (SEQ ID NO:11)和Hygcr (5’-GCAGGATCCGATATCGAATTCCATATGCCCAAGCTTCTCGAGACT-3’)(SEQ ID NO:12)扩增出1.2 kb的dif-ΩHYG-dif 片段(两端分别带有KpnI和BamHI酶切位点),再经过酶切后,与功能片段A进行连接,得到4.1kb的质粒pblDHC1n。 1. Construction of plasmid pblDHC1n: Plasmid pblueINT was digested with restriction endonuclease Kpn I and Bam HI, and functional fragment A of 2.9 kb was recovered; plasmid pUCDHmKE was used as template, and primer Hygcf (5'-GCTGGTACCGCTAGCGCTGCAGCCATGGCAAGCTTCTCGAGTAAG-3') (SEQ ID NO:11) and Hygcr (5'-GCAGGATCCGATATCGAATTCCATATGCCCAAGCTTCTCGAGACT-3')(SEQ ID NO:12) amplified a 1.2 kb dif- ΩHYG - dif fragment (with Kpn I and Bam HI restriction sites at both ends, respectively) , and then digested with the functional fragment A to obtain a 4.1 kb plasmid pblDHC1n.
2. 构建质粒pblDHCGn:用限制性内切酶 NcoI和KpnI消化质粒pblDHC1n,回收4.1kb的功能片段B;以质粒pEGFP-N1(商品质粒)为模板,用引物GYF-f (5’-GGTCCATGGTGAGCAAGGGCGAGG-3’)(SEQ ID NO:13)和GYF-r(5’- GCGGGTACC TTACTTGTACAGCTCGTCCATGC-3’) (SEQ ID NO:14)扩增出734 bp的eGFP片段,经过酶切后,与功能片段B进行连接,得到质粒pblDHCGn。 2. Construction of plasmid pblDHCGn: digest plasmid pblDHC1n with restriction endonucleases Nco I and Kpn I, and recover 4.1kb functional fragment B; use plasmid pEGFP-N1 (commercial plasmid) as template, use primer GYF-f (5'- GGTCCATGGTGAGCAAGGGCGAGG-3')(SEQ ID NO:13) and GYF-r(5'- GCGGGTACC TTACTTGTACAGCTCGTCCATGC-3') (SEQ ID NO:14) amplified a 734 bp eGFP fragment, which was digested with the functional fragment B is ligated to obtain plasmid pblDHCGn.
3. 构建质粒pblDHCiGn:用限制性内切酶SpeI和EcoRI消化质粒pblDHCGn,回收4.7kb的功能片段C;用限制性内切酶XbaI和EcoRI消化质粒pblueINT,回收2.1kb的Int:attP片段,与功能片段C进行连接,得到质粒pblDHCiGn,其序列如SEQ ID NO:4所示。 3. Construction of plasmid pblDHCiGn : digest plasmid pblDHCGn with restriction enzymes SpeI and EcoRI, and recover 4.7kb functional fragment C; digest plasmid pblueINT with restriction enzymes XbaI and EcoRI, and recover 2.1kb Int:attP fragment , and functional fragment C were connected to obtain plasmid pblDHCiGn, the sequence of which is shown in SEQ ID NO:4.
将构建得到的质粒pblDHCiGn通过电转化的方法转化入结核分枝杆菌或耻垢分枝杆菌中,获得相应的转化子,转化子可以观察到eGFP的表达。由此证明了在dif-ΩHYG-dif 抗性表达盒两端添加了双酶切位点后,Hyg的启动子不仅可以启动Hyg抗性基因的表达,而且可以实现启动Hyg抗性基因下游dif 序列+酶切位点及其后面的基因的表达,此处绿色荧光蛋作为指示标记。 The constructed plasmid pblDHCiGn was transformed into Mycobacterium tuberculosis or Mycobacterium smegmatis by electroporation to obtain corresponding transformants, and the expression of eGFP could be observed in the transformants. This proves that after adding double restriction sites at both ends of the dif- ΩHYG - dif resistance expression cassette, the Hyg promoter can not only initiate the expression of the Hyg resistance gene, but also can activate the downstream dif sequence of the Hyg resistance gene +Expression of the restriction site and the gene behind it, where the green fluorescent egg is used as an indicator.
下一步,将获得的带有pblDHCiGn质粒的结核分枝杆菌或耻垢分枝杆菌转化株进行液体传代培养,在培养3天后(对于耻垢分枝杆菌)或14天后(对于结核分枝杆菌),将培养的菌液进行稀释后铺平板,所得的平板上约有80%的菌落丢失了HYG抗性,即无法在含有潮霉素(HYG)的平板上生长,由此再次证明了本发明的dif-ΩHYG-dif 抗性表达盒可成功在分枝杆菌中自动解离,且有较高的效率。而且,丢失了HYG抗性的转化子依然可以观察到绿色荧光,此设计说明了抗性表达盒的自动解离不会对其插入后下游基因的表达产生影响,即本发明所设计的抗性表达盒无论存在与否均不影响下游基因的表达。 Next, the obtained M. tuberculosis or M. smegmatis transformants harboring the pblDHCiGn plasmid were subjected to liquid subculture after 3 days (for M. smegmatis) or 14 days (for M. tuberculosis) of culture , after diluting the cultured bacterial solution, spread it on the plate, about 80% of the colonies on the obtained plate lost HYG resistance, that is, they could not grow on the plate containing hygromycin (HYG), thus proving again that the present invention The dif- ΩHYG - dif resistance expression cassette can be successfully auto-dissociated in mycobacteria with high efficiency. Moreover, green fluorescence can still be observed in transformants that have lost HYG resistance. This design shows that the automatic dissociation of the resistance expression cassette will not affect the expression of downstream genes after its insertion, that is, the resistance designed in the present invention The presence or absence of an expression cassette does not affect the expression of downstream genes.
参考文献:references:
[1] Howard NS, Gomez JE, Ko C, Bishai WR. Color selection with a hygromycin-resistance-based escherichia coli-mycobacterial shuttle vector. Gene 1995;166:181-182. [1] Howard NS, Gomez JE, Ko C, Bishai WR. Color selection with a hygromycin-resistance-based escherichia coli-mycobacterial shuttle vector. Gene 1995;166:181-182.
[2] Lee MH, Pascopella L, Jacobs WR, Jr., Hatfull GF. Site-specific integration of mycobacteriophage l5: Integration-proficient vectors for mycobacterium smegmatis, mycobacterium tuberculosis, and bacille calmette-guerin. Proc Natl Acad Sci U S A 1991;88:3111-3115. [2] Lee MH, Pascopella L, Jacobs WR, Jr., Hatfull GF. Site-specific integration of mycobacteriophage l5: Integration-proficient vectors for mycobacterium smegmatis, mycobacterium tuberculosis, and bacille S Nat U calmette-guerin. Procad 1991;88:3111-3115.
[3] Zhang T, Li SY, Nuermberger EL. Autoluminescent Mycobacterium tuberculosis for Rapid, Real-Time, Non-Invasive Assessment of Drug and Vaccine Efficacy. PLoS ONE. (2012), 7(1): e29774。 [3] Zhang T, Li SY, Nuermberger EL. Autoluminescent Mycobacterium tuberculosis for Rapid, Real-Time, Non-Invasive Assessment of Drug and Vaccine Efficacy. PLoS ONE. (2012), 7(1): e29774.
<110> 中国科学院广州生物医药与健康研究院 <110> Guangzhou Institute of Biomedicine and Health, Chinese Academy of Sciences
<120> 一种用于高效构建无抗性标记重组分枝杆菌的抗性表达盒 <120> A resistance expression cassette for efficient construction of recombinant mycobacteria without resistance markers
<130> <130>
<150> 2013102245916 <150> 2013102245916
<151> 2013-06-06 <151> 2013-06-06
<160> 14 <160> 14
<170> PatentIn version 3.5 <170> PatentIn version 3.5
the
<210> 1 <210> 1
<211> 1049 <211> 1049
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 1 <400> 1
ccggccgtgc ggaattaagc cggcccgtac cctgtgaata gaggtccgct gtgacacaag 60 ccggccgtgc ggaattaagc cggcccgtac cctgtgaata gaggtccgct gtgacacaag 60
aatccctgtt acttctcgac cgtattgatt cggatgattc ctacgcgagc ctgcggaacg 120 aatccctgtt acttctcgac cgtattgatt cggatgattc ctacgcgagc ctgcggaacg 120
accaggagtt ctgggagccg ctggcccgcc gagccctgga ggagctcggg ctgccggtgc 180 accaggagtt ctgggagccg ctggcccgcc gagccctgga ggagctcggg ctgccggtgc 180
cgccggtgct gcgggtgccc ggcgagagca ccaaccccgt actggtcggc gagcccggcc 240 cgccggtgct gcgggtgccc ggcgagagca ccaaccccgt actggtcggc gagcccggcc 240
cggtgatcaa gctgttcggc gagcactggt gcggtccgga gagcctcgcg tcggagtcgg 300 cggtgatcaa gctgttcggc gagcactggt gcggtccgga gagcctcgcg tcggagtcgg 300
aggcgtacgc ggtcctggcg gacgccccgg ttccggtgcc ccgcctcctc ggccgcggcg 360 aggcgtacgc ggtcctggcg gacgccccgg ttccggtgcc ccgcctcctc ggccgcggcg 360
agctgcggcc cggcaccgga gcctggccgt ggccctacct ggtgatgagc cggatgaccg 420 agctgcggcc cggcaccgga gcctggccgt ggccctacct ggtgatgagc cggatgaccg 420
gcaccacctg gcggtccgcg atggacggca cgaccgaccg gaacgcgctg ctcgccctgg 480 gcaccacctg gcggtccgcg atggacggca cgaccgaccg gaacgcgctg ctcgccctgg 480
cccgcgaact cggccgggtg ctcggacggc tgcacagggt gccgctgacc gggaacaccg 540 cccgcgaact cggccgggtg ctcggacggc tgcacagggt gccgctgacc gggaacaccg 540
tgctcacccc ccattccgag gtcttcccgg aactgctgcg ggaacgccgc gcggcgaccg 600 tgctcacccc ccattccgag gtcttcccgg aactgctgcg ggaacgccgc gcggcgaccg 600
tcgaggacca ccgcgggtgg ggctacctct cgccccggct gctggaccgc ctggaggact 660 tcgaggacca ccgcgggtgg ggctacctct cgccccggct gctggaccgc ctggaggact 660
ggctgccgga cgtggacacg ctgctggccg gccgcgaacc ccggttcgtc cacggcgacc 720 ggctgccgga cgtggacacg ctgctggccg gccgcgaacc ccggttcgtc cacggcgacc 720
tgcacgggac caacatcttc gtggacctgg ccgcgaccga ggtcaccggg atcgtcgact 780 tgcacgggac caacatcttc gtggacctgg ccgcgaccga ggtcaccggg atcgtcgact 780
tcaccgacgt ctatgcggga gactcccgct acagcctggt gcaactgcat ctcaacgcct 840 tcaccgacgt ctatgcggga gactcccgct acagcctggt gcaactgcat ctcaacgcct 840
tccggggcga ccgcgagatc ctggccgcgc tgctcgacgg ggcgcagtgg aagcggaccg 900 tccggggcga ccgcgagatc ctggccgcgc tgctcgacgg ggcgcagtgg aagcggaccg 900
aggacttcgc ccgcgaactg ctcgccttca ccttcctgca cgacttcgag gtgttcgagg 960 aggacttcgc ccgcgaactg ctcgccttca ccttcctgca cgacttcgag gtgttcgagg 960
agaccccgct ggatctctcc ggcttcaccg atccggagga actggcgcag ttcctctggg 1020 agaccccgct ggatctctcc ggcttcaccg atccggagga actggcgcag ttcctctggg 1020
ggccgccgga caccgccccc ggcgcctga 1049 ggccgccgga caccgccccc ggcgcctga 1049
the
the
<210> 2 <210> 2
<211> 3799 <211> 3799
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 2 <400> 2
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgagctcggt accaagcttc 420 tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgagctcggt accaagcttc 420
tcgagacttg acataatgtc gcttatcggc ttaatcgatc tagaccggcc gtgcggaatt 480 tcgagacttg acataatgtc gcttatcggc ttaatcgatc tagaccggcc gtgcggaatt 480
aagccggccc gtaccctgtg aatagaggtc cgctgtgaca caagaatccc tgttacttct 540 aagccggccc gtaccctgtg aatagaggtc cgctgtgaca caagaatccc tgttacttct 540
cgaccgtatt gattcggatg attcctacgc gagcctgcgg aacgaccagg agttctggga 600 cgaccgtatt gattcggatg attcctacgc gagcctgcgg aacgaccagg agttctggga 600
gccgctggcc cgccgagccc tggaggagct cgggctgccg gtgccgccgg tgctgcgggt 660 gccgctggcc cgccgagccc tggaggagct cgggctgccg gtgccgccgg tgctgcgggt 660
gcccggcgag agcaccaacc ccgtactggt cggcgagccc ggcccggtga tcaagctgtt 720 gcccggcgag agcaccaacc ccgtactggt cggcgagccc ggcccggtga tcaagctgtt 720
cggcgagcac tggtgcggtc cggagagcct cgcgtcggag tcggaggcgt acgcggtcct 780 cggcgagcac tggtgcggtc cggagagcct cgcgtcggag tcggaggcgt acgcggtcct 780
ggcggacgcc ccggttccgg tgccccgcct cctcggccgc ggcgagctgc ggcccggcac 840 ggcggacgcc ccggttccgg tgccccgcct cctcggccgc ggcgagctgc ggcccggcac 840
cggagcctgg ccgtggccct acctggtgat gagccggatg accggcacca cctggcggtc 900 cggagcctgg ccgtggccct acctggtgat gagccggatg accggcacca cctggcggtc 900
cgcgatggac ggcacgaccg accggaacgc gctgctcgcc ctggcccgcg aactcggccg 960 cgcgatggac ggcacgaccg accggaacgc gctgctcgcc ctggcccgcg aactcggccg 960
ggtgctcgga cggctgcaca gggtgccgct gaccgggaac accgtgctca ccccccattc 1020 ggtgctcgga cggctgcaca gggtgccgct gaccgggaac accgtgctca ccccccattc 1020
cgaggtcttc ccggaactgc tgcgggaacg ccgcgcggcg accgtcgagg accaccgcgg 1080 cgaggtcttc ccggaactgc tgcgggaacg ccgcgcggcg accgtcgagg accaccgcgg 1080
gtggggctac ctctcgcccc ggctgctgga ccgcctggag gactggctgc cggacgtgga 1140 gtggggctac ctctcgcccc ggctgctgga ccgcctggag gactggctgc cggacgtgga 1140
cacgctgctg gccggccgcg aaccccggtt cgtccacggc gacctgcacg ggaccaacat 1200 cacgctgctg gccggccgcg aaccccggtt cgtccacggc gacctgcacg ggaccaacat 1200
cttcgtggac ctggccgcga ccgaggtcac cgggatcgtc gacttcaccg acgtctatgc 1260 cttcgtggac ctggccgcga ccgaggtcac cgggatcgtc gacttcaccg acgtctatgc 1260
gggagactcc cgctacagcc tggtgcaact gcatctcaac gccttccggg gcgaccgcga 1320 gggagactcc cgctacagcc tggtgcaact gcatctcaac gccttccggg gcgaccgcga 1320
gatcctggcc gcgctgctcg acggggcgca gtggaagcgg accgaggact tcgcccgcga 1380 gatcctggcc gcgctgctcg acggggcgca gtggaagcgg accgaggact tcgcccgcga 1380
actgctcgcc ttcaccttcc tgcacgactt cgaggtgttc gaggagaccc cgctggatct 1440 actgctcgcc ttcaccttcc tgcacgactt cgaggtgttc gaggagaccc cgctggatct 1440
ctccggcttc accgatccgg aggaactggc gcagttcctc tgggggccgc cggacaccgc 1500 ctccggcttc accgatccgg aggaactggc gcagttcctc tgggggccgc cggacaccgc 1500
ccccggcgcc tgatctagac ccgggacttg acataatgtc gcttatcggc ttactcgaga 1560 ccccggcgcc tgatctagac ccgggacttg acataatgtc gcttatcggc ttactcgaga 1560
agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt 1620 agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt 1620
ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc 1680 ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc 1680
taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc 1740 taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc 1740
cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct 1800 cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct 1800
tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca 1860 tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca 1860
gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac 1920 gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac 1920
atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt 1980 atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt 1980
ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg 2040 ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg 2040
cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc 2100 cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc 2100
tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc 2160 tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc 2160
gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc 2220 gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc 2220
aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac 2280 aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac 2280
tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt 2340 tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt 2340
aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct 2400 aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct 2400
aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc 2460 aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc 2460
ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt 2520 ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt 2520
ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg 2580 ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg 2580
atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc 2640 atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc 2640
atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa 2700 atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa 2700
tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag 2760 tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag 2760
gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg 2820 gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg 2820
tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga 2880 tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga 2880
gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag 2940 gacccacgct caccggctcc agattatca gcaataaacc agccagccgg aagggccgag 2940
cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa 3000 cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa 3000
gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc 3060 gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc 3060
atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca 3120 atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca 3120
aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg 3180 aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg 3180
atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat 3240 atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat 3240
aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc 3300 aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc 3300
aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg 3360 aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg 3360
gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg 3420 gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg 3420
gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt 3480 gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta accactcgt 3480
gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca 3540 gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca 3540
ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata 3600 ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata 3600
ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac 3660 ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac 3660
atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa 3720 atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa 3720
gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt 3780 gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt 3780
atcacgaggc cctttcgtc 3799 atcacgaggc cctttcgtc 3799
the
the
<210> 3 <210> 3
<211> 5064 <211> 5064
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 3 <400> 3
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc 60 ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc 60
attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga 120 atttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga 120
gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc 180 gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc 180
caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc 240 caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc 240
ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag 300 ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag 300
cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa 360 cccccgattt agagcttgac gggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa 360
agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac 420 agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac 420
cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg 480 cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg 480
caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg 540 caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg 540
gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg 600 gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg 600
taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctcca 660 taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctcca 660
ccgcggtggc ggccgctcta gaactagtgg atcccccggg ctgcaggaat tcgatatcaa 720 ccgcggtggc ggccgctcta gaactagtgg atcccccggg ctgcaggaat tcgatatcaa 720
gcttgcatgc ctgcaggtcg accaagacga tcaccgggct cgtcggagct ggaggcgcag 780 gcttgcatgc ctgcaggtcg accaagacga tcaccgggct cgtcggagct ggaggcgcag 780
cgggagctgg ctctgtattc tcaggcaagg ccggaggccc tggaggaaac accacggcgt 840 cgggagctgg ctctgtattc tcaggcaagg ccggaggccc tggaggaaac accacggcgt 840
ccgctgtcgg atggtcaggt ttgaccgcaa ccggcggtcc cggaggctct gtgatcgaca 900 ccgctgtcgg atggtcaggt ttgaccgcaa ccggcggtcc cggaggctct gtgatcgaca 900
tcctcagcgt cgccggaaag tcgcctggag atcggaccta caacgaccag ctctacatag 960 tcctcagcgt cgccggaaag tcgcctggag atcggaccta caacgaccag ctctacatag 960
gcggcgcaca acagaactca gctggcggga acggcaatgc tcctggcggc ggcggggctg 1020 gcggcgcaca acagaactca gctggcggga acggcaatgc tcctggcggc ggcggggctg 1020
gtgcccaggt ctccgcacag agcggcggtg ctggcgctcg cggccaggcg tggttcttcg 1080 gtgcccaggt ctccgcacag agcggcggtg ctggcgctcg cggccaggcg tggttcttcg 1080
cgtactgaca agaaaccccc ctctttagga ctcagtgtcc ttgggagggg ggctttttgc 1140 cgtactgaca agaaaccccc ctctttagga ctcagtgtcc ttgggagggg ggctttttgc 1140
gtttcaggag gtcttggcca gcttggacat cgcctcagcg atagcctcgt cgcgggcctc 1200 gtttcaggag gtcttggcca gcttggacat cgcctcagcg atagcctcgt cgcgggcctc 1200
agacgccatc tggtacttca tcgccatcct aggagtcgtg tgaccgagac gggccatcag 1260 agacgccatc tggtacttca tcgccatcct aggagtcgtg tgaccgagac gggccatcag 1260
ctccttggtc gtcgcacctg cctgagcggc gaacgtagcg ccgacagcgc ggaggtcgtg 1320 ctccttggtc gtcgcacctg cctgagcggc gaacgtagcg ccgacagcgc ggaggtcgtg 1320
gatgcggagt tccggccgac cgatcttggc gtagccacgc ttcagcgact tggtgaacgc 1380 gatgcggagt tccggccgac cgatcttggc gtagccacgc ttcagcgact tggtgaacgc 1380
ggacttcgac agccggttgc cctgcgtcgt ggtcaccagg aatgcctcgg ggcccttgtt 1440 ggacttcgac agccggttgc cctgcgtcgt ggtcaccagg aatgcctcgg ggcccttgtt 1440
catcttcgta cggtccttca tgtgcgctcg gatcatctcc gcgacgtgag gcggaaccgt 1500 catcttcgta cggtccttca tgtgcgctcg gatcatctcc gcgacgtgag gcggaaccgt 1500
cacaggacgc ttcgaccgga cggtcttggc gttgccaacg acgatcttgt tccccacgcg 1560 cacaggacgc ttcgaccgga cggtcttggc gttgccaacg acgatcttgt tccccacgcg 1560
ggaagcgcca cggcgcaccc ggagcttcat cgtcatgccg tcgtccacga tgtccttgcg 1620 ggaagcgcca cggcgcaccc ggagcttcat cgtcatgccg tcgtccacga tgtccttgcg 1620
gcgaagctcg atcagctctc cgaaccggag gctcgtccac gccaggatgt atgccgcgat 1680 gcgaagctcg atcagctctc cgaaccggag gctcgtccac gccaggatgt atgccgcgat 1680
ccggtagtgc tcgaagatct cagcggcgac gatgtccagc tcctcaggcg tcagcgcctc 1740 ccggtagtgc tcgaagatct cagcggcgac gatgtccagc tcctcaggcg tcagcgcctc 1740
tacgtcgcgc tcatcggctg ccttctgctc gatccggcac gggttctctg cgatcagctt 1800 tacgtcgcgc tcatcggctg ccttctgctc gatccggcac gggttctctg cgatcagctt 1800
gtcctcgacc gctgtgttca tcaccgcccg gaggacgttg taggcatgcc ggcgggcagt 1860 gtcctcgacc gctgtgttca tcaccgcccg gaggacgttg taggcatgcc ggcgggcagt 1860
cgggtgcttc ctacccatcc cggcccacca cgcacgcacc agagctggcg tcatctctgt 1920 cgggtgcttc ctacccatcc cggcccacca cgcacgcacc agagctggcg tcatctctgt 1920
gaccgccact tcacctagca ccgggtagat gcggcgctcc gcgtgcccgc tgtacagatc 1980 gaccgccact tcacctagca ccgggtagat gcggcgctcc gcgtgcccgc tgtacagatc 1980
cctggtgccg tctgcgaggt cgcgctccac gagccacttc cgggtgtact cctccagcgt 2040 cctggtgccg tctgcgaggt cgcgctccac gagccacttc cgggtgtact cctccagcgt 2040
gatggcgctg gcggctgcct tcttcgcccg gtcctgtgga ggggtccagg tctccatctc 2100 gatggcgctg gcggctgcct tcttcgcccg gtcctgtgga ggggtccagg tctccatctc 2100
gatgagccgc ttctcgcccg cgagccaggc ttcggcgtcc atcttgttgt cgtaggtctg 2160 gatgagccgc ttctcgcccg cgagccaggc ttcggcgtcc atcttgttgt cgtaggtctg 2160
cagcgcgtag tacctcacac cgtcctgcgg gttgacgtat gaggcttgga tcctcccgct 2220 cagcgcgtag tacctcacac cgtcctgcgg gttgacgtat gaggcttgga tcctcccgct 2220
gcgctgagtc ttcagcgatc cccatccgcg acgtgccaac taggtctcct ctcgtcgtga 2280 gcgctgagtc ttcagcgatc cccatccgcg acgtgccaac taggtctcct ctcgtcgtga 2280
acaaggctac cgggttgcaa ctcctgtgca actctcaggc ttcaacgcgc ttctacgacc 2340 acaaggctac cgggttgcaa ctcctgtgca actctcaggc ttcaacgcgc ttctacgacc 2340
tgcaatttct ttccacttag aggatgcagc cgagaggggg taaaaaccta tcttgaccgg 2400 tgcaatttct ttccacttag aggatgcagc cgagagggggg taaaaaccta tcttgaccgg 2400
cccatatgtg gtcggcagac acccattctt ccaaactagc tacgcgggtt cgattcccgt 2460 cccatatgtg gtcggcagac accattctt ccaaactagc tacgcgggtt cgattcccgt 2460
cgcccgctcc gctggtcaga gggtgttttc gccctctggc catttttctt tccaggggtc 2520 cgcccgctcc gctggtcaga gggtgttttc gccctctggc catttttctt tccagggggtc 2520
tgcaactctt gtgcgactct tctgacctgg gcatacgcgg ttgcaacgca tccctgatct 2580 tgcaactctt gtgcgactct tctgacctgg gcatacgcgg ttgcaacgca tccctgatct 2580
ggctactttc gatgctgaca aacgaataga gccccccgcc tgcgcgaaca gacgaggggc 2640 ggctactttc gatgctgaca aacgaataga gccccccgcc tgcgcgaaca gacgaggggc 2640
attcacacca gattggagct ggtgcagtga agagaataga ccgggacaag gttgcaccgg 2700 attcacacca gattggagct ggtgcagtga agagaataga ccgggacaag gttgcaccgg 2700
gagttgcagc ggtcggaacc ctcgccgtcg gcgggctggc gttcgccctg tcgttcacgg 2760 gagttgcagc ggtcggaacc ctcgccgtcg gcgggctggc gttcgccctg tcgttcacgg 2760
ctctcagcga gctggctgcg gccaacgggg tggcccaagc agagatggtg cccttggtgg 2820 ctctcagcga gctggctgcg gccaacgggg tggcccaagc agagatggtg cccttggtgg 2820
tcgactctag aggatccccg acctcgaggg ggggcccggt acccagcttt tgttcccttt 2880 tcgactctag aggatccccg acctcgaggg ggggcccggt acccagcttt tgttcccttt 2880
agtgagggtt aattgcgcgc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt 2940 agtgagggtt aattgcgcgc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt 2940
gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg 3000 gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg 3000
gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt 3060 gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt 3060
cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt 3120 cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt 3120
tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc 3180 tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc 3180
tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg 3240 tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg 3240
ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg 3300 ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg 3300
ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac 3360 ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac 3360
gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 3420 gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 3420
gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct 3480 gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct 3480
ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg 3540 ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg 3540
tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct 3600 tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct 3600
gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 3660 gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 3660
tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt 3720 tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt 3720
tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc 3780 tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc 3780
tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca 3840 tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca 3840
ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat 3900 ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat 3900
ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac 3960 ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac 3960
gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt 4020 gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt 4020
aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc 4080 aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc 4080
aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg 4140 aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg 4140
cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg 4200 cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg 4200
ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc 4260 ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc 4260
cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta 4320 cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta 4320
ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg 4380 ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg 4380
ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct 4440 ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct 4440
ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 4500 ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 4500
gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 4560 gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 4560
ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 4620 ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 4620
ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 4680 ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 4680
gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 4740 gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 4740
ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 4800 ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 4800
cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 4860 cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 4860
ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 4920 ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 4920
aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 4980 aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 4980
gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 5040 gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 5040
gcacatttcc ccgaaaagtg ccac 5064 gcacatttcc ccgaaaagtg ccac 5064
the
the
<210> 4 <210> 4
<211> 6898 <211> 6898
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 4 <400> 4
gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt 60 gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt 60
caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa 120 caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa 120
ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt 180 ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt 180
gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt 240 gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt 240
tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt 300 tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt 300
ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg 360 ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg 360
tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga 420 tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga 420
atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa 480 atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa 480
gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga 540 gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga 540
caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa 600 caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa 600
ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca 660 ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca 660
ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta 720 ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta 720
ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac 780 ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggacc 780
ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc 840 ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc 840
gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag 900 gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag 900
ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga 960 ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga 960
taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt 1020 taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt 1020
agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata 1080 agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata 1080
atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag 1140 atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag 1140
aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa 1200 aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa 1200
caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt 1260 caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt 1260
ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc 1320 ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc 1320
cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa 1380 cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa 1380
tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa 1440 tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa 1440
gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc 1500 gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc 1500
ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa 1560 ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa 1560
gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa 1620 gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa 1620
caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg 1680 caggagagcg cacgaggggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg 1680
ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc 1740 ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc 1740
tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg 1800 tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg 1800
ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg 1860 ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg 1860
agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg 1920 agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg 1920
aagcggaaga gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat 1980 aagcggaaga gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat 1980
gcagctggca cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg 2040 gcagctggca cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg 2040
tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt 2100 tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt 2100
tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg 2160 tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg 2160
ccaagcgcgc aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggtggcg 2220 ccaagcgcgc aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggtggcg 2220
gccgctctag aactagagtc gaccaccaag ggcaccatct ctgcttgggc caccccgttg 2280 gccgctctag aactagagtc gaccaccaag ggcaccatct ctgcttgggc caccccgttg 2280
gccgcagcca gctcgctgag agccgtgaac gacagggcga acgccagccc gccgacggcg 2340 gccgcagcca gctcgctgag agccgtgaac gacagggcga acgccagccc gccgacggcg 2340
agggttccga ccgctgcaac tcccggtgca accttgtccc ggtctattct cttcactgca 2400 agggttccga ccgctgcaac tcccggtgca accttgtccc ggtctattct cttcactgca 2400
ccagctccaa tctggtgtga atgcccctcg tctgttcgcg caggcggggg gctctattcg 2460 ccagctccaa tctggtgtga atgcccctcg tctgttcgcg caggcggggg gctctattcg 2460
tttgtcagca tcgaaagtag ccagatcagg gatgcgttgc aaccgcgtat gcccaggtca 2520 tttgtcagca tcgaaagtag ccagatcagg gatgcgttgc aaccgcgtat gcccaggtca 2520
gaagagtcgc acaagagttg cagacccctg gaaagaaaaa tggccagagg gcgaaaacac 2580 gaagagtcgc acaagagttg cagacccctg gaaagaaaaa tggccagagg gcgaaaacac 2580
cctctgacca gcggagcggg cgacgggaat cgaacccgcg tagctagttt ggaagaatgg 2640 cctctgacca gcggagcggg cgacgggaat cgaacccgcg tagctagttt ggaagaatgg 2640
gtgtctgccg accacatatg ggccggtcaa gataggtttt taccccctct cggctgcatc 2700 gtgtctgccg accacatatg ggccggtcaa gataggtttt taccccctct cggctgcatc 2700
ctctaagtgg aaagaaattg caggtcgtag aagcgcgttg aagcctgaga gttgcacagg 2760 ctctaagtgg aaagaaattg caggtcgtag aagcgcgttg aagcctgaga gttgcacagg 2760
agttgcaacc cggtagcctt gttcacgacg agaggagacc tagttggcac gtcgcggatg 2820 agttgcaacc cggtagcctt gttcacgacg agaggagacc tagttggcac gtcgcggatg 2820
gggatcgctg aagactcagc gcagcgggag gatccaagcc tcatacgtca acccgcagga 2880 gggatcgctg aagactcagc gcagcgggag gatccaagcc tcatacgtca acccgcagga 2880
cggtgtgagg tactacgcgc tgcagaccta cgacaacaag atggacgccg aagcctggct 2940 cggtgtgagg tactacgcgc tgcagaccta cgacaacaag atggacgccg aagcctggct 2940
cgcgggcgag aagcggctca tcgagatgga gacctggacc cctccacagg accgggcgaa 3000 cgcgggcgag aagcggctca tcgagatgga gacctggacc cctccacagg accgggcgaa 3000
gaaggcagcc gccagcgcca tcacgctgga ggagtacacc cggaagtggc tcgtggagcg 3060 gaaggcagcc gccagcgcca tcacgctgga ggagtacacc cggaagtggc tcgtggagcg 3060
cgacctcgca gacggcacca gggatctgta cagcgggcac gcggagcgcc gcatctaccc 3120 cgacctcgca gacggcacca gggatctgta cagcgggcac gcggagcgcc gcatctaccc 3120
ggtgctaggt gaagtggcgg tcacagagat gacgccagct ctggtgcgtg cgtggtgggc 3180 ggtgctaggt gaagtggcgg tcacagagat gacgccagct ctggtgcgtg cgtggtgggc 3180
cgggatgggt aggaagcacc cgactgcccg ccggcatgcc tacaacgtcc tccgggcggt 3240 cgggatgggt aggaagcacc cgactgcccg ccggcatgcc tacaacgtcc tccgggcggt 3240
gatgaacaca gcggtcgagg acaagctgat cgcagagaac ccgtgccgga tcgagcagaa 3300 gatgaacaca gcggtcgagg acaagctgat cgcagagaac ccgtgccgga tcgagcagaa 3300
ggcagccgat gagcgcgacg tagaggcgct gacgcctgag gagctggaca tcgtcgccgc 3360 ggcagccgat gagcgcgacg tagaggcgct gacgcctgag gagctggaca tcgtcgccgc 3360
tgagatcttc gagcactacc ggatcgcggc atacatcctg gcgtggacga gcctccggtt 3420 tgagatcttc gagcactacc ggatcgcggc atacatcctg gcgtggacga gcctccggtt 3420
cggagagctg atcgagcttc gccgcaagga catcgtggac gacggcatga cgatgaagct 3480 cggagagctg atcgagcttc gccgcaagga catcgtggac gacggcatga cgatgaagct 3480
ccgggtgcgc cgtggcgctt cccgcgtggg gaacaagatc gtcgttggca acgccaagac 3540 ccgggtgcgc cgtggcgctt cccgcgtggg gaacaagatc gtcgttggca acgccaagac 3540
cgtccggtcg aagcgtcctg tgacggttcc gcctcacgtc gcggagatga tccgagcgca 3600 cgtccggtcg aagcgtcctg tgacggttcc gcctcacgtc gcggagatga tccgagcgca 3600
catgaaggac cgtacgaaga tgaacaaggg ccccgaggca ttcctggtga ccacgacgca 3660 catgaaggac cgtacgaaga tgaacaaggg ccccgaggca ttcctggtga ccacgacgca 3660
gggcaaccgg ctgtcgaagt ccgcgttcac caagtcgctg aagcgtggct acgccaagat 3720 gggcaaccgg ctgtcgaagt ccgcgttcac caagtcgctg aagcgtggct acgccaagat 3720
cggtcggccg gaactccgca tccacgacct ccgcgctgtc ggcgctacgt tcgccgctca 3780 cggtcggccg gaactccgca tccacgacct ccgcgctgtc ggcgctacgt tcgccgctca 3780
ggcaggtgcg acgaccaagg agctgatggc ccgtctcggt cacacgactc ctaggatggc 3840 ggcaggtgcg acgaccaagg agctgatggc ccgtctcggt cacacgactc ctaggatggc 3840
gatgaagtac cagatggcgt ctgaggcccg cgacgaggct atcgctgagg cgatgtccaa 3900 gatgaagtac cagatggcgt ctgaggcccg cgacgaggct atcgctgagg cgatgtccaa 3900
gctggccaag acctcctgaa acgcaaaaag cccccctccc aaggacactg agtcctaaag 3960 gctggccaag acctcctgaa acgcaaaaag cccccctccc aaggacactg agtcctaaag 3960
aggggggttt cttgtcagta cgcgaagaac cacgcctggc cgcgagcgcc agcaccgccg 4020 aggggggttt cttgtcagta cgcgaagaac cacgcctggc cgcgagcgcc agcaccgccg 4020
ctctgtgcgg agacctgggc accagccccg ccgccgccag gagcattgcc gttcccgcca 4080 ctctgtgcgg agacctgggc accagccccg ccgccgccag gagcattgcc gttcccgcca 4080
gctgagttct gttgtgcgcc gcctatgtag agctggtcgt tgtaggtccg atctccaggc 4140 gctgagttct gttgtgcgcc gcctatgtag agctggtcgt tgtaggtccg atctccaggc 4140
gactttccgg cgacgctgag gatgtcgatc acagagcctc cgggaccgcc ggttgcggtc 4200 gactttccgg cgacgctgag gatgtcgatc acaggcctc cgggaccgcc ggttgcggtc 4200
aaacctgacc atccgacagc ggacgccgtg gtgtttcctc cagggcctcc ggccttgcct 4260 aaacctgacc atccgacagc ggacgccgtg gtgtttcctc cagggcctcc ggccttgcct 4260
gagaatacag agccagctcc cgctgcgcct ccagctccga cgagcccggt gatcgtcttg 4320 gagaatacag agccagctcc cgctgcgcct ccagctccga cgagcccggt gatcgtcttg 4320
gtcgacctgc aggcatgcaa gcttgatatc gaattccata tgcccaagct tctcgagact 4380 gtcgacctgc aggcatgcaa gcttgatatc gaattccata tgcccaagct tctcgagact 4380
tgacataatg tcgcttatcg gcttaatcga tctagaccgg ccgtgcggaa ttaagccggc 4440 tgacataatg tcgcttatcg gcttaatcga tctagaccgg ccgtgcggaa ttaagccggc 4440
ccgtaccctg tgaatagagg tccgctgtga cacaagaatc cctgttactt ctcgaccgta 4500 ccgtaccctg tgaatagagg tccgctgtga cacaagaatc cctgttactt ctcgaccgta 4500
ttgattcgga tgattcctac gcgagcctgc ggaacgacca ggagttctgg gagccgctgg 4560 ttgattcgga tgattcctac gcgagcctgc ggaacgacca ggagttctgg gagccgctgg 4560
cccgccgagc cctggaggag ctcgggctgc cggtgccgcc ggtgctgcgg gtgcccggcg 4620 cccgccgagc cctggaggag ctcgggctgc cggtgccgcc ggtgctgcgg gtgcccggcg 4620
agagcaccaa ccccgtactg gtcggcgagc ccggcccggt gatcaagctg ttcggcgagc 4680 agagcaccaa ccccgtactg gtcggcgagc ccggcccggt gatcaagctg ttcggcgagc 4680
actggtgcgg tccggagagc ctcgcgtcgg agtcggaggc gtacgcggtc ctggcggacg 4740 actggtgcgg tccggagagc ctcgcgtcgg agtcggaggc gtacgcggtc ctggcggacg 4740
ccccggttcc ggtgccccgc ctcctcggcc gcggcgagct gcggcccggc accggagcct 4800 ccccggttcc ggtgccccgc ctcctcggcc gcggcgagct gcggcccggc accggagcct 4800
ggccgtggcc ctacctggtg atgagccgga tgaccggcac cacctggcgg tccgcgatgg 4860 ggccgtggcc ctacctggtg atgagccgga tgaccggcac cacctggcgg tccgcgatgg 4860
acggcacgac cgaccggaac gcgctgctcg ccctggcccg cgaactcggc cgggtgctcg 4920 acggcacgac cgaccggaac gcgctgctcg ccctggcccg cgaactcggc cgggtgctcg 4920
gacggctgca cagggtgccg ctgaccggga acaccgtgct caccccccat tccgaggtct 4980 gacggctgca cagggtgccg ctgaccggga acaccgtgct cacccccccat tccgaggtct 4980
tcccggaact gctgcgggaa cgccgcgcgg cgaccgtcga ggaccaccgc gggtggggct 5040 tcccggaact gctgcgggaa cgccgcgcgg cgaccgtcga ggaccaccgc gggtggggct 5040
acctctcgcc ccggctgctg gaccgcctgg aggactggct gccggacgtg gacacgctgc 5100 acctctcgcc ccggctgctg gaccgcctgg aggactggct gccggacgtg gacacgctgc 5100
tggccggccg cgaaccccgg ttcgtccacg gcgacctgca cgggaccaac atcttcgtgg 5160 tggccggccg cgaaccccgg ttcgtccacg gcgacctgca cgggaccaac atcttcgtgg 5160
acctggccgc gaccgaggtc accgggatcg tcgacttcac cgacgtctat gcgggagact 5220 acctggccgc gaccgaggtc accgggatcg tcgacttcac cgacgtctat gcgggagact 5220
cccgctacag cctggtgcaa ctgcatctca acgccttccg gggcgaccgc gagatcctgg 5280 cccgctacag cctggtgcaa ctgcatctca acgccttccg gggcgaccgc gagatcctgg 5280
ccgcgctgct cgacggggcg cagtggaagc ggaccgagga cttcgcccgc gaactgctcg 5340 ccgcgctgct cgacggggcg cagtggaagc ggaccgagga cttcgcccgc gaactgctcg 5340
ccttcacctt cctgcacgac ttcgaggtgt tcgaggagac cccgctggat ctctccggct 5400 ccttcacctt cctgcacgac ttcgaggtgt tcgaggagac cccgctggat ctctccggct 5400
tcaccgatcc ggaggaactg gcgcagttcc tctgggggcc gccggacacc gcccccggcg 5460 tcaccgatcc ggaggaactg gcgcagttcc tctgggggcc gccggacacc gcccccggcg 5460
cctgatctag acccgggact tgacataatg tcgcttatcg gcttactcga gaagcttgcc 5520 cctgatctag acccgggact tgacataatg tcgcttatcg gcttactcga gaagcttgcc 5520
atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 5580 atggtgagca agggcgagga gctgttcacc gggtggtgc ccatcctggt cgagctggac 5580
ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 5640 ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 5640
ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 5700 ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 5700
ctcgtgacca ccctgaccta cggcgtgcag tgcttcagcc gctaccccga ccacatgaag 5760 ctcgtgacca ccctgaccta cggcgtgcag tgcttcagcc gctaccccga ccacatgaag 5760
cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 5820 cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 5820
ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 5880 ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 5880
gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 5940 gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 5940
aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 6000 aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 6000
ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 6060 ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 6060
gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 6120 gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 6120
tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 6180 tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 6180
ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaagtaa 6240 ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaagtaa 6240
ggtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca 6300 ggtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca 6300
acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 6360 acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 6360
tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 6420 tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 6420
cagcctgaat ggcgaatggg acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt 6480 cagcctgaat ggcgaatggg acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt 6480
ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt 6540 ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt 6540
cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa atcgggggct 6600 cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa atcgggggct 6600
ccctttaggg ttccgattta gtgctttacg gcacctcgac cccaaaaaac ttgattaggg 6660 ccctttaggg ttccgatta gtgctttacg gcacctcgac cccaaaaaac ttgattaggg 6660
tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt tgacgttgga 6720 tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt tgacgttgga 6720
gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca accctatctc 6780 gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca accctatctc 6780
ggtctattct tttgatttat aagggatttt gccgatttcg gcctattggt taaaaaatga 6840 ggtctattct tttgattatt aagggatttt gccgatttcg gcctattggt taaaaaatga 6840
gctgatttaa caaaaattta acgcgaattt taacaaaata ttaacgctta caatttag 6898 gctgatttaa caaaaattta acgcgaattt taacaaaata ttaacgctta caatttag 6898
the
the
<210> 5 <210> 5
<211> 1152 <211> 1152
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 5 <400> 5
aagcttctcg agacttgaca taatgtcgct tatcggctta atcgatctag accggccgtg 60 aagcttctcg agacttgaca taatgtcgct tatcggctta atcgatctag accggccgtg 60
cggaattaag ccggcccgta ccctgtgaat agaggtccgc tgtgacacaa gaatccctgt 120 cggaattaag ccggcccgta ccctgtgaat agaggtccgc tgtgacacaa gaatccctgt 120
tacttctcga ccgtattgat tcggatgatt cctacgcgag cctgcggaac gaccaggagt 180 tacttctcga ccgtattgat tcggatgatt cctacgcgag cctgcggaac gaccaggagt 180
tctgggagcc gctggcccgc cgagccctgg aggagctcgg gctgccggtg ccgccggtgc 240 tctgggagcc gctggcccgc cgagccctgg aggagctcgg gctgccggtg ccgccggtgc 240
tgcgggtgcc cggcgagagc accaaccccg tactggtcgg cgagcccggc ccggtgatca 300 tgcgggtgcc cggcgagagc accaaccccg tactggtcgg cgagcccggc ccggtgatca 300
agctgttcgg cgagcactgg tgcggtccgg agagcctcgc gtcggagtcg gaggcgtacg 360 agctgttcgg cgagcactgg tgcggtccgg agagcctcgc gtcggagtcg gaggcgtacg 360
cggtcctggc ggacgccccg gttccggtgc cccgcctcct cggccgcggc gagctgcggc 420 cggtcctggc ggacgccccg gttccggtgc cccgcctcct cggccgcggc gagctgcggc 420
ccggcaccgg agcctggccg tggccctacc tggtgatgag ccggatgacc ggcaccacct 480 ccggcaccgg agcctggccg tggccctacc tggtgatgag ccggatgacc ggcaccacct 480
ggcggtccgc gatggacggc acgaccgacc ggaacgcgct gctcgccctg gcccgcgaac 540 ggcggtccgc gatggacggc acgaccgacc ggaacgcgct gctcgccctg gcccgcgaac 540
tcggccgggt gctcggacgg ctgcacaggg tgccgctgac cgggaacacc gtgctcaccc 600 tcggccgggt gctcggacgg ctgcacaggg tgccgctgac cgggaacacc gtgctcaccc 600
cccattccga ggtcttcccg gaactgctgc gggaacgccg cgcggcgacc gtcgaggacc 660 cccattccga ggtcttcccg gaactgctgc gggaacgccg cgcggcgacc gtcgaggacc 660
accgcgggtg gggctacctc tcgccccggc tgctggaccg cctggaggac tggctgccgg 720 accgcgggtg gggctacctc tcgccccggc tgctggaccg cctggaggac tggctgccgg 720
acgtggacac gctgctggcc ggccgcgaac cccggttcgt ccacggcgac ctgcacggga 780 acgtggacac gctgctggcc ggccgcgaac cccggttcgt ccacggcgac ctgcacggga 780
ccaacatctt cgtggacctg gccgcgaccg aggtcaccgg gatcgtcgac ttcaccgacg 840 ccaacatctt cgtggacctg gccgcgaccg aggtcaccgg gatcgtcgac ttcaccgacg 840
tctatgcggg agactcccgc tacagcctgg tgcaactgca tctcaacgcc ttccggggcg 900 tctatgcggg agactcccgc tacagcctgg tgcaactgca tctcaacgcc ttccggggcg 900
accgcgagat cctggccgcg ctgctcgacg gggcgcagtg gaagcggacc gaggacttcg 960 accgcgagat cctggccgcg ctgctcgacg gggcgcagtg gaagcggacc gaggacttcg 960
cccgcgaact gctcgccttc accttcctgc acgacttcga ggtgttcgag gagaccccgc 1020 cccgcgaact gctcgccttc accttcctgc acgacttcga ggtgttcgag gagaccccgc 1020
tggatctctc cggcttcacc gatccggagg aactggcgca gttcctctgg gggccgccgg 1080 tggatctctc cggcttcacc gatccggagg aactggcgca gttcctctgg gggccgccgg 1080
acaccgcccc cggcgcctga tctagacccg ggacttgaca taatgtcgct tatcggctta 1140 acaccgcccc cggcgcctga tctagacccg ggacttgaca taatgtcgct tatcggctta 1140
ctcgagaagc tt 1152 ctcgagaagc tt 1152
the
the
<210> 6 <210> 6
<211> 1226 <211> 1226
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 6 <400> 6
gcggccgctc tagaactagt ggatccgata tcgaattcca tatgcccaag cttctcgaga 60 gcggccgctc tagaactagt ggatccgata tcgaattcca tatgcccaag cttctcgaga 60
cttgacataa tgtcgcttat cggcttaatc gatctagacc ggccgtgcgg aattaagccg 120 cttgacataa tgtcgcttat cggcttaatc gatctagacc ggccgtgcgg aattaagccg 120
gcccgtaccc tgtgaataga ggtccgctgt gacacaagaa tccctgttac ttctcgaccg 180 gcccgtaccc tgtgaataga ggtccgctgt gacacaagaa tccctgttac ttctcgaccg 180
tattgattcg gatgattcct acgcgagcct gcggaacgac caggagttct gggagccgct 240 tattgattcg gatgattcct acgcgagcct gcggaacgac caggagttct gggagccgct 240
ggcccgccga gccctggagg agctcgggct gccggtgccg ccggtgctgc gggtgcccgg 300 ggcccgccga gccctggagg agctcgggct gccggtgccg ccggtgctgc gggtgcccgg 300
cgagagcacc aaccccgtac tggtcggcga gcccggcccg gtgatcaagc tgttcggcga 360 cgagagcacc aaccccgtac tggtcggcga gcccggcccg gtgatcaagc tgttcggcga 360
gcactggtgc ggtccggaga gcctcgcgtc ggagtcggag gcgtacgcgg tcctggcgga 420 gcactggtgc ggtccggaga gcctcgcgtc ggagtcggag gcgtacgcgg tcctggcgga 420
cgccccggtt ccggtgcccc gcctcctcgg ccgcggcgag ctgcggcccg gcaccggagc 480 cgccccggtt ccggtgcccc gcctcctcgg ccgcggcgag ctgcggcccg gcaccggagc 480
ctggccgtgg ccctacctgg tgatgagccg gatgaccggc accacctggc ggtccgcgat 540 ctggccgtgg ccctacctgg tgatgagccg gatgaccggc accacctggc ggtccgcgat 540
ggacggcacg accgaccgga acgcgctgct cgccctggcc cgcgaactcg gccgggtgct 600 ggacggcacg accgaccgga acgcgctgct cgccctggcc cgcgaactcg gccgggtgct 600
cggacggctg cacagggtgc cgctgaccgg gaacaccgtg ctcacccccc attccgaggt 660 cggacggctg cacagggtgc cgctgaccgg gaacaccgtg ctcacccccc attccgaggt 660
cttcccggaa ctgctgcggg aacgccgcgc ggcgaccgtc gaggaccacc gcgggtgggg 720 cttcccggaa ctgctgcggg aacgccgcgc ggcgaccgtc gaggaccacc gcgggtgggg 720
ctacctctcg ccccggctgc tggaccgcct ggaggactgg ctgccggacg tggacacgct 780 ctacctctcg ccccggctgc tggaccgcct ggaggactgg ctgccggacg tggacacgct 780
gctggccggc cgcgaacccc ggttcgtcca cggcgacctg cacgggacca acatcttcgt 840 gctggccggc cgcgaaccccc ggttcgtcca cggcgacctg cacgggacca acatcttcgt 840
ggacctggcc gcgaccgagg tcaccgggat cgtcgacttc accgacgtct atgcgggaga 900 ggacctggcc gcgaccgagg tcaccgggat cgtcgacttc accgacgtct atgcgggaga 900
ctcccgctac agcctggtgc aactgcatct caacgccttc cggggcgacc gcgagatcct 960 ctcccgctac agcctggtgc aactgcatct caacgccttc cggggcgacc gcgagatcct 960
ggccgcgctg ctcgacgggg cgcagtggaa gcggaccgag gacttcgccc gcgaactgct 1020 ggccgcgctg ctcgacgggg cgcagtggaa gcggaccgag gacttcgccc gcgaactgct 1020
cgccttcacc ttcctgcacg acttcgaggt gttcgaggag accccgctgg atctctccgg 1080 cgccttcacc ttcctgcacg acttcgaggt gttcgaggag accccgctgg atctctccgg 1080
cttcaccgat ccggaggaac tggcgcagtt cctctggggg ccgccggaca ccgcccccgg 1140 cttcaccgat ccggaggaac tggcgcagtt cctctggggg ccgccggaca ccgcccccgg 1140
cgcctgatct agacccggga cttgacataa tgtcgcttat cggcttactc gagaagcttg 1200 cgcctgatct agacccggga cttgacataa tgtcgcttat cggcttactc gagaagcttg 1200
gccatggctg cagcgctagc ggtacc 1226 gccatggctg cagcgctagc ggtacc 1226
the
the
<210> 7 <210> 7
<211> 51 <211> 51
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 7 <400> 7
agcttctcga gtaagccgat aagcgacatt atgtcaagtc ccgggtctag a 51 agcttctcga gtaagccgat aagcgacatt atgtcaagtc ccgggtctag a 51
the
the
<210> 8 <210> 8
<211> 51 <211> 51
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 8 <400> 8
atcgattaag ccgataagcg acattatgtc aagtctcgag aagcttggta c 51 atcgattaag ccgataagcg aattatgtc aagtctcgag aagcttggta c 51
the
the
<210> 9 <210> 9
<211> 26 <211> 26
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 9 <400> 9
tgtctagacc ggccgtgcgg aattaa 26 tgtctagacc ggccgtgcgg aattaa 26
the
the
<210> 10 <210> 10
<211> 29 <211> 29
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 10 <400> 10
attctagatc aggcgccggg ggcggtgtc 29 attctagatc aggcgccggg ggcggtgtc 29
the
the
<210> 11 <210> 11
<211> 45 <211> 45
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 11 <400> 11
gctggtaccg ctagcgctgc agccatggca agcttctcga gtaag 45 gctggtaccg ctagcgctgc agccatggca agcttctcga gtaag 45
the
the
<210> 12 <210> 12
<211> 45 <211> 45
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 12 <400> 12
gcaggatccg atatcgaatt ccatatgccc aagcttctcg agact 45 gcaggatccg atatcgaatt ccatatgccc aagcttctcg agact 45
the
the
<210> 13 <210> 13
<211> 24 <211> 24
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 13 <400> 13
ggtccatggt gagcaagggc gagg 24 ggtccatggt gagcaagggc gagg 24
the
the
<210> 14 <210> 14
<211> 32 <211> 32
<212> DNA <212> DNA
<213> 人工序列 <213> Artificial sequence
<400> 14 <400> 14
gcgggtacct tacttgtaca gctcgtccat gc 32 gcgggtacct tacttgtaca gctcgtccat gc 32
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310386264.0A CN103451181B (en) | 2013-06-06 | 2013-08-29 | A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310224591.6 | 2013-06-06 | ||
CN2013102245916 | 2013-06-06 | ||
CN 201310224591 CN103320433A (en) | 2013-06-06 | 2013-06-06 | Resistance expression cassette used for highly efficiently constructing recombinant mycobacteria with no resistance marker |
CN201310386264.0A CN103451181B (en) | 2013-06-06 | 2013-08-29 | A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103451181A CN103451181A (en) | 2013-12-18 |
CN103451181B true CN103451181B (en) | 2015-09-23 |
Family
ID=49189477
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201310224591 Withdrawn CN103320433A (en) | 2013-06-06 | 2013-06-06 | Resistance expression cassette used for highly efficiently constructing recombinant mycobacteria with no resistance marker |
CN201310386264.0A Expired - Fee Related CN103451181B (en) | 2013-06-06 | 2013-08-29 | A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201310224591 Withdrawn CN103320433A (en) | 2013-06-06 | 2013-06-06 | Resistance expression cassette used for highly efficiently constructing recombinant mycobacteria with no resistance marker |
Country Status (1)
Country | Link |
---|---|
CN (2) | CN103320433A (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106636167A (en) * | 2016-10-17 | 2017-05-10 | 中国科学院广州生物医药与健康研究院 | Thiostrepton-gentamicin resistance gene system as well as resistance expression box and recombinant plasmid containing same |
CN111378679B (en) * | 2020-03-20 | 2023-10-03 | 苏州金唯智生物科技有限公司 | Gene expression assembly, cloning vector constructed by same and application of cloning vector |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
PL1947926T3 (en) * | 2005-10-28 | 2015-08-31 | Dow Agrosciences Llc | Novel herbicide resistance genes |
US20110047635A1 (en) * | 2006-08-28 | 2011-02-24 | University of Hawail | Methods and compositions for transposon-mediated transgenesis |
HUE026199T2 (en) * | 2006-12-07 | 2016-05-30 | Dow Agrosciences Llc | Novel selectable marker genes |
JP4547643B1 (en) * | 2009-06-05 | 2010-09-22 | 東洋紡績株式会社 | Expression vector optimized for cloning |
-
2013
- 2013-06-06 CN CN 201310224591 patent/CN103320433A/en not_active Withdrawn
- 2013-08-29 CN CN201310386264.0A patent/CN103451181B/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN103451181A (en) | 2013-12-18 |
CN103320433A (en) | 2013-09-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2020264412B2 (en) | Dna-binding protein using ppr motif, and use thereof | |
DK2670846T3 (en) | METHODS FOR THE DEVELOPMENT OF TERPEN SYNTHASE VARIETIES | |
DK2663645T3 (en) | Yeast strains modified for the production of ETHANOL FROM GLYCEROL | |
KR20140146616A (en) | Acetate supplemention of medium for butanologens | |
IL236992A (en) | Genetically modified cyanobacteria producing ethanol | |
US20030211597A1 (en) | Expression of core-glycosylated HCV envelope proteins in yeast | |
CN111465689B (en) | CAS9 variants and methods of use | |
CN112501269B (en) | A method for rapid identification of high-affinity TCR antigen cross-reactivity | |
KR20220134001A (en) | Method for the preparation of closed linear DNA | |
JP2024037797A (en) | Use of infectious nucleic acids to treat cancer | |
KR20210105382A (en) | RNA encoding protein | |
CN103451181B (en) | A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium | |
Gust et al. | PCR targeting system in Streptomyces coelicolor A3 (2) | |
KR102055215B1 (en) | Porcine circovirus type 2 capsid protein and method of preparing a pharmaceutical composition comprising the same | |
US6803230B2 (en) | Phagemid vectors | |
CN111315212B (en) | Genome edited birds | |
CN114836461B (en) | Recombinant plasmid expressing collagenase, yeast strain, fermentation medium and fermentation culture method thereof | |
KR20160012153A (en) | Polypeptides with permease activity | |
KR102416059B1 (en) | Over-expression of a fatty acid transporter gene and of genes encoding enzymes of the beta-oxidation pathway for higher production of riboflavin via fermentation of eremothecium | |
CN109652325B (en) | Saccharomyces cerevisiae industrial strain for delta integration and secretory expression of cellulase and application | |
WO2020043869A2 (en) | Methods and compositions for producing a virus | |
CN116457465A (en) | Methods and compositions for genome modification | |
KR20230169221A (en) | Non-viral homology-mediated end joining | |
KR20230112625A (en) | Compositions and methods for vaccination against Neisseria gonorrhea | |
KR102794718B1 (en) | Recombinant vector comprising hybrid signal sequence, and secretary preparation method of human insulin-like growth factor-1 using the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20150923 Termination date: 20170829 |