CN114836461B - Recombinant plasmid expressing collagenase, yeast strain, fermentation medium and fermentation culture method thereof - Google Patents
Recombinant plasmid expressing collagenase, yeast strain, fermentation medium and fermentation culture method thereof Download PDFInfo
- Publication number
- CN114836461B CN114836461B CN202210605225.4A CN202210605225A CN114836461B CN 114836461 B CN114836461 B CN 114836461B CN 202210605225 A CN202210605225 A CN 202210605225A CN 114836461 B CN114836461 B CN 114836461B
- Authority
- CN
- China
- Prior art keywords
- collagenase
- leader
- sequence
- saccharomyces cerevisiae
- fermentation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108060005980 Collagenase Proteins 0.000 title claims abstract description 63
- 102000029816 Collagenase Human genes 0.000 title claims abstract description 62
- 229960002424 collagenase Drugs 0.000 title claims abstract description 60
- 240000004808 Saccharomyces cerevisiae Species 0.000 title claims abstract description 43
- 238000000855 fermentation Methods 0.000 title claims abstract description 33
- 230000004151 fermentation Effects 0.000 title claims abstract description 33
- 239000013612 plasmid Substances 0.000 title claims abstract description 30
- 238000012136 culture method Methods 0.000 title claims abstract description 7
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims abstract description 42
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 claims abstract description 31
- 108010076504 Protein Sorting Signals Proteins 0.000 claims abstract description 29
- 102000008186 Collagen Human genes 0.000 claims abstract description 28
- 108010035532 Collagen Proteins 0.000 claims abstract description 28
- 229920001436 collagen Polymers 0.000 claims abstract description 28
- 239000002609 medium Substances 0.000 claims abstract description 14
- 108010010803 Gelatin Proteins 0.000 claims abstract description 13
- 239000008273 gelatin Substances 0.000 claims abstract description 13
- 229920000159 gelatin Polymers 0.000 claims abstract description 13
- 235000019322 gelatine Nutrition 0.000 claims abstract description 13
- 235000011852 gelatine desserts Nutrition 0.000 claims abstract description 13
- 239000013613 expression plasmid Substances 0.000 claims abstract description 12
- 102000004190 Enzymes Human genes 0.000 claims description 27
- 108090000790 Enzymes Proteins 0.000 claims description 27
- 229940088598 enzyme Drugs 0.000 claims description 27
- 239000007222 ypd medium Substances 0.000 claims description 11
- 125000006850 spacer group Chemical group 0.000 claims description 10
- 241000193403 Clostridium Species 0.000 claims description 9
- 101710090707 Collagenase ColG Proteins 0.000 claims description 8
- 101710090686 Collagenase ColH Proteins 0.000 claims description 7
- 238000012258 culturing Methods 0.000 claims description 3
- 230000001131 transforming effect Effects 0.000 claims description 2
- 230000000593 degrading effect Effects 0.000 claims 1
- 239000001963 growth medium Substances 0.000 abstract description 8
- 108090000765 processed proteins & peptides Proteins 0.000 abstract description 4
- 102000004196 processed proteins & peptides Human genes 0.000 abstract description 2
- 230000003248 secreting effect Effects 0.000 abstract 1
- 239000012634 fragment Substances 0.000 description 29
- 230000000694 effects Effects 0.000 description 21
- 108020004414 DNA Proteins 0.000 description 9
- 238000000034 method Methods 0.000 description 8
- 239000000243 solution Substances 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 7
- 230000007062 hydrolysis Effects 0.000 description 7
- 238000006460 hydrolysis reaction Methods 0.000 description 7
- 239000006228 supernatant Substances 0.000 description 7
- 239000000835 fiber Substances 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 238000005457 optimization Methods 0.000 description 6
- 230000001954 sterilising effect Effects 0.000 description 6
- 238000004659 sterilization and disinfection Methods 0.000 description 6
- 239000001888 Peptone Substances 0.000 description 5
- 108010080698 Peptones Proteins 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 229940041514 candida albicans extract Drugs 0.000 description 5
- 230000015556 catabolic process Effects 0.000 description 5
- 238000006731 degradation reaction Methods 0.000 description 5
- 235000019319 peptone Nutrition 0.000 description 5
- 230000028327 secretion Effects 0.000 description 5
- 239000012138 yeast extract Substances 0.000 description 5
- 101100454433 Biomphalaria glabrata BG01 gene Proteins 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 229910021645 metal ion Inorganic materials 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- 101100168109 Hathewaya histolytica colG gene Proteins 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 150000002500 ions Chemical class 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 108090000623 proteins and genes Proteins 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 102000005741 Metalloproteases Human genes 0.000 description 2
- 108010006035 Metalloproteases Proteins 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 2
- 210000000988 bone and bone Anatomy 0.000 description 2
- 239000006227 byproduct Substances 0.000 description 2
- 239000008367 deionised water Substances 0.000 description 2
- 229910021641 deionized water Inorganic materials 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000003301 hydrolyzing effect Effects 0.000 description 2
- FEMOMIGRRWSMCU-UHFFFAOYSA-N ninhydrin Chemical compound C1=CC=C2C(=O)C(O)(O)C(=O)C2=C1 FEMOMIGRRWSMCU-UHFFFAOYSA-N 0.000 description 2
- 230000004962 physiological condition Effects 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000012192 staining solution Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108091005658 Basic proteases Proteins 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 101100454434 Biomphalaria glabrata BG04 gene Proteins 0.000 description 1
- 101100454435 Biomphalaria glabrata BG05 gene Proteins 0.000 description 1
- 101100454436 Biomphalaria glabrata BG06 gene Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N DMSO Substances CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 101100168110 Hathewaya histolytica colH gene Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 108010013295 Microbial collagenase Proteins 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 239000008118 PEG 6000 Substances 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 229920002584 Polyethylene Glycol 6000 Polymers 0.000 description 1
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 239000000853 adhesive Substances 0.000 description 1
- 230000001070 adhesive effect Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 150000001413 amino acids Chemical group 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- UDSAIICHUKSCKT-UHFFFAOYSA-N bromophenol blue Chemical compound C1=C(Br)C(O)=C(Br)C=C1C1(C=2C=C(Br)C(O)=C(Br)C=2)C2=CC=CC=C2S(=O)(=O)O1 UDSAIICHUKSCKT-UHFFFAOYSA-N 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- MTHSVFCYNBDYFN-UHFFFAOYSA-N diethylene glycol Chemical compound OCCOCCO MTHSVFCYNBDYFN-UHFFFAOYSA-N 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000009585 enzyme analysis Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 210000000003 hoof Anatomy 0.000 description 1
- 210000003284 horn Anatomy 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 229940055729 papain Drugs 0.000 description 1
- 235000019834 papain Nutrition 0.000 description 1
- 244000144977 poultry Species 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000004626 scanning electron microscopy Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 235000020183 skimmed milk Nutrition 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000012089 stop solution Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/78—Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin or cold insoluble globulin [CIG]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/14—Fungi; Culture media therefor
- C12N1/16—Yeasts; Culture media therefor
- C12N1/18—Baker's yeast; Brewer's yeast
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/52—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from bacteria or Archaea
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/06—Preparation of peptides or proteins produced by the hydrolysis of a peptide bond, e.g. hydrolysate products
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/24—Metalloendopeptidases (3.4.24)
- C12Y304/24003—Microbial collagenase (3.4.24.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/645—Fungi ; Processes using fungi
- C12R2001/85—Saccharomyces
- C12R2001/865—Saccharomyces cerevisiae
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Mycology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Botany (AREA)
- Virology (AREA)
- General Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Tropical Medicine & Parasitology (AREA)
- Plant Pathology (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明公开了一种表达胶原蛋白酶的重组质粒、酵母菌株及其发酵培养基、发酵培养方法和用途,该重组质粒是在表达质粒中插入表达盒不同模块,包括信号肽αfactor、ColG或ColH前导肽和功能区;然后将重组质粒转化出发酵母菌株后得到分泌胶原蛋白酶的酿酒酵母。本发明构建了分泌生产胶原蛋白酶的酿酒酵母工程菌,通过优化酵母培养基的成分,优化胶原蛋白酶序列中的信号肽元件,提高了酿酒酵母分泌胶原蛋白酶的能力。所得重组胶原蛋白酶能够有效降解明胶和胶原蛋白,对胶原蛋白酶的应用及胶原蛋白资源的有效利用具有重要的实践意义。
The invention discloses a recombinant plasmid for expressing collagenase, a yeast strain and its fermentation medium, fermentation culture method and use. The recombinant plasmid inserts different modules of the expression cassette into the expression plasmid, including the signal peptide αfactor, ColG or ColH leader. Peptides and functional regions; then the recombinant plasmid is transformed into a yeast strain to obtain Saccharomyces cerevisiae secreting collagenase. The present invention constructs an engineering strain of Saccharomyces cerevisiae that secretes and produces collagenase, and improves the ability of Saccharomyces cerevisiae to secrete collagenase by optimizing the components of the yeast culture medium and optimizing the signal peptide component in the collagenase sequence. The obtained recombinant collagenase can effectively degrade gelatin and collagen, which has important practical significance for the application of collagenase and the effective utilization of collagen resources.
Description
技术领域Technical Field
本发明属于生物技术领域,具体涉及一种表达胶原蛋白酶的重组质粒、酵母菌株及其发酵培养基、发酵培养方法和用途。The invention belongs to the field of biotechnology, and specifically relates to a recombinant plasmid expressing collagenase, a yeast strain and its fermentation medium, fermentation culture method and use.
背景技术Background technique
在畜禽及水产品的加工生产中,副产物产量巨大,这些动物性的角、蹄、皮、骨和鳞片中含有丰富的胶原蛋白资源。In the processing and production of livestock, poultry and aquatic products, the output of by-products is huge. The horns, hooves, skins, bones and scales of these animals are rich in collagen resources.
酶法提取是提高这些副产物加工利用率的重要手段,相比于其他酸碱处理等非生物过程,可以更好地获取高质量的生物活性提取物,且更加环保。鉴于胶原蛋白的特殊三螺旋结构具有较高强度,普通蛋白酶难以将其直接水解,通常需化学法或高温辅助水解。Enzymatic extraction is an important means to improve the processing utilization of these by-products. Compared with other non-biological processes such as acid and alkali treatment, it can better obtain high-quality biologically active extracts and is more environmentally friendly. In view of the high strength of collagen's special triple helical structure, it is difficult for ordinary proteases to directly hydrolyze it, and chemical methods or high-temperature assisted hydrolysis are usually required.
目前工业生产中常用于制备胶原蛋白寡肽的酶主要包括木瓜蛋白酶、碱性蛋白酶、胰蛋白酶等,但是对胶原蛋白的特异性较差。Currently, enzymes commonly used to prepare collagen oligopeptides in industrial production mainly include papain, alkaline protease, trypsin, etc., but their specificity for collagen is poor.
胶原蛋白酶是一种针对胶原蛋白的特异性金属蛋白水解酶,能在生理条件下水解胶原蛋白,从而更易获取具有生物活性的胶原蛋白肽,可有效提高温和处理条件下的胶原蛋白水解程度及质量。Collagenase is a specific metalloproteinase for collagen. It can hydrolyze collagen under physiological conditions, making it easier to obtain bioactive collagen peptides. It can effectively improve the degree and quality of collagen hydrolysis under mild treatment conditions. .
来源于溶组织梭状芽孢杆菌的胶原蛋白酶研究最为广泛,是目前公认的用于鉴定分析新型胶原蛋白酶的标准酶,也是可商品化获取的微生物源胶原蛋白酶。但溶组织梭状芽孢杆菌源胶原蛋白酶主要是从溶组织梭状芽孢杆菌的发酵上清液中纯化所得,其发酵上清液中成分复杂,蛋白纯化步骤繁琐,批次间胶原蛋白酶制剂的活性差异较大。此外,溶组织梭状芽孢杆菌是致病菌,其分泌的胶原蛋白酶在食品等对安全性要求较高领域的应用受到一定限制。Collagenase derived from Clostridium histolytica is the most widely studied and is currently recognized as the standard enzyme for identification and analysis of new collagenases. It is also a commercially available microbial-derived collagenase. However, collagenase derived from Clostridium histolytica is mainly purified from the fermentation supernatant of Clostridium histolytica. The composition of the fermentation supernatant is complex, the protein purification steps are cumbersome, and the activity of collagenase preparations between batches big different. In addition, Clostridium histolytica is a pathogenic bacterium, and the collagenase it secretes has certain limitations in its application in areas with high safety requirements such as food.
发明内容Contents of the invention
本发明的目的在于提供一种表达胶原蛋白酶的重组质粒、酵母菌株及其发酵培养基、发酵培养方法和用途,来解决现有胶原蛋白酶来源待拓展的不足。通过将胶原蛋白酶基因导入酿酒酵母细胞体内,优化酵母发酵培养基组成成分,实现胶原蛋白酶在食品级安全宿主中的分泌表达。进而通过优化胶原蛋白酶序列中的信号肽元件,提高了胶原蛋白酶的分泌水平。本发明围绕培养基和信号肽元件的优化,得到了高效分泌表达胶原蛋白酶的酿酒酵母菌株,所得的重组胶原蛋白酶具有在生理条件下水解明胶和胶原蛋白的能力,含有重组胶原蛋白酶的发酵上清液可直接用于底物的加工处理。The purpose of the present invention is to provide a recombinant plasmid expressing collagenase, a yeast strain and its fermentation medium, fermentation culture method and use, so as to solve the shortcomings of existing collagenase sources that need to be expanded. By introducing the collagenase gene into Saccharomyces cerevisiae cells and optimizing the composition of the yeast fermentation medium, the secretion and expression of collagenase in a food-grade safe host can be achieved. Furthermore, by optimizing the signal peptide element in the collagenase sequence, the secretion level of collagenase was improved. The present invention focuses on the optimization of culture medium and signal peptide components, and obtains a Saccharomyces cerevisiae strain that efficiently secretes and expresses collagenase. The obtained recombinant collagenase has the ability to hydrolyze gelatin and collagen under physiological conditions. The fermentation supernatant containing the recombinant collagenase The liquid can be used directly for substrate processing.
本发明的目的通过下述技术方案实现:The purpose of the present invention is achieved through the following technical solutions:
一种表达胶原蛋白酶的重组质粒,是在表达质粒中插入αfactor的前导序列(leader)、溶组织梭状芽孢杆菌胶原蛋白酶ColG的前导肽序列(pro peptide)和酶功能区序列(Chain),或者是在表达质粒中插入αfactor的前导序列(leader)和间隔序列(spacer)、溶组织梭状芽孢杆菌胶原蛋白酶ColH的前导肽序列(pro peptide)和酶功能区序列(Chain);A recombinant plasmid for expressing collagenase is to insert the leader sequence (leader) of αfactor, the leader peptide sequence (pro peptide) and the enzyme functional region sequence (Chain) of Clostridium histolytica collagenase ColG into the expression plasmid, or It is to insert the leader sequence (leader) and spacer sequence (spacer) of αfactor, the leader peptide sequence (pro peptide) and enzyme functional region sequence (Chain) of Clostridium histolytica collagenase ColH into the expression plasmid;
所述αfactor的前导序列(leader)如SEQ ID No.1所示;The leader sequence (leader) of the α factor is shown in SEQ ID No. 1;
所述ColG的前导肽序列和酶功能区序列(pro peptide-Chain)如SEQ ID No.2所示;The leader peptide sequence and enzyme functional region sequence (pro peptide-Chain) of ColG are shown in SEQ ID No.2;
所述αfactor的前导序列和间隔序列(leader-spacer)如SEQ ID No.3所示;The leader sequence and spacer sequence (leader-spacer) of the αfactor are shown in SEQ ID No. 3;
所述ColH的前导肽序列和酶功能区序列(pro peptide-Chain)如SEQ ID No.4所示;The leader peptide sequence and enzyme functional region sequence (pro peptide-Chain) of ColH are shown in SEQ ID No. 4;
本发明构建的胶原蛋白酶表达质粒,不限于仅使用胶原蛋白酶原生信号肽介导胶原蛋白酶分泌,而是通过比较表达盒不同模块(信号肽、前导肽、功能区)组合下的胶原蛋白酶分泌产量,得到优化后的模块组合,实现了在酿酒酵母细胞中的高效分泌。The collagenase expression plasmid constructed by the present invention is not limited to using only the native signal peptide of collagenase to mediate collagenase secretion, but by comparing the collagenase secretion yield under the combination of different modules (signal peptide, leader peptide, functional region) of the expression cassette, The optimized module combination was obtained to achieve efficient secretion in Saccharomyces cerevisiae cells.
所述的表达质粒,为CPOTud、p426GPD、p426TEF、p426ADH、p426CYC、p416GPD、p4216TEF、p4126ADH、p416CYC、p425GPD、p425TEF、p425ADH、p425CYC、p415GPD、p4215TEF、p4125ADH、p415CYC、p424GPD、p424TEF、p424ADH、p424CYC、p414GPD、p4214TEF、p4124ADH、p414CYC、p423GPD、p423TEF、p423ADH、p423CYC、p413GPD、p4213TEF、p4123ADH、p413CYC、pSP-GM1、pRS303、pRS304、pRS305、pRS306、pRS413、pRS414、pRS415、pRS416、pRS423、pRS424、pRS425或pRS426。The expression plasmids are CPOTud, p426GPD, p426TEF, p426ADH, p426CYC, p416GPD, p4216TEF, p4126ADH, p416CYC, p425GPD, p425TEF, p425ADH, p425CYC, p415GPD, p4215TEF, p4125ADH, p41 5CYC, p424GPD, p424TEF, p424ADH, p424CYC, p414GPD , p4214TEF, p4124ADH, p414CYC, p423GPD, p423TEF, p423ADH, p423CYC, p413GPD, p4213TEF, p4123ADH, p413CYC, pSP-GM1, pRS303, pRS304, pRS305, pRS306, pRS413, pRS414 , pRS415, pRS416, pRS423, pRS424, pRS425 or pRS426 .
一种分泌胶原蛋白酶的酿酒酵母,是将上述的重组质粒转化出发酵母菌株后得到;A Saccharomyces cerevisiae that secretes collagenase is obtained by transforming the above-mentioned recombinant plasmid into a yeast strain;
所述的出发酵母菌株为酿酒酵母B184M、CEN.PK 530.1C、CEN.PK 113.5D、BY4742、BY4741、CEN.PK2-1D、CEN.PK2-1C或IMX581。The starting yeast strain is Saccharomyces cerevisiae B184M, CEN.PK 530.1C, CEN.PK 113.5D, BY4742, BY4741, CEN.PK2-1D, CEN.PK2-1C or IMX581.
一种适合于上述酿酒酵母的发酵培养基,是在YPD培养基中添加Ca2+和/或Zn2+;A fermentation medium suitable for the above-mentioned Saccharomyces cerevisiae is to add Ca 2+ and/or Zn 2+ to the YPD medium;
所述Ca2+的浓度为10-25mM,在这个浓度下,胶原蛋白酶均有较高量的分泌;考虑较高金属离子浓度会有潜在导致发酵液沉淀的可能性,所以优选10mM;The Ca 2+ concentration is 10-25 mM. At this concentration, collagenase is secreted in a relatively high amount. Considering that a higher metal ion concentration may potentially cause precipitation of the fermentation broth, 10 mM is preferred.
所述Zn2+的浓度为0.4-1mM,优选0.6mM。The concentration of Zn 2+ is 0.4-1mM, preferably 0.6mM.
上述酿酒酵母的发酵培养方法,是将所述酿酒酵母接种至上述的发酵培养基中培养;The above-mentioned fermentation culture method of Saccharomyces cerevisiae is to inoculate the Saccharomyces cerevisiae into the above-mentioned fermentation medium and culture it;
所述的培养,优选在30-37℃下培养至少96h,即可收集重组胶原蛋白酶。The recombinant collagenase can be collected by culturing, preferably at 30-37°C for at least 96 hours.
本发明酿酒酵母的发酵产物重组胶原蛋白酶可有效降解明胶和胶原蛋白,因此上述的重组质粒和酿酒酵母可应用于生产胶原蛋白酶以及降解明胶和胶原蛋白。The recombinant collagenase fermentation product of Saccharomyces cerevisiae of the present invention can effectively degrade gelatin and collagen. Therefore, the above-mentioned recombinant plasmid and Saccharomyces cerevisiae can be used to produce collagenase and degrade gelatin and collagen.
本发明相对于现有技术具有如下的优点及效果:Compared with the existing technology, the present invention has the following advantages and effects:
本发明构建了分泌生产胶原蛋白酶的酿酒酵母工程菌,通过优化酵母培养基的成分,优化胶原蛋白酶序列中的信号肽元件,提高了酿酒酵母分泌胶原蛋白酶的能力。所得重组胶原蛋白酶能够有效降解明胶和胶原蛋白,对胶原蛋白酶的应用及胶原蛋白资源的有效利用具有重要的实践意义。The present invention constructs an engineering strain of Saccharomyces cerevisiae that secretes and produces collagenase, and improves the ability of Saccharomyces cerevisiae to secrete collagenase by optimizing the components of the yeast culture medium and optimizing the signal peptide component in the collagenase sequence. The obtained recombinant collagenase can effectively degrade gelatin and collagen, which has important practical significance for the application of collagenase and the effective utilization of collagen resources.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1是胶原蛋白酶表达质粒pCP_G01和pCP_H01的图谱。Figure 1 is a map of collagenase expression plasmids pCP_G01 and pCP_H01.
图2是不同Ca2+离子浓度下的酶活和OD600值。Figure 2 shows the enzyme activity and OD 600 values under different Ca 2+ ion concentrations.
图3是不同Zn2+离子浓度下的酶活和OD600值。Figure 3 shows the enzyme activity and OD 600 values under different Zn 2+ ion concentrations.
图4是不同Zn2+离子浓度下的酶活和OD600值。Figure 4 shows the enzyme activity and OD 600 values under different Zn 2+ ion concentrations.
图5是质粒构建框架图。Figure 5 is a diagram of the plasmid construction framework.
图6是不同菌株的酶活和OD600值。Figure 6 shows the enzyme activities and OD 600 values of different strains.
图7是不同菌株的平板水解实验结果。Figure 7 shows the results of plate hydrolysis experiments of different strains.
图8是不同菌株的酶活和OD600值。Figure 8 shows the enzyme activities and OD 600 values of different strains.
图9是不同菌株的平板水解实验结果。Figure 9 shows the results of plate hydrolysis experiments of different strains.
图10是胶原蛋白酶对明胶的降解效果。Figure 10 shows the degradation effect of collagenase on gelatin.
图11是胶原蛋白酶对胶原蛋白的降解效果。Figure 11 shows the degradation effect of collagenase on collagen.
图12是胶原蛋白酶处理24h后胶原蛋白的形貌。Figure 12 shows the morphology of collagen after collagenase treatment for 24 hours.
图13是菌株分泌表达胶原蛋白酶的产量。Figure 13 shows the production of collagenase secreted by the strain.
具体实施方式Detailed ways
下面结合实施例及附图对本发明作进一步详细的描述,但本发明的实施方式不限于此。The present invention will be described in further detail below with reference to the examples and drawings, but the implementation of the present invention is not limited thereto.
下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到;The materials, reagents, etc. used in the following examples can all be obtained from commercial sources unless otherwise specified;
下列实施例中未注明具体条件的实验方法,则通常按照常规条件,如《分子克隆实验指南》(北京:科学出版社,2017)、《酵母遗传学方法实验指南》(北京:科学出版社,2016)中所述的条件进行。The experimental methods for which specific conditions are not specified in the following examples are generally carried out according to conventional conditions, such as those described in "Molecular Cloning Experiment Guide" (Beijing: Science Press, 2017) and "Yeast Genetics Methods Experiment Guide" (Beijing: Science Press, 2016).
为更好地理解本发明的内容,以酿酒酵母(Saccharomyces cerevisiae)B184M(PNAS,2015,112:E4689–E4696)为出发菌株,以质粒CPOTud(Biotechnology andBioengineering,2012,109(5):1259-1268)为表达质粒,为具体实施例作进一步说明。In order to better understand the content of the present invention, Saccharomyces cerevisiae B184M (PNAS, 2015, 112: E4689-E4696) was used as the starting strain, and the plasmid CPOTud (Biotechnology and Bioengineering, 2012, 109 (5): 1259-1268) was used as the starting strain. ) is an expression plasmid, and specific examples will be further described.
下列实施例中所涉及的培养基如下:The culture media involved in the following examples are as follows:
LB培养基:10g/L蛋白胨,5g/L酵母提取物,10g/L NaCl;LB medium: 10 g/L peptone, 5 g/L yeast extract, 10 g/L NaCl;
LB/Amp培养基:10g/L蛋白胨,5g/L酵母提取物,10g/L NaCl,灭菌后冷却至50℃左右,加入100μg/mL氨苄青霉素(过滤除菌);LB/Amp medium: 10g/L peptone, 5g/L yeast extract, 10g/L NaCl. After sterilization, cool to about 50°C, add 100μg/mL ampicillin (filter sterilization);
YPD培养基:20g/L蛋白胨,10g/L酵母提取物,20g/L葡萄糖(单独灭菌后加入);YPD medium: 20g/L peptone, 10g/L yeast extract, 20g/L glucose (added after separate sterilization);
改良YPD培养基:20g/L蛋白胨,10g/L酵母提取物,20g/L葡萄糖(单独灭菌后加入),10mM CaCl2和0.6mM ZnCl2;Modified YPD medium: 20g/L peptone, 10g/L yeast extract, 20g/L glucose (added after separate sterilization), 10mM CaCl 2 and 0.6mM ZnCl 2 ;
YPE培养基:20g/L蛋白胨,10g/L酵母提取物,0.5g/L葡萄糖(单独灭菌后加入),10ml无水乙醇/L(过滤除菌);YPE culture medium: 20g/L peptone, 10g/L yeast extract, 0.5g/L glucose (added after separate sterilization), 10ml absolute ethanol/L (filter sterilization);
固体培养基则在相应的液体培养基的基础上添加20g/L琼脂粉。For solid culture medium, 20g/L agar powder is added to the corresponding liquid culture medium.
下列实施例中,胶原蛋白酶的相关酶活检测按下述方法进行:In the following examples, the relevant enzyme activity detection of collagenase was carried out according to the following method:
(1)含有胶原蛋白酶表达质粒的酵母菌株(BG01、BH01)在YPD平板上活化后,用1μL接种环挑取单菌落转接至2.5mL培养基30℃,200rpm培养96h。(1) After the yeast strain (BG01, BH01) containing the collagenase expression plasmid is activated on the YPD plate, use a 1 μL inoculation loop to pick a single colony and transfer it to 2.5 mL medium at 30°C, and culture at 200 rpm for 96 hours.
(2)生物量的测定。将菌液稀释合适倍数,测定在600nm波长下的吸光值,即为OD600值。(2) Determination of biomass. Dilute the bacterial solution to an appropriate multiple and measure the absorbance value at a wavelength of 600nm, which is the OD 600 value.
(3)采用茚三酮法测定胶原蛋白酶酶活(Anal Biochem,2013,437:46-48)。将培养后的菌液离心,得到发酵上清液。取20μL发酵上清,加入480μL用反应缓冲液(50mM Tris·Cl、5mM CaCl2、1μM ZnCl2,pH=7.5)配制的2mg/mL明胶溶液,在37℃下反应30min,加入500μL终止液(12%(w/v)PEG6000、25mM EDTA)终止反应。取10μL反应液加90μL去离子水,加入500μL茚三酮显色液(含0.16%SnCl2,pH=5.0的200mM柠檬酸缓冲液与50g/L茚三酮-DMSO溶液等体积混合,现配现用),80℃水浴中加热10min,冷却后加入600μL去离子水混合均匀,在570nm处测吸光度。在37℃,pH=7.5的条件下,每100μL酶液每分钟水解胶原产生相当于1μg甘氨酸的量为1个酶活力单位U。(3) The collagenase activity was determined by the ninhydrin method (Anal Biochem, 2013, 437: 46-48). The cultured bacterial solution was centrifuged to obtain the fermentation supernatant. 20 μL of the fermentation supernatant was taken, and 480 μL of 2 mg/mL gelatin solution prepared with reaction buffer (50 mM Tris·Cl, 5 mM CaCl 2 , 1 μM ZnCl 2 , pH = 7.5) was added. The reaction was carried out at 37°C for 30 min, and 500 μL of stop solution (12% (w/v) PEG6000, 25 mM EDTA) was added to terminate the reaction. Take 10 μL of the reaction solution, add 90 μL of deionized water, add 500 μL of ninhydrin colorimetric solution (mix equal volumes of 200 mM citric acid buffer containing 0.16% SnCl 2 , pH=5.0 and 50 g/L ninhydrin-DMSO solution, prepare and use immediately), heat in a 80°C water bath for 10 min, add 600 μL of deionized water after cooling and mix evenly, and measure the absorbance at 570 nm. Under the conditions of 37°C and pH=7.5, the amount of collagen produced by hydrolyzing collagen per minute by 100 μL of enzyme solution is equivalent to 1 μg of glycine, which is 1 enzyme activity unit U.
(4)平板透明圈法定性表征酶活高低。在固体培养基中额外添加1%的脱脂奶粉,菌株在平板上于30℃培养4天后,转移至37℃培养2-3天,查看菌落周围形成的水解圈大小,并进行拍照。(4) Plate transparent circle method to qualitatively characterize enzyme activity. Add an additional 1% skimmed milk powder to the solid medium. After culturing the strain on the plate at 30°C for 4 days, transfer it to 37°C and culture it for 2-3 days. Check the size of the hydrolysis circle formed around the colony and take photos.
下列实施例中所涉及的引物序列如表1所示:The primer sequences involved in the following examples are shown in Table 1:
表1:引物序列Table 1: Primer sequences
#下划线标记酶切位点#Underline marks the enzyme cutting site
实施例1溶组织梭状芽孢杆菌胶原蛋白酶编码基因的获取Example 1 Acquisition of Clostridium histolyticum collagenase encoding gene
根据UniProt数据库中公开的溶组织梭状芽孢杆菌胶原蛋白酶ColG(Q9X721)和ColH(Q46085)的氨基酸序列,针对酿酒酵母进行密码子优化后由南京金斯瑞生物科技有限公司合成DNA编码序列,得到质粒pUC57-Mini_G(SEQ ID No.5)和pUC57-Mini_H(SEQ IDNo.6)(包含pre peptide-pro peptide-Chain)。According to the amino acid sequences of Clostridium histolytica collagenase ColG (Q9X721) and ColH (Q46085) published in the UniProt database, the DNA coding sequence was synthesized by Nanjing GenScript Biotechnology Co., Ltd. after codon optimization for Saccharomyces cerevisiae to obtain Plasmids pUC57-Mini_G (SEQ ID No. 5) and pUC57-Mini_H (SEQ ID No. 6) (containing pre peptide-pro peptide-Chain).
实施例2表达胶原蛋白酶ColG和ColH的酿酒酵母工程菌的构建Example 2 Construction of Saccharomyces cerevisiae engineering strain expressing collagenase ColG and ColH
(1)CPOTud经Kpn I和Nhe I酶切后回收得到载体;(1) CPOTud is digested with Kpn I and Nhe I and the vector is recovered;
(2)使用引物对AF/ARproG、AF/ARproH,以pAlphaAmyCPOT(Biotechnology andBioengineering,2012,109:1259-1268)为模板扩增出带有G、H同源臂的信号肽(为αfactorleader+spacer)(SEQ ID No.3);使用引物对proGF/GR、proHF/HR,以pUC57-Mini_G和pUC57-Mini_H为模板扩增出ColG和ColH片段(SEQ ID No.2和SEQ ID No.4);相应片段(ColG或ColH与相应的信号肽)使用对应引物AF/GR、AF/HR经PCR扩增融合得到插入片段G01、H01。(2) Use the primer pair AF/ARproG, AF/ARproH, and use pAlphaAmyCPOT (Biotechnology and Bioengineering, 2012, 109:1259-1268) as the template to amplify the signal peptide with G and H homology arms (αfactorleader+spacer) (SEQ ID No.3); use the primer pair proGF/GR, proHF/HR, and use pUC57-Mini_G and pUC57-Mini_H as templates to amplify the ColG and ColH fragments (SEQ ID No.2 and SEQ ID No.4); The corresponding fragments (ColG or ColH and the corresponding signal peptide) were amplified and fused by PCR using corresponding primers AF/GR and AF/HR to obtain insert fragments G01 and H01.
(3)融合好的片段用Kpn I和Nhe I酶切回收后,使用T4 DNA Ligase与载体连接,得到质粒pCP_G01(SEQ ID No.7)和pCP_H01(SEQ ID No.8)。(3) After the fused fragments were digested with Kpn I and Nhe I, they were connected to the vector using T4 DNA Ligase to obtain plasmids pCP_G01 (SEQ ID No. 7) and pCP_H01 (SEQ ID No. 8).
(4)连接产物采用常规流程转化至大肠杆菌感受态细胞中。涂布的平板在37℃过夜培养后,挑取菌落至含有鉴定引物G1/G2或H1/H2的PCR体系中进行菌落PCR鉴定。(4) The ligation product is transformed into E. coli competent cells using conventional procedures. After the coated plate was cultured at 37°C overnight, bacterial colonies were picked and placed in a PCR system containing identification primers G1/G2 or H1/H2 for colony PCR identification.
(5)鉴定正确的克隆接种至LB/Amp培养基中,37℃,250rpm培养12-16h,提质粒酶切鉴定并测序确认质粒pCP_G01(SEQ ID No.7)和pCP_H01(SEQ ID No.8)构建成功。质粒pCP_G01含有的胶原蛋白酶ColG表达框为:αfactor leader+spacer-colG pro peptide-colG chain;质粒pCP_H01含有的胶原蛋白酶ColH表达框为:αfactor leader+spacer-colHpro peptide-colH chain(图谱见图1)。(5) Identify the correct clone and inoculate it into LB/Amp medium, culture it at 37°C, 250rpm for 12-16 hours, extract and digest the plasmid, identify it and sequence it to confirm the plasmid pCP_G01 (SEQ ID No. 7) and pCP_H01 (SEQ ID No. 8) ) build successfully. The collagenase ColG expression cassette contained in plasmid pCP_G01 is: αfactor leader+spacer-colG pro peptide-colG chain; the collagenase ColH expression cassette contained in plasmid pCP_H01 is: αfactor leader+spacer-colH pro peptide-colH chain (see Figure 1 for the map) .
(6)得到的质粒pCP_G01、pCP_H01和对照空质粒CPOTud采用常规醋酸锂转化法转化至酿酒酵母B184M,得到菌株BG01、BH01和B0。(6) The obtained plasmids pCP_G01, pCP_H01 and the control empty plasmid CPOTud were transformed into Saccharomyces cerevisiae B184M using the conventional lithium acetate transformation method to obtain strains BG01, BH01 and B0.
实施例3胶原蛋白酶生产菌的培养条件优化Example 3 Optimization of culture conditions for collagenase-producing bacteria
由于ColG跟ColH是金属蛋白酶,其需要Ca2+跟Zn2+作为金属辅因子。Since ColG and ColH are metalloproteases, they require Ca 2+ and Zn 2+ as metal cofactors.
以菌株BG01为代表进行培养基优化。在常规的YPD培养基中分别补充浓度为2.5mM、5mM、10mM、15mM、20mM、25mM的Ca2+(CaCl2)和浓度为0.1mM、0.2mM、0.4mM、0.6mM、0.8mM、1mM的Zn2+(ZnCl2)后发酵96h,测定酶活。结果如图2、图3所示,单因素实验表明添加Ca2+或Zn2+使得胶原蛋白酶产量显著提升。其中,Ca2+在10mM~25mM之间,均能达到较高的酶活。为避免较高浓度的Ca2+造成的潜在沉淀影响,优选10mM Ca2+作为金属离子的添加浓度。对于Zn2+,则选用0.6mM Zn2+作为金属离子添加浓度。The strain BG01 was used as a representative for medium optimization. The conventional YPD medium was supplemented with Ca 2+ (CaCl 2 ) at concentrations of 2.5mM, 5mM, 10mM, 15mM, 20mM, and 25mM and Zn 2+ (ZnCl 2 ) at concentrations of 0.1mM, 0.2mM, 0.4mM, 0.6mM, 0.8mM, and 1mM, respectively, and then fermented for 96h and the enzyme activity was determined. The results are shown in Figures 2 and 3. The single factor experiment showed that the addition of Ca 2+ or Zn 2+ significantly increased the production of collagenase. Among them, Ca 2+ between 10mM and 25mM can achieve a higher enzyme activity. In order to avoid the potential precipitation effect caused by a higher concentration of Ca 2+ , 10mM Ca 2+ is preferably used as the added concentration of metal ions. For Zn 2+ , 0.6mM Zn 2+ is selected as the added concentration of metal ions.
基于ColG的优化结果,我们也在YPD培养基添加10mM Ca2+基础上,补充浓度为0.2mM、0.4mM、0.6mM、0.8mM、1mM的Zn2+(ZnCl2),并接种菌株BH01发酵96h,测定酶活。结果如图4所示,0.6mM Zn2+浓度下酶活最高。Based on the optimization results of ColG, we also added 10mM Ca 2+ to YPD medium, supplemented with 0.2mM, 0.4mM, 0.6mM, 0.8mM, and 1mM Zn 2+ (ZnCl 2 ), and inoculated strain BH01 for 96h fermentation to measure enzyme activity. The results are shown in Figure 4, and the enzyme activity was the highest at 0.6mM Zn 2+ concentration.
综上,10mM Ca2+和0.6mM Zn2+为最佳金属离子添加浓度。In summary, 10mM Ca 2+ and 0.6mM Zn 2+ are the optimal metal ion addition concentrations.
实施例4酿酒酵母表达盒的元件适配组装优化Example 4 Optimization of component adaptation assembly of Saccharomyces cerevisiae expression cassette
元件适配组装优化系列质粒构建框架如图5所示,构建流程同实施例2,用相应引物对扩增融合得到不同片段。The construction framework of a series of plasmids optimized for component adaptation assembly is shown in Figure 5. The construction process is the same as in Example 2. Different fragments are obtained by amplification and fusion using corresponding primer pairs.
使用引物对AF/ARWproG,以pAlphaAmyCPOT为模板扩增出带有G同源臂的信号肽(为αfactor leader)(SEQ ID No.1);使用引物对WproGF/GR以pUC57-Mini_G为模板扩增出ColG片段;相应片段(ColG与相应的信号肽)使用对应引物AF/GR经PCR扩增融合得到插入片段G02。Use the primer pair AF/ARWproG and use pAlphaAmyCPOT as the template to amplify the signal peptide with G homology arm (αfactor leader) (SEQ ID No. 1); use the primer pair WproGF/GR and use pUC57-Mini_G as the template to amplify The ColG fragment was obtained; the corresponding fragment (ColG and the corresponding signal peptide) was amplified and fused by PCR using the corresponding primer AF/GR to obtain the inserted fragment G02.
使用引物对PproG/GR,以pUC57-Mini_G为模板扩增出带有αfactor pre同源臂的ColG,所得片段使用引物对AP/GR扩增得到插入片段G03。The primer pair PproG/GR was used to amplify ColG with αfactor pre homology arm using pUC57-Mini_G as the template, and the resulting fragment was amplified using the primer pair AP/GR to obtain the inserted fragment G03.
使用引物对AF/ARG,以pAlphaAmyCPOT为模板扩增出带有G同源臂的信号肽(为αfactor leader+spacer);使用引物对AGF/GR,以pUC57-Mini_G为模板扩增出ColG片段;相应片段(ColG与相应的信号肽)使用对应引物AF/GR经PCR扩增融合得到插入片段G04。Use the primer pair AF/ARG and use pAlphaAmyCPOT as the template to amplify the signal peptide with G homology arm (αfactor leader+spacer); use the primer pair AGF/GR and use pUC57-Mini_G as the template to amplify the ColG fragment; The corresponding fragment (ColG and the corresponding signal peptide) was amplified and fused by PCR using the corresponding primer AF/GR to obtain the inserted fragment G04.
使用引物对AF/ARWG,以pAlphaAmyCPOT为模板扩增出带有G同源臂的信号肽(为αfactor leader+spacer);使用引物对WGF/GR,以pUC57-Mini_G为模板扩增出ColG片段;相应片段(ColG与相应的信号肽)使用对应引物AF/GR经PCR扩增融合得到插入片段G05。Using primer pair AF/ARWG, pAlphaAmyCPOT was used as a template to amplify the signal peptide with G homology arm (α factor leader + spacer); using primer pair WGF/GR, pUC57-Mini_G was used as a template to amplify the ColG fragment; the corresponding fragments (ColG and the corresponding signal peptide) were amplified and fused using corresponding primers AF/GR by PCR to obtain the inserted fragment G05.
使用引物对PG/GR,以pUC57-Mini_G为模板扩增出带有αfactor pre同源臂的ColG,所得片段使用引物对AP/GR扩增得到插入片段G06。The primer pair PG/GR was used to amplify ColG with αfactor pre homology arm using pUC57-Mini_G as the template, and the resulting fragment was amplified using the primer pair AP/GR to obtain the insert fragment G06.
使用引物对GF/GR,以pUC57-Mini_G为模板扩增得到插入片段G07。Using the primer pair GF/GR, the insert G07 was amplified using pUC57-Mini_G as the template.
使用引物对AF/ARWproH,以pAlphaAmyCPOT为模板扩增出带有H同源臂的信号肽(为αfactor leader);使用引物对WproHF/HR,以pUC57-Mini_H为模板扩增出ColH片段;相应片段(ColH与相应的信号肽)使用对应引物AF/HR经PCR扩增融合得到插入片段H02。Use the primer pair AF/ARWproH and use pAlphaAmyCPOT as the template to amplify the signal peptide with H homology arm (αfactor leader); use the primer pair WproHF/HR and use pUC57-Mini_H as the template to amplify the ColH fragment; the corresponding fragment (ColH and the corresponding signal peptide) were amplified and fused by PCR using the corresponding primers AF/HR to obtain the insert H02.
使用引物对AF/ARH,以pAlphaAmyCPOT为模板扩增出带有H同源臂的信号肽(为αfactor leader+spacer);使用引物对AHF/HR,以pUC57-Mini_H为模板扩增出ColH片段;相应片段(ColH与相应的信号肽)使用对应引物AF/HR经PCR扩增融合得到插入片段H03。Use the primer pair AF/ARH and use pAlphaAmyCPOT as the template to amplify the signal peptide with H homology arm (αfactor leader+spacer); use the primer pair AHF/HR and use pUC57-Mini_H as the template to amplify the ColH fragment; The corresponding fragment (ColH and the corresponding signal peptide) was amplified and fused by PCR using the corresponding primer AF/HR to obtain the insert fragment H03.
使用引物对AF/ARWH,以pAlphaAmyCPOT为模板扩增出带有H同源臂的信号肽(为αfactor leader+spacer);使用引物对WHF/HR,以pUC57-Mini_H为模板扩增出ColH片段;相应片段(ColH与相应的信号肽)使用对应引物AF/HR经PCR扩增融合得到插入片段H04。Using primer pair AF/ARWH, pAlphaAmyCPOT was used as a template to amplify the signal peptide with H homology arm (α factor leader + spacer); using primer pair WHF/HR, pUC57-Mini_H was used as a template to amplify the ColH fragment; the corresponding fragments (ColH and the corresponding signal peptide) were amplified and fused using corresponding primers AF/HR by PCR to obtain the inserted fragment H04.
使用引物对HF/HR,以pUC57-Mini_H为模板扩增得到插入片段H05。Using the primer pair HF/HR, the insert fragment H05 was amplified using pUC57-Mini_H as the template.
随后将以上G02、G03、G04、G05、G06、G07、H02、H03、H04、H05片段通过酶切连接插入到质粒CPOTud中,部分片段用Seamless Cloning master mix与质粒连接(具体参考世联博研试剂盒B632219说明书);相应构建出表达质粒pCP_G02、pCP_G03、pCP_G04、pCP_G05、pCP_G06、pCP_G07、pCP_H02、pCP_H03、pCP_H04、pCP_H05。Then the above G02, G03, G04, G05, G06, G07, H02, H03, H04, H05 fragments were inserted into the plasmid CPOTud through enzyme digestion and ligation, and some fragments were ligated with the plasmid using Seamless Cloning master mix (for details, please refer to Shilian Bo Research Kit B632219 instructions); the expression plasmids pCP_G02, pCP_G03, pCP_G04, pCP_G05, pCP_G06, pCP_G07, pCP_H02, pCP_H03, pCP_H04, and pCP_H05 were constructed accordingly.
得到的质粒及含对应的表达框架为:The obtained plasmid and corresponding expression framework are:
pCP_G01(含αfactor leader+spacer-colG pro peptide-colG chain)pCP_G01 (containing αfactor leader+spacer-colG pro peptide-colG chain)
pCP_G02(含αfactor leader-colG pro peptide-colG chain)pCP_G02 (containing αfactor leader-colG pro peptide-colG chain)
pCP_G03(含αfactor pre-colG pro peptide-colG chain)pCP_G03 (containing αfactor pre-colG pro peptide-colG chain)
pCP_G04(含αfactor leader+space-colG chain)pCP_G04 (including αfactor leader+space-colG chain)
pCP_G05(含αfactor leader-colG chain)pCP_G05 (containing αfactor leader-colG chain)
pCP_G06(含αfactor pre-colG chain)pCP_G06 (containing αfactor pre-colG chain)
pCP_G07(含colG pre peptide-colG pro peptide-colG Chain)pCP_G07 (containing colG pre peptide-colG pro peptide-colG Chain)
pCP_H01(含αfactor leader+spacer-colH pro peptide-colH chain)pCP_H01 (containing αfactor leader+spacer-colH pro peptide-colH chain)
pCP_H02(含αfactor leader-colH pro peptide-colH chain)pCP_H02 (contains αfactor leader-colH pro peptide-colH chain)
pCP_H03(含αfactor leader+spacer-colH chain)pCP_H03 (containing α factor leader + spacer-colH chain)
pCP_H04(含αfactor leader-colH chain)pCP_H04 (containing αfactor leader-colH chain)
pCP_H05(含colH pre peptide-colH pro peptide-colH chain)pCP_H05 (containing colH pre peptide-colH pro peptide-colH chain)
测序鉴定无误后,转入B184M中,得到相应菌株BG01、BG02、BG03、BG04、BG05、BG06、BG07、BH01、BH02、BH03、BH04、BH05。After the sequencing identification was correct, it was transferred into B184M to obtain the corresponding strains BG01, BG02, BG03, BG04, BG05, BG06, BG07, BH01, BH02, BH03, BH04, and BH05.
构建得到的菌株培养后进行酶活检测,由图6、图7可知BG02表达胶原蛋白酶ColG的酶活最高,平板水解圈最明显。由图8、图9可知BH01表达胶原蛋白酶ColH的酶活最高,平板水解圈最明显。菌株BG02携带质粒为pCP_G02(SEQ ID No.9),所含表达框为αfactorleader-colG pro peptide-colG chain;菌株BH01携带质粒为pCP_H01(SEQ ID No.8),所含表达框为αfactor leader+spacer-colH pro peptide-colH chain。The constructed strain was cultured and the enzyme activity was tested. Figures 6 and 7 show that BG02 expresses collagenase ColG with the highest enzyme activity and the most obvious plate hydrolysis zone. It can be seen from Figures 8 and 9 that BH01 expresses collagenase ColH with the highest enzyme activity and the most obvious hydrolysis zone on the plate. The plasmid carried by strain BG02 is pCP_G02 (SEQ ID No. 9), and the expression cassette contained is αfactor leader-colG pro peptide-colG chain; the plasmid carried by strain BH01 is pCP_H01 (SEQ ID No. 8), and the expression cassette contained is αfactor leader+ spacer-colH pro peptide-colH chain.
后续实验采用上述菌株BG02和菌株BH01进行。Subsequent experiments were carried out using the above-mentioned strain BG02 and strain BH01.
实施例5重组胶原蛋白酶对不同底物的降解Example 5 Degradation of different substrates by recombinant collagenase
(1)降解明胶。在改良YPD培养基中额外添加1%的明胶。分别接种菌株B0、BG02和BH01至2.5mL改良YPD+明胶的培养基中,在30℃、200rpm条件下发酵96h。发酵结束后,取500μL发酵液进行离心,去除菌体细胞。再取20μL发酵上清液,加入5μL 5×SDS-PAGE蛋白上样缓冲液,100℃煮沸10min后上样电泳至溴酚蓝到达预制胶底部。电泳完成后将蛋白胶置于染色液中染色30min,洗脱多余的染色液后用凝胶成像仪拍照。如图10所示,空白对照和对照菌株B0所对应的泳道明胶未分解,整条泳道均被染色;而菌株BG02和BH01所对应的泳道明胶明显被分泌的胶原蛋白酶所水解。(1) Degradation of gelatin. Add an additional 1% gelatin to modified YPD medium. The strains B0, BG02 and BH01 were respectively inoculated into 2.5 mL of modified YPD+gelatin culture medium, and fermented for 96 hours at 30°C and 200 rpm. After the fermentation is completed, 500 μL of the fermentation broth is centrifuged to remove bacterial cells. Take another 20 μL of fermentation supernatant, add 5 μL of 5×SDS-PAGE protein loading buffer, boil at 100°C for 10 minutes, then load the sample for electrophoresis until bromophenol blue reaches the bottom of the precast gel. After the electrophoresis is completed, place the protein gel in the staining solution for 30 minutes and take pictures with a gel imager after eluting the excess staining solution. As shown in Figure 10, the gelatin in the lanes corresponding to the blank control and control strain B0 was not decomposed and the entire lane was stained; while the gelatin in the lanes corresponding to strains BG02 and BH01 was obviously hydrolyzed by secreted collagenase.
(2)降解不溶性牛骨胶原蛋白。接种菌株B0、BG02和BH01至40mL改良YPD培养基中,在30℃、200rpm条件下发酵96h。发酵结束后,取20mL发酵液进行离心,去除菌体细胞。将发酵所得的上清液用50kDa超滤离心管浓缩,再用50mM Tris·Cl稀释至合适浓度。在其中加入2mg不溶性牛骨胶原蛋白,37℃静置,观察胶原蛋白的溶解情况,并拍照记录。其中G+H组指BG02与BH01按1:1混合。从图11可以看到,与对照组相比,重组胶原蛋白酶处理后,胶原蛋白纤维明显溶胀,且两种重组胶原蛋白酶表现出协同效应。随着时间的推移,胶原蛋白纤维被酶解从而渐渐消失。处理24h后的胶原蛋白纤维在硅片上自然风干后,粘于导电胶上,喷金镀膜后使用超高分辨率冷场发扫描电镜(SU8220,Hitachi)观察形态和结构。结果如图12所示,相较于对照组紧密完整的纤维结构,用重组胶原蛋白酶ColG处理后的样品纤维呈现松散的状态,用重组胶原蛋白酶ColH处理后的样品纤维表面也发生了明显的变化。这些现象从微观结构上证实了重组胶原蛋白酶具有很好的胶原蛋白水解活性。(2) Degradation of insoluble bovine collagen. Inoculate strains B0, BG02, and BH01 into 40 mL of modified YPD medium, and ferment for 96 h at 30°C and 200 rpm. After the fermentation is completed, take 20 mL of fermentation broth and centrifuge to remove bacterial cells. The supernatant obtained from fermentation was concentrated in a 50kDa ultrafiltration centrifuge tube, and then diluted to an appropriate concentration with 50mM Tris·Cl. Add 2 mg of insoluble bovine bone collagen to it, let it stand at 37°C, observe the dissolution of the collagen, and take photos to record it. The G+H group refers to the mixture of BG02 and BH01 at 1:1. As can be seen from Figure 11, compared with the control group, the collagen fibers swelled significantly after treatment with recombinant collagenase, and the two recombinant collagenases showed a synergistic effect. Over time, collagen fibers are enzymatically broken down and disappear. The collagen fibers treated for 24 hours were naturally air-dried on the silicon wafer, adhered to the conductive adhesive, and sprayed with gold coating to observe the morphology and structure using ultra-high-resolution cold field scanning electron microscopy (SU8220, Hitachi). The results are shown in Figure 12. Compared with the tight and complete fiber structure of the control group, the sample fibers treated with recombinant collagenase ColG were in a loose state, and the surface of the sample fibers treated with recombinant collagenase ColH also underwent significant changes. . These phenomena confirmed from the microstructure that recombinant collagenase has good collagen hydrolytic activity.
实施例6工程菌株摇瓶发酵生产重组胶原蛋白酶Example 6 Production of recombinant collagenase by shake flask fermentation of engineered strains
菌株B0、BG02、BH01在YPD平板上划线活化后,用1μL接种环挑取单菌落转接至2.5mL改良YPD培养基中培养过夜得到种子液。取适当种子液转接至40mL新鲜的改良YPD培养基中,起始OD600调整为0.005左右,在30℃、200rpm和10%的装液量条件下进行摇瓶发酵,间隔适当时间取样进行酶活检测。工程菌株摇瓶发酵所得的重组胶原蛋白酶产量如图13所示,胶原蛋白酶ColG和ColH的产量分别达到6.8×104U/L和5.5×104U/L。结果说明本发明构建得到的酿酒酵母工程菌株可高效分泌表达重组胶原蛋白酶,可用于对胶原蛋白资源的加工利用,具有广阔的产业化应用前景。After the strains B0, BG02, and BH01 were streaked and activated on the YPD plate, a single colony was picked with a 1 μL inoculating loop and transferred to 2.5 mL of modified YPD medium and cultured overnight to obtain a seed liquid. Take appropriate seed liquid and transfer it to 40 mL of fresh modified YPD medium. Adjust the initial OD 600 to about 0.005. Carry out shake flask fermentation at 30°C, 200 rpm and 10% liquid volume. Take samples at appropriate intervals for enzyme analysis. Live detection. The production of recombinant collagenase obtained from shake flask fermentation of the engineering strain is shown in Figure 13. The production of collagenase ColG and ColH reached 6.8×10 4 U/L and 5.5×10 4 U/L respectively. The results show that the Saccharomyces cerevisiae engineering strain constructed in the present invention can efficiently secrete and express recombinant collagenase, can be used for processing and utilizing collagen resources, and has broad industrial application prospects.
上述实施例为本发明较佳的实施方式,但本发明的实施方式并不受上述实施例的限制,其他的任何未背离本发明的精神实质与原理下所作的改变、修饰、替代、组合、简化,均应为等效的置换方式,都包含在本发明的保护范围之内。The above embodiments are preferred embodiments of the present invention, but the embodiments of the present invention are not limited to the above embodiments. Any other changes, modifications, substitutions, combinations, etc. may be made without departing from the spirit and principles of the present invention. All simplifications should be equivalent substitutions, and are all included in the protection scope of the present invention.
序列表Sequence Listing
<110> 华南理工大学<110> South China University of Technology
<120> 表达胶原蛋白酶的重组质粒、酵母菌株及其发酵培养基和发酵培养方法<120> Recombinant plasmid expressing collagenase, yeast strain, fermentation medium and fermentation culture method thereof
<160> 9<160> 9
<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0
<210> 1<210> 1
<211> 267<211> 267
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 1<400> 1
atgagatttc catctatttt tactgctgtt ttgtttgctg cttcttctgc tttggctgct 60atgagatttc catctatttt tactgctgtt ttgtttgctg cttcttctgc tttggctgct 60
ccagttaata ctactactga agatgaaact gctcaaattc cagctgaagc tgttattggt 120ccagttaata ctactactga agatgaaact gctcaaattc cagctgaagc tgttatattggt 120
tattctgatt tggagggtga ctttgatgtt gctgttttgc cattttctaa ctctactaac 180tattctgatt tggagggtga ctttgatgtt gctgttttgc cattttctaa ctctactaac 180
aacggtttgc tattcatcaa cactactatc gcttctatcg ctgctaaaga agaaggtgtt 240aacggtttgc tattcatcaa cactactatc gcttctatcg ctgctaaaga agaaggtgtt 240
tctttggata aaagagaggc tgaagct 267tctttggata aaagagaggc tgaagct 267
<210> 2<210> 2
<211> 3219<211> 3219
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 2<400> 2
aagccaatcg aaaacactaa cgacaccagc atcaagaacg ttgaaaagtt gagaaacgcc 60aagccaatcg aaaacactaa cgacaccagc atcaagaacg ttgaaaagtt gagaaacgcc 60
ccaaacgaag aaaactctaa gaaagttgaa gactctaaga atgataaggt tgaacacgtt 120ccaaacgaag aaaactctaa gaaagttgaa gactctaaga atgataaggt tgaacacgtt 120
aagaacattg aagaagctaa ggtcgaacaa gttgctccag aagtcaagtc taaaagtacc 180aagaacattg aagaagctaa ggtcgaacaa gttgctccag aagtcaagtc taaaagtacc 180
ttgagatctg cttctatagc taatacaaat tccgaaaagt acgacttcga atatcttaat 240ttgagatctg cttctatagc taatacaaat tccgaaaagt acgacttcga atatcttaat 240
ggtttgagtt acaccgaatt gactaacttg atcaagaaca tcaagtggaa ccaaatcaac 300ggtttgagtt acaccgaatt gactaacttg atcaagaaca tcaagtggaa ccaaatcaac 300
ggtttattca actactccac cggttcgcaa aaatttttcg gtgacaagaa cagagtccaa 360ggtttattca actactccac cggttcgcaa aaatttttcg gtgacaagaa cagagtccaa 360
gccatcatca acgctctaca agaatctggt agaacttaca ctgccaatga catgaagggt 420gccatcatca acgctctaca agaatctggt agaacttaca ctgccaatga catgaagggt 420
attgaaactt tcaccgaagt tttgagagcc ggtttttact tgggctatta caacgatggt 480attgaaactt tcaccgaagt tttgagagcc ggtttttattgggctatta caacgatggt 480
ctctcctacc tgaacgaccg taatttccaa gacaagtgta tcccagccat gattgctatt 540ctctcctacc tgaacgaccg taatttccaa gacaagtgta tcccagccat gattgctatt 540
caaaagaacc caaacttcaa gttaggtact gctgttcaag acgaagttat cacatctttg 600caaaagaacc caaacttcaa gttaggtact gctgttcaag acgaagttat cacatctttg 600
ggtaagttga ttggtaacgc ctctgctaac gctgaagttg ttaataactg tgtcccagtt 660ggtaagttga ttggtaacgc ctctgctaac gctgaagttg ttaataactg tgtcccagtt 660
ttgaagcaat tcagagaaaa cttaaaccaa tacgctccag actacgtcaa aggtactgcc 720ttgaagcaat tcagagaaaa cttaaaccaa tacgctccag actacgtcaa aggtactgcc 720
gtaaatgaat tgatcaaggg tattgaattt gacttttccg gtgctgctta cgaaaaggac 780gtaaatgaat tgatcaaggg tattgaattt gacttttccg gtgctgctta cgaaaaggac 780
gtcaagacca tgccatggta cggtaagatt gacccattca tcaatgaatt aaaggccttg 840gtcaagacca tgccatggta cggtaagatt gacccattca tcaatgaatt aaaggccttg 840
ggtctctacg gtaacatcac ttctgccact gaatgggctt cagatgttgg tatctactac 900ggtctctacg gtaacatcac ttctgccact gaatgggctt cagatgttgg tatctactac 900
ttgtctaagt tcggtttgta ctcaaccaat cgtaacgaca ttgttcaatc cttagaaaag 960ttgtctaagt tcggtttgta ctcaaccaat cgtaacgaca ttgttcaatc cttagaaaag 960
gctgttgata tgtacaaata tggtaagatc gctttcgttg ctatggaaag aatcacctgg 1020gctgttgata tgtacaaata tggtaagatc gctttcgttg ctatggaaag aatcacctgg 1020
gattacgatg gtatcggttc caacgggaag aaggtcgacc acgataagtt cttggacgat 1080gattacgatg gtatcggttc caacgggaag aaggtcgacc acgataagtt cttggacgat 1080
gcagaaaagc attatttgcc aaagacctat actttcgaca acggaacctt catcatcaga 1140gcagaaaagc attatttgcc aaagacctat actttcgaca acggaacctt catcatcaga 1140
gctggagata aggtttctga agaaaagatt aagagattgt actgggcttc ccgtgaagtt 1200gctggagata aggtttctga agaaaagatt aagagattgt actgggcttc ccgtgaagtt 1200
aagtctcaat tccacagagt tgttggtaac gataaggctt tggaagtggg taacgctgac 1260aagtctcaat tccacagagt tgttggtaac gataaggctt tggaagtggg taacgctgac 1260
gatgtcttga ccatgaaaat cttcaactct ccagaagaat acaagttcaa caccaacatt 1320gatgtcttga ccatgaaaat cttcaactct ccagaagaat acaagttcaa caccaacatt 1320
aacggtgtct ccactgataa cggtggtttg tacatcgaac caagaggtac tttctacact 1380aacggtgtct ccactgataa cggtggtttg tacatcgaac caagaggtac tttctacact 1380
tacgaaagaa ccccacaaca atctattttc tccttggaag aactgttcag acacgaatac 1440tacgaaagaa ccccacaaca atctattttc tccttggaag aactgttcag acacgaatac 1440
acccactact tgcaagctag atacttggtt gacggtttgt ggggtcaagg tcctttctac 1500acccactact tgcaagctag atacttggtt gacggtttgt ggggtcaagg tcctttctac 1500
gaaaaaaaca gattgacttg gttcgatgaa ggtactgctg aatttttcgc tggttccacc 1560gaaaaaaaca gattgacttg gttcgatgaa ggtactgctg aatttttcgc tggttccacc 1560
agaacttctg gtgtgttgcc aagaaagtcc atcttgggtt accttgccaa ggataaggtc 1620agaacttctg gtgtgttgcc aagaaagtcc atcttgggtt accttgccaa ggataaggtc 1620
gatcatagat actccttgaa gaaaactttg aactctggtt acgatgactc tgactggatg 1680gatcatagat actccttgaa gaaaactttg aactctggtt acgatgactc tgactggatg 1680
ttctacaact acggttttgc tgttgctcac tacttatacg aaaaggacat gccaactttt 1740ttctacaact acggttttgc tgttgctcac tacttatacg aaaaggacat gccaactttt 1740
attaagatga acaaggccat cttgaacact gatgttaagt cttacgacga aatcattaaa 1800attaagatga acaaggccat cttgaacact gatgttaagt cttacgacga aatcattaaa 1800
aaactgtctg acgacgctaa caagaacact gaataccaaa accacatcca agaactcgct 1860aaactgtctg acgacgctaa caagaacact gaatacccaaa accacatcca agaactcgct 1860
gataagtacc aaggtgccgg tataccattg gtgtctgatg actacctgaa ggaccatggt 1920gataagtacc aaggtgccgg tataccattg gtgtctgatg actacctgaa ggaccatggt 1920
tacaagaagg cttccgaagt ttactcagaa atctccaagg ctgcgtcttt aactaacact 1980tacaagaagg cttccgaagt ttactcagaa atctccaagg ctgcgtcttt aactaacact 1980
tctgttaccg ctgaaaagtc tcaatacttc aacacattca ctttgagggg tacttacact 2040tctgttaccg ctgaaaagtc tcaatacttc aacacattca ctttgagggg tacttacact 2040
ggtgaaacct ctaagggtga atttaaggac tgggacgaaa tgtctaagaa gttggacggt 2100ggtgaaacct ctaagggtga atttaaggac tgggacgaaa tgtctaagaa gttggacggt 2100
actttggagt ctttagctaa gaactcttgg tctggttaca agactttgac cgcttacttc 2160actttggagt ctttagctaa gaactcttgg tctggttaca agactttgac cgcttacttc 2160
accaactaca gagtcacttc tgataacaaa gtccaatatg atgttgtctt ccacggtgtt 2220accaactaca gagtcacttc tgataacaaa gtccaatatg atgttgtctt ccacggtgtt 2220
ttgactgaca acgccgatat ttctaacaac aaggctccta ttgctaaggt caccggtcca 2280ttgactgaca acgccgatat ttctaacaac aaggctccta ttgctaaggt caccggtcca 2280
tctactggtg ctgttggtag aaacattgaa ttttctggta aggattccaa ggacgaagat 2340tctactggtg ctgttggtag aaacattgaa ttttctggta aggattccaa ggacgaagat 2340
ggcaagattg tctcttatga ctgggatttc ggtgatggtg ctacctccag aggtaagaac 2400ggcaagattg tctcttatga ctgggatttc ggtgatggtg ctacctccag aggtaagaac 2400
tctgtccacg cttacaagaa ggctggtact tacaacgtca ctttgaaggt caccgatgac 2460tctgtccacg cttacaagaa ggctggtact tacaacgtca ctttgaaggt caccgatgac 2460
aagggtgcta cggctactga aagtttcact attgaaatta aaaacgaaga cactaccacc 2520aagggtgcta cggctactga aagtttcact attgaaatta aaaacgaaga cactaccacc 2520
ccaatcacca aggaaatgga accaaacgat gacattaagg aagccaacgg cccgattgtc 2580ccaatcacca aggaaatgga accaaacgat gacattaagg aagccaacgg cccgattgtc 2580
gaaggcgtca cggtcaaagg tgacttgaac ggttctgatg acgctgacac tttctacttt 2640gaaggcgtca cggtcaaagg tgacttgaac ggttctgatg acgctgacac tttctacttt 2640
gatgtcaagg aagacggtga cgttaccatc gaattgccat actctggttc ctctaacttt 2700gatgtcaagg aagacggtga cgttaccatc gaattgccat actctggttc ctctaacttt 2700
acctggttag tctacaagga aggtgacgac caaaaccaca tcgcctccgg tatcgataaa 2760acctggttag tctacaagga aggtgacgac caaaaccaca tcgcctccgg tatcgataaa 2760
aacaactcca aggttggtac tttcaagtcc actaagggtc gtcattacgt tttcatctac 2820aacaactcca aggttggtac tttcaagtcc actaagggtc gtcattacgt tttcatctac 2820
aagcacgact ctgcttccaa catttcatac tccttgaaca tcaagggttt gggtaacgaa 2880aagcacgact ctgcttccaa catttcatac tccttgaaca tcaagggttt gggtaacgaa 2880
aagttgaaag agaaggaaaa caatgactcc tctgacaagg caaccgttat tccaaatttc 2940aagttgaaag agaaggaaaa caatgactcc tctgacaagg caaccgttat tccaaatttc 2940
aataccacca tgcaaggttc tctattaggg gacgattcta gagactacta ctccttcgaa 3000aataccacca tgcaaggttc tctattagggg gacgattcta gagactacta ctccttcgaa 3000
gtcaaggaag aaggtgaagt taacattgaa ttggataaaa aggatgaatt tggtgttact 3060gtcaaggaag aaggtgaagt taacattgaa ttggataaaa aggatgaatt tggtgttat 3060
tggaccttgc acccagaaag caacatcaac gacagaatta cctacggtca agtcgatggt 3120tggaccttgc acccagaaag caacatcaac gacagaatta cctacggtca agtcgatggt 3120
aacaaggtat ccaacaaggt caaattgaga ccaggtaagt actacttgtt ggtttacaag 3180aacaaggtat ccaacaaggt caaattgaga ccaggtaagt actacttgtt ggtttacaag 3180
tacagtggtt ccggtaacta cgaattgcgg gttaacaag 3219tacagtggtt ccggtaacta cgaattgcgg gttaacaag 3219
<210> 3<210> 3
<211> 273<211> 273
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 3<400> 3
atgagatttc catctatttt tactgctgtt ttgtttgctg cttcttctgc tttggctgct 60atgagatttc catctatttt tactgctgtt ttgtttgctg cttcttctgc tttggctgct 60
ccagttaata ctactactga agatgaaact gctcaaattc cagctgaagc tgttattggt 120ccagttaata ctactactga agatgaaact gctcaaattc cagctgaagc tgttatattggt 120
tattctgatt tggagggtga ctttgatgtt gctgttttgc cattttctaa ctctactaac 180tattctgatt tggagggtga ctttgatgtt gctgttttgc cattttctaa ctctactaac 180
aacggtttgc tattcatcaa cactactatc gcttctatcg ctgctaaaga agaaggtgtt 240aacggtttgc tattcatcaa cactactatc gcttctatcg ctgctaaaga agaaggtgtt 240
tctttggata aaagagaaga aggtgaacca aaa 273tctttggata aaagagaaga aggtgaacca aaa 273
<210> 4<210> 4
<211> 2973<211> 2973
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 4<400> 4
gctgttgaca agaacaatgc cactgctgcc gtccaaaacg aatccaagcg ttacactgtt 60gctgttgaca agaacaatgc cactgctgcc gtccaaaacg aatccaagcg ttacactgtt 60
tcttacttaa agaccttgaa ctattacgac ttggtcgatc tattggttaa gaccgaaatc 120tcttacttaa agaccttgaa ctattacgac ttggtcgatc tattggttaa gaccgaaatc 120
gaaaacttac cagacttgtt tcaatactct tccgacgcta aggaatttta cggtaacaag 180gaaaacttac cagacttgtt tcaatactct tccgacgcta aggaatttta cggtaacaag 180
actagaatgt cctttatcat ggacgaaatt ggtcgtagag ctccacaata cactgaaatt 240actagaatgt cctttatcat ggacgaaatt ggtcgtagag ctccacaata cactgaaatt 240
gaccacaagg gtataccaac cctcgttgaa gtcgtcagag ccgggttcta cttgggtttt 300gaccacaagg gtataccaac cctcgttgaa gtcgtcagag ccgggttcta cttgggtttt 300
cacaacaaag aattgaatga aatcaacaag agatctttca aagaaagagt tatcccgtct 360cacaacaaag aattgaatga aatcaacaag agatctttca aagaaagagt tatcccgtct 360
atcttggcta tccaaaagaa cccaaacttc aaattaggta ctgaagttca agacaagatt 420atcttggcta tccaaaagaa cccaaacttc aaattaggta ctgaagttca agacaagatt 420
gtctctgcta ccggtctatt ggctggtaac gaaaccgctc caccagaagt tgttaacaac 480gtctctgcta ccggtctatt ggctggtaac gaaaccgctc caccagaagt tgttaacaac 480
ttcaccccaa tcttgcaaga ttgtatcaag aacatcgaca gatacgcctt ggacgacttg 540ttcaccccaa tcttgcaaga ttgtatcaag aacatcgaca gatacgcctt ggacgacttg 540
aaaagtaaag cattgttcaa cgttttggcc gccccaacct acgacattac tgaatacctc 600aaaagtaaag cattgttcaa cgttttggcc gccccaacct acgacattac tgaatacctc 600
agagctacca aggaaaagcc agaaaacact ccatggtacg gtaagatcga tggtttcatc 660agagctacca aggaaaagcc agaaaacact ccatggtacg gtaagatcga tggtttcatc 660
aacgaattga agaagctcgc tttgtacggt aaaattaacg acaacaactc ttggattatc 720aacgaattga agaagctcgc tttgtacggt aaaattaacg acaacaactc ttggattatc 720
gataacggta tttaccacat tgctccattg ggtaaattgc actctaacaa caaaatcggt 780gataacggta tttaccacat tgctccattg ggtaaattgc actctaacaa caaaatcggt 780
atcgaaactc taactgaagt tatgaaggtc tacccatact tgtccatgca acacttgcaa 840atcgaaactc taactgaagt tatgaaggtc tacccatact tgtccatgca acacttgcaa 840
tccgctgacc aaatcaagag acactacgac tctaaggatg ccgaaggtaa caagatccca 900tccgctgacc aaatcaagag acactacgac tctaaggatg ccgaaggtaa caagatccca 900
ttagataagt tcaagaagga aggcaaggaa aagtactgcc caaagaccta cacttttgat 960ttagataagt tcaagaagga aggcaaggaa aagtactgcc caaagaccta cacttttgat 960
gatggtaagg ttatcatcaa ggccggtgct cgtgtcgaag aagaaaaggt caaaagatta 1020gatggtaagg ttatcatcaa ggccggtgct cgtgtcgaag aagaaaaggt caaaagatta 1020
tactgggctt ccaaggaagt caacagtcaa ttcttcagag tttatggtat tgacaagcca 1080tactgggctt ccaaggaagt caacagtcaa ttcttcagag tttatggtat tgacaagcca 1080
ttggaagaag gtaacccaga cgacatcctt accatggtta tttacaactc tccagaagaa 1140ttggaagaag gtaacccaga cgacatcctt accatggtta tttacaactc tccagaagaa 1140
tacaaactta actccgtctt gtacggttac gataccaaca acggtggtat gtacattgaa 1200tacaaactta actccgtctt gtacggttac gatacccaaca acggtggtat gtacattgaa 1200
ccagaaggta cttttttcac ttacgaaaga gaagctcaag aatctacgta cactttggaa 1260ccagaaggta cttttttcac ttacgaaaga gaagctcaag aatctacgta cactttggaa 1260
gaattattca gacacgaata cacccattac ttgcaaggta gatacgctgt cccaggtcaa 1320gaattattca gacacgaata cacccattac ttgcaaggta gatacgctgt cccaggtcaa 1320
tggggtcgta ccaagttgta tgataacgac agattgacct ggtacgaaga aggtggtgct 1380tggggtcgta ccaagttgta tgataacgac agattgacct ggtacgaaga aggtggtgct 1380
gaactgtttg ctggttctac tagaacttcc ggcatcttgc caagaaagtc tattgtctct 1440gaactgtttg ctggttctac tagaacttcc ggcatcttgc caagaaagtc tattgtctct 1440
aacattcaca acaccactag aaacaacaga tacaagctgt ctgacactgt tcattcgaag 1500aacattcaca acaccactag aaacaacaga tacaagctgt ctgacactgt tcattcgaag 1500
tacggtgcct ctttcgaatt ttacaactac gcttgtatgt tcatggacta catgtacaat 1560tacggtgcct ctttcgaatt ttacaactac gcttgtatgt tcatggacta catgtacaat 1560
aaggacatgg gtattttgaa caagttgaat gatttggcta agaacaacga cgttgacggt 1620aaggacatgg gtattttgaa caagttgaat gatttggcta agaacaacga cgttgacggt 1620
tacgacaatt acatcagaga tttgtcatct aattacgctt tgaacgataa gtaccaagat 1680tacgacaatt acatcagaga tttgtcatct aattacgctt tgaacgataa gtaccaagat 1680
cacatgcaag aacgtattga caactatgaa aacttgaccg ttccattcgt tgctgacgac 1740cacatgcaag aacgtattga caactatgaa aacttgaccg ttccattcgt tgctgacgac 1740
tacttagtta gacacgctta caagaaccca aacgaaattt actccgaaat ctccgaagtt 1800tacttagtta gacacgctta caagaaccca aacgaaattt actccgaaat ctccgaagtt 1800
gccaagttga aggatgctaa gtctgaagtt aagaaatctc aatacttctc aactttcact 1860gccaagttga aggatgctaa gtctgaagtt aagaaatctc aatacttctc aactttcact 1860
ttgcgtggta gttacaccgg aggtgcttcc aagggtaagc tggaagatca aaaagctatg 1920ttgcgtggta gttacaccgg aggtgcttcc aagggtaagc tggaagatca aaaagctatg 1920
aacaagttca ttgacgactc tttgaagaag ttggacacct attcctggtc tggttacaag 1980aacaagttca ttgacgactc tttgaagaag ttggacacct attcctggtc tggttacaag 1980
actcttactg cttattttac caactacaag gtcgattctt ctaacagagt tacttacgat 2040actcttactg cttattttac caactacaag gtcgattctt ctaacagagt tacttacgat 2040
gtcgttttcc atggttactt gccaaacgaa ggtgattcca agaacagctt gccatacggt 2100gtcgttttcc atggttatactt gccaaacgaa ggtgattcca agaacagctt gccatacggt 2100
aagattaacg gtacttacaa gggtactgag aaggaaaaga ttaagttctc ctccgaaggt 2160aagattaacg gtacttacaa gggtactgag aaggaaaaga ttaagttctc ctccgaaggt 2160
tctttcgacc cagacggaaa gatagtttcc tacgaatggg acttcggtga tggtaacaag 2220tctttcgacc cagacggaaa gatagtttcc tacgaatggg acttcggtga tggtaacaag 2220
tccaacgaag agaaccctga acacagttac gataaggtcg gtacttacac tgtcaagttg 2280tccaacgaag agaaccctga acacagttac gataaggtcg gtacttacac tgtcaagttg 2280
aaggttaccg atgataaggg tgaaagctct gtttctacta ccactgctga aatcaaagat 2340aaggttaccg atgataaggg tgaaagctct gtttctacta ccactgctga aatcaaagat 2340
ttgtctgaaa acaagttgcc agtcatctat atgcacgttc ctaaatcggg tgctttaaac 2400ttgtctgaaa acaagttgcc agtcatctat atgcacgttc ctaaatcggg tgctttaaac 2400
caaaaggttg tcttctacgg taagggtact tacgacccag acggttctat cgctggctac 2460caaaaggttg tcttctacgg taagggtact tacgacccag acggttctat cgctggctac 2460
caatgggatt tcggtgacgg ttctgacttt tcctctgaac agaacccatc tcacgtctac 2520caatgggatt tcggtgacgg ttctgacttt tcctctgaac agaacccatc tcacgtctac 2520
accaagaaag gtgaatacac tgtgaccttg agagtcatgg attcttccgg tcaaatgtct 2580accaagaaag gtgaatacac tgtgaccttg agagtcatgg attcttccgg tcaaatgtct 2580
gaaaagacca tgaaaatcaa gatcactgat ccagtatacc ctattggtac tgaaaaggaa 2640gaaaagacca tgaaaatcaa gatcactgat ccagtatacc ctattggtac tgaaaaggaa 2640
ccaaacaact ctaaggaaac agcctccggt ccaatcgttc cgggtattcc agtctccggt 2700ccaaacaact ctaaggaaac agcctccggt ccaatcgttc cgggtattcc agtctccggt 2700
actatcgaaa acacctctga tcaagactat ttctacttcg atgttattac tccaggtgaa 2760actatcgaaa acacctctga tcaagactat ttctacttcg atgttattac tccaggtgaa 2760
gtcaagatcg acatcaacaa gctaggttac ggtggtgcta catgggttgt ttacgatgaa 2820gtcaagatcg acatcaacaa gctaggttac ggtggtgcta catgggttgt ttacgatgaa 2820
aacaataacg ctgtctccta cgctaccgat gacggtcaaa atttgtctgg taaattcaag 2880aacaataacg ctgtctccta cgctaccgat gacggtcaaa atttgtctgg taaattcaag 2880
gctgacaagc ctgggagata ctacattcat ttgtacatgt tcaatggttc ttacatgcca 2940gctgacaagc ctgggagata ctacattcat ttgtacatgt tcaatggttc ttacatgcca 2940
tacagaatta acattgaagg ttctgttggt aga 2973tacagaatta acattgaagg ttctgttggt aga 2973
<210> 5<210> 5
<211> 5216<211> 5216
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 5<400> 5
ccaatgataa aggtaccaac aaaatgaaga agaacatctt gaagattttg atggactctt 60ccaatgataa aggtaccaac aaaatgaaga agaacatctt gaagattttg atggactctt 60
actccaagga atccaaaatc caaactgtta gaagagttac ttctgtttct ttgttggctg 120actccaagga atccaaaatc caaactgtta gaagagttac ttctgtttct ttgttggctg 120
tctacttgac tatgaacacc tcaagtctag tgttggctaa gccaatcgaa aacactaacg 180tctacttgac tatgaacacc tcaagtctag tgttggctaa gccaatcgaa aacactaacg 180
acaccagcat caagaacgtt gaaaagttga gaaacgcccc aaacgaagaa aactctaaga 240acaccagcat caagaacgtt gaaaagttga gaaacgcccc aaacgaagaa aactctaaga 240
aagttgaaga ctctaagaat gataaggttg aacacgttaa gaacattgaa gaagctaagg 300aagttgaaga ctctaagaat gataaggttg aacacgttaa gaacattgaa gaagctaagg 300
tcgaacaagt tgctccagaa gtcaagtcta aaagtacctt gagatctgct tctatagcta 360tcgaacaagt tgctccagaa gtcaagtcta aaagtacctt gagatctgct tctatagcta 360
atacaaattc cgaaaagtac gacttcgaat atcttaatgg tttgagttac accgaattga 420atacaaattc cgaaaagtac gacttcgaat atcttaatgg tttgagttac accgaattga 420
ctaacttgat caagaacatc aagtggaacc aaatcaacgg tttattcaac tactccaccg 480ctaacttgat caagaacatc aagtggaacc aaatcaacgg tttattcaac tactccaccg 480
gttcgcaaaa atttttcggt gacaagaaca gagtccaagc catcatcaac gctctacaag 540gttcgcaaaa atttttcggt gacaagaaca gagtccaagc catcatcaac gctctacaag 540
aatctggtag aacttacact gccaatgaca tgaagggtat tgaaactttc accgaagttt 600aatctggtag aacttacact gccaatgaca tgaagggtat tgaaactttc accgaagttt 600
tgagagccgg tttttacttg ggctattaca acgatggtct ctcctacctg aacgaccgta 660tgagagccgg tttttacttg ggctattaca acgatggtct ctcctacctg aacgaccgta 660
atttccaaga caagtgtatc ccagccatga ttgctattca aaagaaccca aacttcaagt 720atttccaaga caagtgtatc ccagccatga ttgctattca aaagaaccca aacttcaagt 720
taggtactgc tgttcaagac gaagttatca catctttggg taagttgatt ggtaacgcct 780taggtactgc tgttcaagac gaagttatca catctttggg taagttgatt ggtaacgcct 780
ctgctaacgc tgaagttgtt aataactgtg tcccagtttt gaagcaattc agagaaaact 840ctgctaacgc tgaagttgtt aataactgtg tcccagtttt gaagcaattc agagaaaact 840
taaaccaata cgctccagac tacgtcaaag gtactgccgt aaatgaattg atcaagggta 900taaaccaata cgctccagac tacgtcaaag gtactgccgt aaatgaattg atcaagggta 900
ttgaatttga cttttccggt gctgcttacg aaaaggacgt caagaccatg ccatggtacg 960ttgaatttga cttttccggt gctgcttacg aaaaggacgt caagaccatg ccatggtacg 960
gtaagattga cccattcatc aatgaattaa aggccttggg tctctacggt aacatcactt 1020gtaagattga cccattcatc aatgaattaa aggccttggg tctctacggt aacatcactt 1020
ctgccactga atgggcttca gatgttggta tctactactt gtctaagttc ggtttgtact 1080ctgccactga atgggcttca gatgttggta tctactactt gtctaagttc ggtttgtact 1080
caaccaatcg taacgacatt gttcaatcct tagaaaaggc tgttgatatg tacaaatatg 1140caaccaatcg taacgacatt gttcaatcct tagaaaaggc tgttgatatg tacaaatatg 1140
gtaagatcgc tttcgttgct atggaaagaa tcacctggga ttacgatggt atcggttcca 1200gtaagatcgc tttcgttgct atggaaagaa tcacctggga ttacgatggt atcggttcca 1200
acgggaagaa ggtcgaccac gataagttct tggacgatgc agaaaagcat tatttgccaa 1260acgggaagaa ggtcgaccac gataagttct tggacgatgc agaaaagcat tatttgccaa 1260
agacctatac tttcgacaac ggaaccttca tcatcagagc tggagataag gtttctgaag 1320agacctatac tttcgacaac ggaaccttca tcatcagagc tggagataag gtttctgaag 1320
aaaagattaa gagattgtac tgggcttccc gtgaagttaa gtctcaattc cacagagttg 1380aaaagattaa gagattgtac tgggcttccc gtgaagttaa gtctcaattc cacagagttg 1380
ttggtaacga taaggctttg gaagtgggta acgctgacga tgtcttgacc atgaaaatct 1440ttggtaacga taaggctttg gaagtgggta acgctgacga tgtcttgacc atgaaaatct 1440
tcaactctcc agaagaatac aagttcaaca ccaacattaa cggtgtctcc actgataacg 1500tcaactctcc agaagaatac aagttcaaca ccaacattaa cggtgtctcc actgataacg 1500
gtggtttgta catcgaacca agaggtactt tctacactta cgaaagaacc ccacaacaat 1560gtggtttgta catcgaacca agaggtactt tctacactta cgaaagaacc ccacaacaat 1560
ctattttctc cttggaagaa ctgttcagac acgaatacac ccactacttg caagctagat 1620ctattttctc cttggaagaa ctgttcagac acgaatacac ccactacttg caagctagat 1620
acttggttga cggtttgtgg ggtcaaggtc ctttctacga aaaaaacaga ttgacttggt 1680acttggttga cggtttgtgg ggtcaaggtc ctttctacga aaaaaacaga ttgacttggt 1680
tcgatgaagg tactgctgaa tttttcgctg gttccaccag aacttctggt gtgttgccaa 1740tcgatgaagg tactgctgaa tttttcgctg gttccaccag aacttctggt gtgttgccaa 1740
gaaagtccat cttgggttac cttgccaagg ataaggtcga tcatagatac tccttgaaga 1800gaaagtccat cttgggttac cttgccaagg ataaggtcga tcatagatac tccttgaaga 1800
aaactttgaa ctctggttac gatgactctg actggatgtt ctacaactac ggttttgctg 1860aaactttgaa ctctggttac gatgactctg actggatgtt ctacaactac ggttttgctg 1860
ttgctcacta cttatacgaa aaggacatgc caacttttat taagatgaac aaggccatct 1920ttgctcacta cttatacgaa aaggacatgc caacttttat taagatgaac aaggccatct 1920
tgaacactga tgttaagtct tacgacgaaa tcattaaaaa actgtctgac gacgctaaca 1980tgaacactga tgttaagtct tacgacgaaa tcattaaaaa actgtctgac gacgctaaca 1980
agaacactga ataccaaaac cacatccaag aactcgctga taagtaccaa ggtgccggta 2040agaacactga ataccaaaac cacatccaag aactcgctga taagtaccaa ggtgccggta 2040
taccattggt gtctgatgac tacctgaagg accatggtta caagaaggct tccgaagttt 2100taccattggt gtctgatgac tacctgaagg accatggtta caagaaggct tccgaagttt 2100
actcagaaat ctccaaggct gcgtctttaa ctaacacttc tgttaccgct gaaaagtctc 2160actcagaaat ctccaaggct gcgtctttaa ctaacacttc tgttaccgct gaaaagtctc 2160
aatacttcaa cacattcact ttgaggggta cttacactgg tgaaacctct aagggtgaat 2220aatacttcaa cacattcact ttgaggggta cttacactgg tgaaacctct aagggtgaat 2220
ttaaggactg ggacgaaatg tctaagaagt tggacggtac tttggagtct ttagctaaga 2280ttaaggactg ggacgaaatg tctaagaagt tggacggtac tttggagtct ttagctaaga 2280
actcttggtc tggttacaag actttgaccg cttacttcac caactacaga gtcacttctg 2340actcttggtc tggttacaag actttgaccg cttacttcac caactacaga gtcacttctg 2340
ataacaaagt ccaatatgat gttgtcttcc acggtgtttt gactgacaac gccgatattt 2400ataacaaagt ccaatatgat gttgtcttcc acggtgtttt gactgacaac gccgatattt 2400
ctaacaacaa ggctcctatt gctaaggtca ccggtccatc tactggtgct gttggtagaa 2460ctaacaacaa ggctcctatt gctaaggtca ccggtccatc tactggtgct gttggtagaa 2460
acattgaatt ttctggtaag gattccaagg acgaagatgg caagattgtc tcttatgact 2520acattgaatt ttctggtaag gattccaagg acgaagatgg caagattgtc tctttatgact 2520
gggatttcgg tgatggtgct acctccagag gtaagaactc tgtccacgct tacaagaagg 2580gggatttcgg tgatggtgct acctccagag gtaagaactc tgtccacgct tacaagaagg 2580
ctggtactta caacgtcact ttgaaggtca ccgatgacaa gggtgctacg gctactgaaa 2640ctggtactta caacgtcact ttgaaggtca ccgatgacaa gggtgctacg gctactgaaa 2640
gtttcactat tgaaattaaa aacgaagaca ctaccacccc aatcaccaag gaaatggaac 2700gtttcactat tgaaattaaa aacgaagaca ctaccacccc aatcaccaag gaaatggaac 2700
caaacgatga cattaaggaa gccaacggcc cgattgtcga aggcgtcacg gtcaaaggtg 2760caaacgatga cattaaggaa gccaacggcc cgattgtcga aggcgtcacg gtcaaaggtg 2760
acttgaacgg ttctgatgac gctgacactt tctactttga tgtcaaggaa gacggtgacg 2820acttgaacgg ttctgatgac gctgacactt tctactttga tgtcaaggaa gacggtgacg 2820
ttaccatcga attgccatac tctggttcct ctaactttac ctggttagtc tacaaggaag 2880ttaccatcga attgccatac tctggttcct ctaactttac ctggttagtc tacaaggaag 2880
gtgacgacca aaaccacatc gcctccggta tcgataaaaa caactccaag gttggtactt 2940gtgacgacca aaaccacatc gcctccggta tcgataaaaa caactccaag gttggtactt 2940
tcaagtccac taagggtcgt cattacgttt tcatctacaa gcacgactct gcttccaaca 3000tcaagtccac taagggtcgt cattacgttt tcatctacaa gcacgactct gcttccaaca 3000
tttcatactc cttgaacatc aagggtttgg gtaacgaaaa gttgaaagag aaggaaaaca 3060tttcatactc cttgaacatc aagggtttgg gtaacgaaaa gttgaaagag aaggaaaaca 3060
atgactcctc tgacaaggca accgttattc caaatttcaa taccaccatg caaggttctc 3120atgactcctc tgacaaggca accgttattc caaatttcaa taccaccatg caaggttctc 3120
tattagggga cgattctaga gactactact ccttcgaagt caaggaagaa ggtgaagtta 3180tattagggga cgattctaga gactactact ccttcgaagt caaggaagaa ggtgaagtta 3180
acattgaatt ggataaaaag gatgaatttg gtgttacttg gaccttgcac ccagaaagca 3240acattgaatt ggataaaaag gatgaatttg gtgttatacttg gaccttgcac ccagaaagca 3240
acatcaacga cagaattacc tacggtcaag tcgatggtaa caaggtatcc aacaaggtca 3300acatcaacga cagaattacc tacggtcaag tcgatggtaa caaggtatcc aacaaggtca 3300
aattgagacc aggtaagtac tacttgttgg tttacaagta cagtggttcc ggtaactacg 3360aattgagacc aggtaagtac tacttgttgg tttacaagta cagtggttcc ggtaactacg 3360
aattgcgggt taacaagtaa gctagcttta tcggaaagaa catgtgagca aaaggccagc 3420aattgcgggt taacaagtaa gctagcttta tcggaaagaa catgtgagca aaaggccagc 3420
aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc 3480aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc 3480
ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat 3540ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat 3540
aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc 3600aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc 3600
cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct 3660cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct 3660
cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg 3720cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg 3720
aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc 3780aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc 3780
cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga 3840cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga 3840
ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa 3900ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa 3900
gaacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta 3960gaacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta 3960
gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc 4020gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc 4020
agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg 4080agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg 4080
acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga 4140acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga 4140
tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg 4200tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg 4200
agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct 4260agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct 4260
gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg 4320gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg 4320
agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc 4380agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc 4380
cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa 4440cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa 4440
ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc 4500ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc 4500
cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt 4560cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt 4560
cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc 4620cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc 4620
ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt 4680ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt 4680
tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc 4740tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc 4740
catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt 4800catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt 4800
gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata 4860gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata 4860
gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga 4920gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga 4920
tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag 4980tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag 4980
catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa 5040catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa 5040
aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt 5100aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt 5100
attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga 5160attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga 5160
aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgtc 5216aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgtc 5216
<210> 6<210> 6
<211> 4925<211> 4925
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 6<400> 6
ccaatgataa aggtaccaac aaaatgaaga gaaaatgttt gtctaagaga ttgatgttgg 60ccaatgataa aggtaccaac aaaatgaaga gaaaatgttt gtctaagaga ttgatgttgg 60
ctatcaccat ggctaccatc ttcaccgtca attctacatt gccaatttac gctgctgttg 120ctatcaccat ggctaccatc ttcaccgtca attctacatt gccaatttac gctgctgttg 120
acaagaacaa tgccactgct gccgtccaaa acgaatccaa gcgttacact gtttcttact 180acaagaacaa tgccactgct gccgtccaaa acgaatccaa gcgttacact gtttcttact 180
taaagacctt gaactattac gacttggtcg atctattggt taagaccgaa atcgaaaact 240taaagacctt gaactattac gacttggtcg atctattggt taagaccgaa atcgaaaact 240
taccagactt gtttcaatac tcttccgacg ctaaggaatt ttacggtaac aagactagaa 300taccagactt gtttcaatac tcttccgacg ctaaggaatt ttacggtaac aagactagaa 300
tgtcctttat catggacgaa attggtcgta gagctccaca atacactgaa attgaccaca 360tgtcctttat catggacgaa attggtcgta gagctccaca atacactgaa attgaccaca 360
agggtatacc aaccctcgtt gaagtcgtca gagccgggtt ctacttgggt tttcacaaca 420agggtatacc aaccctcgtt gaagtcgtca gagccgggtt ctacttgggt tttcacaaca 420
aagaattgaa tgaaatcaac aagagatctt tcaaagaaag agttatcccg tctatcttgg 480aagaattgaa tgaaatcaac aagagatctt tcaaagaaag agttatcccg tctatcttgg 480
ctatccaaaa gaacccaaac ttcaaattag gtactgaagt tcaagacaag attgtctctg 540ctatccaaaa gaacccaaac ttcaaattag gtactgaagt tcaagacaag attgtctctg 540
ctaccggtct attggctggt aacgaaaccg ctccaccaga agttgttaac aacttcaccc 600ctaccggtct attggctggt aacgaaaccg ctccaccaga agttgttaac aacttcaccc 600
caatcttgca agattgtatc aagaacatcg acagatacgc cttggacgac ttgaaaagta 660caatcttgca agattgtatc aagaacatcg acagatacgc cttggacgac ttgaaaagta 660
aagcattgtt caacgttttg gccgccccaa cctacgacat tactgaatac ctcagagcta 720aagcattgtt caacgttttg gccgccccaa cctacgacat tactgaatac ctcagagcta 720
ccaaggaaaa gccagaaaac actccatggt acggtaagat cgatggtttc atcaacgaat 780ccaaggaaaa gccagaaaac actccatggt acggtaagat cgatggtttc atcaacgaat 780
tgaagaagct cgctttgtac ggtaaaatta acgacaacaa ctcttggatt atcgataacg 840tgaagaagct cgctttgtac ggtaaaatta acgacaacaa ctcttggatt atcgataacg 840
gtatttacca cattgctcca ttgggtaaat tgcactctaa caacaaaatc ggtatcgaaa 900gtatttacca cattgctcca ttgggtaaat tgcactctaa caacaaaatc ggtatcgaaa 900
ctctaactga agttatgaag gtctacccat acttgtccat gcaacacttg caatccgctg 960ctctaactga agttatgaag gtctacccat acttgtccat gcaacacttg caatccgctg 960
accaaatcaa gagacactac gactctaagg atgccgaagg taacaagatc ccattagata 1020accaaatcaa gagacactac gactctaagg atgccgaagg taacaagatc ccattagata 1020
agttcaagaa ggaaggcaag gaaaagtact gcccaaagac ctacactttt gatgatggta 1080agttcaagaa ggaaggcaag gaaaagtact gcccaaagac ctacactttt gatgatggta 1080
aggttatcat caaggccggt gctcgtgtcg aagaagaaaa ggtcaaaaga ttatactggg 1140aggttatcat caaggccggt gctcgtgtcg aagaagaaaa ggtcaaaaga ttatactggg 1140
cttccaagga agtcaacagt caattcttca gagtttatgg tattgacaag ccattggaag 1200cttccaagga agtcaacagt caattcttca gagtttatgg tattgacaag ccattggaag 1200
aaggtaaccc agacgacatc cttaccatgg ttatttacaa ctctccagaa gaatacaaac 1260aaggtaaccc agacgacatc cttaccatgg ttatttacaa ctctccagaa gaatacaaac 1260
ttaactccgt cttgtacggt tacgatacca acaacggtgg tatgtacatt gaaccagaag 1320ttaactccgt cttgtacggt tacgatacca acaacggtgg tatgtacatt gaaccagaag 1320
gtactttttt cacttacgaa agagaagctc aagaatctac gtacactttg gaagaattat 1380gtactttttt cacttacgaa agagaagctc aagaatctac gtacactttg gaagaattat 1380
tcagacacga atacacccat tacttgcaag gtagatacgc tgtcccaggt caatggggtc 1440tcagacacga atacacccat tacttgcaag gtagatacgc tgtcccaggt caatggggtc 1440
gtaccaagtt gtatgataac gacagattga cctggtacga agaaggtggt gctgaactgt 1500gtaccaagtt gtatgataac gacagattga cctggtacga agaaggtggt gctgaactgt 1500
ttgctggttc tactagaact tccggcatct tgccaagaaa gtctattgtc tctaacattc 1560ttgctggttc tactagaact tccggcatct tgccaagaaa gtctattgtc tctaacattc 1560
acaacaccac tagaaacaac agatacaagc tgtctgacac tgttcattcg aagtacggtg 1620acaacaccac tagaaacaac agatacaagc tgtctgacac tgttcattcg aagtacggtg 1620
cctctttcga attttacaac tacgcttgta tgttcatgga ctacatgtac aataaggaca 1680cctctttcga attttacaac tacgcttgta tgttcatgga ctacatgtac aataaggaca 1680
tgggtatttt gaacaagttg aatgatttgg ctaagaacaa cgacgttgac ggttacgaca 1740tgggtatttt gaacaagttg aatgatttgg ctaagaacaa cgacgttgac ggttacgaca 1740
attacatcag agatttgtca tctaattacg ctttgaacga taagtaccaa gatcacatgc 1800attacatcag agatttgtca tctaattacg ctttgaacga taagtaccaa gatcacatgc 1800
aagaacgtat tgacaactat gaaaacttga ccgttccatt cgttgctgac gactacttag 1860aagaacgtat tgacaactat gaaaacttga ccgttccatt cgttgctgac gactacttag 1860
ttagacacgc ttacaagaac ccaaacgaaa tttactccga aatctccgaa gttgccaagt 1920ttagacacgc ttacaagaac ccaaacgaaa tttactccga aatctccgaa gttgccaagt 1920
tgaaggatgc taagtctgaa gttaagaaat ctcaatactt ctcaactttc actttgcgtg 1980tgaaggatgc taagtctgaa gttaagaaat ctcaatactt ctcaactttc actttgcgtg 1980
gtagttacac cggaggtgct tccaagggta agctggaaga tcaaaaagct atgaacaagt 2040gtagttacac cggaggtgct tccaagggta agctggaaga tcaaaaagct atgaacaagt 2040
tcattgacga ctctttgaag aagttggaca cctattcctg gtctggttac aagactctta 2100tcattgacga ctctttgaag aagttggaca cctattcctg gtctggttac aagactctta 2100
ctgcttattt taccaactac aaggtcgatt cttctaacag agttacttac gatgtcgttt 2160ctgcttattt taccaactac aaggtcgatt cttctaacag agttacttac gatgtcgttt 2160
tccatggtta cttgccaaac gaaggtgatt ccaagaacag cttgccatac ggtaagatta 2220tccatggtta cttgccaaac gaaggtgatt ccaagaacag cttgccatac ggtaagatta 2220
acggtactta caagggtact gagaaggaaa agattaagtt ctcctccgaa ggttctttcg 2280acggtactta caagggtact gagaaggaaa agattaagtt ctcctccgaa ggttctttcg 2280
acccagacgg aaagatagtt tcctacgaat gggacttcgg tgatggtaac aagtccaacg 2340acccagacgg aaagatagtt tcctacgaat gggacttcgg tgatggtaac aagtccaacg 2340
aagagaaccc tgaacacagt tacgataagg tcggtactta cactgtcaag ttgaaggtta 2400aagagaaccc tgaacacagt tacgataagg tcggtactta cactgtcaag ttgaaggtta 2400
ccgatgataa gggtgaaagc tctgtttcta ctaccactgc tgaaatcaaa gatttgtctg 2460ccgatgataa gggtgaaagc tctgtttcta ctaccactgc tgaaatcaaa gatttgtctg 2460
aaaacaagtt gccagtcatc tatatgcacg ttcctaaatc gggtgcttta aaccaaaagg 2520aaaacaagtt gccagtcatc tatatgcacg ttcctaaatc gggtgcttta aaccaaaagg 2520
ttgtcttcta cggtaagggt acttacgacc cagacggttc tatcgctggc taccaatggg 2580ttgtcttcta cggtaagggt acttacgacc cagacggttc tatcgctggc taccaatggg 2580
atttcggtga cggttctgac ttttcctctg aacagaaccc atctcacgtc tacaccaaga 2640atttcggtga cggttctgac ttttcctctg aacagaaccc atctcacgtc tacaccaaga 2640
aaggtgaata cactgtgacc ttgagagtca tggattcttc cggtcaaatg tctgaaaaga 2700aaggtgaata cactgtgacc ttgagagtca tggattcttc cggtcaaatg tctgaaaaga 2700
ccatgaaaat caagatcact gatccagtat accctattgg tactgaaaag gaaccaaaca 2760ccatgaaaat caagatcact gatccagtat accctattgg tactgaaaag gaaccaaaca 2760
actctaagga aacagcctcc ggtccaatcg ttccgggtat tccagtctcc ggtactatcg 2820actctaagga aacagcctcc ggtccaatcg ttccgggtat tccagtctcc ggtactatcg 2820
aaaacacctc tgatcaagac tatttctact tcgatgttat tactccaggt gaagtcaaga 2880aaaacacctc tgatcaagac tatttctact tcgatgttat tactccaggt gaagtcaaga 2880
tcgacatcaa caagctaggt tacggtggtg ctacatgggt tgtttacgat gaaaacaata 2940tcgacatcaa caagctaggt tacggtggtg ctacatgggt tgtttacgat gaaaacaata 2940
acgctgtctc ctacgctacc gatgacggtc aaaatttgtc tggtaaattc aaggctgaca 3000acgctgtctc ctacgctacc gatgacggtc aaaatttgtc tggtaaattc aaggctgaca 3000
agcctgggag atactacatt catttgtaca tgttcaatgg ttcttacatg ccatacagaa 3060agcctgggag atactacatt catttgtaca tgttcaatgg ttcttacatg ccatacagaa 3060
ttaacattga aggttctgtt ggtagataag ctagctttat cggaaagaac atgtgagcaa 3120ttaacattga aggttctgtt ggtagataag ctagctttat cggaaagaac atgtgagcaa 3120
aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 3180aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 3180
tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 3240tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 3240
caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 3300caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 3300
cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 3360cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 3360
ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 3420ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 3420
gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 3480gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 3480
agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 3540agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 3540
gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 3600gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 3600
acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 3660acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 3660
gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 3720gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 3720
gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 3780gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 3780
cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 3840cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 3840
caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 3900caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 3900
gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 3960gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 3960
cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta 4020cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta 4020
cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct 4080cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct 4080
caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg 4140caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg 4140
gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa 4200gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa 4200
gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt 4260gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt 4260
cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 4320cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 4320
catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 4380catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 4380
gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 4440gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 4440
ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 4500ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 4500
gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg 4560gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg 4560
cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 4620cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 4620
tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 4680tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 4680
gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa 4740gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa 4740
atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 4800atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 4800
ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 4860ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 4860
gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg 4920gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg 4920
acgtc 4925acgtc 4925
<210> 7<210> 7
<211> 11468<211> 11468
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 7<400> 7
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accataccac agcttttcaa ttcaattcat catttttttt ttattctttt ttttgatttc 240accataccac agcttttcaa ttcaattcat catttttttt ttattctttt ttttgatttc 240
ggtttctttg aaattttttt gattcggtaa tctccgaaca gaaggaagaa cgaaggaagg 300ggtttctttg aaattttttt gattcggtaa tctccgaaca gaaggaagaa cgaaggaagg 300
agcacagact tagattggta tatatacgca tatgtagtgt tgaagaaaca tgaaattgcc 360agcacagact tagattggta tatatacgca tatgtagtgt tgaagaaaca tgaaattgcc 360
cagtattctt aacccaactg cacagaacaa aaacctgcag gaaacgaaga taaatctaaa 420cagtattctt aacccaactg cacagaacaa aaacctgcag gaaacgaaga taaatctaaa 420
aaactgtatt ataagtaaat gcatgtatac taaactcaca aattagagct tcaatttaat 480aaactgtatt ataagtaaat gcatgtatac taaactcaca aattagagct tcaatttaat 480
tatatcagtt attaccctat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 540tatatcagtt attaccctat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 540
cgcatcagga aattgtaaac gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa 600cgcatcagga aattgtaaac gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa 600
tcagctcatt ttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat 660tcagctcattttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat 660
agaccgagat agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg 720agaccgagat agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg 720
tggactccaa cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac 780tggactccaa cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac 780
catcacccta atcaagtttt ttggggtcga ggtgccgtaa agcactaaat cggaacccta 840catcacccta atcaagttttttggggtcga ggtgccgtaa agcactaaat cggaacccta 840
aagggagccc ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag 900aagggagccc ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag 900
ggaagaaagc gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg 960ggaagaaagc gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg 960
taaccaccac acccgccgcg cttaatgcgc cgctacaggg cgcgtccatt cgccattcag 1020taaccaccac acccgccgcg cttaatgcgc cgctacaggg cgcgtccatt cgccattcag 1020
gctgcgcaac tgttgggaag ggcgatcggt gcgggcctct tcgctattac gccagctgga 1080gctgcgcaac tgttgggaag ggcgatcggt gcgggcctct tcgctattac gccagctgga 1080
taaaggcgcg ccgtcgacaa tatagaaatt tcttagtttt aatgattgaa tgaatccatt 1140taaaggcgcg ccgtcgacaa tatagaaatt tcttagtttt aatgattgaa tgaatccatt 1140
tcgaacacgt caatgtactt ggacaagctt tgtagaacat gaagtacttg agcaatggac 1200tcgaacacgt caatgtactt ggacaagctt tgtagaacat gaagtacttg agcaatggac 1200
gcccaactgg aatttacttt agcgacacta ttccaagacc agcaactcaa agaagtaaat 1260gcccaactgg aatttacttt agcgacacta ttccaagacc agcaactcaa agaagtaaat 1260
tctttttttc acaatagttg tttttttgat gatgtacgtc ccagcagcac ctaatggtat 1320tctttttttc acaatagttg tttttttgat gatgtacgtc ccagcagcac ctaatggtat 1320
ttttatgagt attggcagtt tgccttatta attaagcaaa ccggtcatat attttaaaaa 1380ttttatgagt attggcagtt tgccttatta attaagcaaa ccggtcatat attttaaaaa 1380
ataggaactt tgtagtcctt attaatgctg aataaaaaaa taggaagaga aatcagcaag 1440ataggaactt tgtagtcctt attaatgctg aataaaaaaa taggaagaga aatcagcaag 1440
tataaacagt taaatgtttt ttaactataa tggttgcatt caagaggaca tgtaaattgc 1500tataaacagt taaatgtttt ttaactataa tggttgcatt caagaggaca tgtaaattgc 1500
gtactcggta caagttggta aaaacgttaa agcactccta cgagatttag tcagtcctgg 1560gtactcggta caagttggta aaaacgttaa agcactccta cgagatttag tcagtcctgg 1560
cattaataaa caaaaagacc tgtatctata tttcatgaac tgggtaaaat caccacacaa 1620cattaataaa caaaaagacc tgtatctata tttcatgaac tgggtaaaat caccacacaa 1620
aaaagaataa acaaatataa tgaactactt ttggtcttgg actcgcagtt tatagtttgt 1680aaaagaataa acaaatataa tgaactactt ttggtcttgg actcgcagtt tatagtttgt 1680
taaaattgac aaacacacat gttctgatta acttacttcc gcttcaaagt actgaaaaca 1740taaaattgac aaacacacat gttctgatta acttacttcc gcttcaaagt actgaaaaca 1740
tcatacgaac tggtataaga aggaagatag caaataaaaa taaatcaaat attgcttgta 1800tcatacgaac tggtataaga aggaagatag caaataaaaa taaatcaaat attgcttgta 1800
aaaaattagg atataaaaaa atctctattc attaaagaca aaaagctaaa ggaataaaaa 1860aaaaattagg atataaaaaa atctctattc attaaagaca aaaagctaaa ggaataaaaa 1860
agtcattcgc tttacgccaa ataaaacgtc tatgcacgtt tttaaaggct gtgaacatta 1920agtcattcgc tttacgccaa ataaaacgtc tatgcacgtt tttaaaggct gtgaacatta 1920
acaatgttgt ggaattcagg cttgagagaa gcaccgccaa ccaagaaacc gtcaatatcg 1980acaatgttgt ggaattcagg cttgagagaa gcaccgccaa ccaagaaacc gtcaatatcg 1980
tggaacttga ggaactcctt gcagttacca ccgttaacgg aaccaccgta gatgacacgg 2040tggaacttga ggaactcctt gcagttacca ccgttaacgg aaccaccgta gatgacacgg 2040
agaccctcgg caacagatgc accaagcttg ttggtagccc acttgcggat ctcagcgtga 2100agaccctcgg caacagatgc accaagcttg ttggtagccc acttgcggat ctcagcgtga 2100
acctcttgag cttgctcagg ggtggcagtc ttaccagtac caatggccca gacaggctca 2160acctcttgag cttgctcagg ggtggcagtc ttaccagtac caatggccca gacaggctca 2160
taagcaatga caatcttgga ccagttctgg accttgtcag cgatggcgtt caattgacga 2220taagcaatga caatcttgga ccagttctgg accttgtcag cgatggcgtt caattgacga 2220
acaacaacgt tgatggtctc gttagcctca cgctcggcca aagtctcacc aatgcaggca 2280acaacaacgt tgatggtctc gttagcctca cgctcggcca aagtctcacc aatgcaggca 2280
acgacagtaa gaccttgttc aagggcaaac ttggtcttgt cggcaacgaa ctcgtcagac 2340acgacagtaa gaccttgttc aagggcaaac ttggtcttgt cggcaacgaa ctcgtcagac 2340
tccttgaaga tggtacgacg ctcggagtga ccagtcaaag tgtaggtaat accagcatca 2400tccttgaaga tggtacgacg ctcggagtga ccagtcaaag tgtaggtaat accagcatca 2400
atcaaagatt gagcactgtt ctcaccagtg taggcaccgt tcttcttgtc gaagacgttt 2460atcaaagatt gagcactgtt ctcaccagtg taggcaccgt tcttcttgtc gaagacgttt 2460
tgggcaccaa cgccaatatc cttcttgact tgttggcggg tggtgatgag gtacatgttt 2520tgggcaccaa cgccaatatc cttcttgact tgttggcggg tggtgatgag gtacatgttt 2520
tgagggaaga tgacagtttc gacatcacca acgttaagct tggtggtgtt caaaccctca 2580tgagggaaga tgacagtttc gacatcacca acgttaagct tggtggtgtt caaaccctca 2580
ataatagtct tcatggactc caaagagcca ttcatcttaa agttaccacc gacaaagaat 2640ataatagtct tcatggactc caaagagcca ttcatcttaa agttaccacc gacaaagaat 2640
ttacgtgcca ttttgatcct agttctacta ttgaaatgta gttggggaga gacgagagtt 2700ttacgtgcca ttttgatcct agttctacta ttgaaatgta gttggggaga gacgagagtt 2700
ggcccgtccg ctcattatat ataacgtagc ggacagtcac acagttaagg ggaattaccg 2760ggcccgtccg ctcattatat ataacgtagc ggacagtcac acagttaagg ggaattaccg 2760
agcttcggca atttaccccg tcgatagcaa ccgttggcat ggatccggcc ggccagatct 2820agcttcggca atttaccccg tcgatagcaa ccgttggcat ggatccggcc ggccagatct 2820
acgtatggtc atttcttctt cagattccct catggagaaa gtgcggcaga tgtatatgac 2880acgtatggtc atttcttctt cagattccct catggagaaa gtgcggcaga tgtatatgac 2880
agagtcgcca gtttccaaga gactttattc aggcacttcc atgataggca agagagaaga 2940agagtcgcca gtttccaaga gactttatattc aggcacttcc atgataggca agagagaaga 2940
cccagagatg ttgttgtcct agttacacat ggtatttatt ccagagtatt cctgatgaaa 3000cccagagatg ttgttgtcct agttacacat ggtatttatt ccagagtatt cctgatgaaa 3000
tggtttagat ggacatacga agagtttgaa tcgtttacca atgttcctaa cgggagcgta 3060tggtttagat ggacatacga agagtttgaa tcgtttacca atgttcctaa cgggagcgta 3060
atggtgatgg aactggacga atccatcaat agatacgtcc tgaggaccgt gctacccaaa 3120atggtgatgg aactggacga atccatcaat agatacgtcc tgaggaccgt gctacccaaa 3120
tggactgatt gtgagggaga cctaactaca tagtgtttaa agattacgga tatttaactt 3180tggactgatt gtgagggaga cctaactaca tagtgtttaa agattacgga tatttaactt 3180
acttagaata atgccatttt tttgagttat aataatccta cgttagtgtg agcgggattt 3240acttagaata atgccatttttttgagttat aataatccta cgttagtgtg agcgggattt 3240
aaactgtgag gaccttaata cattcagaca cttctgcggt atcaccctac ttattccctt 3300aaactgtgag gaccttaata cattcagaca cttctgcggt atcaccctac ttatccctt 3300
cgagattata tctaggaacc catcaggttg gtggaagatt acccgttcta agacttttca 3360cgagattata tctaggaacc catcaggttg gtggaagatt acccgttcta agacttttca 3360
gcttcctcta ttgatgttac acctggacac cccttttctg gcatccagtt tttaatcttc 3420gcttcctcta ttgatgttac acctggacac cccttttctg gcatccagtt tttaatcttc 3420
agtggcatgt gagattctcc gaaattaatt aaagcaatca cacaattctc tcggatacca 3480agtggcatgt gagattctcc gaaattaatt aaagcaatca cacaattctc tcggatacca 3480
cctcggttga aactgacagg tggtttgtta cgcatgctaa tgcaaaggag cctatatacc 3540cctcggttga aactgacagg tggtttgtta cgcatgctaa tgcaaaggag cctatatacc 3540
tttggctcgg ctgctgtaac agggaatata aagggcagca taatttagga gtttagtgaa 3600tttggctcgg ctgctgtaac agggaatata aagggcagca taatttagga gtttagtgaa 3600
cttgcaacat ttactatttt cccttcttac gtaaatattt ttctttttaa ttctaaatca 3660cttgcaacat ttactatttt cccttcttac gtaaatattt ttctttttaa ttctaaatca 3660
atctttttca attttttgtt tgtattcttt tcttgcttaa atctataact acaaaaaaca 3720atctttttca attttttgtt tgtattcttt tcttgcttaa atctataact acaaaaaaca 3720
catacataaa ctaaaaggta ccaacaaaat gagatttcca tctattttta ctgctgtttt 3780catacataaa ctaaaaggta ccaacaaaat gagatttcca tctattttta ctgctgtttt 3780
gtttgctgct tcttctgctt tggctgctcc agttaatact actactgaag atgaaactgc 3840gtttgctgct tcttctgctt tggctgctcc agttaatact actactgaag atgaaactgc 3840
tcaaattcca gctgaagctg ttattggtta ttctgatttg gagggtgact ttgatgttgc 3900tcaaattcca gctgaagctg ttattggtta ttctgatttg gagggtgact ttgatgttgc 3900
tgttttgcca ttttctaact ctactaacaa cggtttgcta ttcatcaaca ctactatcgc 3960tgttttgcca ttttctaact ctactaacaa cggtttgcta ttcatcaaca ctactatcgc 3960
ttctatcgct gctaaagaag aaggtgtttc tttggataaa agagaagaag gtgaaccaaa 4020ttctatcgct gctaaagaag aaggtgtttc tttggataaa agagaagaag gtgaaccaaa 4020
aaagccaatc gaaaacacta acgacaccag catcaagaac gttgaaaagt tgagaaacgc 4080aaagccaatc gaaaacacta acgacaccag catcaagaac gttgaaaagt tgagaaacgc 4080
cccaaacgaa gaaaactcta agaaagttga agactctaag aatgataagg ttgaacacgt 4140cccaaacgaa gaaaactcta agaaagttga agactctaag aatgataagg ttgaacacgt 4140
taagaacatt gaagaagcta aggtcgaaca agttgctcca gaagtcaagt ctaaaagtac 4200taagaacatt gaagaagcta aggtcgaaca agttgctcca gaagtcaagt ctaaaagtac 4200
cttgagatct gcttctatag ctaatacaaa ttccgaaaag tacgacttcg aatatcttaa 4260cttgagatct gcttctatag ctaatacaaa ttccgaaaag tacgacttcg aatatcttaa 4260
tggtttgagt tacaccgaat tgactaactt gatcaagaac atcaagtgga accaaatcaa 4320tggtttgagt tacaccgaat tgactaactt gatcaagaac atcaagtgga accaaatcaa 4320
cggtttattc aactactcca ccggttcgca aaaatttttc ggtgacaaga acagagtcca 4380cggtttattc aactactcca ccggttcgca aaaatttttc ggtgacaaga acagagtcca 4380
agccatcatc aacgctctac aagaatctgg tagaacttac actgccaatg acatgaaggg 4440agccatcatc aacgctctac aagaatctgg tagaacttac actgccaatg acatgaaggg 4440
tattgaaact ttcaccgaag ttttgagagc cggtttttac ttgggctatt acaacgatgg 4500tattgaaact ttcaccgaag ttttgagagc cggtttttac ttgggctatt acaacgatgg 4500
tctctcctac ctgaacgacc gtaatttcca agacaagtgt atcccagcca tgattgctat 4560tctctcctac ctgaacgacc gtaatttcca agacaagtgt atcccagcca tgattgctat 4560
tcaaaagaac ccaaacttca agttaggtac tgctgttcaa gacgaagtta tcacatcttt 4620tcaaaagaac ccaaacttca agttaggtac tgctgttcaa gacgaagtta tcacatcttt 4620
gggtaagttg attggtaacg cctctgctaa cgctgaagtt gttaataact gtgtcccagt 4680gggtaagttg attggtaacg cctctgctaa cgctgaagtt gttaataact gtgtcccagt 4680
tttgaagcaa ttcagagaaa acttaaacca atacgctcca gactacgtca aaggtactgc 4740tttgaagcaa ttcagagaaa acttaaacca atacgctcca gactacgtca aaggtactgc 4740
cgtaaatgaa ttgatcaagg gtattgaatt tgacttttcc ggtgctgctt acgaaaagga 4800cgtaaatgaa ttgatcaagg gtattgaatt tgacttttcc ggtgctgctt acgaaaagga 4800
cgtcaagacc atgccatggt acggtaagat tgacccattc atcaatgaat taaaggcctt 4860cgtcaagacc atgccatggt acggtaagat tgacccattc atcaatgaat taaaggcctt 4860
gggtctctac ggtaacatca cttctgccac tgaatgggct tcagatgttg gtatctacta 4920gggtctctac ggtaacatca cttctgccac tgaatgggct tcagatgttg gtatctacta 4920
cttgtctaag ttcggtttgt actcaaccaa tcgtaacgac attgttcaat ccttagaaaa 4980cttgtctaag ttcggtttgt actcaaccaa tcgtaacgac attgttcaat ccttagaaaa 4980
ggctgttgat atgtacaaat atggtaagat cgctttcgtt gctatggaaa gaatcacctg 5040ggctgttgat atgtacaaat atggtaagat cgctttcgtt gctatggaaa gaatcacctg 5040
ggattacgat ggtatcggtt ccaacgggaa gaaggtcgac cacgataagt tcttggacga 5100ggattacgat ggtatcggtt ccaacgggaa gaaggtcgac cacgataagt tcttggacga 5100
tgcagaaaag cattatttgc caaagaccta tactttcgac aacggaacct tcatcatcag 5160tgcagaaaag cattatttgc caaagaccta tactttcgac aacggaacct tcatcatcag 5160
agctggagat aaggtttctg aagaaaagat taagagattg tactgggctt cccgtgaagt 5220agctggagat aaggtttctg aagaaaagat taagagattg tactgggctt cccgtgaagt 5220
taagtctcaa ttccacagag ttgttggtaa cgataaggct ttggaagtgg gtaacgctga 5280taagtctcaa ttccacagag ttgttggtaa cgataaggct ttggaagtgg gtaacgctga 5280
cgatgtcttg accatgaaaa tcttcaactc tccagaagaa tacaagttca acaccaacat 5340cgatgtcttg accatgaaaa tcttcaactc tccagaagaa tacaagttca acaccaacat 5340
taacggtgtc tccactgata acggtggttt gtacatcgaa ccaagaggta ctttctacac 5400taacggtgtc tccactgata acggtggttt gtacatcgaa ccaagaggta ctttctacac 5400
ttacgaaaga accccacaac aatctatttt ctccttggaa gaactgttca gacacgaata 5460ttacgaaaga accccacaac aatctatttt ctccttggaa gaactgttca gacacgaata 5460
cacccactac ttgcaagcta gatacttggt tgacggtttg tggggtcaag gtcctttcta 5520cacccactac ttgcaagcta gatacttggt tgacggtttg tggggtcaag gtcctttcta 5520
cgaaaaaaac agattgactt ggttcgatga aggtactgct gaatttttcg ctggttccac 5580cgaaaaaaac agattgactt ggttcgatga aggtactgct gaatttttcg ctggttccac 5580
cagaacttct ggtgtgttgc caagaaagtc catcttgggt taccttgcca aggataaggt 5640cagaacttct ggtgtgttgc caagaaagtc catcttgggt taccttgcca aggataaggt 5640
cgatcataga tactccttga agaaaacttt gaactctggt tacgatgact ctgactggat 5700cgatcataga tactccttga agaaaacttt gaactctggt tacgatgact ctgactggat 5700
gttctacaac tacggttttg ctgttgctca ctacttatac gaaaaggaca tgccaacttt 5760gttctacaac tacggttttg ctgttgctca ctacttatac gaaaaggaca tgccaacttt 5760
tattaagatg aacaaggcca tcttgaacac tgatgttaag tcttacgacg aaatcattaa 5820tattaagatg aacaaggcca tcttgaacac tgatgttaag tcttacgacg aaatcattaa 5820
aaaactgtct gacgacgcta acaagaacac tgaataccaa aaccacatcc aagaactcgc 5880aaaactgtct gacgacgcta acaagaacac tgaataccaa aaccacatcc aagaactcgc 5880
tgataagtac caaggtgccg gtataccatt ggtgtctgat gactacctga aggaccatgg 5940tgataagtac caaggtgccg gtataccatt ggtgtctgat gactacctga aggaccatgg 5940
ttacaagaag gcttccgaag tttactcaga aatctccaag gctgcgtctt taactaacac 6000ttacaagaag gcttccgaag tttactcaga aatctccaag gctgcgtctt taactaacac 6000
ttctgttacc gctgaaaagt ctcaatactt caacacattc actttgaggg gtacttacac 6060ttctgttacc gctgaaaagt ctcaatactt caacacattc actttgaggg gtacttacac 6060
tggtgaaacc tctaagggtg aatttaagga ctgggacgaa atgtctaaga agttggacgg 6120tggtgaaacc tctaagggtg aatttaagga ctgggacgaa atgtctaaga agttggacgg 6120
tactttggag tctttagcta agaactcttg gtctggttac aagactttga ccgcttactt 6180tactttggag tctttagcta agaactcttg gtctggttac aagactttga ccgcttactt 6180
caccaactac agagtcactt ctgataacaa agtccaatat gatgttgtct tccacggtgt 6240caccaactac agagtcactt ctgataacaa agtccaatat gatgttgtct tccacggtgt 6240
tttgactgac aacgccgata tttctaacaa caaggctcct attgctaagg tcaccggtcc 6300tttgactgac aacgccgata tttctaacaa caaggctcct attgctaagg tcaccggtcc 6300
atctactggt gctgttggta gaaacattga attttctggt aaggattcca aggacgaaga 6360atctactggt gctgttggta gaaacattga attttctggt aaggattcca aggacgaaga 6360
tggcaagatt gtctcttatg actgggattt cggtgatggt gctacctcca gaggtaagaa 6420tggcaagatt gtctcttatg actgggattt cggtgatggt gctacctcca gaggtaagaa 6420
ctctgtccac gcttacaaga aggctggtac ttacaacgtc actttgaagg tcaccgatga 6480ctctgtccac gcttacaaga aggctggtac ttacaacgtc actttgaagg tcaccgatga 6480
caagggtgct acggctactg aaagtttcac tattgaaatt aaaaacgaag acactaccac 6540caagggtgct acggctactg aaagtttcac tattgaaatt aaaaacgaag acactaccac 6540
cccaatcacc aaggaaatgg aaccaaacga tgacattaag gaagccaacg gcccgattgt 6600cccaatcacc aaggaaatgg aaccaaacga tgacattaag gaagccaacg gcccgattgt 6600
cgaaggcgtc acggtcaaag gtgacttgaa cggttctgat gacgctgaca ctttctactt 6660cgaaggcgtc acggtcaaag gtgacttgaa cggttctgat gacgctgaca ctttctactt 6660
tgatgtcaag gaagacggtg acgttaccat cgaattgcca tactctggtt cctctaactt 6720tgatgtcaag gaagacggtg acgttaccat cgaattgcca tactctggtt cctctaactt 6720
tacctggtta gtctacaagg aaggtgacga ccaaaaccac atcgcctccg gtatcgataa 6780tacctggtta gtctacaagg aaggtgacga ccaaaaccac atcgcctccg gtatcgataa 6780
aaacaactcc aaggttggta ctttcaagtc cactaagggt cgtcattacg ttttcatcta 6840aaacaactcc aaggttggta ctttcaagtc cactaagggt cgtcattacg ttttcatcta 6840
caagcacgac tctgcttcca acatttcata ctccttgaac atcaagggtt tgggtaacga 6900caagcacgac tctgcttcca acatttcata ctccttgaac atcaagggtt tgggtaacga 6900
aaagttgaaa gagaaggaaa acaatgactc ctctgacaag gcaaccgtta ttccaaattt 6960aaagttgaaa gagaaggaaa acaatgactc ctctgacaag gcaaccgtta ttccaaattt 6960
caataccacc atgcaaggtt ctctattagg ggacgattct agagactact actccttcga 7020caataccacc atgcaaggtt ctctattagg ggacgattct agagactact actccttcga 7020
agtcaaggaa gaaggtgaag ttaacattga attggataaa aaggatgaat ttggtgttac 7080agtcaaggaa gaaggtgaag ttaacattga attggataaa aaggatgaat ttggtgttac 7080
ttggaccttg cacccagaaa gcaacatcaa cgacagaatt acctacggtc aagtcgatgg 7140ttggaccttg cacccagaaa gcaacatcaa cgacagaatt acctacggtc aagtcgatgg 7140
taacaaggta tccaacaagg tcaaattgag accaggtaag tactacttgt tggtttacaa 7200taacaaggta tccaacaagg tcaaattgag accaggtaag tactacttgt tggtttacaa 7200
gtacagtggt tccggtaact acgaattgcg ggttaacaag caccaccacc accaccacta 7260gtacagtggt tccggtaact acgaattgcg ggttaacaag caccaccacc accaccacta 7260
agctagcctc gagtctagaa actaagatta atataattat ataaaaatat tatcttcttt 7320agctagcctc gagtctagaa actaagatta atataattat ataaaaatat tatcttcttt 7320
tctttatatc tagtgttatg taaaataaat tgatgactac ggaaagcttt tttatattgt 7380tctttatatc tagtgttatg taaaataaat tgatgactac ggaaagcttt tttatattgt 7380
ttctttttca ttctgagcca cttaaatttc gtgaatgttc ttataaggga cggtagattt 7440ttctttttca ttctgagcca cttaaatttc gtgaatgttc ttataaggga cggtagattt 7440
acaagtgata caacaaaaag caaggcgctt tttctaataa aaagaagaaa agcatttaac 7500acaagtgata caacaaaaag caaggcgctt tttctaataa aaagaagaaa agcatttaac 7500
aattgaacac ctctatatca acgaagaata ttactttgtc tctaaatcct tgtaaaatgt 7560aattgaacac ctctatatca acgaagaata ttactttgtc tctaaatcct tgtaaaatgt 7560
gtacgatctc tatatgggtt actcagaagt gtaccgaaga ctgcattgaa agtttatgtt 7620gtacgatctc tatatgggtt actcagaagt gtaccgaaga ctgcattgaa agtttatgtt 7620
ttttcactgc aagcgtcatt ttcgctttga gaagatgttc ttattcaaat ttcaactgtt 7680ttttcactgc aagcgtcatt ttcgctttga gaagatgttc ttattcaaat ttcaactgtt 7680
atatagaaga gcaaaaaatt gccaaaaaaa acaacattta ttcatttaaa atataaaatt 7740atatagaaga gcaaaaaatt gccaaaaaaa acaacattta ttcatttaaa atataaaatt 7740
tgggcttcta tattttaata ttgcttttca attactgtta ttaaatgtaa gtactgcgtc 7800tgggcttcta tattttaata ttgcttttca attactgtta ttaaatgtaa gtactgcgtc 7800
tatgaaaata tatgcaaatg ctaagaaaaa tcctaaaaat ttgaatatga gatattcctc 7860tatgaaaata tatgcaaatg ctaagaaaaa tcctaaaaat ttgaatatga gatattcctc 7860
agtatttctt tttcatcctt tcttctgcgg ctctagccct ttgttctctc atcaatctgc 7920agtatttctt tttcatcctt tcttctgcgg ctctagccct ttgttctctc atcaatctgc 7920
gtctctgttc atcggtcaaa gaattcaaat tttgttgctg aattgaagga ataacgcgtg 7980gtctctgttc atcggtcaaa gaattcaaat tttgttgctg aattgaagga ataacgcgtg 7980
tacgcatgta acattatact gaaaaccttg cttgagaagg ttttgggacg ctcgaagatc 8040tacgcatgta acattatact gaaaaccttg cttgagaagg ttttgggacg ctcgaagatc 8040
ctccggatcg tttcgccggc gtttatccag ctgcattaat gaatcggcca acgcgcgggg 8100ctccggatcg tttcgccggc gtttatccag ctgcattaat gaatcggcca acgcgcgggg 8100
agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 8160agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 8160
gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 8220gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 8220
gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 8280gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 8280
cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 8340cgtaaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 8340
aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 8400aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 8400
tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 8460tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 8460
ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat 8520ctgtccgcctttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat 8520
ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 8580ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 8580
cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 8640cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 8640
ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 8700ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 8700
gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt 8760gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt 8760
atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 8820atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 8820
aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 8880aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 8880
aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 8940aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 8940
gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 9000gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 9000
cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 9060cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 9060
gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 9120gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 9120
tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 9180tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 9180
ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 9240ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 9240
ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 9300ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 9300
atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 9360atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 9360
cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct 9420cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct 9420
tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 9480tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 9480
aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 9540aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 9540
tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 9600tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 9600
ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 9660ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 9660
agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa 9720agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa 9720
gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 9780gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 9780
agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 9840agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 9840
accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 9900accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 9900
gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 9960gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 9960
cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 10020cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 10020
ggggttccgc gcacatttcc ccgaaaagtg ccacctgaac gaagcatctg tgcttcattt 10080ggggttccgc gcacatttcc ccgaaaagtg ccacctgaac gaagcatctg tgcttcattt 10080
tgtagaacaa aaatgcaacg cgagagcgct aatttttcaa acaaagaatc tgagctgcat 10140tgtagaacaa aaatgcaacg cgagagcgct aatttttcaa acaaagaatc tgagctgcat 10140
ttttacagaa cagaaatgca acgcgaaagc gctattttac caacgaagaa tctgtgcttc 10200ttttacagaa cagaaatgca acgcgaaagc gctattttac caacgaagaa tctgtgcttc 10200
atttttgtaa aacaaaaatg caacgcgaga gcgctaattt ttcaaacaaa gaatctgagc 10260atttttgtaa aacaaaaatg caacgcgaga gcgctaattt ttcaaacaaa gaatctgagc 10260
tgcattttta cagaacagaa atgcaacgcg agagcgctat tttaccaaca aagaatctat 10320tgcattttta cagaacagaa atgcaacgcg agagcgctat tttaccaaca aagaatctat 10320
acttcttttt tgttctacaa aaatgcatcc cgagagcgct atttttctaa caaagcatct 10380acttcttttt tgttctacaa aaatgcatcc cgagagcgct atttttctaa caaagcatct 10380
tagattactt tttttctcct ttgtgcgctc tataatgcag tctcttgata actttttgca 10440tagattactt tttttctcct ttgtgcgctc tataatgcag tctcttgata actttttgca 10440
ctgtaggtcc gttaaggtta gaagaaggct actttggtgt ctattttctc ttccataaaa 10500ctgtaggtcc gttaaggtta gaagaaggct actttggtgt ctattttctc ttccataaaa 10500
aaagcctgac tccacttccc gcgtttactg attactagcg aagctgcggg tgcatttttt 10560aaagcctgac tccacttccc gcgtttactg attactagcg aagctgcggg tgcatttttt 10560
caagataaag gcatccccga ttatattcta taccgatgtg gattgcgcat actttgtgaa 10620caagataaag gcatccccga ttatattcta taccgatgtg gattgcgcat actttgtgaa 10620
cagaaagtga tagcgttgat gattcttcat tggtcagaaa attatgaacg gtttcttcta 10680cagaaagtga tagcgttgat gattcttcat tggtcagaaa attatgaacg gtttcttcta 10680
ttttgtctct atatactacg tataggaaat gtttacattt tcgtattgtt ttcgattcac 10740ttttgtctct atatactacg tataggaaat gtttacattt tcgtattgtt ttcgattcac 10740
tctatgaata gttcttacta caattttttt gtctaaagag taatactaga gataaacata 10800tctatgaata gttcttacta caattttttt gtctaaagag taatactaga gataaacata 10800
aaaaatgtag aggtcgagtt tagatgcaag ttcaaggagc gaaaggtgga tgggtaggtt 10860aaaaatgtag aggtcgagtt tagatgcaag ttcaaggagc gaaaggtgga tgggtaggtt 10860
atatagggat atagcacaga gatatatagc aaagagatac ttttgagcaa tgtttgtgga 10920atataggggat atagcacaga gatatatagc aaagagatac ttttgagcaa tgtttgtgga 10920
agcggtattc gcaatatttt agtagctcgt tacagtccgg tgcgtttttg gttttttgaa 10980agcggtattc gcaatatttt agtagctcgt tacagtccgg tgcgtttttg gttttttgaa 10980
agtgcgtctt cagagcgctt ttggttttca aaagcgctct gaagttccta tactttctag 11040agtgcgtctt cagagcgctt ttggttttca aaagcgctct gaagttccta tactttctag 11040
agaataggaa cttcggaata ggaacttcaa agcgtttccg aaaacgagcg cttccgaaaa 11100agaataggaa cttcggaata ggaacttcaa agcgtttccg aaaacgagcg cttccgaaaa 11100
tgcaacgcga gctgcgcaca tacagctcac tgttcacgtc gcacctatat ctgcgtgttg 11160tgcaacgcga gctgcgcaca tacagctcac tgttcacgtc gcacctatat ctgcgtgttg 11160
cctgtatata tatatacatg agaagaacgg catagtgcgt gtttatgctt aaatgcgtac 11220cctgtatata tatatacatg agaagaacgg catagtgcgt gtttatgctt aaatgcgtac 11220
ttatatgcgt ctatttatgt aggatgaaag gtagtctagt acctcctgtg atattatccc 11280ttatatgcgt ctatttatgt aggatgaaag gtagtctagt acctcctgtg atattatccc 11280
attccatgcg gggtatcgta tgcttccttc agcactaccc tttagctgtt ctatatgctg 11340attccatgcg gggtatcgta tgcttccttc agcactaccc tttagctgtt ctatatgctg 11340
ccactcctca attggattag tctcatcctt caatgctatc atttcctttg atattggatc 11400ccactcctca attggattag tctcatcctt caatgctatc atttcctttg atattggatc 11400
atactaagaa accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc 11460atactaagaa accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc 11460
ctttcgtc 11468ctttcgtc 11468
<210> 8<210> 8
<211> 11222<211> 11222
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 8<400> 8
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accataccac agcttttcaa ttcaattcat catttttttt ttattctttt ttttgatttc 240accataccac agcttttcaa ttcaattcat catttttttt ttattctttt ttttgatttc 240
ggtttctttg aaattttttt gattcggtaa tctccgaaca gaaggaagaa cgaaggaagg 300ggtttctttg aaattttttt gattcggtaa tctccgaaca gaaggaagaa cgaaggaagg 300
agcacagact tagattggta tatatacgca tatgtagtgt tgaagaaaca tgaaattgcc 360agcacagact tagattggta tatatacgca tatgtagtgt tgaagaaaca tgaaattgcc 360
cagtattctt aacccaactg cacagaacaa aaacctgcag gaaacgaaga taaatctaaa 420cagtattctt aacccaactg cacagaacaa aaacctgcag gaaacgaaga taaatctaaa 420
aaactgtatt ataagtaaat gcatgtatac taaactcaca aattagagct tcaatttaat 480aaactgtatt ataagtaaat gcatgtatac taaactcaca aattagagct tcaatttaat 480
tatatcagtt attaccctat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 540tatatcagtt attaccctat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 540
cgcatcagga aattgtaaac gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa 600cgcatcagga aattgtaaac gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa 600
tcagctcatt ttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat 660tcagctcattttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat 660
agaccgagat agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg 720agaccgagat agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg 720
tggactccaa cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac 780tggactccaa cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac 780
catcacccta atcaagtttt ttggggtcga ggtgccgtaa agcactaaat cggaacccta 840catcacccta atcaagttttttggggtcga ggtgccgtaa agcactaaat cggaacccta 840
aagggagccc ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag 900aagggagccc ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag 900
ggaagaaagc gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg 960ggaagaaagc gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg 960
taaccaccac acccgccgcg cttaatgcgc cgctacaggg cgcgtccatt cgccattcag 1020taaccaccac acccgccgcg cttaatgcgc cgctacaggg cgcgtccatt cgccattcag 1020
gctgcgcaac tgttgggaag ggcgatcggt gcgggcctct tcgctattac gccagctgga 1080gctgcgcaac tgttgggaag ggcgatcggt gcgggcctct tcgctattac gccagctgga 1080
taaaggcgcg ccgtcgacaa tatagaaatt tcttagtttt aatgattgaa tgaatccatt 1140taaaggcgcg ccgtcgacaa tatagaaatt tcttagtttt aatgattgaa tgaatccatt 1140
tcgaacacgt caatgtactt ggacaagctt tgtagaacat gaagtacttg agcaatggac 1200tcgaacacgt caatgtactt ggacaagctt tgtagaacat gaagtacttg agcaatggac 1200
gcccaactgg aatttacttt agcgacacta ttccaagacc agcaactcaa agaagtaaat 1260gcccaactgg aatttacttt agcgacacta ttccaagacc agcaactcaa agaagtaaat 1260
tctttttttc acaatagttg tttttttgat gatgtacgtc ccagcagcac ctaatggtat 1320tctttttttc acaatagttg tttttttgat gatgtacgtc ccagcagcac ctaatggtat 1320
ttttatgagt attggcagtt tgccttatta attaagcaaa ccggtcatat attttaaaaa 1380ttttatgagt attggcagtt tgccttatta attaagcaaa ccggtcatat attttaaaaa 1380
ataggaactt tgtagtcctt attaatgctg aataaaaaaa taggaagaga aatcagcaag 1440ataggaactt tgtagtcctt attaatgctg aataaaaaaa taggaagaga aatcagcaag 1440
tataaacagt taaatgtttt ttaactataa tggttgcatt caagaggaca tgtaaattgc 1500tataaacagt taaatgtttt ttaactataa tggttgcatt caagaggaca tgtaaattgc 1500
gtactcggta caagttggta aaaacgttaa agcactccta cgagatttag tcagtcctgg 1560gtactcggta caagttggta aaaacgttaa agcactccta cgagatttag tcagtcctgg 1560
cattaataaa caaaaagacc tgtatctata tttcatgaac tgggtaaaat caccacacaa 1620cattaataaa caaaaagacc tgtatctata tttcatgaac tgggtaaaat caccacacaa 1620
aaaagaataa acaaatataa tgaactactt ttggtcttgg actcgcagtt tatagtttgt 1680aaaagaataa acaaatataa tgaactactt ttggtcttgg actcgcagtt tatagtttgt 1680
taaaattgac aaacacacat gttctgatta acttacttcc gcttcaaagt actgaaaaca 1740taaaattgac aaacacacat gttctgatta acttacttcc gcttcaaagt actgaaaaca 1740
tcatacgaac tggtataaga aggaagatag caaataaaaa taaatcaaat attgcttgta 1800tcatacgaac tggtataaga aggaagatag caaataaaaa taaatcaaat attgcttgta 1800
aaaaattagg atataaaaaa atctctattc attaaagaca aaaagctaaa ggaataaaaa 1860aaaaattagg atataaaaaa atctctattc attaaagaca aaaagctaaa ggaataaaaa 1860
agtcattcgc tttacgccaa ataaaacgtc tatgcacgtt tttaaaggct gtgaacatta 1920agtcattcgc tttacgccaa ataaaacgtc tatgcacgtt tttaaaggct gtgaacatta 1920
acaatgttgt ggaattcagg cttgagagaa gcaccgccaa ccaagaaacc gtcaatatcg 1980acaatgttgt ggaattcagg cttgagagaa gcaccgccaa ccaagaaacc gtcaatatcg 1980
tggaacttga ggaactcctt gcagttacca ccgttaacgg aaccaccgta gatgacacgg 2040tggaacttga ggaactcctt gcagttacca ccgttaacgg aaccaccgta gatgacacgg 2040
agaccctcgg caacagatgc accaagcttg ttggtagccc acttgcggat ctcagcgtga 2100agaccctcgg caacagatgc accaagcttg ttggtagccc acttgcggat ctcagcgtga 2100
acctcttgag cttgctcagg ggtggcagtc ttaccagtac caatggccca gacaggctca 2160acctcttgag cttgctcagg ggtggcagtc ttaccagtac caatggccca gacaggctca 2160
taagcaatga caatcttgga ccagttctgg accttgtcag cgatggcgtt caattgacga 2220taagcaatga caatcttgga ccagttctgg accttgtcag cgatggcgtt caattgacga 2220
acaacaacgt tgatggtctc gttagcctca cgctcggcca aagtctcacc aatgcaggca 2280acaacaacgt tgatggtctc gttagcctca cgctcggcca aagtctcacc aatgcaggca 2280
acgacagtaa gaccttgttc aagggcaaac ttggtcttgt cggcaacgaa ctcgtcagac 2340acgacagtaa gaccttgttc aagggcaaac ttggtcttgt cggcaacgaa ctcgtcagac 2340
tccttgaaga tggtacgacg ctcggagtga ccagtcaaag tgtaggtaat accagcatca 2400tccttgaaga tggtacgacg ctcggagtga ccagtcaaag tgtaggtaat accagcatca 2400
atcaaagatt gagcactgtt ctcaccagtg taggcaccgt tcttcttgtc gaagacgttt 2460atcaaagatt gagcactgtt ctcaccagtg taggcaccgt tcttcttgtc gaagacgttt 2460
tgggcaccaa cgccaatatc cttcttgact tgttggcggg tggtgatgag gtacatgttt 2520tgggcaccaa cgccaatatc cttcttgact tgttggcggg tggtgatgag gtacatgttt 2520
tgagggaaga tgacagtttc gacatcacca acgttaagct tggtggtgtt caaaccctca 2580tgagggaaga tgacagtttc gacatcacca acgttaagct tggtggtgtt caaaccctca 2580
ataatagtct tcatggactc caaagagcca ttcatcttaa agttaccacc gacaaagaat 2640ataatagtct tcatggactc caaagagcca ttcatcttaa agttaccacc gacaaagaat 2640
ttacgtgcca ttttgatcct agttctacta ttgaaatgta gttggggaga gacgagagtt 2700ttacgtgcca ttttgatcct agttctacta ttgaaatgta gttggggaga gacgagagtt 2700
ggcccgtccg ctcattatat ataacgtagc ggacagtcac acagttaagg ggaattaccg 2760ggcccgtccg ctcattatat ataacgtagc ggacagtcac acagttaagg ggaattaccg 2760
agcttcggca atttaccccg tcgatagcaa ccgttggcat ggatccggcc ggccagatct 2820agcttcggca atttaccccg tcgatagcaa ccgttggcat ggatccggcc ggccagatct 2820
acgtatggtc atttcttctt cagattccct catggagaaa gtgcggcaga tgtatatgac 2880acgtatggtc atttcttctt cagattccct catggagaaa gtgcggcaga tgtatatgac 2880
agagtcgcca gtttccaaga gactttattc aggcacttcc atgataggca agagagaaga 2940agagtcgcca gtttccaaga gactttatattc aggcacttcc atgataggca agagagaaga 2940
cccagagatg ttgttgtcct agttacacat ggtatttatt ccagagtatt cctgatgaaa 3000cccagagatg ttgttgtcct agttacacat ggtatttatt ccagagtatt cctgatgaaa 3000
tggtttagat ggacatacga agagtttgaa tcgtttacca atgttcctaa cgggagcgta 3060tggtttagat ggacatacga agagtttgaa tcgtttacca atgttcctaa cgggagcgta 3060
atggtgatgg aactggacga atccatcaat agatacgtcc tgaggaccgt gctacccaaa 3120atggtgatgg aactggacga atccatcaat agatacgtcc tgaggaccgt gctacccaaa 3120
tggactgatt gtgagggaga cctaactaca tagtgtttaa agattacgga tatttaactt 3180tggactgatt gtgagggaga cctaactaca tagtgtttaa agattacgga tatttaactt 3180
acttagaata atgccatttt tttgagttat aataatccta cgttagtgtg agcgggattt 3240acttagaata atgccatttttttgagttat aataatccta cgttagtgtg agcgggattt 3240
aaactgtgag gaccttaata cattcagaca cttctgcggt atcaccctac ttattccctt 3300aaactgtgag gaccttaata cattcagaca cttctgcggt atcaccctac ttatccctt 3300
cgagattata tctaggaacc catcaggttg gtggaagatt acccgttcta agacttttca 3360cgagattata tctaggaacc catcaggttg gtggaagatt acccgttcta agacttttca 3360
gcttcctcta ttgatgttac acctggacac cccttttctg gcatccagtt tttaatcttc 3420gcttcctcta ttgatgttac acctggacac cccttttctg gcatccagtt tttaatcttc 3420
agtggcatgt gagattctcc gaaattaatt aaagcaatca cacaattctc tcggatacca 3480agtggcatgt gagattctcc gaaattaatt aaagcaatca cacaattctc tcggatacca 3480
cctcggttga aactgacagg tggtttgtta cgcatgctaa tgcaaaggag cctatatacc 3540cctcggttga aactgacagg tggtttgtta cgcatgctaa tgcaaaggag cctatatacc 3540
tttggctcgg ctgctgtaac agggaatata aagggcagca taatttagga gtttagtgaa 3600tttggctcgg ctgctgtaac agggaatata aagggcagca taatttagga gtttagtgaa 3600
cttgcaacat ttactatttt cccttcttac gtaaatattt ttctttttaa ttctaaatca 3660cttgcaacat ttactatttt cccttcttac gtaaatattt ttctttttaa ttctaaatca 3660
atctttttca attttttgtt tgtattcttt tcttgcttaa atctataact acaaaaaaca 3720atctttttca attttttgtt tgtattcttt tcttgcttaa atctataact acaaaaaaca 3720
catacataaa ctaaaaggta ccaacaaaat gagatttcca tctattttta ctgctgtttt 3780catacataaa ctaaaaggta ccaacaaaat gagatttcca tctattttta ctgctgtttt 3780
gtttgctgct tcttctgctt tggctgctcc agttaatact actactgaag atgaaactgc 3840gtttgctgct tcttctgctt tggctgctcc agttaatact actactgaag atgaaactgc 3840
tcaaattcca gctgaagctg ttattggtta ttctgatttg gagggtgact ttgatgttgc 3900tcaaattcca gctgaagctg ttattggtta ttctgatttg gagggtgact ttgatgttgc 3900
tgttttgcca ttttctaact ctactaacaa cggtttgcta ttcatcaaca ctactatcgc 3960tgttttgcca ttttctaact ctactaacaa cggtttgcta ttcatcaaca ctactatcgc 3960
ttctatcgct gctaaagaag aaggtgtttc tttggataaa agagaagaag gtgaaccaaa 4020ttctatcgct gctaaagaag aaggtgtttc tttggataaa agagaagaag gtgaaccaaa 4020
agctgttgac aagaacaatg ccactgctgc cgtccaaaac gaatccaagc gttacactgt 4080agctgttgac aagaacaatg ccactgctgc cgtccaaaac gaatccaagc gttacactgt 4080
ttcttactta aagaccttga actattacga cttggtcgat ctattggtta agaccgaaat 4140ttcttactta aagaccttga actattacga cttggtcgat ctattggtta agaccgaaat 4140
cgaaaactta ccagacttgt ttcaatactc ttccgacgct aaggaatttt acggtaacaa 4200cgaaaactta ccagacttgt ttcaatactc ttccgacgct aaggaatttt acggtaacaa 4200
gactagaatg tcctttatca tggacgaaat tggtcgtaga gctccacaat acactgaaat 4260gactagaatg tcctttatca tggacgaaat tggtcgtaga gctccacaat acactgaaat 4260
tgaccacaag ggtataccaa ccctcgttga agtcgtcaga gccgggttct acttgggttt 4320tgaccacaag ggtataccaa ccctcgttga agtcgtcaga gccgggttct acttgggttt 4320
tcacaacaaa gaattgaatg aaatcaacaa gagatctttc aaagaaagag ttatcccgtc 4380tcacaacaaa gaattgaatg aaatcaacaa gagatctttc aaagaaagag ttatcccgtc 4380
tatcttggct atccaaaaga acccaaactt caaattaggt actgaagttc aagacaagat 4440tatcttggct atccaaaaga acccaaactt caaattaggt actgaagttc aagacaagat 4440
tgtctctgct accggtctat tggctggtaa cgaaaccgct ccaccagaag ttgttaacaa 4500tgtctctgct accggtctat tggctggtaa cgaaaccgct ccaccagaag ttgttaacaa 4500
cttcacccca atcttgcaag attgtatcaa gaacatcgac agatacgcct tggacgactt 4560cttcacccca atcttgcaag attgtatcaa gaacatcgac agatacgcct tggacgactt 4560
gaaaagtaaa gcattgttca acgttttggc cgccccaacc tacgacatta ctgaatacct 4620gaaaagtaaa gcattgttca acgttttggc cgccccaacc tacgacatta ctgaatacct 4620
cagagctacc aaggaaaagc cagaaaacac tccatggtac ggtaagatcg atggtttcat 4680cagagctacc aaggaaaagc cagaaaacac tccatggtac ggtaagatcg atggtttcat 4680
caacgaattg aagaagctcg ctttgtacgg taaaattaac gacaacaact cttggattat 4740caacgaattg aagaagctcg ctttgtacgg taaaattaac gacaacaact cttggattat 4740
cgataacggt atttaccaca ttgctccatt gggtaaattg cactctaaca acaaaatcgg 4800cgataacggt atttaccaca ttgctccatt gggtaaattg cactctaaca acaaaatcgg 4800
tatcgaaact ctaactgaag ttatgaaggt ctacccatac ttgtccatgc aacacttgca 4860tatcgaaact ctaactgaag ttatgaaggt ctacccatac ttgtccatgc aacacttgca 4860
atccgctgac caaatcaaga gacactacga ctctaaggat gccgaaggta acaagatccc 4920atccgctgac caaatcaaga gacactacga ctctaaggat gccgaaggta acaagatccc 4920
attagataag ttcaagaagg aaggcaagga aaagtactgc ccaaagacct acacttttga 4980attagataag ttcaagaagg aaggcaagga aaagtactgc ccaaagacct acacttttga 4980
tgatggtaag gttatcatca aggccggtgc tcgtgtcgaa gaagaaaagg tcaaaagatt 5040tgatggtaag gttatcatca aggccggtgc tcgtgtcgaa gaagaaaagg tcaaaagatt 5040
atactgggct tccaaggaag tcaacagtca attcttcaga gtttatggta ttgacaagcc 5100atactgggct tccaaggaag tcaacagtca attcttcaga gtttatggta ttgacaagcc 5100
attggaagaa ggtaacccag acgacatcct taccatggtt atttacaact ctccagaaga 5160attggaagaa ggtaacccag acgacatcct taccatggtt atttacaact ctccagaaga 5160
atacaaactt aactccgtct tgtacggtta cgataccaac aacggtggta tgtacattga 5220atacaaactt aactccgtct tgtacggtta cgataccaac aacggtggta tgtacattga 5220
accagaaggt acttttttca cttacgaaag agaagctcaa gaatctacgt acactttgga 5280accagaaggt acttttttca cttacgaaag agaagctcaa gaatctacgt acactttgga 5280
agaattattc agacacgaat acacccatta cttgcaaggt agatacgctg tcccaggtca 5340agaattattc agacacgaat acacccatta cttgcaaggt agatacgctg tcccaggtca 5340
atggggtcgt accaagttgt atgataacga cagattgacc tggtacgaag aaggtggtgc 5400atggggtcgt accaagttgt atgataacga cagattgacc tggtacgaag aaggtggtgc 5400
tgaactgttt gctggttcta ctagaacttc cggcatcttg ccaagaaagt ctattgtctc 5460tgaactgttt gctggttcta ctagaacttc cggcatcttg ccaagaaagt ctattgtctc 5460
taacattcac aacaccacta gaaacaacag atacaagctg tctgacactg ttcattcgaa 5520taacattcac aacaccacta gaaacaacag atacaagctg tctgacactg ttcattcgaa 5520
gtacggtgcc tctttcgaat tttacaacta cgcttgtatg ttcatggact acatgtacaa 5580gtacggtgcc tctttcgaat tttacaacta cgcttgtatg ttcatggact acatgtacaa 5580
taaggacatg ggtattttga acaagttgaa tgatttggct aagaacaacg acgttgacgg 5640taaggacatg ggtattttga acaagttgaa tgatttggct aagaacaacg acgttgacgg 5640
ttacgacaat tacatcagag atttgtcatc taattacgct ttgaacgata agtaccaaga 5700ttacgacaat tacatcagag atttgtcatc taattacgct ttgaacgata agtaccaaga 5700
tcacatgcaa gaacgtattg acaactatga aaacttgacc gttccattcg ttgctgacga 5760tcacatgcaa gaacgtattg acaactatga aaacttgacc gttccattcg ttgctgacga 5760
ctacttagtt agacacgctt acaagaaccc aaacgaaatt tactccgaaa tctccgaagt 5820ctacttagtt agacacgctt acaagaaccc aaacgaaatt tactccgaaa tctccgaagt 5820
tgccaagttg aaggatgcta agtctgaagt taagaaatct caatacttct caactttcac 5880tgccaagttg aaggatgcta agtctgaagt taagaaatct caatacttct caactttcac 5880
tttgcgtggt agttacaccg gaggtgcttc caagggtaag ctggaagatc aaaaagctat 5940tttgcgtggt agttacaccg gaggtgcttc caagggtaag ctggaagatc aaaaagctat 5940
gaacaagttc attgacgact ctttgaagaa gttggacacc tattcctggt ctggttacaa 6000gaacaagttc attgacgact ctttgaagaa gttggacacc tattcctggt ctggttacaa 6000
gactcttact gcttatttta ccaactacaa ggtcgattct tctaacagag ttacttacga 6060gactcttact gcttatttta ccaactacaa ggtcgattct tctaacagag ttacttacga 6060
tgtcgttttc catggttact tgccaaacga aggtgattcc aagaacagct tgccatacgg 6120tgtcgttttc catggttatact tgccaaacga aggtgattcc aagaacagct tgccatacgg 6120
taagattaac ggtacttaca agggtactga gaaggaaaag attaagttct cctccgaagg 6180taagattaac ggtacttaca agggtactga gaaggaaaag attaagttct cctccgaagg 6180
ttctttcgac ccagacggaa agatagtttc ctacgaatgg gacttcggtg atggtaacaa 6240ttctttcgac ccagacggaa agatagtttc ctacgaatgg gacttcggtg atggtaacaa 6240
gtccaacgaa gagaaccctg aacacagtta cgataaggtc ggtacttaca ctgtcaagtt 6300gtccaacgaa gagaaccctg aacacagtta cgataaggtc ggtacttaca ctgtcaagtt 6300
gaaggttacc gatgataagg gtgaaagctc tgtttctact accactgctg aaatcaaaga 6360gaaggttacc gatgataagg gtgaaagctc tgtttctact accactgctg aaatcaaaga 6360
tttgtctgaa aacaagttgc cagtcatcta tatgcacgtt cctaaatcgg gtgctttaaa 6420tttgtctgaa aacaagttgc cagtcatcta tatgcacgtt cctaaatcgg gtgctttaaa 6420
ccaaaaggtt gtcttctacg gtaagggtac ttacgaccca gacggttcta tcgctggcta 6480ccaaaaggtt gtcttctacg gtaagggtac ttacgaccca gacggttcta tcgctggcta 6480
ccaatgggat ttcggtgacg gttctgactt ttcctctgaa cagaacccat ctcacgtcta 6540ccaatgggat ttcggtgacg gttctgactt ttcctctgaa cagaacccat ctcacgtcta 6540
caccaagaaa ggtgaataca ctgtgacctt gagagtcatg gattcttccg gtcaaatgtc 6600caccaagaaa ggtgaataca ctgtgacctt gagagtcatg gattcttccg gtcaaatgtc 6600
tgaaaagacc atgaaaatca agatcactga tccagtatac cctattggta ctgaaaagga 6660tgaaaagacc atgaaaatca agatcactga tccagtatac cctattggta ctgaaaagga 6660
accaaacaac tctaaggaaa cagcctccgg tccaatcgtt ccgggtattc cagtctccgg 6720accaaacaac tctaaggaaa cagcctccgg tccaatcgtt ccgggtattc cagtctccgg 6720
tactatcgaa aacacctctg atcaagacta tttctacttc gatgttatta ctccaggtga 6780tactatcgaa aacacctctg atcaagacta tttctacttc gatgttatta ctccaggtga 6780
agtcaagatc gacatcaaca agctaggtta cggtggtgct acatgggttg tttacgatga 6840agtcaagatc gacatcaaca agctaggtta cggtggtgct acatgggttg tttacgatga 6840
aaacaataac gctgtctcct acgctaccga tgacggtcaa aatttgtctg gtaaattcaa 6900aaacaataac gctgtctcct acgctaccga tgacggtcaa aatttgtctg gtaaattcaa 6900
ggctgacaag cctgggagat actacattca tttgtacatg ttcaatggtt cttacatgcc 6960ggctgacaag cctgggagat actacattca tttgtacatg ttcaatggtt cttacatgcc 6960
atacagaatt aacattgaag gttctgttgg tagacaccac caccaccacc actaagctag 7020atacagaatt aacattgaag gttctgttgg tagacacccac caccaccacc actaagctag 7020
cctcgagtct agaaactaag attaatataa ttatataaaa atattatctt cttttcttta 7080cctcgagtct agaaactaag attaatataa ttatataaaa atattatctt cttttcttta 7080
tatctagtgt tatgtaaaat aaattgatga ctacggaaag cttttttata ttgtttcttt 7140tatctagtgt tatgtaaaat aaattgatga ctacggaaag cttttttata ttgtttcttt 7140
ttcattctga gccacttaaa tttcgtgaat gttcttataa gggacggtag atttacaagt 7200ttcattctga gccacttaaa tttcgtgaat gttcttataa gggacggtag atttacaagt 7200
gatacaacaa aaagcaaggc gctttttcta ataaaaagaa gaaaagcatt taacaattga 7260gatacaacaa aaagcaaggc gctttttcta ataaaaagaa gaaaagcatt taacaattga 7260
acacctctat atcaacgaag aatattactt tgtctctaaa tccttgtaaa atgtgtacga 7320acacctctat atcaacgaag aatattactt tgtctctaaa tccttgtaaa atgtgtacga 7320
tctctatatg ggttactcag aagtgtaccg aagactgcat tgaaagttta tgttttttca 7380tctctatatg ggttactcag aagtgtaccg aagactgcat tgaaagttta tgttttttca 7380
ctgcaagcgt cattttcgct ttgagaagat gttcttattc aaatttcaac tgttatatag 7440ctgcaagcgt cattttcgct ttgagaagat gttcttattc aaatttcaac tgttatatag 7440
aagagcaaaa aattgccaaa aaaaacaaca tttattcatt taaaatataa aatttgggct 7500aagagcaaaa aattgccaaa aaaaacaaca tttattcatt taaaatataa aatttgggct 7500
tctatatttt aatattgctt ttcaattact gttattaaat gtaagtactg cgtctatgaa 7560tctatatttt aatattgctt ttcaattact gttattaaat gtaagtactg cgtctatgaa 7560
aatatatgca aatgctaaga aaaatcctaa aaatttgaat atgagatatt cctcagtatt 7620aatatatgca aatgctaaga aaaatcctaa aaatttgaat atgagatatt cctcagtatt 7620
tctttttcat cctttcttct gcggctctag ccctttgttc tctcatcaat ctgcgtctct 7680tctttttcat cctttcttct gcggctctag ccctttgttc tctcatcaat ctgcgtctct 7680
gttcatcggt caaagaattc aaattttgtt gctgaattga aggaataacg cgtgtacgca 7740gttcatcggt caaagaattc aaattttgtt gctgaattga aggaataacg cgtgtacgca 7740
tgtaacatta tactgaaaac cttgcttgag aaggttttgg gacgctcgaa gatcctccgg 7800tgtaacatta tactgaaaac cttgcttgag aaggttttgg gacgctcgaa gatcctccgg 7800
atcgtttcgc cggcgtttat ccagctgcat taatgaatcg gccaacgcgc ggggagaggc 7860atcgtttcgc cggcgtttat ccagctgcat taatgaatcg gccaacgcgc ggggagaggc 7860
ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 7920ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 7920
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 7980cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 7980
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 8040ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 8040
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 8100aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 8100
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 8160cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 8160
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 8220cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 8220
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 8280gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 8280
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 8340tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 8340
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 8400cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 8400
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 8460ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 8460
gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc 8520gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc 8520
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 8580gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 8580
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 8640accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 8640
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 8700ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 8700
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 8760tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 8760
aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 8820aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 8820
taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 8880taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 8880
gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc 8940gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc 8940
agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac 9000agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac 9000
cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag 9060cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag 9060
tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac 9120tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac 9120
gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc 9180gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc 9180
agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg 9240agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg 9240
gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc 9300gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc 9300
atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct 9360atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct 9360
gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc 9420gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc 9420
tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc 9480tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc 9480
atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc 9540atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc 9540
agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc 9600agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc 9600
gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca 9660gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca 9660
cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt 9720cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt 9720
tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt 9780tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt 9780
ccgcgcacat ttccccgaaa agtgccacct gaacgaagca tctgtgcttc attttgtaga 9840ccgcgcacat ttccccgaaa agtgccacct gaacgaagca tctgtgcttc attttgtaga 9840
acaaaaatgc aacgcgagag cgctaatttt tcaaacaaag aatctgagct gcatttttac 9900acaaaaatgc aacgcgagag cgctaatttt tcaaacaaag aatctgagct gcatttttac 9900
agaacagaaa tgcaacgcga aagcgctatt ttaccaacga agaatctgtg cttcattttt 9960agaacagaaa tgcaacgcga aagcgctatt ttaccaacga agaatctgtg cttcattttt 9960
gtaaaacaaa aatgcaacgc gagagcgcta atttttcaaa caaagaatct gagctgcatt 10020gtaaaacaaa aatgcaacgc gagagcgcta atttttcaaa caaagaatct gagctgcatt 10020
tttacagaac agaaatgcaa cgcgagagcg ctattttacc aacaaagaat ctatacttct 10080tttacagaac agaaatgcaa cgcgagagcg ctattttacc aacaaagaat ctatacttct 10080
tttttgttct acaaaaatgc atcccgagag cgctattttt ctaacaaagc atcttagatt 10140tttttgttct acaaaaatgc atcccgagag cgctattttt ctaacaaagc atcttagatt 10140
actttttttc tcctttgtgc gctctataat gcagtctctt gataactttt tgcactgtag 10200actttttttc tcctttgtgc gctctataat gcagtctctt gataactttt tgcactgtag 10200
gtccgttaag gttagaagaa ggctactttg gtgtctattt tctcttccat aaaaaaagcc 10260gtccgttaag gttagaagaa ggctactttg gtgtctattt tctcttccat aaaaaaagcc 10260
tgactccact tcccgcgttt actgattact agcgaagctg cgggtgcatt ttttcaagat 10320tgactccact tcccgcgttt actgattact agcgaagctg cgggtgcatt ttttcaagat 10320
aaaggcatcc ccgattatat tctataccga tgtggattgc gcatactttg tgaacagaaa 10380aaaggcatcc ccgattatat tctataccga tgtggattgc gcatactttg tgaacagaaa 10380
gtgatagcgt tgatgattct tcattggtca gaaaattatg aacggtttct tctattttgt 10440gtgatagcgt tgatgattct tcattggtca gaaaattatg aacggtttct tctattttgt 10440
ctctatatac tacgtatagg aaatgtttac attttcgtat tgttttcgat tcactctatg 10500ctctatatac tacgtatagg aaatgtttac attttcgtat tgttttcgat tcactctatg 10500
aatagttctt actacaattt ttttgtctaa agagtaatac tagagataaa cataaaaaat 10560aatagttctt actacaattt ttttgtctaa agagtaatac tagagataaa cataaaaaat 10560
gtagaggtcg agtttagatg caagttcaag gagcgaaagg tggatgggta ggttatatag 10620gtagaggtcg agtttagatg caagttcaag gagcgaaagg tggatgggta ggttatatag 10620
ggatatagca cagagatata tagcaaagag atacttttga gcaatgtttg tggaagcggt 10680ggatatagca cagagatata tagcaaagag atacttttga gcaatgtttg tggaagcggt 10680
attcgcaata ttttagtagc tcgttacagt ccggtgcgtt tttggttttt tgaaagtgcg 10740attcgcaata ttttagtagc tcgttacagt ccggtgcgtt tttggttttt tgaaagtgcg 10740
tcttcagagc gcttttggtt ttcaaaagcg ctctgaagtt cctatacttt ctagagaata 10800tcttcagagc gcttttggtt ttcaaaagcg ctctgaagtt cctatacttt ctagagaata 10800
ggaacttcgg aataggaact tcaaagcgtt tccgaaaacg agcgcttccg aaaatgcaac 10860ggaacttcgg aataggaact tcaaagcgtt tccgaaaacg agcgcttccg aaaatgcaac 10860
gcgagctgcg cacatacagc tcactgttca cgtcgcacct atatctgcgt gttgcctgta 10920gcgagctgcg cacatacagc tcactgttca cgtcgcacct atatctgcgt gttgcctgta 10920
tatatatata catgagaaga acggcatagt gcgtgtttat gcttaaatgc gtacttatat 10980tatatatata catgagaaga acggcatagt gcgtgtttat gcttaaatgc gtacttatat 10980
gcgtctattt atgtaggatg aaaggtagtc tagtacctcc tgtgatatta tcccattcca 11040gcgtctattt atgtaggatg aaaggtagtc tagtacctcc tgtgatatta tcccattcca 11040
tgcggggtat cgtatgcttc cttcagcact accctttagc tgttctatat gctgccactc 11100tgcggggtat cgtatgcttc cttcagcact accctttagc tgttctatat gctgccactc 11100
ctcaattgga ttagtctcat ccttcaatgc tatcatttcc tttgatattg gatcatacta 11160ctcaattgga ttagtctcat ccttcaatgc tatcatttcc tttgatattg gatcatacta 11160
agaaaccatt attatcatga cattaaccta taaaaatagg cgtatcacga ggccctttcg 11220agaaaccatt attatcatga cattaaccta taaaaatagg cgtatcacga ggccctttcg 11220
tc 11222tc 11222
<210> 9<210> 9
<211> 11462<211> 11462
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 9<400> 9
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accataccac agcttttcaa ttcaattcat catttttttt ttattctttt ttttgatttc 240accataccac agcttttcaa ttcaattcat catttttttt ttattctttt ttttgatttc 240
ggtttctttg aaattttttt gattcggtaa tctccgaaca gaaggaagaa cgaaggaagg 300ggtttctttg aaattttttt gattcggtaa tctccgaaca gaaggaagaa cgaaggaagg 300
agcacagact tagattggta tatatacgca tatgtagtgt tgaagaaaca tgaaattgcc 360agcacagact tagattggta tatatacgca tatgtagtgt tgaagaaaca tgaaattgcc 360
cagtattctt aacccaactg cacagaacaa aaacctgcag gaaacgaaga taaatctaaa 420cagtattctt aacccaactg cacagaacaa aaacctgcag gaaacgaaga taaatctaaa 420
aaactgtatt ataagtaaat gcatgtatac taaactcaca aattagagct tcaatttaat 480aaactgtatt ataagtaaat gcatgtatac taaactcaca aattagagct tcaatttaat 480
tatatcagtt attaccctat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 540tatatcagtt attaccctat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 540
cgcatcagga aattgtaaac gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa 600cgcatcagga aattgtaaac gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa 600
tcagctcatt ttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat 660tcagctcattttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat 660
agaccgagat agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg 720agaccgagat agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg 720
tggactccaa cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac 780tggactccaa cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac 780
catcacccta atcaagtttt ttggggtcga ggtgccgtaa agcactaaat cggaacccta 840catcacccta atcaagttttttggggtcga ggtgccgtaa agcactaaat cggaacccta 840
aagggagccc ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag 900aagggagccc ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag 900
ggaagaaagc gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg 960ggaagaaagc gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg 960
taaccaccac acccgccgcg cttaatgcgc cgctacaggg cgcgtccatt cgccattcag 1020taaccaccac acccgccgcg cttaatgcgc cgctacaggg cgcgtccatt cgccattcag 1020
gctgcgcaac tgttgggaag ggcgatcggt gcgggcctct tcgctattac gccagctgga 1080gctgcgcaac tgttgggaag ggcgatcggt gcgggcctct tcgctattac gccagctgga 1080
taaaggcgcg ccgtcgacaa tatagaaatt tcttagtttt aatgattgaa tgaatccatt 1140taaaggcgcg ccgtcgacaa tatagaaatt tcttagtttt aatgattgaa tgaatccatt 1140
tcgaacacgt caatgtactt ggacaagctt tgtagaacat gaagtacttg agcaatggac 1200tcgaacacgt caatgtactt ggacaagctt tgtagaacat gaagtacttg agcaatggac 1200
gcccaactgg aatttacttt agcgacacta ttccaagacc agcaactcaa agaagtaaat 1260gcccaactgg aatttacttt agcgacacta ttccaagacc agcaactcaa agaagtaaat 1260
tctttttttc acaatagttg tttttttgat gatgtacgtc ccagcagcac ctaatggtat 1320tctttttttc acaatagttg tttttttgat gatgtacgtc ccagcagcac ctaatggtat 1320
ttttatgagt attggcagtt tgccttatta attaagcaaa ccggtcatat attttaaaaa 1380ttttatgagt attggcagtt tgccttatta attaagcaaa ccggtcatat attttaaaaa 1380
ataggaactt tgtagtcctt attaatgctg aataaaaaaa taggaagaga aatcagcaag 1440ataggaactt tgtagtcctt attaatgctg aataaaaaaa taggaagaga aatcagcaag 1440
tataaacagt taaatgtttt ttaactataa tggttgcatt caagaggaca tgtaaattgc 1500tataaacagt taaatgtttt ttaactataa tggttgcatt caagaggaca tgtaaattgc 1500
gtactcggta caagttggta aaaacgttaa agcactccta cgagatttag tcagtcctgg 1560gtactcggta caagttggta aaaacgttaa agcactccta cgagatttag tcagtcctgg 1560
cattaataaa caaaaagacc tgtatctata tttcatgaac tgggtaaaat caccacacaa 1620cattaataaa caaaaagacc tgtatctata tttcatgaac tgggtaaaat caccacacaa 1620
aaaagaataa acaaatataa tgaactactt ttggtcttgg actcgcagtt tatagtttgt 1680aaaagaataa acaaatataa tgaactactt ttggtcttgg actcgcagtt tatagtttgt 1680
taaaattgac aaacacacat gttctgatta acttacttcc gcttcaaagt actgaaaaca 1740taaaattgac aaacacacat gttctgatta acttacttcc gcttcaaagt actgaaaaca 1740
tcatacgaac tggtataaga aggaagatag caaataaaaa taaatcaaat attgcttgta 1800tcatacgaac tggtataaga aggaagatag caaataaaaa taaatcaaat attgcttgta 1800
aaaaattagg atataaaaaa atctctattc attaaagaca aaaagctaaa ggaataaaaa 1860aaaaattagg atataaaaaa atctctattc attaaagaca aaaagctaaa ggaataaaaa 1860
agtcattcgc tttacgccaa ataaaacgtc tatgcacgtt tttaaaggct gtgaacatta 1920agtcattcgc tttacgccaa ataaaacgtc tatgcacgtt tttaaaggct gtgaacatta 1920
acaatgttgt ggaattcagg cttgagagaa gcaccgccaa ccaagaaacc gtcaatatcg 1980acaatgttgt ggaattcagg cttgagagaa gcaccgccaa ccaagaaacc gtcaatatcg 1980
tggaacttga ggaactcctt gcagttacca ccgttaacgg aaccaccgta gatgacacgg 2040tggaacttga ggaactcctt gcagttacca ccgttaacgg aaccaccgta gatgacacgg 2040
agaccctcgg caacagatgc accaagcttg ttggtagccc acttgcggat ctcagcgtga 2100agaccctcgg caacagatgc accaagcttg ttggtagccc acttgcggat ctcagcgtga 2100
acctcttgag cttgctcagg ggtggcagtc ttaccagtac caatggccca gacaggctca 2160acctcttgag cttgctcagg ggtggcagtc ttaccagtac caatggccca gacaggctca 2160
taagcaatga caatcttgga ccagttctgg accttgtcag cgatggcgtt caattgacga 2220taagcaatga caatcttgga ccagttctgg accttgtcag cgatggcgtt caattgacga 2220
acaacaacgt tgatggtctc gttagcctca cgctcggcca aagtctcacc aatgcaggca 2280acaacaacgt tgatggtctc gttagcctca cgctcggcca aagtctcacc aatgcaggca 2280
acgacagtaa gaccttgttc aagggcaaac ttggtcttgt cggcaacgaa ctcgtcagac 2340acgacagtaa gaccttgttc aagggcaaac ttggtcttgt cggcaacgaa ctcgtcagac 2340
tccttgaaga tggtacgacg ctcggagtga ccagtcaaag tgtaggtaat accagcatca 2400tccttgaaga tggtacgacg ctcggagtga ccagtcaaag tgtaggtaat accagcatca 2400
atcaaagatt gagcactgtt ctcaccagtg taggcaccgt tcttcttgtc gaagacgttt 2460atcaaagatt gagcactgtt ctcaccagtg taggcaccgt tcttcttgtc gaagacgttt 2460
tgggcaccaa cgccaatatc cttcttgact tgttggcggg tggtgatgag gtacatgttt 2520tgggcaccaa cgccaatatc cttcttgact tgttggcggg tggtgatgag gtacatgttt 2520
tgagggaaga tgacagtttc gacatcacca acgttaagct tggtggtgtt caaaccctca 2580tgagggaaga tgacagtttc gacatcacca acgttaagct tggtggtgtt caaaccctca 2580
ataatagtct tcatggactc caaagagcca ttcatcttaa agttaccacc gacaaagaat 2640ataatagtct tcatggactc caaagagcca ttcatcttaa agttaccacc gacaaagaat 2640
ttacgtgcca ttttgatcct agttctacta ttgaaatgta gttggggaga gacgagagtt 2700ttacgtgcca ttttgatcct agttctacta ttgaaatgta gttggggaga gacgagagtt 2700
ggcccgtccg ctcattatat ataacgtagc ggacagtcac acagttaagg ggaattaccg 2760ggcccgtccg ctcattatat ataacgtagc ggacagtcac acagttaagg ggaattaccg 2760
agcttcggca atttaccccg tcgatagcaa ccgttggcat ggatccggcc ggccagatct 2820agcttcggca atttaccccg tcgatagcaa ccgttggcat ggatccggcc ggccagatct 2820
acgtatggtc atttcttctt cagattccct catggagaaa gtgcggcaga tgtatatgac 2880acgtatggtc atttcttctt cagattccct catggagaaa gtgcggcaga tgtatatgac 2880
agagtcgcca gtttccaaga gactttattc aggcacttcc atgataggca agagagaaga 2940agagtcgcca gtttccaaga gactttattc aggcacttcc atgataggca agagagaaga 2940
cccagagatg ttgttgtcct agttacacat ggtatttatt ccagagtatt cctgatgaaa 3000cccagagatg ttgttgtcct agttacacat ggtatttatt ccagagtatt cctgatgaaa 3000
tggtttagat ggacatacga agagtttgaa tcgtttacca atgttcctaa cgggagcgta 3060tggtttagat ggacatacga agagtttgaa tcgtttacca atgttcctaa cgggagcgta 3060
atggtgatgg aactggacga atccatcaat agatacgtcc tgaggaccgt gctacccaaa 3120atggtgatgg aactggacga atccatcaat agatacgtcc tgaggaccgt gctacccaaa 3120
tggactgatt gtgagggaga cctaactaca tagtgtttaa agattacgga tatttaactt 3180tggactgatt gtgagggaga cctaactaca tagtgtttaa agattacgga tatttaactt 3180
acttagaata atgccatttt tttgagttat aataatccta cgttagtgtg agcgggattt 3240acttagaata atgccatttttttgagttat aataatccta cgttagtgtg agcgggattt 3240
aaactgtgag gaccttaata cattcagaca cttctgcggt atcaccctac ttattccctt 3300aaactgtgag gaccttaata cattcagaca cttctgcggt atcaccctac ttattccctt 3300
cgagattata tctaggaacc catcaggttg gtggaagatt acccgttcta agacttttca 3360cgagattata tctaggaacc catcaggttg gtggaagatt acccgttcta agacttttca 3360
gcttcctcta ttgatgttac acctggacac cccttttctg gcatccagtt tttaatcttc 3420gcttcctcta ttgatgttac acctggacac cccttttctg gcatccagtt tttaatcttc 3420
agtggcatgt gagattctcc gaaattaatt aaagcaatca cacaattctc tcggatacca 3480agtggcatgt gagattctcc gaaattaatt aaagcaatca cacaattctc tcggatacca 3480
cctcggttga aactgacagg tggtttgtta cgcatgctaa tgcaaaggag cctatatacc 3540cctcggttga aactgacagg tggtttgtta cgcatgctaa tgcaaaggag cctatatacc 3540
tttggctcgg ctgctgtaac agggaatata aagggcagca taatttagga gtttagtgaa 3600tttggctcgg ctgctgtaac agggaatata aagggcagca taatttagga gtttagtgaa 3600
cttgcaacat ttactatttt cccttcttac gtaaatattt ttctttttaa ttctaaatca 3660cttgcaacat ttactatttt cccttcttac gtaaatattt ttctttttaa ttctaaatca 3660
atctttttca attttttgtt tgtattcttt tcttgcttaa atctataact acaaaaaaca 3720atctttttca attttttgtt tgtattcttt tcttgcttaa atctataact acaaaaaaca 3720
catacataaa ctaaaaggta ccaacaaaat gagatttcca tctattttta ctgctgtttt 3780catacataaa ctaaaaggta ccaacaaaat gagatttcca tctattttta ctgctgtttt 3780
gtttgctgct tcttctgctt tggctgctcc agttaatact actactgaag atgaaactgc 3840gtttgctgct tcttctgctt tggctgctcc agttaatact actactgaag atgaaactgc 3840
tcaaattcca gctgaagctg ttattggtta ttctgatttg gagggtgact ttgatgttgc 3900tcaaattcca gctgaagctg ttattggtta ttctgatttg gagggtgact ttgatgttgc 3900
tgttttgcca ttttctaact ctactaacaa cggtttgcta ttcatcaaca ctactatcgc 3960tgttttgcca ttttctaact ctactaacaa cggtttgcta ttcatcaaca ctactatcgc 3960
ttctatcgct gctaaagaag aaggtgtttc tttggataaa agagaggctg aagctaagcc 4020ttctatcgct gctaaagaag aaggtgtttc tttggataaa agagaggctg aagctaagcc 4020
aatcgaaaac actaacgaca ccagcatcaa gaacgttgaa aagttgagaa acgccccaaa 4080aatcgaaaac actaacgaca ccagcatcaa gaacgttgaa aagttgagaa acgccccaaa 4080
cgaagaaaac tctaagaaag ttgaagactc taagaatgat aaggttgaac acgttaagaa 4140cgaagaaaac tctaagaaag ttgaagactc taagaatgat aaggttgaac acgttaagaa 4140
cattgaagaa gctaaggtcg aacaagttgc tccagaagtc aagtctaaaa gtaccttgag 4200cattgaagaa gctaaggtcg aacaagttgc tccagaagtc aagtctaaaa gtaccttgag 4200
atctgcttct atagctaata caaattccga aaagtacgac ttcgaatatc ttaatggttt 4260atctgcttct atagctaata caaattccga aaagtacgac ttcgaatatc ttaatggttt 4260
gagttacacc gaattgacta acttgatcaa gaacatcaag tggaaccaaa tcaacggttt 4320gagttacacc gaattgacta acttgatcaa gaacatcaag tggaaccaaa tcaacggttt 4320
attcaactac tccaccggtt cgcaaaaatt tttcggtgac aagaacagag tccaagccat 4380attcaactac tccaccggtt cgcaaaaatt tttcggtgac aagaacagag tccaagccat 4380
catcaacgct ctacaagaat ctggtagaac ttacactgcc aatgacatga agggtattga 4440catcaacgct ctacaagaat ctggtagaac ttacactgcc aatgacatga agggtattga 4440
aactttcacc gaagttttga gagccggttt ttacttgggc tattacaacg atggtctctc 4500aactttcacc gaagttttga gagccggttt ttacttgggc tattacaacg atggtctctc 4500
ctacctgaac gaccgtaatt tccaagacaa gtgtatccca gccatgattg ctattcaaaa 4560ctacctgaac gaccgtaatt tccaagacaa gtgtatccca gccatgattg ctattcaaaa 4560
gaacccaaac ttcaagttag gtactgctgt tcaagacgaa gttatcacat ctttgggtaa 4620gaacccaaac ttcaagttag gtactgctgt tcaagacgaa gttatcacat ctttgggtaa 4620
gttgattggt aacgcctctg ctaacgctga agttgttaat aactgtgtcc cagttttgaa 4680gttgattggt aacgcctctg ctaacgctga agttgttaat aactgtgtcc cagttttgaa 4680
gcaattcaga gaaaacttaa accaatacgc tccagactac gtcaaaggta ctgccgtaaa 4740gcaattcaga gaaaacttaa accaatacgc tccagactac gtcaaaggta ctgccgtaaa 4740
tgaattgatc aagggtattg aatttgactt ttccggtgct gcttacgaaa aggacgtcaa 4800tgaattgatc aagggtattg aatttgactt ttccggtgct gcttacgaaa aggacgtcaa 4800
gaccatgcca tggtacggta agattgaccc attcatcaat gaattaaagg ccttgggtct 4860gaccatgcca tggtacggta agattgaccc attcatcaat gaattaaagg ccttgggtct 4860
ctacggtaac atcacttctg ccactgaatg ggcttcagat gttggtatct actacttgtc 4920ctacggtaac atcacttctg ccactgaatg ggcttcagat gttggtatct actacttgtc 4920
taagttcggt ttgtactcaa ccaatcgtaa cgacattgtt caatccttag aaaaggctgt 4980taagttcggt ttgtactcaa ccaatcgtaa cgacattgtt caatccttag aaaaggctgt 4980
tgatatgtac aaatatggta agatcgcttt cgttgctatg gaaagaatca cctgggatta 5040tgatatgtac aaatatggta agatcgcttt cgttgctatg gaaagaatca cctgggatta 5040
cgatggtatc ggttccaacg ggaagaaggt cgaccacgat aagttcttgg acgatgcaga 5100cgatggtatc ggttccaacg ggaagaaggt cgaccacgat aagttcttgg acgatgcaga 5100
aaagcattat ttgccaaaga cctatacttt cgacaacgga accttcatca tcagagctgg 5160aaagcattatttgccaaaga cctatacttt cgacaacgga accttcatca tcagagctgg 5160
agataaggtt tctgaagaaa agattaagag attgtactgg gcttcccgtg aagttaagtc 5220agataaggtt tctgaagaaa agattaagag attgtactgg gcttcccgtg aagttaagtc 5220
tcaattccac agagttgttg gtaacgataa ggctttggaa gtgggtaacg ctgacgatgt 5280tcaattccac agagttgttg gtaacgataa ggctttggaa gtgggtaacg ctgacgatgt 5280
cttgaccatg aaaatcttca actctccaga agaatacaag ttcaacacca acattaacgg 5340cttgaccatg aaaatcttca actctccaga agaatacaag ttcaacacca acattaacgg 5340
tgtctccact gataacggtg gtttgtacat cgaaccaaga ggtactttct acacttacga 5400tgtctccact gataacggtg gtttgtacat cgaaccaaga ggtactttct acacttacga 5400
aagaacccca caacaatcta ttttctcctt ggaagaactg ttcagacacg aatacaccca 5460aagaacccca caacaatcta ttttctcctt ggaagaactg ttcagacacg aatacaccca 5460
ctacttgcaa gctagatact tggttgacgg tttgtggggt caaggtcctt tctacgaaaa 5520ctacttgcaa gctagatact tggttgacgg tttgtggggt caaggtcctt tctacgaaaa 5520
aaacagattg acttggttcg atgaaggtac tgctgaattt ttcgctggtt ccaccagaac 5580aaacagattg acttggttcg atgaaggtac tgctgaattt ttcgctggtt ccaccagaac 5580
ttctggtgtg ttgccaagaa agtccatctt gggttacctt gccaaggata aggtcgatca 5640ttctggtgtg ttgccaagaa agtccatctt gggttacctt gccaaggata aggtcgatca 5640
tagatactcc ttgaagaaaa ctttgaactc tggttacgat gactctgact ggatgttcta 5700tagatactcc ttgaagaaaa ctttgaactc tggttacgat gactctgact ggatgttcta 5700
caactacggt tttgctgttg ctcactactt atacgaaaag gacatgccaa cttttattaa 5760caactacggt tttgctgttg ctcactactt atacgaaaag gacatgccaa cttttattaa 5760
gatgaacaag gccatcttga acactgatgt taagtcttac gacgaaatca ttaaaaaact 5820gatgaacaag gccatcttga acactgatgt taagtcttac gacgaaatca ttaaaaaact 5820
gtctgacgac gctaacaaga acactgaata ccaaaaccac atccaagaac tcgctgataa 5880gtctgacgac gctaacaaga acactgaata ccaaaaccac atccaagaac tcgctgataa 5880
gtaccaaggt gccggtatac cattggtgtc tgatgactac ctgaaggacc atggttacaa 5940gtaccaaggt gccggtatac cattggtgtc tgatgactac ctgaaggacc atggttacaa 5940
gaaggcttcc gaagtttact cagaaatctc caaggctgcg tctttaacta acacttctgt 6000gaaggcttcc gaagtttact cagaaatctc caaggctgcg tctttaacta acacttctgt 6000
taccgctgaa aagtctcaat acttcaacac attcactttg aggggtactt acactggtga 6060taccgctgaa aagtctcaat acttcaacac attcactttg aggggtactt acactggtga 6060
aacctctaag ggtgaattta aggactggga cgaaatgtct aagaagttgg acggtacttt 6120aacctctaag ggtgaattta aggactggga cgaaatgtct aagaagttgg acggtacttt 6120
ggagtcttta gctaagaact cttggtctgg ttacaagact ttgaccgctt acttcaccaa 6180ggagtcttta gctaagaact cttggtctgg ttacaagact ttgaccgctt acttcaccaa 6180
ctacagagtc acttctgata acaaagtcca atatgatgtt gtcttccacg gtgttttgac 6240ctacagagtc acttctgata acaaagtcca atatgatgtt gtcttccacg gtgttttgac 6240
tgacaacgcc gatatttcta acaacaaggc tcctattgct aaggtcaccg gtccatctac 6300tgacaacgcc gatatttcta acaacaaggc tcctattgct aaggtcaccg gtccatctac 6300
tggtgctgtt ggtagaaaca ttgaattttc tggtaaggat tccaaggacg aagatggcaa 6360tggtgctgtt ggtagaaaca ttgaattttc tggtaaggat tccaaggacg aagatggcaa 6360
gattgtctct tatgactggg atttcggtga tggtgctacc tccagaggta agaactctgt 6420gattgtctct tatgactggg atttcggtga tggtgctacc tccagaggta agaactctgt 6420
ccacgcttac aagaaggctg gtacttacaa cgtcactttg aaggtcaccg atgacaaggg 6480ccacgcttac aagaaggctg gtacttacaa cgtcactttg aaggtcaccg atgacaaggg 6480
tgctacggct actgaaagtt tcactattga aattaaaaac gaagacacta ccaccccaat 6540tgctacggct actgaaagtt tcactattga aattaaaaac gaagacacta ccaccccaat 6540
caccaaggaa atggaaccaa acgatgacat taaggaagcc aacggcccga ttgtcgaagg 6600caccaaggaa atggaaccaa acgatgacat taaggaagcc aacggcccga ttgtcgaagg 6600
cgtcacggtc aaaggtgact tgaacggttc tgatgacgct gacactttct actttgatgt 6660cgtcacggtc aaaggtgact tgaacggttc tgatgacgct gacactttct actttgatgt 6660
caaggaagac ggtgacgtta ccatcgaatt gccatactct ggttcctcta actttacctg 6720caaggaagac ggtgacgtta ccatcgaatt gccatactct ggttcctcta actttacctg 6720
gttagtctac aaggaaggtg acgaccaaaa ccacatcgcc tccggtatcg ataaaaacaa 6780gttagtctac aaggaaggtg acgaccaaaa ccacatcgcc tccggtatcg ataaaaacaa 6780
ctccaaggtt ggtactttca agtccactaa gggtcgtcat tacgttttca tctacaagca 6840ctccaaggtt ggtactttca agtccactaa gggtcgtcat tacgttttca tctacaagca 6840
cgactctgct tccaacattt catactcctt gaacatcaag ggtttgggta acgaaaagtt 6900cgactctgct tccaacattt catactcctt gaacatcaag ggtttgggta acgaaaagtt 6900
gaaagagaag gaaaacaatg actcctctga caaggcaacc gttattccaa atttcaatac 6960gaaagagaag gaaaacaatg actcctctga caaggcaacc gttattccaa atttcaatac 6960
caccatgcaa ggttctctat taggggacga ttctagagac tactactcct tcgaagtcaa 7020caccatgcaa ggttctctat taggggacga ttctagagac tactactcct tcgaagtcaa 7020
ggaagaaggt gaagttaaca ttgaattgga taaaaaggat gaatttggtg ttacttggac 7080ggaagaaggt gaagttaaca ttgaattgga taaaaaggat gaatttggtg ttacttggac 7080
cttgcaccca gaaagcaaca tcaacgacag aattacctac ggtcaagtcg atggtaacaa 7140cttgcaccca gaaagcaaca tcaacgacag aattacctac ggtcaagtcg atggtaacaa 7140
ggtatccaac aaggtcaaat tgagaccagg taagtactac ttgttggttt acaagtacag 7200ggtatccaac aaggtcaaat tgagaccagg taagtactac ttgttggttt acaagtacag 7200
tggttccggt aactacgaat tgcgggttaa caagcaccac caccaccacc actaagctag 7260tggttccggt aactacgaat tgcgggttaa caagcaccac caccaccacc actaagctag 7260
cctcgagtct agaaactaag attaatataa ttatataaaa atattatctt cttttcttta 7320cctcgagtct agaaactaag attaatataa ttatataaaa atattatctt cttttcttta 7320
tatctagtgt tatgtaaaat aaattgatga ctacggaaag cttttttata ttgtttcttt 7380tatctagtgt tatgtaaaat aaattgatga ctacggaaag ctttttttata ttgtttcttt 7380
ttcattctga gccacttaaa tttcgtgaat gttcttataa gggacggtag atttacaagt 7440ttcattctga gccacttaaa tttcgtgaat gttcttataa gggacggtag atttacaagt 7440
gatacaacaa aaagcaaggc gctttttcta ataaaaagaa gaaaagcatt taacaattga 7500gatacaacaa aaagcaaggc gctttttcta ataaaaagaa gaaaagcatt taacaattga 7500
acacctctat atcaacgaag aatattactt tgtctctaaa tccttgtaaa atgtgtacga 7560acacctctat atcaacgaag aatattactt tgtctctaaa tccttgtaaa atgtgtacga 7560
tctctatatg ggttactcag aagtgtaccg aagactgcat tgaaagttta tgttttttca 7620tctctatatg ggttactcag aagtgtaccg aagactgcat tgaaagttta tgttttttca 7620
ctgcaagcgt cattttcgct ttgagaagat gttcttattc aaatttcaac tgttatatag 7680ctgcaagcgt cattttcgct ttgagaagat gttcttattc aaatttcaac tgttatatag 7680
aagagcaaaa aattgccaaa aaaaacaaca tttattcatt taaaatataa aatttgggct 7740aagagcaaaa aattgccaaa aaaaacaaca tttattcatt taaaatataa aatttgggct 7740
tctatatttt aatattgctt ttcaattact gttattaaat gtaagtactg cgtctatgaa 7800tctatatttt aatattgctt ttcaattact gttattaaat gtaagtactg cgtctatgaa 7800
aatatatgca aatgctaaga aaaatcctaa aaatttgaat atgagatatt cctcagtatt 7860aatatatgca aatgctaaga aaaatcctaa aaatttgaat atgagatatt cctcagtatt 7860
tctttttcat cctttcttct gcggctctag ccctttgttc tctcatcaat ctgcgtctct 7920tctttttcat cctttcttct gcggctctag ccctttgttc tctcatcaat ctgcgtctct 7920
gttcatcggt caaagaattc aaattttgtt gctgaattga aggaataacg cgtgtacgca 7980gttcatcggt caaagaattc aaattttgtt gctgaattga aggaataacg cgtgtacgca 7980
tgtaacatta tactgaaaac cttgcttgag aaggttttgg gacgctcgaa gatcctccgg 8040tgtaacatta tactgaaaac cttgcttgag aaggttttgg gacgctcgaa gatcctccgg 8040
atcgtttcgc cggcgtttat ccagctgcat taatgaatcg gccaacgcgc ggggagaggc 8100atcgtttcgc cggcgtttat ccagctgcat taatgaatcg gccaacgcgc ggggagaggc 8100
ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 8160ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 8160
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 8220cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 8220
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 8280ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 8280
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 8340aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 8340
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 8400cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 8400
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 8460cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 8460
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 8520gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 8520
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 8580tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 8580
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 8640cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 8640
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 8700ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 8700
gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc 8760gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc 8760
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 8820gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 8820
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 8880accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 8880
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 8940ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 8940
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 9000tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 9000
aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 9060aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 9060
taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 9120taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 9120
gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc 9180gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc 9180
agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac 9240agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac 9240
cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag 9300cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag 9300
tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac 9360tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac 9360
gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc 9420gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc 9420
agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg 9480agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg 9480
gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc 9540gttagctccttcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc 9540
atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct 9600atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct 9600
gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc 9660gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc 9660
tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc 9720tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc 9720
atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc 9780atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc 9780
agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc 9840agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc 9840
gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca 9900gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca 9900
cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt 9960cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt 9960
tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt 10020tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt 10020
ccgcgcacat ttccccgaaa agtgccacct gaacgaagca tctgtgcttc attttgtaga 10080ccgcgcacat ttccccgaaa agtgccacct gaacgaagca tctgtgcttc attttgtaga 10080
acaaaaatgc aacgcgagag cgctaatttt tcaaacaaag aatctgagct gcatttttac 10140acaaaaatgc aacgcgagag cgctaatttt tcaaacaaag aatctgagct gcatttttac 10140
agaacagaaa tgcaacgcga aagcgctatt ttaccaacga agaatctgtg cttcattttt 10200agaacagaaa tgcaacgcga aagcgctatt ttaccaacga agaatctgtg cttcattttt 10200
gtaaaacaaa aatgcaacgc gagagcgcta atttttcaaa caaagaatct gagctgcatt 10260gtaaaacaaa aatgcaacgc gagagcgcta atttttcaaa caaagaatct gagctgcatt 10260
tttacagaac agaaatgcaa cgcgagagcg ctattttacc aacaaagaat ctatacttct 10320tttacagaac agaaatgcaa cgcgagagcg ctattttacc aacaaagaat ctatacttct 10320
tttttgttct acaaaaatgc atcccgagag cgctattttt ctaacaaagc atcttagatt 10380tttttgttct acaaaaatgc atcccgagag cgctattttt ctaacaaagc atcttagatt 10380
actttttttc tcctttgtgc gctctataat gcagtctctt gataactttt tgcactgtag 10440actttttttc tcctttgtgc gctctataat gcagtctctt gataactttt tgcactgtag 10440
gtccgttaag gttagaagaa ggctactttg gtgtctattt tctcttccat aaaaaaagcc 10500gtccgttaag gttagaagaa ggctactttg gtgtctattt tctcttccat aaaaaaagcc 10500
tgactccact tcccgcgttt actgattact agcgaagctg cgggtgcatt ttttcaagat 10560tgactccact tcccgcgttt actgattact agcgaagctg cgggtgcatt ttttcaagat 10560
aaaggcatcc ccgattatat tctataccga tgtggattgc gcatactttg tgaacagaaa 10620aaaggcatcc ccgattatat tctataccga tgtggattgc gcatactttg tgaacagaaa 10620
gtgatagcgt tgatgattct tcattggtca gaaaattatg aacggtttct tctattttgt 10680gtgatagcgt tgatgattct tcattggtca gaaaattatg aacggtttct tctattttgt 10680
ctctatatac tacgtatagg aaatgtttac attttcgtat tgttttcgat tcactctatg 10740ctctatatac tacgtatagg aaatgtttac attttcgtat tgttttcgat tcactctatg 10740
aatagttctt actacaattt ttttgtctaa agagtaatac tagagataaa cataaaaaat 10800aatagttctt actacaattt ttttgtctaa agagtaatac tagagataaa cataaaaaat 10800
gtagaggtcg agtttagatg caagttcaag gagcgaaagg tggatgggta ggttatatag 10860gtagaggtcg agtttagatg caagttcaag gagcgaaagg tggatgggta ggttatatag 10860
ggatatagca cagagatata tagcaaagag atacttttga gcaatgtttg tggaagcggt 10920ggatatagca cagagatata tagcaaagag atacttttga gcaatgtttg tggaagcggt 10920
attcgcaata ttttagtagc tcgttacagt ccggtgcgtt tttggttttt tgaaagtgcg 10980attcgcaata ttttagtagc tcgttacagt ccggtgcgtt tttggttttt tgaaagtgcg 10980
tcttcagagc gcttttggtt ttcaaaagcg ctctgaagtt cctatacttt ctagagaata 11040tcttcagagc gcttttggtt ttcaaaagcg ctctgaagtt cctatacttt ctagagaata 11040
ggaacttcgg aataggaact tcaaagcgtt tccgaaaacg agcgcttccg aaaatgcaac 11100ggaacttcgg aataggaact tcaaagcgtt tccgaaaacg agcgcttccg aaaatgcaac 11100
gcgagctgcg cacatacagc tcactgttca cgtcgcacct atatctgcgt gttgcctgta 11160gcgagctgcg cacatacagc tcactgttca cgtcgcacct atatctgcgt gttgcctgta 11160
tatatatata catgagaaga acggcatagt gcgtgtttat gcttaaatgc gtacttatat 11220tatatatata catgagaaga acggcatagt gcgtgtttat gcttaaatgc gtacttatat 11220
gcgtctattt atgtaggatg aaaggtagtc tagtacctcc tgtgatatta tcccattcca 11280gcgtctattt atgtaggatg aaaggtagtc tagtacctcc tgtgatatta tcccattcca 11280
tgcggggtat cgtatgcttc cttcagcact accctttagc tgttctatat gctgccactc 11340tgcggggtat cgtatgcttc cttcagcact accctttagc tgttctatat gctgccactc 11340
ctcaattgga ttagtctcat ccttcaatgc tatcatttcc tttgatattg gatcatacta 11400ctcaattgga ttagtctcat ccttcaatgc tatcatttcc tttgatattg gatcatacta 11400
agaaaccatt attatcatga cattaaccta taaaaatagg cgtatcacga ggccctttcg 11460agaaaccatt attatcatga cattaaccta taaaaatagg cgtatcacga ggccctttcg 11460
tc 11462tc 11462
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210605225.4A CN114836461B (en) | 2022-05-31 | 2022-05-31 | Recombinant plasmid expressing collagenase, yeast strain, fermentation medium and fermentation culture method thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210605225.4A CN114836461B (en) | 2022-05-31 | 2022-05-31 | Recombinant plasmid expressing collagenase, yeast strain, fermentation medium and fermentation culture method thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114836461A CN114836461A (en) | 2022-08-02 |
CN114836461B true CN114836461B (en) | 2024-03-29 |
Family
ID=82572076
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210605225.4A Active CN114836461B (en) | 2022-05-31 | 2022-05-31 | Recombinant plasmid expressing collagenase, yeast strain, fermentation medium and fermentation culture method thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114836461B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115232843A (en) * | 2022-08-18 | 2022-10-25 | 南京中医药大学 | Biological synthesis method of white ketone and iso white ketone |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102216454A (en) * | 2008-11-19 | 2011-10-12 | 明治制果药业株式会社 | Fusion collagenase to which affinity tag is attached, and method for producing same |
CN102559740A (en) * | 2012-02-25 | 2012-07-11 | 山东大学 | Method for improving secretory expression of heterologous protein by saccharomyces cerevisiae and special saccharomyces cerevisiae strain |
CN111996131A (en) * | 2020-09-07 | 2020-11-27 | 广西大学 | Pichia pastoris of shigella delavayi for degrading ammonia nitrogen and application |
CN112955547A (en) * | 2018-06-27 | 2021-06-11 | 贝林格尔·英格海姆Rcv两合公司 | Means and methods for increasing protein expression by use of transcription factors |
TW202130804A (en) * | 2019-10-25 | 2021-08-16 | 日商長瀨產業股份有限公司 | Metabolizing-enzyme-destroyed strain of aerobic bacterium, and method for culturing same |
-
2022
- 2022-05-31 CN CN202210605225.4A patent/CN114836461B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102216454A (en) * | 2008-11-19 | 2011-10-12 | 明治制果药业株式会社 | Fusion collagenase to which affinity tag is attached, and method for producing same |
CN102559740A (en) * | 2012-02-25 | 2012-07-11 | 山东大学 | Method for improving secretory expression of heterologous protein by saccharomyces cerevisiae and special saccharomyces cerevisiae strain |
CN112955547A (en) * | 2018-06-27 | 2021-06-11 | 贝林格尔·英格海姆Rcv两合公司 | Means and methods for increasing protein expression by use of transcription factors |
TW202130804A (en) * | 2019-10-25 | 2021-08-16 | 日商長瀨產業股份有限公司 | Metabolizing-enzyme-destroyed strain of aerobic bacterium, and method for culturing same |
CN111996131A (en) * | 2020-09-07 | 2020-11-27 | 广西大学 | Pichia pastoris of shigella delavayi for degrading ammonia nitrogen and application |
Non-Patent Citations (8)
Title |
---|
a-Factor-directed synthesis and secretion of mature foreign proteins in Saccharomyces cerevisiae;ANTHONY J. BRAKE et al.;Proc. Natl. Acad. Sci. USA;第81卷;第4642-4646页 * |
AltName: Full=Class I collagenase * |
AltName: Full=Gelatinase ColG * |
AltName: Full=Microbial collagenase * |
Flags: Precursor GENBANK ACCESSION NO. Q9X721.1.GENBANK.2022,第1-13页. * |
Janowska,K. et al..RecName: Full=Collagenase ColG * |
Protein secretion from Saccharomyces cerevisiae directed by the prepro-alpha-factor leader region;K M Zsebo et al.;Journal of Biological Chemistry;第261卷(第13期);第5858-5865页 * |
Structure of a Yeast Pheromone Gene (MFa): A Putative a-factor Precursor Contains Four Tandem Copies of Mature a-factor;Janet Kurjan and Ira Herskowitz;Cell;第30卷;第933-943页 * |
Also Published As
Publication number | Publication date |
---|---|
CN114836461A (en) | 2022-08-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2021250992B2 (en) | Compositions and methods for directing proteins to specific loci in the genome | |
DK2601292T3 (en) | METHODS AND PRODUCTS FOR MANUFACTURING GROWERS | |
CN108753823B (en) | Method for realizing gene knockout by using base editing technology and application thereof | |
AU2020264412B2 (en) | Dna-binding protein using ppr motif, and use thereof | |
CN104059942B (en) | Avian pneumo-encephalitis virus heat-resisting live vaccine vectors system and application thereof | |
CN112501269B (en) | A method for rapid identification of high-affinity TCR antigen cross-reactivity | |
CN114836461B (en) | Recombinant plasmid expressing collagenase, yeast strain, fermentation medium and fermentation culture method thereof | |
CN113151178B (en) | Recombinant T cells knocking out Rc3h1 gene and/or Zc3h12a gene and its application | |
KR20160012562A (en) | Genetically engineered and acid resistant yeast cell with ehanced radiation sensitivity complementing kinase activity and method for producing lactate using the same | |
KR20080094910A (en) | New protein expression system | |
JP2024037797A (en) | Use of infectious nucleic acids to treat cancer | |
CN101688231B (en) | Allosteric trans-splicing type I ribozyme whose target-specific ribonucleic acid substitution activity is regulated by theophylline | |
CN113652405B (en) | pSFV-p54 replicon particles and preparation method and application thereof | |
CN109504663B (en) | Ad4/Ad7 type bivalent recombinant adenovirus and application thereof | |
US6531289B1 (en) | Regulated gene expression in yeast and method of use | |
CN112725211A (en) | Recombinant pichia pastoris, culture method and application | |
CN102517326B (en) | Method for improving content of lysine in maize by using zein gene ribonucleic acid interference (RNAi) vector | |
CN102533847B (en) | Constructing method of zein gene RNAi (Ribonucleic Acid Interference) carrier | |
CN103451181B (en) | A kind of resistance expression's box for efficiently building non-resistant mark recombinant mycobacterium | |
CN102517314B (en) | Zein gene RNAi (Ribonucleic Acid interference) vector | |
WO2014049580A2 (en) | Assay to monitor autophagy, a method and kit thereof | |
CN101652482A (en) | The production of butanol of being undertaken by the metabolic engineering yeast | |
KR102468650B1 (en) | Recombinant vector inducing expression of T7 RNA polymerase and mRNA capping enzyme and uses thereof | |
KR20230169221A (en) | Non-viral homology-mediated end joining | |
KR20230159994A (en) | Recombinant vector comprising hybrid signal sequence, and secretary preparation method of human insulin-like growth factor-1 using the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |