CN101883843A - 破坏过氧化物酶体生物合成因子蛋白(pex)以改变含油真核生物中多不饱和脂肪酸和总脂质含量 - Google Patents
破坏过氧化物酶体生物合成因子蛋白(pex)以改变含油真核生物中多不饱和脂肪酸和总脂质含量 Download PDFInfo
- Publication number
- CN101883843A CN101883843A CN2008801189174A CN200880118917A CN101883843A CN 101883843 A CN101883843 A CN 101883843A CN 2008801189174 A CN2008801189174 A CN 2008801189174A CN 200880118917 A CN200880118917 A CN 200880118917A CN 101883843 A CN101883843 A CN 101883843A
- Authority
- CN
- China
- Prior art keywords
- acid
- gene
- pufa
- polyunsaturated fatty
- pex
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 352
- 235000020777 polyunsaturated fatty acids Nutrition 0.000 title claims abstract description 260
- 150000002632 lipids Chemical class 0.000 title claims abstract description 196
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 91
- 210000002824 peroxisome Anatomy 0.000 title abstract description 15
- 230000008436 biogenesis Effects 0.000 title abstract 2
- 238000000034 method Methods 0.000 claims abstract description 136
- 239000000194 fatty acid Substances 0.000 claims abstract description 73
- 235000014113 dietary fatty acids Nutrition 0.000 claims abstract description 66
- 229930195729 fatty acid Natural products 0.000 claims abstract description 66
- 150000004665 fatty acids Chemical class 0.000 claims abstract description 65
- 241000206602 Eukaryota Species 0.000 claims abstract description 39
- 101001126533 Arabidopsis thaliana Peroxisome biogenesis factor 10 Proteins 0.000 claims abstract description 30
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 226
- 235000019197 fats Nutrition 0.000 claims description 161
- 235000020673 eicosapentaenoic acid Nutrition 0.000 claims description 129
- 239000002253 acid Substances 0.000 claims description 117
- 102000004190 Enzymes Human genes 0.000 claims description 84
- 108090000790 Enzymes Proteins 0.000 claims description 84
- 230000003570 biosynthesizing effect Effects 0.000 claims description 76
- HOBAELRKJCKHQD-UHFFFAOYSA-N (8Z,11Z,14Z)-8,11,14-eicosatrienoic acid Natural products CCCCCC=CCC=CCC=CCCCCCCC(O)=O HOBAELRKJCKHQD-UHFFFAOYSA-N 0.000 claims description 60
- HOBAELRKJCKHQD-QNEBEIHSSA-N dihomo-γ-linolenic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/CCCCCCC(O)=O HOBAELRKJCKHQD-QNEBEIHSSA-N 0.000 claims description 60
- 235000020660 omega-3 fatty acid Nutrition 0.000 claims description 52
- 230000006696 biosynthetic metabolic pathway Effects 0.000 claims description 42
- 230000006378 damage Effects 0.000 claims description 36
- 108010022240 delta-8 fatty acid desaturase Proteins 0.000 claims description 34
- 230000001976 improved effect Effects 0.000 claims description 34
- 210000000130 stem cell Anatomy 0.000 claims description 32
- 241000235070 Saccharomyces Species 0.000 claims description 31
- 238000003209 gene knockout Methods 0.000 claims description 25
- 230000001066 destructive effect Effects 0.000 claims description 24
- -1 Δ 12 desaturases Proteins 0.000 claims description 22
- 230000008569 process Effects 0.000 claims description 21
- 241000223252 Rhodotorula Species 0.000 claims description 19
- VZCCETWTMQHEPK-QNEBEIHSSA-N gamma-linolenic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/CCCCC(O)=O VZCCETWTMQHEPK-QNEBEIHSSA-N 0.000 claims description 18
- 235000020664 gamma-linolenic acid Nutrition 0.000 claims description 18
- 230000008034 disappearance Effects 0.000 claims description 17
- MBMBGCFOFBJSGT-KUBAVDMBSA-N docosahexaenoic acid Natural products CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O MBMBGCFOFBJSGT-KUBAVDMBSA-N 0.000 claims description 16
- 102100034542 Acyl-CoA (8-3)-desaturase Human genes 0.000 claims description 13
- 102100034544 Acyl-CoA 6-desaturase Human genes 0.000 claims description 13
- 108010073542 Delta-5 Fatty Acid Desaturase Proteins 0.000 claims description 13
- 108010037138 Linoleoyl-CoA Desaturase Proteins 0.000 claims description 13
- DPLUDQAYKAQJBI-GWOCEWHOSA-J [H+].[H+].[H+].[H+].[Zn++].[Zn++].N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](Cc1c[n-]cn1)C(O)=O.N[C@@H](Cc1c[n-]cn1)C(O)=O Chemical group [H+].[H+].[H+].[H+].[Zn++].[Zn++].N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](Cc1c[n-]cn1)C(O)=O.N[C@@H](Cc1c[n-]cn1)C(O)=O DPLUDQAYKAQJBI-GWOCEWHOSA-J 0.000 claims description 13
- 235000013305 food Nutrition 0.000 claims description 13
- DTOSIQBPPRVQHS-PDBXOOCHSA-N alpha-linolenic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(O)=O DTOSIQBPPRVQHS-PDBXOOCHSA-N 0.000 claims description 12
- 101000579335 Arabidopsis thaliana Peroxisome biogenesis protein 12 Proteins 0.000 claims description 11
- PIFPCDRPHCQLSJ-WYIJOVFWSA-N 4,8,12,15,19-Docosapentaenoic acid Chemical compound CC\C=C\CC\C=C\C\C=C\CC\C=C\CC\C=C\CCC(O)=O PIFPCDRPHCQLSJ-WYIJOVFWSA-N 0.000 claims description 10
- 101000693832 Arabidopsis thaliana Peroxisome biogenesis protein 2 Proteins 0.000 claims description 10
- PIFPCDRPHCQLSJ-UHFFFAOYSA-N Clupanodonic acid Natural products CCC=CCCC=CCC=CCCC=CCCC=CCCC(O)=O PIFPCDRPHCQLSJ-UHFFFAOYSA-N 0.000 claims description 10
- OYHQOLUKZRVURQ-HZJYTTRNSA-N Linoleic acid Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(O)=O OYHQOLUKZRVURQ-HZJYTTRNSA-N 0.000 claims description 10
- 241000223230 Trichosporon Species 0.000 claims description 10
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 claims description 10
- 210000004899 c-terminal region Anatomy 0.000 claims description 10
- 229960004232 linoleic acid Drugs 0.000 claims description 10
- 235000021290 n-3 DPA Nutrition 0.000 claims description 10
- 229960005135 eicosapentaenoic acid Drugs 0.000 claims description 9
- 241001527609 Cryptococcus Species 0.000 claims description 8
- HXWJFEZDFPRLBG-UHFFFAOYSA-N Timnodonic acid Natural products CCCC=CC=CCC=CCC=CCC=CCCCC(O)=O HXWJFEZDFPRLBG-UHFFFAOYSA-N 0.000 claims description 8
- JAZBEHYOTPTENJ-JLNKQSITSA-N all-cis-5,8,11,14,17-icosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O JAZBEHYOTPTENJ-JLNKQSITSA-N 0.000 claims description 8
- 101000579348 Arabidopsis thaliana Peroxisomal membrane protein 13 Proteins 0.000 claims description 7
- 101000600176 Arabidopsis thaliana Peroxisomal membrane protein PEX14 Proteins 0.000 claims description 7
- 101000987688 Arabidopsis thaliana Peroxisome biogenesis protein 5 Proteins 0.000 claims description 7
- 241000235575 Mortierella Species 0.000 claims description 7
- 108010077056 Peroxisomal Targeting Signal 2 Receptor Proteins 0.000 claims description 7
- 102000009913 Peroxisomal Targeting Signal 2 Receptor Human genes 0.000 claims description 7
- 241000233675 Thraustochytrium Species 0.000 claims description 7
- 108010087894 Fatty acid desaturases Proteins 0.000 claims description 6
- 101000912235 Rebecca salina Acyl-lipid (7-3)-desaturase Proteins 0.000 claims description 6
- 101000877236 Siganus canaliculatus Acyl-CoA Delta-4 desaturase Proteins 0.000 claims description 6
- 235000020661 alpha-linolenic acid Nutrition 0.000 claims description 6
- VZCCETWTMQHEPK-UHFFFAOYSA-N gamma-Linolensaeure Natural products CCCCCC=CCC=CCC=CCCCCC(O)=O VZCCETWTMQHEPK-UHFFFAOYSA-N 0.000 claims description 6
- 229960002733 gamolenic acid Drugs 0.000 claims description 6
- 229960004488 linolenic acid Drugs 0.000 claims description 6
- 241000222120 Candida <Saccharomycetales> Species 0.000 claims description 5
- 235000021298 Dihomo-γ-linolenic acid Nutrition 0.000 claims description 5
- 241001149698 Lipomyces Species 0.000 claims description 5
- 235000021342 arachidonic acid Nutrition 0.000 claims description 5
- 229940114079 arachidonic acid Drugs 0.000 claims description 5
- 108010011713 delta-15 desaturase Proteins 0.000 claims description 5
- DVSZKTAMJJTWFG-UHFFFAOYSA-N docosa-2,4,6,8,10,12-hexaenoic acid Chemical compound CCCCCCCCCC=CC=CC=CC=CC=CC=CC(O)=O DVSZKTAMJJTWFG-UHFFFAOYSA-N 0.000 claims description 5
- IQLUYYHUNSSHIY-HZUMYPAESA-N eicosatetraenoic acid Chemical compound CCCCCCCCCCC\C=C\C=C\C=C\C=C\C(O)=O IQLUYYHUNSSHIY-HZUMYPAESA-N 0.000 claims description 5
- 230000036961 partial effect Effects 0.000 claims description 5
- XSXIVVZCUAHUJO-AVQMFFATSA-N (11e,14e)-icosa-11,14-dienoic acid Chemical compound CCCCC\C=C\C\C=C\CCCCCCCCCC(O)=O XSXIVVZCUAHUJO-AVQMFFATSA-N 0.000 claims description 4
- 241000003595 Aurantiochytrium limacinum Species 0.000 claims description 4
- 235000021297 Eicosadienoic acid Nutrition 0.000 claims description 4
- 101150056463 PEX5 gene Proteins 0.000 claims description 4
- 241000235013 Yarrowia Species 0.000 claims description 4
- 230000033444 hydroxylation Effects 0.000 claims description 4
- 238000005805 hydroxylation reaction Methods 0.000 claims description 4
- 101150040807 pex14 gene Proteins 0.000 claims description 4
- BBWMTEYXFFWPIF-CJBMEHDJSA-N (2e,4e,6e)-icosa-2,4,6-trienoic acid Chemical compound CCCCCCCCCCCCC\C=C\C=C\C=C\C(O)=O BBWMTEYXFFWPIF-CJBMEHDJSA-N 0.000 claims description 3
- 241000233671 Schizochytrium Species 0.000 claims description 3
- 102100034543 Fatty acid desaturase 3 Human genes 0.000 claims 1
- 210000004027 cell Anatomy 0.000 abstract description 176
- 230000000694 effects Effects 0.000 abstract description 41
- 101150051239 PEX3 gene Proteins 0.000 abstract description 25
- 230000001965 increasing effect Effects 0.000 abstract description 5
- 230000002759 chromosomal effect Effects 0.000 abstract description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 329
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 243
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 241
- 230000001580 bacterial effect Effects 0.000 description 228
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 218
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 200
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 131
- 229940088598 enzyme Drugs 0.000 description 78
- 239000013612 plasmid Substances 0.000 description 75
- 235000018102 proteins Nutrition 0.000 description 74
- 239000003921 oil Substances 0.000 description 71
- 230000014509 gene expression Effects 0.000 description 69
- 239000002773 nucleotide Substances 0.000 description 53
- 125000003729 nucleotide group Chemical group 0.000 description 53
- 241000894006 Bacteria Species 0.000 description 52
- 108020004414 DNA Proteins 0.000 description 51
- 150000007523 nucleic acids Chemical class 0.000 description 51
- 239000012634 fragment Substances 0.000 description 48
- 150000001413 amino acids Chemical class 0.000 description 47
- 239000000047 product Substances 0.000 description 46
- 239000002609 medium Substances 0.000 description 44
- 239000000523 sample Substances 0.000 description 43
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 39
- 230000008859 change Effects 0.000 description 38
- 238000006243 chemical reaction Methods 0.000 description 38
- 239000000758 substrate Substances 0.000 description 38
- 238000013459 approach Methods 0.000 description 36
- 239000000203 mixture Substances 0.000 description 36
- 101100382243 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YAT1 gene Proteins 0.000 description 35
- 238000004519 manufacturing process Methods 0.000 description 35
- 229940024606 amino acid Drugs 0.000 description 33
- 235000001014 amino acid Nutrition 0.000 description 33
- 101150099000 EXPA1 gene Proteins 0.000 description 30
- 102100029095 Exportin-1 Human genes 0.000 description 30
- 101100119348 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) EXP1 gene Proteins 0.000 description 30
- 101100269618 Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) aliA gene Proteins 0.000 description 30
- 108700002148 exportin 1 Proteins 0.000 description 30
- 108020004705 Codon Proteins 0.000 description 27
- 108090000765 processed proteins & peptides Proteins 0.000 description 26
- 238000005516 engineering process Methods 0.000 description 24
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 23
- 230000012010 growth Effects 0.000 description 22
- 229920001184 polypeptide Polymers 0.000 description 22
- 102000004196 processed proteins & peptides Human genes 0.000 description 22
- 230000001105 regulatory effect Effects 0.000 description 22
- 108700012830 rat Lip2 Proteins 0.000 description 21
- 230000001276 controlling effect Effects 0.000 description 20
- 238000013461 design Methods 0.000 description 20
- 230000006870 function Effects 0.000 description 20
- 101150035194 pex16 gene Proteins 0.000 description 20
- 102220023258 rs387907548 Human genes 0.000 description 20
- 101100005882 Mus musculus Cel gene Proteins 0.000 description 19
- 101100289046 Mus musculus Lias gene Proteins 0.000 description 19
- 238000004458 analytical method Methods 0.000 description 19
- 229910052799 carbon Inorganic materials 0.000 description 19
- 101150091094 lipA gene Proteins 0.000 description 19
- YAFQFNOUYXZVPZ-UHFFFAOYSA-N liproxstatin-1 Chemical compound ClC1=CC=CC(CNC=2C3(CCNCC3)NC3=CC=CC=C3N=2)=C1 YAFQFNOUYXZVPZ-UHFFFAOYSA-N 0.000 description 19
- 102000039446 nucleic acids Human genes 0.000 description 19
- 108020004707 nucleic acids Proteins 0.000 description 19
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 18
- 241000196324 Embryophyta Species 0.000 description 17
- 239000002585 base Substances 0.000 description 17
- 238000009396 hybridization Methods 0.000 description 17
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 16
- SEHFUALWMUWDKS-UHFFFAOYSA-N 5-fluoroorotic acid Chemical compound OC(=O)C=1NC(=O)NC(=O)C=1F SEHFUALWMUWDKS-UHFFFAOYSA-N 0.000 description 15
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 15
- 108091026890 Coding region Proteins 0.000 description 15
- 241000233866 Fungi Species 0.000 description 15
- 235000019387 fatty acid methyl ester Nutrition 0.000 description 15
- 108020004999 messenger RNA Proteins 0.000 description 15
- 238000011160 research Methods 0.000 description 15
- 241000195619 Euglena gracilis Species 0.000 description 14
- 239000000126 substance Substances 0.000 description 14
- 238000013519 translation Methods 0.000 description 14
- 229940012843 omega-3 fatty acid Drugs 0.000 description 13
- 208000016444 Benign adult familial myoclonic epilepsy Diseases 0.000 description 12
- 241000195493 Cryptophyta Species 0.000 description 12
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 12
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 12
- 230000015572 biosynthetic process Effects 0.000 description 12
- 238000013016 damping Methods 0.000 description 12
- 208000016427 familial adult myoclonic epilepsy Diseases 0.000 description 12
- ZGNITFSDLCMLGI-UHFFFAOYSA-N flubendiamide Chemical compound CC1=CC(C(F)(C(F)(F)F)C(F)(F)F)=CC=C1NC(=O)C1=CC=CC(I)=C1C(=O)NC(C)(C)CS(C)(=O)=O ZGNITFSDLCMLGI-UHFFFAOYSA-N 0.000 description 12
- 239000012530 fluid Substances 0.000 description 12
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 12
- 244000005700 microbiome Species 0.000 description 12
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 12
- 235000021313 oleic acid Nutrition 0.000 description 12
- 235000020665 omega-6 fatty acid Nutrition 0.000 description 12
- 229940033080 omega-6 fatty acid Drugs 0.000 description 12
- 102220023257 rs387907546 Human genes 0.000 description 12
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 11
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 11
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 11
- 239000005642 Oleic acid Substances 0.000 description 11
- 238000006555 catalytic reaction Methods 0.000 description 11
- 235000013350 formula milk Nutrition 0.000 description 11
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 11
- 239000002243 precursor Substances 0.000 description 11
- 238000012216 screening Methods 0.000 description 11
- 230000009466 transformation Effects 0.000 description 11
- 108700023863 Gene Components Proteins 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 10
- 241000233654 Oomycetes Species 0.000 description 10
- 241000235015 Yarrowia lipolytica Species 0.000 description 10
- 102220369447 c.1352G>A Human genes 0.000 description 10
- 239000000284 extract Substances 0.000 description 10
- 238000007429 general method Methods 0.000 description 10
- 230000002068 genetic effect Effects 0.000 description 10
- 239000008103 glucose Substances 0.000 description 10
- 230000010354 integration Effects 0.000 description 10
- 238000002360 preparation method Methods 0.000 description 10
- 238000012360 testing method Methods 0.000 description 10
- 241000255640 Loa loa Species 0.000 description 9
- 230000003197 catalytic effect Effects 0.000 description 9
- 230000000295 complement effect Effects 0.000 description 9
- 238000013467 fragmentation Methods 0.000 description 9
- 238000006062 fragmentation reaction Methods 0.000 description 9
- 239000001963 growth medium Substances 0.000 description 9
- 230000008676 import Effects 0.000 description 9
- 239000013067 intermediate product Substances 0.000 description 9
- 230000000968 intestinal effect Effects 0.000 description 9
- 239000007788 liquid Substances 0.000 description 9
- 239000000463 material Substances 0.000 description 9
- 229910052757 nitrogen Inorganic materials 0.000 description 9
- 239000000243 solution Substances 0.000 description 9
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 8
- 241000233639 Pythium Species 0.000 description 8
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 8
- 239000003550 marker Substances 0.000 description 8
- 230000007246 mechanism Effects 0.000 description 8
- 230000010076 replication Effects 0.000 description 8
- TUNFSRHWOTWDNC-UHFFFAOYSA-N tetradecanoic acid Chemical compound CCCCCCCCCCCCCC(O)=O TUNFSRHWOTWDNC-UHFFFAOYSA-N 0.000 description 8
- 108700010070 Codon Usage Proteins 0.000 description 7
- 108010025366 Peroxins Proteins 0.000 description 7
- 102000013772 Peroxins Human genes 0.000 description 7
- 229940100389 Sulfonylurea Drugs 0.000 description 7
- 102220369446 c.1274G>A Human genes 0.000 description 7
- 230000004087 circulation Effects 0.000 description 7
- 230000029087 digestion Effects 0.000 description 7
- 230000004060 metabolic process Effects 0.000 description 7
- 210000002220 organoid Anatomy 0.000 description 7
- 238000004321 preservation Methods 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- 239000011541 reaction mixture Substances 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 238000012546 transfer Methods 0.000 description 7
- DCXXMTOCNZCJGO-UHFFFAOYSA-N tristearoylglycerol Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(OC(=O)CCCCCCCCCCCCCCCCC)COC(=O)CCCCCCCCCCCCCCCCC DCXXMTOCNZCJGO-UHFFFAOYSA-N 0.000 description 7
- 102000057234 Acyl transferases Human genes 0.000 description 6
- 108700016155 Acyl transferases Proteins 0.000 description 6
- 102000002148 Diacylglycerol O-acyltransferase Human genes 0.000 description 6
- 108010001348 Diacylglycerol O-acyltransferase Proteins 0.000 description 6
- 241000233732 Fusarium verticillioides Species 0.000 description 6
- 229940096437 Protein S Drugs 0.000 description 6
- 235000021355 Stearic acid Nutrition 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 238000011081 inoculation Methods 0.000 description 6
- 238000003780 insertion Methods 0.000 description 6
- 230000037431 insertion Effects 0.000 description 6
- 239000002502 liposome Substances 0.000 description 6
- 235000020978 long-chain polyunsaturated fatty acids Nutrition 0.000 description 6
- 230000007935 neutral effect Effects 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 6
- SECPZKHBENQXJG-FPLPWBNLSA-N palmitoleic acid Chemical compound CCCCCC\C=C/CCCCCCCC(O)=O SECPZKHBENQXJG-FPLPWBNLSA-N 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 229940081969 saccharomyces cerevisiae Drugs 0.000 description 6
- 239000008117 stearic acid Substances 0.000 description 6
- YROXIXLRRCOBKF-UHFFFAOYSA-N sulfonylurea Chemical class OC(=N)N=S(=O)=O YROXIXLRRCOBKF-UHFFFAOYSA-N 0.000 description 6
- 238000005809 transesterification reaction Methods 0.000 description 6
- 238000011144 upstream manufacturing Methods 0.000 description 6
- 102100039239 Amidophosphoribosyltransferase Human genes 0.000 description 5
- 108010039224 Amidophosphoribosyltransferase Proteins 0.000 description 5
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 5
- 239000006227 byproduct Substances 0.000 description 5
- 102220369445 c.668T>C Human genes 0.000 description 5
- 235000005911 diet Nutrition 0.000 description 5
- 239000003814 drug Substances 0.000 description 5
- 229960003136 leucine Drugs 0.000 description 5
- 230000000813 microbial effect Effects 0.000 description 5
- 238000010369 molecular cloning Methods 0.000 description 5
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 5
- 238000005457 optimization Methods 0.000 description 5
- 230000001131 transforming effect Effects 0.000 description 5
- AVKOENOBFIYBSA-WMPRHZDHSA-N (4Z,7Z,10Z,13Z,16Z)-docosa-4,7,10,13,16-pentaenoic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O AVKOENOBFIYBSA-WMPRHZDHSA-N 0.000 description 4
- 108010000700 Acetolactate synthase Proteins 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 4
- 241000024188 Andala Species 0.000 description 4
- 241000219194 Arabidopsis Species 0.000 description 4
- 239000002028 Biomass Substances 0.000 description 4
- 208000031639 Chromosome Deletion Diseases 0.000 description 4
- 241000219112 Cucumis Species 0.000 description 4
- 235000015510 Cucumis melo subsp melo Nutrition 0.000 description 4
- 101150102653 Dgat2 gene Proteins 0.000 description 4
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 4
- 241000907999 Mortierella alpina Species 0.000 description 4
- 101100313266 Mus musculus Tead1 gene Proteins 0.000 description 4
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- 102000003992 Peroxidases Human genes 0.000 description 4
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 4
- WQDUMFSSJAZKTM-UHFFFAOYSA-N Sodium methoxide Chemical compound [Na+].[O-]C WQDUMFSSJAZKTM-UHFFFAOYSA-N 0.000 description 4
- 101150050575 URA3 gene Proteins 0.000 description 4
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 4
- FJJCIZWZNKZHII-UHFFFAOYSA-N [4,6-bis(cyanoamino)-1,3,5-triazin-2-yl]cyanamide Chemical compound N#CNC1=NC(NC#N)=NC(NC#N)=N1 FJJCIZWZNKZHII-UHFFFAOYSA-N 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 230000000692 anti-sense effect Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 229940041514 candida albicans extract Drugs 0.000 description 4
- 150000001721 carbon Chemical group 0.000 description 4
- 239000007795 chemical reaction product Substances 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 230000000378 dietary effect Effects 0.000 description 4
- 238000006073 displacement reaction Methods 0.000 description 4
- 235000013399 edible fruits Nutrition 0.000 description 4
- 230000009483 enzymatic pathway Effects 0.000 description 4
- 238000011156 evaluation Methods 0.000 description 4
- 238000000855 fermentation Methods 0.000 description 4
- 230000004151 fermentation Effects 0.000 description 4
- 235000021588 free fatty acids Nutrition 0.000 description 4
- 238000006460 hydrolysis reaction Methods 0.000 description 4
- 150000002500 ions Chemical class 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 229930182817 methionine Natural products 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 238000007899 nucleic acid hybridization Methods 0.000 description 4
- 108040007629 peroxidase activity proteins Proteins 0.000 description 4
- 230000008488 polyadenylation Effects 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 238000003753 real-time PCR Methods 0.000 description 4
- 230000009467 reduction Effects 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 230000008521 reorganization Effects 0.000 description 4
- 230000003362 replicative effect Effects 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 150000004671 saturated fatty acids Chemical class 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 235000013311 vegetables Nutrition 0.000 description 4
- 239000012138 yeast extract Substances 0.000 description 4
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 108010051225 Diacylglycerol cholinephosphotransferase Proteins 0.000 description 3
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 3
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 3
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 3
- 241000235058 Komagataella pastoris Species 0.000 description 3
- 101710159527 Maturation protein A Proteins 0.000 description 3
- 101710091157 Maturation protein A2 Proteins 0.000 description 3
- 108091034057 RNA (poly(A)) Proteins 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 102000013275 Somatomedins Human genes 0.000 description 3
- 102100028897 Stearoyl-CoA desaturase Human genes 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 239000012620 biological material Substances 0.000 description 3
- 230000001851 biosynthetic effect Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 210000000172 cytosol Anatomy 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 239000008121 dextrose Substances 0.000 description 3
- 235000015872 dietary supplement Nutrition 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 3
- 150000002148 esters Chemical class 0.000 description 3
- 239000013613 expression plasmid Substances 0.000 description 3
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 3
- 230000007062 hydrolysis Effects 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 3
- 239000002207 metabolite Substances 0.000 description 3
- 239000003471 mutagenic agent Substances 0.000 description 3
- 235000015097 nutrients Nutrition 0.000 description 3
- 235000008935 nutritious Nutrition 0.000 description 3
- 230000002018 overexpression Effects 0.000 description 3
- 230000000858 peroxisomal effect Effects 0.000 description 3
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 3
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 3
- NLKNQRATVPKPDG-UHFFFAOYSA-M potassium iodide Chemical compound [K+].[I-] NLKNQRATVPKPDG-UHFFFAOYSA-M 0.000 description 3
- 102220023256 rs387907547 Human genes 0.000 description 3
- 229920006395 saturated elastomer Polymers 0.000 description 3
- 238000012882 sequential analysis Methods 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 238000003860 storage Methods 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 238000005303 weighing Methods 0.000 description 3
- 239000011701 zinc Substances 0.000 description 3
- BZTDTCNHAFUJOG-UHFFFAOYSA-N 6-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C11OC(=O)C2=CC=C(C(=O)O)C=C21 BZTDTCNHAFUJOG-UHFFFAOYSA-N 0.000 description 2
- 201000011452 Adrenoleukodystrophy Diseases 0.000 description 2
- 241001225321 Aspergillus fumigatus Species 0.000 description 2
- 241000351920 Aspergillus nidulans Species 0.000 description 2
- 241000222122 Candida albicans Species 0.000 description 2
- 241000222178 Candida tropicalis Species 0.000 description 2
- 102100035882 Catalase Human genes 0.000 description 2
- 108010053835 Catalase Proteins 0.000 description 2
- 108010051219 Cre recombinase Proteins 0.000 description 2
- 241000235646 Cyberlindnera jadinii Species 0.000 description 2
- 241001147476 Cyclotella Species 0.000 description 2
- 241001491720 Cyclotella sp. Species 0.000 description 2
- 102000013444 Diacylglycerol Cholinephosphotransferase Human genes 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- 102000004533 Endonucleases Human genes 0.000 description 2
- 241001465328 Eremothecium gossypii Species 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- 102000009114 Fatty acid desaturases Human genes 0.000 description 2
- 241000223195 Fusarium graminearum Species 0.000 description 2
- ZRALSGWEFCBTJO-UHFFFAOYSA-N Guanidine Chemical compound NC(N)=N ZRALSGWEFCBTJO-UHFFFAOYSA-N 0.000 description 2
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- 241001138401 Kluyveromyces lactis Species 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 206010027336 Menstruation delayed Diseases 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 102000029749 Microtubule Human genes 0.000 description 2
- 108091022875 Microtubule Proteins 0.000 description 2
- 241000180701 Nitzschia <flatworm> Species 0.000 description 2
- 241000486043 Nitzschia sp. (in: Bacillariophyta) Species 0.000 description 2
- 108091092724 Noncoding DNA Proteins 0.000 description 2
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 241000199911 Peridinium Species 0.000 description 2
- 102100037479 Peroxisomal membrane protein PEX16 Human genes 0.000 description 2
- 102100030554 Peroxisome biogenesis factor 10 Human genes 0.000 description 2
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 108700005078 Synthetic Genes Proteins 0.000 description 2
- 241001467333 Thraustochytriaceae Species 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- 108010020764 Transposases Proteins 0.000 description 2
- 102000008579 Transposases Human genes 0.000 description 2
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 2
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 244000301083 Ustilago maydis Species 0.000 description 2
- 235000015919 Ustilago maydis Nutrition 0.000 description 2
- 201000004525 Zellweger Syndrome Diseases 0.000 description 2
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical group [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 2
- CKUAXEQHGKSLHN-UHFFFAOYSA-N [C].[N] Chemical compound [C].[N] CKUAXEQHGKSLHN-UHFFFAOYSA-N 0.000 description 2
- 241000222126 [Candida] glabrata Species 0.000 description 2
- DPDMMXDBJGCCQC-UHFFFAOYSA-N [Na].[Cl] Chemical compound [Na].[Cl] DPDMMXDBJGCCQC-UHFFFAOYSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 230000000735 allogeneic effect Effects 0.000 description 2
- 239000004411 aluminium Substances 0.000 description 2
- 229910052782 aluminium Inorganic materials 0.000 description 2
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 2
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 2
- 229940126575 aminoglycoside Drugs 0.000 description 2
- 230000003698 anagen phase Effects 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 229940091771 aspergillus fumigatus Drugs 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 238000009395 breeding Methods 0.000 description 2
- 230000001488 breeding effect Effects 0.000 description 2
- 229940095731 candida albicans Drugs 0.000 description 2
- 208000032343 candida glabrata infection Diseases 0.000 description 2
- 125000004432 carbon atom Chemical group C* 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- NSWAMPCUPHPTTC-UHFFFAOYSA-N chlorimuron-ethyl Chemical group CCOC(=O)C1=CC=CC=C1S(=O)(=O)NC(=O)NC1=NC(Cl)=CC(OC)=N1 NSWAMPCUPHPTTC-UHFFFAOYSA-N 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- SECPZKHBENQXJG-UHFFFAOYSA-N cis-palmitoleic acid Natural products CCCCCCC=CCCCCCCCC(O)=O SECPZKHBENQXJG-UHFFFAOYSA-N 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 210000004748 cultured cell Anatomy 0.000 description 2
- 230000001186 cumulative effect Effects 0.000 description 2
- 230000034994 death Effects 0.000 description 2
- GHVNFZFCNZKVNT-UHFFFAOYSA-N decanoic acid Chemical compound CCCCCCCCCC(O)=O GHVNFZFCNZKVNT-UHFFFAOYSA-N 0.000 description 2
- CYQFCXCEBYINGO-IAGOWNOFSA-N delta1-THC Chemical compound C1=C(C)CC[C@H]2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3[C@@H]21 CYQFCXCEBYINGO-IAGOWNOFSA-N 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- POULHZVOKOAJMA-UHFFFAOYSA-N dodecanoic acid Chemical compound CCCCCCCCCCCC(O)=O POULHZVOKOAJMA-UHFFFAOYSA-N 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 230000032050 esterification Effects 0.000 description 2
- 238000005886 esterification reaction Methods 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 238000010359 gene isolation Methods 0.000 description 2
- 208000016361 genetic disease Diseases 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 235000011187 glycerol Nutrition 0.000 description 2
- 229930182470 glycoside Natural products 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 238000003306 harvesting Methods 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- YQYJSBFKSSDGFO-FWAVGLHBSA-N hygromycin A Chemical compound O[C@H]1[C@H](O)[C@H](C(=O)C)O[C@@H]1Oc1ccc(\C=C(/C)C(=O)N[C@@H]2[C@@H]([C@H]3OCO[C@H]3[C@@H](O)[C@@H]2O)O)cc1O YQYJSBFKSSDGFO-FWAVGLHBSA-N 0.000 description 2
- VKOBVWXKNCXXDE-UHFFFAOYSA-N icosanoic acid Chemical compound CCCCCCCCCCCCCCCCCCCC(O)=O VKOBVWXKNCXXDE-UHFFFAOYSA-N 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 229910052500 inorganic mineral Inorganic materials 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000012269 metabolic engineering Methods 0.000 description 2
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Natural products C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 2
- 210000004688 microtubule Anatomy 0.000 description 2
- 239000011707 mineral Substances 0.000 description 2
- 235000010755 mineral Nutrition 0.000 description 2
- 150000002759 monoacylglycerols Chemical class 0.000 description 2
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical class CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 2
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 2
- 238000003499 nucleic acid array Methods 0.000 description 2
- 230000000050 nutritive effect Effects 0.000 description 2
- UTOPWMOLSKOLTQ-UHFFFAOYSA-N octacosanoic acid Chemical compound CCCCCCCCCCCCCCCCCCCCCCCCCCCC(O)=O UTOPWMOLSKOLTQ-UHFFFAOYSA-N 0.000 description 2
- 239000002751 oligonucleotide probe Substances 0.000 description 2
- 108010033653 omega-3 fatty acid desaturase Proteins 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 230000003647 oxidation Effects 0.000 description 2
- 238000007254 oxidation reaction Methods 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 230000037039 plant physiology Effects 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 102220004457 rs11567847 Human genes 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 210000001541 thymus gland Anatomy 0.000 description 2
- 230000005026 transcription initiation Effects 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 235000021122 unsaturated fatty acids Nutrition 0.000 description 2
- 150000004670 unsaturated fatty acids Chemical class 0.000 description 2
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 2
- 229940045145 uridine Drugs 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- 150000003722 vitamin derivatives Chemical class 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 229910052725 zinc Inorganic materials 0.000 description 2
- WBBQTNCISCKUMU-PDBXOOCHSA-N (13Z,16Z,19Z)-docosatrienoic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCCCCCC(O)=O WBBQTNCISCKUMU-PDBXOOCHSA-N 0.000 description 1
- GWHCXVQVJPWHRF-KTKRTIGZSA-N (15Z)-tetracosenoic acid Chemical compound CCCCCCCC\C=C/CCCCCCCCCCCCCC(O)=O GWHCXVQVJPWHRF-KTKRTIGZSA-N 0.000 description 1
- CUXYLFPMQMFGPL-UHFFFAOYSA-N (9Z,11E,13E)-9,11,13-Octadecatrienoic acid Natural products CCCCC=CC=CC=CCCCCCCCC(O)=O CUXYLFPMQMFGPL-UHFFFAOYSA-N 0.000 description 1
- QXJSBBXBKPUZAA-CMDGGOBGSA-N (e)-octadec-10-enoic acid Chemical compound CCCCCCC\C=C\CCCCCCCCC(O)=O QXJSBBXBKPUZAA-CMDGGOBGSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- PORPENFLTBBHSG-MGBGTMOVSA-N 1,2-dihexadecanoyl-sn-glycerol-3-phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(O)=O)OC(=O)CCCCCCCCCCCCCCC PORPENFLTBBHSG-MGBGTMOVSA-N 0.000 description 1
- LDVVTQMJQSCDMK-UHFFFAOYSA-N 1,3-dihydroxypropan-2-yl formate Chemical compound OCC(CO)OC=O LDVVTQMJQSCDMK-UHFFFAOYSA-N 0.000 description 1
- OUKQYSCCWVCDKU-UHFFFAOYSA-N 1-[(3,4-diethoxyphenyl)methyl]-6,7-diethoxy-3,4-dihydroisoquinolin-2-ium;2-(1,3-dimethyl-2,6-dioxopurin-7-yl)acetate Chemical compound O=C1N(C)C(=O)N(C)C2=C1N(CC(O)=O)C=N2.C1=C(OCC)C(OCC)=CC=C1CC1=NCCC2=CC(OCC)=C(OCC)C=C12 OUKQYSCCWVCDKU-UHFFFAOYSA-N 0.000 description 1
- IHPYMWDTONKSCO-UHFFFAOYSA-N 2,2'-piperazine-1,4-diylbisethanesulfonic acid Chemical compound OS(=O)(=O)CCN1CCN(CCS(O)(=O)=O)CC1 IHPYMWDTONKSCO-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- WYMDDFRYORANCC-UHFFFAOYSA-N 2-[[3-[bis(carboxymethyl)amino]-2-hydroxypropyl]-(carboxymethyl)amino]acetic acid Chemical compound OC(=O)CN(CC(O)=O)CC(O)CN(CC(O)=O)CC(O)=O WYMDDFRYORANCC-UHFFFAOYSA-N 0.000 description 1
- HVCOBJNICQPDBP-UHFFFAOYSA-N 3-[3-[3,5-dihydroxy-6-methyl-4-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyoxan-2-yl]oxydecanoyloxy]decanoic acid;hydrate Chemical group O.OC1C(OC(CC(=O)OC(CCCCCCC)CC(O)=O)CCCCCCC)OC(C)C(O)C1OC1C(O)C(O)C(O)C(C)O1 HVCOBJNICQPDBP-UHFFFAOYSA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- LODRRYMGPWQCTR-UHFFFAOYSA-N 5-fluoro-2,4-dioxo-1h-pyrimidine-6-carboxylic acid;hydrate Chemical compound O.OC(=O)C=1NC(=O)NC(=O)C=1F LODRRYMGPWQCTR-UHFFFAOYSA-N 0.000 description 1
- 101150014984 ACO gene Proteins 0.000 description 1
- 102100024643 ATP-binding cassette sub-family D member 1 Human genes 0.000 description 1
- 101150040074 Aco2 gene Proteins 0.000 description 1
- 241000242764 Aequorea victoria Species 0.000 description 1
- 241001439211 Almeida Species 0.000 description 1
- 241000590031 Alteromonas Species 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 108700012837 Arabidopsis Pex16 Proteins 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 108010023063 Bacto-peptone Proteins 0.000 description 1
- 101150098791 CPT1 gene Proteins 0.000 description 1
- 101100289888 Caenorhabditis elegans lys-5 gene Proteins 0.000 description 1
- 239000005632 Capric acid (CAS 334-48-5) Substances 0.000 description 1
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 1
- 108090000489 Carboxy-Lyases Proteins 0.000 description 1
- 102100027667 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 2 Human genes 0.000 description 1
- 101710134389 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 2 Proteins 0.000 description 1
- 206010053684 Cerebrohepatorenal syndrome Diseases 0.000 description 1
- 241001362614 Crassa Species 0.000 description 1
- 201000007336 Cryptococcosis Diseases 0.000 description 1
- 241000221204 Cryptococcus neoformans Species 0.000 description 1
- 241000123365 Cryptococcus neoformans var. neoformans Species 0.000 description 1
- 241000223233 Cutaneotrichosporon cutaneum Species 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 241000235036 Debaryomyces hansenii Species 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 108091027757 Deoxyribozyme Proteins 0.000 description 1
- 108010006731 Dimethylallyltranstransferase Proteins 0.000 description 1
- 102000005454 Dimethylallyltranstransferase Human genes 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 102000000429 Factor XII Human genes 0.000 description 1
- 108010080865 Factor XII Proteins 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 241001149475 Gaeumannomyces graminis Species 0.000 description 1
- UQHGAYSULGRWRG-WHFBIAKZSA-N Glu-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UQHGAYSULGRWRG-WHFBIAKZSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 101100246753 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) pyrF gene Proteins 0.000 description 1
- 101000600189 Homo sapiens Peroxisomal membrane protein PEX16 Proteins 0.000 description 1
- 101001126498 Homo sapiens Peroxisome biogenesis factor 10 Proteins 0.000 description 1
- 102000008070 Interferon-gamma Human genes 0.000 description 1
- 108010074328 Interferon-gamma Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- RRHGJUQNOFWUDK-UHFFFAOYSA-N Isoprene Chemical group CC(=C)C=C RRHGJUQNOFWUDK-UHFFFAOYSA-N 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- SRBFZHDQGSBBOR-HWQSCIPKSA-N L-arabinopyranose Chemical compound O[C@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-HWQSCIPKSA-N 0.000 description 1
- LEVWYRKDKASIDU-IMJSIDKUSA-N L-cystine Chemical compound [O-]C(=O)[C@@H]([NH3+])CSSC[C@H]([NH3+])C([O-])=O LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 1
- RGHNJXZEOKUKBD-SKNVOMKLSA-N L-idonic acid Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@@H](O)C(O)=O RGHNJXZEOKUKBD-SKNVOMKLSA-N 0.000 description 1
- 239000004395 L-leucine Substances 0.000 description 1
- 235000019454 L-leucine Nutrition 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 239000005639 Lauric acid Substances 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 235000021353 Lignoceric acid Nutrition 0.000 description 1
- CQXMAMUUWHYSIY-UHFFFAOYSA-N Lignoceric acid Natural products CCCCCCCCCCCCCCCCCCCCCCCC(=O)OCCC1=CC=C(O)C=C1 CQXMAMUUWHYSIY-UHFFFAOYSA-N 0.000 description 1
- 239000000232 Lipid Bilayer Substances 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 108090000856 Lyases Proteins 0.000 description 1
- 102000004317 Lyases Human genes 0.000 description 1
- 241001344131 Magnaporthe grisea Species 0.000 description 1
- 241001330975 Magnaporthe oryzae Species 0.000 description 1
- 208000002720 Malnutrition Diseases 0.000 description 1
- 108010049137 Member 1 Subfamily D ATP Binding Cassette Transporter Proteins 0.000 description 1
- 241001123676 Metschnikowia pulcherrima Species 0.000 description 1
- 101710169105 Minor spike protein Proteins 0.000 description 1
- 101710081079 Minor spike protein H Proteins 0.000 description 1
- 241001219224 Mortierella elongata Species 0.000 description 1
- 241000048020 Mortierella exigua Species 0.000 description 1
- 241000133355 Mortierella hygrophila Species 0.000 description 1
- CHJJGSNFBQVOTG-UHFFFAOYSA-N N-methyl-guanidine Natural products CNC(N)=N CHJJGSNFBQVOTG-UHFFFAOYSA-N 0.000 description 1
- 241000221961 Neurospora crassa Species 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 241000320412 Ogataea angusta Species 0.000 description 1
- 108010055012 Orotidine-5'-phosphate decarboxylase Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 101150046494 PEX12 gene Proteins 0.000 description 1
- 101150100341 PEX2 gene Proteins 0.000 description 1
- 239000007990 PIPES buffer Substances 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 235000021319 Palmitoleic acid Nutrition 0.000 description 1
- 241000228150 Penicillium chrysogenum Species 0.000 description 1
- 102000002508 Peptide Elongation Factors Human genes 0.000 description 1
- 108010068204 Peptide Elongation Factors Proteins 0.000 description 1
- 102100029577 Peroxisomal biogenesis factor 3 Human genes 0.000 description 1
- 101710141840 Peroxisomal biogenesis factor 3 Proteins 0.000 description 1
- 101710170024 Peroxisomal membrane protein PEX16 Proteins 0.000 description 1
- 101710196890 Peroxisome biogenesis factor 10 Proteins 0.000 description 1
- 101150118397 Pex5l gene Proteins 0.000 description 1
- 102000001107 Phosphatidate Phosphatase Human genes 0.000 description 1
- 108010069394 Phosphatidate Phosphatase Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 240000007789 Pinus pinea Species 0.000 description 1
- 235000008575 Pinus pinea Nutrition 0.000 description 1
- 244000298647 Poinciana pulcherrima Species 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 101710089165 Protein white Proteins 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 241000190932 Rhodopseudomonas Species 0.000 description 1
- 241000221523 Rhodotorula toruloides Species 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 241000598397 Schizochytrium sp. Species 0.000 description 1
- 241000242583 Scyphozoa Species 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 241000863430 Shewanella Species 0.000 description 1
- 239000004141 Sodium laurylsulphate Substances 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 241001634922 Tausonia pullulans Species 0.000 description 1
- 241001298230 Thraustochytrium sp. Species 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 239000006035 Tryptophane Substances 0.000 description 1
- 108010092464 Urate Oxidase Proteins 0.000 description 1
- XXDVDTMEVBYRPK-XPUUQOCRSA-N Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O XXDVDTMEVBYRPK-XPUUQOCRSA-N 0.000 description 1
- IOUPEELXVYPCPG-UHFFFAOYSA-N Valylglycine Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 1
- 241001532014 Xanthorrhoea Species 0.000 description 1
- 101100532752 Yarrowia lipolytica (strain CLIB 122 / E 150) SCP2 gene Proteins 0.000 description 1
- 208000036813 Zellweger spectrum disease Diseases 0.000 description 1
- 241000235017 Zygosaccharomyces Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 238000004026 adhesive bonding Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 101150099105 alien gene Proteins 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 150000001336 alkenes Chemical class 0.000 description 1
- AHANXAKGNAKFSK-PDBXOOCHSA-N all-cis-icosa-11,14,17-trienoic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCCCC(O)=O AHANXAKGNAKFSK-PDBXOOCHSA-N 0.000 description 1
- CUXYLFPMQMFGPL-SUTYWZMXSA-N all-trans-octadeca-9,11,13-trienoic acid Chemical compound CCCC\C=C\C=C\C=C\CCCCCCCC(O)=O CUXYLFPMQMFGPL-SUTYWZMXSA-N 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 229920006318 anionic polymer Polymers 0.000 description 1
- 229940121363 anti-inflammatory agent Drugs 0.000 description 1
- 239000002260 anti-inflammatory agent Substances 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 239000003529 anticholesteremic agent Substances 0.000 description 1
- 229940127226 anticholesterol agent Drugs 0.000 description 1
- 238000002820 assay format Methods 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- 125000004429 atom Chemical group 0.000 description 1
- 238000002869 basic local alignment search tool Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 235000013361 beverage Nutrition 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 150000005693 branched-chain amino acids Chemical class 0.000 description 1
- 229910052792 caesium Inorganic materials 0.000 description 1
- TVFDJXOCXUVLDH-UHFFFAOYSA-N caesium atom Chemical compound [Cs] TVFDJXOCXUVLDH-UHFFFAOYSA-N 0.000 description 1
- 229960001948 caffeine Drugs 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 235000011089 carbon dioxide Nutrition 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000006652 catabolic pathway Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000003196 chaotropic effect Effects 0.000 description 1
- 231100000481 chemical toxicant Toxicity 0.000 description 1
- 238000005660 chlorination reaction Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000013373 clone screening Methods 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 235000009508 confectionery Nutrition 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 150000005690 diesters Chemical class 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- SWSQBOPZIKWTGO-UHFFFAOYSA-N dimethylaminoamidine Natural products CN(C)C(N)=N SWSQBOPZIKWTGO-UHFFFAOYSA-N 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- UKMSUNONTOPOIO-UHFFFAOYSA-N docosanoic acid Chemical compound CCCCCCCCCCCCCCCCCCCCCC(O)=O UKMSUNONTOPOIO-UHFFFAOYSA-N 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- PRHHYVQTPBEDFE-UHFFFAOYSA-N eicosatrienoic acid Natural products CCCCCC=CCC=CCCCCC=CCCCC(O)=O PRHHYVQTPBEDFE-UHFFFAOYSA-N 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000004146 energy storage Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- FARYTWBWLZAXNK-WAYWQWQTSA-N ethyl (z)-3-(methylamino)but-2-enoate Chemical compound CCOC(=O)\C=C(\C)NC FARYTWBWLZAXNK-WAYWQWQTSA-N 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000003925 fat Substances 0.000 description 1
- 150000002190 fatty acyls Chemical group 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 229940044627 gamma-interferon Drugs 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 229960002989 glutamic acid Drugs 0.000 description 1
- 150000002338 glycosides Chemical class 0.000 description 1
- 235000011868 grain product Nutrition 0.000 description 1
- 239000004519 grease Substances 0.000 description 1
- 239000005090 green fluorescent protein Substances 0.000 description 1
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- XMHIUKTWLZUKEX-UHFFFAOYSA-N hexacosanoic acid Chemical compound CCCCCCCCCCCCCCCCCCCCCCCCCC(O)=O XMHIUKTWLZUKEX-UHFFFAOYSA-N 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 206010020718 hyperplasia Diseases 0.000 description 1
- YAMHXTCMCPHKLN-UHFFFAOYSA-N imidazolidin-2-one Chemical compound O=C1NCCN1 YAMHXTCMCPHKLN-UHFFFAOYSA-N 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 230000016507 interphase Effects 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000007834 ligase chain reaction Methods 0.000 description 1
- 101150077696 lip-1 gene Proteins 0.000 description 1
- 230000037356 lipid metabolism Effects 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000001071 malnutrition Effects 0.000 description 1
- 235000000824 malnutrition Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 235000013310 margarine Nutrition 0.000 description 1
- 239000003264 margarine Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 235000013622 meat product Nutrition 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 244000005706 microflora Species 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000002715 modification method Methods 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 235000021281 monounsaturated fatty acids Nutrition 0.000 description 1
- WQEPLUUGTLDZJY-UHFFFAOYSA-N n-Pentadecanoic acid Natural products CCCCCCCCCCCCCCC(O)=O WQEPLUUGTLDZJY-UHFFFAOYSA-N 0.000 description 1
- 230000001937 non-anti-biotic effect Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 208000015380 nutritional deficiency disease Diseases 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 229940049964 oleate Drugs 0.000 description 1
- JRZJOMJEPLMPRA-UHFFFAOYSA-N olefin Natural products CCCCCCCC=C JRZJOMJEPLMPRA-UHFFFAOYSA-N 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 229960005010 orotic acid Drugs 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 208000023269 peroxisome biogenesis disease Diseases 0.000 description 1
- 210000004332 phalangeal cell Anatomy 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 230000007096 poisonous effect Effects 0.000 description 1
- 239000003495 polar organic solvent Substances 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229920000193 polymethacrylate Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 235000007715 potassium iodide Nutrition 0.000 description 1
- 229960004839 potassium iodide Drugs 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000000327 preparative centrifugation Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- FOGKDYADEBOSPL-UHFFFAOYSA-M rubidium(1+);acetate Chemical compound [Rb+].CC([O-])=O FOGKDYADEBOSPL-UHFFFAOYSA-M 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 238000007127 saponification reaction Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012772 sequence design Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- AWUCVROLDVIAJX-GSVOUGTGSA-N sn-glycerol 3-phosphate Chemical compound OC[C@@H](O)COP(O)(O)=O AWUCVROLDVIAJX-GSVOUGTGSA-N 0.000 description 1
- 235000011888 snacks Nutrition 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- BAZAXWOYCMUHIX-UHFFFAOYSA-M sodium perchlorate Chemical compound [Na+].[O-]Cl(=O)(=O)=O BAZAXWOYCMUHIX-UHFFFAOYSA-M 0.000 description 1
- 229910001488 sodium perchlorate Inorganic materials 0.000 description 1
- VGTPCRGMBIAPIM-UHFFFAOYSA-M sodium thiocyanate Chemical compound [Na+].[S-]C#N VGTPCRGMBIAPIM-UHFFFAOYSA-M 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 238000000194 supercritical-fluid extraction Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000010809 targeting technique Methods 0.000 description 1
- ABZLKHKQJHEPAX-UHFFFAOYSA-N tetramethylrhodamine Chemical compound C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C([O-])=O ABZLKHKQJHEPAX-UHFFFAOYSA-N 0.000 description 1
- 150000007970 thio esters Chemical class 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 239000003440 toxic substance Substances 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- RYYVLZVUVIJVGH-UHFFFAOYSA-N trimethylxanthine Natural products CN1C(=O)N(C)C(=O)C2=C1N=CN2C RYYVLZVUVIJVGH-UHFFFAOYSA-N 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 229940005267 urate oxidase Drugs 0.000 description 1
- 238000003828 vacuum filtration Methods 0.000 description 1
- 239000000273 veterinary drug Substances 0.000 description 1
- PXQPEWDEAKTCGB-UHFFFAOYSA-N vitamin B13 Natural products OC(=O)C1=CC(=O)NC(=O)N1 PXQPEWDEAKTCGB-UHFFFAOYSA-N 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6436—Fatty acid esters
- C12P7/6445—Glycerides
- C12P7/6472—Glycerides containing polyunsaturated fatty acid [PUFA] residues, i.e. having two or more double bonds in their backbone
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6409—Fatty acids
- C12P7/6427—Polyunsaturated fatty acids [PUFA], i.e. having two or more double bonds in their backbone
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6409—Fatty acids
- C12P7/6427—Polyunsaturated fatty acids [PUFA], i.e. having two or more double bonds in their backbone
- C12P7/6432—Eicosapentaenoic acids [EPA]
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Chemical & Material Sciences (AREA)
- Biotechnology (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Fodder In General (AREA)
- Coloring Foods And Improving Nutritive Qualities (AREA)
Abstract
通过改变过氧化物酶体生物合成因子(Pex)蛋白的活性,已经成功地找到在生产PUFA的含油真核生物中的总脂质级分和油级分中提高多不饱和脂肪酸(PUFA)的量的方法。在与不破坏天然Pex蛋白的亲本菌株比较时,在生产PUFA的含油真核生物菌种中破坏染色体Pex3基因、Pex10p基因或Pex16p基因,导致在菌株的总脂质级分和油级分中的PUFA量提高,所述PUFA的量以总脂肪酸百分比和干细胞重量百分比形式表示。
Description
本专利申请要求美国临时申请60/977,174和60/977,177的优先权,所述临时申请均提交于2007年10月3日,并且全文以引用方式并入本文中。
发明领域
本发明属于生物技术领域。更具体地讲,本发明涉及基于破坏过氧化物酶体生物合成因子(PEX)蛋白的、用于操纵真核生物的多不饱和脂肪酸(PUFA)组成和脂质含量的方法。
发明背景
已经有大量文献证明了与多不饱和脂肪酸[“PUFA”],尤其是ω-3和ω-6PUFA相关的健康有益效果。为了寻找大规模制备ω-3和ω-6
PUFA的方法,研究者已经将他们的工作转向发现基因以及了解产生脂质和脂肪酸的编码的生物合成途径。
一个用于生产这些PUFA的研究成果已经将ω-3/ω-6PUFA生物合成途径导入了不会天然产生ω-3/ω-6PUFA的生物体中。一种已经被广泛使用的所述生物是非含油酵母,啤酒糖酵母(Saccharomycescerevisiae)。然而,并无初步结果证明有限产量的亚油酸[“LA”]、γ-亚麻酸[“GLA”]、α-亚麻酸[“ALA”]、十八碳四烯酸[“STA”]和/或二十碳五烯酸[“EPA”]适于商业开发。
其它用于大规模生产ω-3/ω-6PUFA的研究成果已经培育出了天然生产选定脂肪酸的微生物,例如异养硅藻小环藻属(Cyclotella sp.)和菱形藻属(Nitzschia sp.)、假单胞菌属(Pseudomonas)、交替单胞菌属(Alteromonas)或希瓦氏菌(Shewanella)、腐霉属(Pythium)的丝状真菌、或长被孢霉属(Mortierella elongata)、M.exigua或M.hygrophila。
然而所有这些研究成果都不能够充分地改善油产量或控制所生产的油的组成的特性,因为发酵依赖微生物本身的天然能力。
共有的美国专利7,238,482描述了使用含油酵母解脂耶氏酵母作为生产宿主用于生产PUFA的用途。含油酵母被定义为是那些天然能够合成并积聚油的酵母,其中细胞干重一般大于25%。本领域已经描述了生产宿主的优化(参见例如国际申请公开WO 2006/033723、美国专利申请公开2006-0094092、美国专利申请公开2006-0115881、和美国专利申请公开2006-0110806)。本文所述的重组菌株包含多种嵌合基因,所述基因表达多拷贝的异源去饱和酶、延伸酶和酰基转移酶,并且任选地包含多个天然去饱和酶和酰基转移酶敲除以能够合成并积聚PUFA。商业生产PUFA需要对宿主细胞进行进一步优化。
Lin Y.等人提出过氧化物酶体是脂质的分解代谢和合成代谢所必需的(Plant Physiology,135:814-827(2004))。然而,该假说是以对拟南芥属突变体中的Pex16p同源物的研究为基础的,该突变体具有异常的过氧化物酶体生物合成以及脂肪酸合成(即,据报道ssel种子中生产的油比野生型减少了大约10-16%)。Binns,D.等人(J.Cell Biol.,173(5):719-731(2006))也提供了证明过氧化物酶体和脂质体在啤酒糖酵母中的密切协作的文献。但是以前尚未在生产PUFA的生物中进行过对Pex敲除的研究。
申请人已经通过在生产PUFA的生物体中破坏过氧化物酶体生物合成因子蛋白的非预知机制,解决了优化宿主细胞以用于PUFA商业生产的所述问题,所述机制导致在解脂耶氏酵母的重组PUFA生产菌株中提高PUFA量(以总脂肪酸百分比形式表示)的非预知结果。本文描述了在过氧化物酶体生物合成因子蛋白中存在破坏的新型菌株。
发明概述
本文所述的是提高在具有总脂质含量、总脂质级分和油级分的含油真核生物中至少一种多不饱和脂肪酸[“PUFA”]的重量%相对于总脂肪酸[“TFA”]的重量%的方法,所述方法包括:
a)提供含油真核生物,所述含油真核生物包含:
1)编码功能性多不饱和脂肪酸生物合成途径的基因;和
2)在编码过氧化物酶体生物合成因子蛋白的天然基因中的破坏,由此提供PEX破坏的生物,以及
b)使所述PEX破坏的生物在以下条件下生长:即,使得当与编码过氧化物酶体生物合成因子蛋白的天然基因未被破坏的含油真核生物中的总脂质级分或油级分中相对于总脂肪酸的重量百分比的至少一种多不饱和脂肪酸的重量百分比进行比较时,所述总脂质级分或油级分中至少一种多不饱和脂肪酸的重量百分比相对于总脂肪酸的重量百分比提高。
该提高方法也可通过实施相同步骤(a)和(b),用于提高至少一种多不饱和脂肪酸[“PUFA”]相对于干细胞重量[“DCW”]的百分比。
在本文所述的一些方法中,PUFA重量%相对于TFA重量%提高了至少1.3倍。
在一些所述方法中,PEX破坏的生物体中的总脂质含量与天然PEX基因中无破坏的含油真核生物的总脂质含量相比可能提高或降低。
在这些方法的任何一种中,提高的PUFA可以是单个PUFA或多个PUFA的组合。在任一种情况下,提高的PUFA或提高的PUFA组合可以包括亚油酸、共轭亚油酸、γ-亚麻酸、二高-γ-亚麻酸、花生四烯酸、二十二碳四烯酸、ω-6二十二碳五烯酸、α-亚麻酸、十八碳四烯酸、二十碳四烯酸、二十碳五烯酸、ω-3二十二碳五烯酸、二十碳二烯酸、二十碳三烯酸、二十二碳六烯酸、这些脂肪酸的羟基化或环氧脂肪酸、C18多不饱和脂肪酸或这些脂肪酸的组合、C20多不饱和脂肪酸或这些脂肪酸的组合、C20-22多不饱和脂肪酸的组合和C22多不饱和脂肪酸或这些脂肪酸的组合。
在这些方法的任何一种中,PEX破坏的生物可以是以下生物的一员:耶氏酵母属(Yarrowia)、假丝酵母属(Candida)、红酵母属(Rhodotorula)、红冬孢酵母属(Rhodosporidium)、隐球酵母属(Cryptococcus)、丝孢酵母属(Trichosporon)、油脂酵母属(Lipomyces)、被孢霉属(Mortierella)、破囊壶菌属(Thraustochytrium)、裂殖壶菌属(Schizochytrium)、和糖酵母属(Saccharomyces),上述生物都具有含油特性。并且,在任何一种所述方法中,PUFA生物合成途径包括编码以下酶的任何一种或它们的组合的基因:Δ9去饱和酶、Δ12去饱和酶、Δ6去饱和酶、Δ5去饱和酶、Δ17去饱和酶、Δ8去饱和酶、Δ15去饱和酶、Δ4去饱和酶、C14/16延伸酶、C16/18延伸酶、C18/20延伸酶、C20/22延伸酶和Δ9延伸酶。
破坏可发生在编码以下过氧化物酶体生物合成因子蛋白的PEX基因中,所述蛋白包括:Pex1p、Pex 2p、Pex3p、Pex3Bp、Pex4p、Pex5p、Pex5Bp、Pex5Cp、Pex5/20p、Pex6p、Pex7p、Pex8p、Pex10p、Pex12p、Pex13p、Pex14p、Pex15p、Pex16p、Pex17p、Pex14/17p、Pex18p、Pex19p、Pex20p、Pex21p、Pex21Bp、Pex22p、Pex22p样和Pex26p。并且在这些方法的任何一种中,破坏可以是编码蛋白的C-末端部分的基因的一部分中的基因敲除或缺失。在这些方法的任何一种中,缺失发生在编码蛋白的C3HC4环状锌指基序的C-末端部分的基因部分。
本文也描述了PEX破坏的生物中的油级分或总脂质级分,通过如权利要求1所述的方法,其已经提高了至少一种PUFA的重量%。本文也描述了一种PEX破坏的解脂耶氏酵母,该酵母在编码Pex3p或Pex10p或Pex16p的天然基因中存在破坏。该解脂耶氏酵母的ATCC命名为ATCC PTA-8614(菌株Y4128)。
生物保藏
下列生物材料保藏于美国典型培养物保藏中心(American TypeCulture Collection)(ATCC)(10801 University Boulevard,Manassas,VA20110-2209),并具有下列名称、保藏号和保藏日期。
生物材料 | 保藏号 | 保藏日期 |
解脂耶氏酵母(Yarrowia lipolytica)Y2047 | ATCC PTA-7186 | 2005年10月26日 |
解脂耶氏酵母(Yarrowia lipolytica)Y2201 | ATCC PTA-7185 | 2005年10月26日 |
解脂耶氏酵母(Yarrowia lipolytica)Y2096 | ATCC PTA-7184 | 2005年10月26日 |
解脂耶氏酵母(Yarrowia lipolytica)Y3000 | ATCC PTA-7187 | 2005年10月26日 |
解脂耶氏酵母(Yarrowia lipolytica)Y4128 | ATCC PTA-8614 | 2007年8月23日 |
解脂耶氏酵母(Yarrowia lipolytica)Y4127 | ATCC PTA-8802 | 2007年11月29日 |
上面列出的生物材料将按照国际承认的用于专利程序目的的微生物保藏的布达佩斯条约的有关条款来保存。所列出的保藏物将会被保持在指定的国际保藏机构中至少30年,并且一旦准予专利公开它就将会对公众开放。在由政府行为授予的专利权的部分废除中,保藏物的可用性不会构成实施主题发明的许可。
附图简述和序列表
图1由图1A和图1B组成,它们一起示出了ω-3/ω-6脂肪酸生物合成途径,并且当考虑下文对该途径的描述时应被视为一起。
图2A提供了解脂耶氏酵母Pex10p(即,SEQ ID NO:10[GenBank保藏号CAG81606]的氨基酸327-364)、解脂耶氏酵母Pex2p(即,SEQID NO:2[GenBank保藏号CAG77647]的氨基酸266-323)和解脂耶氏酵母Pex12p(即,SEQ ID NO:11[GenBank保藏号CAG81532]的氨基酸342-391)的C3HC4环状锌指基序的比对,它们具有星号指示的保守C3HC4环状锌指基序的半胱氨酸和组氨酸残基。
图2B图示出了解脂耶氏酵母Pex10p C3HC4指状基序的不同氨基酸残基和它们结合的两个锌离子之间的本发明相互作用。
图3A图示了解脂耶氏酵母菌株Y4128的发育,该菌株在总脂质级分中生产37.6%的EPA。
图3B提供了pZP3-Pa777U的质粒图谱。
图4提供了下列质粒的质粒图谱:(A)pY117;和(B)pZP2-2988。
图5提供了下列质粒的质粒图谱:(A)pZKUE3S;和(B)pFBAIN-MOD-1。
图6提供了下列质粒的质粒图谱:(A)pFBAIN-PEX10;和(B)pEXP-MOD-1。
图7A提供了pPEX10-1的质粒图谱。图7B图示了解脂耶氏酵母菌株Y4184U的发育。
图8提供了如下质粒的质粒图谱:(A)pZKL1-2SP98C;和(B)pZKL2-5U89GC。
图9提供了下列质粒的质粒图谱:(A)pYPS161;和(B)pYRH13。
图10图示了解脂耶氏酵母菌株Y4305U3的发育。
图11提供了下列质粒的质粒图谱:(A)pZKUM;和(B)pZKD2-5U89A2。
图12提供了下列质粒的质粒图谱:(A)pY87;和(B)pY157。
根据下面的详细描述和附带的序列描述可以更充分地理解本发明,下面的详细描述和所附的序列描述形成了本申请的一部分。
下面的序列遵照37C.F.R.§1.821-1.825(“对包含核苷酸序列和/或氨基酸序列公开内容的专利申请的要求-序列规则”(“Requirements forPatent Applications Containing Nuceotide Sequences and/or Amino AcidSequence Disclosures-the Sequence Rules”)),并且符合世界知识产权组织(World Intellectual Property Organization)(WIPO)ST.25标准(1998)以及EPO和PCT的序列表要求(细则5.2和49.5(a-bis)以及行政指令(Administrative Instructions)的208节和附录C)。用于核苷酸和氨基酸序列数据的符号和格式遵循在37C.F.R.§1.822中示出的规则。
SEQ ID NO:1-86是引物、编码基因或蛋白(或它们的片段)的ORF、或质粒,如表1所述。
表1
核酸和蛋白质SEQ ID号总汇
描述和缩写 | 核酸SEQ ID NO | 蛋白质SEQ ID NO |
解脂耶氏酵母Pex1p(GenBank保藏号CAG82178) | -- | 1(1024AA) |
解脂耶氏酵母Pex2p(GenBank保藏号CAG77647) | -- | 2(381AA) |
解脂耶氏酵母Pex3p(GenBank保藏号CAG78565) | -- | 3(431AA) |
解脂耶氏酵母Pex3Bp(GenBank保藏号CAG83356) | -- | 4(395AA) |
解脂耶氏酵母Pex4p(GenBank保藏号CAG79130) | -- | 5(153AA) |
解脂耶氏酵母Pex5p(GenBank保藏号CAG78803) | -- | 6(598AA) |
解脂耶氏酵母Pex6p(GenBank保藏号CAG82306) | -- | 7(1024AA) |
解脂耶氏酵母Pex7p(GenBank保藏号CAG78389) | -- | 8(356AA) |
解脂耶氏酵母Pex8p(GenBank保藏号CAG80447) | -- | 9(671AA) |
解脂耶氏酵母Pex10p(GenBank保藏号CAG81606) | -- | 10(377AA) |
解脂耶氏酵母Pex12p(GenBank保藏号CAG81532) | -- | 11(408AA) |
解脂耶氏酵母Pex13p(GenBank保藏号CAG81789) | -- | 12(412AA) |
解脂耶氏酵母Pex14p(GenBank保藏号CAG79323) | -- | 13(380AA) |
解脂耶氏酵母Pex16p(GenBank保藏号CAG79622) | -- | 14(391AA) |
解脂耶氏酵母Pex17p(GenBank保藏号CAG84025) | -- | 15(225AA) |
描述和缩写 | 核酸SEQ ID NO | 蛋白质SEQ ID NO |
解脂耶氏酵母Pex19p(GenBank保藏号AAK84827) | -- | 16(324AA) |
解脂耶氏酵母Pex20p(GenBank保藏号CAG79226) | -- | 17(417AA) |
解脂耶氏酵母Pex22p(GenBank保藏号CAG77876) | -- | 18(195AA) |
解脂耶氏酵母Pex26p(GenBank保藏号NC_006072,核苷酸117230-118387的反义翻译) | -- | 19(386AA) |
包含解脂耶氏酵母Pex10基因的重叠群,该基因编码过氧化物酶体生物合成因子蛋白(Pex10p)(GenBank保藏号AB036770) | 20(3387bp) | -- |
解脂耶氏酵母Pex10(GenBank保藏号AB036770,核苷酸1038-2171)(蛋白序列与SEQ ID NO:10100%相同) | 21(1134bp) | 22(377AA) |
解脂耶氏酵母Pex10(GenBank保藏号AJ012084,它对应于GenBank保藏号AB036770的核苷酸1107-2171)(相对于SEQID NO:10和22的蛋白序列截短了最初的23个氨基酸) | 23(1065bp) | 24(354AA) |
解脂耶氏酵母Pex10p C3HC4环状锌指基序(即,SEQ ID NO:10的氨基酸327-364) | -- | 25(38AA) |
解脂耶氏酵母截短Pex10p(GenBank保藏号CAG81606[SEQ IDNO:10],缺失了C-末端的32个氨基酸) | -- | 26(345AA) |
解脂耶氏酵母乙酰羟酸合酶(AHAS)突变基因,该基因包含W497L突变 | 27(2987bp) | -- |
质粒pZP3-Pa777U | 28(13,066bp) | -- |
质粒pY117 | 29(9570bp) | -- |
质粒pZP2-2988 | 30(15,743bp) | -- |
质粒pZKUE3S | 31(6303bp) | -- |
引物pZP-GW-5-1 | 32 | -- |
引物pZP-GW-5-2 | 33 | -- |
描述和缩写 | 核酸SEQ ID NO | 蛋白质SEQ ID NO |
引物pZP-GW-5-3 | 34 | -- |
引物pZP-GW-5-4 | 35 | -- |
引物pZP-GW-3-1 | 36 | -- |
引物pZP-GW-3-2 | 37 | -- |
引物pZP-GW-3-3 | 38 | -- |
引物pZP-GW-3-4 | 39 | -- |
Genome Walker衔接子[顶链] | 40 | -- |
Genome Walker衔接子[底链] | 41 | -- |
巢式衔接子引物 | 42 | -- |
引物Per10F1 | 43 | -- |
引物ZPGW-5-5 | 44 | -- |
引物Per10R | 45 | -- |
质粒pFBAIN-MOD-1 | 46(7222bp) | -- |
质粒pFBAIn-PEX10 | 47(8133bp) | -- |
引物PEX10-R-BsiWI | 48 | -- |
引物PEX10-F1-SalI | 49 | -- |
引物PEX10-F2-SalI | 50 | -- |
质粒pEXP-MOD1 | 51(7277bp) | -- |
质粒pPEX10-1 | 52(7559bp) | -- |
质粒pPEX10-2 | 53(8051bp) | -- |
质粒pZKL1-2SP98C | 54(15,877bp) | -- |
描述和缩写 | 核酸SEQ ID NO | 蛋白质SEQ ID NO |
质粒pZKL2-5U89GC | 55(15,812bp) | -- |
质粒pYPS161 | 56(7966bp) | -- |
引物Pex-10del1 3′.正向 | 57 | -- |
引物Pex-10del2 5′.反向 | 58 | -- |
质粒pYRH13 | 59(8673bp) | -- |
引物PEX16Fii | 60 | -- |
引物PEX16Rii | 61 | -- |
引物3UTR-URA3 | 62 | -- |
引物Pex16-conf | 63 | -- |
实时PCR引物ef-324F | 64 | -- |
实时PCR引物ef-392R | 65 | -- |
实时PCR引物Pex16-741F | 66 | -- |
实时PCR引物Pex16-802R | 67 | -- |
TaqMan探针ef-345T的核苷酸部分 | 68 | -- |
TaqMan探针PEX16-760T的核苷酸部分 | 69 | -- |
质粒pZKUM | 70(4313bp) | -- |
质粒pZKD2-5U89A2 | 71(15,966bp) | -- |
解脂耶氏酵母二酰基甘油酰基转移酶(DGAT2)(美国专利公开7,267,976) | 72(2119bp) | 73(514AA) |
合成Δ12去饱和酶,该酶来源于串珠镰刀菌,其密码子经优化以在解脂耶氏酵母(“FmD12S”)中表达 | 74(1434bp) | 75(477AA) |
合成突变型Δ8去饱和酶(“EgD8M”),该酶来源于小眼虫(“EgD8S”;美国专利公开7,256,033) | 76(1272bp) | 77(422AA) |
描述和缩写 | 核酸SEQ ID NO | 蛋白质SEQ ID NO |
合成Δ9延伸酶,该酶来源于小型绿藻属CCMP389,其密码子经优化以在解脂耶氏酵母(“E389D9eS”)中表达 | 78(792bp) | 79(263AA) |
合成Δ5去饱和酶,该酶来源于小眼虫,其密码子经优化以在解脂耶氏酵母(“EgD5S”)中表达 | 80(1350bp) | 81(449AA) |
质粒pY157 | 82(6356bp) | -- |
质粒pY87 | 83(5910bp) | -- |
被Cre重组酶识别的大肠杆菌(Escherichia coli)LoxP重组位点 | 84(34bp) | -- |
引物UP 768 | 85 | -- |
引物LP 769 | 86 | -- |
发明详述
本文所述的是用于操纵在生产PUFA的真核生物中的长链多不饱和脂肪酸[“LC-PUFA”]的浓度(以总脂肪酸百分比形式表示)和含量(以干细胞重量百分比形式表示)的通用方法。这些方法依赖于破坏在宿主中的天然过氧化物酶体生物合成因子[“Pex”]蛋白并将对具有天然的或遗传工程的PUFA生产能力的多种真核生物具有广泛适用性,所述真核生物包括藻类、真菌、卵菌、酵母、类眼虫、原生藻菌、植物和一些哺乳动物系统。
PUFA或其衍生物可用作膳食替代品或补充剂,尤其是婴儿代乳品(formula),用于接受静脉内营养法的患者或用来预防或治疗营养不良。例如,PUFA可掺入烹饪油、脂肪或人造黄油并作为消费者的一般饮食部分被摄入,从而给予消费者所需的饮食补充剂。此外,PUFA还可以掺入婴儿代乳品、营养补充剂或其它食品中,并且可用作抗炎或降胆固醇剂。任选地,该组合物可以用于药用(人药或兽药)。
定义
在本公开中,使用了大量术语和缩写。给出了如下定义。
“开放阅读框”缩写为ORF。
“聚合酶链反应”缩写为PCR。
“美国典型培养物保藏中心”缩写为“ATCC”。
“多不饱和脂肪酸”缩写为“PUFA”。
“三酰基甘油”缩写为“TAG”。
“总脂肪酸”缩写为“TFA”。
“脂肪酸甲酯”缩写为“FAME”。
“干细胞重量”缩写为“DCW”。
如本文所用,术语“发明”或“本发明”不旨在限制而是通常适用于权利要求中的或本文所述的任何发明。
术语“过氧化物酶体”指普遍存在于所有真核细胞中的细胞器。它们具有单独的脂双层膜,该膜将它们的内容物与细胞溶质分离开来,并且包含多种下文所述功能必需的膜蛋白。过氧化物酶体经由“扩展的穿梭机制(extended shuttle mechanism)”选择性输入蛋白。更具体地讲,存在至少32个已知的过氧化物酶体蛋白(也称为peroxin),它们参与通过ATP水解穿过过氧化物酶体膜而输入蛋白的过程。一些过氧化物酶体蛋白在其N-末端或C-末端包含特异性蛋白信号,即,过氧化物酶体靶信号或“PTS”,它们引导蛋白通过过氧化物酶体膜。一旦细胞蛋白进入过氧化物酶体,它们通常通过一些方法降解。例如,过氧化物酶体包含氧化酶,例如过氧化氢酶、D-氨基酸氧化酶和尿酸氧化酶,这些酶能够降解对细胞有毒的物质。作为另外一种选择,过氧化物酶体降解脂肪酸分子以生成乙酰-CoA游离分子,它们被回输到细胞溶质中,该过程称为β-氧化。
术语“过氧化物酶体生物合成因子蛋白”、“过氧化物酶体蛋白”和“Pex蛋白”是可互换的,并且指涉及过氧化物酶体生物合成的和/或参与通过ATP水解使细胞蛋白穿过过氧化物酶体膜的过程的蛋白。编码任何这些蛋白的基因的首字母缩写是“Pex基因”。Pex基因的命名体系描述于Distel等人,J.Cell Biol.,135:1-3(1996)。迄今已经在多种真核生物中鉴定了至少32个不同的Pex基因。已经从对突变体的分析中分离出了多种Pex基因,所述突变体显示具有异常的过氧化物酶体功能或结构。根据Kiel,J.A.K.W.等人的文献(Traffic,7:1291-1303(2006)),其中对17个不同真菌菌种中的基因组序列进行了电脑模拟的信息学分析(silico analysis),鉴定了以下Pex蛋白:Pex1p、Pex2p、Pex3p、Pex3Bp、Pex4p、Pex5p、Pex5Bp、Pex5Cp、Pex5/20p、Pex6p、Pex7p、Pex8p、Pex10p、Pex12p、Pex13p、Pex14p、Pex15p、Pex16p、Pex17p、Pex14/17p、Pex18p、Pex19p、Pex20p、Pex21p、Pex21Bp、Pex22p、Pex22p样和Pex26p。因此,本文将这些蛋白中的每一个称为“Pex蛋白”、“过氧化物酶体蛋白”或“过氧化物酶体生物合成因子蛋白”,并且每个蛋白由至少一个“Pex基因”编码。
术语“保守结构域”或“基序”指进化上相关的蛋白质的比对序列中在特定位置处保守的一组氨基酸。虽然同源蛋白质之间在其它位置的氨基酸可以发生变化,但在特定位置高度保守的氨基酸表明为是对蛋白质的结构、稳定性或活性来说必需的氨基酸。因为它们可通过它们在蛋白质同系物家族的比对序列中的高度保守来鉴定,所以它们可用作识别标签或“签名”来确定具有新的测定序列的蛋白质是否属于以前鉴定的蛋白质家族。参考本文相关部分,Pex2p、Pex10p和Pex12p在它们的羧基末端都有一个富含半胱氨酸的基序,已知该基序是C3HC4环状锌指基序。该基序是它们的活性所必需的,涉及蛋白停靠并转移到过氧化物酶体中(Kiel,J.A.K.W.,等人,Traffic,7:1291-1303(2006))。
术语“C3HC4环状锌指基序”或“C3HC4基序”一般指结合两个锌离子的保守半胱氨酸富含基序,它们通过如式I所示的氨基酸序列进行鉴定:
式I:CX2CX9-27CX1-3HX2CX2CX4-48CX2C
在解脂耶氏酵母的编码过氧化物酶体生物合成因子10蛋白(即YlPex10p)的基因中的C3HC4环状锌指基序,位于SEQ ID NO:10的氨基酸327-364之间,并且由CX2CX11CX1HX2CX2CX10CX2C基序(SEQ IDNO:25)限定。在解脂耶氏酵母的编码过氧化物酶体生物合成因子2蛋白(即YlPex2p)的基因中的C3HC4环状锌指基序,位于SEQ ID NO:2的氨基酸266-323之间。解脂耶氏酵母过氧化物酶体生物合成因子12蛋白,即YlPex12p,包含位于SEQ ID NO:11的氨基酸342-391之间的不完全C3HC4环指基序。图2A列出了对应于YlPex10、YlPex2和YlPex12的C3HC4环状锌指基序的蛋白序列;星号表示基序的保守半胱氨酸或组氨酸残基。
认为YlPex10、YlPex2和YlPex12通过蛋白-蛋白相互作用形成环指复合物。本发明的YlPex10p C3HC4指状基序(具有两个锌残基)的胱氨酸和组氨酸残基间相互作用在图2B中用图表示出。
术语“Pex10”指编码过氧化物酶体生物合成因子10蛋白或过氧化物酶体装配蛋白Peroxin 10的基因,其中所述peroxin蛋白在下文中称为“Pex10p”。Pex10p的功能尚未被清楚的阐明,虽然对其它生物的研究已经揭示Pex10产物位于过氧化物酶体膜内并且是细胞器正常功能所必需的。C3HC4环状锌指基序在Pex10p的C-末端区域是保守的(Kalish,J.E.等人,Mol.Cell Biol.,15:6406-6419(1995);Tan,X.等人,J.CellBiol.,128:307-319(1995);Warren,D.S.,等人,Am.J.Hum.Genet.,63:347-359(1998)),并且是酶活性所必需的。
术语“YlPex10”指解脂耶氏酵母的编码过氧化物酶体生物合成因子10蛋白的基因,其中所述蛋白在下文中称为“YlPex10p”。该特定的peroxin近来由Sumita等人(FEMS Microbiol.Lett.,214:31-38(2002))进行了研究。YlPex10的核苷酸序列在GenBank中注册了多个保藏号,包括GenBank保藏号CAG81606(SEQ ID NO:10)、AB036770(SEQ IDNOs:20、21和22)以及AJ012084(SEQ ID NO:23和24)。如在SEQ IDNO:24中所示的YlPex10p的序列长度为354个氨基酸。相比之下,如在SEQ ID NO:10和SEQ ID NO:22中所示的YlPex10p序列每个长度为377个氨基酸,它们100%相同的序列在蛋白N-末端具有附加地23个氨基酸(对应于与GenBank保藏号AJ012084(SEQ ID NO:24)中鉴定的起始密码子不同的起始密码子)。
术语“Pex3”指编码过氧化物酶体生物合成因子3蛋白或过氧化物酶体装配蛋白Peroxin 3的基因,其中peroxin蛋白在下文中称为“Pex3p”。虽然尚未清楚地了解关于Pex3p的机制细节,但是清楚的是Pex3p是在早期的过氧化物酶体生物合成中所需的过氧化物酶体整合膜蛋白,该蛋白用于形成过氧化物酶体膜(参见例如Baerends,R.J.等人,J.Biol.Chem.,271:8887-8894(1996);Bascom,R.A.等人,Mol.Biol.Cell,14:939-957(2003))。
术语“YlPex3”指解脂耶氏酵母的编码过氧化物酶体生物合成因子3蛋白的基因,其中所述蛋白在下文中称为“YlPex3p”。YlPex3的核苷酸序列在GenBank中登记为保藏号CAG78565(SEQ ID NO:3)。
术术语“Pex16”指编码过氧化物酶体生物合成因子16蛋白或过氧化物酶体装配蛋白Peroxin 16的基因,其中peroxin蛋白在下文中称为“Pex16p”。虽然对多种生物的研究已经揭示Pex16产物在形成过氧化物酶体膜和调节过氧化物酶体增殖中起到作用,但是Pex16p的功能仍未被清楚地阐明(Platta,H.W.和R.Erdmann,Trends Cell Biol.,17(10):474-484(2007))。
术语“YlPex16”指解脂耶氏酵母的编码过氧化物酶体生物合成因子16蛋白的基因,其中所述蛋白在下文中称为“YlPex16p”。该特定peroxin描述于Elizen G.A.等人(J.Cell Biol.,137:1265-1278(1997))和Titorenko,V.I.等人(Mol.Cell Biol.,17:5210-5226(1997))。YlPex16的核苷酸序列在GenBank中登记为保藏号CAG79622(SEQ ID NO:14)。
在天然Pex基因中的或与之相关的术语“破坏”指在一部分该基因中的插入、缺失、或定向突变,所述破坏导致全部基因敲除使得该基因从基因组中缺失并且不翻译蛋白,或者导致翻译的Pex蛋白具有插入、缺失、氨基酸取代或其它定向突变。蛋白中的破坏位置可以是在例如蛋白的N-末端部分中或在蛋白的C-末端部分中。破坏的Pex蛋白相对于未破坏的Pex蛋白将具有减弱的活性,并且可能是无功能的。在编码Pex蛋白的天然基因中的破坏也包括导致Pex蛋白低表达或缺乏表达的替代方法,例如能够经由操纵调控序列、转录和翻译因子和/或信号转导途径或使用有义、反义或RNAi技术等方法导致破坏。
如本文所用,术语“PEX破坏的生物”指包含以下基因的任何含油真核生物,所述基因编码功能性多不饱和脂肪酸生物合成途径并且在编码过氧化物酶体生物合成因子蛋白的天然基因中具有上述破坏。
术语“脂质”指任何脂溶性的(即,亲脂的)、天然存在的分子。脂质是具有多种关键生物学功能的化合物的不同组,所述功能例如细胞膜的结构组分、能量贮存来源和信号途径的中间体。可以将脂质广泛定义为疏水性或两亲性小分子,它们完全地或部分地起源于酮脂酰或异戊二烯基团。表2显示了对脂质的综述,它基于Lipid Metabolites andPathways Strategy(LIPID MAPS)分类系统(National Institute of GeneralMedical Sciences,Bethesda,MD)。
表2
脂质类别综述
本文术语细胞的“总脂质级分”指细胞的所有酯化脂肪酸。可分离在总脂质级分中的不同亚级分,包括三酰基甘油[“油”]级分、磷脂酰胆碱级分和磷脂酰乙醇胺级分,但是这并不包括所有亚级分。
“脂质体”指通过单层磷脂以及通常通过特异性蛋白结合的脂质小滴。这些细胞器是大多数生物运输/贮存中性脂类的位点。认为脂质体来源于包含TAG生物合成酶的内质网的微区。它们的合成及尺寸受特异性蛋白组分的控制。
“中性脂类”指那些一般以贮存脂肪和油形式存在于脂质体中的细胞中的脂质,它们的名称是因为在细胞pH下,所述脂质无带电基团。它们一般是完全非极性的,对水无亲和力。中性脂类一般指脂肪酸的甘油单酯、二酯、和/或三酯,也分别称为单酰基甘油、二酰基甘油或三酰基甘油,或统称为酰基甘油。为了从酰基甘油中释放游离脂肪酸,必须发生水解反应。
术语“三酰基甘油”[“TAG”]和“油”是可互换的,它们指由酰化甘油分子的三个脂肪酰残基组成的中性脂类。TAG能够包含长链PUFA,以及较短的饱和的和不饱和的脂肪酸以及较长链的饱和脂肪酸。细胞的TAG级分也称作“油级分”,并且“油的生物合成”一般指在细胞中合成TAG。油或TAG级分是总脂质级分的亚级分,虽然它也是构成总脂质含量的主要部分,总脂质含量以含油生物细胞中的总脂肪酸重量占干细胞重量[参见下文]的百分比表示。油[“TAG”]级分中的脂肪酸组成和总脂质级分的脂肪酸组成一般是相似的。因此,总脂质级分中的PUFA浓度的提高或降低将对应于油[“TAG”]级分中的PUFA浓度的升高或降低,反之亦然。
术语“总脂肪酸”[“TFA”]本文指所有细胞脂肪酸的总量,所述脂肪酸在给定实例中可通过碱酯交换方法(本领域已知的方法)被衍生化成脂肪酸甲酯[“FAME”],例如它可以是总脂质级分或油级分。因此,总脂肪酸包括来自中性和极性脂质级分的脂肪酸,包括磷脂酰胆碱级分、磷脂酰乙醇胺级分(phosphatidyletanolamine fraction)和二酰基甘油、单酰基甘油和三酰基甘油[“TAG或油”]级分,但是不包括游离脂肪酸。
术语细胞的“总脂质含量”是TFA的量度,以干细胞重量[“DCW”]百分比的形式表示。因此,总脂质含量[“TFA%DCW”]等同于例如每100毫克DCW的总脂肪酸毫克数。
脂肪酸浓度本文一般表示为TFA的重量百分比[“%TFA”],例如每100毫克TFA的给定脂肪酸毫克数。除非在本文公开内容中另作具体说明,给定脂肪酸相对于总脂质的百分比等同于以%TFA表示的脂肪酸浓度(例如总脂质的%EPA等同于EPA%TFA)。
在一些情况下,可使用细胞中给定脂肪酸占干细胞重量的百分比[“%DCW”]形式表达给定脂肪酸的含量。因此例如二十碳五烯酸%DCW将根据下式进行测定:(二十碳五烯酸%TFA)*(TFA%DCW)]/100。
术语“脂质分布”和“脂质组成”是可互换的,并且指在特定脂质级分(例如在总脂质级分或油[“TAG”]级分中)中包含的单个脂肪酸的量,其中所述量用TFA百分比形式表示。混合物中存在的各单个脂肪酸的总量应当是100。
如本文所用,术语“成倍增加”指通过乘以某数获得的增加。例如,通过乘以1.3的数量、量、浓度、重量百分比等,提供1.3倍的增加。
术语“脂肪酸”指不同链长的长链脂族酸(链烷酸),链长为约C12至C22(尽管更长和更短链长的酸均是已知的)。主链长介于C16和C22之间。脂肪酸的结构可用简单的记号系统“X:Y”来表示,其中X表示具体脂肪酸中碳(“C”)原子的总数,而Y表示双键的数目。另外的关于“饱和脂肪酸”与“不饱和脂肪酸”、“单不饱和脂肪酸”与“多不饱和脂肪酸”(“PUFA”)以及“ω-6脂肪酸”(ω-6或n-6)与“ω-3脂肪酸”(ω-3或n-3)之间的区别的详细信息在美国专利7,238,482中有所提供。
表3提供了用于描述本文PUFA的命名。在标题为“简化符号”一栏中,ω-指代系统用于表明碳数目、双键的数目和最接近ω碳的双键位置,双键位置的计数从ω碳开始(为此ω碳的编号为1)。该表的其余部分汇总了ω-3和ω-6脂肪酸及其前体的俗名、在整个说明书中使用的缩写以及每种化合物的化学名称。
表3
多不饱和脂肪酸及其前体的命名
俗名 | 缩写 | 化学名称 | 简化符号 |
肉豆蔻酸 | -- | 十四酸 | 14:0 |
棕榈酸 | 棕榈酸 | 十六酸 | 16:0 |
棕榈油酸 | -- | 9-十六碳烯酸 | 16:1 |
硬脂酸 | -- | 十八酸 | 18:0 |
油酸 | -- | 顺式-9-十八碳烯酸 | 18:1 |
亚油酸 | LA | 顺式-9,12-十八碳二烯酸 | 18:2 ω-6 |
γ-亚麻酸 | GLA | 顺式-6,9,12-十八碳三烯酸 | 18:3 ω-6 |
附子脂酸 | EDA | 顺式-11,14-二十碳二烯酸 | 20:2 ω-6 |
二高-γ-亚麻酸 | DGLA | 顺式-8,11,14-二十碳三烯酸 | 20:3 ω-6 |
花生四烯酸 | ARA | 顺式-5,8,11,14-二十碳四烯酸 | 20:4 ω-6 |
俗名 | 缩写 | 化学名称 | 简化符号 |
α-亚麻酸 | ALA | 顺式-9,12,15-十八碳三烯酸 | 18:3 ω-3 |
硬脂艾杜糖酸 | STA | 顺式-6,9,12,15-十八碳四烯酸 | 18:4 ω-3 |
二十碳三烯酸 | ETrA | 顺式-11,14,17-二十碳三烯酸 | 20:3 ω-3 |
金松烯酸 | SCI | 顺式-5,11,14-二十碳三烯酸 | 20:3b ω-6 |
Juniperonic | JUP | 顺式-5,17,11,14-二十碳四烯酸 | 20:4b ω-3 |
二十碳四烯酸 | ETA | 顺式-8,11,14,17-二十碳四烯酸 | 20:4 ω-3 |
二十碳五烯酸 | EPA | 顺式-5,8,11,14,17-二十碳五烯酸 | 20:5 ω-3 |
俗名 | 缩写 | 化学名称 | 简化符号 |
二十二碳三烯酸 | DRA | 顺式-10,13,16-二十二碳三烯酸 | 22:3 ω-3 |
二十二碳四烯酸 | DTA | 顺式-7,10,13,16-二十二碳四烯酸 | 22:4 ω-3 |
二十二碳五烯酸 | DPAn-6 | 顺式-4,7,10,13,16-二十二碳五烯酸 | 22:5 ω-6 |
二十二碳五烯酸 | DPA | 顺式-7,10,13,16,19-二十二碳五烯酸 | 22:5 ω-3 |
二十二碳六烯酸 | DHA | 顺式-4,7,10,13,16,19-二十二碳六烯酸 | 22:6 ω-3 |
虽然使用本文所述方法,表3列出的ω-3/ω-6PUFA最可能在含油酵母的油级分中积聚,但是该列表不应理解为限制性的或完全的。
如本文所用,术语“多不饱和脂肪酸的组合”或“多不饱和脂肪酸的任何组合”指上表3中列出的任何两种或更多种多不饱和脂肪酸的混合物。此类组合具有能相对于细胞中的多种浓度或重量百分比进行测量,包括相对于细胞中的总脂肪酸的重量百分比进行测量的浓度和重量百分比属性。
代谢途径或生物合成途径在生物化学意义上可以认为是发生于细胞内由酶催化的一系列化学反应,以实现细胞待使用的或待贮存的代谢产物的形成,或启动另一代谢途径(称作流量产生步骤)。很多此类途径均很精细,并涉及对起始物质的逐步修饰以使之形成具有期望的精确化学结构的产物。
术语“PUFA生物合成途径”指将油酸转化成诸如LA、EDA、GLA、DGLA、ARA、DRA、DTA和DPAn-6之类的ω-6脂肪酸和诸如ALA、STA、ETrA、ETA、EPA、DPA和DHA之类的ω-3脂肪酸的代谢过程。文献中详细描述了该过程。参见例如Int′.App.Pub.No.WO 2006/052870。简而言之,该过程涉及通过添加碳原子来延长碳链和通过加入双键来使分子去饱和,这通过存在于内质网膜内的一系列特异性去饱和酶和延伸酶(称为“PUFA生物合成途径酶”)进行。更具体地讲,“PUFA生物合成途径酶”指如下与PUFA生物合成相关的任何酶(以及编码它们的基因),所述酶包括:Δ4去饱和酶、Δ5去饱和酶、Δ6去饱和酶、Δ12去饱和酶、Δ15去饱和酶、Δ17去饱和酶、Δ9去饱和酶、Δ8去饱和酶、Δ9延伸酶、C14/16延伸酶、C16/18延伸酶、C18/20延伸酶和/或C20/22延伸酶。
术语“ω-3/ω-6脂肪酸生物合成途径”指在合适条件下表达时编码催化ω-3和ω-6脂肪酸二者之一或两者产生的酶的一组基因。通常,参与ω-3/ω-6脂肪酸生物合成途径的基因编码PUFA生物合成途径酶。图1示出了一条代表性途径,提供了从肉豆蔻酸经过多种中间产物向DHA的转化,演示了ω-3和ω-6脂肪酸两者是如何可以从共同来源产生。该途径自然分成两部分,使得一部分只生成ω-3脂肪酸而另一部分只生成ω-6脂肪酸。只产生ω-3脂肪酸的部分在本文中将称作ω-3脂肪酸生物合成途径,而只产生ω-6脂肪酸的部分在本文中称作ω-6脂肪酸生物合成途径。
如本文中关于ω-3/ω-6脂肪酸生物合成途径所用的,术语“功能性的”指该途径中的一些(或全部)基因表达活性酶,导致体内催化或底物转化。应当理解,“ω-3/ω-6脂肪酸生物合成途径”或“功能性ω-3/ω-6脂肪酸生物合成途径”并不意味着上面段落中列出的所有基因都是必需的,因为许多脂肪酸产物将仅需要表达该途径中的一亚组基因。
术语“Δ6去饱和酶/Δ6延伸酶途径”指最低程度包括至少一种Δ6去饱和酶和至少一种C18/20延伸酶的PUFA生物合成途径,从而使得能分别从LA和ALA开始,以GLA和/或STA作为脂肪酸中间产物来生物合成DGLA和/或ETA。通过其它去饱和酶和延伸酶的表达,还可以合成ARA、EPA、DPA和DHA。
术语“Δ9延伸酶/Δ8去饱和酶途径”指最低程度包括至少一种Δ9延伸酶和至少一种Δ8去饱和酶的PUFA生物合成途径,从而使得能分别从LA和ALA开始,以EDA和/或ETrA作为脂肪酸中间产物来生物合成DGLA和/或ETA。通过其它去饱和酶和延伸酶的表达,还可以合成ARA、EPA、DPA和DHA。
术语“去饱和酶”指能够通过从邻接碳原子中的一个上移除氢原子、并且从而在碳原子之间导入双键来去饱和脂肪酸中的邻接碳原子的多肽。去饱和产生了脂肪酸或受关注的前体。尽管在整个说明书中使用ω-指代系统来指代特定的脂肪酸,但使用Δ-系统从底物的羧基端计数来表示去饱和酶的活性更方便。本文特别关注的是:1)Δ5去饱和酶,它催化脂肪酸底物DGLA转化成ARA和/或催化脂肪酸底物ETA转化成EPA;2)Δ17去饱和酶,它在从分子羧基末端编号为第17和第18的碳原子之间去饱和脂肪酸,并且它例如催化底物脂肪酸ARA转化成EPA和/或催化底物脂肪酸DGLA转化成ETA;3)Δ6去饱和酶,它催化脂肪酸底物LA转化成GLA和/或催化脂肪酸底物ALA转化成STA;4)Δ12去饱和酶,它催化底物脂肪酸油酸转化成LA;5)Δ15去饱和酶,它催化脂肪酸底物LA转化成ALA和/或催化脂肪酸底物GLA转化成STA;6)Δ4去饱和酶,它催化底物脂肪酸DPA转化成DHA和/或催化底物脂肪酸DTA转化成DPAn-6;7)Δ8去饱和酶,它催化底物脂肪酸EDA转化成DGLA和/或催化底物脂肪酸ETrA转化成ETA;和,8)Δ9去饱和酶,它催化底物脂肪酸棕榈酸转化成棕榈油酸(16:1)和/或催化底物脂肪酸硬脂酸转化成油酸。基于Δ15和Δ17去饱和酶将ω-6脂肪酸转化成它们的ω-3对应物的能力(如分别将LA转化成ALA以及将ARA转化成EPA),偶尔也将它们称作“ω-3去饱和酶”、“W-3去饱和酶”和/或“ω-3去饱和酶”。所期望的是通过用脂肪酸去饱和酶的基因来转化合适的宿主并测定它对该宿主脂肪酸分布的作用,从而经验性地测定特定脂肪酸去饱和酶的特异性。
术语“延伸酶”指能延长脂肪酸碳链从而产生比该延伸酶作用于其上的脂肪酸底物长2个碳原子的酸的多肽。该延长过程在与脂肪酸合酶相关的多步骤机制中发生,如美国专利公开2005/0132442和国际专利公开WO 2005/047480所述。延伸酶体系催化反应的实例如GLA转化成DGLA、STA转化成ETA以及EPA转化成DPA。通常,延伸酶的底物选择性有些广泛,但由链长度和不饱和的程度及类型两者来区分。例如,C14/16延伸酶利用C14底物(如肉豆蔻酸),C16/18延伸酶利用C16底物(如棕榈酸),C18/20延伸酶(也称为Δ6延伸酶,两个术语可以互换使用)利用C18底物(如GLA、STA),而C20/22延伸酶利用C20底物(如EPA)。以类似方式,Δ9延伸酶能够催化LA和ALA分别转化成EDA和ETrA。重要的是,须注意一些延伸酶具有广泛的特异性因而单个酶可能能够催化几种延伸酶反应。例如单个酶可因此作为C16/18延伸酶和C18/20延伸酶。
术语“转化效率”和“底物转化百分比”指特定酶(如去饱和酶)能够将底物转化成产物的效率。转化效率根据下面的公式测量:([产物]/[底物+产物])*100,其中‘产物’包括中间产物和该途径中来源于它的所有产物。
术语“含油的”指那些倾向于以油形式贮存它们的能源的生物(Weete,Fungal Lipid Biochemistry,第2版,Plenum,1980)。
术语“含油酵母”指那些能制造油(即TAG)的、分类为酵母的微生物。通常,含油微生物的细胞油或TAG含量符合S形曲线,其中脂质浓度增加直至在对数生长期晚期或稳定生长期早期它达到最高浓度,随后在稳定生长期晚期和死亡期期间逐渐下降(Yongrmanitchai和Ward,Appl.Environ.Microbiol.,57:419-25(1991))。本文所述的含油微生物通常积聚超过它们的干细胞重量约25%的油或TAG。含油酵母的实例包括但不限于如下属:耶氏酵母属(Yarrowia)、假丝酵母属(Candida)、红酵母属(Rhodotorula)、红冬孢酵母属(Rhodosporidium)、隐球酵母属(Cryptococcus)、丝孢酵母属(Trichosporon)和油脂酵母属(Lipomyces)。
如本文所用的,术语“分离的核酸片段”和“分离的核酸分子”可互换使用,并且是指单链或双链的,任选含有合成的、非天然的或改变了的核苷酸碱基的RNA或DNA聚合物。-DNA聚合物形式的分离的核酸片段可由cDNA、基因组DNA或合成DNA的一个或多个片段构成。
当在合适的温度和溶液离子强度条件下单链形式的核酸片段可以退火至另一核酸片段时,核酸片段“可杂交”至另一核酸片段,例如cDNA、基因组DNA或RNA分子。杂交条件和洗涤条件是众所周知的,并在Sambrook,J.,Fritsch,E.F.和Maniatis,T.Molecular Cloning:A Laboratory Manual,第2版,Cold Spring Harbor Laboratory:Cold SpringHarbor,NY(1989)中得到举例说明。
氨基酸或核苷酸序列的“基本部分”指这样的部分,该部分包括的多肽的氨基酸序列或基因的核苷酸序列足以推定鉴定所述多肽或基因,所述鉴定或者可以由本领域技术人员通过人工评价序列来完成,或者可以利用诸如BLAST(Basic Local Alignment Search Tool;Altschul,S.F.等人,J.Mol.Biol.,215:403-410(1993))之类的算法通过计算机自动化序列比较和鉴定来完成。一般来讲,为了推测鉴定多肽或核酸序列是否与已知的蛋白质或基因同源,需要有10个或更多邻接氨基酸或者30个或更多邻接核苷酸的序列。此外,对于核苷酸序列,包含20-30个邻接核苷酸的基因特异性寡核苷酸探针可用于序列依赖性的基因鉴定(如DNA杂交)和基因分离(如细菌菌落或噬斑的原位杂交)的方法中。此外,12至15个碱基的短寡核苷酸可在PCR中用作扩增引物,以便获得包含该引物的特定核酸片段。因此,核苷酸序列的“基本部分”所包含的序列应足以特异性地鉴定和/或分离包含该序列的核酸片段。
术语“互补的”用于描述核苷酸碱基之间能够彼此杂交的关系。例如,对于DNA,腺嘌呤与胸腺嘧啶互补,而胞嘧啶与鸟嘌呤互补。
术语“同源性”或“同源”本文互换使用。它们指这样的核酸片段,即其中一个或多个核苷酸碱基改变并不会影响该核酸片段介导基因表达或产生某种表型的能力。这些术语也指本文所述的Pex核酸片段的修饰(例如缺失或插入一个或多个核苷酸),相对于初始的未经修饰的核酸片段,该修饰基本上不会改变所得核酸片段的功能特性。
此外,技术人员认识到,同源核酸序列也由它们在中等严格条件(如0.5×SSC,0.1%SDS,60℃)下,与本文所示例的序列杂交的能力,或杂交至本文公开的核苷酸序列的任何部分以及杂交至与其功能相当的序列的能力所限定。
“密码子简并性”指允许核苷酸序列在不影响所编码的多肽的氨基酸序列的情况下发生变化的遗传密码的性质。技术人员非常了解具体宿主细胞在使用核苷酸密码子确定给定氨基酸时所表现出的“密码子偏好性”。因此,当合成基因用以改善在宿主细胞中的表达时,希望对基因进行设计,使得其密码子使用频率接近该宿主细胞优选的密码子使用频率。
“合成的基因”可由使用本领域技术人员已知的方法化学合成的寡核苷酸构件装配而成。将这些寡核苷酸基本单位构件进行退火并随后连接以形成基因节段,该基因节段随后在酶促作用下装配而构建成完整的基因。因此,基于最优化核苷酸序列以反映宿主细胞的密码子偏好性,可以定制基因用以最优化基因表达。如果密码子使用偏向于宿主偏好的那些密码子,则技术人员能预期成功的基因表达的可能。优选的密码子的确定可基于对来源于宿主细胞的基因(其中序列信息可获得)的检测。
“基因”指表达特定蛋白的核酸片段,并且可以指单独的编码区或可以包含位于编码序列之前的调控序列(5′非编码区)和之后的调控序列(3′非编码区)。“天然基因”是指天然存在的具有其自己的调控序列的基因。“嵌合基因”是指不是天然基因的任何基因,包含在天然情况下不是一起存在的调控序列和编码序列。因此,嵌合基因可包括源于不同来源的调控序列和编码序列,或者包括源于同一来源但以不同于天然存在的方式排列的调控序列和编码序列。“内源性基因”指位于生物基因组内它的天然位置的天然基因。“外来”基因指通过基因转移导入到宿主生物内的基因。外来基因可包括插入到非天然生物内的天然基因、导入到天然宿主内的新位置的天然基因,或嵌合基因。“转基因”是已通过转化方法导入基因组内的基因。“密码子优化的基因”是其密码子使用频率经设计用以模仿宿主细胞优选的密码子使用频率的基因。
“编码序列”指编码特定氨基酸序列的DNA序列。“合适的调控序列”指位于编码序列的上游(5′非编码序列)、中间或下游(3′非编码序列)的核苷酸序列,其可影响相关编码序列的转录、RNA加工或稳定性或者翻译。调控序列可包括启动子、增强子、静默子、5′非翻译前导序列(例如在转录起始位点和翻译启动密码子之间的序列)、内含子、多腺苷酸化识别序列、RNA加工位点、效应子结合位点和茎-环结构。
“启动子”指能够控制编码序列或功能性RNA表达的DNA序列。一般来讲,编码序列位于启动子序列的3′端。启动子可整个源于天然基因,或者由源于不同的天然存在的启动子的不同元件组成,或者甚至包含合成的DNA片段。本领域内的技术人员应当理解,不同的启动子可以在不同的组织或细胞类型中,或者在不同的发育阶段,或者响应不同的环境条件或生理条件而引导基因的表达。导致基因在大部分时间内在大多数细胞类型中表达的启动子通常称为“组成型启动子”。还应当进一步认识到,由于在大多数情况下调控序列的确切边界尚未完全确定,因此不同长度的DNA片段可能具有相同的启动子活性。
术语“3′非编码序列”和“转录终止子”指位于编码序列下游的DNA序列。这包括多腺苷酸化识别序列和编码能影响mRNA加工或基因表达的调控信号的其它序列。多腺苷酸化信号通常特征在于影响多腺苷酸片添加到mRNA前体的3′末端。3′区可影响相关编码序列的转录、RNA加工或稳定性或翻译。
“RNA转录物”指由RNA聚合酶催化DNA序列的转录所产生的产物。当RNA转录物是DNA序列的完全互补的拷贝时,它被称为初级转录物,或者它可以是源自初级转录物的转录后加工的RNA序列并被称作成熟RNA。“信使RNA”或“mRNA”指无内含子并且可以由细胞翻译成蛋白质的RNA。“eDNA”指与mRNA互补并源于mRNA的双链DNA。“有义”RNA指包含mRNA并因而能由细胞翻译成蛋白质的RNA转录物。“反义RNA”指与全部或部分靶初级转录物或mRNA互补,并阻止靶基因表达的RNA转录物(美国专利5,107,065;国际申请公开WO 99/28508)。反义RNA可以与特定基因转录物的任何部分,即5′非编码序列、3′非编码序列或编码序列互补。“功能性RNA”指反义RNA、核酶RNA或其它不被翻译但是对细胞过程有影响作用的RNA。
术语“可操作地连接”指单个核酸片段上的核酸序列的关联,使得其中一个核酸序列的功能受到另一个核酸序列的影响。例如,当启动子能够影响编码序列的表达时,它可操作地连接编码序列。即,编码序列处于启动子的转录控制下。编码序列可以以有义或反义的取向可操作地连接至调控序列。
如本文所用,术语“表达”指源于核酸片段的有义RNA(mRNA)或反义RNA的转录和稳定积聚。表达也可指将mRNA翻译成多肽。
“成熟”蛋白指经翻译后加工的多肽,即已经去除了存在于初始翻译产物中的任何前肽或肽原的多肽。“前体”蛋白质指mRNA的初级翻译产物,即前肽和肽原仍然存在。前肽和肽原可以是但不限于细胞内定位信号。
“转化”指将核酸分子转移至宿主生物中,导致在遗传上稳定遗传。例如,核酸分子可以是自主复制的质粒,或者它可以整合进宿主生物的基因组中。含有转化核酸片段的宿主生物被称为“转基因”或“重组”或“转化”生物体。
“稳定转化”指将核酸片段转移至宿主生物的基因组(包括核基因组和细胞器基因组)中,导致在遗传上稳定遗传。相反,“瞬时转化”指将核酸片段转移至宿主生物的核中或包含DNA的细胞器中,导致基因表达而不整合或不稳定遗传。含有转化核酸片段的宿主生物被称为“转基因”生物体。
术语“质粒”和“载体”指通常携带有不属于细胞中心代谢部分的基因的染色体外元件,并且常常是环状双链DNA片段的形式。这类元件可以是源自任何来源的自主复制序列、基因组整合序列、噬菌体或单链或双链DNA或RNA的核苷酸序列(线性或环状),其中多个核苷酸序列已连接或重组为一种独特构建体,该独特构建体能够将表达盒引入细胞中。
术语“表达盒”指包含如下编码序列的DNA片段:所选基因的编码序列和所选基因产物表达所需的位于编码序列之前(5′非编码序列)和之后(3′非编码序列)的调控序列。因此,表达盒通常由如下序列构成:(1)启动子序列;2)编码序列,即,开放阅读框[“ORF”]和,3)3′非翻译区域,即,在真核细胞中通常包含聚腺苷酸位点的终止子。表达盒通常包含于载体中以有利于克隆和转化。可以将不同表达盒转化进包括细菌、酵母、植物和哺乳动物细胞在内的不同生物体中,只要能针对每种宿主使用正确的调控序列。
术语“同一性百分比”指两种或更多种多肽序列之间或两种或更多种多核苷酸序列之间的关系,该关系通过对序列进行比较而确定。“同一性”还表示多肽或多核苷酸序列之间的序列关联的程度,根据具体情况,它由比较序列的序列串之间的匹配百分比确定。“同一性百分比”和“相似性百分比”可容易地通过已知方法计算出来,所述的方法包括但不限于以下文献中所描述的那些:1.)Computational Molecular Biology(Lesk,A.M.编辑)Oxford University:NY(1988);2)Biocomputing: Informatics and Genome Projects(Smith,D.W.编辑)Academic:NY(1993);3.)Computer Analysis of Sequence Data,Part I(Griffin,A.M.和Griffin,H.G.编辑)Humania:NJ(1994);4)Sequence Analysis in Molecular Biology(von Heinje,G.编辑)Academic(1987);和5.)Sequence Analysis Primer(Gribskov,M.和Devereux,J.编辑)Stockton:NY(1991)。
设定确定同一性百分比的优选方法来用于给出待测试序列之间的最佳匹配。用于测定同一性百分比和相似性百分比的方法在公开可获得的计算机程序中进行编辑。序列比对和同一性百分比计算可以用LASERGENE生物信息学计算软件包(LASERGENE bioinformaticscomputing suite(DNASTAR Inc.,Madison,WI))中的MegAlignTM程序进行。序列的多重比对采用包括几种改变形式的算法在内的“Clustal比对方法”进行,包括“Clustal V比对方法”和“Clustal W比对方法”(在Higgins和Sharp,CABIOS,5:151-153(1989);Higgins,D.G.等人,Comput.Appl.Biosci.,8:189-191(1992)中有所描述)和可以在LASERGENE生物信息学计算软件包(DNASTAR Inc.)中的MegAlignTMv6.1程序中找到的比对方法。用Clustal程序比对序列后,可通过查看程序中的“序列距离”表来获得“同一性百分比”。
本领域的技术人员非常清楚,多种程度的序列同一性百分比可用于从其它物种中鉴定多肽,其中这类多肽具有相同或相似的功能或活性。同一性百分比的有用实例包括但不限于50%、55%、60%、65%、70%、75%、80%、85%、90%、或95%、或从50%至100%的任何整数百分比。实际上,从50%至100%的任何整数氨基酸同一性可用于描述在本文所述的方法和宿主细胞中编码多肽的合适核酸片段(分离的多核苷酸),例如51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%。在一些情况下,合适的核酸片段(分离的多核苷酸)编码与本文报道的氨基酸序列至少约70%相同,优选地至少约75%相同,更优选地至少约80%相同的多肽。优选的核酸片段编码与本文所报道的氨基酸序列具有至少约85%同一性的氨基酸序列。更优选的核酸片段编码与本文所报道的氨基酸序列具有至少约90%同一性的氨基酸序列。最优选的是编码与本文所报道的氨基酸序列具有至少约95%同一性的氨基酸序列的核酸片段。
合适的核酸片段不但具有上述同源性,而且通常编码具有至少50个氨基酸,优选至少100个氨基酸,更优选至少150个氨基酸,更优选至少200个氨基酸,最优选至少250个氨基酸的多肽。
术语“序列分析软件”指可用于分析核苷酸或氨基酸序列的任何计算机算法或软件程序。“序列分析软件”可商购获得或独立开发。典型的序列分析软件包括但不限于:1.)GCG程序包(Wisconsin PackageVersion 9.0,Genetics Computer Group(GCG),Madison,WI);2)BLASTP,BLASTN,BLASTX(Altschul等人,J.Mol.Biol.,215:403-410(1990));3)DNASTAR(DNASTAR,Inc.Madison,WI);4)Sequencher(Gene Codes Corporation,Ann Arbor,MI);和5.)整合了Smith-Waterman算法的FASTA程序(W.R.Pearson,Comput.MethodsGenome Res.,[Proc.Int.Symp.](1994),召开年份:1992,111-20.编辑:Suhai,Sandor.Plenum:New York,NY)。在这一描述中,除非另外指明,只要序列分析软件用于分析,分析结果都基于所用程序的“默认值”。在此所用的“默认值”指在首次初始化软件时软件最初加载的任何值或参数集。
本文使用的标准重组DNA和分子克隆技术是本领域熟知的并且已经在如下文献中有所描述:Sambrook,J.,Fritsch,E.F.和Maniatis,T.,Molecular Cloning:A Laboratory Manual,第2版,Cold Spring HarborLaboratory:Cold Spring Harbor,NY(1989)(下文称为“Maniatis”);Silhavy,T.J.、Bennan,M.L.和Enquist,L.W.,Experiments with Gene Fusions,Cold Spring Harbor Laboratory:Cold Spring Harbor,NY(1984);以及Ausubel,F.M.等人,Current Protocols in Molecular Biology,GreenePublishing Assoc.and Wiley-Interscience出版,Hoboken,NJ(1987)。
综述:脂肪酸和三酰基甘油的生物合成
通常,含油微生物中的脂质蓄积是响应存在于生长培养基中的总的碳氮比而触发。该过程(导致含油微生物中游离棕榈酸(16:0)的从头合成)在美国专利7,238,482中有详细描述。棕榈酸是长链饱和和不饱和脂肪酸衍生物的前体,这些脂肪酸衍生物通过延伸酶和去饱和酶的作用形成(图1)。
TAG(脂肪酸的主要贮存单位)通过涉及如下反应的一系列反应形成:(1)一分子的酰基辅酶A和甘油-3-磷酸通过酰基转移酶酯化而生成溶血磷脂酸;(2)第二分子的酰基辅酶A通过酰基转移酶酯化而生成1,2-二酰基甘油磷酸(通常称为磷脂酸);3)通过磷脂酸磷酸酶移除磷酸以生成1,2-二酰基甘油[“DAG”];和4)通过酰基转移酶的作用加入第三脂肪酸以形成TAG。
广谱的脂肪酸可被引入TAG中,包括饱和的和不饱和的脂肪酸以及短链和长链的脂肪酸。一些能通过酰基转移酶被引入TAG的脂肪酸的非限制性实例包括:癸酸(10:0)、月桂酸(12:0)、肉豆蔻酸(14:0)、棕榈酸(16:0)、棕榈油酸(16:1)、硬脂酸(18:0)、油酸(18:1)、异油酸(18:1)、LA(18:2)、桐油酸(18:3)、GLA(18:3)、ALA(18:3)、STA(18:4)、花生酸(20:0)、EDA(20:2)、DGLA(20:3)、ETrA(20:3)、ARA(20:4)、ETA(20:4)、EPA(20:5)、二十二烷酸(22:0)、DPA(22:5)、DHA(22:6)、木蜡酸(24:0)、神经酸(24:1)、蜡酸(26:0)和褐煤酸(28:0)脂肪酸。在本文所述的方法和宿主细胞中,最期望将“长链”PUFA引入TAG中,其中长链PUFA包括来源于18:1底物的任何脂肪酸,所述底物具有至少18个碳原子,即,C18或更长。其也包括羟基化脂肪酸、环氧脂肪酸和共轭亚油酸。
虽然大多数PUFA以中性脂类的形式掺入TAG并贮存于脂质体中,但是重要的是认识到在含油生物中测量总PUFA应包括那些位于磷脂酰胆碱级分、磷脂酰乙醇胺级分、和三酰基甘油(也称为TAG或油)级分中的PUFA。
ω脂肪酸的生物合成
其中油酸转化成ω-3/ω-6脂肪酸的代谢过程包括通过添加碳原子使碳链延长和通过加入双键使分子去饱和。这需要存在于内质网膜内的一系列专门的去饱和酶和延伸酶。然而,如在图1中看到的和如下所述的,通常存在多个用于产生特定ω-3/ω-6脂肪酸的替代途径。
具体地讲,图1描述了下文所述的途径。所有途径都需要最初通过Δ12去饱和酶将油酸转化成亚油酸LA(第一个ω-6脂肪酸)。然后,利用“Δ6去饱和酶/Δ6延伸酶途径”并将LA用作底物来如下形成长链ω-6脂肪酸:1)通过Δ6去饱和酶将LA转化成γ-亚麻酸[“GLA”];2)通过C18/20延伸酶将GLA转化成二高-γ-亚麻酸[“DGLA”];3)通过Δ5去饱和酶将DGLA转化成花生四烯酸[“ARA”];4)通过C20/22延伸酶将ARA转化成二十二碳四烯酸[“DTA”];以及,5)通过Δ4去饱和酶将DTA转化成二十二碳五烯酸[“DPAn-6”]。
作为另外一种选择,“Δ6去饱和酶/Δ6延伸酶途径”能够如下使用α-亚麻酸[“ALA”]作底物生产长链ω-3脂肪酸:(1)通过Δ15去饱和酶将LA转化成ALA(第一个ω-3脂肪酸);2)通过Δ6去饱和酶将ALA转化成十八碳四烯酸[“STA”];3)通过C18/20延伸酶将STA转化成二十碳四烯酸[“ETA”];4)通过Δ5去饱和酶将ETA转化成二十碳五烯酸[“EPA”];5)通过C20/22延伸酶将EPA转化成二十二碳五烯酸[“DPA”];以及,6)通过Δ4去饱和酶将DPA转化成二十二碳六烯酸[“DHA”]。任选地,ω-6脂肪酸可转化成ω-3脂肪酸;例如,ETA和EPA通过Δ17去饱和酶活性分别从DGLA和ARA生成。
ω-3/ω-6脂肪酸生物合成的替代途径利用Δ9延伸酶和Δ8去饱和酶,即,“Δ9延伸酶/Δ8去饱和酶途径”。更具体地讲,LA和ALA可通过Δ9延伸酶分别转化成EDA和ETrA;Δ8去饱和酶然后将EDA转化成DGLA和/或将ETrA转化成ETA。随后如上所述形成下游PUFA。
本文宿主生物必须天然或经由基因工程技术具有生产PUFA的能力。虽然多种微生物能够在常见的细胞代谢过程中合成PUFA(包括ω-3/ω-6脂肪酸),并且它们中的一些能进行商业培养,但是这些生物很少生产或不生产具有期望的油含量的油和用于药物、膳食替代品、医疗食品、营养补充剂、其它食物产品、工业油脂化学制品或其它终端产品中的组合物。因此,越来越着重于工程化微生物以生产“设计”的脂质和油的能力,其中脂肪酸含量和组成通过基因工程进行详细指定。在此基础上,期望宿主可能包含编码功能性PUFA生物合成途径的异源基因,但这是非必需的。
如果宿主生物不天然生产期望的PUFA或具有期望的脂质分布,本领域的技术人员熟悉将编码用于PUFA生物合成的合适酶的一个或多个表达盒导入所选的宿主生物所必需的构思和技术。文献为技术人员提供了多个教导,用于将此类表达盒导入多个宿主生物。将使用宿主生物解脂耶氏酵母的一些文献提供如下:美国专利7,238,482、国际申请公布WO 2006/033723、专利申请公布US-2006-0094092、专利申请公布US-2006-0115881-A1和专利申请公布US-2006-0110806-A1。上述列举并非穷尽性的因此不应理解为限制性的。
简而言之,多种ω-3/ω-6PUFA产品能在它们转化成TAG之前被生产出来,这取决于脂肪酸底物和存在于或被转化到宿主细胞中的ω-3/ω-6脂肪酸生物合成途径的特定基因。同样地,期望脂肪酸产品的生产可能直接发生或间接发生。当不经过任何中间步骤或中间途径将脂肪酸底物直接转化成期望的脂肪酸产品时,直接生产发生。当编码PUFA生物合成途径的多个基因可被组合使用,使得一系列反应发生以生产期望的PUFA时,间接生产发生。具体地讲,可期望用包含Δ12去饱和酶、Δ6去饱和酶、C18/20延伸酶、Δ5去饱和酶和Δ17去饱和酶的表达盒转化含油酵母以过剩生产EPA。参见美国专利7,238,482和国际申请公布WO 2006/052870。为本领域的技术人员熟知的是可将编码PUFA生物合成途径酶的基因的多种其它组合用于在含油生物中表达(参见图1)。包含于具体表达盒中的具体基因取决于宿主生物、它的PFUA分布和/或去饱和酶/延伸酶分布、底物的可利用性和期望的终产物。
许多候选基因具有所需去饱和酶和/或延伸酶活性,它们能根据公开可获得的文献进行鉴定,如GenBank、专利文献、和对具有生产PUFA能力的生物进行的实验分析。可用的去饱和酶和延伸酶序列可源自任何来源,例如,分离自天然来源如细菌、藻类、真菌、卵菌、酵母、植物、动物等、经由半合成途径产生或从头合成。在鉴定这些候选基因后,选择具有去饱和酶或延伸酶活性的特定多肽时的考虑事项包括:1)多肽的底物特异性;2)多肽或其组件是否为限速酶;3)去饱和酶或延伸酶是否是合成期望的PUFA所必需的;4)多肽所需的辅因子;和/或5)多肽在其产生后是否被修饰,例如,通过激酶或异戊烯转移酶修饰。
表达的多肽优选具有与它在宿主细胞中的位置的生化环境相容的参数。参见美国专利7,238,482。考虑每种特定的去饱和酶和/或延伸酶的转化效率也可以是有用的。更具体地讲,由于每种酶极少能以100%的效率将底物转化成产物,宿主细胞内所产生的未纯化的油的最终脂质分布通常是由期望的ω-3/ω-6脂肪酸及多种上游PUFA中间产物组成的多种PUFA的混合物。因此,当使期望的脂肪酸的生物合成最优化时,每种酶的转换效率也是要考虑的变量。
过氧化物酶体生物合成和Pex基因
如前文所述,过氧化物酶体是普遍存在于所有真核细胞中的细胞器。它们的主要作用是降解细胞定位细胞器中的各种物质,例如毒性化合物、脂肪酸等。例如,在过氧化物酶体中能够发生β-氧化过程,其中脂肪酸分子发生降解,最后产生乙酰-CoA的游离分子(它被回输到细胞溶质中)。虽然线粒体中的β-氧化过程导致ATP合成,在过氧化物酶体中的β-氧化引起高电位电子转移到O2中并导致形成H2O2,H2O2随后被过氧化物酶体过氧化氢酶转化成水和O2。特长链(如C18至C22)脂肪酸在过氧化物酶体中经过初始β-氧化,然后经过线粒体β-氧化。
已知负责通过ATP水解使蛋白穿过过氧化物酶体膜的蛋白是过氧化物酶体生物合成因子蛋白,或“peroxins”。这些过氧化物酶体生物合成因子蛋白也包括那些涉及过氧化物酶体生物合成/装配的蛋白。过氧化物酶体生物合成因子蛋白的基因首字母缩写是Pex;并且命名体系描述于Distel等人,J.Cell Biol.,135:1-3(1996)。迄今已经在多种真核生物中鉴定了至少32个不同的Pex基因。然而在真菌中,Kiel等人近来的文献(Traffic,7:1291-1303(2006))提出过氧化物酶体生物合成/基质蛋白输入所需的最小数目是17,因此仅需要Pex1p、Pex2p、Pex3p、Pex4p、Pex5p、Pex6p、Pex7p、Pex8p、Pex10p、Pex12p、Pex13p、Pex14p、Pex17p、Pex19p、Pex20p、Pex22p和Pe26p。这些蛋白以协同方式作用以增生(复制)过氧化物酶体并经由转移进入过氧化物酶体以输入蛋白(参见Waterham,H.R.和J.M.Cregg,BioEssays.19(1):57-66(1996))。
最初从对突变体的分析中分离了多种Pex基因,所述突变体显示具有异常的过氧化物酶体功能或结构。然而,由于全部基因组序列是可获得的,所以通过基于同源性的计算机序列搜索来鉴定Pex基因变得更加容易。Kiel等人(Traffic,7:1291-1303(2006))引用过氧化物酶体生物合成机制的强保守性,尽管它偶尔会出现低序列相似性。更具体地讲,在酵母和丝状真菌中,它们的数据指示迄今鉴定的几乎所有Pex蛋白都是保守的。下表4显示Kiel等人鉴定的过氧化物酶体生物合成因子蛋白(同上),它们来源于啤酒糖酵母(Saccharomyces cerevisiae)、光滑假丝酵母(Candida glabrata)、棉阿舒囊霉(Ashbya gossypii)、乳酸克鲁维酵母(Kluyveromyces lactis)、白假丝酵母(Candida albicans)、汉逊德巴利酵母(Debaryomyces hansenii)、巴斯德毕赤酵母(Pichiapastoris)、多形汉逊酵母(Hansenula polymorpha)、解脂耶氏酵母(Yarrowia lipolytica)、烟曲霉(Aspergillus fumigatus)、构巢曲霉(Aspergillus nidulans)、产黄青霉(Penicillium chrysogenum)、稻瘟病菌(Magnaporthe grisea)、粗糙脉孢菌(Neurospora crassa)、玉蜀黍赤霉(Gibberella zeae)、玉米黑粉菌(Ustilago maydis)、新生隐球菌新生变种(Cryptococcus neoformans var.neoformans)和粟酒裂殖酵母(Schizosaccharomyces pombe)。
导致过氧化物酶体生物合成减弱的Pex基因突变在酵母、人类和植物中引起严重的代谢和发育紊乱(Eckert,J.H.和R.Erdmann,Rev.Physiol.Biochem Pharmacol.,147:75-121(2003);Weller,S.等人,AnnualReview of Genomics and Human Genetics,4:165-211(2003);Wanders,R.J.,Am.J.Med.Genet.,126A:355-375(2004);Mano,S.和M.Nishimura,Vitam Horm.,72:111-154(2005);Wanders,J.A.,和H.R.Waterham,Annu.Rev.Biochem.,75:295-332(2006);Fujiki,Yukio.Peroxisome Biogenesis Disorders.In,Encyclopedia of Life Sciences.JohnWiley&Sons,2006)。例如X-连锁肾上腺脑白质营养不良[“X-ALD”]和Zellweger综合征、以及若干个严重程度较低的疾病,它们可能由单个酶缺乏和/或过氧化物酶体生物合成失调引起。
在酵母、解脂耶氏酵母中,已经分离并鉴定了多种不同的Pex基因,如上表4中鉴定的那些Pex基因。更具体地讲,Bascom,R.A.等人(Mol.Biol.Cell,14:939-957(2003))描述了YlPex3p;Szilard,R.K.等人(J.Cell Biol.,131:1453-1469(1995))描述了YlPex5p;Nuttley,W.M.等人(J.Biol.Chem.,269:556-566(1994))描述了YlPex6p;Elizen G.A.,等人(J.Biol.Chem.,270:1429-1436(1995))描述了YlPex9p;ElizenG.A.,等人(J.Cell Biol.,137:1265-1278(1997))和Titorenko,V.I.等人(Mol.Cell Biol.,17:5210-5226(1997))描述了YlPex16p;Lambkin,G.R.和R.A.Rachubinski(Mol.Biol.Cell.,12(11):3353-3364(2001))描述了YlPex19;以及Titorenko V.I.,等人(J.Cell Biol.,142:403-420(1998))和Smith J.J.以及R.A.Rachubinski(J.Cell Biol.,276:1618-1625(2001))描述了YlPex20p。
本文最初关注的基因是YlPex10p(GenBank保藏号CAG81606、AB036770和AJ012084)。它由Sumita等人证明(FEMS Microbiol.Lett.,214:31-38(2002)):1)YlPex10p用作过氧化物酶体的组分;并且2)YlPex10p的C3HC4环状锌指基序是蛋白功能必需的,如经由制造C341S、C346S和H343W点突变并随后进行生长分析来所测定的。
已经在其它生物中完成了对Pex10的C3HC4环状锌指基序的研究并具有类似结果。例如,发现改变在巴斯德毕赤酵
母(Pichia pastoris)的Pex10p C3HC4基序中的保守残基的点突变会使蛋白无功能(Kalish,J.E.等人,Mol.Cell Biol.,15:6406-6419(1995))。同样地,在对纤维原细胞系进行功能互补分析后,Warren D.S.等人(Hum.Mutat.,15(6):509-521(2000))得出结论:C3HC4基序对Pex10p功能是至关重要的。若干个研究结果显示Pex10p在拟南芥属中丧失功能会引起胚芽在心期死亡(Hu,J.等人,Science,297:405-409(2002);Schmumann,U.等人,Proc.Natl.Acad.Sci.U.S.A.,100:9626-9631(2003);Sparkes,I.A.等人,Plant Physiol.,133:1809-1819(2003);Fan,J.等人,Plant Physiol.,139:231-239(2005))。在后继研究中,Schemann,U.等人(Proc.Natl.Acad.Sci.U.S.A,104:1069-1074(2007))研究了Pex10p在非致命的功能部分丧失拟南芥属突变体中的功能。具体地讲,在拟南芥属野生型植株中制备具有功能异常的C3HC4基序的表达Pex10p的四个T-DNA插入序列。突变植物显示具有减少的叶片过氧化物酶体,并且作者提出Pex10p中环指基序的失活消除了连结过氧化物酶体与叶绿体以及在过氧化物酶体和叶绿体之间转移代谢物所需的蛋白相互作用。
虽然研究尚未鉴定其它Pex蛋白中必需的结构域,但是研究认识到了多种Pex突变体的效应,从而认识到各种不同的生物进化出的用于装配、保持、繁殖和遗传过氧化物酶体(一种已知在脂质代谢中起作用的细胞器)的方法和分子机制。例如,Bascom,R.A.等人已经进行了解脂耶氏酵母Pex3p的基因敲除和超表达(Mol.Biol.Cell,14:939-957(2003))。基因敲除细胞不包含野生型过氧化物酶体,相反具有多个小囊泡;超表达导致细胞具有较少、较大和聚集的过氧化物酶体。他们假定Pex3p涉及通过固定过氧化物酶体生物合成组件启动过氧化物酶体装配,即,过氧化物酶体靶信号(PTS)1和2输入机制。同样地,Guo,T.等人,敲除解脂耶氏酵母Pex16p导致过多地增殖不成熟的过氧化物酶体囊泡并显著地降低它们转化成成熟过氧化物酶体的速率和效率(J.Cell Biol.,162:1255-1266(2003)),然而超表达导致产生很少但是增大的过氧化物酶体(Eitzen等人,J.Cell Biol.,137:1265-1278(1997))。Guo等人得出结论:Pex16p负调节早期的过氧化物酶体前体分裂所需的膜分裂事件。
尽管已有上文概述的研究进展,关于不同Pex蛋白的作用、它们彼此相互作用和过氧化物酶体中的生物合成/装配机制的细节仍未被阐明。同样地,本专利申请所述的数据是在对其它植物或动物的研究中尚未验证的新观察数据,其中在YlPex10p的C3HC4基序中的突变或YlPex3p、YlPex10p或YlPex16p的敲除导致产生解脂耶氏酵母突变体,该突变体具有提高的PUFA掺入能力,尤其是掺入长链PUFA如C20至C22分子到细胞的总脂质级分和油级分中的能力。
已经提出过氧化物酶体是脂质的分解代谢和合成代谢所必需的(LinY.等人,Plant Physiology,135:814-827(2004));然而,该假说是基于对Pex16p的同源物的研究。更具体地讲,Lin,Y.等人(同上)报道拟南芥属Shrunken种子1(sse1)突变体具有异常的过氧化物酶体生物合成和脂肪酸合成,这基于sse1种子中的油与野生型相比减少了10-16%。Binns,D.等人(J.Cell Biol.,173(5):719-731(2006))检查了啤酒糖酵母中的过氧化物酶体-脂质体相互作用并测定两个细胞器之间的广泛物理接触促进脂质体内的结合脂解及过氧化物酶体脂肪酸氧化。更具体地讲,检查不同Pex基因敲除菌株的游离脂肪酸与TAG的比率,发现比率相对于野生型提高。很明显,为了了解过氧化物酶体,尤其是Pex3p、Pex10p和Pex16p蛋白的代谢作用,进一步的研究将是必须的。
不希望受任何具体说明或理论的限制,假定在含油酵母细胞中的Pex基因破坏或敲除影响过氧化物酶体中天然存在的脂质分解代谢和合成代谢,或受过氧化物酶体的影响。与尚未破坏其天然过氧化物酶体生物合成因子蛋白的含油酵母相比,破坏或敲除导致总脂质级分和油级分中的PUFA量增加,所述PUFA量以总脂肪酸百分比形式表示。在一些情况下,也观察到在总脂质级分和油级分中以干细胞重量百分比形式表示的PUFA量增加,和/或以干细胞重量百分比形式表示的总脂质含量增加。假定这种通用机制可应用于所有真核生物中,如藻类、真菌、卵菌、酵母、类眼虫、原生藻菌、植物和一些哺乳动物体系,因为所有这些生物都包含过氧化物酶体。
鉴定并分离Pex同源物
当在优选宿主生物中的特定Pex基因或蛋白的序列不是已知的时,本领域技术人员认识到在调控基因编码的蛋白活性之前鉴定和分离这些基因或它们的部分基因将是最期望的,所述调控继而促进掺入真核生物总脂质级分和油级分中的PUFA量的变化,所述量以总脂肪酸百分比形式表示。对优选的宿主Pex基因序列的认识将有利于通过定向破坏来破坏同源染色体基因。
表4中的Pex序列或它们的序列部分可用于在相同或其它藻类、真菌、卵菌、类眼虫、原生藻菌、酵母或植物物种中使用序列分析软件搜索Pex同源物。通常,这种计算机软件通过将同源程度赋予多种置换、缺失和其它修饰来匹配相似的序列。使用软件算法,如具有低复杂度滤波器和以下参数的BLASTP比对方法:Expect value=10,matrix=Blosum 62(Altschul,等人,Nucleic Acids Res.25:3389-3402(1997)),该方法是用于将表4中的任何Pex蛋白对核酸或蛋白序列数据库进行比较并从而鉴定在优选宿主生物中的相似已知序列的熟知方法。
使用算法软件搜索已知序列数据库尤其适于分离对公开可获得的Pex序列具有相对低的同一性百分比的同源物,如那些表4中描述的序列。可预测的一点是:分离与公开可获得的Pex序列具有至少约70%-85%同一性的Pex同源物将是相对更容易的。此外,那些至少约85%-90%相同的序列将尤其适于分离,那些至少约90%-95%相同的序列将最容易分离。
通过使用Pex酶独有的基序已经分离了一些Pex同源物。例如,为人熟知的是Pex2p、Pex10p和Pex12p在它们的羧基末端都有一个富含半胱氨酸的基序,已知该基序是C3HC4环状锌指基序(图2A)。该“保守结构域”区域对应于一组在特定位点高度保守的氨基酸,并且可能提供对蛋白结构、稳定性或活性来说必需的Pex蛋白区域。基序通过它们在蛋白同源物家族的比对序列中的高度保守性进行鉴定。作为独有的“标记”,它们能决定具有新测定序列的蛋白是否属于以前鉴定过的蛋白家族。这些基序可用作诊断工具以分别迅速鉴定新的Pex2、Pex10和/或Pex12基因。
作为另外一种选择,公开可获得的Pex序列或它们的基序可以是用于鉴定同源物的杂交试剂。核酸杂交试验的基本组成包括探针、怀疑含有目的基因或基因片段的样本及特定的杂交方法。探针通常是与待检测核酸序列互补的单链核酸序列。探针与待检测的核酸序列是可杂交的。尽管探针的长度可在5个碱基到数万个碱基之间变化,但通常约15个碱基到约30个碱基的探针长度是合适的。只需要探针分子的部分与待检测的核酸序列互补。另外,探针和靶序列之间不需要完全互补。杂交确实可以在并不完全互补的分子之间发生,结果是杂交区内的一定比率的碱基未与适当的互补碱基配对。
杂交方法是已知的。通常探针和样品必须在允许核酸杂交的条件下混合。这涉及在适当浓度和温度条件下在存在无机或有机盐时使探针和样品接触。探针和样品核酸必须接触足够长的时间,使探针和样品核酸之间的任何可能的杂交均会发生。混合物中的探针或靶标的浓度决定杂交发生所需的时间。探针或靶标的浓度越高,所需的杂交孵育时间就越短。任选地,可以加入离液剂(如氯化胍、硫氰酸胍、硫氰酸钠、四氯乙酸锂、高氯酸钠、四氯乙酸铷、碘化钾或三氟乙酸铯)。如果需要,能将甲酰胺加入杂交混合物,通常为30-50%(v/v)[“按体积计”]。
可以采用多种杂交溶液。通常,这些杂交溶液包含约20%至60%体积,优选30%体积的极性有机溶剂。通常的杂交溶液采用约30-50%v/v甲酰胺、约0.15至1M氯化钠、约0.05至0.1M缓冲液(如柠檬酸钠、Tris-HCl、PIPES或HEPES(pH范围约6-9))、约0.05至0.2%的去污剂(如十二烷基硫酸钠)或0.5-20mM的EDTA、FICOLL(Pharmacia Inc.)(约300-500千道尔顿)、聚乙烯吡咯烷酮(约250-500千道尔顿)和血清白蛋白。一般的杂交溶液还包含约0.1至5mg/mL未经标记的载体核酸、片段化的核酸DNA(如小牛胸腺或鲑精DNA或酵母RNA),以及任选约0.5%至2%wt/vol[“重量体积比”]的甘氨酸。可以包含其它添加剂,例如包括极性水溶性或可膨胀试剂(如聚乙二醇)、阴离子聚合物(如聚丙烯酸酯或聚甲基丙烯酸酯)和阴离子糖类聚合物(如硫酸葡聚糖)在内的体积排阻剂。
核酸杂交可适用于多种测定形式。最合适的形式之一是夹心测定形式。夹心测定尤其适用于在非变性条件下杂交。夹心型测定的主要成分是固体支持体。固体支持体具有吸附或共价连接至其上的固定核酸探针,该探针未经标记并且与序列的一部分互补。
任何Pex核酸片段或任何鉴定的同源物可用于从相同或其它藻类、真菌、卵菌、类眼虫、原生藻菌、酵母或植物物种中分离编码同源蛋白的基因。使用序列依赖性规程分离同源基因是本领域熟知的。序列依赖性规程的实例包括但不限于:1)核酸杂交方法;2)DNA和RNA扩增方法,例如核酸扩增技术的多种应用,如聚合酶链反应[“PCR”](美国专利4,683,202);连接酶链反应[“LCR”](Tabor,S.等人,Proc.Natl.Acad.Sci.U.S.A.,82:1074(1985));或链置换扩增[“SDA”],Walker等人,Proc.Natl.Acad.Sci.U.S.A.,89:392(1992));和3.)文库构建和互补筛选方法。
例如,编码蛋白或多肽的基因类似于公开可获得的Pex基因或它们的基序,可使用所有或部分那些公开可获得的核酸片段作为DNA杂交探针,以使用熟知的方法筛选来自任何期望生物的文库而直接分离该基因。基于公开可获得核酸序列的特异性,寡核苷酸探针可通过本领域已知的方法(Maniatis,同上)设计并合成。而且,整个序列可直接用于通过熟练技术人员已知的方法,例如随机引物DNA标记、切口平移或末端标记技术,来合成DNA探针,或使用可获得的体外转录体系来合成RNA探针。此外,能设计特异性引物并用于扩增部分或全长公开可获得的序列或它们的基序。所得的扩增产物可在扩增反应过程中直接标记或在扩增反应后标记,并用作探针以在合适的严格条件下分离全长的DNA片段。
通常,在PCR类型的扩增技术中,引物具有不同的序列而且彼此之间不互补。取决于期望的检测条件,应当设计引物序列以提供既有效又可靠的靶核酸的复制。PCR引物设计方法是常见且熟知的(Thein和Wallace,“The use of oligonucleotides as specific hybridization probes inthe Diagnosis of Genetic Disorders”,Human Genetic Diseases:A PracticalApproach,K.E.Davis编辑,(1986)第33-50页,IRL:Herndon,VA;Rychlik,W.,In Methods in Molecular Biology,White,B.A.Ed.,(1993)第15卷,第31-39页,PCR Protocols:Current Methods and Applications.Humania:Totowa,NJ)。
通常,可以将可获得的Pex序列的两个短片段在PCR规程中用于从DNA或RNA扩增编码同源基因的更长的核酸片段。也可以对克隆的核酸片段文库进行PCR,其中一个引物的序列来源于可获得的核酸片段或它们的基序。其它引物序列利用mRNA前体编码基因3′末端存在的多腺苷酸片。
作为另一种选择,第二个引物序列可以基于来源于克隆载体的序列。例如,技术人员可以按照RACE规程(Frohman等人,Proc.Natl.Acad.Sci.U.S.A.,85:8998(1988)),通过用PCR扩增在转录物内单个位点与3′或5′端之间的区域的拷贝来产生cDNA。以3′和5′方向取向的引物可以从能利用可获得的序列设计。使用可商业获得的3′RACE或5′RACE体系(例如BRL,Gaithersburg,MD),可以分离特异性的3′或5′cDNA片段(Ohara等人,Proc.Natl.Acad.Sci.U.S.A.,86:5673(1989);Loh等人,Science,243:217(1989))。
基于所讨论的这些熟知方法中的任何一种,在选择的任何优选真核生物中鉴定和/或分离Pex基因同源物将是可能的。能通过定向破坏PUFA生产宿主生物中的内源基因容易地确认任何推定的Pex基因的活性,因为总脂质级分和油级分的脂质分布相对于它们在缺乏定向Pex基因破坏的生物中的那些脂质分布发生了改变。
经由破坏天然过氧化物酶体生物合成因子蛋白提高总脂质级分和油级
分中的PUFA量
如上所述,本发明的公开内容涉及下述提高含油真核生物中的一种PUFA或PUFA组合的重量百分比的方法,所述方法包括:
a)提供含油真核生物,所述生物在编码过氧化物酶体生物合成因子蛋白的天然基因中包含破坏,这产生PEX破坏生物;和编码功能性PUFA生物合成途径的基因;以及
b)在下述条件下培养(a)的真核生物:当与未破坏其天然过氧化物酶体生物合成因子蛋白的含油真核生物中的那些的重量%相比较时,(a)真核生物的总脂质级分和油级分中的一种PUFA或PUFA组合的重量%相对于总脂肪酸的重量%增加。
以总脂肪酸百分比形式提高的PUFA的量可能是:1)作为功能性PUFA生物合成途径期望终产品形式的PUFA,这与以中间体或副产品形式产生的PUFA相反;2)C20至C22的PUFA;和/或3)总PUFA。
除了相对于总脂肪酸重量%提高一种PUFA或PUFA组合的重量%之外,在一些情况下,还可以提高或降低细胞的总脂质含量(TFA%DCW)。这意味着无论PEX基因破坏是否引起PEX破坏细胞中的总脂质量提高或降低,该破坏总是引起一种PUFA或PUFA组合的重量%提高。
本文提供的另一种方法涉及在编码过氧化物酶体生物合成因子蛋白的天然基因中的破坏,其中当与亲本菌株中的该百分比进行比较时,所述破坏能够导致一种PUFA或PUFA组合相对于干细胞重量的百分比提高,所述亲本菌株的天然Pex蛋白未被破坏或其表达破坏天然Pex蛋白的“后备”拷贝。
在上述方法的优选方面,在编码过氧化物酶体生物合成因子蛋白的天然基因中的破坏导致PUFA的量提高,PUFA是功能性PUFA生物合成途径期望的终产品,而不是PUFA中间体或副产品,PUFA量以干细胞重量百分比相对于亲本菌株的重量百分比表示,所述亲本菌株的天然Pex蛋白未被破坏或其表达破坏天然Pex蛋白的“后备”拷贝。在一些情况下,组合PUFA相对于干细胞重量的百分比的提高是C20至C22PUFA的组合或总PUFA的提高。
上文也描述了通过这些方法产生的生物,所述生物包含至少一个过氧化物酶体生物合成因子蛋白的破坏。也描述了获取自这些生物的脂质和油、获取自脂质和油加工的产品、这些脂质和油用于食品、动物饲料、或工业应用的用途和/或副产品用于食品或动物饲料的用途。
上述方法中优选的真核生物包括藻类、真菌、卵菌、酵母、类眼虫、原生藻菌、植物和一些哺乳动物系统。
用于这些方法中任何一种的过氧化物酶体生物合成因子蛋白可选自:Pex1p、Pex2p、Pex3p、Pex3Bp、Pex4p、Pex5p、Pex5Bp、Pex5Cp、Pex5/20p、Pex6p、Pex7p、Pex8p、Pex10p、Pex12p、Pex13p、Pex14p、Pex15p、Pex16p、Pex17p、Pex14/17p、Pex18p、Pex19p、Pex20p、Pex21p、Pex21B、Pex22p、Pex22p样和Pex26p(以及它们的蛋白同源物)。在本文所述的一些优选方法中,被破坏的过氧化物酶体生物合成因子蛋白选自:Pex2p、Pex3p、Pex10p、Pex12p和/或Pex16p。然而在一些更优选的方法中,被破坏的过氧化物酶体生物合成因子蛋白选自:Pex3p、Pex10p和/或Pex16p。
在编码过氧化物酶体生物合成因子蛋白的天然基因中的破坏可能是在部分基因(例如在蛋白N-末端部分或在蛋白C-末端部分)中进行的插入、缺失、或定向突变。作为另外一种选择,该破坏能导致完全的基因敲除,使得基因从宿主细胞基因组中除去。或者该破坏可能是导致无功能蛋白的定向突变。
破坏方法
本发明包括在天然基因中的破坏,所述基因编码优选宿主细胞内的过氧化物酶体生物合成因子蛋白。虽然本领域的技术人员可使用多种技术获得破坏,但是特定基因的内源活性一般能通过以下技术减少或消除,例如:1)通过插入、取代和/或缺失所有或部分靶基因破坏所述基因;或2)操纵控制所述蛋白表达的调控序列。这些技术在下文中讨论。然而,本领域的技术人员认识到这些技术在现有文献中已有详细描述,并且不受本文所述的方法、宿主细胞、和产品的限制。本领域的技术人员也认识到大多数适用的技术使用任何特定的含油酵母。
经由插入、取代和/或缺失的破坏:就基因破坏而言,将外源DNA片段(通常是一个选择性标记基因)插入结构基因。这打断结构基因的编码序列并引起基因失活。将破坏盒转化到宿主细胞中导致非功能性破坏基因通过同源重组置换功能性天然基因。参见例如:Hamilton等人,J.Bacteriol.,171:4617-4622(1989);Balbas等人,Gene,136:211-213(1993);Gueldener等人,Nucleic Acids Res.,24:2519-2524(1996);和Smith等人,Methods Mol.Cell.Biol.,5:270-277(1996)。本领域的技术人员了解基因定向常规方法的许多改良方法,所述方法容许有阳性选择和阴性选择、生成基因敲除、以及将外来DNA序列插入到哺乳动物系统、植物细胞、丝状真菌、藻类、卵菌、类眼虫、原生藻菌、酵母和/或微生物系统的特定基因组位点中。
相反地,基因破坏的非特异性方法是使用转座元件或转座子。转座子是随机插入DNA但随后能根据序列进行检索以测定插入位点的基因元件。体内和体外转座技术是已知的,并且涉及转座元件与转座酶的组合使用。当转座元件或转座子与核酸片段在存在转座酶的情况下接触时,转座元件随机插入核酸片段中。该技术用于随机诱变和基因分离,因为被破坏的基因可基于转座元件的序列进行鉴定。用于体外转座的试剂盒是可商购获得的并包括:Primer Island Transposition Kit,得自PerkinElmer Applied Biosystems,Branchburg,NJ,基于酵母的Ty1元件;GenomePriming System,得自New England Biolabs,Beverly,MA,基于细菌转座子Tn7;和EZ::TN转座子插入系统,得自Epicentre Technologies,Madison,WI,基于Tn5细菌转座元件。
Pex调控序列的操纵:本领域熟知与编码序列附连的调控序列包括转录和翻译“控制”核苷酸序列,所述序列位于编码序列的上游(5′非编码序列)、内部、或下游(3′非编码序列),并且影响附连的编码序列的转录、RNA加工或稳定性、或翻译。因此,操纵Pex基因调控序列可以指操纵特定Pex基因的启动子、静默子、5′非转录前导序列(介于转录起始位点好翻译启动密码子之间)、内含子、增强子、启动控制区、多腺苷酸化识别序列、RNA加工位点、效应子结合位点和茎-环结构。然而在所有情况下,操纵的结果是下调Pex基因的表达,这促使与天然过氧化物酶体生物合成因子蛋白未被破坏的含油酵母相比,总脂质级分和油级分中以总脂肪酸百分比形式表示的PUFA量增加。
例如,能缺失或破坏Pex10基因的启动子。作为另外一种选择,驱动Pex10基因表达的天然启动子可以用与天然启动子相比启动子活性减弱的异源启动子取代。用于操纵调控序列的方法是为人们熟知的。
技术人员能够使用这些技术和其它熟知技术破坏在本文所述的优选宿主细胞中的天然过氧化物酶体生物合成因子蛋白,所述优选宿主例如哺乳动物系统、植物细胞、丝状真菌、藻类、卵菌、类眼虫、原生藻菌和酵母。
本领域的技术人员能够识别破坏天然Pex基因的最佳方法,以使得与天然过氧化物酶体生物合成因子蛋白未被破坏的真核生物相比,在总脂质级分和油级分中积聚的PUFA量增加,所述PUFA量以总脂肪酸的百分比形式表示。
ω-3和/或ω-6脂肪酸生物合成的代谢工程
本文所述方法除了用于破坏天然过氧化物酶体生物合成因子蛋白之外,还可使用操纵ω-3和/或ω-6脂肪酸生物合成。这种操纵可能需要直接在PUFA生物合成途径中进行代谢工程改造或需要另外对为PUFA生物合成途径贡献碳的途径进行操纵。可用于上调期望的生化途径和下调不期望的生化途径的技术是本领域熟知的。例如,与ω-3和/或ω-6脂肪酸生物合成途径竞争能量或碳的生化途径或干扰特定PUFA终产物产生的天然PUFA生物合成途径中的酶可以通过基因破坏来去除或通过其它手段(如反义mRNA和锌指靶向技术(zinc-finger targetingtechnologies))来下调。
以下讨论改变PUFA生物合成途径从而分别提高GLA、ARA、EPA或DHA含量的方法,以及在TAG生物合成途径和TAG降解途径中的所期望的操纵:分别是国际申请公布WO 2006/033723,国际申请公布WO 2006/055322[美国专利申请公布2006-0094092-A1],国际申请公布WO 2006/052870[美国专利申请公布2006-0115881-A1]和国际申请公布WO 2006/052871[美国专利申请公布2006-0110806-A1]。
表达系统、表达盒、载体和宿主细胞转化
制备重组构建体并将其导入到优选真核宿主中以破坏天然过氧化物酶体生物合成因子蛋白和/或导入编码PUFA生物合成途径的基因可能是必需的,所述真核宿主例如哺乳动物系统、植物细胞、丝状真菌、藻类、卵菌、类眼虫、原生藻菌和酵母。本领域的技术人员了解的标准来源材料如下所述:1)构建、操纵和分离大分子(例如DNA分子、质粒等)的特定条件和程序;2)生成重组DNA片段和重组表达构建体;和3)克隆的筛选和分离。参见Sambrook等人,Molecular Cloning:A Laboratory Manual,第2版,Cold Spring Harbor Laboratory:Cold SpringHarbor,NY(1989);Maliga等人,Methods in Plant Molecular Biology,Cold Spring Harbor,NY(1995);Birren等人,Genome Analysis:DetectingGenes,v.1,Cold Spring Harbor,NY(1998);Birren等人,Genome Analysis:Analyzing DNA,v.2,Cold Spring Harbor:NY(1998);Plant Molecular Biology:A Laboratory Manual,Clark,ed.Springer:NY(1997)。
一般来讲,存在于构建体中的序列的具体选择取决于期望的表达产物、宿主细胞的性质以及提出的相对于未转化细胞分离转化细胞的手段。技术人员熟知基因元件必须存在于质粒载体上以成功地转化、选择并增殖包含嵌合基因的宿主细胞。然而,通常载体或盒含有引导相关基因的转录和翻译的序列、选择标记和允许自主复制或染色体整合的序列。合适的载体包括控制转录起始的基因5′区(即启动子)和控制转录终止的DNA片段3′区(即终止子)。最优选两个控制区都来源于来自转化宿主细胞的基因。
用于驱动在期望的宿主细胞中的异源基因或部分异源基因表达的启动控制区或启动子有多种,并且为人们所熟知。这些控制区可包含启动子、增强子、静默子、内含子序列、3′UTR和/或5′UTR区域、以及蛋白和/或RNA稳定元件。此类元件的长度和特异性可以不同。实际上能够引导这些基因在所选宿主细胞中的表达的任何启动子(即,天然的、合成的、或嵌合的启动子)都是适用的。在宿主细胞中的表达能够以诱导型或组成型的方式发生。诱导型表达通过诱导可操作地连接至受关注的Pex基因的可调控启动子的活性发生。组成型表达通过利用可操作地连接至受关注基因的组成型启动子发生。
当宿主细胞是例如酵母时,酵母细胞中的功能性转录和翻译区域尤其从宿主菌种中提供。用于在解脂耶氏酵母中使用的优选转录起始调节区参见国际申请公布WO 2006/052870。可以使用许多调控序列的任意一种,这取决于是期望组成型转录还是诱导型转录、启动子在表达所关注的ORF中的效率、构建的容易性等。
必须在重组构建体中提供编码转录终止信号的3′非编码序列,即,“终止区”,该序列可以来自从中获取初始区的基因或不同基因的3′区域。大量的终止区是已知的并且当用于与它们源自的属和物种相同和不同的属和物种两者时,其在多种宿主中发挥的功能令人满意。选择终止区更多的是从方便的角度出发而不是为了任何特性。终止区也可以源于优选宿主的多种天然基因。
尤其有用的酵母终止区是那些来源于酵母基因,尤其是糖酵母属、裂殖糖酵母属、假丝酵母属、耶氏酵母属或克鲁维酵母属的终止区。编码γ-干扰素和α-2干扰素的哺乳动物基因的3′区域也已知能在酵母中起作用。3′-区也可以是合成的,因为本领域的技术人员可利用可获得的信息来设计和合成用作转录终止子的3′-区序列。终止区可以是非必需的,但是是高度优选的。
载体除了上述调控元件之外还可包含选择性的和/或可评分的标记。优选地,标记基因是抗生素抗性基因,使得用抗生素处理细胞引起未转化细胞的生长抑制或死亡,但不抑制转化细胞的生长。为了选择酵母转化体,可使用在酵母中发挥功能的任何标记,使之具有对卡那霉素、潮霉素、和氨基配醣物G418的抗性并能够在缺乏尿嘧啶、赖氨酸、组氨酸、或亮氨酸的培养基上生长的标记是尤其有用的。
仅仅将基因插入克隆载体不确保它能以期望的速率、浓度、量等表达。根据对高表达率的需要,通过操作许多控制转录、RNA稳定性、翻译、蛋白质稳定性和位置、氧限制和从宿主细胞分泌的不同遗传元件,已经建立了许多专用的表达载体。一些操纵特征包括:相关转录启动子和终止子序列的性质,所克隆基因的拷贝数目以及该基因是质粒携带的还是整合进了宿主细胞的基因组中,所合成的外来蛋白质的最终细胞定位,蛋白质在宿主生物内的翻译和正确折叠的效率,所克隆基因的mRNA和蛋白质在宿主细胞内的固有稳定性和所克隆基因中的密码子使用,以使得其频率与宿主细胞的优选密码子使用频率接近。这些特征的每一种可用于本文所述的方法和宿主细胞中,以进一步优化PUFA生物合成途径基因的表达并减少天然Pex基因的表达。
在制备适于破坏或敲除天然过氧化物酶体生物合成因子蛋白和/或表达编码PUFA生物合成途径活性的基因的重组构建体(例如包含具有启动子、ORF和终止子的嵌合基因的构建体)后,将其置于能够在宿主细胞中自主复制的质粒载体中,或直接整合进宿主细胞基因组中。表达盒的整合可以在宿主基因组中随机地发生或者可以通过使用下述构建体来靶向,所述构建体含有足以靶向宿主基因座中的重组的与宿主基因组同源的区。如果构建体靶向内源性基因座,则转录和翻译调控区的全部或某些可以由内源性基因座提供。
当两个或更多个基因从独立的复制载体表达时,每个载体可具有不同的选择手段并且应当缺乏与其它构建体的同源性以便维持稳定表达和防止元件在构建体之间重配(reassortment)。可用实验方法确定对调控区、选择手段和所引入的构建体的增殖方法的正确选择,以便所有导入的基因均以需要的水平表达从而提供期望的产品的合成。
包含目的基因的构建体可以通过任何标准技术导入到宿主细胞中。这些技术包括转化(如醋酸锂转化[Methods in Enzymology,194:186-187(1991)])、原生质体融合、基因枪轰击(bolistic impact)、电穿孔、显微注射、真空过滤或将所关注的基因引入宿主细胞中的任何其它方法。
为方便起见,已经通过任何方法被操纵以摄取DNA序列(如表达盒)的宿主细胞在本文中将称为“转化的”或“重组的”。转化的宿主将具有至少一个表达构建体的拷贝,而且可以具有两个或更多个拷贝,这取决于该基因是整合进基因组内、被扩增还是存在于具有多拷贝数的染色体外元件上。
转化宿主细胞能通过选择导入构建体上包含的标记进行鉴定。作为另外一种选择,分离的标记构建体可用期望的构建体进行共转化,该方法与多种将多个DNA分子导入宿主细胞的转化技术一样。通常转化宿主通过它们在选择培养基上生长的能力进行选择。选择培养基可掺入抗生素或缺乏未转化宿主必需的生长因子,如营养物质或生长因子。导入的标记基因可赋予抗生素抗性,或编码必需的生长因子或酶,从而当在转化宿主中表达时允许在选择培养基上生长。转化宿主的选择也能发生在表达标记蛋白能被直接地或间接地检测出来时。标记蛋白可单独表达或融合到另一种蛋白中进行表达。标记蛋白可通过其酶活性(例如β-半乳醣苷酶可将X-gal[“5-溴-4-氯-3-吲哚基-β-D-半乳糖苷”]底物转化成着色产物;荧光素酶能将荧光素转化成发光产物)或其产光或改性特性(例如维多利亚多管水母(Aequorea victoria)的绿荧光蛋白,它当用蓝光照亮时发荧光)进行检测。作为另外一种选择,能使用抗体检测标记蛋白或在例如所关注蛋白上的分子标记。能对表达标记蛋白或标记物的细胞进行选择,例如通过视觉检测或通过例如使用抗体分选或批选荧光活化细胞的技术。
无论选择哪种宿主或表达构建体,必须筛选多个转化体以获取显示期望的表达水平、调节和模式的菌株或植物品系,因为不同的独立转化事件导致不同水平和模式的表达(Jones等人,EMBO J.,4:2411-2418(1985);De Almeida等人,Mol.Gen.Genetics,218:78-86(1989))。这种筛选可以通过DNA印迹的Southern分析(Southern,J.Mol.Biol.,98:503(1975))、mRNA表达的Northern分析(Kroczek,J.Chromatogr.Biomed.Appl.,618(1-2):133-145(1993))、蛋白质表达的Western和/或Elisa分析、PUFA产物的表型分析或GC分析来完成。
优选的真核宿主生物
多种真核生物适合作为本文的宿主,因此产生在天然过氧化物酶体生物合成因子蛋白和编码PUFA生物合成途径的基因中的包含破坏的转化宿主生物,与尚未破坏其天然过氧化物酶体生物合成因子蛋白的真核生物相比,其中转化的真核宿主生物的PUFA含量提高,所述PUFA被掺入到总脂质级分和油级分中,以总脂肪酸百分比形式表示。可获得的宿主可以是不同的哺乳动物体系、植物细胞、真菌、藻类、卵菌、酵母、原生藻菌和/或类眼虫。虽然优选含油生物,但本文也可使用非含油生物,使得当它们的一种天然PEX基因被破坏时,在总脂质级分或油级分中至少一种多不饱和脂肪酸的重量%相对于总脂肪酸的重量%提高,并且可导致PUFA提高1.3倍。此外,PUFA百分比可相对于非含油生物的干细胞重量提高。在另一个实施方案中,能基因修饰非含油生物使之变成含油生物,例如酵母如啤酒糖酵母。
含油生物天然能够合成并积聚油,其中总油含量通常大于细胞干重的约25%。将多种藻类、藓、真菌、酵母、原生藻菌和植物被天然地归类为含油生物。
优选的含油微生物包括那些藻类、原生藻菌和真菌生物,它们天然生产ω-3/ω-6PUFA。例如,ARA、EPA和/或DHA经由小环藻属(Cyclotella sp.)、菱形藻属(Nitzschia sp.)、腐霉属(Pythium)、破囊壶菌属(Thraustochytrium sp.)、裂殖壶菌属(Schizochytrium sp.)和被孢霉属(Mortierella)生产。Mackenzie等人(Appl.Environ.Microbiol.,66:4655(2000))描述了转化高山被孢霉(M.alpina)的方法。类似地,用于转化破囊壶菌目(Thraustochytriales)微生物(例如破囊壶菌属(Thraustochytrium)、裂殖壶菌属(Schizochytrium))的方法在美国专利7,001,772中公开。
更优选含油酵母,包括那些天然生产和那些经遗传工程化以生产ω-3/ω-6PUFA的酵母。通常鉴定为含油酵母的属包括但不限于:耶氏酵母属(Yarrowia)、假丝酵母属(Candida)、红酵母属(Rhodotorula)、红冬孢酵母属(Rhodosporidium)、隐球酵母属(Cryptococcus)、丝孢酵母属(Trichosporon)和油脂酵母属(Lipomyces)。更具体地讲,示例性的油合成酵母包括:圆红冬孢酵母(Rhodosporidium toruloides)、斯达氏油脂酵母(Lipomyces starkeyii)、产油油脂酵母(L.lipoferus)、拉可夫氏假丝酵母(Candida revkaufi)、铁红假丝酵母(C.pulcherrima)、热带假丝酵母(C.tropicalis)、产朊假丝酵母(C.utilis)、茁芽丝孢酵母(Trichosporon pullans)、皮状丝孢酵母(T.cutaneum)、胶粘红酵母(Rhodotorula glutinus)、禾木科红酵母(R.graminis)和解脂耶氏酵母(Yarrowia lipolytica)(以前归类为解脂假丝酵母(Candidalipolytica))。
最优选的是含油酵母解脂耶氏酵母;而且,在其它实施方案中,最优选的是命名为ATCC#76982、ATCC#20362、ATCC#8862、ATCC#18944和/或LGAM S(7)1的解脂耶氏酵母菌株(Papanikolaou S.和Aggelis G.,Bioresour.Technol.,82(1):43-9(2002))。
涉及转化解脂耶氏酵母的具体教导内容包括美国专利4,880,741和美国专利5,071,764,以及Chen,D.C.等人(Appl.Microbiol Biotechnol..,48(2):232-235(1997)),而合适的选择技术描述于美国专利7,238,482和国际申请公布WO 2005/003310和WO 2006/052870。
在解脂耶氏酵母中表达基因的优选方法是通过将线性DNA整合进宿主基因组中。当期望基因高水平表达时,整合进基因组中的多个位置可能是尤其有用的,例如在Ura3基因座(GenBank保藏号AJ306421)、Leu2基因座(GenBank保藏号AF260230)、Lys5基因座(GenBank保藏号M34929)、Aco2基因座(GenBank保藏号AJ001300)、Pox3基因座(Pox3:GenBank保藏号XP_503244或Aco3:GenBank保藏号AJ001301)、Δ12去饱和酶基因座(美国专利7,214,491)、Lip1基因座(GenBank保藏号Z50020)、Lip2基因座(GenBank保藏号AJ012632)、SCP2基因座(GenBank保藏号AJ431362)、Pex3基因座(GenBank保藏号CAG78565)、Pex16基因座(GenBank保藏号CAG79622)和/或Pex10基因座(GenBank保藏号CAG81606)。
优选的用于解脂耶氏酵母的选择方法是对卡那霉素、潮霉素和氨基糖苷G418的抗性以及在缺乏尿嘧啶、亮氨酸、赖氨酸、色氨酸或组氨酸的培养基上生长的能力。5-氟乳清酸[5-氟尿嘧啶-6-羧酸一水合物或“5-FOA”]也可用于选择酵母Ura-突变体。该化合物对具有编码乳清酸苷5′-单磷酸脱羧酶[OMP脱羧酶]的功能性URA3基因的酵母细胞有毒性;因此,基于这种毒性,5-FOA特别可用于选择和鉴定Ura-突变型酵母菌株(Bartel,P.L.和Fields,S.,Yeast 2-Hybrid System,Oxford University:New York,第7卷,第109-147页,1997;也参见国际申请公开WO2006/052870,用于5-FOA在耶氏酵母属中的应用)。
用于耶氏酵母属的备选的优选选择方法依赖于用于解脂耶氏酵母的基于磺酰脲(氯嘧磺隆;E.I.duPont de Nemours & Co.,Inc.,Wilmington,DE)抗性的显性非抗生素标记。更具体地讲,该标记基因是具有单个氨基酸改变(即W497L)的天然乙酰羟酸合酶(“AHAS”或乙酰乳酸合酶;E.C.4.1.3.18),从而赋予了磺酰脲除草剂抗性(国际申请公布WO 2006/052870)。AHAS是用于生物合成支链氨基酸(即缬氨酸、亮氨酸、异亮氨酸)的途径中的第一常用酶,并且它是磺酰脲和咪唑啉酮除草剂的靶标。
用于多不饱和脂肪酸生产的发酵方法
使转化的宿主细胞在优化PUFA生物合成基因的表达并且产生最大并且最经济产量的期望PUFA的条件下生长。通常,可通过改变碳源的类型和量、氮源的类型和量、碳氮比、不同矿物离子的量、氧水平、生长温度、pH、生物质生产期的时长、油积聚期的时长以及细胞收获的时间和方法来优化培养基条件。通常使所关注的含油酵母例如解脂耶氏酵母在复合培养基(例如酵母提取物-蛋白胨-葡萄糖肉汤培养基(YPD))或缺乏生长所必需的成分并强迫选择期望表达盒的确定成分的极限培养基(如Yeast Nitrogen Base(DIFCO Laboratories,Detroit,MI))中生长。
用于本文所述方法和宿主细胞的发酵培养基必须包含合适的碳源如在美国专利7,238,482中提出的碳源。合适的碳源涵盖多种来源,优选糖、甘油和/或脂肪酸。最优选的碳源是葡萄糖和/或包含介于10-22个碳之间的脂肪酸。
氮可以由无机源(如(NH4)2SO4)或有机源(如尿素或谷氨酸)提供。除了合适的碳源和氮源之外,发酵培养基还必须含有合适的矿物质、盐、辅因子、缓冲液、维生素和本领域技术人员已知的适合含油酵母生长并促进PUFA产生的酶促途径的其它组分。要特别注意促进脂质和PUFA合成的几种金属离子(如Fe+2、Cu+2、Mn+2、Co+2、Zn+2和Mg+2)(Nakahara,T.等人,Ind.Appl.Single Cell Oils,D.J.Kyle和R.Colin编辑,第61-97页(1992))。
用于本文所述方法和宿主细胞的优选生长培养基是常规的商业制备培养基,如酵母氮源培养基(Yeast Nitrogen Base,DIFCO Laboratories,Detroit,MI)。也可以使用其它确定成分的生长培养基或合成的生长培养基,适于转化宿主细胞生长的培养基对微生物学或发酵科学来说是已知的。适于发酵的pH范围通常介于约pH 4.0至pH 8.0之间,其中优选将pH 5.5至pH 7.5作为初始生长条件的范围。发酵可以在有氧或厌氧条件下进行,其中优选为微好氧条件。
通常,在含油酵母细胞中积聚高水平的PUFA和TAG需要一个两阶段过程,因为代谢状态必须在生长和脂肪的合成/贮存之间达到“平衡”。因此,最优选的是,两阶段的发酵过程是在含油酵母中产生油所必需的。这种方法在美国专利7,238,482中有所描述,其也描述了多种合适的发酵工艺设计(即分批式、补料分批式和连续式)和生长过程中的注意事项。
PUFA油的纯化和处理
包括PUFA在内的脂肪酸可以游离脂肪酸或酯化形式如酰基甘油、磷脂、硫脂或糖脂形式存在于宿主生物中。这些脂肪酸可通过本领域熟知的多种方法从宿主细胞中提取。关于酵母脂质的提取技术、质量分析和可接受标准的一篇综述是Z.Jacobs(Critical Reviews in Biotechnology,12(5/6):463-491(1992))的综述。有关下游加工的简述也可由A.Singh和O.Ward(Adv.Appl.Microbiol.,45:271-312(1997))提供。
一般来讲,用于纯化脂肪酸(包括PUFA)的手段可包括用有机溶剂萃取(例如,美国专利6,797,303和美国专利5,648,564)、超声处理、超临界流体萃取(如使用二氧化碳)、皂化和物理手段(例如挤压)、或它们的组合。参见美国专利7,238,482.
油在食物、保健食品、药物和动物饲料中的用途
市场包含许多种掺入了ω-3和/或ω-6脂肪酸(尤其是ALA、GLA、ARA、EPA、DPA和DHA)的食物和饲料产品。可以预期,通过本文所述方法和宿主细胞制备的包含长链PUFA的微生物生物质、包含PUFA的部分纯化微生物生物质、包含PUFA的纯化微生物油、和/或纯化的PUFA赋予健康有益效果,通过加入这些物质改善了对食物或饲料的摄取。举例来说,可将这些油加到食物类似物、饮料、肉产品、谷类产品、烘焙食品、小吃食品和乳制品中。参见美国专利公开2006/0094092。
这些组合物通过加入到医疗食品(包括医疗营养物质、饮食补充剂、婴儿代乳品和药品)中可赋予健康有益效果。技术人员将会知道加到食品、饲料、饮食补充剂、营养物质、药品、和其它可摄取的产品中以赋予健康有益效果的油的量。本领域描述了摄取这些油获得的健康有益效果,它是技术人员已知的并进行了持续的研究。本文所指的量是“有效”量,除了其它方面的原因,它取决于摄入的包含这些油的产品的性质以及它们旨在治疗的病症。
优选实施方案的描述
如实施例所证明并如表5所概述的(下文),破坏YlPex10p的C3HC4环状锌指基序的C-末端部分、缺失整个染色体YlPex10基因或整个染色体YlPex16基因、缺失整个染色体YlPex10和YlPex16基因、和缺失整个染色体YlPex3基因都导致产生解脂耶氏酵母的工程化PUFA生产菌株,相对于不破坏天然Pex蛋白的亲本菌株,该菌株具有以总脂肪酸百分比形式表示的、提高的PUFA重量%。在解脂耶氏酵母的工程化EPA生产菌株中表达染色体外的YlPex10p逆转了该效应,所述菌株在基因组Pex10p中具有破坏和提高的总脂质级分和油级分中的PUFA的量。
表5汇编了来自实施例3、4、5、7、9、11和12的数据,使得关于总脂质含量[“TFA%DCW”]、以总脂肪酸重量%表示的给定脂肪酸浓度[“TFA%”]、和干细胞重量百分比形式表示的给定脂肪酸含量[“DCW%”]的趋势能基于存在/不存在Pex破坏或敲除被推论出来。“期望PUFA%TFA”和“期望PUFA%DCW”分别定量期望的PUFA产物(即DGLA或EPA)的特定浓度或含量,所述产物通过设计的工程化PUFA生物合成途径进行生产。“全部PUFA”包括LA、ALA、EDA、DGLA、ETrA、ETA和EPA、然而“C20PUFA”被限定为EDA、DGLA、ETrA、ETA和EPA。
虽然数据不能直接地在实施例之间进行比较,但作为不同的耶氏酵母属菌株和生长条件的结果,可得出以下结论(相对于其天然Pex蛋白未被破坏的亲本菌株或其表达破坏天然Pex蛋白的“后备”拷贝的亲本菌株):
1)在生产PUFA的耶氏酵母属中的Pex破坏导致总脂质级分和油级分中的单个PUFA例如EPA或DLGA的重量%相对于总脂肪酸的重量%(%TFA)提高;
2)在生产PUFA的耶氏酵母属中的Pex破坏导致总脂质级分和油级分中的C20PUFA的重量%相对于总脂肪酸的重量%(%TFA)提高;
3)通过延伸点1),在生产PUFA的耶氏酵母属中的Pex破坏导致总脂质级分和油级分中的任何PUFA和所有PUFA组合的量相对于总脂肪酸的重量%提高;和
4)在生产PUFA的耶氏酵母属中的Pex破坏导致单个PUFA例如EPA或DLGA相对于干细胞重量的百分比提高。
当比较在“所有PUFA%DCW”、“C20PUFA%DCW”和TFA%DCW中的Pex破坏的效应时,观察到不同的结果。具体地讲,在一些情况下,在生产PUFA的耶氏酵母属中的Pex破坏导致总脂质级分和油级分中的C20PUFA或所有PUFA的量(以干细胞重量百分比形式表示)提高(相对于天然Pex蛋白未被破坏的亲本菌株)。在其它情况下,在总脂质级分和油级分中存在以干细胞重量百分比形式表示的减少量的C20PUFA或所有PUFA(相对于天然Pex蛋白未被破坏的亲本菌株)。也观察到相对于总脂质含量(TFA%DCW)的类似结果,因此Pex破坏的效应可能导致总脂质含量提高或降低。
虽然上文概述的每项内容都是受关注的,尤其有用的是检查Pex破坏对期望的PUFA相对于总PUFA量的比率的效应,对生物进行工程化以生产期望的PUFA。
例如,在包含Pex10破坏的菌株Y4128中54%的PUFA(%TFA)是EPA,所述破坏导致截去C末端的最后32个氨基酸,然而在亲本菌株Y4086中仅有16.3%的PUFA(%TFA)是EPA。因此,所述破坏导致期望的PUFA(%TFA)量提高3.3倍(实施例3、4)。在类似情况下,在菌株Y4036(ΔPex16)中62.8%的PUFA(%TFA)是DGLA,然而在Y4036-中仅有38.1%的PUFA(%TFA)是DGLA,提高了1.65倍(实施例9)。并且在菌株Y4036(ΔPex3)中67.7%的PUFA(%TFA)是DGLA,然而在Y4036-中仅有33.3%的PUFA(%TFA)是DGLA,提高了2.0倍(实施例12)。这些结果证实这一假说:Pex破坏导致总脂质级分和油级分中的期望的PUFA的量(%TFA)选择性地提高,所述PUFA通过工程化生物进行生产。
当检查Pex破坏对C20PUFA相对于总PUFA量的比率的效应时,观察到较低的选择性。例如,在包含Pex10破坏的菌株Y4128中,73%的PUFA(%TFA)是C20PUFA,然而在菌株Y4086中,仅有42%的PUFA(%TFA)是C20PUFA。因此,破坏导致总脂质级分和油级分中积聚的C20PUFA相对于总PUFA的量提高1.7倍(实施例3、4)。在类似情况下,在菌株Y4036(ΔPex16)中71%的PUFA(%TFA)是C20PUFA,然而在Y4036-中仅有54.8%的PUFA(%TFA)是C20PUFA,提高了1.3倍(实施例9)。并且在菌株Y4036(ΔPex3)中82.4%的PUFA(%TFA)是C20PUFA,然而在Y4036-中仅有47.4%的PUFA(%TFA)是C20PUFA,提高了1.7倍(实施例12)。
基于教导和本文所述的结果,期望将实现以下方法的可行性和商业可用性:利用在编码过氧化物酶体生物合成因子蛋白的天然基因中的多个破坏作为提高在生产PUFA的真核生物中的PUFA量的方法。生产PUFA的真核生物能够合成多种ω-3和/或ω-6PUFA,该合成利用Δ9延伸酶/Δ8去饱和酶途径或Δ6去饱和酶/Δ6延伸酶途径。
实施例
在以下实施例中进一步描述本发明,所述实施例示出了实施本发明的简要方法,但不完全限定其所有可能变型。
一般方法
实施例中使用的标准重组DNA技术和分子克隆技术是本领域所熟知的并且在如下文献中有所描述:1)Sambrook,J.,Fritsch,E.F.和Maniatis,T.的Molecular Cloning:A Laboratory Manual;Cold SpringHarbor Laboratory:Cold Spring Harbor,NY(1989)(Maniatis);2)T.J.Silhavy,M.L.Bennan和L.W.Enquist,Experiments with GeneFusions;Cold Spring Harbor Laboratory:Cold Spring Harbor,NY(1984);以及3.)Ausubel,F.M.等人,Current Protocols in Molecular Biology,由Greene Publishing Assoc.和Wiley-Interscience出版,Hoboken,NJ(1987)。
适用于微生物培养物的维持和生长的材料和方法是本领域熟知的。适用于下面实施例的技术可在Manual of Methods for GeneralBacteriology(Phillipp Gerhardt,R.G.E.Murray,Ralph N.Costilow,Eugene W.Nester,Willis A.Wood,Noel R.Krieg和G.Briggs Phillips编辑),American Society for Microbiology:Washington,D.C.(1994));或Thomas D.Brock in Biotechnology:A Textbook of IndustrialMicrobiology,第2版,Sinauer Associates:Sunderland,MA(1989)的描述中找到。用于生长和维持微生物细胞的所有试剂、限制性酶和材料得自Aldrich Chemicals(Milwaukee,WI)、DIFCO Laboratories(Detroit,MI)、New England Biolabs,Inc.(Beverly,MA)、GIBCO/BRL(Gaithersburg,MD)、或Sigma Chemical Company(St.Louis,MO)。通常于37℃下在Luria Bertani(LB)平板上培养大肠杆菌菌株。
一般的分子克隆根据标准方法(Sambrook等人,同上)来完成。DNA序列是在ABI自动测序仪上采用染料终止剂技术(美国专利5,366,860;EP 272,007)使用载体和插入特异性引物的组合来产生的。序列编辑是在Sequencher(Gene Codes Corporation,Ann Arbor,MI)中进行的。所有序列在两个方向上覆盖至少两次。除非另外指明,使用DNASTAR软件(DNASTAR Inc.,Madison,WI)比较本文的基因序列。
缩写的含义如下:“sec”表示秒,“min”表示分钟,“h”表示小时,“d”表示天,“μL”表示微升,“mL”表示毫升,“L”表示升,“μM”表示微摩尔浓度,“mM”表示毫摩尔浓度,“M”表示摩尔浓度,“mmol”表示毫摩尔,“μmole”表示微摩尔,“g”表示克,“μg”表示微克,“ng”表示纳克,“U”表示单位,“bp”表示碱基对,并且“kB”表示千碱基对。
表达盒的命名:
表达盒的结构通过简单的记号系统“X::Y::Z”表示,其中X描述启动子片段,Y描述基因片段而Z描述终止子片段,它们均彼此可操作地连接。
解脂耶氏酵母的转化和培养
解脂耶氏酵母菌株ATCC#20362购自美国典型培养物保藏中心(Rockville,MD)。解脂耶氏酵母菌株通常是在根据下面所示配方的数种培养基中于28-30℃下培育。根据标准方法,按需要通过将20g/L琼脂添加到每种液体培养基中来制备琼脂平板。
YPD琼脂培养基(每升):10g酵母提取物[Difco]、20g细菌用蛋白胨[Difco]、以及20g葡萄糖。
基本培养基(MM)(每升):20g葡萄糖、1.7g无氨基酸酵母氮源、1.0g脯氨酸,并且pH6.1(不需要调节)。
基本培养基+尿嘧啶(MM+尿嘧啶或MMU)(每升):如上制备MM培养基并加入0.1g尿嘧啶和0.1g尿苷。
基本培养基+尿嘧啶+磺酰脲类(MMU+SU)(每升):如上制备MMU培养基并加入280mg磺酰脲类。
基本培养基+亮氨酸+赖氨酸(MMLeuLys)(每升):如上制备MM培养基并加入0.1g亮氨酸和0.1g赖氨酸。
基本培养基+5-氟乳清酸(MM+5-FOA)(每升):20a葡萄糖、6.7g酵母氮源、75mg尿嘧啶、75mg尿苷,并且基于对一系列从100mg/L至1000mg/L的浓度的FOA活性试验(因为供应商提供的每批料中会发生改变),选用适量的FOA(Zymo Research Corp.,Orange,CA)。
高葡萄糖培养基(HGM)(每升):80葡萄糖、2.58g KH2PO4和5.36g K2HPO4,pH7.5(不需要调节)。
无酵母提取物的发酵培养基(FM无YE)(每升):6.70g酵母氮源、6.00g KH2PO4,2.00g K2HPO4、1.50g MgSO4*7H2O和20g葡萄糖。
发酵培养基(FM)(每升):如上制备无YE培养基的FM并加入5.00g酵母提取物(BBL)。
合成右旋糖培养基(SD)(每升):6.7g酵母氮源,具有硫酸铵并且无氨基酸;和20g葡萄糖。
Complete Minimal Glucose Broth Minus Uracil(CSM-Ura):目录号
C8140,Teknova,Hollister,CA(0.13%氨基酸缺陷型粉末,无尿嘧啶。0.17%酵母氮源,0.5%(NH4)2SO4,2.0%葡萄糖)。
解脂耶氏酵母的转化根据Chen,D.C.等人(Appl.MicrobiolBiotechnol.,48(2):232-235(1997))的方法完成,除非另外说明。简而言之,将耶氏酵母划线到YPD平板上,并在30℃下生长大约18小时。从平板上刮掉几大环量的细胞并将其重悬浮于含有下面成分的lmL转化缓冲液中:2.25mL 50%PEG,平均分子量为3350;0.125mL 2M醋酸锂,pH6.0;以及0.125mL 2M DTT。然后,将大约500ng线性化的质粒DNA在100μL重悬浮的细胞中孵育,并将其在39℃下维持1小时,每间隔15分钟进行涡旋混合。将细胞铺在选择培养基平板上并在30℃下维持2至3天。
解脂耶氏酵母的脂肪酸分析
为了进行脂肪酸分析,将细胞通过离心收集并如Bligh,E.G.&Dyer,W.J.(Can.J.Biochem.Physiol.,37:911-917(1959))中所述提取脂质。通过将脂质提取物与甲醇钠进行酯交换反应而制备脂肪酸甲酯[“FAME”(Roughan,G.和Nishida I.,Arch Biochem Biophys.,276(1):38-46(1990))并随后将其用配有30m×0.25mm(内径)HP-INNOWAX(Hewlett-Packard)柱的Hewlett-Packard 6890气相色谱仪(GC)进行分析。炉温以3.5℃/分钟从170℃(保持25分钟)升到185℃。
为了进行直接的碱催化酯交换反应,收获耶氏酵母培养物(3mL),在蒸馏水中洗涤一次,并在Speed-Vac中在真空下干燥5-10分钟。将甲醇钠(100μL,浓度为1%)加入样本中,然后将样本涡旋振荡20分钟。在加入3滴1M氯化钠和400μL己烷后,涡旋并离心样本。移出上层并如上所述用GC进行分析。
实施例1
产生解脂耶氏酵母菌株Y4086以通过Δ9延伸酶/Δ8去饱和酶途径生产
占总脂质约14%的EPA
本实施例描述了来源于解脂耶氏酵母ATCC#20362的菌株Y4086的构建,该菌株能通过Δ9延伸酶/Δ8去饱和酶途径的表达生产相对于总脂质而言约14%的EPA(图3A)。
菌株Y4086的产生需要构建菌株Y2224(由野生型耶氏酵母菌株ATCC#20362的Ura3基因自主突变而产生的FOA抗性突变体)、菌株Y4001(产生17%EDA,具有Leu-表型)、菌株Y4001U(Leu-和Ura表型)、菌株Y4036(产生18%DGLA,具有Leu-表型)、菌株Y4036U(具有Leu-和Ura-表型)和菌株Y4070(产生12%ARA,具有Ura-表型)。关于构建菌株Y2224、Y4001、Y4001U、Y4036、Y4036U和Y4070的更多细节描述于实施例7,国际申请公布WO 2008/073367。
相对于野生型解脂耶氏酵母ATCC#20362,菌株Y4070的最终基因型是Ura3-、unknown 1-、unknown 3-、Leu+、Lys+、GPD::FmD12::Pex20、YAT1::FmD12::OCT、YAT1::ME3S::Pex16、GPAT::EgD9e::Lip2、EXP1::EgD9eS::Lip1、FBAINm::EgD9eS::Lip2、FBAINm::EgD8M::Pex20、EXP1::EgD8M::Pex16、FBAIN::EgD5::Aco、EXP1::EgD5S::Pex20、YAT1::RD5S::OCT(其中FmD12是串珠镰刀菌Δ12去饱和酶基因[国际申请公布WO 2005/047485];ME3S是经密码子优化的C16/18延伸酶基因,来源于高山被孢霉[国际申请公布WO 2007/046817];EgD9e是小眼虫Δ9延伸酶基因[国际申请公布WO 2007/061742];EgD9eS是经密码子优化的Δ9延伸酶基因,来源于小眼虫[国际申请公布WO 2007/061742];EgD8M是合成突变型Δ8去饱和酶[国际申请公布WO 2008/073271],来源于小眼虫[美国专利7,256,033];EgD5是小眼虫Δ5去饱和酶[美国专利申请公布US 2007-0292924-A1];EgD5S是经密码子优化的Δ5去饱和酶基因,来源于小眼虫[美国专利申请公布2007-0292924];以及RD5S是经密码子优化的Δ5去饱和酶,来源于多甲藻属CCMP626[美国专利申请公布2007-0271632])。
产生菌株Y4086以生产占总脂质约14%的EPA
构建体pZP3-Pa777U(图3B;SEQ ID NO:28)描述于表19,国际申请公布WO 2008/07336,它被生成以将三个Δ17去饱和酶基因整合进菌株Y4070的Pox3位点中(GenBank保藏号AJ001301),从而生成EPA。Δ17去饱和酶基因是PaD17,一种瓜果腐霉菌Δ17去饱和酶(国际申请公布WO 2008/054565),和PaD17S,一种经密码子优化的Δ17去饱和酶,来源于瓜果腐霉菌(国际申请公布WO 2008/054565)。
将pZP3-Pa777U质粒用AscI/SphI消化,然后根据“一般方法”将其用于转化菌株Y4070。将转化细胞铺在MM平板上并在30℃下维持2至3天。将单菌落再次划线至MM平板上,随后将其在30℃下接种至液体MMLeuLys中并以250rpm/min摇动2天。离心收集细胞,提取脂质,通过酯交换反应来制备FAME,并随后用Hewlett-Packard 6890GC进行分析。
GC分析显示,EPA存在于含有pZP3-Pa777U的3种嵌合基因的转化体中,但不存在于亲本Y4070菌株中。所选的96个菌株中大部分产生占总脂质约10-13%的EPA。有2个菌株(即#58和#79)产生了占总脂质约14.2%和13.8%的EPA。将这两个菌株分别命名为Y4085和Y4086。
相对于野生型解脂耶氏酵母ATCC#20362,菌株Y4086的最终基因型是Ura3+、Leu+、Lys+、unknown 1-、unknown 2-、YALI0F24167g-、GPD::FmD12::Pex20、YAT1::FmD12::OCT、YAT1::ME3S::Pex16、GPAT::EgD9e::Lip2、EXP1::EgD9eS::Lip1、FBAINm::EgD9eS::Lip2、FBAINm::EgD8M::Pex20、EXP 1::EgD8M::Pex16、FBAIN::EgD5::Aco、EXP1::EgD5S::Pex20、YAT1::RD5S::OCT、YAT1::PaD17S::Lip1、EXP1::PaD17::Pex16、FBAINm::PaD17::Aco。
实施例2
产生解脂耶氏酵母菌株Y4128以通过Δ9延伸酶/Δ8去饱和酶途径生产
占总脂质约37%的EPA
本实施例描述来源于解脂耶氏酵母ATCC#20362的菌株Y4128的构建,该菌株能够产生相对于总脂质约37.6%的EPA(即,相对于Y4086以总脂肪酸的百分比表示的EPA浓度有大于2倍的提高;图3A)。
菌株Y4128的产生需要构建菌株Y2224、Y4001、Y4001U、Y4036、Y4036U、Y4070和Y4086(如实施例1所述),以及构建菌株Y4086U1(Ura-)。
产生菌株Y4086U1(Ura-)
菌株Y4086U1经由在菌株Y4086的构建体pY117中暂时表达Cre重组酶(图4A;SEQ ID NO:29;描述于国际申请公布WO 2008/073367的表20中)制备Ura-表型。这从基因组释放出LoxP夹着的Ura3基因。突变的耶氏酵母属乙酰羟酸合酶[“AHAS”;E.C.4.1.3.18](即,GenBank保藏号XP501277,包含W497L突变,如SEQ ID NO:27所示;参见国际申请公布WO 2006/052870)在质粒pY117中,赋予其磺酰脲类抗除草剂性(SUR),该抗性用作阳性筛选标记。
根据“一般方法”将质粒pY117用于转化菌株Y4086。在转化后,将细胞置于MMU+SU(280μg/mL磺酰脲类;也称作氯嘧磺隆,E.I.duPont de Nemours & Co.,Inc.,Wilmington,DE)平板上并在30℃下保持2至3天。挑取在MMU+SU平板上生长的SUR单菌落,将其在30℃下划线至YPD液体培养基中并以250rpm/min摇动1天以固化pY117质粒。将生长培养基划线接种在MMU平板上。在30℃下保持两天后,将单克隆再划线接种在MM和MMU平板上。选择可以在MMU平板上生长但不在MM平板上生长的那些菌落。将这些菌株中具有Ura-表型的两个菌株命名为Y4086U1和Y4086U2。
产生菌株Y4128以生产占总脂质约37%的EPA
制备构建体pZP2-2988(图4B;SEQ ID NO:30;描述于表21,国际申请公布WO 2008/073367)以将一个Δ12去饱和酶基因(即,FmD12S,一种经密码子优化的Δ12去饱和酶基因,来源于串珠镰刀菌[国际申请公布WO 2005/047485])、两个Δ8去饱和酶基因(即,EgD8M)和一个Δ9延伸酶基因(即,EgD9eS)整合进菌株Y4086U1的Pox2位点(GenBank保藏号AJ001300)中,从而在较高水平上生产EPA。将pZP2-2988质粒用AscI/SphI消化,然后根据“一般方法”将其用于转化菌株Y4086U1。将转化细胞铺在MM平板上并在30℃下维持2至3天。将单菌落再次划线至MM平板上,随后将其在30℃下接种至液体MMLeuLys中并以250rpm/min摇动2天。通过离心收集细胞,将其重悬浮于HGM中,然后以250rpm/min摇动5天。离心收集细胞,提取脂质,通过酯交换反应来制备FAME,并随后用Hewlett-Packard 6890GC进行分析。
GC分析显示所选择的96个菌株中的大多数生产了占总脂12-15.6%的EPA。有2个菌株(即,Group I中的#37和Group II中的#33)生产占总脂质约37.6%和16.3%的EPA。将这两个菌株分别命名为Y4128和Y4129。
相对于野生型解脂耶氏酵母ATCC#20362的菌株Y4128的最终基因型是:YALI0F24167g-、Pex10-、unknown 1-、unknown 2-、GPD::FmD12::Pex20、YAT1::FmD12::OCT、GPM/FBAIN::FmD12S::OCT、YAT1::ME3S::Pex16、GPAT::EgD9e::Lip2、EXP1::EgD9eS::Lip1、FBAINm::EgD9eS::Lip2、FBA::EgD9eS::Pex20、FBAINm::EgD8M::Pex20、EXP1::EgD8M::Pex16、GPDIN::EgD8M::Lip1、YAT1::EgD8M::Aco、FBAIN::EgD5::Aco、EXP1::EgD5S::Pex20、YAT1::RD5S::OCT、YAT1::PaD17S::Lip1、EXP1::PaD17::Pex16、FBAINm::PaD17::Aco。
在2007年8月23日将解脂耶氏酵母菌株Y4128保藏于美国典型培养物保藏中心,并且命名为ATCC PTA-8614。
产生具有Ura-表型的Y4128U菌株
为了破坏菌株Y4128中的Ura3基因,制备构建体pZKUE3S(图5ASEQ ID NO:31;描述于表22,国际申请公布WO 2008/073367)以将EXP1::ME3S::Pex20嵌合基因整合进菌株Y4128的Ura3基因中。将质粒pZKUE3S用SphI/PacI消化,然后根据“一般方法”将其用于转化菌株Y4128。转化后,将细胞铺至MM+5-FOA选择平板上并在30℃下维持2至3天。
挑取在MM+5-FOA选择平板上生长的总共24个转化体,并且再次划线接种到新MM+5-FOA平板上。从板上剥离细胞,提取脂质,通过酯交换反应来制备FAME,并随后用Hewlett-Packard 6890GC进行分析。
GC分析显示来自平板的具有pZKUE3S的所有转化体中存在10-15%的EPA。鉴定为#3、#4、#10、#12、#19和#21的菌株分别生产占总脂质12.9%、14.4%、15.2%、15.4%、14%和10.9%的EPA,将它们分别命名为Y4128U1、Y4128U2、Y4128U3、Y4128U4、Y4128U5和Y4128U6(统称为Y4128U)。
Y4128(37.6%)对Y4128U(平均13.8%)%EPA定量差异是因为不同的生长条件而产生的。具体地讲,前者的培养基是在液体培养基中生长两天后进行分析的,而后者的培养基是在琼脂平板上生长后进行分析的。申请人已经观察到当比较琼脂平板与液体培养基中的结果时,%EPA提高了2-3倍。因此,虽然不能直接比较结果,但Y4128和Y4128U菌株都证明产生了EPA。
实施例3
测定解脂耶氏酵母菌株Y4128的总脂质含量
通过GC分析法测定菌株Y4128生产的脂质总量和脂质中的每种脂肪酸物质的百分比。具体地讲,如一般方法所述,提取总脂质,通过酯交换反应来制备FAME,并随后用Hewlett-Packard 6890GC进行分析。
干细胞重量如下进行测定:从10mL培养物中经由离心收集细胞,用水洗涤细胞一次以除去残余培养基,在80℃真空炉中干燥细胞过夜,称量干细胞重量。样本中的FAME总量通过比较GC特征图中的所有峰面积与加入的已知量内部标准品C15:0脂肪酸的峰面积而进行测定。
基于上述分析,测定菌株Y4086和Y4128的以干细胞重量百分比形式表示的脂质含量(DCW)和脂质组成。菌株Y4128与菌株Y4086相比脂质含量降低(11.2TFA%DCW对28.6TFA%DCW)。相反地,菌株Y4128与菌株Y4086相比脂质中的EPA浓度提高,如下表6所示。脂肪酸被鉴定为18:0(硬脂酸)、18:1(油酸)、LA、ALA、EDA、DGLA、ETrA、ETA和EPA;脂肪酸组成表示为总脂肪酸的重量百分比(wt.%)(TFA)。
表6
在解脂耶氏酵母菌株Y4086和Y4128中的脂质组成
样本 | 18:0 | 18:1 | 18:2[LA] | 18:3(n-3)[ALA] | 20:2[EDA] | 20:3(n-6)[DGLA] | 20:3(n-3)[ETrA] | 20:4(n-3)[ETA] | 20:5(n-3)[EPA] |
Y4086 | 4.6 | 26.8 | 28.0 | 6.9 | 7.6 | 0.9 | 4.9 | 2.0 | 9.8 |
Y4128 | 1.8 | 6.7 | 19.6 | 1.8 | 4.2 | 3.4 | 1.5 | 6.0 | 42.8 |
细胞中的EPA含量以mg EPA/g干细胞形式表示,并且按照下式计算:(%EPA/脂质)*(%脂质/干细胞重量)*0.1,其从菌株Y4086中的28mg EPA/g DCW提高到菌株Y4128中的47.9mg EPA/g DCW。
因此,表6中的结果显示与亲本菌株Y4086相比,菌株Y4128具有更低的总脂质含量(TFA%DCW)(11.2%对28.6%),更高的EPA%TFA(42.8%对9.8%),和更高的EPA%DCW(4.8%对2.8%)。此外,菌株Y4128的EPA相对于总PUFA的量提高了3.3倍(54%的PUFA[%TFA]对16.3%的PUFA[%TFA]),而C20PUFA相对于总PUFA的量提高了1.7倍(73%的PUFA[%TFA]对42%的PUFA[%TFA])。
实施例4
测定在解脂耶氏酵母菌株Y4128中的整合位点pZP2-2988发生Pex10整
合
菌株Y4128中的pZP2-2988基因组整合位点通过基因组步移法,使用来自Clontech(Palo Alto,CA)的Universal GenomeWalkerTM试剂盒,按照制造商推荐的规程进行测定。基因质粒序列,设计以下引物用于基因组步移法:pZP-GW-5-1(SEQ ID NO:32)、pZP-GW-5-2(SEQ IDNO:33)、pZP-GW-5-3(SEQ ID NO:34)、pZP-GW-5-4(SEQ ID NO:35)、pZP-GW-3-1(SEQ ID NO:36)、pZP-GW-3-2(SEQ ID NO:37)、pZP-GW-3-3(SEQ ID NO:38)和pZP-GW-3-4(SEQ ID NO:39)。
使用Qiagen Miniprep试剂盒用改进规程从菌株Y4128中制备基因组DNA。从YPD培养基平板上刮除细胞并置于1.5mL微管中。将细胞沉淀物(100μL)用250μL缓冲液P1重悬,该缓冲液P1包含0.125M β-巯基乙醇和1mg/mL酵母裂解酶20T(MP Biomedicals,Inc.,Solon,OH)。细胞悬浮液在37℃下培养30分钟。然后将缓冲液P2(250μL)加到管中。在将管颠倒几次进行混合后,加入350μL缓冲液N3。然后将微管中的混合物在14,000rpm下离心5分钟。将上清液注入Qiagen小量制备离心柱中,离心1分钟。加入0.75mL缓冲液PE洗涤一次柱子,然后在14,000rpm下离心1分钟。在14,000rpm下再离心1分钟干燥该柱。通过加入50μL的缓冲液EB到柱中洗脱基因组DNA,允许静置1分钟并在14,000rpm下离心1分钟。
使用纯化的基因组DNA进行基因组步移法。按照GenomeWalker试剂盒的规程,用限制性酶DraI、EcoRV、PvuII和StuI分别消化DNA。就每次消化而言,反应混合物包含10μL 10X限制缓冲液、10μL适用的限制性酶和8μg基因组DNA,总体积为100μL。反应混合物在37℃下培养4小时。然后使用Qiagen PCR纯化试剂盒,严格按照制造商规程纯化经消化的DNA样本。DNA样本用16μL水洗脱。然后将纯化的经消化基因组DNA样本连接到genome walker衔接子上(下文)。每个连接混合物包含1.9μL genome walker衔接子、1.6μL 10X连接缓冲液、0.5μL T4DNA连接酶和4μL消化过的DNA。反应混合物在16℃下培养过夜。然后将72μL的50mM TrisHCl、1mM EDTA,pH7.5加入到每个连接混合物中。
就5′末端基因组步移法而言,使用1μL的每个连接混合物作模板进行四个PCR反应。此外,每个反应混合物包含1μL的10μM引物pZP-GW-5-1(SEQ ID NO:32)、1μL的10μM试剂盒配送的GenomeWalker衔接子、41μL水、5μL 10X cDNA PCR反应缓冲液和1μL来自Clontech的Advantage cDNA聚合酶混合物。Genome Walker衔接子序列(SEQ ID NOs:40[顶链]和41[底链])显示如下:
5′-GTAATACGACTCACTATAGGGCACGCGTGGTCGACGGCCCG
GGCTGGT-3′
3′-H2N-CCCGACCA-5′
PCR条件如下:95℃1分钟,随后是30个95℃20秒和68℃3分钟的循环,最后在68℃下延伸7分钟。将PCR产物每个进行1:100的稀释,并且将1μL稀释PCR产物用作模板进行第二轮PCR。条件是完全相同的,除了用pZP-GW-5-2(SEQ ID NO:33)代替pZP-GW-5-1(SEQ IDNO:32)。
就3′-末端基因组步移而言,如上文所述进行四个PCR反应,除了使用引物pZP-GW-3-1(SEQ ID NO:36)和巢式衔接子引物(SEQ IDNO:42)。PCR产物同样进行稀释并用作第二轮PCR的模板,使用pZP-GW-3-2(SEQ ID NO:37)代替pZP-GW-3-1(SEQ ID NO:36)。
通过凝胶电泳分析PCR产物。使用EcoRV消化的基因组DNA作为模板和引物pZP-GW-3-2以及巢式衔接子引物,一个反应产物生成~1.6kB的片段。分离该片段,用Qiagen凝胶纯化试剂盒纯化并克隆到pCR2.1-TOPO中。序列分析显示该片段包括两部分:质粒pZP2-2988和耶氏酵母属来自染色体C的基因组DNA。它们在染色体C的核苷酸位点139826上接合。这是位于Pex10基因的编码区内部(GenBank保藏号CAG81606;SEQ ID NO:10)。
为了测定接合的5′末端,使用来自菌株Y4128的基因组DNA作为模板以及引物Per10F1(SEQ ID NO:43)和ZPGW-5-5(SEQ ID NO:44)进行PCR扩增。反应混合物包括1μL的每个均为20μM的引物、1μL基因组DNA、22μL水和25μL TaKaRa ExTaq 2X预混物(TaKaRa BioInc.,Otsu Shiga,Japan)。热循环仪条件是:94℃1分钟,然后为30个94℃20秒、55℃20秒和72℃2分钟的循环,之后在72℃下进行7分钟最终延伸反应。扩增了1.6kB的DNA片段并将其克隆到pCR2.1-TOPO中。序列分析显示它是一个嵌合片段,该片段位于耶氏酵母属来自染色体C的基因组DNA和pZP2-2988之间。接合位于染色体C的核苷酸位点139817。因此,染色体C的10个核苷酸的片段被来自菌株Y4128中的pZP2-2988(图4B)AscI/SphI片段置换。因此,菌株Y4128中的Pex10缺少编码蛋白的最后32个氨基酸。
基于上述结论,实施例2中分离的Y4128U菌株2(同上)后来称为Δpex10菌株。为清楚起见,菌株Y4128U1等同于菌株Y4128U1(Δpex10)。
实施例5
在解脂耶氏酵母菌株Y4128U1(Δpex10)中的Pex10质粒表达
构建递送解脂耶氏酵母Pex10基因的三个质粒:1)pFBAIn-PEX10允许Pex10 ORF在FBAINm启动子控制下表达;和2)pPEX10-1和pPEX10-2允许在天然Pex10启动子控制下表达Pex10,虽然pPEX10-1使用较短版本(~500bp)的启动子而pPEX10-2使用较长版本(~900bp)的启动子。在构建这些表达质粒并转化后,测定Pex10质粒表达对解脂耶氏酵母菌株Y4128U1(Δpex10)总油含量和EPA含量的效应。缺失Pex10导致细胞中以TFA百分比形式表示的EPA的量提高,但是以DCW百分比形式表示的总脂质的量降低。
构建pFBAIn-PEX10、pPEX10-1和pPEX10-2
为了构建pFBAIn-PEX10,使用引物Per10F1(SEQ ID NO:43)和Per10R(SEQ ID NO:45),使用解脂耶氏酵母基因组DNA作为模板扩增Pex10基因的编码区。PCR反应混合物包含1μL每个均为20μM的引物、1μL解脂耶氏酵母基因组DNA(~100ng)、25μL ExTaq 2X预混物和22μL水。反应如下进行:94℃1分钟,然后为30个94℃20秒、55℃20秒和72℃90秒的循环,之后在72℃下进行7分钟最终延伸反应。PCR产物是一个1168bp的DNA片段,它用Qiagen PCR纯化试剂盒进行纯化,用NcoI和NotI消化,并克隆到用相同的两个限制性酶消化的pFBAIn-MOD-1(SEQ ID NO:46;图5B)中。
8个单克隆进行序列分析,2个具有无错的正确Pex10序列。pFBAIn-PEX10(SEQ ID NO:47;图6A)组分在下表7中列出。
表7
质粒pFBAIn-PEX10(SEQ ID NO:47)的组分
SEQ ID NO:47内的RE位点和核苷酸 | 片段和嵌合基因组分的描述 |
BglII-BsiWI(6040-318) | FBAINm::Pex10::Pex20,它包含:●FBAINm:解脂耶氏酵母FBAINm启动子(美国专利7,202,356);●Pex10:解脂耶氏酵母Pex10ORF(GenBank保藏号AB036770,核苷酸1038-2171;SEQ ID NO:21);●Pex20:来自耶氏酵母属Pex20基因(GenBank保藏号AF054613)的Pex20终止子序列 |
SEQ ID NO:47内的RE位点和核苷酸 | 片段和嵌合基因组分的描述 |
PacI-BglII(4530-6040) | 耶氏酵母属URA3(GenBank保藏号AJ306421) |
(3123-4487) | 耶氏酵母属自主复制序列18(ARS18;GenBank保藏号A17608) |
(2464-2864) | 大肠杆菌f1复制起点 |
(1424-2284) | 用于在大肠杆菌中选择的氨苄青霉素抗性基因(AmpR) |
(474-1354) | ColE1质粒复制起点 |
为了构建pPEX10-1和pPEX10-2,设计并合成引物PEx10-R-BsiWI(SEQ ID NO:48)、PEX10-F1-SalI(SEQ ID NO:49)和PEX10-F2-SalI(SEQ ID NO:50)。使用基因组解脂耶氏酵母DNA以及引物PEX10-R-BsiWI和PEX10-F1-SalI进行的PCR扩增生成1873bp的片段,该片段包含Pex10 ORF,Pex10基因的500bp的5′上游区域和215bp的3′下游区域,该基因的两端侧接SalI和BsiWI限制性位点。该片段用Qiagen PCR纯化试剂盒进行纯化,用SalI和BsiWI消化,并克隆到用相同的两个酶消化的pEXP-MOD-1(SEQ ID NO:51;图6B)中以生成pPEX10-1(SEQ ID NO:52;图7A)。质粒pEXP-MOD1类似于pFBAIn-MOD-1(SEQ ID NO:46;图5B),除了后者的FBAINm启动子被EXP1启动子置换。表8列出pPEX10-1的组分。
表8
质粒pPEX10-1(SEQ ID NO:52)的组分
SEQ ID NO:52中的RE位点和核苷酸 | 片段和嵌合基因组分的描述 |
SalI-BsiWI(5705-1) | Pex10-5′::Pex10::Pex10-3′,它包含:●Pex10-5′:解脂耶氏酵母Pex10基因的500bp的5′启动子区域;●Pex10:解脂耶氏酵母Pex10ORF(GenBank保藏号AB036770,核苷酸1038-2171;SEQ ID NO:21);●Pex10-3′:来自耶氏酵母属Pex10基因的215bp的Pex10终止子序列(GenBank保藏号AB036770)[注意图中的整个Pex10-5′::Pex10::Pex10-3′表达盒统一标记成“PEX10”] |
SEQ ID NO:52中的RE位点和核苷酸 | 片段和嵌合基因组分的描述 |
PacI-SalI(4216-5703) | 耶氏酵母属URA3基因(GenBank保藏号AJ306421) |
(2806-4170) | 耶氏酵母属自主复制序列18(ARS18;GenBank保藏号A17608) |
(2147-2547) | 大肠杆菌f1复制起点 |
(1107-1967) | 用于在大肠杆菌中选择的氨苄青霉素抗性基因(AmpR) |
(157-1037) | ColE1质粒复制起点 |
使用PEX10-R-BsiWI(SEQ ID NO:48)和PEX10-F2-SalI(SEQ IDNO:50)对解脂耶氏酵母基因组DNA进行PCR扩增生成2365bp的片段,该片段包含Pex10ORF,Pex10基因的991bp的5′上游区域和215bp的3′下游区域,该基因的两端侧接SalI和BsiWI限制性位点。用Qiagen PCR纯化试剂盒纯化该片段,用SalI和BsiWI进行消化,然后将其克隆到进行了相似消化的pEXP-MOD-1中。这导致合成pPEX10-2(SEQ IDNO:53),其构建类似于质粒pPEX10-1的构建(表8,同上),除了在嵌合Pex10-5′::Pex10::Pex10-3′基因中有更长的Pex10-5′启动子。
在菌株Y4128U1(Δpex10)中表达Pex10
按照一般方法中的规程将质粒pFBAIN-MOD-1(对照物;SEQ IDNO:46)、pFBAIn-PEX10(SEQ ID NO:47)、pPEX10-1(SEQ ID NO:52)和pPEX10-2(SEQ ID NO:53)转化到Y4128U1(Δpex10)中。将转化体置于MM平板上。递送上述质粒的转化体总脂质含量和脂肪酸组成按照实施例3所述进行分析。
下表9显示以干细胞重量(DCW)百分比表示的脂质含量和脂质组成。具体地讲,将脂肪酸鉴定为18:0(硬脂酸)、18:1(油酸)、LA、ALA、EDA、DGLA、ETrA、ETA和EPA;脂肪酸组成表示为总脂肪酸的重量百分比(wt.%)。
表9中的结果显示在Y4128U1(Δpex10)中从天然解脂耶氏酵母Pex10启动子或从解脂耶氏酵母FBAINm启动子表达Pex10,将EPA百分比降低回Y4086的水平,同时将总脂质含量(TFA%DCW)提高到Y4086的水平(参见表6数据用于比较)。每克干细胞的EPA含量从对照样本中(即,递送pFBAIn-MOD-1的细胞)的63.2mg变成递送pFBAIn-PEX10的细胞中的31.5mg、递送pPEX10-1的细胞中的29mg以及递送pPEX10-2的细胞中的30.8mg。这些结果证明破坏Pex10的环指结构域提高了细胞中的EPA的量但降低细胞中的总脂质含量。
因此,表9的结果显示与具有对照质粒的Y4128U1(Δpex10)转化体相比,所有具有Pex10表达质粒的转化体显示更高的脂质含量(TFA%DCW)(>27%对22.8%)、更低的EPA%TFA(大约10.8%对27.7%)、和更低的EPA%DCW(<3.1%对6.3%)。此外,具有对照质粒的菌株Y4128U1(Δpex10)转化体与那些具有Pex10表达质粒的转化体相比,EPA相对于总PUFA的量提高了2.5倍(44%的PUFA[%TFA]对17.5%(平均值)的PUFA[%TFA]),C20PUFA相对于总PUFA的量提高了1.5倍(67%的PUFA[%TFA]对44%(平均值)的PUFA[%TFA])。
实施例6
生成Y4184U菌株用于生产EPA
解脂耶氏酵母菌株Y4184U用作实施例7中的宿主,下文菌株Y4184U来源于解脂耶氏酵母ATCC#20362,并且能够经由Δ9延伸酶/Δ8去饱和酶途径的表达生产EPA。该菌株具有Ura-表型并且其构建如实施例7所述,国际申请公布WO 2008/073367。
然而概括地说,菌株Y4184U的产生需要构建菌株Y2224、菌株Y4001、菌株Y4001U、菌株Y4036、菌株Y4036U和菌株Y4069(如上文实施例1所述)。进一步产生菌株Y4184U(在图7B用图表表示)需要生成菌株Y4084、菌株Y4084U1、菌株Y4127(在2007年11月29日保藏于美国典型培养物保藏中心,保藏号ATCC PTA-8802)、菌株Y4127U2、菌株Y4158、菌株Y4158U1和菌株Y4184。质粒构建体pZKL1-2SP98C用于转化菌株Y4127U2,在图8A中用图表表示(SEQ IDNO:54;描述于表23,国际申请公布WO 2008/073367)。质粒pZKL2-5U89GC用于转化菌株Y4158U1,在图8B中显示(SEQ IDNO:55;描述于表24,国际申请公布WO 2008/073367)。
相对于野生型解脂耶氏酵母ATCC#20362的菌株Y4184(生产占总脂质31%的EPA)的最终基因型是unknown 1-、unknown 2-、unknown 4-、unknown 5-、unknown 6-、unknown 7-、YAT1::ME3S::Pex16、EXP1::ME3S::Pex20(2拷贝)、GPAT::EgD9e::Lip2、FBAINm::EgD9eS::Lip2、EXP1::EgD9eS::Lip1、FBA::EgD9eS::Pex20、YAT1::EgD9eS::Lip2、GPD::EgD9eS::Lip2、GPDIN::EgD8M::Lip1、YAT 1::EgD8M::Aco、EXP1::EgD8M::Pex16、FBAINm::EgD8M::Pex20、FBAIN::EgD8M::Lip1(2拷贝)、GPM/FBAIN::FmD 12S::Oct、EXP1::FmD12S::Aco、YAT1::FmD12::Oct、GPD::FmD12::Pex20、EXP1::EgD5S::Pex20、YAT1::EgD5S::Aco、YAT1::Rd5S::Oct、FBAIN::EgD5::Aco、FBAINm::PaD17::Aco、EXP1::PaD17::Pex16、YAT1::PaD17S::Lip1、YAT1::Y1CPT1::Aco、GPD::YlCPT1::Aco(其中FmD12是串珠镰刀菌Δ12去饱和酶基因[国际申请公布WO2005/047485];FmD12S是经密码子优化的Δ12去饱和酶基因,来源于串珠镰刀菌[国际申请公布WO 2005/047485];ME3S是经密码子优化的C16/18延伸酶基因,来源于高山被孢霉[国际申请公布WO 2007/046817];EgD9e是小眼虫Δ9延伸酶基因[国际申请公布WO 2007/061742];EgD9eS是经密码子优化的Δ9延伸酶基因,来源于小眼虫[国际申请公布WO 2007/061742];EgD8M是合成突变型Δ8去饱和酶[国际申请公布WO 2008/073271],来源于小眼虫[美国专利7,256,033];EgD5是小眼虫Δ5去饱和酶[美国专利申请公布US 2007-0292924-A1];EgD5S是经密码子优化的Δ5去饱和酶基因,来源于小眼虫[美国专利申请公布2007-0292924];RD5S是经密码子优化的Δ5去饱和酶,来源于多甲藻属CCMP626[美国专利申请公布2007-0271632];PaD17是瓜果腐霉菌Δ17去饱和酶[国际申请公布WO 2008/054565];PaD17S是一种经密码子优化的Δ17去饱和酶,来源于瓜果腐霉菌[国际申请公布WO2008/054565];和YlCPT1是一种解脂耶氏酵母二酰基甘油胆碱磷酸转移酶基因[国际申请公布WO 2006/052870])。
为了破坏菌株Y4184中的Ura3基因,构建体pZKUE3S(图5A SEQID NO:31;描述于表22,国际申请公布WO 2008/073367)用于将EXP1::ME3S::Pex20嵌合基因整合进菌株Y4184的Ura3基因中以分别产生菌株Y4184U1(占总脂质11.2%的EPA)、Y4184U2(占总脂质10.6%的EPA)和Y4184U4(占总脂质15.5%的EPA)(统称为Y4184U)。
实施例7
解脂耶氏酵母菌株Y4184U4中的Pex10染色体缺失提高了EPA的积聚
和总脂质含量
构建体pYPS161(图9A,SEQ ID NO:56)用于从生产EPA的耶氏酵母属菌株Y4184U4中敲除染色体Pex10基因(实施例6)。用Pex10敲除构建体转化解脂耶氏酵母菌株Y4184U4导致生成菌株Y4184(Δpex10)。测定并比较Pex10敲除对总油量和EPA含量的效应。具体地讲,敲除Pex10导致细胞中EPA(%TFA和%DCW)百分比提高以及总脂质含量提高。
构建体pYSP161
构建体pYPS161包含以下组分:
表10
质粒pYPS161(SEQ ID NO:56)的描述
SEQ ID NO:56中的RE位点和核苷酸 | 片段和嵌合基因组分的描述 |
AscI/BsiW I(1521-157) | 耶氏酵母属Pex10基因的1364bp的Pex10敲除片段#1(GenBank保藏号AB036770) |
PacI/SphI(5519-4229) | 耶氏酵母属Pex10基因的1290bp的Pex10敲除片段#2(GenBank保藏号AB036770) |
SalI/EcoRI(7170-5551) | 耶氏酵母属URA3基因(GenBank保藏号AJ306421) |
2451-1571 | ColE1质粒复制起点 |
SEQ ID NO:56中的RE位点和核苷酸 | 片段和嵌合基因组分的描述 |
3369-2509 | 用于在大肠杆菌中选择的氨苄青霉素抗性基因(AmpR) |
3977-3577 | 大肠杆菌f1复制起点 |
生成解脂耶氏酵母基因敲除菌株Y4184(ΔPex10)
用于转化解脂耶氏酵母菌株Y4184U4(实施例6)的标准规程使用Pex10敲除构建体pYPS161的纯化5.3kB AscI/SphI片段(同上),并且还制备了细胞单独对照。在每个转化实验中有大约200至250个菌落存在,然而在细胞单独平板上无菌落存在(每期望值)。
菌落PCR用于筛选具有Pex10缺失的细胞。具体地讲,PCR反应使用MasterAmp Taq聚合酶(Epicentre Technologies,Madison,WI),使用PCR引物Pex-10del1 3′.正向(SEQ ID NO:57)和Pex-10del2 5′.反向(SEQ ID NO:58),按照标准规程进行。PCR反应条件为94℃5分钟,然后30个94℃30秒、60℃30秒和72℃2分钟的循环,之后在72℃下进行6分钟的最终延伸反应。然后将该反应保持在4℃下。如果Pex10基因敲除构建体在Pex10区域整合,期望生成长度为2.8KB的单一PCR产物。相反地,如果该菌株在除Pex10区域之外的染色体区域内整合Pex10基因敲除构建体,则将生成两个PCR片段,即2.8kB和1.1kB的片段。在288个筛选的菌落中,大部分菌落具有在随机位点整合的Pex10基因敲除构建体。288个菌落中仅有一个菌落包含Pex10敲除。将该菌株命名为Y4184(Δpex10)。
评估解脂耶氏酵母菌株Y4184和Y4184(ΔPex10)的总油和EPA产量
为了评估Pex10基因敲除对总脂质级分中的PUFA百分比和细胞中的总脂质含量的效应,菌株Y4184和Y4184(Δpex10)在可比较的含油条件下生长。具体地讲,培养物在初始OD600为~0.1的情况下开始生长,在含有25mL发酵培养基(FM)或无酵母提取物(FM无YE)的FM培养基的250mL烧瓶中生长48小时。在50mL圆锥管中,以8000rpm离心10分钟收获细胞。弃去上清液,将细胞重悬于25mL HGM中并转移到新的250mL烧瓶中。细胞在30℃下透气培养另外120小时。
为了测定干细胞重量(DCW),处理来自5mL FM生长培养基和10mL无YE生长培养物的FM的细胞。培养的细胞在4300rpm下离心10分钟。使用10mL生理盐水重悬沉淀物并在相同条件下二次离心。然后用1mL无菌H2O(第三次)重悬沉淀物,将其转移到预先称重的铝盘中。细胞在80℃真空炉中干燥过夜。测量细胞重量。
递送上述质粒的转化体的总脂质含量和脂肪酸组成如实施例3所述分析。DCW、总脂质含量(TFA%DCW)、总EPA%TFA、和EPA%DCW在下表11中显示。
表11
在解脂耶氏酵母菌株Y4184和Y4184中的脂质组成(ΔPex10)
表11中的结果显示敲除Y4184(ΔPex10)中的染色体Pex10基因与菌株Y4184中的EPA百分比和总油含量相比提高EPA百分比(%TFA和%DCW)并提高总油含量,菌株Y4184的天然Pex10p未被敲除。更具体地讲,在FM培养基中,EPA(%TFA)有约109%的提高,EPA产量(%DCW)有约216%的提高,总油量(TFA%DCW)有约49%的提高。在无YE的FM培养基中,EPA(%TFA)有约100%的提高,EPA产量(%DCW)有约205%的提高,总油量(TFA%DCW)有约50%的提高。
因此,表11中的结果显示与亲本菌株Y4184相比,Y4184(ΔPex10)菌株在FM培养基中具有更高的脂质含量(TFA%DCW)(17.6%对11.8%),更高的EPA%TFA(43.2%对20.6%),和更高的EPA%DCW(7.6%对2.4%)。同样的,与亲本菌株Y4184相比,Y4184(ΔPex10)菌株在无YE的FM培养基中具有更高的脂质含量(TFA%DCW)(13.2%对8.8%),更高的EPA%TFA(46.1%对23.2%),和更高的EPA%DCW(6.1%对2.0%)。
实施例8(预测的)
在解脂耶氏酵母的PUFA生产菌株中的替代Pex基因的染色体敲除
本实施例描述了已经进行了工程化以生产ω-3/ω-6PUFA的解脂耶氏酵母的多个菌株。如果使用实施例7的方法同上破坏编码Pex1p、Pex2p、Pex3p、Pex3Bp、Pex4p、Pex5p、Pex6p、Pex7p、Pex8p、Pex12p、Pex13p、Pex14p、Pex16p、Pex17p、Pex19p、Pex20p、Pex22p或Pex26p的染色体基因,预期这些解脂耶氏酵母宿主菌株中的任何一种都能进行工程化以生产在总脂质级分和油级分中含量提高的ω-3/ω-6PUFA。
更具体地讲,申请人的受让人已经工程化了多种解脂耶氏酵母菌株,经由表达异源Δ6去饱和酶/Δ6延伸酶PUFA途径或异源Δ9延伸酶/Δ8去饱和酶PUFA途径来生产高浓度的多种ω-3/.ω-6PUFA。
生产ω-3/ω-6PUFA的代表性解脂耶氏酵母菌株概述
虽然在下表中概述了一些代表性菌株,但公开的生产ω-3/ω-6PUFA的解脂耶氏酵母菌株不受本文菌株的任何方式的限制。相反地,除了以下共有的和共同未决的专利申请之外,在本专利申请中提供的所有教导对于开发合适的工程化解脂耶氏酵母菌株以生产ω-3/ω-6PUFA也是有用的。这些教导具体地讲包括以下申请人的受让人的共同未决的专利和专利申请:美国专利7,125,672、美国专利7,189,559、美国专利7,192,762、美国专利7,198,937、美国专利7,202,356、美国专利7,214,491、美国专利7,238,482、美国专利7,256,033、美国专利7,259,255、美国专利7,264,949、美国专利7,267,976、美国专利7,273,746、美国专利申请10/985254和10/985691(提交于2004年11月10日)、美国专利申请11/183664(提交于2005年7月18日)、美国专利申请11/185301(提交于2005年7月20日)、美国专利申请11/190750(提交于2005年7月27日)、美国专利申请11/198975(提交于2005年8月8日)、美国专利申请11/253882(提交于2005年10月19日)、美国专利申请11/264784和11/264737(提交于2005年11月1日)、美国专利申请11/265761(提交于2005年11月2日)、美国专利申请11/601563和11/601564(提交于2006年11月16日)、美国专利申请11/635258(提交于2006年12月7日)、美国专利申请11/613420(提交于2006年12月20日)、美国专利申请11/787772(提交于2007年4月18日)、美国专利申请11/737772(提交于2007年4月20日)、美国专利申请11/740298(提交于2007年4月26日)、美国专利申请12/111237(提交于2008年4月29日)、美国专利申请11/748629和11/748637(提交于2007年5月15日)、美国专利申请11/779915(提交于2007年7月19日)、美国专利申请60/991266(提交于2007年11月30日)、美国专利申请11/952243(提交于2007年12月7日)、美国专利申请61/041716(提交于2008年4月2日)、美国专利申请12/061738(提交于2008年4月3日)、美国专利申请12/099811(提交于2008年4月9日)、美国专利申请12/102879(提交于2008年4月15日)、美国专利申请12/111237(提交于2008年4月29日)、美国专利申请61/055511(提交于2008年5月23日)和美国专利申请61/093007(提交于2008年8月29日)。
Pex基因的染色体敲除
在选择优选解脂耶氏酵母菌株生产期望的ω-3/ω-6PUFA(或它们的PUFA组合)后,本领域的技术人员能够容易地工程化合适的基因敲除构建体(类似于在实施例7中的pYPS161)以使得染色体Pex敲除基因通过转化进入解脂耶氏酵母菌株。优选的Pex基因将包括:YlPex1p(GenBank保藏号CAG82178;SEQ ID NO:1)、YlPex2p(GenBank保藏号CAG77647;SEQ ID NO:2)、YlPex3p(GenBank保藏号CAG78565;SEQ ID NO:3)、YlPex3Bp(GenBank保藏号CAG83356;SEQ ID NO:4)、YlPex4p(GenBank保藏号CAG79130;SEQ ID NO:5)、YlPex5p(GenBank保藏号CAG78803;SEQ ID NO:6)、YlPex6p(GenBank保藏号CAG82306;SEQ ID NO:7)、YlPex7p(GenBank保藏号CAG78389;SEQ ID NO:8)、YlPex8p(GenBank保藏号CAG80447;SEQ ID NO:9)、YlPex12p(GenBank保藏号CAG81532;SEQ ID NO:11)、YlPex13p(GenBank保藏号CAG81789;SEQ ID NO:12)、YlPex14p(GenBank保藏号CAG79323;SEQ ID NO:13)、YlPex16p(GenBank保藏号CAG79622;SEQ ID NO:14)、YlPex17p(GenBank保藏号CAG84025;SEQ ID NO:15)、YlPex19p(GenBank保藏号AAK84827;SEQ ID NO:16)、YlPex20p(GenBank保藏号CAG79226;SEQ ID NO:17)、YlPex22p(GenBank保藏号CAG77876;SEQ ID NO:18)和YlPex26p(GenBank保藏号NC_006072,反义翻译核苷酸117230-118387;SEQ ID NO:19)。
期望的是染色体破坏Pex基因将导致与天然过氧化物酶体生物合成因子蛋白未被破坏的真核生物相比,在总脂质级分和油级分中积聚的以总脂肪酸百分比形式表示的PUFA量增加,其中所述PUFA量可能是:1)以功能性PUFA生物合成途径期望的终产品形式产生的PUFA,与以中间体或副产品形式产生的PUFA相反,2)C20和C22PUFA,和/或3)总PUFA。优选的结果是与天然过氧化物酶体生物合成因子蛋白未被破坏的真核生物相比,不但以总脂肪酸百分比表示的PUFA量提高,而且以干细胞重量百分比形式表示的PUFA量也提高。此外,PUFA量可能是:1)以功能性PUFA生物合成途径期望的终产品形式产生的PUFA,与以中间体或副产品形式产生的PUFA相反,2)C20和C22PUFA,和/或3)总PUFA。在一些情况下,相对于未破坏天然过氧化物酶体生物合成因子蛋白的真核生物,总脂质含量也提高。
实施例9
在解脂耶氏酵母菌株Y4036U中的Pex16染色体缺失提高了积聚的
DGLA百分比
本实施例描述使用构建体pYRH13(图9B;SEQ ID NO:59)敲除DGLA生产耶氏酵母属菌株Y4036U(实施例1)中的染色体Pex16基因。用Pex16敲除构建体转化解脂耶氏酵母菌株Y4036U导致生成菌株Y4036U(Dpex16)。测定并比较Pex16敲除对DGLA含量的效应。具体地讲,敲除Pex16导致细胞中以总脂肪酸百分比形式存在的DGLA百分比提高。
构建体pYRH13
质粒pYRH13来源于质粒pYPS161(图9A,SEQ ID NO:56;实施例7)。具体地讲,解脂耶氏酵母Pex16基因(GenBank保藏号CAG79622)的1982bp的5′启动子区域置换pYPS161的AscI/BsiWI片段,解脂耶氏酵母Pex16基因(GenBank保藏号CAG79622)的448bp的3′终止子区域置换pYPS161的PacI/SphI片段以生成pYRH13(SEQ ID NO:59;图9B)。
生成解脂耶氏酵母基因敲除菌株Y4036(ΔPex16)
使用标准规程,用Pex16敲除构建体pYRH13的纯化6.0kBAscI/SphI片段转化解脂耶氏酵母的菌株Y4036U(实施例1)。
为了筛选具有Pex16缺失的细胞,使用Taq聚合酶(Invitrogen;Carlsbad,CA)、PCR引物PEX16Fii(SEQ ID NO:60)和PEX16Rii(SEQID NO:61)进行菌落PCR。设计这组引物以扩增完整Pex16基因的1.1kB区域,因此Pex16缺失突变体(即,Δpex16)将不生成该条带。设计第二组引物生成仅当Pex16基因被缺失时出现的条带。具体地讲,一个引物(即3UTR-URA3;SEQ ID NO:62)结合到导入的6.0kB AscI/SphI破坏片段的载体序列区域上,而另一个引物(即,PEX16-conf;SEQ IDNO:63)结合到在破坏片段的同源序列之外染色体Pex16终止子序列上。
更具体地讲,使用反应混合物进行菌落PCR,所述混合物包含:20mM Tris-HCl(pH 8.4)、50mM KCl、1.5mM MgCl2、400μM各种dGTP、dCTP、dATP、和dTTP、2μM的各个引物、20μL的水和2U Taq聚合酶。扩增如下进行:94℃120秒进行初始变性,随后在94℃60秒进行35个循环的变性,在55℃下退火60秒,在72℃下延伸120秒。最后在72℃下进行5分钟的最终延伸循环,然后在4℃下终止反应。
在205个筛选的菌落中,195个菌落具有在染色体随机位点整合的Pex16敲除片段,因此不是Δpex16突变体(然而,由于存在pYRH13,细胞能够在ura-平板上生长)。这些随机整合的菌株中的三个,命名为Y4036U-17、Y4036U-19和Y4036U-33,用作脂质生产实验中的对照物(下文)。
筛选的剩余10个菌落(即,总计205个菌落中的剩余菌落)包含Pex16基因敲除。将在Y4036U菌株背景中的这十个Δpex16突变体命名为RHY25至RHY34。
通过定量实时PCR确认解脂耶氏酵母基因敲除菌株Y4036U(ΔPex16)
通过定量实时PCR在菌株RHY25至RHY34中对Pex16基因敲除进行进一步确认,使用耶氏酵母属翻译延伸因子(tef-1)基因(GenBank保藏号AF054510)作为对照物。
首先,设计分别靶向于Pex16基因和tef-1基因的实时PCR引物和TaqMan探针,该设计使用Primer Express软件v 2.0(AppliedBiosystems,Foster City,CA)。具体地讲,设计实时PCR引物ef-324F(SEQ ID NO:64)、ef-392R(SEQ ID NO:65)、PEX16-741F(SEQ ID NO:66)和PEX16-802R(SEQ ID NO:67)以及TaqMan探针ef-345T(即,5′6-FAMTM-TGCTGGTGGTGTTGGTGAGTT-TAMRATM,其中核苷酸序列如SEQ ID NO:68所示)和PEX16-760T(即,5′-6FAMTM-CTGTCCATTCTGCGACCCCTC-TAMRATM,其中核苷酸序列如EQ ID NO:69所示)。TaqMan荧光探针5′末端具有结合的6FAMTM荧光报告基因染料,而3′末端包含TAMRATM淬灭剂。所有引物和探针获取自Sigma-Genosys(Woodlands,TX)。
敲除候选DNA通过在50μL水中悬浮1个菌落进行制备。tef-1和PEX16的反应分别进行,每个样本重复三次。实时PCR反应包括20pmoles各个正向和反向引物(即,ef-324F、ef-392R、PEX16-741F和PEX16-802R 5′,同上)、5pmoles TaqMan探针(即,ef-345T和PEX16-760T)、10μL TaqMan Universal PCR Master Mix--No AmpEraseUracil-N-Glycosylase(UNG)(目录号PN 4326614,AppliedBiosystems)、1μL菌落悬浮液和8.5μL去RNase/DNase水,每个反应总体积为20μL。反应在ABI PRISM7900Sequence Detection System上进行,使用以下条件:初始在95℃变性10分钟,随后进行95℃变性15秒、60℃退火1分钟的循环40次。在每个循环期间通过监控6-FAMTM荧光自动收集实时数据。使用用于数据归一化的tef-1基因阈值循环(CT)值,按照ABIPRISM7900Sequence Detection System使用说明书进行数据分析。
基于这一分析,得出结论:所有十个Y4036U(Δpex16)菌落(即,RHY25至RHY34)是有效的Pex16基因敲除菌落,其中pYRH13构建体已经整合进染色体YlPex16中。
评估用于DGLA生产的解脂耶氏酵母菌株Y4036U和Y4036U(δPex16)
为了评估Pex16基因敲除对总脂质级分中的PUFA百分比和细胞中的总脂质含量的效应,使Y4036U和Y4036U(Δpex16)菌株在可比较的含油条件下生长。更具体地讲,菌株Y4036U-17、Y4036U-19和Y4036U-33具有在染色体随机位点整合的Pex16基因敲除片段,上述菌株被认为是Pex16野生型(即,Y4036U),而菌株RHY25至RHY34是Pex16突变型菌株(即,Y4036U(Δpex16))。每个菌株在起始OD600为~0.1的情况下,在包含90mg/L L-亮氨酸的25mL MM中,置于125mL烧瓶中培养48小时。在50mL圆锥管中,以4300rpm离心5分钟收获细胞。弃去上清液,将细胞重悬于25mL HGM中并转移到新的125mL烧瓶中。细胞在30℃下透气培养另外120小时。
下表13中显示了每个菌株的脂肪酸组成(即,LA(18:2),ALA,EDA和DGLA);脂肪酸组成以占总脂肪酸的重量百分比(wt.%)的形式表示。菌株Y4036U和Y4036U(Δpex16)的平均脂肪酸组成用灰色突出显示并且用“Ave”指示。在MM+L-亮氨酸培养基中无受试菌株提供足够的细胞群,因此不能分析总脂质含量。
表13
解脂耶氏酵母菌株Y4036U和Y4036U(Δpex16)中的脂质组成
表13中的结果显示敲除Y4036U(Δpex16)中的染色体Pex16基因与天然Pex16p未被敲除的菌株Y4036U中的DGLA%TFA相比较,DGLA%TFA提高了大约85%。然而,Y4036U(Δpex16)积聚的LA(18:2)也降低了~40%。
因此,表13中的结果显示与亲本菌株Y4036相比,Y4036(ΔPex16)菌株具有更高的平均DGLA%TFA(43.4%对23.4%)。此外,菌株Y4036U(Δpex16)的DGLA相对于总PUFA的量提高了1.65倍(62.8%的PUFA[%TFA]对38.1%的PUFA[%TFA]),而C20PUFA相对于总PUFA的量提高了1.3倍(71%的PUFA[%TFA]对54.8%的PUFA[%TFA])。
实施例10
产生菌株Y4305以生产占总脂质约53.2%的EPA
解脂耶氏酵母菌株Y4305U具有Ura-表型,它在下文的实施例11中被用作宿主。菌株Y4305(Ura+菌株,它是Y4305U的亲本)来源于解脂耶氏酵母ATCC#20362,该菌株能通过Δ9延伸酶/Δ8去饱和酶途径的表达生产相对于总脂质而言约53.2%的EPA。
菌株Y4305U的产生需要构建菌株Y2224、菌株Y4001、菌株Y4001U、菌株Y4036、菌株Y4036U、菌株Y4070和菌株Y4086(如上文实施例1所述)。进一步开发菌株Y4305U需要构建菌株Y4086U1、菌株Y4128和菌株Y4128U3(如上文实施例2所述)。随后产生菌株Y4305U(如图10所示)需要构建菌株Y4217(生产42%的EPA)、菌株Y4217U2(Ura-)、菌株Y4259(生产46.5%的EPA)、菌株Y4259U2(Ura-)和菌株Y4305(生产53.2%的EPA)。
虽然本文未详细阐述关于转化和选择EPA生产菌株(在菌株Y4128U3后开发的菌株)的细节,但是在实施例1和2中描述了分离菌株Y4217、菌株Y4217U2、菌株Y4259、菌株Y4259U2、菌株Y4305和菌株Y4305U的方法。
简而言之,生成构建体pZKL2-5U89GC(图8B;SEQ ID NO:55;描述于表24,国际申请公布WO 2008/073367)以将一个Δ9延伸酶基因(即,EgD9eS)、一个Δ8去饱和酶基因(即,EgD8M)、一个Δ5去饱和酶基因(即,EgD5S)、和一个解脂耶氏酵母二酰基甘油胆碱磷酸转移酶(CPT1)基因整合进菌株Y4128U3的Lip2位点(GenBank保藏号AJ012632)中,从而产生较高含量的EPA。六个分别命名为Y4215、Y4216、Y4217、Y4218、Y4219和Y4220的菌株分别生产占总脂质约41.1%、41.8%、41.7%、41.1%、41%和41.1%的EPA。
菌株Y4217U1和Y4217U2经由构建体pZKUE3S破坏菌株Y4217中的Ura3基因进行制备(图5A SEQ ID NO:31;描述于表22,国际申请公布WO 2008/073367),它包含定向于Ura3基因的嵌合EXP1::ME3S::Pex20基因。利用构建体pZKL1-2SP98C(图8A SEQ IDNO:54;描述于表23,国际申请公布WO 2008/073367)将一个Δ9延伸酶基因(即,EgD9eS)、一个Δ8去饱和酶基因(即,EgD8M)、一个Δ12去饱和酶基因(即,FmD12S)、和一个解脂耶氏酵母CPT1基因整合进菌株Y4217U2的Lip1位点(GenBank保藏号Z50020)中,从而产生分离菌株Y4259、Y4260、Y4261、Y4262、Y4263和Y4264,分别生产占总脂质约46.5%、44.5%、44.5%、44.8%、44.5%和44.3%的EPA。
然后经由用构建体pZKUM进行转化制备Ura-衍生物(即,菌株Y4259U2)(图11A;SEQ ID NO:70;描述于表33,国际申请公布WO2008/073367),该构建体将Ura3突变基因整合进菌株Y4259的Ura3基因中,从而分别产生分离的菌株Y4259U1、Y4259U2和Y4259U3(统称为Y4259U)(分别生产占总脂质31.4%、31%和31.3%的EPA)。
最后,生成构建体pZKD2-5U89A2(图11B;SEQ ID NO:71)以将一个Δ9延伸酶基因、一个Δ5去饱和酶基因、一个Δ8去饱和酶基因、和一个Δ12去饱和酶基因整合进菌株Y4259U2的二酰基甘油酰基转移酶(DGAT2)位点中,从而使EPA产量提高。pZKD2-5U89A2质粒含有如下组分:
表14
描述质粒pZKD2-5U89A2(SEQ ID NO:71)
SEQ ID NO:71中的RE位点和核苷酸 | 片段和嵌合基因组分的描述 |
AscI/BsiWI(1-736) | 耶氏酵母属DGAT2基因的728bp的5′部分(SEQ IDNO:72)(图中标记为“YLDGAT5”;美国专利7,267,976) |
PacI/SphI(4164-3444) | 耶氏酵母属DGAT2基因的714bp的3′部分(SEQ IDNO:72)(图中标记为“YLDGAT3”;美国专利7,267,976) |
SEQ ID NO:71中的RE位点和核苷酸 | 片段和嵌合基因组分的描述 |
SwaI/BsiWI(13377-1) | YAT1::FmD12S::Lip2,它包含:●YAT1:解脂耶氏酵母YAT1启动子(在图中标记为“YAT”;专利申请公布US 2006/0094102-A1);●FmD12S:经密码子优化的Δ12延伸酶(SEQ IDNO:74),来源于串珠镰刀菌(图中标记为“F.D12S”;国际申请公布WO 2005/047485);●Lip2:来自耶氏酵母属Lip2基因(GenBank保藏号AJ012632)的Lip2终止子序列 |
PmeI/SwaI(10740-13377) | FBAIN::EgD8M::Lip1,它包含:●FBAIN:解脂耶氏酵母FBAIN启动子(美国专利7,202,356);●EgD8M:合成突变型Δ8去饱和酶(SEQ ID NO:76;专利申请公布US 2008-0138868A1),来源于小眼虫(“EgD8S”;美国专利7,256,033);●Lip1:来自耶氏酵母属Lip1基因(GenBank保藏号Z50020)的Lip1终止子序列 |
ClaI/PmeI(8846-10740) | YAT1::E389D9eS::OCT,它包含:●YAT1:解脂耶氏酵母YAT1启动子(在图中标记为“YAT”;专利申请公布US 2006/0094102-A1);●E389D9eS:经密码子优化的Δ9延伸酶(SEQ IDNO:78),来源于小型绿藻属CCMP389(图中标记为“D9ES-389”;国际申请公布WO 2007/061742);●OCT:耶氏酵母属OCT基因(GenBank保藏号X69988)的OCT终止子序列 |
ClaI/EcoRI(8846-6777) | 耶氏酵母属Ura3基因(GenBank保藏号AJ306421) |
SEQ ID NO:71中的RE位点和核苷酸 | 片段和嵌合基因组分的描述 |
EcoRI/PacI(6777-4164) | EXP1::EgD5S::ACO,它包含:●EXP1:解脂耶氏酵母输出蛋白(EXP1)启动子(在图中标记为“Exp”;国际申请公布WO 2006/052870)。●EgD5S:经密码子优化的Δ5去饱和酶(SEQ IDNO:80),来源于小眼虫(专利申请公布US2007-0292924-A1);●Aco:耶氏酵母属Aco基因(GenBank保藏号AJ001300)的Aco终止子序列 |
将pZKD2-5U89A2质粒用AscI/SphI消化,然后根据“一般方法”将其用于转化菌株Y4259U2。将转化细胞铺在MM平板上并在30℃下维持3至4天。将单菌落再划线接种到MM平板上,所得菌落用于接种液体MM。液体培养物以250rpm/min,在30℃下摇动2天。通过离心收集细胞,将其重悬浮于HGM中,然后以250rpm/min摇动5天。通过离心收集细胞,并提取脂质。通过酯交换制备FAME并随后用Hewlett-Packard 6890GC进行分析。
GC分析显示大多数选择的96个菌株生产占总脂质40-46%的EPA。命名为Y4305、Y4306、Y4307和Y4308的四个菌株分别生产占总脂质约53.2%、46.4%、46.8%和47.8%的EPA。Y4305的全部脂质分布如下:16:0(2.8%)、16:1(0.7%)、18:0(1.3%)、18:1(4.9%)、18:2(17.6%)、ALA(2.3%)、EDA(3.4%)、DGLA(2.0%)、ARA(0.6%)、ETA(1.7%)和EPA(53.2%)。总脂质%干细胞重量是27.5。
相对于野生型解脂耶氏酵母ATCC#20362的菌株Y4305的最终基因型是SCP2-(YALI0E01298g)、YALI0C18711g-、Pex10-、YALI0F24167g-,unknown 1-、unknown 3-、unknown 8-、GPD::FmD12::Pex20、YAT1::FmD12::OCT、GPM/FBAIN::FmD12S::OCT、EXP1::FmD12S::Aco、YAT1::FmD12S::Lip2、YAT1::ME3S::Pex16、EXP1::ME3S::Pex20(3copies)、GPAT::EgD9e::Lip2、EXP1::EgD9eS::Lip1、FBAINm::EgD9eS::Lip2、FBA::EgD9eS::Pex20、GPD::EgD9eS::Lip2、YAT1::EgD9eS::Lip2、YAT1::E389D9eS::OCT、FBAINm::EgD8M::Pex20、FBAIN::EgD8M::Lip1(2个拷贝)、EXP1::EgD8M::Pex16、GPDIN::EgD8M::Lip1、YAT1::EgD8M::Aco、FBAIN::EgD5::Aco、EXP1::EgD5S::Pex20、YAT1::EgD5S::Aco、EXP1::EgD5S::ACO、YAT1::RD5S::OCT、YAT1::PaD17S::Lip1、EXP1::PaD17::Pex16、FBAINm::PaD17::Aco、YAT1::Y1CPT1::ACO、GPD::YlCPT1::ACO。
为了破坏菌株Y4305中的Ura3基因,使用构建体pZKUM(图11A;SEQ ID NO:70;描述于表33,国际申请公布WO 2008/073367)以将Ura3突变基因整合进菌株Y4305的Ura3基因中。挑取在MM+5-FOA平板上生长的总共8个转化体,并且再次分别划线接种到MM平板和MM+5-FOA平板上。所有8个菌株具有Ura-表型(即,细胞能在MM+5-FOA平板上生长,但是不能在MM平板上生长)。从MM+5-FOA平板上刮除细胞并提取脂质。通过酯交换制备FAME并随后用Hewlett-Packard6890GC进行分析。
GC分析显示在MM+5-FOA平板上生长的pZKUM转化体#1、#6和#7中存在占总脂质37.6%,37.3%和36.5%的EPA。这三个菌株分别命名为菌株Y4305U1、Y4305U2和Y4305U3(统称为Y4305U)。为清楚起见,在实施例11中菌株Y4305U被称为菌株Y4305U(Δpex10)。
实施例11
在解脂耶氏酵母菌株Y4305U(Δpex10)中的Pex16染色体缺失进一步
提高了积聚的EPA百分比(Pex10-Pex16双敲除)
本实施例描述使用构建体pYRH13(图9B;SEQ ID NO:59)敲除在耶氏酵母属菌株Y4305U中的染色体Pex16(Δpex10)(实施例10),从而导致生成Pex10-Pex16双突变体。测定并比较Pex10-Pex16双敲除对总油量和EPA含量的效应。具体地讲,在菌株Y4305U(Δpex10)(Δpex16)中的Pex10-Pex16双突变体的效应导致细胞中EPA的量与单突变体相比(即,菌株Y4305U(Δpex10))有所提高(EPA%TFA和EPA%DCW)。
生成解脂耶氏酵母基因敲除菌株Y4305U(Δpex10)(Δpex16)
采用Pex16敲除构建体pYRH13(实施例9;SEQ ID NO:59)的纯化6.0kB AscI/SphI片段,使用标准规程来转化解脂耶氏酵母菌株Y4305U(Δpex10)(实施例10)。如实施例9所述,通过菌落PCR筛选并鉴定具有Pex16缺失的细胞。
在93个筛选的菌落中,88个菌落具有在染色体随机位点整合的Pex16敲除片段,因此不是Δpex16突变体(然而,由于存在pYRH13,细胞能够在ura-平板上生长)。这些随机整合的菌株中的两个,命名为Y4305U-22和Y4305U-25,用作脂质生产实验中的对照物(下文)。
筛选的剩余5个菌落(即,总计93个菌落中的剩余菌落)包含Pex16基因敲除。将在Y4305U菌株中的这五个Δpex16突变体命名为RHY20、RHY21、RHY22、RHY23和RHY24。如实施例9所述,通过如实施例9所述的定量实时PCR进行对YlPex16基因敲除的进一步确认。
评估用于EPA生产的解脂耶氏酵母菌株Y4305U(ΔPex10)和Y4305U
(ΔPex10)(Δpex16)
为了评估多个Pex基因突变对总脂质级分中的PUFA百分比和细胞中的总脂质含量的效应,将Y4305U(Δpex10)和Y4305U(Δpex10)(Δpex16)菌株在可比较的含油条件下生长。更具体地讲,菌株Y4305U-22和Y4305U-25具有在染色体随机位点整合的Pex16基因敲除片段,它们被认为是Pex16野生型,Pex10基因敲除型(即,Y4305U(Δpex10))。菌株RHY22、RHY23和RHY24是双敲除突变型菌株(即,Y4305U(Δpex10)(Δpex16))。在可比较的含油条件下平行培养每个菌株。
具体地讲,在25mL合成右旋糖培养基(SD)中在起始OD600为~0.1的情况下,在125mL烧瓶中培养48小时。在50mL圆锥管中,以4300rpm离心5分钟收获细胞。弃去上清液,将细胞重悬于25mL HGM中并转移到新的125mL烧瓶中。细胞在30℃下透气培养另外120小时。
为了测定干细胞重量(DCW),对来自5mL HGM生长培养物的细胞进行处理。培养的细胞在4300rpm下离心5分钟。使用10mL无菌水重悬沉淀物并在相同条件下二次离心。然后用1mL无菌H2O(第三次)重悬沉淀物,将其转移到预先称重的铝盘中。细胞悬浮液在80℃真空炉中干燥过夜。测量细胞重量。
为了测定总脂质含量,1mL的HGM培养细胞通过在13,000rpm下离心1分钟进行收集,提取总脂质并通过酯交换制备FAME,随后用Hewlett-Packard 6890GC(一般方法)进行分析。
表15显示了每个菌株的脂肪酸组成(即,16:0(棕榈酸))、16:1(棕榈油酸)、18:0、18:1(油酸)、18:2(LA)、18:3(ALA)、EDA、DGLA、ARA、ETrA、ETA和EPA)(以总脂肪酸(TFA)的重量百分比(wt.%)形式表示),以及DCW(g/L)和总脂质含量(TFA%DCW)。菌株Y4305U(Δpex10)和Y4305U(Δpex10)(Δpex16)的平均脂肪酸组成用灰色突出显示并用“Ave”指示。
表15中的结果显示敲除Y4305U(Δpex10)(Δpex16)中的染色体Pex16基因与天然Pex16p未被敲除的菌株Y4305U(Δpex10)中的EPA%TFA相比较,EPA%TFA提高了大约8%。此外,与单突变菌株相比,EPA%DCW也在双突变体中提高,而TFA%DCW保持不变。
因此,表15中的结果显示与对照Y4305(ΔPex10)菌株相比,Y4305(ΔPex10,ΔPex16)菌株平均具有更高的EPA%TFA(48.3%对44.7%)和更高的EPA%DCW(14.57%对13.23%)。菌株Y4305(ΔPex10,ΔPex16)相对于菌株Y4305(ΔPex10)的EPA相对于总PUFA的量(61%的PUFA[%TFA]对58.3%的PUFA[%TFA])仅提高了1.05倍,而C20PUFA相对于总PUFA的量的提高二者基本上相同(73%的PUFA[%TFA]对72%的PUFA[%TFA])。
实施例12
在解脂耶氏酵母菌株Y4036U中的Pex3染色体缺失提高了积聚的DGLA
百分比
本实施例描述了使用构建体pY157(图12B;SEQ ID NO:82)敲除在Ura-、DGLA-生产耶氏酵母属菌株Y4036U(实施例1)中的染色体Pex3基因(SEQ ID NO:3)。用Pex3敲除构建体转化解脂耶氏酵母菌株Y4036U导致生成菌株Y4036(Dpex3)。测量Pex3敲除对DGLA含量的效应并与对照菌株Y4036(Ura+菌株,它是菌株Y4036U的亲本)比较。具体地讲,敲除Pex3提高DGLA占总脂肪酸的百分比并且与对照相比改善了大约3倍的DGLA%DCW。
构建体pY157
质粒pY87(图12A)包含表达盒以敲除解脂耶氏酵母二酰基甘油酰基转移酶(DGAT2)基因,如下表16所述:
表16
质粒pY87(SEQ ID NO:83)的描述
SEQ ID NO:83中的RE位点和核苷酸 | 片段和嵌合基因组分的描述 |
SphI/PacI(1-721) | 耶氏酵母属DGAT2基因的5′部分(SEQ ID NO:72的碱基1-720)(美国专利公开7,267,976) |
PacI/BglII(721-2459) | LoxP::Ura3::LoxP,它包含:●LoxP序列(SEQ ID NO:84);●耶氏酵母属Ura3基因(GenBank保藏号AJ306421);●LoxP序列(SEQ ID NO:84) |
BglII/AscI(2459-3203) | 耶氏酵母属DGAT2基因的3′部分(SEQ ID NO:72的碱基2468-3202)(美国专利公开7,267,976) |
AscI/SphI(3203-5910) | 载体主链包括:●ColE1质粒复制起点●用于在大肠杆菌中选择的氨苄青霉素抗性基因(AmpR)(4191-5051);●大肠杆菌f1复制起点 |
质粒pY157来源于质粒pY87。具体地讲,解脂耶氏酵母Pex3基因的704bp的5′启动子区域置换pY87的SphI/PacI片段,解脂耶氏酵母Pex3基因的448bp的3′终止子区域置换pY87的BglII/AscI片段以生成pYR157(SEQ ID NO:82;图12B)。
生成解脂耶氏酵母基因敲除菌株Y4036(ΔPex3)
使用标准规程,用Pex3敲除构建体pY157的纯化3648bp的AscI/SphI片段来转化解脂耶氏酵母的菌株Y4036U(实施例1)(同上)。
为了筛选具有Pex3缺失的细胞,使用Taq聚合酶(Invitrogen;Carlsbad,CA)和PCR引物UP 768(SEQ ID NO:85)以及LP 769(SEQID NO:86)进行菌落PCR。设计这组引物用于扩增完整Pex3基因的2039bp的野生型条带,以及当Pex3基因被定向敲除破坏时,3719bp的敲除特异性条带。
更具体地讲,菌落PCR使用MasterAmp Taq试剂盒(EpicentreTechnologies,Madison,WI;目录号82250),并且按照制造商说明书进行,在25μL的反应中包含:2.5μL 10X MasterAmp Taq缓冲液、2.0μL25mM MgCl2、7.5μL 10X MasterAmp Enhancer、2.5μL 2.5mM dNTP(TaKaRa Bio Inc.,Otsu Shiga,Japan)、1.0μL 10μM上游引物、1.0μL10μM下游引物、0.25μL MasterAmp Taq DNA聚合酶和19.75μL水。扩增如下进行:95℃5分钟进行初始变性,随后在95℃30秒进行40个循环的变性,在56℃下退火60秒,在72℃下延伸4分钟。最后在72℃下进行10分钟的最终延伸循环,然后在4℃下终止反应。
筛选的48个菌落中,46个菌落具有2039bp的预期条带,该条带来自野生型(即,未破坏的)Pex3基因,因此不是Δpex3突变体。剩余的2个菌落只显示2039bp的弱条带,说明它们是在背景中存在一些污染的非转化细胞的Δpex3突变体。通过划线接种2个推定敲除菌落到选择性平板上分离单菌落,这一点得到了证实。然后从每个推定基因敲除菌株的3个单菌落中来分离基因组DNA并用相同引物对来进行筛选。即,UP 768和LP 769(SEQ ID NO:85和86)。该方法被认为是比菌落PCR更灵敏的方法。来自主要转化体的所有三个单菌落都缺少2039bp的野生型条带,相反地具有3719bp的基因敲除特异性条带。将在Y4036U菌株中的Δpex3突变体命名为L134和L135。
评估用于DGLA生产的解脂耶氏酵母菌株YY4036和Y4036(ΔPex3)
为了评估Pex3基因敲除对总脂质级分中的PUFA百分比和细胞中的总脂质含量的效应,Y4036和Y4036(Δpex3)菌株6在可比较的含油条件下生长。将菌株Y4036、L134(即,Y4036(Δpex3))和L135(即,Y4036(Δpex3))接种到25mL的CSM-Ura并在振荡器中在30℃下培养过夜。将预培养物等分到新的25mL CSM-Ura烧瓶中,最终OD600为0.4。培养物在30℃下在振荡器中生长。在48小时后,离心细胞(它几乎不生长)并重悬在新的25mL CSM-Ura中并继续生长72小时。将细胞离心、重悬于25mL HGM中,并继续如上文所述生长72小时。离心收获细胞,用蒸馏水洗涤一次并重悬于25mL水中,终体积为20.5mL。使用等分试样(1.5mL)测量脂质含量,提取总脂质,通过酯交换制备FAME并通过Hewlett-Packard 6890GC进行分析(一般方法)。如实施例11所述干燥剩余的等分试样以测量干细胞重量(DCW)。
表17显示了每个菌株的脂肪酸组成(即,16:0(棕榈酸)、16:1(棕榈油酸)、18:0、18:1(油酸)、18:2(LA)、EDA和DGLA)(以占总脂肪酸(TFA)的重量百分比(wt.%)形式表示),以及总脂质含量(TFA%DCW)。转化效率(“CE”)根据下面的公式测量:([产物]/[底物+产物])*100,其中‘产物’包括中间产物和该途径中来源于它的所有产物。因此,Δ12去饱和酶转化效率(Δ12%CE)计算如下:([LA+EDA+DGLA]/[18:1+LA+EDA+DGLA])*100;Δ9延伸酶转化效率(Δ9elo%CE)计算如下:([EDA+DGLA]/[LA+EDA+DGLA])*100;并且Δ8去饱和酶转化效率(Δ8%CE)计算如下:([DGLA]/[EDA+DGLA])*100。菌株Y4036、L134和L135的平均脂肪酸组成用灰色突出显示并用“Ave”表示,而“S.D.”指示标准偏差。正如预期的那样,Δpex3菌株不在用油酸盐作唯一碳源的平板上生长。
表17中的结果显示敲除Y4036(Δpex3)中的染色体Pex3基因与天然Pex3p未被敲除的菌株Y4036中的DGLA%TFA相比较,DGLA%TFA提高了大约142%。具体地讲,Pex3敲除提高了DGLA含量,从在Y4036中的大约19%提高到在Y4036(Δpex3)菌株L134和L135中的46%。此外,Δ9延伸酶转化效率百分比从Y4036中的大约48%提高到Y4036(Δpex3)菌株L134和L135中的83%;并且菌株L134和L135中的TFA%DCW从4.7%提高到6%。LA%TFA从30%下降到12%。Pex3缺失实际上提高了脂肪酸的通量并从而提高了用于Δ9延伸的底物可利用性。
因此,表17中的结果显示与亲本菌株Y4036相比,Y4036(ΔPex3)菌株具有平均更高的脂质含量(TFA%DCW)(大约6.0%对4.7%),更高的DGLA%TFA(46%对19%),以及更高的DGLA%DCW(大约2.8%对0.9%)。此外,菌株Y4036(ΔPex3)的DGLA相对于总PUFA的量提高了2倍(67.7%的PUFA[%TFA]对33.3%的PUFA[%TFA]),而C20PUFA相对于总PUFA的量提高了1.7倍(82%的PUFA[%TFA]对47%的PUFA[%TFA])。
预测改善的DGLA产量也将导致在经基因工程化用于EPA生产的解脂耶氏酵母菌株中的EPA产量的改善(例如实施例10所述并从中衍生出的解脂耶氏酵母菌株Y4305U)。
序列表
<110>E.I.du Pont de Nemours & Co.,Inc.
Zhu,Quinn
Xue,Zhixiong
<120)破坏过氧化物酶体生物合成因子蛋白(PEX)以改变含油真核生物中多不饱和脂肪酸和总脂质含量
<130>CL3847
<160>89
<170>PatentIn版本3.4
<210>1
<211>1024
<212>PRT
<213>解脂耶氏酵母CLIBl22(GenBank保藏号CAG82178)
<220>
<221>MISC_FEATURE
<222>(1)..(1024)
<223>YlPex1p
<400>1
Met Thr Ser Lys Ser Asp Tyr Ser Gly Lys Asp Lys Ile Glu Leu Asp
1 5 10 15
Pro Val Phe Ala Lys Ser Ile Asp Leu Leu Pro Asn Thr Gln Val Val
20 25 30
Ile Asp Ile Gln Leu Asn Pro Lys Ile Ala His Thr Ile His Leu Glu
35 40 45
Pro Val Thr Val Ala Asp Trp Glu Ile Val Glu Leu His Ala Ala Tyr
50 55 60
Leu Glu Ser Arg Met Ile Asn Gln Val Arg Ala Val Ser Pro Asn Gln
65 70 75 80
Pro Val Thr Val Tyr Pro Ser Ser Thr Thr Ser Ala Thr Leu Lys Val
85 90 95
Ile Arg Ile Glu Pro Asp Leu Gly Ala Ala Gly Phe Ala Lys Leu Ser
100 105 110
Pro Asp Ser Glu Val Val Val Ala Pro Lys Gln Arg Lys Lys Glu Glu
115 120 125
Lys Gln Val Lys Lys Arg Ser Gly Ser Ala Arg Ser Thr Gly Ser Gln
130 135 140
Lys Arg Lys Gly Gly Arg Gly Pro His Ala Leu Arg Arg Ala Ile Ser
145 150 155 160
Glu Asp Phe Asp Gly His Leu Arg Leu Glu Val Ser Leu Asp Val Ser
165 170 175
Gln Leu Pro Pro Glu Phe His Gln Leu Lys Asn Val Ser Ile Lys Val
180 185 190
Ile Thr Pro Pro Asn Leu Ala Ser Pro Gln Gln Ala Ala Ser Ile Ala
195 200 205
Val Glu Glu Lys Ser Glu Glu Ser Leu Ser Gln Asn Lys Pro Pro Ser
210 215 220
Ser Glu Pro Lys Val Glu Val Pro Pro Asp Ile Ile Asn Pro Ala Ser
225 230 235 240
Glu Ile Val Ala Thr Leu Val Asn Asp Thr Thr Ser Pro Thr Gly His
245 250 255
Ala Lys Leu Ser Tyr Ala Leu Ala Asp Ala Leu Gly Ile Pro Ser Ser
260 265 270
Val Gly His Val Ile Arg Phe Glu Ser Ala Ser Lys Pro Leu Ser Gln
275 280 285
Lys Pro Gly Ala Leu Val Ile His Arg Phe Ile Thr Lys Thr Val Gly
290 295 300
Ala Ala Glu Gln Lys Ser Leu Arg Leu Lys Gly Glu Lys Asn Ala Asp
305 310 315 320
Asp Gly Val Ser Ala Asp Asp Gln Phe Ser Leu Leu Glu Glu Leu Lys
325 330 335
Lys Leu Gln Mer Leu Glu Gly Pro Ile Thr Asn Phe Gln Arg Leu Pro
340 345 350
Pro Ile Pro Glu Leu Leu Pro Leu Gly Gly Val Ile Gly Leu Gln Asn
355 360 365
Ser Glu Gly Trp Ile Gln Gly Gly Tyr Leu Gly Glu Glu Pro Ile Pro
370 375 380
Phe Val Ser Gly Ser Glu Ile Leu Arg Ser Glu Ser Ser Leu Ser Pro
385 390 395 400
Ser Asn Ile Glu Ser Glu Asp Lys Arg Val Val Gly Leu Asp Asn Met
405 410 415
Leu Asn Lys Ile Asn Glu Val Leu Ser Arg Asp Ser Ile Gly Cys Leu
420 425 430
Val Tyr Gly Ser Arg Gly Ser Gly Lys Ser Ala Val Leu Asn His Ile
435 440 445
Lys Lys Glu Cys Lys Val Ser His Thr His Thr Val Ser Ile Ala Cys
450 455 460
Gly Leu Ile Ala Gln Asp Arg Val Gln Ala Val Arg Glu Ile Leu Thr
465 470 475 480
Lys Ala Phe Leu Glu Ala Ser Trp Phe Ser Pro Ser Val Leu Phe Leu
485 490 495
Asp Asp Ile Asp Ala Leu Met Pro Ala Glu Val Glu His Ala Asp Ser
500 505 510
Ser Arg Thr Arg Gln Leu Thr Gln Leu Phe Leu Glu Leu Ala Leu Pro
515 520 525
Ile Met Lys Ser Arg His Val Ser Val Val Ala Ser Ala Gln Ala Lys
530 535 540
Glu Ser Leu His Met Asn Leu Val Thr Gly His Val Phe Glu Glu Leu
545 550 555 560
Phe His Leu Lys Ser Pro Asp Lys Glu Ala Arg Leu Ala Ile Leu Ser
565 570 575
Glu Ala Val Lys Leu Met Asp Gln Asn Val Ser Phe Ser Gln Asn Asp
580 585 590
Val Leu Glu Ile Ala Ser Gln Val Asp Gly Tyr Leu Pro Gly Asp Leu
595 600 605
Trp Thr Leu Ser Glu Arg Ala Gln His Glu Met Ala Leu Arg Gln Ile
610 615 620
Glu Ile Gly Leu Glu Asn Pro Ser Ile Gln Leu Ala Asp Phe Met Lys
625 630 635 640
Ala Leu Glu Asp Phe Val Pro Ser Ser Leu Arg Gly Val Lys Leu Gln
645 650 655
Lys Ser Asn Val Lys Trp Asn Asp Ile Gly Gly Leu Lys Glu Thr Lys
660 665 670
Ala Val Leu Leu Glu Thr Leu Glu Trp Pro Thr Lys Tyr Ala Pro Ile
675 680 685
Phe Ala Ser Cys Pro Leu Arg Leu Arg Ser Gly Leu Leu Leu Tyr Gly
690 695 700
Tyr Pro Gly Cys Gly Lys Thr Tyr Leu Ala Ser Ala Val Ala Ala Gln
705 710 715 720
Cys Gly Leu Asn Phe Ile Ser Ile Lys Gly Pro Glu Ile Leu Asn Lys
725 730 735
Tyr Ile Gly Ala Ser Glu Gln Ser Val Arg Glu Leu Phe Glu Arg Ala
740 745 750
Gln Ala Ala Lys Pro Cys Ile Leu Phe Phe Asp Glu Phe Asp Ser Ile
755 760 765
Ala Pro Lys Arg Gly His Asp Ser Thr Gly Val Thr Asp Arg Val Val
770 775 780
Asn Gln Met Leu Thr Gln Met Asp Gly Ala Glu Gly Leu Asp Gly Val
785 790 795 800
Tyr Val Leu Ala Ala Thr Ser Arg Pro Asp Leu Ile Asp Pro Ala Leu
805 810 815
Leu Arg Pro Gly Arg Leu Asp Lys Met Leu Ile Cys Asp Leu Pro Ser
820 825 830
Tyr Glu Asp Arg Leu Asp Ile Leu Arg Ala Ile Val Asp Gly Lys Met
835 840 845
His Leu Asp Gly Glu Val Glu Leu Glu Tyr Val Ala Ser Arg Thr Asp
850 855 860
Gly Phe Ser Gly Ala Asp Leu Gln Ala Val Met Phe Asn Ala Tyr Leu
865 870 875 880
Glu Ala Ile His Glu Val Val Asp Val Ala Asp Asp Thr Ala Ala Asp
885 890 895
Thr Pro Ala Leu Glu Asp Lys Arg Leu Glu Phe Phe Gln Thr Thr Leu
900 905 910
Gly Asp Ala Lys Lys Asp Pro Ala Ala Val Gln Asn Glu Val Met Asn
915 920 925
Ala Arg Ala Ala Val Ala Glu Lys Ala Arg Val Thr Ala Lys Leu Glu
930 935 940
Ala Leu Phe Lys Gly Met Ser Val Gly Val Asp Asn Asp Asp Asp Lys
945 950 955 960
Pro Arg Lys Lys Ala Val Val Val Ile Lys Pro Gln His Met Asn Lys
965 970 975
Ser Leu Asp Glu Thr Ser Pro Ser Ile Ser Lys Lys Glu Leu Leu Lys
980 985 990
Leu Lys Gly Ile Tyr Ser Gln Phe Val Ser Gly Arg Ser Gly Asp Met
995 1000 1005
Pro Pro Gly Thr Ala Ser Thr Asp ValGly Gly Arg Ala Thr Leu
1010 1015 1020
Ala
<210>2
<211>381
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG77647)
<220>
<221>MISC_FEATURE
<222>(1)..(381)
<223>YlPex2p
<400>2
Met Ser Ser Val Leu Arg Leu Phe Lys Ile Gly Ala Pro Val Pro Asn
1 5 10 15
Val Arg Val His Gln Leu Asp Ala Ser Leu Leu Asp Ala Glu Leu Val
20 25 30
Asp Leu Leu Lys Asn Gln Leu Phe Lys Gly Phe Thr Asn Phe His Pro
35 40 45
Glu Phe Arg Asp Lys Tyr Glu Ser Glu Leu Val Leu Ala Leu Lys Leu
50 55 60
Ile Leu Phe Lys Leu Thr Val Trp Asp His Ala Ile Thr Tyr Gly Gly
65 70 75 80
Lys Leu Gln Asn Leu Lys Phe Ile Asp Ser Arg His Ser Ser Lys Leu
85 90 95
Gln Ile Gln Pro Ser Val Ile Gln Lys Leu Gly Tyr Gly Ile Leu Val
100 105 110
Val Gly Gly Gly Tyr Leu Trp Ser Lys Ile Glu Gly Tyr Leu Leu Ala
115 120 125
Arg Ser Glu Asp Asp Val Ala Thr Asp Gly Thr Ser Val Arg Gly Ala
130 135 140
Ser Ala Ala Arg Gly Ala Leu Lys Val Ala Asn Phe Ala Ser Leu Leu
145 150 155 160
Tyr Ser Ala Ala Thr Leu Gly Asn Phe Val Ala Phe Leu Tyr Thr Gly
165 170 175
Arg Tyr Ala Thr Val Ile Met Arg Leu Leu Arg Ile Arg Leu Val Pro
180 185 190
Ser Gln Arg Thr Ser Ser Arg Gln Val Ser Tyr Glu Phe Gln Asn Arg
195 200 205
Gln Leu Val Trp Asn Ala Phe Thr Glu Phe Leu Ile Phe Ile Leu Pro
210 215 220
Leu Leu Gln Leu Pro Lys Leu Lys Arg Arg Ile Glu Arg Lys Leu Gln
225 230 235 240
Ser Leu Asn Val Thr Arg Val Gly Asn Val Glu Glu Ala Ser Glu Gly
245 250 255
Glu Leu Ala His Leu Pro Gln Lys Thr Cys Ala Ile Cys Phe Arg Asp
260 265 270
Glu Glu Glu Gln Glu Gly Gly Gly Gly Ala Ser His Tyr Ser Thr Asp
275 280 285
Val Thr Asn Pro Tyr Gln Ala Asp Cys Gly His Val Tyr Cys Tyr Val
290 295 300
Cys Leu Val Thr Lys Leu Ala Gln Gly Asp Gly Asp Gly Trp Asn Cys
305 310 315 320
Tyr Arg Cys Ala Lys Gln ValGln Lys Met Lys Pro Trp Val Asp Val
325 330 335
Asp Glu Ala Ala Val Val Gly Ala Ala Glu Met His Glu Lys Val Asp
340 345 350
Val Ile Glu His Ala Glu Asp Asn Glu Gln Glu Glu Glu Glu Phe Asp
355 360 365
Asp Asp Asp Glu Asp Ser Asn Phe Gln Leu Met Lys Asp
370 375 380
<210>3
<211>431
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG78565)
<220>
<221>MISC_FEATURE
<222>(1)..(431)
<223>YlPex3p
<400>3
Met Asp Phe Phe Arg Arg His Gln Lys Lys Val Leu Ala Leu Val Gly
1 5 10 15
Val Ala Leu Ser Ser Tyr Leu Phe Ile Asp Tyr Val Lys Lys Lys Phe
20 25 30
Phe Glu Ile Gln Gly Arg Leu Ser Ser Glu Arg Thr Ala Lys Gln Asn
35 40 45
Leu Arg Arg Arg Phe Glu Gln Asn Gln Gln Asp Ala Asp Phe Thr Ile
50 55 60
Met Ala Leu Leu Ser Ser Leu Thr Thr Pro Val Met Glu Arg Tyr Pro
65 70 75 80
Val Asp Gln Ile Lys Ala Glu Leu Gln Ser Lys Arg Arg Pro Thr Asp
85 90 95
Arg Val Leu Ala Leu Glu Ser Ser Thr Ser Ser Ser Ala Thr Ala Gln
100 105 110
Thr Val Pro Thr Met Thr Ser Gly Ala Thr Glu Glu Gly Glu Lys Ser
115 120 125
Lys Thr Gln Leu Trp Gln Asp Leu Lys Arg Thr Thr Ile Ser Arg Ala
130 135 140
Phe Ser Leu Val Tyr Ala Asp Ala Leu Leu Ile Phe Phe Thr Arg Leu
145 150 155 160
Gln Leu Asn Ile Leu Gly Arg Arg Asn Tyr Val Asn Ser Val Val Ala
165 170 175
Leu Ala Gln Gln Gly Arg Glu Gly Asn Ala Glu Gly Arg Val Ala Pro
180 185 190
Ser Phe Gly Asp Leu Ala Asp Met Gly Tyr Phe Gly Asp Leu Ser Gly
195 200 205
Ser Ser Ser Phe Gly Glu Thr Ile Val Asp Pro Asp Leu Asp Glu Gln
210 215 220
Tyr Leu Thr Phe Ser Trp Trp Leu Leu Asn Glu Gly Trp Val Ser Leu
225 230 235 240
Ser Glu Arg Val Glu Glu Ala Val Arg Arg Val Trp Asp Pro Val Ser
245 250 255
Pro Lys Ala Glu Leu Gly Phe Asp Glu Leu Ser Glu Leu Ile Gly Arg
260 265 270
Thr Gln Met Leu Ile Asp Arg Pro Leu Asn Pro Ser Ser Pro Leu Asn
275 280 285
Phe Leu Ser Gln Leu Leu Pro Pro Arg Glu Gln Glu Glu Tyr Val Leu
290 295 300
Ala Gln Asn Pro Ser Asp Thr Ala Ala Pro Ile Val Gly Pro Thr Leu
305 310 315 320
Arg Arg Leu Leu Asp Glu Thr Ala Asp Phe Ile Glu Ser Pro Asn Ala
325 330 335
Ala Glu Val Ile Glu Arg Leu Val His Ser Gly Leu Ser Val Phe Met
340 345 350
Asp Lys Leu Ala Val Thr Phe Gly Ala Thr Pro Ala Asp Ser Gly Ser
355 360 365
Pro Tyr Pro Val Val Leu Pro Thr Ala Lys Val Lys Leu Pro Ser Ile
70 375 380
Leu Ala Asn Met Ala Arg Gln Ala Gly Gly Met Ala Gln Gly Ser Pro
385 390 395 400
Gly Val Glu Asn Glu Tyr Ile Asp Val Met Asn Gln Val Gln Glu Leu
405 410 415
Thr Ser Phe Ser Ala Val Val Tyr Ser Ser Phe Asp Trp Ala Leu
420 425 430
<210>4
<211>395
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG83356)
<220>
<221>MISC_FEATURE
<222>(1)..(395)
<223>YlPex3Bp
<400>4
Met Leu Gln Ser Leu Asn Arg Asn Lys Lys Arg Leu Ala Val Ser Thr
1 5 10 15
Gly Leu Ile Ala Val Ala Tyr Val Val Ile Ser Tyr Thr Thr Lys Arg
20 25 30
Leu Ile Glu Lys Gln Glu Gln Lys Leu Glu Glu Glu Arg Ala Lys Glu
35 40 45
Arg Leu Lys Gln Leu Phe Ala Gln Thr Gln Asn Glu Ala Ala Phe His
50 55 60
Thr Ala Ser Val Leu Pro Gln Leu Cys Glu Gln Ile Met Glu Phe Val
65 70 75 80
Ala Val Glu Lys Ile Ala Glu Gln Leu Gln Asn Met Arg Ala Glu Lys
85 90 95
Arg Lys Lys Gln Asn Met Asp Asp Asp Lys His Ser Val Leu Ser Leu
100 105 110
Gly Thr Glu Thr Thr Ala Ser Met Ala Asp Gly Gln Lys Met Ser Lys
115 120 125
Ile Gln Leu Trp Asp Glu Leu Lys Ile Glu Ser Leu Thr Arg Ile Val
130 135 140
Thr Leu Ile Tyr Cys Val Ser Leu Leu Asn Tyr Leu Ile Arg Leu Gln
145 150 155 160
Thr Asn Ile Val Gly Arg Lys Arg Tyr Gln Asn Glu Ala Gly Pro Ala
165 170 175
Gly Ala Thr Tyr Asp Met Ser Leu Glu Gln Cys Tyr Thr Trp Leu Leu
180 185 190
Thr Arg Gly Trp Lys Ser Val Val Asp Asn Val Arg Arg Ser Val Gln
195 200 205
Gln Val Phe Thr Gly Val Asn Pro Arg Gln Asn Leu Ser Leu Asp Glu
210 215 220
Phe Ala Thr Leu Leu Lys Arg Val Gln Thr Leu Val Asn Ser Pro Pro
225 230 235 240
Tyr Ser Thr Thr Pro Asn Thr Phe Leu Thr Ser Leu Leu Pro Pro Arg
245 250 255
Glu Leu Glu Gln Leu Arg Leu Glu Lys Glu Lys Gln Ser Leu Ser Pro
260 265 270
Asn Tyr Thr Tyr Gly Ser Pro Leu Lys Asp Leu Val Phe Glu Ser Ala
275 280 285
Gln His Ile Gln Ser Pro Gln Gly Met Ser Ser Phe Arg Ala Ile Ile
290 295 300
Asp Gln Ser Phe Lys Val Phe Leu Glu Lys Val Asn Glu Ser Gln Tyr
305 310 315 320
Val Asn Pro Pro Ser Thr Gly Gly Lys Arg Ile Ala Val Gly Ala Leu
325 330 335
Gln Pro Pro Ile Ile Ser Gly Gly Pro Lys Lys Val Lys Leu Ala Ser
340 345 350
Leu Leu Ser Val Ala Thr Arg Gln Ser Ser Val Ile Ser His Ala Gln
355 360 365
Pro Asn Pro Tyr Val Asp Ala Ile Asn Ser Val Ala Glu Tyr Asn Gly
370 375 380
Leu Cys Ala Val Ile Tyr Ser Ser Phe Glu Gln
385 390 395
<210>5
<211>153
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG79130)
<220>
<221>MISC_FEATURE
<222>(1)..(153)
<223>YlPex4p
<400>5
Met Ala Ser Gln Lys Arg Leu Ile Lys Glu Leu Ala Ala Tyr Lys Lys
1 5 10 15
Asp Pro Asn Pro Cys Leu Ala Ser Leu Thr Ala Asp Gly Asp Ser Leu
20 25 30
Tyr Lys Trp Thr Ala Val Met Arg Gly Thr Glu Gly Thr Ala Tyr Glu
35 40 45
Asn Gly Leu Trp Gln Val Glu Ile Asn Ile Pro Glu Asn Tyr Pro Leu
50 55 60
Gln Pro Pro Thr Met Phe Phe Arg Thr Lys Ile Cys His Pro Asn Ile
65 70 75 80
His Phe Glu Thr Gly Glu Val Cys Ile Asp Val Leu Lys Thr Gln Trp
85 90 95
Ser Pro Ala Trp Thr Ile Ser Ser Ala Cys Thr Ala Val Ser Ala Met
100 105 110
Leu Ser Leu Pro Glu Pro Asp Ser Pro Leu Asn Ile Asp Ala Ala Asn
115 120 125
Leu Val Arg Cys Gly Asp GluSer Ala Met Glu Gly Leu Val Arg Tyr
130 135 140
Tyr Val Asn Lys Tyr Ala Ser Gly Asn
145 150
<210>6
<211>598
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG78803)
<220>
<221>MISC_FEATURE
<222>(1)..(598)
<223>YlPex5p
<400>6
Met Ser Phe Met Arg Gly Gly Ser Glu Cys Ser Thr Gly Arg Asn Pro
1 5 10 15
Leu Ser Gln Phe Thr Lys His Thr Ala Glu Asp Arg Ser Leu Gln His
20 25 30
Asp Arg Val Ala Gly Pro Ser Gly Gly Arg Val Gly Gly Met Arg Ser
35 40 45
Asn Thr Gly Glu Met Ser Gln Gln Asp Arg Glu Met Met Ala Arg Phe
50 55 60
Gly Ala Ala Gly Pro Glu Gln Ser Ser Phe Asn Tyr Glu Gln Met Arg
65 70 75 80
His Glu Leu His Asn Met Gly Ala Gln Gly Gly Gln Ile Pro Gln Val
85 90 95
Pro Ser Gln Gln Gly Ala Ala Asn Gly Gly Gln Trp Ala Arg Asp Phe
100 105 110
Gly Gly Gln Gln Thr Ala Pro Gly Ala Ala Pro Gln Asp Ala Lys Asn
115 120 125
Trp Asn Ala Glu Phe Gln Arg Gly Gly Ser Pro Ala Glu Ala Met Gln
130 135 140
Gln Gln Gly Pro Gly Pro Met Gln Gly Gly Met Gly Met Gly Gly Met
145 150 155 160
Pro Met Tyr Gly Met Ala Arg Pro Met Tyr Ser Gly Met Ser Ala Asn
165 170 175
Met Ala Pro Gln Phe Gln Pro Gln Gln Ala Asn Ala Arg Val Val Glu
180 185 190
Leu Asp Glu Gln Asn Trp Glu Glu Gln Phe Lys Gln Met Asp Ser Ala
195 200 205
Val Gly Lys Gly Lys Glu Val Glu Glu Gln Thr Ala Glu Thr Ala Thr
210 215 220
Ala Thr Glu Thr Val Thr Glu Thr Glu Thr Thr Thr Glu Asp Lys Pro
225 230 235 240
Met Asp Ile Lys Asn Met Asp Phe Glu Asn Ile Trp Lys Asn Leu Gln
245 250 255
Val Asn Val Leu Asp Asn Met Asp Glu Trp Leu Glu Glu Thr Asn Ser
260 265 270
Pro Ala Trp Glu Arg Asp Phe His Glu Tyr Thr His Asn Arg Pro Glu
275 280 285
Phe Ala Asp Tyr Gln Phe Glu Glu Asn Asn Gln Phe Met Glu His Pro
290 295 300
Asp Pro Phe Lys Ile Gly Val Glu Leu Met Glu Thr Gly Gly Arg Leu
305 310 315 320
Ser Glu Ala Ala Leu Ala Phe Glu Ala Ala Val Gln Lys Asn Thr Glu
325 330 335
His Ala Glu Ala Trp Gly Arg Leu Gly Ala Cys Gln Ala Gln Asn Glu
340 345 350
Lys Glu Asp Pro Ala Ile Arg Ala Leu Glu Arg Cys Ile Lys Leu Glu
355 360 365
Pro Gly Asn Leu Ser Ala Leu Met Asn Leu Ser Val Ser Tyr Thr Asn
370 375 380
Glu Gly Tyr Glu Asn Ala Ala Tyr Ala Thr Leu Glu Arg Trp Leu Ala
385 390 395 400
Thr Lys Tyr Pro Glu Val Val Asp Gln Ala Arg Asn Gln Glu Pro Arg
405 410 415
Leu Gly Asn Glu Asp Lys Phe Gln Leu His Ser Arg Val Thr Glu Leu
420 425 430
Phe Ile Arg Ala Ala Gln Leu Ser Pro Asp Gly Ala Asn Ile Asp Ala
435 440 445
Asp Val Gln Val Gly Leu Gly Val Leu Phe Tyr Gly Asn Glu Glu Tyr
450 455 460
Asp Lys Ala Ile Asp Cys Phe Asn Ala Ala Ile Ala Val Arg Pro Asp
465 470 475 480
Asp Ala Leu Leu Trp Asn Arg Leu Gly Ala Thr Leu Ala Asn Ser His
485 490 495
Arg Ser Glu Glu Ala Ile Asp Ala Tyr Tyr Lys Ala Leu Glu Leu Arg
500 505 510
Pro Ser Phe Val Arg Ala Arg Tyr Asn Leu Gly Val Ser Cys Ile Asn
515 520 525
Ile Gly Cys Tyr Lys Glu Ala Ala Gln Tyr Leu Leu Gly Ala Leu Ser
530 535 540
Met His Lys Val Glu Gly Val Gln Asp Asp Val Leu Ala Asn Gln Ser
545v550 555 560
Thr Asn Leu Tyr Asp Thr Leu Lys Arg Val Phe Leu Gly Met Asp Arg
565 570 575
Arg Asp Leu Val Ala Lys Val Gly Asn Gly Met Asp Val Asn Gln Phe
580 585 590
Arg Asn Glu Phe Glu Phe
595
<210>7
<211>1024
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG82306)
<220>
<221>MISC_FEATURE
<222>(1)..(1024)
<223>YlPex6p
<400>7
Met Pro Ser Ile Ser His Lys Pro Ile Thr Ala Lys Leu Val Ala Ala
1 5 10 15
Pro Asp Ala Thr Lys Leu Glu Leu Ser Ser Tyr Leu Tyr Gln Gln Leu
20 25 30
Phe Ser Asp Lys Pro Ala Glu Pro Tyr Val Ala Phe Glu Ala Pro Gly
35 40 45
Ile Lys Trp Ala Leu Tyr Pro Ala Ser Glu Asp Arg Ser Leu Pro Gln
50 55 60
Tyr Thr Cys Lys Ala Asp Ile Arg His Val Ala Gly Ser Leu Lys Lys
65 70 75 80
Phe Met Pro Val Val Leu Lys Arg Val Asn Pro Val Thr Ile Glu His
85 90 95
Ala Ile Val Thr Val Pro Ala Ser Gln Tyr Glu Thr Leu Asn Thr Pro
100 105 110
Glu Gln Val Leu Lys Ala Leu Glu Pro Gln Leu Asp Lys Asp Arg Pro
115 120 125
Val Ile Arg Gln Gly Asp Val Leu Leu Asn Gly Cys Arg Val Arg Leu
130 135 140
Cys Glu Pro Val Asn Gln Gly Lys Val Val Lys Gly Thr Thr Lys Leu
145 150 155 160
Thr Val Ala Lys Glu Gln Glu Thr Ile Gln Pro Ala Asp Glu Ala Ala
165 170 175
Asp Val Ala Phe Asp Ile Ala Glu Phe Leu Asp Phe Asp Thr Ser Val
180 185 190
Ala Lys Thr Arg Glu Ser Thr Asn Leu Gln Val Ala Pro Leu Glu Gly
195 200 205
Ala Ile Pro Thr Pro Leu Ser Asp Arg Phe Asp Asp Cys Glu Ser Arg
210 215 220
Gly Phe Val Lys Ser Glu Thr Met Ser Lys Leu Gly Val Phe Ser Gly
225 230 235 240
Asp Ile Val Ser Ile Lys Thr Lys Asn Gly Ala Glu Arg Val Leu Arg
245 250 255
Leu Phe Ala Tyr Pro Glu Pro Asn Thr Val Lys Tyr Asp Val Val Tyr
260 265 270
Val Ser Pro Ile Leu Tyr His Asn Ile Gly Asp Lys Glu Ile Glu Val
275 280 285
Thr Pro Asn Gly Glu Thr His Lys Ser Val Gly Glu Ala Leu Asp Ser
290 295 300
Val Leu Glu AIa Ala Glu Glu Val Lys Leu Ala Arg Val Leu Gly Pro
305 310 315 320
Thr Thr Thr Asp Arg Thr Phe Gln Thr Ala Tyr His Ala Gly Leu Gln
325 330 335
Ala Tyr Phe Lys Pro Val Lys Arg Ala Val Arg Val Gly Asp Leu Ile
340 345 350
Pro Ile Pro Phe Asp Ser Ile Leu Ala Arg Thr Ile Gly Glu Asp Pro
355 360 365
Glu Met Ser His Ile Pro Leu Glu Ala Leu Ala Val Lys Pro Asp Ser
370 375 380
Val Ala Trp Phe Gln Val Thr Ser Leu Asn Gly Ser Glu Asp Pro Ala
385 390 395 400
Ser Lys Gln Tyr Leu Val Asp Ser Ser Gln Thr Lys Leu Ile Glu Gly
405 410 415
Gly Thr Thr Ser Ser Ala Val Ile Pro Thr Ser Val Pro Trp Arg Glu
420 425 430
Tyr Leu Gly Leu Asp Thr Leu Pro Lys Phe Gly Ser Glu Phe Ala Tyr
435 440 445
Ala Asp Lys Ile Arg Asn Leu Val Gln Ile Ser Thr Ser Ala Leu Ser
450 455 460
His Ala Lys Leu Asn Thr Ser Val Leu Leu His Ser Ala Lys Arg Gly
465 470 475 480
Val Gly Lys Ser Thr Val Leu Arg Ser Val Ala Ala Gln Cys Gly Ile
485 490 495
Ser Val Phe Glu Ile Ser Cys Phe Gly Leu Ile Gly Asp Asn Glu Ala
500 505 510
Gln Thr Leu Gly Thr Leu Arg Ala Lys Leu Asp Arg Ala Tyr Gly Cys
515 520 525
Ser Pro Cys Val Val Val Leu Gln His Leu Glu Ser Ile Ala Lys Lys
530 535 540
Ser Asp Gln Asp Gly Lys Asp Glu Gly Ile Val Ser Lys Leu Val Asp
545 550 555 560
Val Leu Ala Asp Tyr Ser Gly His Gly Val Leu Leu Ala Ala Thr Ser
565 570 575
Asn Asp Pro Asp Lys Ile Ser Glu Ala Ile Arg Ser Arg Phe Gln Phe
580 585 590
Glu Ile Glu Ile Gly Val Pro Ser Glu Pro Gln Arg Arg Gln Ile Phe
595 600 605
Ser His Leu Thr Lys Ser Gly Pro Gly Gly Asp Ser Ile Arg Asn Ala
610 615 620
Pro Ile Ser Leu Arg Ser Asp Val Ser Val Glu Asn Leu Ala Leu Gln
625 630 635 640
Ser Ala Gly Leu Thr Pro Pro Asp Leu Thr Ala Ile Val Gln Thr Thr
645 650 655
Arg Leu Arg Ala Ile Asp Arg Leu Asn Lys Leu Thr Lys Asp Ser Asp
660 665 670
Thr Thr Leu Asp Asp Leu Leu Thr Leu Ser His Gly Thr Leu Gln Leu
675 680 685
Thr Pro Ser Asp Phe Asp Asp Ala Ile Ala Asp Ala Arg Gln Lys Tyr
690 695 700
Ser Asp Ser Ile Gly Ala Pro Arg Ile Pro Asn Val Gly Trp Asp Asp
705 710 715 720
Val Gly Gly Met Glu Gly Val Lys Lys Asp Ile Leu Asp Thr Ile Glu
725 730 735
Thr Pro Leu Lys Tyr Pro His Trp Phe Ser Asp Gly Val Lys Lys Arg
740 745 750
Ser Gly Ile Leu Phe Tyr Gly Pro Pro Gly Thr Gly Lys Thr Leu Leu
755 760 765
Ala Lys Ala Ile Ala Thr Thr Phe Ser Leu Asn Phe Phe Ser Val Lys
770 775 780
Gly Pro Glu Leu Leu Asn Met Tyr Ile Gly Glu Ser Glu Ala Asn Val
785 790 795 800
Arg Arg Val Phe Gln Lys Ala Arg Asp Ala Lys Pro Cys Val Val Phe
805 810 815
Phe Asp Glu Leu Asp Ser Val Ala Pro Gln Arg Gly Asn Gln Gly Asp
820 825 830
Ser Gly Gly Val Mer Asp Arg Ile Val Ser Gln Leu Leu Ala Glu Leu
835 840 845
Asp Gly Met Ser Thr Ala Gly Gly Glu Gly Val Phe Val Val Gly Ala
850 855 860
Thr Asn Arg Pro Asp Leu Leu Asp Glu Ala Leu Leu Arg Pro Gly Arg
865 870 875 880
Phe Asp Lys Met Leu Tyr Leu Gly Ile Ser Asp Thr His Glu Lys Gln
885 890 895
Gln Thr Ile Met Glu Ala Leu Thr Arg Lys Phe Arg Leu Ala Ala Asp
900 905 910
Val Ser Leu Glu Ala Ile Ser Lys Arg Cys Pro Phe Thr Phe Thr Gly
915 920 925
Ala Asp Phe Tyr Ala Leu Cys Ser Asp Ala Met Leu Asn Ala Met Thr
930 935 940
Arg Thr Ala Asn Glu Val Asp Ala Lys Ile Lys Leu Leu Asn Lys Asn
945 950 955 960
Arg Glu Glu Ala Gly Glu Glu Pro Val Ser Ile Arg Trp Trp Phe Asp
965 970 975
His Glu Ala Thr Lys Ser Asp Ile Glu Val Glu Val Ala Gln Gln Asp
980 985 990
Phe Glu Lys Ala Lys Asp Glu Leu Ser Pro Ser Val Ser Ala Glu Glu
995 1000 1005
Leu Gln His Tyr Leu Lys Leu Arg Gln Gln Phe Glu Gly Gly Lys
1010 1015 1020
Lys
<210>8
<211>356
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG78389)
<220>
<221>MISC_FEATURE
<222>(1)..(356)
<223>YlPex7p
<400>8
Met Leu Gly Phe Lys Thr Gln Gly Phe Asn Gly Tyr Ala Ala Asn Tyr
1 5 10 15
Ser Pro Phe Phe Asn Asp Lys Ile Ala Val Gly Thr Ala Ala Asn Tyr
20 25 30
Gly Leu Val Gly Asn Gly Lys Leu Phe Ile Leu Gly Ile Ser Pro Glu
35 40 45
Gly Arg Met Val Cys Glu Gly Gln Phe Asp Thr Gln Asp Gly Ile Phe
50 55 60
Asp Val Ala Trp Ser Glu Gln His Glu Asn His Val Ala Thr Ala Cys
65 70 75 80
Gly Asp Gly Ser Val Lys Leu Phe Asp Ile Lys Ala Gly Ala Phe Pro
85 90 95
Leu Val Ser Phe Lys Glu His Thr Arg Glu Val Phe Ser Val Asn Trp
100 105 110
Asn Met Ala Asn Lys Ala Leu Phe Cys Thr Ser Ser Trp Asp Ser Thr
115 120 125
Ile Lys Ile Trp Thr Pro Glu Arg Thr Asn Ser Ile Met Thr Leu Gly
130 135 140
Gln Pro Ala Pro Ala Gln Gly Thr Asn Ala Ser Ala His Ile Gly Arg
145 150 155 160
Gln Thr Ala Pro Asn Gln Ala Ala Ala Gln Glu Cys Ile Tyr Ser Ala
165 170 175
Lys Phe Ser Pro His Thr Asp Ser Ile Ile Ala Ser Ala His Ser Thr
180 185 190
Gly Met Val Lys Val Trp Asp Thr Arg Ala Pro Gln Pro Leu Gln Gln
195 200 205
Gln Phe Ser Thr Gln Gln Thr Glu Ser Gly Gly Pro Pro Glu Val Leu
210 215 220
Ser Leu Asp Trp Asn Lys Tyr Arg Pro Thr Val Ile Ala Thr Gly Gly
225 230 235 240
Val Asp Arg Ser Val Gln Val Tyr Asp Ile Arg Met Thr Gln Pro Ala
245 250 255
Ala Asn Gln Pro Val Gln Pro Leu Ser Leu Ile Leu Gly His Arg Leu
260 265 270
Pro Val Arg Gly Val Ser Trp Ser Pro His His Ala Asp Leu Leu Leu
275 280 285
Ser Cys Ser Tyr Asp Met Thr Ala Arg Val Trp Arg Asp Ala Ser Thr
290 295 300
Gly Gly Asn Tyr Leu Ala Arg Gln Arg Gly Gly Thr Glu Val Lys Cys
305 310 315 320
Met Asp Arg His Thr Glu Phe Val Ile Gly Gly Asp Trp Ser Leu Trp
325 330 335
Gly Asp Pro Gly Trp Ile Thr Thr Val Gly Trp Asp Gln Met Val Tyr
340 345 350
Val Trp His Ala
355
<210>9
<211>671
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG80447)
<220>
<221>MISC_FEATURE
<222>(1)..(671)
<223>YlPex8p
<400>9
Met Asn Lys Tyr Leu Val Pro Pro Pro Gln Ala Asn Arg Thr Val Thr
1 5 10 15
Asn Leu Asp Leu Leu Ile Asn Asn Leu Arg Gly Ser Ser Thr Pro Gly
20 25 30
Ala Ala Glu Val Asp Thr Arg Asp Ile Leu Gln Arg Ile Val Phe Ile
35 40 45
Leu Pro Thr Ile Lys Asn Pro Leu Asn Leu Asp Leu Val Ile Lys Glu
50 55 60
Ile Ile Asn Ser Pro Arg Leu Leu Pro Pro Leu Ile Asp Leu His Asp
65 70 75 80
Tyr Gln Gln Leu Thr Asp Ala Phe Arg Ala Thr Ile Lys Arg Lys Ala
85 90 95
Leu Val Thr Asp Pro Thr Ile Ser Phe Glu Ala Trp Leu Glu Thr Cys
100 105 110
Phe Gln Val Ile Thr Arg Phe Ala Gly Pro Gly Trp Lys Lys Leu Pro
115 120 125
Leu Leu Ala Gly Leu Ile Leu Ala Asp Tyr Asp Ile Ser Ala Asp Gly
130 135 140
Pro Thr Leu Glu Arg Lys Pro Gly Phe Pro Ser Lys Leu Lys His Leu
145 150 155 160
Leu Lys Arg Glu Phe Val Thr Thr Phe Asp Gln Cys Leu Ser Ile Asp
165 170 175
Thr Arg Asn Arg Ser Asp Ala Thr Lys Trp Val Pro Val Leu Ala Cys
180 185 190
Ile Ser Ile Ala Gln Val Tyr Ser Leu Leu Gly Asp Val Ala Ile Asn
195 200 205
Tyr Arg Arg Phe Leu Gln Val Gly Leu Asp Leu Ile Phe Ser Asn Tyr
210 215 220
Gly Leu Glu Met Gly Thr Ala Leu Ala Arg Leu His Ala Glu Ser Gly
225 230 235 240
Gly Asp Ala Thr Thr Ala Gly Gly Leu Ile Gly Lys Lys Leu Lys Glu
245 250 255
Pro Val Val Ala Leu Leu Asn Thr Phe Ala His Ile Ala Ser Ser Cys
260 265 270
Ile Val His Val Asp Ile Asp Tyr Ile Asp Arg Ile Gln Asn Lys Ile
275 280 285
Ile Leu Val Cys Glu Asn Gln Ala Glu Thr Trp Arg Ile Leu Thr Ile
290 295 300
Glu Ser Pro Thr Val Met His His Gln Glu Ser Val Gln Tyr Leu Lys
305 310 315 320
Trp Glu Leu Phe Thr Leu Cys Ile Ile Met Gln Gly Ile Ala Asn Met
325 330 335
Leu Leu Thr Gln Lys Met Asn Gln Phe Met Tyr Leu Gln Leu Ala Tyr
340 345 350
Lys Gln Leu Gln Ala Leu His Ser Ile Tyr Phe Ile Val Asp Gln Met
355 360 365
Gly Ser Gln Phe Ala Ala Tyr Asp Tyr Val Phe Phe Ser Ala Ile Asp
370 375 380
Val Leu Leu Ser Glu Tyr Ala Pro Tyr Ile Lys Asn Arg Gly Thr Ile
385 390 395 400
Pro Pro Asn Lys Glu Phe Val Ala Glu Arg Leu Ala Ala Asn Leu Ala
405 410 415
Gly Thr Ser Asn Val Gly Ser His Leu Pro Ile Asp Arg Ser Arg Val
420 425 430
Leu Phe Ala Leu Asn Tyr Tyr Glu Gln Leu Val Thr Val Cys His Asp
435 440 445
Ser Cys Val Glu Thr Ile Ile Tyr Pro Met Ala Arg Ser Phe Leu Tyr
450 455 460
Pro Thr Ser Asp Ile Gln Gln Leu Lys Pro Leu Val Glu Ala Ala His
465 470 475 480
Ser Val Ile Leu Ala Gly Leu Ala Val Pro Thr Asn Ala Val Val Asn
485 490 495
Ala Lys Leu Ile Pro Glu Tyr Met Gly Gly Val Leu Pro Leu Phe Pro
500 505 510
Gly Val Phe Ser Trp Asn Gln Phe Val Leu Ala Ile Gln Ser Ile Val
515 520 525
Asn Thr Val Ser Pro Pro Ser Glu ValPhe Lys Thr Asn Gln Lys Leu
530 535 540
Phe Arg Leu Val Leu Asp Ser Leu Met Lys Lys Cys Arg Asp Thr Pro
545 550 555 560
Val Gly Ile Pro Val Pro His Ser Val Thr Val Ser Gln Glu Gln Glu
565 570 575
Asp Ile Pro Pro Thr Gln Arg Ala Val Val Met Leu Ala Leu Ile Asn
580 585 590
Ser Leu Pro Tyr Val Asp Ile Arg Ser Phe Glu Leu Trp Leu Gln Glu
595 600 605
Thr Trp Asn Met Ile Glu Ala Thr Pro Met Leu Ala Glu Asn Ala Pro
610 615 620
Asn Lys Glu Leu Ala His Ala Glu His Glu Phe Leu Val Leu Glu Met
625 630 635 640
Trp Lys Met Ile Ser Gly Asn Ile Asp Gln Arg Leu Asn Asp Val Ala
645 650 655
Ile Arg Trp Trp Tyr Lys Lys Asn Ala Arg Val His Gly Thr Leu
660 665 670
<210>10
<211>377
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG81606)
<220>
<221>MISC_FEATURE
<222>(1)..(377)
<223>YlPex10p
<400>10
Met Trp Gly Ser Ser His Ala Phe Ala Gly Glu Ser Asp Leu Thr Leu
1 5 10 15
Gln Leu His Thr Arg Ser Asn Met Ser Asp Asn Thr Thr Ile Lys Lys
20 25 30
Pro Ile Arg Pro Lys Pro Ile Arg Thr Glu Arg Leu Pro Tyr Ala Gly
35 40 45
Ala Ala Glu Ile Ile Arg Ala Asn Gln Lys Asp His Tyr Phe Glu Ser
50 55 60
Val Leu Glu Gln His Leu Val Thr Phe Leu Gln Lys Trp Lys Gly Val
65 70 75 80
Arg Phe Ile His Gln Tyr Lys Glu Glu Leu Glu Thr Ala Ser Lys Phe
85 90 95
Ala Tyr Leu Gly Leu Cys Thr Leu Val Gly Ser Lys Thr Leu Gly Glu
100 105 110
Glu Tyr Thr Asn Leu Met Tyr Thr Ile Arg Asp Arg Thr Ala Leu Pro
115 120 125
Gly Val Val Arg Arg Phe Gly Tyr Val Leu Ser Asn Thr Leu Phe Pro
130 135 140
Tyr Leu Phe Val Arg Tyr Met Gly Lys Leu Arg Ala Lys Leu Met Arg
145 150 155 160
Glu Tyr Pro His Leu Val Glu Tyr Asp Glu Asp Glu Pro Val Pro Ser
165 170 175
Pro Glu Thr Trp Lys Glu Arg Val Ile Lys Thr Phe Val Asn Lys Phe
180 185 190
Asp Lys Phe Thr Ala Leu Glu Gly Phe Thr Ala Ile His Leu Ala Ile
195 200 205
Phe Tyr Val Tyr Gly Ser Tyr Tyr Gln Leu Ser Lys Arg Ile Trp Gly
210 215 220
Met Arg Tyr Val Phe Gly His Arg Leu Asp Lys Asn Glu Pro Arg Ile
225 230 235 240
Gly Tyr Glu Met Leu Gly Leu Leu Ile Phe Ala Arg Phe Ala Thr Ser
245 250 255
Phe Val Gln Thr Gly Arg Glu Tyr Leu Gly Ala Leu Leu Glu Lys Ser
260 265 270
Val Glu Lys Glu Ala Gly Glu Lys Glu Asp Glu Lys Glu Ala Val Val
275 280 285
Pro Lys Lys Lys Ser Ser Ile Pro Phe Ile Glu Asp Thr Glu Gly Glu
290 295 300
Thr Glu Asp Lys Ile Asp Leu Glu Asp Pro Arg Gln Leu Lys Phe Ile
305 310 315 320
Pro Glu Ala Ser Arg Ala Cys Thr Leu Cys Leu Ser Tyr Ile Ser Ala
325 330 335
Pro Ala Cys Thr Pro Cys Gly His Phe Phe Cys Trp Asp Cys Ile Ser
340 345 350
Glu Trp Val Arg Glu Lys Pro Glu Cys Pro Leu Cys Arg Gln Gly Val
355 360 365
Arg Glu Gln Asn Leu Leu Pro Ile Arg
370 375
<210>11
<211>408
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG81532)
<220>
<221>MISC_FEATURE
<222>(1)..(408)
<223>YlPex12p
<400>11
Met Asp Tyr Phe Ser Ser Leu Asn Ala Ser Gln Leu Asp Pro Asp Val
1 5 10 15
Pro Thr Leu Phe Glu Leu Leu Ser Ala Lys Gln Leu Glu Gly Leu Ile
20 25 30
Ala Pro Ser Val Arg Tyr Ile Leu Ala Phe Tyr Ala Gln Arg His Pro
35 40 45
Arg Tyr Leu Leu Arg Ile Val Asn Arg Tyr Asp Glu Leu Tyr Ala Leu
50 55 60
Phe Mer Gly Leu Val Glu Tyr Tyr Asn Leu Lys Thr Trp Asn Ala Ser
65 70 75 80
Phe Thr Glu Lys Phe Tyr Gly Leu Lys Arg Thr Gln Ile Leu Thr Asn
85 90 95
Pro Ala Leu Arg Thr Arg Gln Ala Val Pro Asp Leu Val Glu Ala Glu
100 105 110
Lys Arg Leu Ser Lys Lys Lys Ile Trp Gly Ser Leu Phe Phe Leu Ile
115 120 125
Val Val Pro Tyr Val Lys Glu Lys Leu Asp Ala Arg Tyr Glu Arg Leu
130 135 140
Lys Gly Arg Tyr Leu Ala Arg Asp Ile Asn Glu Glu Arg Ile Glu Ile
145 150 155 160
Lys Arg Thr Gly Thr Ala Gln Gln Ile Ala Val Phe Glu Phe Asp Tyr
165 170 175
Trp Leu Leu Lys Leu Tyr Pro Ile Val Thr Met Gly Cys Thr Thr Ala
180 185 190
Thr Leu Ala Phe His Met Leu Phe Leu Phe Ser Val Thr Arg Ala Tyr
195 200 205
Ser Ile Asp Asp Phe Leu Leu Asn Ile Gln Phe Ser Arg Met Thr Arg
210 215 220
Tyr Asp Tyr Gln Met Glu Thr Gln Arg Asp Ser Arg Asn Ala Ala Asn
225 230 235 240
Val Ala His Thr Met Lys Ser Ile Ser Glu Tyr Pro Val Ala Glu Arg
245 250 255
Val Met Leu Leu Leu Thr Thr Lys Ala Gly Ala Asn Ala Met Arg Ser
260 265 270
Ala Ala Leu Ser Gly Leu Ser Tyr Val Leu Pro Thr Ser Ile Phe Ala
275 280 285
Leu Lys Phe Leu Glu Trp Trp Tyr Ala Ser Asp Phe Ala Arg Gln Leu
290 295 300
Asn Gln Lys Arg Arg Gly Asp Leu Glu Asp Asn Leu Pro Val Pro Asp
305 310 315 320
Lys Val Lys Gly Ala Asp Lys Leu Ala Glu Ser Val Ala Lys Trp Lys
325 330 335
Glu Asp Thr Ser Lys Cys Pro Leu Cys Ser Lys Glu Leu Val Asn Pro
340 345 350
Thr Val Ile Glu Ser Gly Tyr Val Phe Cys Tyr Thr Cys Ile Tyr Arg
355 360 365
His Leu Glu Asp Gly Asp Glu Glu Thr Gly Gly Arg Cys Pro Val Thr
370 375 380
Gly Gln Lys Leu Leu Gly Cys Arg Trp Gln Asp Asp Val Trp Gln Val
385 390 395 400
Thr Gly Leu Arg Arg Leu Met Val
405
<210>12
<211>412
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG81789)
<220>
<221>MISC_FEATURE
<222>(1)..(412)
<223>YlPex13p
<400>12
Met Ser Val Pro Arg Pro Lys Pro Trp Glu Gly Ala Ser Gly Ser Ser
1 5 10 15
Ala Ala Thr Ala Thr Pro Ala Ala Thr Ala Thr Pro Ala Ser Thr Asp
20 25 30
Ala Val Ser Ser Ser Ala Gly Ser Ala Thr Gly Ala Pro Glu Leu Pro
35 40 45
Ser Arg Pro Ser Ala Met Gly Ser Thr Ser Asn Ala Leu Ser Ser Pro
50 55 60
Met Gly Ser Ser Met Asn Ser Gly Tyr Gly Gly Met Asn Ser Gly Tyr
65 70 75 80
Gly Gly Met Gly Ser Ser Tyr Gly Ser Gly Tyr Gly Ser Ser Tyr Gly
85 90 95
Met Gly Ser Ser Tyr Gly Ser Gly Tyr Gly Ser Gly Leu Gly Gly Tyr
100 105 110
Gly Ser Tyr Gly Gly Met Gly Gly Met Gly Gly Met Tyr Gly Ser Arg
115 120 125
Tyr Gly Gly Tyr Gly Ser Tyr Gly Gly Met Gly Gly Tyr Gly Gly Tyr
130 135 140
Gly Gly Met Gly Gly Gly Pro Met Gly Gln Asn Gly Leu Ala Gly Gly
145 150 155 160
Thr Gln Ala Thr Phe Gln Leu Ile Glu Ser Ile Val Gly Ala Val Gly
165 170 175
Gly Phe Ala Gln Met Leu Glu Ser Thr Tyr Met Ala Thr Gln Ser Ser
180 185 190
Phe Phe Ala Met Val Ser Val Ala Glu Gln Phe Gly Asn Leu Lys Asn
195 200 205
Thr Leu Gly Ser Leu Leu Gly Ile Tyr Ala Ile Met Arg Trp Ala Arg
210 215 220
Arg Leu Val Ala Lys Leu Ser Gly Gln Pro Val Thr Gly Ala Asn Gly
225 230 235 240
Ile Thr Pro Ala Gly Phe Ala Lys Phe Glu Ala Thr Gly Gly Ala Ala
245 250 255
Gly Pro Gly Arg Gly Pro Arg Pro Ser Tyr Lys Pro Leu Leu Phe Phe
260 265 270
Leu Thr Ala Val Phe Gly Leu Pro Tyr Leu Leu Gly Arg Leu Ile Lys
275 280 285
Ala Leu Ala Ala Lys Gln Glu Gly Met Tyr Asp Glu His Gly Asn Leu
290 295 300
Leu Pro Gly Ala Gln Met Gly Met Gly Gly Pro Gly Met Glu Gly Gly
305 310 315 320
Ala Glu Ile Asp Pro Ser Lys Leu Glu Phe Cys Arg Ala Asn Phe Asp
325 330 335
Phe Val Pro Glu Asn Pro Gln Leu Glu Leu Glu Leu Arg Lys Gly Asp
340 345 350
Leu Val Ala Val Leu Ala Lys Thr Asp Pro Met Gly Asn Pro Ser Gln
355 360 365
Trp Trp Arg Val Arg Thr Arg Asp Gly Arg Ser Gly Tyr Val Pro Ala
370 375 380
Asn Tyr Leu Glu Val Ile Pro Arg Pro Ala Val Glu Ala Pro Lys Lys
385 390 395 400
Val Glu Glu Ile Gly Ala Ser Ala Val Pro Val Asn
405 410
<210>13
<211>380
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG79323)
<220>
<221>MISC_FEATURE
<222>(1)..(380)
<223>YlPex14p
<400>13
Met Ile Pro Ser Cys Leu Ser Thr Gln His Met Ala Pro Arg Glu Asp
1 5 10 15
Leu Val Gln Ser Ala Val Ala Phe Leu Asn Asp Pro Gln Ala Ala Thr
20 25 30
Ala Pro Leu Ala Lys Arg Ile Glu Phe Leu Glu Ser Lys Asp Met Thr
35 40 45
Pro Glu Glu Ile Glu Glu Ala Leu Lys Arg Ala Gly Ser Gly Ser Ala
50 55 60
Gln Ser His Pro Gly Ser Val Val Ser His Gly Gly Ala Ala Pro Thr
65 70 75 80
Val Pro Ala Ser Tyr Ala Phe Gln Ser Ala Pro Pro Leu Pro Glu Arg
85 90 95
Asp Trp Lys Asp Val Phe Ile Met Ala Thr Val Thr Val Gly Val Gly
100 105 110
Phe Gly Leu Tyr Thr Val Ala Lys Arg Tyr Leu Met Pro Leu Ile Leu
115 120 125
Pro Pro Thr Pro Pro Ser Leu Glu Ala Asp Lys Glu Ala Leu Glu Ala
130 135 140
Glu Phe Ala Arg Val Gln Gly Leu Leu Asp Gln Val Gln Gln Asp Thr
145 150 155 160
Glu Glu Val Lys Asn Ser Gln Val Glu Val Ala Lys Arg Val Thr Asp
165 170 175
Ala Leu Lys Gly Val Glu Glu Thr Ile Asp Gln Leu Lys Ser Gln Thr
180 185 190
Lys Lys Arg Asp Asp Glu Met Lys Leu Val Thr Ala Glu Val Glu Arg
195 200 205
Ile Arg Asp Arg Leu Pro Lys Asn Ile Asp Lys Leu Lys Asp Ser Gln
210 215 220
Glu Gln Gly Leu Ala Asp Ile Gln Ser Glu Leu Lys Ser Leu Lys Gln
225 230 235 240
Leu Leu Ser Thr Arg Thr Ala Ala Ser Ser Gly Pro Lys Leu Pro Pro
245 250 255
Ile Pro Pro Pro Ser Ser Tyr Leu Thr Arg Lys Ala Ser Pro Ala Val
260 265 270
Pro Ala Ala Ala Pro Ala Pro Val Thr Pro Gly Ser Pro Val His Asn
275 280 285
Val Ser Ser Ser Ser Thr Val Pro Ala Asp Arg Asp Asp Phe Ile Pro
290 295 300
Thr Pro Ala Gly Ala Val Pro Met Ile Pro Gln Pro Ala Ser Met Ser
305 310 315 320
Ser Ser Ser Thr Ser Thr Val Pro Asn Ser Ala Ile Ser Ser Ala Pro
325 330 335
Ser Pro Ile Gln Glu Pro Glu Pro Phe Val Pro Glu Pro Gly Asn Ser
340 345 350
Ala Val Lys Lys Pro Ala Pro Lys Ala Ser Ile Pro Ala Trp Gln Leu
355 360 365
Ala Ala Leu Glu Lys Glu Lys Glu Lys Glu Lys Glu
370 375 380
<210>14
<211>391
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG79622)
<220>
<221>MISC_FEATURE
<222>(1)..(391)
<223>YlPex16p
<400>14
Met Thr Asp Lys Leu Val Lys Val Met Gln Lys Lys Lys Ser Ala Pro
1 5 10 15
Gln Thr Trp Leu Asp Ser Tyr Asp Lys Phe Leu Val Arg Asn Ala Ala
20 25 30
Ser Ile Gly Ser Ile Glu Ser Thr Leu Arg Thr Val Ser Tyr Val Leu
35 40 45
Pro Gly Arg Phe Asn Asp Val Glu Ile Ala Thr Glu Thr Leu Tyr Ala
50 55 60
Val Leu Asn Val Leu Gly Leu Tyr His Asp Thr Ile Ile Ala Arg Ala
65 70 75 80
Val Ala Ala Ser Pro Asn Ala Ala Ala Val Tyr Arg Pro Ser Pro His
85 90 95
Asn Arg Tyr Thr Asp Trp Phe Ile Lys Asn Arg Lys Gly Tyr Lys Tyr
100 105 110
Ala Ser Arg Ala Val Thr Phe Val Lys Phe Gly Glu Leu Val Ala Glu
115 120 125
Met Val Ala Lys Lys Asn Gly Gly Glu Met Ala Arg Trp Lys Cys Ile
130 135 140
Ile Gly Ile Glu Gly Ile Lys Ala Gly Leu Arg Ile Tyr Met Leu Gly
145 150 155 160
Ser Thr Leu Tyr Gln Pro Leu Cys Thr Thr Pro Tyr Pro Asp Arg Glu
165 170 175
Val Thr Gly Glu Leu Leu Glu Thr Ile Cys Arg Asp Glu Gly Glu Leu
180 185 190
Asp Ile Glu Lys Gly Leu Met Asp Pro Gln Trp Lys Met Pro Arg Thr
195 200 205
Gly Arg Thr Ile Pro Glu Ile Ala Pro Thr Asn Val Glu Gly Tyr Leu
210 215 220
Leu Thr Lys Val Leu Arg Ser Glu Asp Val Asp Arg Pro Tyr Asn Leu
225 230 235 240
Leu Ser Arg Leu Asp Asn Trp Gly Val Val Ala Glu Leu Leu Ser Ile
245 250 255
Leu Arg Pro Leu Ile Tyr Ala Cys Leu Leu Phe Arg Gln His Val Asn
260 265 270
Lys Thr Val Pro Ala Ser Thr Lys Ser Lys Phe Pro Phe Leu Asn Ser
275 280 285
Pro Trp Ala Pro Trp Ile Ile Gly Leu Val Ile Glu Ala Leu Ser Arg
290 295 300
Lys Met Met Gly Ser Trp Leu Leu Arg Gln Arg Gln Ser Gly Lys Thr
305 310 315 320
Pro Thr Ala Leu Asp Gln Met Glu Val Lys Gly Arg Thr Asn Leu Leu
325 330 335
Gly Trp Trp Leu Phe Arg Gly Glu Phe Tyr Gln Ala Tyr Thr Arg Pro
340 345 350
Leu Leu Tyr Ser Ile Val Ala Arg Leu Glu Lys Ile Pro Gly Leu Gly
355 360 365
Leu Phe Gly Ala Leu Ile Ser Asp Tyr Leu Tyr Leu Phe Asp Arg Tyr
370 375 380
Tyr Phe Thr Ala Ser Thr Leu
385 390
<210>15
<211>225
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG84025)
<220>
<221>MISC_FEATURE
<222>(1)..(225)
<223>YlPex17p
<400>15
Met Ser Ala Phe Pro Glu Pro Ser Ser Phe Glu Ile Glu Phe Ala Lys
1 5 10 15
Gln Met Asn Arg Pro Arg Thr Val Gln Phe Lys Gln Leu Val Ala Val
20 25 30
Leu Tyr Ile Phe Gly Gly Thr Ser Ala Leu Ile Tyr Ile Ile Ser Lys
35 40 45
Thr Ile Leu Asn Pro Leu Phe Glu Glu Leu Thr Phe Ala Arg Ser Glu
50 55 60
Tyr Ala Ile His Ala Arg Arg Leu Met Glu Gln Leu Asn Ala Lys Leu
65 70 75 80
Ser Ser Met Ala Ser Tyr Ile Pro Pro Val Arg Ala Leu Gln Gly Gln
85 90 95
Arg Phe Val Asp Ala Gln Thr Gln Thr Glu Asp Glu Glu Gly Glu Asp
100 105 110
Ile Pro Asn Pro Ser Leu Gly Lys Ser Ser His Val Ser Phe Gly Glu
115 120 125
Ser Pro Met Gln Leu Lys Leu Ala Glu Lys Glu Lys Gln Gln Lys Leu
130 135 140
Ile Asp Asp Ser Val Asp Asn Leu Glu Arg Leu Ala Asp Ser Leu Lys
145 150 155 160
His Ala Gly Glu Val Ser Asp Leu Ser Ala Leu Ser Gly Phe Lys Tyr
165 170 175
Gln Val Glu Glu Leu Thr Asn Tyr Ser Asp Gln Leu Ala Met Ser Gly
180 185 190
Tyr Ser Met Met Lys Ser Gly Leu Pro Gly His Glu Thr Ala Met Ser
195 200 205
Glu Thr Lys Lys Glu Ile Arg Ser Leu Lys Gly Ser Val Leu Ser Val
210 215 220
Arg
225
<210>16
<211>324
<212>PRT
<213>解脂耶氏酵母(GenBank保藏号AAK84827)
<220>
<221>MISC_FEATURE
<222>(1)..(324)
<223>YlPex19p
<400>16
Met Ser His Glu Glu Asp Leu Asp Asp Leu Asp Asp Phe Leu Asp Glu
1 5 10 15
Phe Asp Glu Gln Val Leu Ser Lys Pro Pro Gly Ala Gln Lys Asp Ala
20 25 30
Thr Pro Thr Thr Ser Thr Ala Pro Thr Thr Ala Glu Ala Lys Pro Asp
35 40 45
Ala Thr Lys Lys Ser Thr Glu Thr Ser Gly Thr Asp Ser Lys Thr Glu
50 55 60
Gly Ala Asp Thr Ala Asp Lys Asn Ala Ala Thr Asp Ser Ala Glu Ala
65 70 75 80
Gly Ala Glu Lys Val Ser Leu Pro Asn Leu Glu Asp Gln Leu Ala Gly
85 90 95
Leu Lys Met Asp Asp Phe Leu Lys Asp Ile Glu Ala Asp Pro Glu Ser
100 105 110
Lys Ala Gln Phe Glu Ser Leu Leu Lys Glu Ile Asn Asn Val Thr Ser
115 120 125
Ala Thr Ala Ser Glu Lys Ala Gln Gln Pro Lys Ser Phe Lys Glu Thr
130 135 140
Ile Ser Ala Thr Ala Asp Arg Leu Asn Gln Ser Asn Gln Glu Met Gly
145 150 155 160
Asp Met Pro Leu Gly Asp Asp Met Leu Ala Gly Leu Met Glu Gln Leu
165 170 175
Ser Gly Ala Gly Gly Phe Gly Glu Gly Gly Glu Gly Asp Phe Gly Asp
180 185 190
Met Leu Gly Gly Ile Met Arg Gln Leu Ala Ser Lys Glu Val Leu Tyr
195 200 205
Gln Pro Leu Lys Glu Met His Asp Asn Tyr Pro Lys Trp Trp Asp Glu
210 215 220
His Gly Ser Lys Val Thr Glu Glu Lys Glu Arg Asp Arg Leu Lys Leu
225 230 235 240
Gln Gln Asp Ile Val Gly Lys Ile Cys Ala Lys Phe Glu Asp Pro Ser
245 250 255
Tyr Ser Asp Asp Ser Glu Ala Asp Arg Ala ValIle Thr Gln Leu Met
260 265 270
Asp Glu Met Gln Glu Thr Gly Ala Pro Pro Asp Glu Ile Met Ser Asn
275 280 285
Val Ala Asp Gly Ser Ile Pro Gly Gly Leu Asp Gly Leu Gly Leu Gly
290 295 300
Gly Leu Gly Gly Gly Lys Met Pro Glu Met Pro Glu Asn Met Pro Glu
305 310 315 320
Cys Asn Gln Gln
<210>17
<211>417
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG79226)
<220>
<221>MISC_FEATURE
<222>(1)..(417)
<223>YlPex20p
<400>17
Met Ala Ser Cys Gly Pro Ser Asn Ala Leu Gln Asn Leu Ser Lys His
1 5 10 15
Ala Ser Ala Asp Arg Ser Leu Gln His Asp Arg Met Ala Pro Gly Gly
20 25 30
Ala Pro Gly Ala Gln Arg Gln Gln Phe Arg Ser Gln Thr Gln Gly Gly
35 40 45
Gln Leu Asn Asn Glu Phe Gln Gln Phe Ala Gln Ala Gly Pro Ala His
50 55 60
Asn Ser Phe Glu Gln Ser Gln Met Gly Pro His Phe Gly Gln Gln His
65 70 75 80
Phe Gly Gln Pro His Gln Pro Gln Met Gly Gln His Ala Pro Met Ala
85 90 95
His Gly Gln Gln Ser Asp Trp Ala Gln Ser Phe Ser Gln Leu Asn Leu
100 105 110
Gly Pro Gln Thr Gly Pro Gln His Thr Gln Gln Ser Asn Trp Gly Gln
115 120 125
Asp Phe Met Arg Gln Ser Pro Gln Ser His Gln Val Gln Pro Gln Met
130 135 140
Ala Asn Gly Val Met Gly Ser Met Ser Gly Met Ser Ser Phe Gly Pro
145 150 155 160
Met Tyr Ser Asn Ser Gln Leu Met Asn Ser Thr Tyr Gly Leu Gln Thr
165 170 175
Glu His Gln Gln Thr His Lys Thr Glu Thr Lys Ser Ser Gln Asp Ala
180 185 190
Ala Phe Glu Ala Ala Phe Gly Ala Val Glu Glu Ser Ile Thr Lys Thr
195 200 205
Ser Asp Lys Gly Lys Glu Val Glu Lys Asp Pro Met Glu Gln Thr Tyr
210 215 220
Arg Tyr Asp Gln Ala Asp Ala Leu Asn Arg Gln Ala Glu His Ile Ser
225 230 235 240
Asp Asn Ile Ser Arg Glu Glu Val Asp Ile Lys Thr Asp Glu Asn Gly
245 250 255
Glu Phe Ala Ser Ile Ala Arg Gln Ile Ala Ser Ser Leu Glu Glu Ala
260 265 270
Asp Lys Ser Lys Phe Glu Lys Ser Thr Phe Met Asn Leu Met Arg Arg
275 280 285
Ile Gly Asn His Glu Val Thr Leu Asp Gly Asp Lys Leu Val Asn Lys
290 295 300
Glu Gly Glu Asp Ile Arg Glu Glu Val Arg Asp Glu Leu Leu Arg Glu
305 310 315 320
Gly Ala Ser Gln Glu Asn Gly Phe Gln Ser Glu Ala Gln Gln Thr Ala
325 330 335
Pro Leu Pro Val His His Glu Ala Pro Pro Pro Glu Gln Ile His Pro
340 345 350
His Thr Glu Thr Gly Asp Lys Gln Leu Glu Asp Pro Met Val Tyr Ile
355 360 365
Glu Gln Glu Ala Ala Arg Arg Ala Ala Glu Ser Gly Arg Thr Val Glu
370 375 380
Glu Glu Lys Leu Asn Phe Tyr Ser Pro Phe Glu Tyr Ala Gln Lys Leu
385 390 395 400
Gly Pro Gln Gly Val Ala Lys Gln Ser Asn Trp Glu Glu Asp Tyr Asp
405 410 415
Phe
<210>18
<211>195
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号CAG77876)
<220>
<221>MISC_FEATURE
<222>(1)..(195)
<223>YlPex22p
<400>18
Val Pro Arg Cys Thr Ser His Pro Cys Asn Leu Thr Leu His Leu Pro
1 5 10 15
Val Thr Thr Met Ala Pro Arg Lys Thr Arg Leu Pro Ala Val Ile Gly
20 25 30
Ala Ala Ala Ala Ala Ala Ala Val Ala Tyr Leu Val Tyr Ser Phe Val
35 40 45
Ala Lys Ser Asn Ser Asp Gln Asp Thr Phe Asp Ser Ser Val Gln Ser
50 55 60
Ser Ser Lys Ser Ser Thr Lys Ser Pro Lys Ser Thr Ala Thr Asn Ser
65 70 75 80
Lys Ile Thr Val Val Val Ser Gln Glu Leu Val Gln Ser Gln Leu Val
85 90 95
Asp Phe Lys His Leu Met Ser Val His Pro Asn Leu Val Val Ile Val
100 105 110
Pro Pro Met Val Ala Asn Lys Phe His Arg Ala Leu Lys Ser Ser Val
115 120 125
Gly His Asp His Gly Val Lys Val Ile Arg Cys Asp Thr Asp Val Gly
130 135 140
Val Ile His Val Ile Lys His Ile Arg Pro Asp Leu Ala Leu Ile Ala
145 150 155 160
Asp Gly Val Gly Asp Asn Ile Gln Gly Glu Ile Lys Arg Phe Val Gly
165 170 175
Ser Ser Glu Ala Leu Ser Gly Asp Val Asn Leu Ala Ala Glu Arg Leu
180 185 190
Thr Gly Leu
195
<210>19
<211>386
<212>PRT
<213>解脂耶氏酵母CLIB122(GenBank保藏号NC_006072,核苷酸117230至118387的
反义翻译)
<220>
<221>MISC_FEATURE
<222>(1)..(386)
<223>YlPex26p
<400>19
Met Pro Pro Ala Met Pro Gln Met Thr Thr Ser Thr Leu Leu Thr Asp
1 5 10 15
Ser Val Thr Ser Ala Val Asn Gln Ala Ala Thr Pro Lys Val Asp Gln
20 25 30
Met Tyr Gln Thr Phe Gly Glu Ser Ala Arg Glu Phe Val Asn Lys Asn
35 40 45
Phe Tyr Asn Ser Tyr Glu Leu Ile Arg Pro Phe Phe Asp Glu Ile Thr
50 55 60
Ala Lys Gly Ala Gln Gln Asn Gly Ser Thr Val Leu Asp Ala Glu Asn
65 70 75 80
Pro His Asn Ile Pro Leu Ser Leu Trp Ile Lys Val Trp Ser Leu Tyr
85 90 95
Leu Ala Ile Leu Asp Ala Ser Cys Lys Gln Ala Gly Glu Ala Leu Leu
100 105 110
Asn Ser Thr Gly Asp Leu Ser Gly Ser Asp Ser Gly Glu Trp Asn Gln
115 120 125
Thr Arg Lys Leu Leu Ala Arg Lys Leu Thr Ser Gly Ser Val Trp Asp
130 135 140
Glu Leu Val Thr Ala Ser Gly Gly Thr Gly Asn Ile His Pro Thr Ile
145 150 155 160
Leu Ala Leu Leu Ala Ser Leu Ser Ile Arg His Asp Thr Asp Ala Lys
165 170 175
Leu Met Ala Asp Asn Leu Glu Lys Phe Ile Val Thr Tyr Asn Asp Asn
180 185 190
Gly Ser Asp Asp Val Lys Thr Lys Thr Ala Phe Tyr Lys Val Leu Asp
195 200 205
Leu Tyr Leu Leu Arg Val Leu Pro Asp Leu Gly Gln Trp Asp Val Ala
210 215 220
His Ser Phe Val Asn Asn Thr Asn Leu Phe Ser His Glu Gln Lys Lys
225 230 235 240
Glu Met Thr His Lys Leu Asp Gln Ser Gln Lys His Ala Glu Gln Glu
245 250 255
His Lys Arg Leu Leu Glu Glu Ala Gln Glu Lys Glu Lys Ser Asp Ala
260 265 270
Lys Glu Lys Glu Arg Glu Glu Arg Val Ser Arg Asp Thr Gln Ser Arg
275 280 285
Glu Ile Lys Ser Pro Ile Val Asp Ser Ser Thr Ser Ser Arg Asp Val
290 295 300
Thr Arg Asp Thr Thr Arg Glu Leu Ser Lys Ser Ser Arg Gln Pro Arg
305 310 315 320
Thr Leu Ser Gln Ile Ile Ser Thr Ser Leu Lys Ser Gln Phe Asp Gly
325 330 335
Asn Ala Ile Phe Arg Thr Leu Ala Leu Ile Val Ile Val Ser Leu Ser
340 345 350
Ala Ala Asn Pro Leu Ile Arg Lys Arg Val Val Asp Thr Leu Lys Met
355 360 365
Leu Trp Ile Lys Ile Leu Gln Thr Leu Ser Met Gly Phe Lys Val Ser
370 375 380
Tyr Leu
385
<210>20
<211>3387
<212>DNA
<213>解脂耶氏酵母(GenBank保藏号AB036770)
<400>20
ggtaccatca agggtaaaat caaggctatc atcaagggcc atatatcgca agtttggggg 60
aagataatat gttcatagtg aatcgggttg tggatttcct catctaacgg cattataact 120
agtcctggag ggtctttttt atggataacc tccatgtacg atgtatccaa gatctccacg 180
tactgtgttc tgtttcctaa gtaataccca acaacctctc caacaaacac ttgggaagat 240
gcacttgtgc tgagatgtca agatgttaga gagtagagac agtagcaagc gtaaaaggcg 300
gccgaggcca ccgagagaac agcgtagcag ggcgcgtagt caccacaggg gacgcagaac 360
caaacaaatg acgaagaaga accacaagga gacgttttca aaggcaatgc aaacgaagag 420
ggcaatggaa ggattgagat tagagaactg gagactggag tggcgttttc ccgatgaacg 480
aacaaacacg cgaagctatg tggaccaaca tacaacacgg actgaaccag gtttttttat 540
gattttttta ctggaaatag gtacgtgcca agttggacca tgacactaaa cgtgtttaat 600
tagtaatatt cgtgtaagcg tacattcatt tcaaaggtta ttctttcacg gcaaagttat 660
aattaaatga atgtatatgc agaaaaaaaa aaaaaaagta ctgtactgga tggagagaat 720
attaataaat aattgttacc caactacatc ttgtcgattg aaagagaccc ctaagacaga 780
taggatatct gcaacccgag gaatgaaccc cccagcaccg gcaccctttc tattaacaaa 840
atgccaactg aaatttgaaa agttcaacta aacttatttg acccacaaaa actcgtcaaa 900
agtggcggcg aaagctggca aatgatgaca tccccttgga accatgatat cctctcggaa 960
tcttcgtccc catttgccac atctacttgc aacgccacat ctgcttacta agcaacccaa 1020
atctgcctcg gctcaaaatg tggggaagtt cacatgcatt cgctggtgaa tctgatctga 1080
cactacaact acacaccagg tccaacatga gcgacaatac gacaatcaaa aagccgatcc 1140
gacccaaacc gatccggacg gaacgcctgc cttacgctgg ggccgcagaa atcatccgag 1200
ccaaccagaa agaccactac tttgagtccg tgcttgaaca gcatctcgtc acgtttctgc 1260
agaaatggaa gggagtacga tttatccacc agtacaagga ggagctggag acggcgtcca 1320
agtttgcata tctcggtttg tgtacgcttg tgggctccaa gactctcgga gaagagtaca 1380
ccaatctcat gtacactatc agagaccgaa cagctctacc gggggtggtg agacggtttg 1440
gctacgtgct ttccaacact ctgtttccat acctgtttgt gcgctacatg ggcaagttgc 1500
gcgccaaact gatgcgcgag tatccccatc tggtggagta cgacgaagat gagcctgtgc 1560
ccagcccgga aacatggaag gagcgggtca tcaagacgtt tgtgaacaag tttgacaagt 1620
tcacggcgct ggaggggttt accgcgatcc acttggcgat tttctacgtc tacggctcgt 1680
actaccagct cagtaagcgg atctggggca tgcgttatgt atttggacac cgactggaca 1740
agaatgagcc tcgaatcggt tacgagatgc tcggtctgct gattttcgcc cggtttgcca 1800
cgtcatttgt gcagacggga agagagtacc tcggagcgct gctggaaaag agcgtggaga 1860
aagaggcagg ggagaaggaa gatgaaaagg aagcggttgt gccgaaaaag aagtcgtcaa 1920
ttccgttcat tgaggataca gaaggggaga cggaagacaa gatcgatctg gaggaccctc 1980
gacagctcaa gttcattcct gaggcgtcca gagcgtgcac tctgtgtctg tcatacatta 2040
gtgcgccggc atgtacgcca tgtggacact ttttctgttg ggactgtatt tccgaatggg 2100
tgagagagaa gcccgagtgt cccttgtgtc ggcagggtgt gagagagcag aacttgttgc 2160
ctatcagata atgacgaggt ctggatggaa ggactagtca gcgagacaca gagcatcagg 2220
gaccagacac gaccaattca atcgacaaca ctgtgctgca tagcagtgca cagaggtcct 2280
gggcatgaat atattttagc attggagata tgagtggtag agcgtataca gtattaattg 2340
tggaggtatc tcgtcgcatt gatagagcaa tacagttact gctgaaggga atgataccga 2400
gtatttcggc ccgattcagt tcttgatatc gtcattttgt ctctattgtc tacttttcag 2460
ataacctcaa caaatcttca acaaatctcc cagtaaacag tcagagatca tatccgagat 2520
catatcagat atgtcacgat ccgagtacaa taatggatat taatctgctt gattttgaat 2580
tctgttgcga ttatgatttc tttgatttcg atatgaacac atacggcgac tcccagacct 2640
ttagaagctc cagtttggat tcttagcaat ggttacactc aactatatcc caagtaatac 2700
ttggtaacaa tatgccaagt tagtcattca ttcgttatag gagttagcaa gtgtttgtca 2760
gctaaaaatg gttagtcggt cgattaccac ttagatcttt tcagcgtgga acttgatggt 2820
acgcttgaac cgacacttgg agtagtcggg gctgttgatg acgtagatga cgtttcgctc 2880
agggtgagga gtgcaatagt agtactcctt ggggccgtct ctcagctcaa aggttccatc 2940
ggcggcaatg tcaaagaccg agccctggag cttgtagccg tagtcgccgg tccagaacaa 3000
agcctgcagc tccagatagg cgatgggcat gtcgttaaca gagaaggtgt tgccctcgcc 3060
ctcggtgatg gtgatgggtt cgccgtcggt ggaggcggtg atcaggtcat cttggtaggt 3120
gacgggcaga gattcgaccg attgggcgtc tgatctggta taggtcagct tgtacttgtc 3180
tccgacagcc gccagagcgg tggtagcgac ggtgatgagg gagatgagtt tcatattggc 3240
ggcaagttta gcaaaagatg gcagtgggat tgagggacaa gagtgtttat atagatatag 3300
atacaacaca acgagtctga atgagacaac cgagacaacc actcccgaag cctcactaat 3360
agttactaac ggcatatttc aggtacc 3387
<210>21
<211>1134
<212>DNA
<213>解脂耶氏酵母(GenBank保藏号AB036770,核苷酸1038-2171)
<220>
<221>CDS
<222>(1)..(1134)
<223>Pex10
<400>21
atg tgg gga agt tca cat gca ttc gct ggt gaa tct gat ctg aca cta 48
Met Trp Gly Ser Ser His Ala Phe Ala Gly Glu Ser Asp Leu Thr Leu
1 5 10 15
caa cta cac acc agg tcc aac atg agc gac aat acg aca atc aaa aag 96
Gln Leu His Thr Arg Ser Asn Met Ser Asp Asn Thr Thr Ile Lys Lys
20 25 30
ccg atc cga ccc aaa ccg atc cgg acg gaa cgc ctg cct tac gct ggg 144
Pro Ile Arg Pro Lys Pro Ile Arg Thr Glu Arg Leu Pro Tyr Ala Gly
35 40 45
gcc gca gaa atc atc cga gcc aac cag aaa gac cac tac ttt gag tcc 192
Ala Ala Glu Ile Ile Arg Ala Asn Gln Lys Asp His Tyr Phe Glu Ser
50 55 60
gtg ctt gaa cag cat ctc gtc acg ttt ctg cag aaa tgg aag gga gta 240
Val Leu Glu Gln His Leu Val Thr Phe Leu Gln Lys Trp Lys Gly Val
65 70 75 80
cga ttt atc cac cag tac aag gag gag ctg gag acg gcg tcc aag ttt 288
Arg Phe Ile His Gln Tyr Lys Glu Glu Leu Glu Thr Ala Ser Lys Phe
85 90 95
gca tat ctc ggt ttg tgt acg ctt gtg ggc tcc aag act ctc gga gaa 336
Ala Tyr Leu Gly Leu Cys Thr Leu Val Gly Ser Lys Thr Leu Gly Glu
100 105 110
gag tac acc aat ctc atg tac act atc aga gac cga aca gct cta ccg 384
Glu Tyr Thr Asn Leu Met Tyr Thr Ile Arg Asp Arg Thr Ala Leu Pro
115 120 125
ggg gtg gtg aga cgg ttt ggc tac gtg ctt tcc aac act ctg ttt cca 432
Gly Val Val Arg Arg Phe Gly Tyr Val Leu Ser Asn Thr Leu Phe Pro
130 135 140
tac ctg ttt gtg cgc tac atg ggc aag ttg cgc gcc aaa ctg atg cgc 480
Tyr Leu Phe Val Arg Tyr Met Gly Lys Leu Arg Ala Lys Leu Met Arg
145 150 155 160
gag tat ccc cat ctg gtg gag tac gac gaa gat gag cct gtg ccc agc 528
Glu Tyr Pro His Leu Val Glu Tyr Asp Glu Asp Glu Pro Val Pro Ser
165 170 175
ccg gaa aca tgg aag gag cgg gtc atc aag acg ttt gtg aac aag ttt 576
Pro Glu Thr Trp Lys Glu Arg Val Ile Lys Thr Phe Val Asn Lys Phe
180 185 190
gac aag ttc acg gcg ctg gag ggg ttt acc gcg atc cac ttg gcg att 624
Asp Lys Phe Thr Ala Leu Glu Gly Phe Thr Ala Ile His Leu Ala Ile
195 200 205
ttc tac gtc tac ggc tcg tac tac cag ctc agt aag cgg atc tgg ggc 672
Phe Tyr Val Tyr Gly Ser Tyr Tyr Gln Leu Ser Lys Arg Ile Trp Gly
210 215 220
atg cgt tat gta ttt gga cac cga ctg gac aag aat gag cct cga atc 720
Met Arg Tyr Val Phe Gly His Arg Leu Asp Lys Asn Glu Pro Arg Ile
225 230 235 240
ggt tac gag atg ctc ggt ctg ctg att ttc gcc cgg ttt gcc acg tca 768
Gly Tyr Glu Met Leu Gly Leu Leu Ile Phe Ala Arg Phe Ala Thr Ser
245 250 255
ttt gtg cag acg gga aga gag tac ctc gga gcg ctg ctg gaa aag agc 816
Phe Val Gln Thr Gly Arg Glu Tyr Leu Gly Ala Leu Leu Glu Lys Ser
260 265 270
gtg gag aaa gag gca ggg gag aag gaa gat gaa aag gaa gcg gtt gtg 864
Val Glu Lys Glu Ala Gly Glu Lys Glu Asp Glu Lys Glu Ala Val Val
275 280 285
ccg aaa aag aag tcg tca att ccg ttc att gag gat aca gaa ggg gag 912
Pro Lys Lys Lys Ser Ser Ile Pro Phe Ile Glu Asp Thr Glu Gly Glu
290 295 300
acg gaa gac aag atc gat ctg gag gac cct cga cag ctc aag ttc att 960
Thr Glu Asp Lys Ile Asp Leu Glu Asp Pro Arg Gln Leu Lys Phe Ile
305 310 315 320
cct gag gcg tcc aga gcg tgc act ctg tgt ctg tca tac att agt gcg 1008
Pro Glu Ala Ser Arg Ala Cys Thr Leu Cys Leu Ser Tyr Ile Ser Ala
325 330 335
ccg gca tgt acg cca tgt gga cac ttt ttc tgt tgg gac tgt att tcc 1056
Pro Ala Cys Thr Pro Cys Gly His Phe Phe Cys Trp Asp Cys Ile Ser
340 345 350
gaa tgg gtg aga gag aag ccc gag tgt ccc ttg tgt cgg cag ggt gtg 1104
Glu Trp Val Arg Glu Lys Pro Glu Cys Pro Leu Cys Arg Gln Gly Val
355 360 365
aga gag cag aac ttg ttg cct atc aga taa 1134
Arg Glu Gln Asn Leu Leu Pro Ile Arg
370 375
<210>22
<211>377
<212>PRT
<213>解脂耶氏酵母(GenBank保藏号AB036770,核苷酸1038-2171)
<400>22
Met Trp Gly Ser Ser His Ala Phe Ala Gly Glu Ser Asp Leu Thr Leu
1 5 10 15
Gln Leu His Thr Arg Ser Asn Met Ser Asp Asn Thr Thr Ile Lys Lys
20 25 30
Pro Ile Arg Pro Lys Pro Ile Arg Thr Glu Arg Leu Pro Tyr Ala Gly
35 40 45
Ala Ala Glu Ile Ile Arg Ala Asn Gln Lys Asp His Tyr Phe Glu Ser
50 55 60
Val Leu Glu Gln His Leu Val Thr Phe Leu Gln Lys Trp Lys Gly Val
65 70 75 80
Arg Phe Ile His Gln Tyr Lys Glu Glu Leu Glu Thr Ala Ser Lys Phe
85 90 95
Ala Tyr Leu Gly Leu Cys Thr Leu Val Gly Ser Lys Thr Leu Gly Glu
100 105 110
Glu Tyr Thr Asn Leu Met Tyr Thr Ile Arg Asp Arg Thr Ala Leu Pro
115 120 125
Gly Val Val Arg Arg Phe Gly Tyr Val Leu Ser Asn Thr Leu Phe Pro
130 135 140
Tyr Leu Phe Val Arg Tyr Met Gly Lys Leu Arg Ala Lys Leu Met Arg
145 150 155 160
Glu Tyr Pro His Leu Val Glu Tyr Asp Glu Asp Glu Pro Val Pro Ser
165 170 175
Pro Glu Thr Trp Lys Glu Arg Val Ile Lys Thr Phe Val Asn Lys Phe
180 185 190
Asp Lys Phe Thr Ala Leu Glu Gly Phe Thr Ala Ile His Leu Ala Ile
195 200 205
Phe Tyr Val Tyr Gly Ser Tyr Tyr Gln Leu Ser Lys Arg Ile Trp Gly
210 215 220
Met Arg Tyr Val Phe Gly His Arg Leu Asp Lys Asn Glu Pro Arg Ile
225 230 235 240
Gly Tyr Glu Met Leu Gly Leu Leu Ile Phe Ala Arg Phe Ala Thr Ser
245 250 255
Phe ValGln Thr Gly Arg Glu Tyr Leu Gly Ala Leu Leu Glu Lys Ser
260 265 270
ValGlu Lys Glu Ala Gly Glu Lys Glu Asp Glu Lys Glu Ala Val Val
275 280 285
Pro Lys Lys Lys Ser Ser Ile Pro Phe Ile Glu Asp Thr Glu Gly Glu
290 295 300
Thr Glu Asp Lys Ile Asp Leu Glu Asp Pro Arg Gln Leu Lys Phe Ile
305 310 315 320
Pro Glu Ala Ser Arg Ala Cys Thr Leu Cys Leu Ser Tyr Ile Ser Ala
325 330 335
Pro Ala Cys Thr Pro Cys Gly His Phe Phe Cys Trp Asp Cys Ile Ser
340 345 350
Glu Trp Val Arg Glu Lys Pro Glu Cys Pro Leu Cys Arg Gln Gly Val
355 360 365
Arg Glu Gln Asn Leu Leu Pro Ile Arg
370 375
<210>23
<211>1065
<212>DNA
<213>解脂耶氏酵母(GenBank保藏号AJ012084,对应于GenBank保藏号AB036770的核
苷酸1107-2171)
<220>
<221>CDS
<222>(1)..(1065)
<223>YlPEX10
<400>23
atg agc gac aat acg aca atc aaa aag ccg atc cga ccc aaa ccg atc 48
Met Ser Asp Asn Thr Thr Ile Lys Lys Pro Ile Arg Pro Lys Pro Ile
1 5 10 15
cgg acg gaa cgc ctg cct tac gct ggg gcc gca gaa atc atc cga gcc 96
Arg Thr Glu Arg Leu Pro Tyr Ala Gly Ala Ala Glu Ile Ile Arg Ala
20 25 30
aac cag aaa gac cac tac ttt gag tcc gtg ctt gaa cag cat ctc gtc 144
Asn Gln Lys Asp His Tyr Phe Glu Ser Val Leu Glu Gln His Leu Val
35 40 45
acg ttt ctg cag aaa tgg aag gga gta cga ttt atc cac cag tac aag 192
Thr Phe Leu Gln Lys Trp Lys Gly Val Arg Phe Ile His Gln Tyr Lys
50 55 60
gag gag ctg gag acg gcg tcc aag ttt gca tat ctc ggt ttg tgt acg 240
Glu Glu Leu Glu Thr Ala Ser Lys Phe Ala Tyr Leu Gly Leu Cys Thr
65 70 75 80
ctt gtg ggc tcc aag act ctc gga gaa gag tac acc aat ctc atg tac 288
Leu Val Gly Ser Lys Thr Leu Gly Glu Glu Tyr Thr Asn Leu Met Tyr
85 90 95
act atc aga gac cga aca gct cta ccg ggg gtg gtg aga cgg ttt ggc 336
Thr Ile Arg Asp Arg Thr Ala Leu Pro Gly Val Val Arg Arg Phe Gly
100 105 110
tac gtg ctt tcc aac act ctg ttt cca tac ctg ttt gtg cgc tac atg 384
Tyr Val Leu Ser Asn Thr Leu Phe Pro Tyr Leu Phe Val Arg Tyr Met
115 120 125
ggc aag ttg cgc gcc aaa ctg atg cgc gag tat ccc cat ctg gtg gag 432
Gly Lys Leu Arg Ala Lys Leu Met Arg Glu Tyr Pro His Leu Val Glu
130 135 140
tac gac gaa gat gag cct gtg ccc agc ccg gaa aca tgg aag gag cgg 480
Tyr Asp Glu Asp Glu Pro Val Pro Ser Pro Glu Thr Trp Lys Glu Arg
145 150 155 160
gtc atc aag acg ttt gtg aac aag ttt gac aag ttc acg gcg ctg gag 528
Val Ile Lys Thr Phe Val Asn Lys Phe Asp Lys Phe Thr Ala Leu Glu
165 170 175
ggg ttt acc gcg atc cac ttg gcg att ttc tac gtc tac ggc tcg tac 576
Gly Phe Thr Ala Ile His Leu Ala Ile Phe Tyr Val Tyr Gly Ser Tyr
180 185 190
tac cag ctc agt aag cgg atc tgg ggc atg cgt tat gta ttt gga cac 624
Tyr Gln Leu Ser Lys Arg Ile Trp Gly Met Arg Tyr Val Phe Gly His
195 200 205
cga ctg gac aag aat gag cct cga atc ggt tac gag atg ctc ggt ctg 672
Arg Leu Asp Lys Asn Glu Pro Arg Ile Gly Tyr Glu Met Leu Gly Leu
210 215 220
ctg att ttc gcc cgg ttt gcc acg tca ttt gtg cag acg gga aga gag 720
Leu Ile Phe Ala Arg Phe Ala Thr Ser Phe Val Gln Thr Gly Arg Glu
225 230 235 240
tac ctc gga gcg ctg ctg gaa aag agc gtg gag aaa gag gca ggg gag 768
Tyr Leu Gly Ala Leu Leu Glu Lys Ser Val Glu Lys Glu Ala Gly Glu
245 250 255
aag gaa gat gaa aag gaa gcg gtt gtg ccg aaa aag aag tcg tca att 816
Lys Glu Asp Glu Lys Glu Ala ValVal Pro Lys Lys Lys Ser Ser Ile
260 265 270
ccg ttc att gag gat aca gaa ggg gag acg gaa gac aag atc gat ctg 864
Pro Phe Ile Glu Asp Thr Glu Gly Glu Thr Glu Asp Lys Ile Asp Leu
275 280 285
gag gac cct cga cag ctc aag ttc att cct gag gcg tcc aga gcg tgc 912
Glu Asp Pro Arg Gln Leu Lys Phe Ile Pro Glu Ala Ser Arg Ala Cys
290 295 300
act ctg tgt ctg tca tac att agt gcg ccg gca tgt acg cca tgt gga 960
Thr Leu Cys Leu Ser Tyr Ile Ser Ala Pro Ala Cys Thr Pro Cys Gly
305 310 315 320
cac ttt ttc tgt tgg gac tgt att tcc gaa tgg gtg aga gag aag ccc 1008
His Phe Phe Cys Trp Asp Cys Ile Ser Glu Trp Val Arg Glu Lys Pro
325 330 335
gag tgt ccc ttg tgt cgg cag ggt gtg aga gag cag aac ttg ttg cct 1056
Glu Cys Pro Leu Cys Arg Gln Gly Val Arg Glu Gln Asn Leu Leu Pro
340 345 350
atc aga taa 1065
Ile Arg
<210>24
<211>354
<212>PRT
<213>解脂耶氏酵母(GenBank保藏号AJ012084,对应于GenBank保藏号AB036770的核
苷酸1107-2171)
<400>24
Met Ser Asp Asn Thr Thr Ile Lys Lys Pro Ile Arg Pro Lys Pro Ile
1 5 10 15
Arg Thr Glu Arg Leu Pro Tyr Ala Gly Ala Ala Glu Ile Ile Arg Ala
20 25 30
Asn Gln Lys Asp His Tyr Phe Glu Ser Val Leu Glu Gln His Leu Val
35 40 45
Thr Phe Leu Gln Lys Trp Lys Gly Val Arg Phe Ile His Gln Tyr Lys
50 55 60
Glu Glu Leu Glu Thr Ala Ser Lys Phe Ala Tyr Leu Gly Leu Cys Thr
65 70 75 80
Leu Val Gly Ser Lys Thr Leu Gly Glu Glu Tyr Thr Asn Leu Met Tyr
85 90 95
Thr Ile Arg Asp Arg Thr Ala Leu Pro Gly Val Val Arg Arg Phe Gly
100 105 110
Tyr Val Leu Ser Asn Thr Leu Phe Pro Tyr Leu Phe Val Arg Tyr Met
115 120 125
Gly Lys Leu Arg Ala Lys Leu Met Arg Glu Tyr Pro His Leu Val Glu
130 135 140
Tyr Asp Glu Asp Glu Pro Val Pro Ser Pro Glu Thr Trp Lys Glu Arg
145 150 155 160
Val Ile Lys Thr Phe Val Asn Lys Phe Asp Lys Phe Thr Ala Leu Glu
165 170 175
Gly Phe Thr Ala Ile His Leu Ala Ile Phe Tyr Val Tyr Gly Ser Tyr
180 185 190
Tyr Gln Leu Ser Lys Arg Ile Trp Gly Met Arg Tyr Val Phe Gly His
195 200 205
Arg Leu Asp Lys Asn Glu Pro Arg Ile Gly Tyr Glu Met Leu Gly Leu
210 215 220
Leu Ile Phe Ala Arg Phe Ala Thr Ser Phe Val Gln Thr Gly Arg Glu
225 230 235 240
Tyr Leu Gly Ala Leu Leu Glu Lys Ser Val Glu Lys Glu Ala Gly Glu
245 250 255
Lys Glu Asp Glu Lys Glu Ala Val Val Pro Lys Lys Lys Ser Ser Ile
260 265 270
Pro Phe Ile Glu Asp Thr Glu Gly Glu Thr Glu Asp Lys Ile Asp Leu
275 280 285
Glu Asp Pro Arg Gln Leu Lys Phe Ile Pro Glu Ala Ser Arg Ala Cys
290 295 300
Thr Leu Cys Leu Ser Tyr Ile Ser Ala Pro Ala Cys Thr Pro Cys Gly
305 310 315 320
His Phe Phe Cys Trp Asp Cys Ile Ser Glu Trp Val Arg Glu Lys Pro
325 330 335
Glu Cys Pro Leu Cys Arg Gln Gly Val Arg Glu Gln Asn Leu Leu Pro
340 345 350
Ile Arg
<210>25
<211>38
<212>PRT
<213>解脂耶氏酵母
<220>
<221>misc_feature
<222>(2)..(3)
<223>Xaa可以是任何天然存在的氨基酸
<220>
<221>misc_feature
<222>(5)..(15)
<223>Xaa可以是任何天然存在的氨基酸
<220>
<221>misc_feature
<222>(17)..(17)
<223>Xaa可以是任何天然存在的氨基酸
<220>
<221>misc_feature
<222>(19)..(20)
<223>Xaa可以是任何天然存在的氨基酸
<220>
<221>misc_feature
<222>(22)..(23)
<223>Xaa可以是任何天然存在的氨基酸
<220>
<221>misc_feature
<222>(25)..(34)
<223>Xaa可以是任何天然存在的氨基酸
<220>
<221>misc_feature
<222>(36)..(37)
<223>Xaa可以是任何天然存在的氨基酸
<400>25
Cys Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys
1 5 10 15
Xaa His Xaa Xaa Cys Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
20 25 30
Xaa Xaa Cys Xaa Xaa Cys
35
<210>26
<211>345
<212>PRT
<213>解脂耶氏酵母
<400>26
Met Trp Gly Ser Ser His Ala Phe Ala Gly Glu Ser Asp Leu Thr Leu
1 5 10 15
Gln Leu His Thr Arg Ser Asn Met Ser Asp Asn Thr Thr Ile Lys Lys
20 25 30
Pro Ile Arg Pro Lys Pro Ile Arg Thr Glu Arg Leu Pro Tyr Ala Gly
35 40 45
Ala Ala Glu Ile Ile Arg Ala Asn Gln Lys Asp His Tyr Phe Glu Ser
50 55 60
Val Leu Glu Gln His Leu Val Thr Phe Leu Gln Lys Trp Lys Gly Val
65 70 75 80
Arg Phe Ile His Gln Tyr Lys Glu Glu Leu Glu Thr Ala Ser Lys Phe
85 90 95
Ala Tyr Leu Gly Leu Cys Thr Leu Val Gly Ser Lys Thr Leu Gly Glu
100 105 110
Glu Tyr Thr Asn Leu Met Tyr Thr Ile Arg Asp Arg Thr Ala Leu Pro
115 120 125
Gly Val Val Arg Arg Phe Gly Tyr Val Leu Ser Asn Thr Leu Phe Pro
130 135 140
Tyr Leu Phe Val Arg Tyr Met Gly Lys Leu Arg Ala Lys Leu Met Arg
145 150 155 160
Glu Tyr Pro His Leu Val Glu Tyr Asp Glu Asp Glu Pro Val Pro Ser
165 170 175
Pro Glu Thr Trp Lys Glu Arg Val Ile Lys Thr Phe Val Asn Lys Phe
180 185 190
Asp Lys Phe Thr Ala Leu Glu Gly Phe Thr Ala Ile His Leu Ala Ile
195 200 205
Phe Tyr Val Tyr Gly Ser Tyr Tyr Gln Leu Ser Lys Arg Ile Trp Gly
210 215 220
Met Arg Tyr Val Phe Gly His Arg Leu Asp Lys Asn Glu Pro Arg Ile
225 230 235 240
Gly Tyr Glu Met Leu Gly Leu Leu Ile Phe Ala Arg Phe Ala Thr Ser
245 250 255
Phe Val Gln Thr Gly Arg Glu Tyr Leu Gly Ala Leu Leu Glu Lys Ser
260 265 270
Val Glu Lys Glu Ala Gly Glu Lys Glu Asp Glu Lys Glu Ala Val Val
275 280 285
Pro Lys Lys Lys Ser Ser Ile Pro Phe Ile Glu Asp Thr Glu Gly Glu
290 295 300
Thr Glu Asp Lys Ile Asp Leu Glu Asp Pro Arg Gln Leu Lys Phe Ile
305 310 315 320
Pro Glu Ala Ser Arg Ala Cys Thr Leu Cys Leu Ser Tyr Ile Ser Ala
325 330 335
Pro Ala Cys Thr Pro Cys Gly His Phe
340 345
<210>27
<211>2987
<212>DNA
<213>解脂耶氏酵母
<220>
<221>misc_feature
<223>具有W497L突变的突变型乙酰羟酸合酶(AHAS)
<300>
<302>解脂耶氏酵母的高二十碳五烯酸生产菌株
<310>US 2006-0115881-A1
<311>2005-11-02
<312>2006-06-01
<313>(1)..(2987)
<300>
<302>解脂耶氏酵母的高二十碳五烯酸生产菌株
<310>WO 2006/052870
<311>2005-11-03
<312>2006-05-18
<313>(1)..(2987)
<400>27
ttccctagtc ccagtgtaca cccgccgata tcgcttaccc tgcagccgga ttaaggttgg 60
caatttttca cgtccttgtc tccgcaatta ctcaccgggt ggtttataag attgcaagcg 120
tcttgatttg tctctgtata ctaacatgca atcgcgactc gcccgacggg ccactaacct 180
ggccagaatc tccagatcca agtattctct tggtctgcga tatgtttcca acacaaaagc 240
ccctgctgcc cagccggcaa ctgctgagtg agtattcctt gccataaacg acccagaacc 300
actgtatagt gtttggaagc actagtcaga agaccagcga aaacaggtgg aaaaaactga 360
gacgaaaagc aacgaccaga aatgtaatgt gtggaaaagc gacacacaca gagcagataa 420
agaggtgaca aataacgaca aatgaaatat cagtatcttc ccacaatcac tacctctcag 480
ctgtctgaag gtgcggctga tatatccatc ccacgtctaa cgtatggagt gtgatagaat 540
atgacgacac aagcatgaga actcgctctc tatccaacca ccgaaacact gtcactacag 600
ccgttcttgt tgctccattc gcttttgtga ttccatgcct tctctggtga ctgacaacat 660
tccttccttt tctccagccc tgttgttatc tgctcatgac ctacggccac tctctatcgc 720
atactaacat agacgatccc agcccgctcc ccacttccag ggcaccgttg gcaagcctcc 780
tatcctcaag aaggctgagg ctgccaacgc tgacatggac gagtccttca tcggaatgtc 840
tggaggagag atcttccacg agatgatgct gcgacacaac gtcgacactg tcttcggtta 900
ccccggtgga gccattctcc ccgtctttga cgccattcac aactctgagt acttcaactt 960
tgtgctccct cgacacgagc agggtgccgg ccacatggcc gagggctacg ctcgagcctc 1020
tggtaagccc ggtgtcgttc tcgtcacctc tggccccggt gccaccaacg tcatcacccc 1080
catgcaggac gctctttccg atggtacccc catggttgtc ttcaccggtc aggtcctgac 1140
ctccgttatc ggcactgacg ccttccagga ggccgatgtt gtcggcatct cccgatcttg 1200
caccaagtgg aacgtcatgg tcaagaacgt tgctgagctc ccccgacgaa tcaacgaggc 1260
ctttgagatt gctacttccg gccgacccgg tcccgttctc gtcgatctgc ccaaggatgt 1320
tactgctgcc atcctgcgag agcccatccc caccaagtcc accattccct cgcattctct 1380
gaccaacctc acctctgccg ccgccaccga gttccagaag caggctatcc agcgagccgc 1440
caacctcatc aaccagtcca agaagcccgt cctttacgtc ggacagggta tccttggctc 1500
cgaggagggt cctaagctgc ttaaggagct ggctgagaag gccgagattc ccgtcaccac 1560
tactctgcag ggtcttggtg cctttgacga gcgagacccc aagtctctgc acatgctcgg 1620
tatgcacggt tccggctacg ccaacatggc catgcagaac gctgactgta tcattgctct 1680
cggcgcccga tttgatgacc gagttaccgg ctccatcccc aagtttgccc ccgaggctcg 1740
agccgctgcc cttgagggtc gaggtggtat tgttcacttt gagatccagg ccaagaacat 1800
caacaaggtt gttcaggcca ccgaagccgt tgagggagac gttaccgagt ctgtccgaca 1860
gctcatcccc ctcatcaaca aggtctctgc cgctgagcga gctccctgga ctgagactat 1920
ccagtcctgg aagcagcagt tccccttcct cttcgaggct gaaggtgagg atggtgttat 1980
caagccccag tccgtcattg ctctgctctc tgacctgaca gagaacaaca aggacaagac 2040
catcatcacc accggtgttg gtcagcatca gatgtggact gcccagcatt tccgatggcg 2100
acaccctcga accatgatca cttctggtgg tcttggaact atgggttacg gcctgcccgc 2160
cgctatcggc gccaaggttg cccgacctga ctgcgacgtc attgacatcg atggtgacgc 2220
ttctttcaac atgactctga ccgagctgtc caccgccgtt cagttcaaca ttggcgtcaa 2280
ggctattgtc ctcaacaacg aggaacaggg tatggtcacc cagctgcagt ctctcttcta 2340
cgagaaccga tactgccaca ctcatcagaa gaaccccgac ttcatgaagc tggccgagtc 2400
catgggcatg aagggtatcc gaatcactca cattgaccag ctggaggccg gtctcaagga 2460
gatgctcgca tacaagggcc ctgtgctcgt tgaggttgtt gtcgacaaga agatccccgt 2520
tcttcccatg gttcccgctg gtaaggcttt gcatgagttc cttgtctacg acgctgacgc 2580
cgaggctgct tctcgacccg atcgactgaa gaatgccccc gcccctcacg tccaccagac 2640
cacctttgag aactaagtgg aaaggaacac aagcaatccg aaccaaaaat aattggggtc 2700
ccgtgcccac agagtctagt gcagacctaa aatgaccaca gtaaattata gctgttatta 2760
aacatgagat tttgaccaac aagagcgtag gaatgttatt agctactact tgtacataca 2820
cagcatttgt tttaaataat gttgcctcca ggggcagtga gatcaggacc cagatccgtg 2880
gccagctctc tgacttcaga ccgcttgtac ttaagcagct cgcaacactg ttgtcgagga 2940
ttgaacttgc catattcgat tttgtggtca tgaatccagc acacctc 2987
<210>28
<211>14688
<212>DNA
<213>人工序列
<220>
<223>质粒pZKLeuN-29E3
<400>28
cgattgttgt ctactaacta tcgtacgata acttcgtata gcatacatta tacgaagtta 60
tcgcgtcgac gagtatctgt ctgactcgtc attgccgcct ttggagtacg actccaacta 120
tgagtgtgct tggatcactt tgacgataca ttcttcgttg gaggctgtgg gtctgacagc 180
tgcgttttcg gcgcggttgg ccgacaacaa tatcagctgc aacgtcattg ctggctttca 240
tcatgatcac atttttgtcg gcaaaggcga cgcccagaga gccattgacg ttctttctaa 300
tttggaccga tagccgtata gtccagtcta tctataagtt caactaactc gtaactatta 360
ccataacata tacttcactg ccccagataa ggttccgata aaaagttctg cagactaaat 420
ttatttcagt ctcctcttca ccaccaaaat gccctcctac gaagctcgag ctaacgtcca 480
caagtccgcc tttgccgctc gagtgctcaa gctcgtggca gccaagaaaa ccaacctgtg 540
tgcttctctg gatgttacca ccaccaagga gctcattgag cttgccgata aggtcggacc 600
ttatgtgtgc atgatcaaaa cccatatcga catcattgac gacttcacct acgccggcac 660
tgtgctcccc ctcaaggaac ttgctcttaa gcacggtttc ttcctgttcg aggacagaaa 720
gttcgcagat attggcaaca ctgtcaagca ccagtaccgg tgtcaccgaa tcgccgagtg 780
gtccgatatc accaacgccc acggtgtacc cggaaccgga atcattgctg gcctgcgagc 840
tggtgccgag gaaactgtct ctgaacagaa gaaggaggac gtctctgact acgagaactc 900
ccagtacaag gagttcctag tcccctctcc caacgagaag ctggccagag gtctgctcat 960
gctggccgag ctgtcttgca agggctctct ggccactggc gagtactcca agcagaccat 1020
tgagcttgcc cgatccgacc ccgagtttgt ggttggcttc attgcccaga accgacctaa 1080
gggcgactct gaggactggc ttattctgac ccccggggtg ggtcttgacg acaagggaga 1140
cgctctcgga cagcagtacc gaactgttga ggatgtcatg tctaccggaa cggatatcat 1200
aattgtcggc cgaggtctgt acggccagaa ccgagatcct attgaggagg ccaagcgata 1260
ccagaaggct ggctgggagg cttaccagaa gattaactgt tagaggttag actatggata 1320
tgtaatttaa ctgtgtatat agagagcgtg caagtatgga gcgcttgttc agcttgtatg 1380
atggtcagac gacctgtctg atcgagtatg tatgatactg cacaacctgt gtatccgcat 1440
gatctgtcca atggggcatg ttgttgtgtt tctcgatacg gagatgctgg gtacagtgct 1500
aatacgttga actacttata cttatatgag gctcgaagaa agctgacttg tgtatgactt 1560
attctcaact acatccccag tcacaatacc accactgcac taccactaca ccaaaaccat 1620
gatcaaacca cccatggact tcctggaggc agaagaactt gttatggaaa agctcaagag 1680
agagatcata acttcgtata gcatacatta tacgaagtta tcctgcaggt aaaggaattc 1740
tggagtttct gagagaaaaa ggcaagatac gtatgtaaca aagcgacgca tggtacaata 1800
ataccggagg catgtatcat agagagttag tggttcgatg atggcactgg tgcctggtat 1860
gactttatac ggctgactac atatttgtcc tcagacatac aattacagtc aagcacttac 1920
ccttggacat ctgtaggtac cccccggcca agacgatctc agcgtgtcgt atgtcggatt 1980
ggcgtagctc cctcgctcgt caattggctc ccatctactt tcttctgctt ggctacaccc 2040
agcatgtctg ctatggctcg ttttcgtgcc ttatctatcc tcccagtatt accaactcta 2100
aatgacatga tgtgattggg tctacacttt catatcagag ataaggagta gcacagttgc 2160
ataaaaagcc caactctaat cagcttcttc ctttcttgta attagtacaa aggtgattag 2220
cgaaatctgg aagcttagtt ggccctaaaa aaatcaaaaa aagcaaaaaa cgaaaaacga 2280
aaaaccacag ttttgagaac agggaggtaa cgaaggatcg tatatatata tatatatata 2340
tatacccacg gatcccgaga ccggcctttg attcttccct acaaccaacc attctcacca 2400
ccctaattca caaccatgga gtctggaccc atgcctgctg gcattccctt ccctgagtac 2460
tatgacttct ttatggactg gaagactccc ctggccatcg ctgccaccta cactgctgcc 2520
gtcggtctct tcaaccccaa ggttggcaag gtctcccgag tggttgccaa gtcggctaac 2580
gcaaagcctg ccgagcgaac ccagtccgga gctgccatga ctgccttcgt ctttgtgcac 2640
aacctcattc tgtgtgtcta ctctggcatc accttctact acatgtttcc tgctatggtc 2700
aagaacttcc gaacccacac actgcacgaa gcctactgcg acacggatca gtccctctgg 2760
aacaacgcac ttggctactg gggttacctc ttctacctgt ccaagttcta cgaggtcatt 2820
gacaccatca tcatcatcct gaagggacga cggtcctcgc tgcttcagac ctaccaccat 2880
gctggagcca tgattaccat gtggtctggc atcaactacc aagccactcc catttggatc 2940
tttgtggtct tcaactcctt cattcacacc atcatgtact gttactatgc cttcacctct 3000
atcggattcc atcctcctgg caaaaagtac ctgacttcga tgcagattac tcagtttctg 3060
gtcggtatca ccattgccgt gtcctacctc ttcgttcctg gctgcatccg aacacccggt 3120
gctcagatgg ctgtctggat caacgtcggc tacctgtttc ccttgaccta tctgttcgtg 3180
gactttgcca agcgaaccta ctccaagcga tctgccattg ccgctcagaa aaaggctcag 3240
taagcggccg cattgatgat tggaaacaca cacatgggtt atatctaggt gagagttagt 3300
tggacagtta tatattaaat cagctatgcc aacggtaact tcattcatgt caacgaggaa 3360
ccagtgactg caagtaatat agaatttgac caccttgcca ttctcttgca ctcctttact 3420
atatctcatt tatttcttat atacaaatca cttcttcttc ccagcatcga gctcggaaac 3480
ctcatgagca ataacatcgt ggatctcgtc aatagagggc tttttggact ccttgctgtt 3540
ggccaccttg tccttgctgt ctggctcatt ctgtttcaac gccttttaat taacggagta 3600
ggtctcggtg tcggaagcga cgccagatcc gtcatcctcc tttcgctctc caaagtagat 3660
acctccgacg agctctcgga caatgatgaa gtcggtgccc tcaacgtttc ggatggggga 3720
gagatcggcg agcttgggcg acagcagctg gcagggtcgc aggttggcgt acaggttcag 3780
gtcctttcgc agcttgagga gaccctgctc gggtcgcacg tcggttcgtc cgtcgggagt 3840
ggtccatacg gtgttggcag cgcctccgac agcaccgagc ataatagagt cagcctttcg 3900
gcagatgtcg agagtagcgt cggtgatggg ctcgccctcc ttctcaatgg cagctcctcc 3960
aatgagtcgg tcctcaaaca caaactcggt gccggaggcc tcagcaacag acttgagcac 4020
cttgacggcc tcggcaatca cctcggggcc acagaagtcg ccgccgagaa gaacaatctt 4080
cttggagtca gtcttggtct tcttagtttc gggttccatt gtggatgtgt gtggttgtat 4140
gtgtgatgtg gtgtgtggag tgaaaatctg tggctggcaa acgctcttgt atatatacgc 4200
acttttgccc gtgctatgtg gaagactaaa cctccgaaga ttgtgactca ggtagtgcgg 4260
tatcggctag ggacccaaac cttgtcgatg ccgatagcat gcgacgtcgg gcccaattcg 4320
ccctatagtg agtcgtatta caattcactg gccgtcgttt tacaacgtcg tgactgggaa 4380
aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt 4440
aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa 4500
tggacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 4560
ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 4620
ccacgttcgc cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat 4680
ttagtgcttt acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg 4740
ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata 4800
gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt 4860
tataagggat tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat 4920
ttaacgcgaa ttttaacaaa atattaacgc ttacaatttc ctgatgcggt attttctcct 4980
tacgcatctg tgcggtattt cacaccgcat caggtggcac ttttcgggga aatgtgcgcg 5040
gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat 5100
aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc 5160
gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa 5220
cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac 5280
tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga 5340
tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag 5400
agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca 5460
cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca 5520
tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa 5580
ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc 5640
tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa 5700
cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag 5760
actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct 5820
ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac 5880
tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa 5940
ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt 6000
aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat 6060
ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg 6120
agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc 6180
ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg 6240
tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag 6300
cgcagatacc aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact 6360
ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg 6420
gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc 6480
ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg 6540
aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg 6600
cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag 6660
ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc 6720
gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct 6780
ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc 6840
ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc 6900
gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac 6960
cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg gcgcgcccac tgagctcgtc 7020
taacggactt gatatacaac caattaaaac aaatgaaaag aaatacagtt ctttgtatca 7080
tttgtaacaa ttaccctgta caaactaagg tattgaaatc ccacaatatt cccaaagtcc 7140
acccctttcc aaattgtcat gcctacaact catataccaa gcactaacct accaaacacc 7200
actaaaaccc cacaaaatat atcttaccga atatacagta acaagctacc accacactcg 7260
ttgggtgcag tcgccagctt aaagatatct atccacatca gccacaactc ccttccttta 7320
ataaaccgac tacacccttg gctattgagg ttatgagtga atatactgta gacaagacac 7380
tttcaagaag actgtttcca aaacgtacca ctgtcctcca ctacaaacac acccaatctg 7440
cttcttctag tcaaggttgc tacaccggta aattataaat catcatttca ttagcagggc 7500
agggcccttt ttatagagtc ttatacacta gcggaccctg ccggtagacc aacccgcagg 7560
cgcgtcagtt tgctccttcc atcaatgcgt cgtagaaacg acttactcct tcttgagcag 7620
ctccttgacc ttgttggcaa caagtctccg acctcggagg tggaggaaga gcctccgata 7680
tcggcggtag tgataccagc ctcgacggac tccttgacgg cagcctcaac agcgtcaccg 7740
gcgggcttca tgttaagaga gaacttgagc atcatggcgg cagacagaat ggtggcgtac 7800
gcaactaaca tgaatgaata cgatatacat caaagactat gatacgcagt attgcacact 7860
gtacgagtaa gagcactagc cactgcactc aagtgaaacc gttgcccggg tacgagtatg 7920
agtatgtaca gtatgtttag tattgtactt ggacagtgct tgtatcgtac attctcaagt 7980
gtcaaacata aatatccgtt gctatatcct cgcaccacca cgtagctcgc tatatccctg 8040
tgttgaatcc atccatcttg gattgccaat tgtgcacaca gaaccgggca ctcacttccc 8100
catccacact tgcggccgct taagcaacgg gcttgataac agcggggggg gtgcccacgt 8160
tgttgcggtt gcggaagaac agaacaccct taccagcacc ctcggcacca gcgctgggct 8220
caacccactg gcacatacgc gcactgcggt acatggcgcg gatgaagcca cgaggaccat 8280
cctggacatc agcccggtag tgcttgccca tgatgggctt aatggcctcg gtggcctcgt 8340
ccgcgttgta gaaggggatg ctgctgacgt agtggtggag gacatgagtc tcgatgatgc 8400
cgtggagaag gtggcggccg atgaagccca tctcacggtc aatggtagca gcggcaccac 8460
ggacgaagtt ccactcgtcg ttggtgtagt ggggaagggt agggtcggtg tgctggagga 8520
aggtgatggc aacgagccag tggttaaccc agaggtaggg aacaaagtac cagatggcca 8580
tgttgtagaa accgaacttc tgaacgagga agtacagagc agtggccatc agaccgatac 8640
caatatcgct gaggacgatg agcttagcgt cactgttctc gtacagaggg ctgcggggat 8700
cgaagtggtt aacaccaccg ccgaggccgt tatgcttgcc cttgccgcga ccctcacgct 8760
ggcgctcgtg gtagttgtgg ccggtaacat tggtgatgag gtagttgggc cagccaacga 8820
gctgctgaag gacgagcatg agaagagtga aagcgggggt ctcctcagta agatgagcga 8880
gctcgtgggt catctttccg agacgagtag cctgctgctc gcgggttcgg ggaacgaaga 8940
ccatgtcacg ctccatgttg ccagtggcct tgtggtgctt tcggtgggag atttgccagc 9000
tgaagtaggg gacaaggagg gaagagtgaa gaacccagcc agtaatgtcg ttgatgatgc 9060
gagaatcgga gaaagcaccg tgaccgcact catgggcaat aacccagaga ccagtaccga 9120
aaagaccctg aagaacggtg tacacggccc acagaccagc gcgggcgggg gtggagggga 9180
tatattcggg ggtcacaaag ttgtaccaga tgctgaaagt ggtagtcagg aggacaatgt 9240
cgcggaggat ataaccgtat cccttgagag cggagcgctt gaagcagtgc ttagggatgg 9300
cattgtagat gtccttgatg gtaaagtcgg gaacctcgaa ctggttgccg taggtgtcga 9360
gcatgacacc atactcggac ttgggcttgg cgatatcaac ctcggacatg gacgagagcg 9420
atgtggaaga ggccgagtgg cggggagagt ctgaaggaga gacggcggca gactcagaat 9480
ccgtcacagt agttgaggtg acggtgcgtc taagcgcagg gttctgcttg ggcagagccg 9540
aagtggacgc catggttgat gtgtgtttaa ttcaagaatg aatatagaga agagaagaag 9600
aaaaaagatt caattgagcc ggcgatgcag acccttatat aaatgttgcc ttggacagac 9660
ggagcaagcc cgcccaaacc tacgttcggt ataatatgtt aagcttttta acacaaaggt 9720
ttggcttggg gtaacctgat gtggtgcaaa agaccgggcg ttggcgagcc attgcgcggg 9780
cgaatggggc cgtgactcgt ctcaaattcg agggcgtgcc tcaattcgtg cccccgtggc 9840
tttttcccgc cgtttccgcc ccgtttgcac cactgcagcc gcttctttgg ttcggacacc 9900
ttgctgcgag ctaggtgcct tgtgctactt aaaaagtggc ctcccaacac caacatgaca 9960
tgagtgcgtg ggccaagaca cgttggcggg gtcgcagtcg gctcaatggc ccggaaaaaa 10020
cgctgctgga gctggttcgg acgcagtccg ccgcggcgta tggatatccg caaggttcca 10080
tagcgccatt gccctccgtc ggcgtctatc ccgcaacctc taaatagagc gggaatataa 10140
cccaagcttc ttttttttcc tttaacacgc acacccccaa ctatcatgtt gctgctgctg 10200
tttgactcta ctctgtggag gggtgctccc acccaaccca acctacaggt ggatccggcg 10260
ctgtgattgg ctgataagtc tcctatccgg actaattctg accaatggga catgcgcgca 10320
ggacccaaat gccgcaatta cgtaacccca acgaaatgcc tacccctctt tggagcccag 10380
cggccccaaa tccccccaag cagcccggtt ctaccggctt ccatctccaa gcacaagcag 10440
cccggttcta ccggcttcca tctccaagca cccctttctc cacaccccac aaaaagaccc 10500
gtgcaggaca tcctactgcg tcgacatcat ttaaattcct tcacttcaag ttcattcttc 10560
atctgcttct gttttacttt gacaggcaaa tgaagacatg gtacgacttg atggaggcca 10620
agaacgccat ttcaccccga gacaccgaag tgcctgaaat cctggctgcc cccattgata 10680
acatcggaaa ctacggtatt ccggaaagtg tatatagaac ctttccccag cttgtgtctg 10740
tggatatgga tggtgtaatc ccctttgagt actcgtcttg gcttctctcc gagcagtatg 10800
aggctctcta atctagcgca tttaatatct caatgtattt atatatttat cttctcatgc 10860
ggccgctcac tgaatctttt tggctccctt gtgcttcctg acgatatacg tttgcacata 10920
gaaattcaag aacaaacaca agactgtgcc aacataaaag taattgaaga accagccaaa 10980
catcctcatc ccatcttggc gataacaggg aatgttcctg tacttccaga caatgtagaa 11040
accaacattg aattgaatga tctgcattga tgtaatcagg gattttggca tggggaactt 11100
cagcttgatc aatctggtcc aataataacc gtacatgatc cagtggatga aaccattcaa 11160
cagcacaaaa atccaaacag cttcatttcg gtaattatag aacagccaca tatccatcgg 11220
tgcccccaaa tgatggaaga attgcaacca ggtcagaggc ttgcccatca gtggcaaata 11280
gaaggagtca atatactcca ggaacttgct caaatagaac aactgcgtgg tgatcctgaa 11340
gacgttgttg tcaaaagcct tctcgcagtt gtcagacata acaccgatgg tgtacatggc 11400
atatgccatt gagaggaatg atcccaacga ataaatggac atgagaaggt tgtaattggt 11460
gaaaacaaac ttcatacgag actgaccttt tggaccaagg gggccaagag tgaacttcaa 11520
gatgacaaat gcgatggaca agtaaagcac ctcacagtga ctggcatcac tccagagttg 11580
ggcataatca actggttggg taaaacttcc tgcccaattg agactatttc attcaccacc 11640
tccatggcca ttgctgtaga tatgtcttgt gtgtaagggg gttggggtgg ttgtttgtgt 11700
tcttgacttt tgtgttagca agggaagacg ggcaaaaaag tgagtgtggt tgggagggag 11760
agacgagcct tatatataat gcttgtttgt gtttgtgcaa gtggacgccg aaacgggcag 11820
gagccaaact aaacaaggca gacaatgcga gcttaattgg attgcctgat gggcaggggt 11880
tagggctcga tcaatggggg tgcgaagtga caaaattggg aattaggttc gcaagcaagg 11940
ctgacaagac tttggcccaa acatttgtac gcggtggaca acaggagcca cccatcgtct 12000
gtcacgggct agccggtcgt gcgtcctgtc aggctccacc taggctccat gccactccat 12060
acaatcccac tagtgtaccg ctaggccgct tttagctccc atctaagacc cccccaaaac 12120
ctccactgta cagtgcactg tactgtgtgg cgatcaaggg caagggaaaa aaggcgcaaa 12180
catgcacgca tggaatgacg taggtaaggc gttactagac tgaaaagtgg cacatttcgg 12240
cgtgccaaag ggtcctaggt gcgtttcgcg agctgggcgc caggccaagc cgctccaaaa 12300
cgcctctccg actccctcca gcggcctcca tatccccatc cctctccaca gcaatgttgt 12360
taagccttgc aaacgaaaaa atagaaaggc taataagctt ccaatattgt ggtgtacgct 12420
gcataacgca acaatgagcg ccaaacaaca cacacacaca gcacacagca gcattaacca 12480
cgatgaacag catgacatta caggtgggtg tgtaatcagg gccctgattg ctggtggtgg 12540
gagcccccat catgggcaga tctgcgtaca ctgtttaaac agtgtacgca gatctactat 12600
agaggaacat ttaaattgcc ccggagaaga cggccaggcc gcctagatga caaattcaac 12660
aactcacagc tgactttctg ccattgccac tagggggggg cctttttata tggccaagcc 12720
aagctctcca cgtcggttgg gctgcaccca acaataaatg ggtagggttg caccaacaaa 12780
gggatgggat ggggggtaga agatacgagg ataacggggc tcaatggcac aaataagaac 12840
gaatactgcc attaagactc gtgatccagc gactgacacc attgcatcat ctaagggcct 12900
caaaactacc tcggaactgc tgcgctgatc tggacaccac agaggttccg agcactttag 12960
gttgcaccaa atgtcccacc aggtgcaggc agaaaacgct ggaacagcgt gtacagtttg 13020
tcttaacaaa aagtgagggc gctgaggtcg agcagggtgg tgtgacttgt tatagccttt 13080
agagctgcga aagcgcgtat ggatttggct catcaggcca gattgagggt ctgtggacac 13140
atgtcatgtt agtgtacttc aatcgccccc tggatatagc cccgacaata ggccgtggcc 13200
tcattttttt gccttccgca catttccatt gctcgatacc cacaccttgc ttctcctgca 13260
cttgccaacc ttaatactgg tttacattga ccaacatctt acaagcgggg ggcttgtcta 13320
gggtatatat aaacagtggc tctcccaatc ggttgccagt ctcttttttc ctttctttcc 13380
ccacagattc gaaatctaaa ctacacatca cagaattccg agccgtgagt atccacgaca 13440
agatcagtgt cgagacgacg cgttttgtgt aatgacacaa tccgaaagtc gctagcaaca 13500
cacactctct acacaaacta acccagctct ggtaccatgg aggtcgtgaa cgaaatcgtc 13560
tccattggcc aggaggttct tcccaaggtc gactatgctc agctctggtc tgatgcctcg 13620
cactgcgagg tgctgtacct ctccatcgcc ttcgtcatcc tgaagttcac ccttggtcct 13680
ctcggaccca agggtcagtc tcgaatgaag tttgtgttca ccaactacaa cctgctcatg 13740
tccatctact cgctgggctc cttcctctct atggcctacg ccatgtacac cattggtgtc 13800
atgtccgaca actgcgagaa ggctttcgac aacaatgtct tccgaatcac cactcagctg 13860
ttctacctca gcaagttcct cgagtacatt gactccttct atctgcccct catgggcaag 13920
cctctgacct ggttgcagtt ctttcaccat ctcggagctc ctatggacat gtggctgttc 13980
tacaactacc gaaacgaagc cgtttggatc tttgtgctgc tcaacggctt cattcactgg 14040
atcatgtacg gctactattg gacccgactg atcaagctca agttccctat gcccaagtcc 14100
ctgattactt ctatgcagat cattcagttc aacgttggct tctacatcgt ctggaagtac 14160
cggaacattc cctgctaccg acaagatgga atgagaatgt ttggctggtt tttcaactac 14220
ttctacgttg gtactgtcct gtgtctgttc ctcaacttct acgtgcagac ctacatcgtc 14280
cgaaagcaca agggagccaa aaagattcag tgagcggccg catgtacata caagattatt 14340
tatagaaatg aatcgcgatc gaacaaagag tacgagtgta cgagtagggg atgatgataa 14400
aagtggaaga agttccgcat ctttggattt atcaacgtgt aggacgatac ttcctgtaaa 14460
aatgcaatgt ctttaccata ggttctgctg tagatgttat taactaccat taacatgtct 14520
acttgtacag ttgcagacca gttggagtat agaatggtac acttaccaaa aagtgttgat 14580
ggttgtaact acgatatata aaactgttga cgggatcccc gctgatatgc ctaaggaaca 14640
atcaaagagg aagatattaa ttcagaatgc tagtatacag ttagggat 14688
<210>29
<211>1434
<212>DNA
<213>串珠镰孢菌
<220>
<221>CDS
<222>(1)..(1434)
<223>Δ-12去饱和酶
<300>
<302>适于改变含油酵母中多不饱和脂肪酸含量的Δ-12去饱和酶
<310>WO 2005/047485
<311>2004-11-12
<312>2005-05-26
<313>(1)..(1434)
<300>
<302>适于改变含油酵母中多不饱和脂肪酸含量的Δ-12去饱和酶
<310>US 2005-0216975-A1
<311>2004-11-10
<312>2005-09-29
<313>(1)..(1434)
<400>29
atg gcg tcc act tcg gct ctg ccc aag cag aac cct gcg ctt aga cgc 48
Met Ala Ser Thr Ser Ala Leu Pro Lys Gln Asn Pro Ala Leu Arg Arg
1 5 10 15
acc gtc acc tca act act gtg acg gat tct gag tct gcc gcc gtc tct 96
Thr Val Thr Ser Thr Thr Val Thr Asp Ser Glu Ser Ala Ala Val Ser
20 25 30
cct tca gac tct ccc cgc cac tcg gcc tct tcc aca tcg ctc tcg tcc 144
Pro Ser Asp Ser Pro Arg His Ser Ala Ser Ser Thr Ser Leu Ser Ser
35 40 45
atg tcc gag gtt gat atc gcc aag ccc aag tcc gag tat ggt gtc atg 192
Met Ser Glu Val Asp Ile Ala Lys Pro Lys Ser Glu Tyr Gly Val Met
50 55 60
ctc gac acc tac ggc aac cag ttc gag gtt ccc gac ttt acc atc aag 240
Leu Asp Thr Tyr Gly Asn Gln Phe Glu Val Pro Asp Phe Thr Ile Lys
65 70 75 80
gac atc tac aat gcc atc cct aag cac tgc ttc aag cgc tcc gct ctc 288
Asp Ile Tyr Asn Ala Ile Pro Lys His Cys Phe Lys Arg Ser Ala Leu
85 90 95
aag gga tac ggt tat atc ctc cgc gac att gtc ctc ctg act acc act 336
Lys Gly Tyr Gly Tyr Ile Leu Arg Asp Ile Val Leu Leu Thr Thr Thr
100 105 110
ttc agc atc tgg tac aac ttt gtg acc ccc gaa tat atc ccc tcc acc 384
Phe Ser Ile Trp Tyr Asn Phe Val Thr Pro Glu Tyr Ile Pro Ser Thr
115 120 125
ccc gcc cgc gct ggt ctg tgg gcc gtg tac acc gtt ctt cag ggt ctt 432
Pro Ala Arg Ala Gly Leu Trp Ala Val Tyr Thr Val Leu Gln Gly Leu
130 135 140
ttc ggt act ggt ctc tgg gtt att gcc cat gag tgc ggt cac ggt gct 480
Phe Gly Thr Gly Leu Trp Val Ile Ala His Glu Cys Gly His Gly Ala
145 150 155 160
ttc tcc gat tct cgc atc atc aac gac att act ggc tgg gtt ctt cac 528
Phe Ser Asp Ser Arg Ile Ile Asn Asp Ile Thr Gly Trp Val Leu His
165 170 175
tct tcc ctc ctt gtc ccc tac ttc agc tgg caa atc tcc cac cga aag 576
Ser Ser Leu Leu Val Pro Tyr Phe Ser Trp Gln Ile Ser His Arg Lys
180 185 190
cac cac aag gcc act ggc aac atg gag cgt gac atg gtc ttc gtt ccc 624
His His Lys Ala Thr Gly Asn Met Glu Arg Asp Met Val Phe Val Pro
195 200 205
cga acc cgc gag cag cag gct act cgt ctc gga aag atg acc cac gag 672
Arg Thr Arg Glu Gln Gln Ala Thr Arg Leu Gly Lys Met Thr His Glu
210 215 220
ctc gct cat ctt act gag gag acc ccc gct ttc act ctt ctc atg ctc 720
Leu Ala His Leu Thr Glu Glu Thr Pro Ala Phe Thr Leu Leu Met Leu
225 230 235 240
gtc ctt cag cag ctc gtt ggc tgg ccc aac tac ctc atc acc aat gtt 768
Val Leu Gln Gln Leu Val Gly Trp Pro Asn Tyr Leu Ile Thr Asn Val
245 250 255
acc ggc cac aac tac cac gag cgc cag cgt gag ggt cgc ggc aag ggc 816
Thr Gly His Asn Tyr His Glu Arg Gln Arg Glu Gly Arg Gly Lys Gly
260 265 270
aag cat aac ggc ctc ggc ggt ggt gtt aac cac ttc gat ccc cgc agc 864
Lys His Asn Gly Leu Gly Gly Gly Val Asn His Phe Asp Pro Arg Ser
275 280 285
cct ctg tac gag aac agt gac gct aag ctc atc gtc ctc agc gat att 912
Pro Leu Tyr Glu Asn Ser Asp Ala Lys Leu Ile Val Leu Ser Asp Ile
290 295 300
ggt atc ggt ctg atg gcc act gct ctg tacttc ctc gtt cag aag ttc 960
Gly Ile Gly Leu Met Ala Thr Ala Leu Tyr Phe Leu ValGln Lys Phe
305 310 315 320
ggt ttc tac aac atg gcc atc tgg tac ttt gtt ccc tac ctc tgg gtt 1008
Gly Phe Tyr Asn Met Ala Ile Trp Tyr Phe Val Pro Tyr Leu Trp Val
325 330 335
aac cac tgg ctc gtt gcc atc acc ttc ctc cag cac acc gac cct acc 1056
Asn His Trp Leu Val Ala Ile Thr Phe Leu Gln His Thr Asp Pro Thr
340 345 350
ctt ccc cac tac acc aac gac gag tgg aac ttc gtc cgt ggt gcc gct 1104
Leu Pro His Tyr Thr Asn Asp Glu Trp Asn Phe Val Arg Gly Ala Ala
355 360 365
gct acc att gac cgt gag atg ggc ttc atc ggc cgc cac ctt ctc cac 1152
Ala Thr Ile Asp Arg Glu Met Gly Phe Ile Gly Arg His Leu Leu His
370 375 380
ggc atc atc gag act cat gtc ctc cac cac tac gtc agc agc atc ccc 1200
Gly Ile Ile Glu Thr His Val Leu His His Tyr Val Ser Ser Ile Pro
385 390 395 400
ttc tac aac gcg gac gag gcc acc gag gcc att aag ccc atc atg ggc 1248
Phe Tyr Asn Ala Asp Glu Ala Thr Glu Ala Ile Lys Pro Ile Met Gly
405 410 415
aag cac tac cgg gct gat gtc cag gat ggt cct cgt ggc ttc atc cgc 1296
Lys His Tyr Arg Ala Asp Val Gln Asp Gly Pro Arg Gly Phe Ile Arg
420 425 430
gcc atg tac cgc agt gcg cgt atg tgc cag tgg gtt gag ccc agc gct 1344
Ala Met Tyr Arg Ser Ala Arg Met Cys Gln Trp Val Glu Pro Ser Ala
435 440 445
ggt gcc gag ggt gct ggt aag ggt gtt ctg ttc ttc cgc aac cgc aac 1392
Gly Ala Glu Gly Ala Gly Lys Gly Val Leu Phe Phe Arg Asn Arg Asn
450 455 460
aac gtg ggc acc ccc ccc gct gtt atc aag ccc gtt gct taa 1434
Asn Val Gly Thr Pro Pro Ala Val Ile Lys Pro Val Ala
465 470 475
<210>30
<211>477
<212>PRT
<213>串珠镰孢菌
<400>30
Met Ala Ser Thr Ser Ala Leu Pro Lys Gln Asn Pro Ala Leu Arg Arg
1 5 10 15
Thr Val Thr Ser Thr Thr Val Thr Asp Ser Glu Ser Ala Ala Val Ser
20 25 30
Pro Ser Asp Ser Pro Arg His Ser Ala Ser Ser Thr Ser Leu Ser Ser
35 40 45
Met Ser Glu Val Asp Ile Ala Lys Pro Lys Ser Glu Tyr Gly Val Met
50 55 60
Leu Asp Thr Tyr Gly Asn Gln Phe Glu Val Pro Asp Phe Thr Ile Lys
65 70 75 80
Asp Ile Tyr Asn Ala Ile Pro Lys His Cys Phe Lys Arg Ser Ala Leu
85 90 95
Lys Gly Tyr Gly Tyr Ile Leu Arg Asp Ile Val Leu Leu Thr Thr Thr
100 105 110
Phe Ser Ile Trp Tyr Asn Phe Val Thr Pro Glu Tyr Ile Pro Ser Thr
115 120 125
Pro Ala Arg Ala Gly Leu Trp Ala Val Tyr Thr Val Leu Gln Gly Leu
130 135 140
Phe Gly Thr Gly Leu Trp Val Ile Ala His Glu Cys Gly His Gly Ala
145 150 155 160
Phe Ser Asp Ser Arg Ile Ile Asn Asp Ile Thr Gly Trp Val Leu His
165 170 175
Ser Ser Leu Leu Val Pro Tyr Phe Ser Trp Gln Ile Ser His Arg Lys
180 185 190
His His Lys Ala Thr Gly Asn Met Glu Arg Asp Met Val Phe Val Pro
195 200 205
Arg Thr Arg Glu Gln Gln Ala Thr Arg Leu Gly Lys Met Thr His Glu
210 215 220
Leu Ala His Leu Thr Glu Glu Thr Pro Ala Phe Thr Leu Leu Met Leu
225 230 235 240
Val Leu Gln Gln Leu Val Gly Trp Pro Asn Tyr Leu Ile Thr Asn Val
245 250 255
Thr Gly His Asn Tyr His Glu Arg Gln Arg Glu Gly Arg Gly Lys Gly
260 265 270
Lys His Asn Gly Leu Gly Gly Gly Val Asn His Phe Asp Pro Arg Ser
275 280 285
Pro Leu Tyr Glu Asn Ser Asp Ala Lys Leu Ile Val Leu Ser Asp Ile
290 295 300
Gly Ile Gly Leu Met Ala Thr Ala Leu Tyr Phe Leu Val Gln Lys Phe
305 310 315 320
Gly Phe Tyr Asn Met Ala Ile Trp Tyr Phe Val Pro Tyr Leu Trp Val
325 330 335
Asn His Trp Leu Val Ala Ile Thr Phe Leu Gln His Thr Asp Pro Thr
340 345 350
Leu Pro His Tyr Thr Asn Asp Glu Trp Asn Phe Val Arg Gly Ala Ala
355 360 365
Ala Thr Ile Asp Arg Glu Met Gly Phe Ile Gly Arg His Leu Leu His
370 375 380
Gly Ile Ile Glu Thr His Val Leu His His Tyr Val Ser Ser Ile Pro
385 390 395 400
Phe Tyr Asn Ala Asp Glu Ala Thr Glu Ala Ile Lys Pro Ile Met Gly
405 410 415
Lys His Tyr Arg Ala Asp Val Gln Asp Gly Pro Arg Gly Phe Ile Arg
420 425 430
Ala Met Tyr Arg Ser Ala Arg Met Cys Gln Trp Val Glu Pro Ser Ala
435 440 445
Gly Ala Glu Gly Ala Gly Lys Gly Val Leu Phe Phe Arg Asn Arg Asn
450 455 460
Asn Val Gly Thr Pro Pro Ala Val Ile Lys Pro Val Ala
465 470 475
<210>31
<211>777
<212>DNA
<213>小眼虫
<220>
<221>CDS
<222>(1)..(777)
<223>合成Δ-9延伸酶(经密码子优化用于解脂耶氏酵母)
<300>
<302>Δ-9延伸酶及其在制备多不饱和脂肪酸中的应用
<310>WO 2007/061742
<311>2006-11-16
<312>2007-05-31
<313>(1)..(777)
<300>
<302>Δ-9延伸酶及其在制备多不饱和脂肪酸中的应用
<310>US-2007-0117190-A1
<311>2006-11-16
<312>2007-05-24
<313>(1)..(777)
<400>31
atg gag gtc gtg aac gaa atc gtc tcc att ggc cag gag gtt ctt ccc 48
Met Glu Val Val Asn Glu Ile Val Ser Ile Gly Gln Glu Val Leu Pro
1 5 10 15
aag gtc gac tat gct cag ctc tgg tct gat gcc tcg cac tgc gag gtg 96
Lys Val Asp Tyr Ala Gln Leu Trp Ser Asp Ala Ser His Cys Glu Val
20 25 30
ctg tac ctc tcc atc gcc ttc gtc atc ctg aag ttc acc ctt ggt cct 144
Leu Tyr Leu Ser Ile Ala Phe Val Ile Leu Lys Phe Thr Leu Gly Pro
35 40 45
ctc gga ccc aag ggt cag tct cga atg aag ttt gtg ttc acc aac tac 192
Leu Gly Pro Lys Gly Gln Ser Arg Met Lys Phe Val Phe Thr Asn Tyr
50 55 60
aac ctg ctc atg tcc atc tac tcg ctg ggc tcc ttc ctc tct atg gcc 240
Asn Leu Leu Met Ser Ile Tyr Ser Leu Gly Ser Phe Leu Ser Met Ala
65 70 75 80
tac gcc atg tac acc att ggt gtc atg tcc gac aac tgc gag aag gct 288
Tyr Ala Met Tyr Thr Ile Gly Val Met Ser Asp Asn Cys Glu Lys Ala
85 90 95
ttc gac aac aat gtc ttc cga atc acc act cag ctg ttc tac ctc agc 336
Phe Asp Asn Asn Val Phe Arg Ile Thr Thr Gln Leu Phe Tyr Leu Ser
100 105 110
aag ttc ctc gag tac att gac tcc ttc tat ctg ccc ctc atg ggc aag 384
Lys Phe Leu Glu Tyr Ile Asp Ser Phe Tyr Leu Pro Leu Met Gly Lys
115 120 125
cct ctg acc tgg ttg cag ttc ttt cac cat ctc gga gct cct atg gac 432
Pro Leu Thr Trp Leu Gln Phe Phe His His Leu Gly Ala Pro Met Asp
130 135 140
atg tgg ctg ttc tac aac tac cga aac gaa gcc gtt tgg atc ttt gtg 480
Met Trp Leu Phe Tyr Asn Tyr Arg Asn Glu Ala Val Trp Ile Phe Val
145 150 155 160
ctg ctc aac ggc ttc att cac tgg atc atg tac ggc tac tat tgg acc 528
Leu Leu Asn Gly Phe Ile His Trp Ile Met Tyr Gly Tyr Tyr Trp Thr
165 170 175
cga ctg atc aag ctc aag ttc cct atg ccc aag tcc ctg att act tct 576
Arg Leu Ile Lys Leu Lys Phe Pro Met Pro Lys Ser Leu Ile Thr Ser
180 185 190
atg cag atc att cag ttc aac gtt ggc ttc tac atc gtc tgg aag tac 624
Met Gln Ile Ile Gln Phe Asn Val Gly Phe Tyr Ile Val Trp Lys Tyr
195 200 205
cgg aac att ccc tgc tac cga caa gat gga atg aga atg ttt ggc tgg 672
Arg Asn Ile Pro Cys Tyr Arg Gln Asp Gly Met Arg Met Phe Gly Trp
210 215 220
ttt ttc aac tac ttc tac gtt ggt act gtc ctg tgt ctg ttc ctc aac 720
Phe Phe Asn Tyr Phe Tyr Val Gly Thr Val Leu Cys Leu Phe Leu Asn
225 230 235 240
ttc tac gtg cag acc tac atc gtc cga aag cac aag gga gcc aaa aag 768
Phe Tyr Val Gln Thr Tyr Ile Val Arg Lys His Lys Gly Ala Lys Lys
245 250 255
att cag tga 777
Ile Gln
<210>32
<211>258
<212>PRT
<213>小眼虫
<400>32
Met Glu Val Val Asn Glu Ile Val Ser Ile Gly Gln Glu Val Leu Pro
1 5 10 15
Lys Val Asp Tyr Ala Gln Leu Trp Ser Asp Ala Ser His Cys Glu Val
20 25 30
Leu Tyr Leu Ser Ile Ala Phe Val Ile Leu Lys Phe Thr Leu Gly Pro
35 40 45
Leu Gly Pro Lys Gly Gln Ser Arg Met Lys Phe Val Phe Thr Asn Tyr
50 55 60
Asn Leu Leu Met Ser Ile Tyr Ser Leu Gly Ser Phe Leu Ser Met Ala
65 70 75 80
Tyr Ala Met Tyr Thr Ile Gly Val Met Ser Asp Asn Cys Glu Lys Ala
85 90 95
Phe Asp Asn Asn Val Phe Arg Ile Thr Thr Gln Leu Phe Tyr Leu Ser
100 105 110
Lys Phe Leu Glu Tyr Ile Asp Ser Phe Tyr Leu Pro Leu Met Gly Lys
115 120 125
Pro Leu Thr Trp Leu Gln Phe Phe His His Leu Gly Ala Pro Met Asp
130 135 140
Met Trp Leu Phe Tyr Asn Tyr Arg Asn Glu Ala Val Trp Ile Phe Val
145 150 155 160
Leu Leu Asn Gly Phe Ile His Trp Ile Met Tyr Gly Tyr Tyr Trp Thr
165 170 175
Arg Leu Ile Lys Leu Lys Phe Pro Met Pro Lys Ser Leu Ile Thr Ser
180 185 190
Met Gln Ile Ile Gln Phe Asn Val Gly Phe Tyr Ile Val Trp Lys Tyr
195 200 205
Arg Asn Ile Pro Cys Tyr Arg Gln Asp Gly Met Arg Met Phe Gly Trp
210 215 220
Phe Phe Asn Tyr Phe Tyr Val Gly Thr Val Leu Cys Leu Phe Leu Asn
225 230 235 240
Phe Tyr Val Gln Thr Tyr Ile Val Arg Lys His Lys Gly Ala Lys Lys
245 250 255
Ile Gln
<210>33
<211>34
<212>DNA
<213>大肠杆菌
<400>33
ataacttcgt ataatgtatg ctatacgaag ttat 34
<210>34
<211>828
<212>DNA
<213>高山被孢霉
<220>
<221>CDS
<222>(1)..(828)
<223>合成C16/18延伸酶(经密码子优化用于解脂耶氏酵母)
<300>
<302>高山被孢霉C16/18脂肪酸延伸酶
<310>US 2007-0087420-A1
<311>2005-10-19
<312>2007-04-19
<313>(1)..(828)
<300>
<302>高山被孢霉C16/18脂肪酸延伸酶
<310>WO 2007/046817
<311>2005-11-04
<312>2007-04-26
<313>(1)..(828)
<400>34
atg gag tct gga ccc atg cct gct ggc att ccc ttc cct gag tac tat 48
Met Glu Ser Gly Pro Met Pro Ala Gly Ile Pro Phe Pro Glu Tyr Tyr
1 5 10 15
gac ttc ttt atg gac tgg aag act ccc ctg gcc atc gct gcc acc tac 96
Asp Phe Phe Met Asp Trp Lys Thr Pro Leu Ala Ile Ala Ala Thr Tyr
20 25 30
act gct gcc gtc ggt ctc ttc aac ccc aag gtt ggc aag gtc tcc cga 144
Thr Ala Ala Val Gly Leu Phe Asn Pro Lys Val Gly Lys Val Ser Arg
35 40 45
gtg gtt gcc aag tcg gct aac gca aag cct gcc gag cga acc cag tcc 192
Val Val Ala Lys Ser Ala Asn Ala Lys Pro Ala Glu Arg Thr Gln Ser
50 55 60
gga gct gcc atg act gcc ttc gtc ttt gtg cac aac ctc att ctg tgt 240
Gly Ala Ala Met Thr Ala Phe Val Phe Val His Asn Leu Ile Leu Cys
65 70 75 80
gtc tac tct ggc atc acc ttc tac tac atg ttt cct gct atg gtc aag 288
Val Tyr Ser Gly Ile Thr Phe Tyr Tyr Met Phe Pro Ala Met Val Lys
85 90 95
aac ttc cga acc cac aca ctg cac gaa gcc tac tgc gac acg gat cag 336
Asn Phe Arg Thr His Thr Leu His Glu Ala Tyr Cys Asp Thr Asp Gln
100 105 110
tcc ctc tgg aac aac gca ctt ggc tac tgg ggt tac ctc ttc tac ctg 384
Ser Leu Trp Asn Asn Ala Leu Gly Tyr Trp Gly Tyr Leu Phe Tyr Leu
115 120 125
tcc aag ttc tac gag gtc att gac acc atc atc atc atc ctg aag gga 432
Ser Lys Phe Tyr Glu Val Ile Asp Thr Ile Ile Ile Ile Leu Lys Gly
130 135 140
cga cgg tcc tcg ctg ctt cag acc tac cac cat gct gga gcc atg att 480
Arg Arg Ser Ser Leu Leu Gln Thr Tyr His His Ala Gly Ala Met Ile
145 150 155 160
acc atg tgg tct ggc atc aac tac caa gcc act ccc att tgg atc ttt 528
Thr Met Trp Ser Gly Ile Asn Tyr Gln Ala Thr Pro Ile Trp Ile Phe
165 170 175
gtg gtc ttc aac tcc ttc att cac acc atc atg tac tgt tac tat gcc 576
Val Val Phe Asn Ser Phe Ile His Thr Ile Met Tyr Cys Tyr Tyr Ala
180 185 190
ttc acc tct atc gga ttc cat cct cct ggc aaa aag tac ctg act tcg 624
Phe Thr Ser Ile Gly Phe His Pro Pro Gly Lys Lys Tyr Leu Thr Ser
195 200 205
atg cag att act cag ttt ctg gtc ggt atc acc att gcc gtg tcc tac 672
Met Gln Ile Thr Gln Phe Leu Val Gly Ile Thr Ile Ala Val Ser Tyr
210 215 220
ctc ttc gtt cct ggc tgc atc cga aca ccc ggt gct cag atg gct gtc 720
Leu Phe Val Pro Gly Cys Ile Arg Thr Pro Gly Ala Gln Met Ala Val
225 230 235 240
tgg atc aac gtc ggc tac ctg ttt ccc ttg acc tat ctg ttc gtg gac 768
Trp Ile Asn Val Gly Tyr Leu Phe Pro Leu Thr Tyr Leu Phe Val Asp
245 250 255
ttt gcc aag cga acc tac tcc aag cga tct gcc att gcc gct cag aaa 816
Phe Ala Lys Arg Thr Tyr Ser Lys Arg Ser Ala Ile Ala Ala Gln Lys
260 265 270
aag gct cag taa 828
Lys Ala Gln
275
<210>35
<211>275
<212>PRT
<213>高山被孢霉
<400>35
Met Glu Ser Gly Pro Met Pro Ala Gly Ile Pro Phe Pro Glu Tyr Tyr
1 5 10 15
Asp Phe Phe Met Asp Trp Lys Thr Pro Leu Ala Ile Ala Ala Thr Tyr
20 25 30
Thr Ala Ala Val Gly Leu Phe Asn Pro Lys Val Gly Lys Val Ser Arg
35 40 45
Val Val Ala Lys Ser Ala Asn Ala Lys Pro Ala Glu Arg Thr Gln Ser
50 55 60
Gly Ala Ala Met Thr Ala Phe Val Phe Val His Asn Leu Ile Leu Cys
65 70 75 80
Val Tyr Ser Gly Ile Thr Phe Tyr Tyr Met Phe Pro Ala Met Val Lys
85 90 95
Asn Phe Arg Thr His Thr Leu His Glu Ala Tyr Cys Asp Thr Asp Gln
100 105 110
Ser Leu Trp Asn Asn Ala Leu Gly Tyr Trp Gly Tyr Leu Phe Tyr Leu
115 120 125
Ser Lys Phe Tyr Glu Val Ile Asp Thr Ile Ile Ile Ile Leu Lys Gly
130 135 140
Arg Arg Ser Ser Leu Leu Gln Thr Tyr His His Ala Gly Ala Met Ile
145 150 155 160
Thr Met Trp Ser Gly Ile Asn Tyr Gln Ala Thr Pro Ile Trp Ile Phe
165 170 175
Val Val Phe Asn Ser Phe Ile His Thr Ile Met Tyr Cys Tyr Tyr Ala
180 185 190
Phe Thr Ser Ile Gly Phe His Pro Pro Gly Lys Lys Tyr Leu Thr Ser
195 200 205
Met Gln Ile Thr Gln Phe Leu Val Gly Ile Thr Ile Ala Val Ser Tyr
210 215 220
Leu Phe Val Pro Gly Cys Ile Arg Thr Pro Gly Ala Gln Met Ala Val
225 230 235 240
Trp Ile Asn Val Gly Tyr Leu Phe Pro Leu Thr Tyr Leu Phe Val Asp
245 250 255
Phe Ala Lys Arg Thr Tyr Ser Lys Arg Ser Ala Ile Ala Ala Gln Lys
260 265 270
Lys Ala Gln
275
<210>36
<211>8739
<212>DNA
<213>人工序列
<220>
<223>质粒pY116
<400>36
ggccgccacc gcggcccgag attccggcct cttcggccgc caagcgaccc gggtggacgt 60
ctagaggtac ctagcaatta acagatagtt tgccggtgat aattctctta acctcccaca 120
ctcctttgac ataacgattt atgtaacgaa actgaaattt gaccagatat tgtgtccgcg 180
gtggagctcc agcttttgtt ccctttagtg agggtttaaa cgagcttggc gtaatcatgg 240
tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa cgtacgagcc 300
ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg 360
ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc 420
ggccaacgcg cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact 480
gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta 540
atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag 600
caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc 660
cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta 720
taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg 780
ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc 840
tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac 900
gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac 960
ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg 1020
aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga 1080
aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt 1140
agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag 1200
cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct 1260
gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg 1320
atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat 1380
gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc 1440
tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg 1500
gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct 1560
ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca 1620
actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg 1680
ccagttaata gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg 1740
tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc 1800
cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag 1860
ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg 1920
ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag 1980
tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat 2040
agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg 2100
atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca 2160
gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca 2220
aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat 2280
tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag 2340
aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc tgacgcgccc 2400
tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt 2460
gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc 2520
ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta 2580
cggcacctcg accccaaaaa acttgattag ggtgatggtt cacgtagtgg gccatcgccc 2640
tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg 2700
ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt ataagggatt 2760
ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat 2820
tttaacaaaa tattaacgct tacaatttcc attcgccatt caggctgcgc aactgttggg 2880
aagggcgatc ggtgcgggcc tcttcgctat tacgccagct ggcgaaaggg ggatgtgctg 2940
caaggcgatt aagttgggta acgccagggt tttcccagtc acgacgttgt aaaacgacgg 3000
ccagtgaatt gtaatacgac tcactatagg gcgaattggg taccgggccc cccctcgagg 3060
tcgatggtgt cgataagctt gatatcgaat tcatgtcaca caaaccgatc ttcgcctcaa 3120
ggaaacctaa ttctacatcc gagagactgc cgagatccag tctacactga ttaattttcg 3180
ggccaataat ttaaaaaaat cgtgttatat aatattatat gtattatata tatacatcat 3240
gatgatactg acagtcatgt cccattgcta aatagacaga ctccatctgc cgcctccaac 3300
tgatgttctc aatatttaag gggtcatctc gcattgttta ataataaaca gactccatct 3360
accgcctcca aatgatgttc tcaaaatata ttgtatgaac ttatttttat tacttagtat 3420
tattagacaa cttacttgct ttatgaaaaa cacttcctat ttaggaaaca atttataatg 3480
gcagttcgtt catttaacaa tttatgtaga ataaatgtta taaatgcgta tgggaaatct 3540
taaatatgga tagcataaat gatatctgca ttgcctaatt cgaaatcaac agcaacgaaa 3600
aaaatccctt gtacaacata aatagtcatc gagaaatatc aactatcaaa gaacagctat 3660
tcacacgtta ctattgagat tattattgga cgagaatcac acactcaact gtctttctct 3720
cttctagaaa tacaggtaca agtatgtact attctcattg ttcatacttc tagtcatttc 3780
atcccacata ttccttggat ttctctccaa tgaatgacat tctatcttgc aaattcaaca 3840
attataataa gatataccaa agtagcggta tagtggcaat caaaaagctt ctctggtgtg 3900
cttctcgtat ttatttttat tctaatgatc cattaaaggt atatatttat ttcttgttat 3960
ataatccttt tgtttattac atgggctgga tacataaagg tattttgatt taattttttg 4020
cttaaattca atcccccctc gttcagtgtc aactgtaatg gtaggaaatt accatacttt 4080
tgaagaagca aaaaaaatga aagaaaaaaa aaatcgtatt tccaggttag acgttccgca 4140
gaatctagaa tgcggtatgc ggtacattgt tcttcgaacg taaaagttgc gctccctgag 4200
atattgtaca tttttgcttt tacaagtaca agtacatcgt acaactatgt actactgttg 4260
atgcatccac aacagtttgt tttgtttttt tttgtttttt ttttttctaa tgattcatta 4320
ccgctatgta tacctacttg tacttgtagt aagccgggtt attggcgttc aattaatcat 4380
agacttatga atctgcacgg tgtgcgctgc gagttacttt tagcttatgc atgctacttg 4440
ggtgtaatat tgggatctgt tcggaaatca acggatgctc aaccgatttc gacagtaatt 4500
aattaatttg aatcgaatcg gagcctaaaa tgaacccgag tatatctcat aaaattctcg 4560
gtgagaggtc tgtgactgtc agtacaaggt gccttcatta tgccctcaac cttaccatac 4620
ctcactgaat gtagtgtacc tctaaaaatg aaatacagtg ccaaaagcca aggcactgag 4680
ctcgtctaac ggacttgata tacaaccaat taaaacaaat gaaaagaaat acagttcttt 4740
gtatcatttg taacaattac cctgtacaaa ctaaggtatt gaaatcccac aatattccca 4800
aagtccaccc ctttccaaat tgtcatgcct acaactcata taccaagcac taacctacca 4860
aacaccacta aaaccccaca aaatatatct taccgaatat acagtaacaa gctaccacca 4920
cactcgttgg gtgcagtcgc cagcttaaag atatctatcc acatcagcca caactccctt 4980
cctttaataa accgactaca cccttggcta ttgaggttat gagtgaatat actgtagaca 5040
agacactttc aagaagactg tttccaaaac gtaccactgt cctccactac aaacacaccc 5100
aatctgcttc ttctagtcaa ggttgctaca ccggtaaatt ataaatcatc atttcattag 5160
cagggcaggg ccctttttat agagtcttat acactagcgg accctgccgg tagaccaacc 5220
cgcaggcgcg tcagtttgct ccttccatca atgcgtcgta gaaacgactt actccttctt 5280
gagcagctcc ttgaccttgt tggcaacaag tctccgacct cggaggtgga ggaagagcct 5340
ccgatatcgg cggtagtgat accagcctcg acggactcct tgacggcagc ctcaacagcg 5400
tcaccggcgg gcttcatgtt aagagagaac ttgagcatca tggcggcaga cagaatggtg 5460
gcaatggggt tgaccttctg cttgccgaga tcgggggcag atccgtgaca gggctcgtac 5520
agaccgaacg cctcgttggt gtcgggcaga gaagccagag aggcggaggg cagcagaccc 5580
agagaaccgg ggatgacgga ggcctcgtcg gagatgatat cgccaaacat gttggtggtg 5640
atgatgatac cattcatctt ggagggctgc ttgatgagga tcatggcggc cgagtcgatc 5700
agctggtggt tgagctcgag ctgggggaat tcgtccttga ggactcgagt gacagtcttt 5760
cgccaaagtc gagaggaggc cagcacgttg gccttgtcaa gagaccacac gggaagaggg 5820
gggttgtgct gaagggccag gaaggcggcc attcgggcaa ttcgctcaac ctcaggaacg 5880
gagtaggtct cggtgtcgga agcgacgcca gatccgtcat cctcctttcg ctctccaaag 5940
tagatacctc cgacgagctc tcggacaatg atgaagtcgg tgccctcaac gtttcggatg 6000
ggggagagat cggcgagctt gggcgacagc agctggcagg gtcgcaggtt ggcgtacagg 6060
ttcaggtcct ttcgcagctt gaggagaccc tgctcgggtc gcacgtcggt tcgtccgtcg 6120
ggagtggtcc atacggtgtt ggcagcgcct ccgacagcac cgagcataat agagtcagcc 6180
tttcggcaga tgtcgagagt agcgtcggtg atgggctcgc cctccttctc aatggcagct 6240
cctccaatga gtcggtcctc aaacacaaac tcggtgccgg aggcctcagc aacagacttg 6300
agcaccttga cggcctcggc aatcacctcg gggccacaga agtcgccgcc gagaagaaca 6360
atcttcttgg agtcagtctt ggtcttctta gtttcgggtt ccattgtgga tgtgtgtggt 6420
tgtatgtgtg atgtggtgtg tggagtgaaa atctgtggct ggcaaacgct cttgtatata 6480
tacgcacttt tgcccgtgct atgtggaaga ctaaacctcc gaagattgtg actcaggtag 6540
tgcggtatcg gctagggacc caaaccttgt cgatgccgat agcgctatcg aacgtacccc 6600
agccggccgg gagtatgtcg gaggggacat acgagatcgt caagggtttg tggccaactg 6660
gtatttaaat gtagctaacg gtagcaggcg aactactggt acatacctcc cccggaatat 6720
gtacaggcat aatgcgtatc tgtgggacat gtggtcgttg cgccattatg taagcagcgt 6780
gtactcctct gactgtccat atggtttgct ccatctcacc ctcatcgttt tcattgttca 6840
caggcggcca caaaaaaact gtcttctctc cttctctctt cgccttagtc tactcggacc 6900
agttttagtt tagcttggcg ccactggata aatgagacct caggccttgt gatgaggagg 6960
tcacttatga agcatgttag gaggtgcttg tatggataga gaagcaccca aaataataag 7020
aataataata aaacaggggg cgttgtcatt tcatatcgtg ttttcaccat caatacacct 7080
ccaaacaatg cccttcatgt ggccagcccc aatattgtcc tgtagttcaa ctctatgcag 7140
ctcgtatctt attgagcaag taaaactctg tcagccgata ttgcccgacc cgcgacaagg 7200
gtcaacaagg tggtgtaagg ccttcgcaga agtcaaaact gtgccaaaca aacatctaga 7260
gtctctttgg tgtttctcgc atatatttwa tcggctgtct tacgtatttg cgcctcggta 7320
ccggactaat ttcggatcat ccccaatacg ctttttcttc gcagctgtca acagtgtcca 7380
tgatctatcc acctaaatgg gtcatatgag gcgtataatt tcgtggtgct gataataatt 7440
cccatatatt tgacacaaaa cttccccccc tagacataca tctcacaatc tcacttcttg 7500
tgcttctgtc acacatctcc tccagctgac ttcaactcac acctctgccc cagttggtct 7560
acagcggtat aaggtttctc cgcatagagg tgcaccactc ctcccgatac ttgtttgtgt 7620
gacttgtggg tcacgacata tatatctaca cacattgcgc caccctttgg ttcttccagc 7680
acaacaaaaa cacgacacgc taaccatggc caatttactg accgtacacc aaaatttgcc 7740
tgcattaccg gtcgatgcaa cgagtgatga ggttcgcaag aacctgatgg acatgttcag 7800
ggatcgccag gcgttttctg agcatacctg gaaaatgctt ctgtccgttt gccggtcgtg 7860
ggcggcatgg tgcaagttga ataaccggaa atggtttccc gcagaacctg aagatgttcg 7920
cgattatctt ctatatcttc aggcgcgcgg tctggcagta aaaactatcc agcaacattt 7980
gggccagcta aacatgcttc atcgtcggtc cgggctgcca cgaccaagtg acagcaatgc 8040
tgtttcactg gttatgcggc ggatccgaaa agaaaacgtt gatgccggtg aacgtgcaaa 8100
acaggctcta gcgttcgaac gcactgattt cgaccaggtt cgttcactca tggaaaatag 8160
cgatcgctgc caggatatac gtaatctggc atttctgggg attgcttata acaccctgtt 8220
acgtatagcc gaaattgcca ggatcagggt taaagatatc tcacgtactg acggtgggag 8280
aatgttaatc catattggca gaacgaaaac gctggttagc accgcaggtg tagagaaggc 8340
acttagcctg ggggtaacta aactggtcga gcgatggatt tccgtctctg gtgtagctga 8400
tgatccgaat aactacctgt tttgccgggt cagaaaaaat ggtgttgccg cgccatctgc 8460
caccagccag ctatcaactc gcgccctgga agggattttt gaagcaactc atcgattgat 8520
ttacggcgct aaggatgact ctggtcagag atacctggcc tggtctggac acagtgcccg 8580
tgtcggagcc gcgcgagata tggcccgcgc tggagtttca ataccggaga tcatgcaagc 8640
tggtggctgg accaatgtaa atattgtcat gaactatatc cgtaacctgg atagtgaaac 8700
aggggcaatg gtgcgcctgc tggaagatgg cgattaagc 8739
<210>37
<211>15337
<212>DNA
<213>人工序列
<220>
<223>质粒pK02UF8289
<400>37
cgatcgagga agaggacaag cggctgcttc ttaagtttgt gacatcagta tccaaggcac 60
cattgcaagg attcaaggct ttgaacccgt catttgccat tcgtaacgct ggtagacagg 120
ttgatcggtt ccctacggcc tccacctgtg tcaatcttct caagctgcct gactatcagg 180
acattgatca acttcggaag aaacttttgt atgccattcg atcacatgct ggtttcgatt 240
tgtcttagag gaacgcatat acagtaatca tagagaataa acgatattca tttattaaag 300
tagatagttg aggtagaagt tgtaaagagt gataaatagc ggccgctcac tgaatctttt 360
tggctccctt gtgcttcctg acgatatacg tttgcacata gaaattcaag aacaaacaca 420
agactgtgcc aacataaaag taattgaaga accagccaaa catcctcatc ccatcttggc 480
gataacaggg aatgttcctg tacttccaga caatgtagaa accaacattg aattgaatga 540
tctgcattga tgtaatcagg gattttggca tggggaactt cagcttgatc aatctggtcc 600
aataataacc gtacatgatc cagtggatga aaccattcaa cagcacaaaa atccaaacag 660
cttcatttcg gtaattatag aacagccaca tatccatcgg tgcccccaaa tgatggaaga 720
attgcaacca ggtcagaggc ttgcccatca gtggcaaata gaaggagtca atatactcca 780
ggaacttgct caaatagaac aactgcgtgg tgatcctgaa gacgttgttg tcaaaagcct 840
tctcgcagtt gtcagacata acaccgatgg tgtacatggc atatgccatt gagaggaatg 900
atcccaacga ataaatggac atgagaaggt tgtaattggt gaaaacaaac ttcatacgag 960
actgaccttt tggaccaagg gggccaagag tgaacttcaa gatgacaaat gcgatggaca 1020
agtaaagcac ctcacagtga ctggcatcac tccagagttg ggcataatca actggttggg 1080
taaaacttcc tgcccaattg agactatttc attcaccacc tccatggtta gcgtgtcgtg 1140
tttttgttgt gctggaagaa ccaaagggtg gcgcaatgtg tgtagatata tatgtcgtga 1200
cccacaagtc acacaaacaa gtatcgggag gagtggtgca cctctatgcg gagaaacctt 1260
ataccgctgt agaccaactg gggcagaggt gtgagttgaa gtcagctgga ggagatgtgt 1320
gacagaagca caagaagtga gattgtgaga tgtatgtcta gggggggaag ttttgtgtca 1380
aatatatggg aattattatc agcaccacga aattatacgc ctcatatgac ccatttaggt 1440
ggatagatca tggacactgt tgacagctgc gaagaaaaag cgtattgggg atgatccgaa 1500
attagtccgg taccgaggcg caaatacgta agacagccga twaaatatat gcgagaaaca 1560
ccaaagagac tctagatgtt tgtttggcac agttttgact tctgcgaagg ccttacacca 1620
ccttgttgac ccttgtcgcg ggtcgggcaa tatcggctga cagagtttta cttgctcaat 1680
aagatacgag ctgcatagag ttgaactaca ggacaatatt ggggctggcc acatgaaggg 1740
cattgtttgg aggtgtattg atggtgaaaa cacgatatga aatgacaacg ccccctgttt 1800
tattattatt cttattattt tgggtgcttc tctatccata caagcacctc ctaacatgct 1860
tcataagtga cctcctcatc acaaggcctg aggtctcatt tatccagtgg cgccaagcta 1920
aactaaaact ggtccgagta gactaaggcg aagagagaag gagagaagac agtttttttg 1980
tggccgcctg tgaacaatga aaacgatgag ggtgagatgg agcaaaccat atggtttaaa 2040
cagtcagagg agtacacgct gcttacataa tggcgcaacg accacatgtc ccacagatac 2100
gcatcgattc gattcaaatt aattaaaagg cgttgaaaca gaatgagcca gacagcaagg 2160
acaaggtggc caacagcaag gagtccaaaa agccctctat tgacgagatc cacgatgtta 2220
ttgctcatga ggtttccgag ctcgatgctg ggaagaagaa gtgatttgta tataagaaat 2280
aaatgagata tagtaaagga gtgcaagaga atggcaaggt ggtcaaattc tatattactt 2340
gcagtcactg gttcctcgtt gacatgaatg aagttaccgt tggcatagct gatttaatat 2400
ataactgtcc aactaactct cacctagata taacccatgt gtgtgtttcc aatcatcaat 2460
gcggccgctt actgagcctt ggcaccgggc tgcttctcgg ccattcgagc gaactgggac 2520
aggtatcgga gcaggatgac gagaccttca tggggcagag ggtttcggta ggggaggttg 2580
tgcttctggc acagctgttc cacctggtag gaaacggcag tgaggttgtg tcgaggcagg 2640
gtgggccaga gatggtgctc gatctggtag ttcaggcctc caaagaacca gtcagtaatg 2700
atgcctcgtc gaatgttcat ggtctcatgg atctgaccca cagagaagcc atgtccgtcc 2760
cagacggaat caccgatctt ctccagaggg tagtggttca tgaagaccac gatggcaatt 2820
ccgaagccac cgacgagctc ggaaacaaag aacaccagca tcgaggtcag gatggagggc 2880
ataaagaaga ggtggaacag ggtcttgaga gtccagtgca gagcgagtcc aatggcctct 2940
ttcttgtact gagatcggta gaactggttg tctcggtcct tgagggatcg aacggtcagc 3000
acagactgga aacaccagat gaatcgcagg agaatacaga tgaccaggaa atagtactgt 3060
tggaactgaa tgagctttcg ggagatggga gaagctcgag tgacatcgtc ctcggaccag 3120
gcgagcagag gcaggttatc aatgtcggga tcgtgaccct gaacgttggt agcagaatga 3180
tgggcgttgt gtctgtcctt ccaccaggtc acggagaagc cctggagtcc gttgccaaag 3240
accagaccca ggacgttatt ccagtttcgg ttcttgaagg tctggtggtg gcagatgtca 3300
tgagacagcc atcccatttg ctggtagtgc ataccgagca cgagagcacc aatgaagtac 3360
aggtggtact ggaccagcat gaagaaggca agcacgccaa gacccagggt ggtcaagatc 3420
ttgtacgagt accagagggg agaggcgtca aacatgccag tggcgatcag ctcttctcgg 3480
agctttcgga aatcctcctg agcttcgttg acggcagcct ggggaggcag ctcggaagcc 3540
tggttgatct tgggcattcg cttgagcttg tcgaaggctt cctgagagtg cataaccatg 3600
aaggcgtcag tagcatctcg tccctggtag ttctcaatga tttcagctcc accagggtgg 3660
aagttcaccc aagcggagac gtcgtacacc tttccgtcga tgacgagggg cagagcctgt 3720
cgagaagcct tcaccatggc cattgctgta gatatgtctt gtgtgtaagg gggttggggt 3780
ggttgtttgt gttcttgact tttgtgttag caagggaaga cgggcaaaaa agtgagtgtg 3840
gttgggaggg agagacgagc cttatatata atgcttgttt gtgtttgtgc aagtggacgc 3900
cgaaacgggc aggagccaaa ctaaacaagg cagacaatgc gagcttaatt ggattgcctg 3960
atgggcaggg gttagggctc gatcaatggg ggtgcgaagt gacaaaattg ggaattaggt 4020
tcgcaagcaa ggctgacaag actttggccc aaacatttgt acgcggtgga caacaggagc 4080
cacccatcgt ctgtcacggg ctagccggtc gtgcgtcctg tcaggctcca cctaggctcc 4140
atgccactcc atacaatccc actagtgtac cgctaggccg cttttagctc ccatctaaga 4200
cccccccaaa acctccactg tacagtgcac tgtactgtgt ggcgatcaag ggcaagggaa 4260
aaaaggcgca aacatgcacg catggaatga cgtaggtaag gcgttactag actgaaaagt 4320
ggcacatttc ggcgtgccaa agggtcctag gtgcgtttcg cgagctgggc gccaggccaa 4380
gccgctccaa aacgcctctc cgactccctc cagcggcctc catatcccca tccctctcca 4440
cagcaatgtt gttaagcctt gcaaacgaaa aaatagaaag gctaataagc ttccaatatt 4500
gtggtgtacg ctgcataacg caacaatgag cgccaaacaa cacacacaca cagcacacag 4560
cagcattaac cacgatgttt aaacagtgta cgcagatccc gtcaacagtt ttatatatcg 4620
tagttacaac catcaacact ttttggtaag tgtaccattc tatactccaa ctggtctgca 4680
actgtacaag tagacatgtt aatggtagtt aataacatct acagcagaac ctatggtaaa 4740
gacattgcat ttttacagga agtatcgtcc tacacgttga taaatccaaa gatgcggaac 4800
ttcttccact tttatcatca tcccctactc gtacactcgt actctttgtt cgatcgcgat 4860
tcatttctat aaataatctt gtatgtacat gcggccgctt aagcaacggg cttgataaca 4920
gcgggggggg tgcccacgtt gttgcggttg cggaagaaca gaacaccctt accagcaccc 4980
tcggcaccag cgctgggctc aacccactgg cacatacgcg cactgcggta catggcgcgg 5040
atgaagccac gaggaccatc ctggacatca gcccggtagt gcttgcccat gatgggctta 5100
atggcctcgg tggcctcgtc cgcgttgtag aaggggatgc tgctgacgta gtggtggagg 5160
acatgagtct cgatgatgcc gtggagaagg tggcggccga tgaagcccat ctcacggtca 5220
atggtagcag cggcaccacg gacgaagttc cactcgtcgt tggtgtagtg gggaagggta 5280
gggtcggtgt gctggaggaa ggtgatggca acgagccagt ggttaaccca gaggtaggga 5340
acaaagtacc agatggccat gttgtagaaa ccgaacttct gaacgaggaa gtacagagca 5400
gtggccatca gaccgatacc aatatcgctg aggacgatga gcttagcgtc actgttctcg 5460
tacagagggc tgcggggatc gaagtggtta acaccaccgc cgaggccgtt atgcttgccc 5520
ttgccgcgac cctcacgctg gcgctcgtgg tagttgtggc cggtaacatt ggtgatgagg 5580
tagttgggcc agccaacgag ctgctgaagg acgagcatga gaagagtgaa agcgggggtc 5640
tcctcagtaa gatgagcgag ctcgtgggtc atctttccga gacgagtagc ctgctgctcg 5700
cgggttcggg gaacgaagac catgtcacgc tccatgttgc cagtggcctt gtggtgcttt 5760
cggtgggaga tttgccagct gaagtagggg acaaggaggg aagagtgaag aacccagcca 5820
gtaatgtcgt tgatgatgcg agaatcggag aaagcaccgt gaccgcactc atgggcaata 5880
acccagagac cagtaccgaa aagaccctga agaacggtgt acacggccca cagaccagcg 5940
cgggcggggg tggaggggat atattcgggg gtcacaaagt tgtaccagat gctgaaagtg 6000
gtagtcagga ggacaatgtc gcggaggata taaccgtatc ccttgagagc ggagcgcttg 6060
aagcagtgct tagggatggc attgtagatg tccttgatgg taaagtcggg aacctcgaac 6120
tggttgccgt aggtgtcgag catgacacca tactcggact tgggcttggc gatatcaacc 6180
tcggacatgg acgagagcga tgtggaagag gccgagtggc ggggagagtc tgaaggagag 6240
acggcggcag actcagaatc cgtcacagta gttgaggtga cggtgcgtct aagcgcaggg 6300
ttctgcttgg gcagagccga agtggacgcc atggttgtga attagggtgg tgagaatggt 6360
tggttgtagg gaagaatcaa aggccggtct cgggatccgt gggtatatat atatatatat 6420
atatatacga tccttcgtta cctccctgtt ctcaaaactg tggtttttcg tttttcgttt 6480
tttgcttttt ttgatttttt tagggccaac taagcttcca gatttcgcta atcacctttg 6540
tactaattac aagaaaggaa gaagctgatt agagttgggc tttttatgca actgtgctac 6600
tccttatctc tgatatgaaa gtgtagaccc aatcacatca tgtcatttag agttggtaat 6660
actgggagga tagataaggc acgaaaacga gccatagcag acatgctggg tgtagccaag 6720
cagaagaaag tagatgggag ccaattgacg agcgagggag ctacgccaat ccgacatacg 6780
acacgctgag atcgtcttgg ccggggggta cctacagatg tccaagggta agtgcttgac 6840
tgtaattgta tgtctgagga caaatatgta gtcagccgta taaagtcata ccaggcacca 6900
gtgccatcat cgaaccacta actctctatg atacatgcct ccggtattat tgtaccatgc 6960
gtcgctttgt tacatacgta tcttgccttt ttctctcaga aactccagac tttggctatt 7020
ggtcgagata agcccggacc atagtgagtc tttcacactc tacatttctc ccttgctcca 7080
actatttaaa ttgccccgga gaagacggcc aggccgccta gatgacaaat tcaacaactc 7140
acagctgact ttctgccatt gccactaggg gggggccttt ttatatggcc aagccaagct 7200
ctccacgtcg gttgggctgc acccaacaat aaatgggtag ggttgcacca acaaagggat 7260
gggatggggg gtagaagata cgaggataac ggggctcaat ggcacaaata agaacgaata 7320
ctgccattaa gactcgtgat ccagcgactg acaccattgc atcatctaag ggcctcaaaa 7380
ctacctcgga actgctgcgc tgatctggac accacagagg ttccgagcac tttaggttgc 7440
accaaatgtc ccaccaggtg caggcagaaa acgctggaac agcgtgtaca gtttgtctta 7500
acaaaaagtg agggcgctga ggtcgagcag ggtggtgtga cttgttatag cctttagagc 7560
tgcgaaagcg cgtatggatt tggctcatca ggccagattg agggtctgtg gacacatgtc 7620
atgttagtgt acttcaatcg ccccctggat atagccccga caataggccg tggcctcatt 7680
tttttgcctt ccgcacattt ccattgctcg gtacccacac cttgcttctc ctgcacttgc 7740
caaccttaat actggtttac attgaccaac atcttacaag cggggggctt gtctagggta 7800
tatataaaca gtggctctcc caatcggttg ccagtctctt ttttcctttc tttccccaca 7860
gattcgaaat ctaaactaca catcacagaa ttccgagccg tgagtatcca cgacaagatc 7920
agtgtcgaga cgacgcgttt tgtgtaatga cacaatccga aagtcgctag caacacacac 7980
tctctacaca aactaaccca gctctggtac catggtgaag gcttctcgac aggctctgcc 8040
cctcgtcatc gacggaaagg tgtacgacgt ctccgcttgg gtgaacttcc accctggtgg 8100
agctgaaatc attgagaact accagggacg agatgctact gacgccttca tggttatgca 8160
ctctcaggaa gccttcgaca agctcaagcg aatgcccaag atcaaccagg cttccgagct 8220
gcctccccag gctgccgtca acgaagctca ggaggatttc cgaaagctcc gagaagagct 8280
gatcgccact ggcatgtttg acgcctctcc cctctggtac tcgtacaaga tcttgaccac 8340
cctgggtctt ggcgtgcttg ccttcttcat gctggtccag taccacctgt acttcattgg 8400
tgctctcgtg ctcggtatgc actaccagca aatgggatgg ctgtctcatg acatctgcca 8460
ccaccagacc ttcaagaacc gaaactggaa taacgtcctg ggtctggtct ttggcaacgg 8520
actccagggc ttctccgtga cctggtggaa ggacagacac aacgcccatc attctgctac 8580
caacgttcag ggtcacgatc ccgacattga taacctgcct ctgctcgcct ggtccgagga 8640
cgatgtcact cgagcttctc ccatctcccg aaagctcatt cagttccaac agtactattt 8700
cctggtcatc tgtattctcc tgcgattcat ctggtgtttc cagtctgtgc tgaccgttcg 8760
atccctcaag gaccgagaca accagttcta ccgatctcag tacaagaaag aggccattgg 8820
actcgctctg cactggactc tcaagaccct gttccacctc ttctttatgc cctccatcct 8880
gacctcgatg ctggtgttct ttgtttccga gctcgtcggt ggcttcggaa ttgccatcgt 8940
ggtcttcatg aaccactacc ctctggagaa gatcggtgat tccgtctggg acggacatgg 9000
cttctctgtg ggtcagatcc atgagaccat gaacattcga cgaggcatca ttactgactg 9060
gttctttgga ggcctgaact accagatcga gcaccatctc tggcccaccc tgcctcgaca 9120
caacctcact gccgtttcct accaggtgga acagctgtgc cagaagcaca acctccccta 9180
ccgaaaccct ctgccccatg aaggtctcgt catcctgctc cgatacctgt cccagttcgc 9240
tcgaatggcc gagaagcagc ccggtgccaa ggctcagtaa gcggccgcaa gtgtggatgg 9300
ggaagtgagt gcccggttct gtgtgcacaa ttggcaatcc aagatggatg gattcaacac 9360
agggatatag cgagctacgt ggtggtgcga ggatatagca acggatattt atgtttgaca 9420
cttgagaatg tacgatacaa gcactgtcca agtacaatac taaacatact gtacatactc 9480
atactcgtac ccgggcaacg gtttcacttg agtgcagtgg ctagtgctct tactcgtaca 9540
gtgtgcaata ctgcgtatca tagtctttga tgtatatcgt attcattcat gttagttgcg 9600
tacgggtgaa gcttccactg gtcggcgtgg tagtggggca gagtggggtc ggtgtgctgc 9660
aggtaggtga tggccacgag ccagtggttg acccacaggt aggggatcag gtagtagagg 9720
gtgacggaag ccaggcccca tcggttgatg gagtatgcga tgacggacat ggtgatacca 9780
ataccgacgt tagagatcca gatgttgaac cagtccttct tctcaaacag cggggcgttg 9840
gggttgaagt ggttgacagc ccatttgttg agcttggggt acttctgtcc ggtaacgtaa 9900
gacagcagat acagaggcca tccaaacacc tgctgggtga tgaggccgta gagggtcatg 9960
aggggagcgt cctcagcaag ctcagaccag tcatgggcgc ctcggttctc cataaactcc 10020
tttcggtcct tgggcacaaa caccatatca cgggtgaggt gaccagtgga cttgtggtgc 10080
atggagtggg tcagcttcca ggcgtagtaa gggaccagca tggaggagtg cagaacccat 10140
ccggtgacgt tgttgacggt gttagagtcg gagaaagcag agtggccaca ctcgtgggca 10200
agaacccaca gaccggtgcc aaacagaccc tggacaatgg agtacatggc ccaggccaca 10260
gctcggccgg aagccgaggg aataagaggc aggtacgcgt aggccatgta ggcaaaaacg 10320
gcgataaaga agcaggcgcg ccagctgcat taatgaatcg gccaacgcgc ggggagaggc 10380
ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 10440
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 10500
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 10560
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 10620
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 10680
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 10740
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 10800
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 10860
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 10920
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 10980
gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc 11040
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 11100
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 11160
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 11220
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 11280
aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 11340
taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 11400
gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc 11460
agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac 11520
cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag 11580
tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac 11640
gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc 11700
agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg 11760
gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc 11820
atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct 11880
gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc 11940
tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc 12000
atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc 12060
agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc 12120
gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca 12180
cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt 12240
tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt 12300
ccgcgcacat ttccccgaaa agtgccacct gatgcggtgt gaaataccgc acagatgcgt 12360
aaggagaaaa taccgcatca ggaaattgta agcgttaata ttttgttaaa attcgcgtta 12420
aatttttgtt aaatcagctc attttttaac caataggccg aaatcggcaa aatcccttat 12480
aaatcaaaag aatagaccga gatagggttg agtgttgttc cagtttggaa caagagtcca 12540
ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc 12600
ccactacgtg aaccatcacc ctaatcaagt tttttggggt cgaggtgccg taaagcacta 12660
aatcggaacc ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg 12720
gcgagaaagg aagggaagaa agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg 12780
gtcacgctgc gcgtaaccac cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc 12840
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 12900
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 12960
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt gtaatacgac tcactatagg 13020
gcgaattggg cccgacgtcg catgcttgaa tctacaagta ggagggttgg agtgattaag 13080
tgaaacttct ttaacggctc tatgccagtt ctattgatat ccgaaacatc agtatgaagg 13140
tctgataagg gtgacttctt cccacagatt cgtatcagta cgagtacgag accggtactt 13200
gtaacagtat tgatactaaa gggaaactac aacggttgtc agcgtaatgt gacttcgccc 13260
atgaacgcag acacgcagtg ccgagtgcgg tgatatcgcc tactcgttac gtccatggac 13320
tacacaaccc ctcggcttcg cttggcttag cctcgggctc ggtgctgttc agttaaaaca 13380
caatcaaata acatttctac tttttagaag gcaggccgtc aggagcaact ccgactccat 13440
tgacgtttct aaacatctga atgccttcct taccttcaac aaactggcag gttcgggcga 13500
cagtgtaaag agacttgatg aagttggtgt cgtcgtgtcg gtagtgcttg cccatgacct 13560
tcttgatctt ctcagtggcg attcgggcgt tgtagaaggg aattccttta cctgcaggat 13620
aacttcgtat aatgtatgct atacgaagtt atgatctctc tcttgagctt ttccataaca 13680
agttcttctg cctccaggaa gtccatgggt ggtttgatca tggttttggt gtagtggtag 13740
tgcagtggtg gtattgtgac tggggatgta gttgagaata agtcatacac aagtcagctt 13800
tcttcgagcc tcatataagt ataagtagtt caacgtatta gcactgtacc cagcatctcc 13860
gtatcgagaa acacaacaac atgccccatt ggacagatca tgcggataca caggttgtgc 13920
agtatcatac atactcgatc agacaggtcg tctgaccatc atacaagctg aacaagcgct 13980
ccatacttgc acgctctcta tatacacagt taaattacat atccatagtc taacctctaa 14040
cagttaatct tctggtaagc ctcccagcca gccttctggt atcgcttggc ctcctcaata 14100
ggatctcggt tctggccgta cagacctcgg ccgacaatta tgatatccgt tccggtagac 14160
atgacatcct caacagttcg gtactgctgt ccgagagcgt ctcccttgtc gtcaagaccc 14220
accccggggg tcagaataag ccagtcctca gagtcgccct taggtcggtt ctgggcaatg 14280
aagccaacca caaactcggg gtcggatcgg gcaagctcaa tggtctgctt ggagtactcg 14340
ccagtggcca gagagccctt gcaagacagc tcggccagca tgagcagacc tctggccagc 14400
ttctcgttgg gagaggggac taggaactcc ttgtactggg agttctcgta gtcagagacg 14460
tcctccttct tctgttcaga gacagtttcc tcggcaccag ctcgcaggcc agcaatgatt 14520
ccggttccgg gtacaccgtg ggcgttggtg atatcggacc actcggcgat tcggtgacac 14580
cggtactggt gcttgacagt gttgccaata tctgcgaact ttctgtcctc gaacaggaag 14640
aaaccgtgct taagagcaag ttccttgagg gggagcacag tgccggcgta ggtgaagtcg 14700
tcaatgatgt cgatatgggt tttgatcatg cacacataag gtccgacctt atcggcaagc 14760
tcaatgagct ccttggtggt ggtaacatcc agagaagcac acaggttggt tttcttggct 14820
gccacgagct tgagcactcg agcggcaaag gcggacttgt ggacgttagc tcgagcttcg 14880
taggagggca ttttggtggt gaagaggaga ctgaaataaa tttagtctgc agaacttttt 14940
atcggaacct tatctggggc agtgaagtat atgttatggt aatagttacg agttagttga 15000
acttatagat agactggact atacggctat cggtccaaat tagaaagaac gtcaatggct 15060
ctctgggcgt cgcctttgcc gacaaaaatg tgatcatgat gaaagccagc aatgacgttg 15120
cagctgatat tgttgtcggc caaccgcgcc gaaaacgcag ctgtcagacc cacagcctcc 15180
aacgaagaat gtatcgtcaa agtgatccaa gcacactcat agttggagtc gtactccaaa 15240
ggcggcaatg acgagtcaga cagatactcg tcgacgcgat aacttcgtat aatgtatgct 15300
atacgaagtt atcgtacgat agttagtaga caacaat 15337
<210>38
<211>1272
<212>DNA
<213>人工序列
<220>
<223>突变型EgD8MΔ-8去饱和酶(也称为“EgD8S-23”)
<220>
<221>CDS
<222>(2)..(1270)
<400>38
c atg gtg aag gct tct cga cag gct ctg ccc ctc gtc atc gac gga aag 49
Met Val Lys Ala Ser Arg Gln Ala Leu Pro Leu Val Ile Asp Gly Lys
1 5 10 15
gtg tac gac gtc tcc gct tgg gtg aac ttc cac cct ggt gga gct gaa 97
Val Tyr Asp Val Ser Ala Trp Val Asn Phe His Pro Gly Gly Ala Glu
20 25 30
atc att gag aac tac cag gga cga gat gct act gac gcc ttc atg gtt 145
Ile Ile Glu Asn Tyr Gln Gly Arg Asp Ala Thr Asp Ala Phe Met Val
35 40 45
atg cac tct cag gaa gcc ttc gac aag ctc aag cga atg ccc aag atc 193
Met His Ser Gln Glu Ala Phe Asp Lys Leu Lys Arg Met Pro Lys Ile
50 55 60
aac cag gct tcc gag ctg cct ccc cag gct gcc gtc aac gaa gct cag 241
Asn Gln Ala Ser Glu Leu Pro Pro Gln Ala Ala Val Asn Glu Ala Gln
65 70 75 80
gag gat ttc cga aag ctc cga gaa gag ctg atc gcc act ggc atg ttt 289
Glu Asp Phe Arg Lys Leu Arg Glu Glu Leu Ile Ala Thr Gly Met Phe
85 90 95
gac gcc tct ccc ctc tgg tac tcg tac aag atc ttg acc acc ctg ggt 337
Asp Ala Ser Pro Leu Trp Tyr Ser Tyr Lys Ile Leu Thr Thr Leu Gly
100 105 110
ctt ggc gtg ctt gcc ttc ttc atg ctg gtc cag tac cac ctg tac ttc 385
Leu Gly Val Leu Ala Phe Phe Met Leu Val Gln Tyr His Leu Tyr Phe
115 120 125
att ggt gct ctc gtg ctc ggt atg cac tac cag caa atg gga tgg ctg 433
Ile Gly Ala Leu Val Leu Gly Met His Tyr Gln Gln Met Gly Trp Leu
130 135 140
tct cat gac atc tgc cac cac cag acc ttc aag aac cga aac tgg aat 481
Ser His Asp Ile Cys His His Gln Thr Phe Lys Asn Arg Asn Trp Asn
145 150 155 160
aac gtc ctg ggt ctg gtc ttt ggc aac gga ctc cag ggc ttc tcc gtg 529
Asn Val Leu Gly Leu Val Phe Gly Asn Gly Leu Gln Gly Phe Ser Val
165 170 175
acc tgg tgg aag gac aga cac aac gcc cat cat tct gct acc aac gtt 577
Thr Trp Trp Lys Asp Arg His Asn Ala His His Ser Ala Thr Asn Val
180 185 190
cag ggt cac gat ccc gac att gat aac ctg cct ctg ctc gcc tgg tcc 625
Gln Gly His Asp Pro Asp Ile Asp Asn Leu Pro Leu Leu Ala Trp Ser
195 200 205
gag gac gat gtc act cga gct tct ccc atc tcc cga aag ctc att cag 673
Glu Asp Asp Val Thr Arg Ala Ser Pro Ile Ser Arg Lys Leu Ile Gln
210 215 220
ttc caa cag tac tat ttc ctg gtc atc tgt att ctc ctg cga ttc atc 721
Phe Gln Gln Tyr Tyr Phe Leu Val Ile Cys Ile Leu Leu Arg Phe Ile
225 230 235 240
tgg tgt ttc cag tct gtg ctg acc gtt cga tcc ctc aag gac cga gac 769
Trp Cys Phe Gln Ser Val Leu Thr Val Arg Ser Leu Lys Asp Arg Asp
245 250 255
aac cag ttc tac cga tct cag tac aag aaa gag gcc att gga ctc gct 817
Asn Gln Phe Tyr Arg Ser Gln Tyr Lys Lys Glu Ala Ile Gly Leu Ala
260 265 270
ctg cac tgg act ctc aag acc ctg ttc cac ctc ttc ttt atg ccc tcc 865
Leu His Trp Thr Leu Lys Thr Leu Phe His Leu Phe Phe Met Pro Ser
275 280 285
atc ctg acc tcg atg ctg gtg ttc ttt gtt tcc gag ctc gtc ggt ggc 913
Ile Leu Thr Ser Met Leu Val Phe Phe Val Ser Glu Leu Val Gly Gly
290 295 300
ttc gga att gcc atc gtg gtc ttc atg aac cac tac cct ctg gag aag 961
Phe Gly Ile Ala Ile Val Val Phe Met Asn His Tyr Pro Leu Glu Lys
305 310 315 320
atc ggt gat tcc gtc tgg gac gga cat ggc ttc tct gtg ggt cag atc 1009
Ile Gly Asp Ser Val Trp Asp Gly His Gly Phe Ser Val Gly Gln Ile
325 330 335
cat gag acc atg aac att cga cga ggc atc att act gac tgg ttc ttt 1057
His Glu Thr Met Asn Ile Arg Arg Gly Ile Ile Thr Asp Trp Phe Phe
340 345 350
gga ggc ctg aac tac cag atc gag cac cat ctc tgg ccc acc ctg cct 1105
Gly Gly Leu Asn Tyr Gln Ile Glu His His Leu Trp Pro Thr Leu Pro
355 360 365
cga cac aac ctc act gcc gtt tcc tac cag gtg gaa cag ctg tgc cag 1153
Arg His Asn Leu Thr Ala Val Ser Tyr Gln Val Glu Gln Leu Cys Gln
370 375 380
aag cac aac ctc ccc tac cga aac cct ctg ccc cat gaa ggt ctc gtc 1201
Lys His Asn Leu Pro Tyr Arg Asn Pro Leu Pro His Glu Gly Leu Val
385 390 395 400
atc ctg ctc cga tac ctg tcc cag ttc gct cga atg gcc gag aag cag 1249
Ile Leu Leu Arg Tyr Leu Ser Gln Phe Ala Arg Met Ala Glu Lys Gln
405 410 415
ccc ggt gcc aag gct cag taa gc 1272
Pro Gly Ala Lys Ala Gln
420
<210>39
<211>422
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>39
Met Val Lys Ala Ser Arg Gln Ala Leu Pro Leu Val Ile Asp Gly Lys
1 5 10 15
Val Tyr Asp Val Ser Ala Trp Val Asn Phe His Pro Gly Gly Ala Glu
20 25 30
Ile Ile Glu Asn Tyr Gln Gly Arg Asp Ala Thr Asp Ala Phe Met Val
35 40 45
Met His Ser Gln Glu Ala Phe Asp Lys Leu Lys Arg Met Pro Lys Ile
50 55 60
Asn Gln Ala Ser Glu Leu Pro Pro Gln Ala Ala Val Asn Glu Ala Gln
65 70 75 80
Glu Asp Phe Arg Lys Leu Arg Glu Glu Leu Ile Ala Thr Gly Met Phe
85 90 95
Asp Ala Ser Pro Leu Trp Tyr Ser Tyr Lys Ile Leu Thr Thr Leu Gly
100 105 110
Leu Gly Val Leu Ala Phe Phe Met Leu Val Gln Tyr His Leu Tyr Phe
115 120 125
Ile Gly Ala Leu Val Leu Gly Met His Tyr Gln Gln Met Gly Trp Leu
130 135 140
Ser His Asp Ile Cys His His Gln Thr Phe Lys Asn Arg Asn Trp Asn
145 150 155 160
Asn Val Leu Gly Leu Val Phe Gly Asn Gly Leu Gln Gly Phe Ser Val
165 170 175
Thr Trp Trp Lys Asp Arg His Asn Ala His His Ser Ala Thr Asn Val
180 185 190
Gln Gly His Asp Pro Asp Ile Asp Asn Leu Pro Leu Leu Ala Trp Ser
195 200 205
Glu Asp Asp Val Thr Arg Ala Ser Pro Ile Ser Arg Lys Leu Ile Gln
210 215 220
Phe Gln Gln Tyr Tyr Phe Leu Val Ile Cys Ile Leu Leu Arg Phe Ile
225 230 235 240
Trp Cys Phe Gln Ser Val Leu Thr Val Arg Ser Leu Lys Asp Arg Asp
245 250 255
Asn Gln Phe Tyr Arg Ser Gln Tyr Lys Lys Glu Ala Ile Gly Leu Ala
260 265 270
Leu His Trp Thr Leu Lys Thr Leu Phe His Leu Phe Phe Met Pro Ser
275 280 285
Ile Leu Thr Ser Met Leu Val Phe Phe Val Ser Glu Leu Va lGly Gly
290 295 300
Phe Gly Ile Ala Ile Val Val Phe Met Asn His Tyr Pro Leu Glu Lys
305 310 315 320
Ile Gly Asp Ser Val Trp Asp Gly His Gly Phe Ser Val Gly Gln Ile
325 330 335
His Glu Thr Met Asn Ile Arg Arg Gly Ile Ile Thr Asp Trp Phe Phe
340 345 350
Gly Gly Leu Asn Tyr Gln Ile Glu His His Leu Trp Pro Thr Leu Pro
355 360 365
Arg His Asn Leu Thr Ala Val Ser Tyr Gln Val Glu Gln Leu Cys Gln
370 375 380
Lys His Asn Leu Pro Tyr Arg Asn Pro Leu Pro His Glu Gly Leu Val
385 390 395 400
Ile Leu Leu Arg Tyr Leu Ser Gln Phe Ala Arg Met Ala Glu Lys Gln
405 410 415
Pro Gly Ala Lys Ala Gln
420
<210>40
<211>777
<212>DNA
<213>小眼虫
<220>
<221>CDS
<222>(1)..(777)
<223>Δ-9延伸酶
<300>
<302>Δ-9延伸酶及其在制备多不饱和脂肪酸中的应用
<310>WO 2007/061742
<311>2006-11-16
<312>2007-05-31
<313>(1)..(777)
<300>
<302>Δ-9延伸酶及其在制备多不饱和脂肪酸中的应用
<310>US 2007-0117190-A1
<311>2006-11-16
<312>2007-05-24
<313>(1)..(777)
<400>40
atg gag gtg gtg aat gaa ata gtc tca att ggg cag gaa gtt tta ccc 48
Met Glu Val Val Asn Glu Ile Val Ser Ile Gly Gln Glu Val Leu Pro
1 5 10 15
aaa gtt gat tat gcc caa ctc tgg agt gat gcc agt cac tgt gag gtg 96
Lys Val Asp Tyr Ala Gln Leu Trp Ser Asp Ala Ser His Cys Glu Val
20 25 30
ctt tac ttg tcc atc gca ttt gtc atc ttg aag ttc act ctt ggc ccc 144
Leu Tyr Leu Ser Ile Ala Phe Val Ile Leu Lys Phe Thr Leu Gly Pro
35 40 45
ctt ggt cca aaa ggt cag tct cgt atg aag ttt gtt ttc acc aat tac 192
Leu Gly Pro Lys Gly Gln Ser Arg Met Lys Phe Val Phe Thr Asn Tyr
50 55 60
aac ctt ctc atg tcc att tat tcg ttg gga tca ttc ctc tca atg gca 240
Asn Leu Leu Met Ser Ile Tyr Ser Leu Gly Ser Phe Leu Ser Met Ala
65 70 75 80
tat gcc atg tac acc atc ggt gtt atg tct gac aac tgc gag aag gct 288
Tyr Ala Met Tyr Thr Ile Gly Val Met Ser Asp Asn Cys Glu Lys Ala
85 90 95
ttt gac aac aac gtc ttc agg atc acc acg cag ttg ttc tat ttg agc 336
Phe Asp Asn Asn Val Phe Arg Ile Thr Thr Gln Leu Phe Tyr Leu Ser
100 105 110
aag ttc ctg gag tat att gac tcc ttc tat ttg cca ctg atg ggc aag 384
Lys Phe Leu Glu Tyr Ile Asp Ser Phe Tyr Leu Pro Leu Met Gly Lys
115 120 125
cct ctg acc tgg ttg caa ttc ttc cat cat ttg ggg gca ccg atg gat 432
Pro Leu Thr Trp Leu Gln Phe Phe His His Leu Gly Ala Pro Met Asp
130 135 140
atg tgg ctg ttc tat aat tac cga aat gaa gct gtt tgg att ttt gtg 480
Met Trp Leu Phe Tyr Asn Tyr Arg Asn Glu Ala Val Trp Ile Phe Val
145 150 155 160
ctg ttg aat ggt ttc atc cac tgg atc atg tac ggt tat tat tgg acc 528
Leu Leu Asn Gly Phe Ile His Trp Ile Met Tyr Gly Tyr Tyr Trp Thr
165 170 175
aga ttg atc aag ctg aag ttc ccc atg cca aaa tcc ctg att aca tca 576
Arg Leu Ile Lys Leu Lys Phe Pro Met Pro Lys Ser Leu Ile Thr Ser
180 185 190
atg cag atc att caa ttc aat gtt ggt ttc tac att gtc tgg aag tac 624
Met Gln Ile Ile Gln Phe Asn Val Gly Phe Tyr Ile Val Trp Lys Tyr
195 200 205
agg aac att ccc tgt tat cgc caa gat ggg atg agg atg ttt ggc tgg 672
Arg Asn Ile Pro Cys Tyr Arg Gln Asp Gly Met Arg Met Phe Gly Trp
210 215 220
ttc ttc aat tac ttt tat gtt ggc aca gtc ttg tgt ttg ttc ttg aat 720
Phe Phe Asn Tyr Phe Tyr Val Gly Thr Val Leu Cys Leu Phe Leu Asn
225 230 235 240
ttc tat gtg caa acg tat atc gtc agg aag cac aag gga gcc aaa aag 768
Phe Tyr Val Gln Thr Tyr Ile Val Arg Lys His Lys Gly Ala Lys Lys
245 250 255
att cag tga 777
Ile Gln
<210>41
<211>258
<212>PRT
<213>小眼虫
<400>41
Met Glu Val Val Asn Glu Ile Val Ser Ile Gly Gln Glu Val Leu Pro
1 5 10 15
Lys Val Asp Tyr Ala Gln Leu Trp Ser Asp Ala Ser His Cys Glu Val
20 25 30
Leu Tyr Leu Ser Ile Ala Phe Val Ile Leu Lys Phe Thr Leu Gly Pro
35 40 45
Leu Gly Pro Lys Gly Gln Ser Arg Met Lys Phe Val Phe Thr Asn Tyr
50 55 60
Asn Leu Leu Met Ser Ile Tyr Ser Leu Gly Ser Phe Leu Ser Met Ala
65 70 75 80
Tyr Ala Met Tyr Thr Ile Gly Val Met Ser Asp Asn Cys Glu Lys Ala
85 90 95
Phe Asp Asn Asn Val Phe Arg Ile Thr Thr Gln Leu Phe Tyr Leu Ser
100 105 110
Lys Phe Leu Glu Tyr Ile Asp Ser Phe Tyr Leu Pro Leu Met Gly Lys
115 120 125
Pro Leu Thr Trp Leu Gln Phe Phe His His Leu Gly Ala Pro Met Asp
130 135 140
Met Trp Leu Phe Tyr Asn Tyr Arg Asn Glu Ala Val Trp Ile Phe Val
145 150 155 160
Leu Leu Asn Gly Phe Ile His Trp Ile Met Tyr Gly Tyr Tyr Trp Thr
165 170 175
Arg Leu Ile Lys Leu Lys Phe Pro Met Pro Lys Ser Leu Ile Thr Ser
180 185 190
Met Gln Ile Ile Gln Phe Asn Val Gly Phe Tyr Ile Val Trp Lys Tyr
195 200 205
Arg Asn Ile Pro Cys Tyr Arg Gln Asp Gly Met Arg Met Phe Gly Trp
210 215 220
Phe Phe Asn Tyr Phe Tyr Val Gly Thr Val Leu Cys Leu Phe Leu Asn
225 230 235 240
Phe Tyr Val Gln Thr Tyr Ile Val Arg Lys His Lys Gly Ala Lys Lys
245 250 255
Ile Gln
<210>42
<211>13707
<212>DNA
<213>人工序列
<220>
<223>质粒pZKSL-555R
<400>42
aaacagtgta cgcagatctg cccatgatgg gggctcccac caccagcaat cagggccctg 60
attacacacc cacctgtaat gtcatgctgt tcatcgtggt taatgctgct gtgtgctgtg 120
tgtgtgtgtt gtttggcgct cattgttgcg ttatgcagcg tacaccacaa tattggaagc 180
ttattagcct ttctattttt tcgtttgcaa ggcttaacaa cattgctgtg gagagggatg 240
gggatatgga ggccgctgga gggagtcgga gaggcgtttt ggagcggctt ggcctggcgc 300
ccagctcgcg aaacgcacct aggacccttt ggcacgccga aatgtgccac ttttcagtct 360
agtaacgcct tacctacgtc attccatgcg tgcatgtttg cgcctttttt cccttgccct 420
tgatcgccac acagtacagt gcactgtaca gtggaggttt tgggggggtc ttagatggga 480
gctaaaagcg gcctagcggt acactagtgg gattgtatgg agtggcatgg agcctaggtg 540
gagcctgaca ggacgcacga ccggctagcc cgtgacagac gatgggtggc tcctgttgtc 600
caccgcgtac aaatgtttgg gccaaagtct tgtcagcctt gcttgcgaac ctaattccca 660
attttgtcac ttcgcacccc cattgatcga gccctaaccc ctgcccatca ggcaatccaa 720
ttaagctcgc attgtctgcc ttgtttagtt tggctcctgc ccgtttcggc gtccacttgc 780
acaaacacaa acaagcatta tatataaggc tcgtctctcc ctcccaacca cactcacttt 840
tttgcccgtc ttcccttgct aacacaaaag tcaagaacac aaacaaccac cccaaccccc 900
ttacacacaa gacatatcta cagcaatggc catggctctc tcccttacta ccgagcagct 960
gctcgagcga cccgacctgg ttgccatcga cggcattctc tacgatctgg aaggtcttgc 1020
caaggtccat cccggaggcg acttgatcct cgcttctggt gcctccgatg cttctcctct 1080
gttctactcc atgcaccctt acgtcaagcc cgagaactcg aagctgcttc aacagttcgt 1140
gcgaggcaag cacgaccgaa cctccaagga cattgtctac acctacgact ctccctttgc 1200
acaggacgtc aagcgaacta tgcgagaggt catgaaaggt cggaactggt atgccacacc 1260
tggattctgg ctgcgaaccg ttggcatcat tgctgtcacc gccttttgcg agtggcactg 1320
ggctactacc ggaatggtgc tgtggggtct cttgactgga ttcatgcaca tgcagatcgg 1380
cctgtccatt cagcacgatg cctctcatgg tgccatcagc aaaaagccct gggtcaacgc 1440
tctctttgcc tacggcatcg acgtcattgg atcgtccaga tggatctggc tgcagtctca 1500
catcatgcga catcacacct acaccaatca gcatggtctc gacctggatg ccgagtccgc 1560
agaaccattc cttgtgttcc acaactaccc tgctgccaac actgctcgaa agtggtttca 1620
ccgattccag gcctggtaca tgtacctcgt gcttggagcc tacggcgttt cgctggtgta 1680
caaccctctc tacatcttcc gaatgcagca caacgacacc attcccgagt ctgtcacagc 1740
catgcgagag aacggctttc tgcgacggta ccgaaccctt gcattcgtta tgcgagcttt 1800
cttcatcttt cgaaccgcct tcttgccctg gtatctcact ggaacctccc tgctcatcac 1860
cattcctctg gtgcccactg ctaccggtgc cttcctcacc ttctttttca tcttgtctca 1920
caacttcgat ggctcggagc gaatccccga caagaactgc aaggtcaaga gctccgagaa 1980
ggacgttgaa gccgatcaga tcgactggta cagagctcag gtggagacct cttccaccta 2040
cggtggaccc attgccatgt tctttactgg cggtctcaac ttccagatcg agcatcacct 2100
ctttcctcga atgtcgtctt ggcactatcc cttcgtgcag caagctgtcc gagagtgttg 2160
cgaacgacac ggagttcggt acgtcttcta ccctaccatt gtgggcaaca tcatttccac 2220
cctcaagtac atgcacaaag tcggtgtggt tcactgtgtc aaggacgctc aggattccta 2280
agcggccgca agtgtggatg gggaagtgag tgcccggttc tgtgtgcaca attggcaatc 2340
caagatggat ggattcaaca cagggatata gcgagctacg tggtggtgcg aggatatagc 2400
aacggatatt tatgtttgac acttgagaat gtacgataca agcactgtcc aagtacaata 2460
ctaaacatac tgtacatact catactcgta cccgggcaac ggtttcactt gagtgcagtg 2520
gctagtgctc ttactcgtac agtgtgcaat actgcgtatc atagtctttg atgtatatcg 2580
tattcattca tgttagttgc gtacgctgtg ttgttgtatg tggtgaagct tgacaatgga 2640
tggtgtgtcg tatcaggctg gggaacaatt gtgcttaagt atgctgcagt tgagtaagag 2700
tcatcgctcc accaaaataa agtttgccat tagggttgga gagagagatg gtggctggaa 2760
gaattaaatg acatcaagct gaggattgtg ggtgtgcaat aacacatgtt aggggtgacc 2820
tgtggctcga aatctgataa ttattttgta actttatgat tattcttaga ttttttaata 2880
ttcctctata taacacataa gtagctgtcg tctagttgtt catagcctga ctcctgcaat 2940
agattagtgc agagtgattt tgtgcaattg agagccacgg ttgagtcaag tgactttgtg 3000
tgtgaagtca tcttacgttt caagtctcac aggttactca attggttggt tgtctgccct 3060
ttacagatat ttacagtacc tgagcgtaaa gtcgttcatc cacggaatga ctgttcctgt 3120
cacgcagtca tgatcatgga tgtggctggt caggaaccat tttggatagg agacttaggg 3180
attggactat tattgaaaaa actgagccga atatgatata gttctatttg aatgcagaac 3240
ttctgatggt caattcactt atttcaggca tatcggtcat ggtggcagct gccacgatgt 3300
tatctcgttg gaaacctcgg cgcgccagct gcattaatga atcggccaac gcgcggggag 3360
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 3420
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 3480
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 3540
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 3600
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 3660
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 3720
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 3780
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 3840
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 3900
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 3960
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 4020
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 4080
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 4140
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 4200
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 4260
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 4320
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 4380
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 4440
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 4500
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 4560
ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 4620
caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 4680
attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 4740
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 4800
actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 4860
ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 4920
ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 4980
gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 5040
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 5100
cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 5160
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 5220
gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 5280
ggttccgcgc acatttcccc gaaaagtgcc acctgatgcg gtgtgaaata ccgcacagat 5340
gcgtaaggag aaaataccgc atcaggaaat tgtaagcgtt aatattttgt taaaattcgc 5400
gttaaatttt tgttaaatca gctcattttt taaccaatag gccgaaatcg gcaaaatccc 5460
ttataaatca aaagaataga ccgagatagg gttgagtgtt gttccagttt ggaacaagag 5520
tccactatta aagaacgtgg actccaacgt caaagggcga aaaaccgtct atcagggcga 5580
tggcccacta cgtgaaccat caccctaatc aagttttttg gggtcgaggt gccgtaaagc 5640
actaaatcgg aaccctaaag ggagcccccg atttagagct tgacggggaa agccggcgaa 5700
cgtggcgaga aaggaaggga agaaagcgaa aggagcgggc gctagggcgc tggcaagtgt 5760
agcggtcacg ctgcgcgtaa ccaccacacc cgccgcgctt aatgcgccgc tacagggcgc 5820
gtccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg 5880
ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca 5940
gggttttccc agtcacgacg ttgtaaaacg acggccagtg aattgtaata cgactcacta 6000
tagggcgaat tgggcccgac gtcgcatgca ttccatagcc acacctttgc ctatggcttc 6060
acaaccgaag gcaattcgag aggtcgcgct tatggaatcg actcgtataa agctgaaggg 6120
aaagggagac gttccgagcg ctcagatgca atagtcgtcc agctaatgtg gattcaaaaa 6180
caaccccaac agtaatcttg aaaatttgaa cggatcaatc tgaacactct tgctccaggt 6240
cattcttcta acgcacatcc ccagagtcta gagggagttg tgttgtgaac atcctaataa 6300
acaatgcaat ggattcggga tatcttctgt ctcgccccct actcgatgtc gagtaaaccg 6360
atcaccaact aacaatactc ctccgcgttc tgccattgac tctcaaacag acatcgctat 6420
caacggaaca gcatatttta gcttcttagg acaataaata ttgataatgc cggctctccc 6480
tcggtatatt aagcaatcca ttcatacact cattcatcag gttaatttta tatatataat 6540
ttgtctattc aaacaccgta aattactggt accatcatct cctccttttc aaatacacgt 6600
ctatttgcat taatgaaatt actcgccaat tcgcagaacg tgtttgtcga acagagcctt 6660
agctcgggtc cagacaggag cagtgtctcg ctgaggaagc tgcaggagag ttaattaact 6720
cacctgcagg attgagacta tgaatggatt cccgtgcccg tattactcta ctaatttgat 6780
cttggaacgc gaaaatacgt ttctaggact ccaaagaatc tcaactcttg tccttactaa 6840
atatactacc catagttgat ggtttacttg aacagagagg acatgttcac ttgacccaaa 6900
gtttctcgca tctcttggat atttgaacaa cggcgtccac tgaccgtcag ttatccagtc 6960
acaaaacccc cacattcata cattcccatg tacgtttaca aagttctcaa ttccatcgtg 7020
caaatcaaaa tcacatctat tcattcatca tatataaacc catcatgtct actaacactc 7080
acaactccat agaaaacatc gactcagaac acacgctcca tgcggccgct taggaatcct 7140
gtgcgtcctt cacgcagtgg acgacaccca ccttatgcat gtacttcagg gtggagatga 7200
tgttgccgac gatggtaggg tagaaaacat atcgcactcc atgtcgttcg caacactccc 7260
ggaccgcctg ctggacgaag gggtagtgcc aagacgacat ccggggaaag aggtggtgct 7320
cgatctggaa attgagaccg ccagtgaaga acatggcgat ggggccaccg tatgtggagg 7380
acgtctccac ctgcgcccga taccagtcaa tttggtcagc ctcaacgtcc ttctcagatc 7440
gcttaacctt gcagttcttg tcggggatcc gttcggagcc atcaaaattg tgggacaaaa 7500
tgaagaagaa cgtcaagaag gcaccagttg cggtgggcac cagaggaatg gtgatcagca 7560
atgaggtccc agtgaggtac cagggcaaga atgcggtccg gaagatgaag aaagctcgca 7620
tcacgaatgc aagtgtgcgg tagcgccgca gaaagccatt ttcccgcatg gccgtgacag 7680
actctgggat ggtgtcattg tgctgcatcc ggaaaatgta gagcgggttg tacaccagcg 7740
ataccccgta tgcccccagc acaaggtaca tgtaccaagc ctggaagcgg tggaaccact 7800
ttcgggcggt gtttgcggcg gggtagttgt ggaacaccag gaacggctct gccgactccg 7860
catccaggtc gaggccgtgc tggttggtgt aggtgtggtg ccgcatgatg tgcgactgca 7920
gccaaatcca ccgggacgat ccgatgacgt caatgccgta ggcgaagagg gcgttgaccc 7980
aaggcttctt gctgatggcc ccgtgggacg catcatgctg gatggataag ccgatctgca 8040
tgtgcatgaa tccagtcaac aggccccaca gcaccatccc cgtggtagcc cagtgccact 8100
cgcaaaaggc cgtcacggcg atgatcccaa cggtgcgcag ccagaagcca ggggttgcgt 8160
accagttcct ccctttcatc acctcgcgca ttgtccgctt aacgtcttgt gcgaagggag 8220
aatcatacgt gtagacaatg tccttcgagg tgcggtcatg cttccctcgg acgaactgtt 8280
gaagcaattt ggagttctcc ggtttgacgt atggatgcat tgaataaaag agaggggagg 8340
catcagaggc accagaagcg agaatcaaat ctcctcctgg atgaactttg gcaagccctt 8400
caaggtcgta gaggatgcca tcaatcgcaa ccaaatcagg gcgttctaac agctgttctg 8460
tggtaagact gagagccatg gagagctggg ttagtttgtg tagagagtgt gtgttgctag 8520
cgactttcgg attgtgtcat tacacaaaac gcgtcgtctc gacactgatc ttgtcgtgga 8580
tactcacggc tcggacatcg tcgccgacga tgacaccgga ctttcgctta aggacgtcag 8640
taacaggcat tgtgtgatgt gtagtttaga tttcgaatct gtggggaaag aaaggaaaaa 8700
agagactggc aaccgattgg gagagccact gtttatatat accctagaca agccccccgc 8760
ttgtaagatg ttggtcaatg taaaccagta ttaaggttgg caagtgcagg agaagcaagg 8820
tgtgggtacc gagcaatgga aatgtgcgga aggcaaaaaa atgaggccac ggcctattgt 8880
cggggctata tccagggggc gattgaagta cactaacatg acatgtgtcc acagaccctc 8940
aatctggcct gatgagccaa atccatacgc gctttcgcag ctctaaaggc tataacaagt 9000
cacaccaccc tgctcgacct cagcgccctc actttttgtt aagacaaact gtacacgctg 9060
ttccagcgtt ttctgcctgc acctggtggg acatttggtg caacctaaag tgctcggaac 9120
ctctgtggtg tccagatcag cgcagcagtt ccgaggtagt tttgaggccc ttagatgatg 9180
caatggtgtc agtcgctgga tcacgagtct taatggcagt attcgttctt atttgtgcca 9240
ttgagccccg ttatcctcgt atcttctacc ccccatccca tccctttgtt ggtgcaaccc 9300
tacccattta ttgttgggtg cagcccaacc gacgtggaga gcttggcttg gccatataaa 9360
aaggcccccc cctagtggca atggcagaaa gtcagctgtg agttgttgaa tttgtcatct 9420
aggcggcctg gccgtcttct ccggggcaat tggggctgtt ttttgggaca caaatacgcc 9480
gccaacccgg tctctcctga attccgtcgt cgcctgagtc gacatcattt atttaccagt 9540
tggccacaaa cccttgacga tctcgtatgt cccctccgac atactcccgg ccggctgggg 9600
tacgttcgat agcgctatcg gcatcgacaa ggtttgggtc cctagccgat accgcactac 9660
ctgagtcaca atcttcggag gtttagtctt ccacatagca cgggcaaaag tgcgtatata 9720
tacaagagcg tttgccagcc acagattttc actccacaca ccacatcaca catacaacca 9780
cacacatcca caatggaacc cgaaactaag aagaccaaga ctgactccaa gaagattgtt 9840
cttctcggcg gcgacttctg tggccccgag gtgattgccg aggccgtcaa ggtgctcaag 9900
tctgttgctg aggcctccgg caccgagttt gtgtttgagg accgactcat tggaggagct 9960
gccattgaga aggagggcga gcccatcacc gacgctactc tcgacatctg ccgaaaggct 10020
gactctatta tgctcggtgc tgtcggaggc gctgccaaca ccgtatggac cactcccgac 10080
ggacgaaccg acgtgcgacc cgagcagggt ctcctcaagc tgcgaaagga cctgaacctg 10140
tacgccaacc tgcgaccctg ccagctgctg tcgcccaagc tcgccgatct ctcccccatc 10200
cgaaacgttg agggcaccga cttcatcatt gtccgagagc tcgtcggagg tatctacttt 10260
ggagagcgaa aggaggatga cggatctggc gtcgcttccg acaccgagac ctactccgtt 10320
cctgaggttg agcgaattgc ccgaatggcc gccttcctgg cccttcagca caacccccct 10380
cttcccgtgt ggtctcttga caaggccaac gtgctggcct cctctcgact ttggcgaaag 10440
actgtcactc gagtcctcaa ggacgaactc ccccagctcg agctcaacca ccagctgatc 10500
gactcggccg ccatgatcct catcaagcag ccctccaaga tgaatggtat catcatcacc 10560
accaacatgt ttggcgatat catctccgac gaggcctccg tcatccccgg ttctctgggt 10620
ctgctgccct ccgcctctct ggcttctctg cccgacacca acgaggcgtt cggtctgtac 10680
gagccctgtc acggatctgc ccccgatctc ggcaagcaga aggtcaaccc cattgccacc 10740
attctgtctg ccgccatgat gctcaagttc tctcttaaca tgaagcccgc cggtgacgct 10800
gttgaggctg ccgtcaagga gtccgtcgag gctggtatca ctaccgccga tatcggaggc 10860
tcttcctcca cctccgaggt cggagacttg ttgccaacaa ggtcaaggag ctgctcaaga 10920
aggagtaagt cgtttctacg acgcattgat ggaaggagca aactgacgcg cctgcgggtt 10980
ggtctaccgg cagggtccgc tagtgtataa gactctataa aaagggccct gccctgctaa 11040
tgaaatgatg atttataatt taccggtgta gcaaccttga ctagaagaag cagattgggt 11100
gtgtttgtag tggaggacag tggtacgttt tggaaacagt cttcttgaaa gtgtcttgtc 11160
tacagtatat tcactcataa cctcaatagc caagggtgta gtcggtttat taaaggaagg 11220
gagttgtggc tgatgtggat atcgatagtt ggagcaaggg agaaatgtag agtgtgaaag 11280
actcactatg gtccgggctt atctcgacca atagccaaag tctggagttt ctgagagaaa 11340
aaggcaagat acgtatgtaa caaagcgacg catggtacaa taataccgga ggcatgtatc 11400
atagagagtt agtggttcga tgatggcact ggtgcctggt atgactttat acggctgact 11460
acatatttgt cctcagacat acaattacag tcaagcactt acccttggac atctgtaggt 11520
accccccggc caagacgatc tcagcgtgtc gtatgtcgga ttggcgtagc tccctcgctc 11580
gtcaattggc tcccatctac tttcttctgc ttggctacac ccagcatgtc tgctatggct 11640
cgttttcgtg ccttatctat cctcccagta ttaccaactc taaatgacat gatgtgattg 11700
ggtctacact ttcatatcag agataaggag tagcacagtt gcataaaaag cccaactcta 11760
atcagcttct tcctttcttg taattagtac aaaggtgatt agcgaaatct ggaagcttag 11820
ttggccctaa aaaaatcaaa aaaagcaaaa aacgaaaaac gaaaaaccac agttttgaga 11880
acagggaggt aacgaaggat cgtatatata tatatatata tatataccca cggatcccga 11940
gaccggcctt tgattcttcc ctacaaccaa ccattctcac caccctaatt cacaaccatg 12000
gctcccgacg ccgacaagct gcgacagcga aaggctcagt ccatccagga cactgccgat 12060
tctcaggcta ccgagctcaa gattggcacc ctgaagggtc tccaaggcac cgagatcgtc 12120
attgatggcg acatctacga catcaaagac ttcgatcacc ctggaggcga atccatcatg 12180
acctttggtg gcaacgacgt tactgccacc tacaagatga ttcatcccta ccactcgaag 12240
catcacctgg agaagatgaa aaaggtcggt cgagtgcccg actacacctc cgagtacaag 12300
ttcgatactc ccttcgaacg agagatcaaa caggaggtct tcaagattgt gcgaagaggt 12360
cgagagtttg gaacacctgg ctacttcttt cgagccttct gctacatcgg tctcttcttt 12420
tacctgcagt atctctgggt taccactcct accactttcg cccttgctat cttctacggt 12480
gtgtctcagg ccttcattgg cctgaacgtc cagcacgacg ccaaccacgg agctgcctcc 12540
aaaaagccct ggatcaacaa tttgctcggc ctgggtgccg actttatcgg aggctccaag 12600
tggctctgga tgaaccagca ctggacccat cacacttaca ccaaccatca cgagaaggat 12660
cccgacgccc tgggtgcaga gcctatgctg ctcttcaacg actatccctt gggtcacccc 12720
aagcgaaccc tcattcatca cttccaagcc ttctactatc tgtttgtcct tgctggctac 12780
tgggtgtctt cggtgttcaa ccctcagatc ctggacctcc agcaccgagg tgcccaggct 12840
gtcggcatga agatggagaa cgactacatt gccaagtctc gaaagtacgc tatcttcctg 12900
cgactcctgt acatctacac caacattgtg gctcccatcc agaaccaagg cttttcgctc 12960
accgtcgttg ctcacattct tactatgggt gtcgcctcca gcctgaccct cgctactctg 13020
ttcgccctct cccacaactt cgagaacgca gatcgggatc ccacctacga ggctcgaaag 13080
ggaggcgagc ctgtctgttg gttcaagtcg caggtggaaa cctcctctac ttacggtggc 13140
ttcatttccg gttgccttac aggcggactc aactttcagg tcgagcatca cctgtttcct 13200
cgaatgtcct ctgcctggta cccctacatc gctcctaccg ttcgagaggt ctgcaaaaag 13260
cacggcgtca agtacgccta ctatccctgg gtgtggcaga acctcatctc gaccgtcaag 13320
tacctgcatc agtccggaac tggctcgaac tggaagaacg gtgccaatcc ctactctggc 13380
aagctgtaag cggccgcatg tacatacaag attatttatagaaatgaatc gcgatcgaac 13440
aaagagtacg agtgtacgag taggggatga tgataaaagt ggaagaagtt ccgcatcttt 13500
ggatttatca acgtgtagga cgatacttcc tgtaaaaatg caatgtcttt accataggtt 13560
ctgctgtaga tgttattaac taccattaac atgtctactt gtacagttgc agaccagttg 13620
gagtatagaa tggtacactt accaaaaagt gttgatggtt gtaactacga tatataaaac 13680
tgttgacggg atctgcgtac actgttt 13707
<210>43
<211>1350
<212>DNA
<213>小眼虫
<220>
<221>CDS
<222>(1)..(1350)
<223>合成Δ-5去饱和酶(经密码子优化用于解脂耶氏酵母)
<400>43
atg gct ctc tcc ctt act acc gag cag ctg ctc gag cga ccc gac ctg 48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu
1 5 10 15
gtt gcc atc gac ggc att ctc tac gat ctg gaa ggt ctt gcc aag gtc 96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val
20 25 30
cat ccc gga ggc gac ttg atc ctc gct tct ggt gcc tcc gat gct tct 144
His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser
35 40 45
cct ctg ttc tac tcc atg cac cct tac gtc aag ccc gag aac tcg aag 192
Pro Leu Phc Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys
50 55 60
ctg ctt caa cag ttc gtg cga ggc aag cac gac cga acc tcc aag gac 240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp
65 70 75 80
att gtc tac acc tac gac tct ccc ttt gca cag gac gtc aag cga act 288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr
85 90 95
atg cga gag gtc atg aaa ggt cgg aac tgg tat gcc aca cct gga ttc 336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe
100 105 110
tgg ctg cga acc gtt ggc atc att gct gtc acc gcc ttt tgc gag tgg 384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp
115 120 125
cac tgg gct act acc gga atg gtg ctg tgg ggt ctc ttg act gga ttc 432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe
130 135 140
atg cac atg cag atc ggc ctg tcc att cag cac gat gcc tct cat ggt 480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly
145 150 155 160
gcc atc agc aaa aag ccc tgg gtc aac gct ctc ttt gcc tac ggc atc 528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile
165 170 175
gac gtc att gga tcg tcc aga tgg atc tgg ctg cag tct cac atc atg 576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met
180 185 190
cga cat cac acc tac acc aat cag cat ggt ctc gac ctg gat gcc gag 624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu
195 200 205
tcc gca gaa cca ttc ctt gtg ttc cac aac tac cct gct gcc aac act 672
Sar Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr
210 215 220
gct cga aag tgg ttt cac cga ttc cag gcc tgg tac atg tac ctc gtg 720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val
225 230 235 240
ctt gga gcc tac ggc gtt tcg ctg gtg tac aac cct ctc tac atc ttc 768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe
245 250 255
cga atg cag cac aac gac acc att ccc gag tct gtc aca gcc atg cga 816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg
260 265 270
gag aac ggc ttt ctg cga cgg tac cga acc ctt gca ttc gtt atg cga 864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg
275 280 285
gct ttc ttc atc ttt cga acc gcc ttc ttg ccc tgg tat ctc act gga 912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly
290 295 300
acc tcc ctg ctc atc acc att cct ctg gtg ccc act gct acc ggt gcc 960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala
305 310 315 320
ttc ctc acc ttc ttt ttc atc ttg tct cac aac ttc gat ggc tcg gag 1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu
325 330 335
cga atc ccc gac aag aac tgc aag gtc aag agc tcc gag aag gac gtt 1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val
340 345 350
gaa gcc gat cag atc gac tgg tac aga gct cag gtg gag acc tct tcc 1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser
355 360 365
acc tac ggt gga ccc att gcc atg ttc ttt act ggc ggt ctc aac ttc 1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe
370 375 380
cag atc gag cat cac ctc ttt cct cga atg tcg tct tgg cac tat ccc 1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro
385 390 395 400
ttc gtg cag caa gct gtc cga gag tgt tgc gaa cga cac gga gtt cgg 1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg
405 410 415
tac gtc ttc tac cct acc att gtg ggc aac atc att tcc acc ctc aag 1296
Tyr ValPhe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys
420 425 430
tac atg cac aaa gtc ggt gtg gtt cac tgt gtc aag gac gct cag gat 1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp
435 440 445
tcc taa 1350
Ser
<210>44
<211>449
<212>PRT
<213>小眼虫
<400>44
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu
1 5 10 15
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val
20 25 30
His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser
35 40 45
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys
50 55 60
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp
65 70 75 80
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr
85 90 95
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe
100 105 110
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp
115 120 125
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe
130 135 140
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly
145 150 155 160
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile
165 170 175
Asp ValIle Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met
180 185 190
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu
195 200 205
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr
210 215 220
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val
225 230 235 240
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe
245 250 255
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg
260 265 270
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg
275 280 285
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly
290 295 300
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala
305 310 315 320
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu
325 330 335
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val
340 345 350
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser
355 360 365
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe
370 375 380
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro
385 390 395 400
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg
405 410 415
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys
420 425 430
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp
435 440 445
Ser
<210>45
<211>1392
<212>DNA
<213>多甲藻属CCMP626
<220>
<221>CDS
<222>(1)..(1392)
<223>合成Δ-5去饱和酶(经密码子优化用于解脂耶氏酵母)
<400>45
atg gct ccc gac gcc gac aag ctg cga cag cga aag gct cag tcc atc 48
Met Ala Pro Asp Ala Asp Lys Leu Arg Gln Arg Lys Ala Gln Ser Ile
1 5 10 15
cag gac act gcc gat tct cag gct acc gag ctc aag att ggc acc ctg 96
Gln Asp Thr Ala Asp Ser Gln Ala Thr Glu Leu Lys Ile Gly Thr Leu
20 25 30
aag ggt ctc caa ggc acc gag atc gtc att gat ggc gac atc tac gac 144
Lys Gly Leu Gln Gly Thr Glu Ile Val Ile Asp Gly Asp Ile Tyr Asp
35 40 45
atc aaa gac ttc gat cac cct gga ggc gaa tcc atc atg acc ttt ggt 192
Ile Lys Asp Phe Asp His Pro Gly Gly Glu Ser Ile Met Thr Phe Gly
50 55 60
ggc aac gac gtt act gcc acc tac aag atg att cat ccc tac cac tcg 240
Gly Asn Asp Val Thr Ala Thr Tyr Lys Met Ile His Pro Tyr His Ser
65 70 75 80
aag cat cac ctg gag aag atg aaa aag gtc ggt cga gtg ccc gac tac 288
Lys His His Leu Glu Lys Met Lys Lys Val Gly Arg Val Pro Asp Tyr
85 90 95
acc tcc gag tac aag ttc gat act ccc ttc gaa cga gag atc aaa cag 336
Thr Ser Glu Tyr Lys Phe Asp Thr Pro Phe Glu Arg Glu Ile Lys Gln
100 105 110
gag gtc ttc aag att gtg cga aga ggt cga gag ttt gga aca cct ggc 384
Glu Val Phe Lys Ile Val Arg Arg Gly Arg Glu Phe Gly Thr Pro Gly
115 120 125
tac ttc ttt cga gcc ttc tgc tac atc ggt ctc ttc ttt tac ctg cag 432
Tyr Phe Phe Arg Ala Phe Cys Tyr Ile Gly Leu Phe Phe Tyr Leu Gln
130 135 140
tat ctc tgg gtt acc act cct acc act ttc gcc ctt gct atc ttc tac 480
Tyr Leu Trp Val Thr Thr Pro Thr Thr Phe Ala Leu Ala Ile Phe Tyr
145 150 155 160
ggt gtg tct cag gcc ttc att ggc ctg aac gtc cag cac gac gcc aac 528
Gly Val Ser Gln Ala Phe Ile Gly Leu Asn Val Gln His Asp Ala Asn
165 170 175
cac gga gct gcc tcc aaa aag ccc tgg atc aac aat ttg ctc ggc ctg 576
His Gly Ala Ala Ser Lys Lys Pro Trp Ile Asn Asn Leu Leu Gly Leu
180 185 190
ggt gcc gac ttt atc gga ggc tcc aag tgg ctc tgg atg aac cag cac 624
Gly Ala Asp Phe Ile Gly Gly Ser Lys Trp Leu Trp Met Asn Gln His
195 200 205
tgg acc cat cac act tac acc aac cat cac gag aag gat ccc gac gcc 672
Trp Thr His His Thr Tyr Thr Asn His His Glu Lys Asp Pro Asp Ala
210 215 220
ctg ggt gca gag cct atg ctg ctc ttc aac gac tat ccc ttg ggt cac 720
Leu Gly Ala Glu Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Gly His
225 230 235 240
ccc aag cga acc ctc att cat cac ttc caa gcc ttc tac tat ctg ttt 768
Pro Lys Arg Thr Leu Ile His His Phe Gln Ala Phe Tyr Tyr Leu Phe
245 250 255
gtc ctt gct ggc tac tgg gtg tct tcg gtg ttc aac cct cag atc ctg 816
Val Leu Ala Gly Tyr Trp Val Ser Ser Val Phe Asn Pro Gln Ile Leu
260 265 270
gac ctc cag cac cga ggt gcc cag gct gtc ggc atg aag atg gag aac 864
Asp Leu Gln His Arg Gly Ala Gln Ala Val Gly Met Lys Met Glu Asn
275 280 285
gac tac att gcc aag tct cga aag tac gct atc ttc ctg cga ctc ctg 912
Asp Tyr Ile Ala Lys Ser Arg Lys Tyr Ala Ile Phe Leu Arg Leu Leu
290 295 300
tac atc tac acc aac att gtg gct ccc atc cag aac caa ggc ttt tcg 960
Tyr Ile Tyr Thr Asn Ile Val Ala Pro Ile Gln Asn Gln Gly Phe Ser
305 310 315 320
ctc acc gtc gtt gct cac att ctt act atg ggt gtc gcc tcc agc ctg 1008
Leu Thr Val Val Ala His Ile Leu Thr Met Gly Val Ala Ser Ser Leu
325 330 335
acc ctc gct act ctg ttc gcc ctc tcc cac aac ttc gag aac gca gat 1056
Thr Leu Ala Thr Leu Phe Ala Leu Ser His Asn Phe Glu Asn Ala Asp
340 345 350
cgg gat ccc acc tac gag gct cga aag gga ggc gag cct gtc tgt tgg 1104
Arg Asp Pro Thr Tyr Glu Ala Arg Lys Gly Gly Glu Pro Val Cys Trp
355 360 365
ttc aag tcg cag gtg gaa acc tcc tct act tac ggt ggc ttc att tcc 1152
Phe Lys Ser Gln Val Glu Thr Ser Ser Thr Tyr Gly Gly Phe Ile Ser
370 375 380
ggt tgc ctt aca ggc gga ctc aac ttt cag gtc gag cat cac ctg ttt 1200
Gly Cys Leu Thr Gly Gly Leu Asn Phe Gln Val Glu His His Leu Phe
385 390 395 400
cct cga atg tcc tct gcc tgg tac ccc tac atc gct cct acc gtt cga 1248
Pro Arg Met Ser Ser Ala Trp Tyr Pro Tyr Ile Ala Pro Thr Val Arg
405 410 415
gag gtc tgc aaa aag cac ggc gtc aag tac gcc tac tat ccc tgg gtg 1296
Glu Val Cys Lys Lys His Gly Val Lys Tyr Ala Tyr Tyr Pro Trp Val
420 425 430
tgg cag aac ctc atc tcg acc gtc aag tac ctg cat cag tcc gga act 1344
Trp Gln Asn Leu Ile Ser Thr Val Lys Tyr Leu His Gln Ser Gly Thr
435 440 445
ggc tcg aac tgg aag aac ggt gcc aat ccc tac tct ggc aag ctg taa 1392
Gly Ser Asn Trp Lys Asn Gly Ala Asn Pro Tyr Ser Gly Lys Leu
450 455 460
<210>46
<211>463
<212>PRT
<213>多甲藻属CCMP626
<400>46
Met Ala Pro Asp Ala Asp Lys Leu Arg Gln Arg Lys Ala Gln Ser Ile
1 5 10 15
Gln Asp Thr Ala Asp Ser Gln Ala Thr Glu Leu Lys Ile Gly Thr Leu
20 25 30
Lys Gly Leu Gln Gly Thr Glu Ile Val Ile Asp Gly Asp Ile Tyr Asp
35 40 45
Ile Lys Asp Phe Asp His Pro Gly Gly Glu Ser Ile Met Thr Phe Gly
50 55 60
Gly Asn Asp Val Thr Ala Thr Tyr Lys Met Ile His Pro Tyr His Ser
65 70 75 80
Lys His His Leu Glu Lys Met Lys Lys Val Gly Arg Val Pro Asp Tyr
85 90 95
Thr Ser Glu Tyr Lys Phe Asp Thr Pro Phe Glu Arg Glu Ile Lys Gln
100 105 110
Glu Val Phe Lys Ile Val Arg Arg Gly Arg Glu Phe Gly Thr Pro Gly
115 120 125
Tyr Phe Phe Arg Ala Phe Cys Tyr Ile Gly Leu Phe Phe Tyr Leu Gln
130 135 140
Tyr Leu Trp Val Thr Thr Pro Thr Thr Phe Ala Leu Ala Ile Phe Tyr
145 150 155 160
Gly Val Ser Gln Ala Phe Ile Gly Leu Asn Val Gln His Asp Ala Asn
165 170 175
His Gly Ala Ala Ser Lys Lys Pro Trp Ile Asn Asn Leu Leu Gly Leu
180 185 190
Gly Ala Asp Phe Ile Gly Gly Ser Lys Trp Leu Trp Met Asn Gln His
195 200 205
Trp Thr His His Thr Tyr Thr Asn His His Glu Lys Asp Pro Asp Ala
210 215 220
Leu Gly Ala Glu Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Gly His
225 230 235 240
Pro Lys Arg Thr Leu Ile His His Phe Gln Ala Phe Tyr Tyr Leu Phe
245 250 255
Val Leu Ala Gly Tyr Trp Val Ser Ser Val Phe Asn Pro Gln Ile Leu
260 265 270
Asp Leu Gln His Arg Gly Ala Gln Ala Val Gly Met Lys Met Glu Asn
275 280 285
Asp Tyr Ile Ala Lys Ser Arg Lys Tyr Ala Ile Phe Leu Arg Leu Leu
290 295 300
Tyr Ile Tyr Thr Asn Ile Val Ala Pro Ile Gln Asn Gln Gly Phe Ser
305 310 315 320
Leu Thr Val Val Ala His Ile Leu Thr Met Gly ValAla Ser Ser Leu
325 330 335
Thr Leu Ala Thr Leu Phe Ala Leu Ser His Asn Phe Glu Asn Ala Asp
340 345 350
Arg Asp Pro Thr Tyr Glu Ala Arg Lys Gly Gly Glu Pro ValCys Trp
355 360 365
Phe Lys Ser Gln Val Glu Thr Ser Ser Thr Tyr Gly Gly Phe Ile Ser
370 375 380
Gly Cys Leu Thr Gly Gly Leu Asn Phe Gln ValGlu His His Leu Phe
385 390 395 400
Pro Arg Met Ser Ser Ala Trp Tyr Pro Tyr Ile Ala Pro Thr Val Arg
405 410 415
Glu ValCys Lys Lys His Gly ValLys Tyr Ala Tyr Tyr Pro Trp Val
420 425 430
Trp Gln Asn Leu Ile Ser Thr Val Lys Tyr Leu His Gln Ser Gly Thr
435 440 445
Gly Ser Asn Trp Lys Asn Gly Ala Asn Pro Tyr Ser Gly Lys Leu
450 455 460
<210>47
<211>1350
<212>DNA
<213>小眼虫
<220>
<221>CDS
<222>(1)..(1350)
<223>Δ-5去饱和酶
<400>47
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg 48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu
1 5 10 15
gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt 96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val
20 25 30
cat cca gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc 144
His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser
35 40 45
cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aat tcc aaa 192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys
50 55 60
ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac 240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp
65 70 75 80
att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca 288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr
85 90 95
atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc 336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe
100 105 110
tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg 384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp
115 120 125
cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc 432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe
130 135 140
atg cac atg cag atc ggc tta tcc atc cag cat gat gcg tcc cac ggg 480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly
145 150 155 160
gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att 528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile
165 170 175
gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg 576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met
180 185 190
cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag 624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu
195 200 205
tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc 672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr
210 215 220
gcc cga aag tgg ttc cac cgc ttc caa gct tgg tac atg tac ctt gtg 720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val
225 230 235 240
ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc 768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe
245 250 255
cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg 816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg
260 265 270
gag aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga 864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg
275 280 285
gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg 912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly
290 295 300
acc tca ttg ctg atc acc att cct ctg gtg ccc act gca act ggt gcc 960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala
305 310 315 320
ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa 1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu
325 330 335
cgg atc ccc gac aag aac tgc aag gtt aag agc tct gag aag gac gtt 1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val
340 345 350
gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc 1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser
355 360 365
aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc 1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe
370 375 380
cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc 1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro
385 390 395 400
ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cgc cat gga gtg cga 1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg
405 410 415
tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag 1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys
420 425 430
tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat 1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp
435 440 445
tcc tga 1350
Ser
<210>48
<211>449
<212>PRT
<213>小眼虫
<400>48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu
1 5 10 15
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val
20 25 30
His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser
35 40 45
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys
50 55 60
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp
65 70 75 80
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr
85 90 95
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe
100 105 110
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp
115 120 125
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe
130 135 140
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly
145 150 155 160
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile
165 170 175
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met
180 185 190
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu
195 200 205
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr
210 215 220
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val
225 230 235 240
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe
245 250 255
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg
260 265 270
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg
275 280 285
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly
290 295 300
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala
305 310 315 320
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu
325 330 335
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val
340 345 350
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser
355 360 365
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe
370 375 380
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro
385 390 395 400
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg
405 410 415
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys
420 425 430
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp
435 440 445
Ser
<210>49
<211>13066
<212>DNA
<213>人工序列
<220>
<223>质粒pZP3-Pa777U
<400>49
tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa 60
atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt 120
cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca tcaggtggca 180
cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata 240
tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga 300
gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc 360
ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg 420
cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc 480
ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat 540
cccgtattga cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact 600
tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat 660
tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga 720
tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc 780
ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga 840
tgcctgtagc aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag 900
cttcccggca acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc 960
gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt 1020
ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct 1080
acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg 1140
cctcactgat taagcattgg taactgtcag accaagttta ctcatatata ctttagattg 1200
atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca 1260
tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga 1320
tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa 1380
aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga 1440
aggtaactgg cttcagcaga gcgcagatac caaatactgt tcttctagtg tagccgtagt 1500
taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt 1560
taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat 1620
agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct 1680
tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca 1740
cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag 1800
agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc 1860
gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga 1920
aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca 1980
tgttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag 2040
ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg 2100
aagagcgccc aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat taatgcagct 2160
ggcgcgccac caatcacaat tctgaaaagc acatcttgat ctcctcattg cggggagtcc 2220
aacggtggtc ttattccccc gaatttcccg ctcaatctcg ttccagaccg acccggacac 2280
agtgcttaac gccgttccga aactctaccg cagatatgct ccaacggact gggctgcata 2340
gatgtgatcc tcggcttgga gaaatggata aaagccggcc aaaaaaaaag cggaaaaaag 2400
cggaaaaaaa gagaaaaaaa atcgcaaaat ttgaaaaata gggggaaaag acgcaaaaac 2460
gcaaggaggg gggagtatat gacactgata agcaagctca caacggttcc tcttattttt 2520
ttcctcatct tctgcctagg ttcccaaaat cccagatgct tctctccagt gccaaaagta 2580
agtaccccac aggttttcgg ccgaaaattc cacgtgcagc aacgtcgtgt ggggtgttaa 2640
aatgtggggg gggggaacca ggacaagagg ctcttgtggg agccgaatga gagcacaaag 2700
cgggcgggtg tgataagggc atttttgccc attttccctt ctcctgtctc tccgacggtg 2760
atggcgttgt gcgtcctcta tttcttttta tttctttttg ttttatttct ctgactaccg 2820
atttggtttg atttcctcaa ccccacacaa ataagctcgg gccgaggaat atatatatac 2880
acggacacag tcgccctgtg gacaacacgt cactacctct acgatacaca ccgtacgttg 2940
tgtggaagct tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg 3000
ccaagctcga aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggacaca 3060
atatctggtc aaatttcagt ttcgttacat ttaaattcct tcacttcaag ttcattcttc 3120
atctgcttct gttttacttt gacaggcaaa tgaagacatg gtacgacttg atggaggcca 3180
agaacgccat ttcaccccga gacaccgaag tgcctgaaat cctggctgcc cccattgata 3240
acatcggaaa ctacggtatt ccggaaagtg tatatagaac ctttccccag cttgtgtctg 3300
tggatatgga tggtgtaatc ccctttgagt actcgtcttg gcttctctcc gagcagtatg 3360
aggctctcta atctagcgca tttaatatct caatgtattt atatatttat cttctcatgc 3420
ggccgcttag ttggctttgg tcttggcagc cttggcctcc ttgagggtaa acatcttggc 3480
atccttgtcg accacgccgt acttggcgta cataagacca attcggatga aggtgggaat 3540
gatgggagaa gccgactttc gcaccagttc gggaaaggcc tgagcgaagg cagcagtggc 3600
ctcgttgagc ttgtagtgag gaatgatggg aaacagatgg tggatctgat gtgtaccaat 3660
gttgtgggac aggttgtcga tgagggctcc gtagcttcgg tccacagagg acaagttgcc 3720
cttgacatag gtccactccg aatcggcgta ccagggagtt tcctcgtcgt tgtgatggag 3780
gaaggtagtg acaaccagca tggtggcgaa tccaaagaga ggtgcgaagt aatacagagc 3840
catggtcttg aggccgtaga cgtaggtaag gtaggcgtac agaccagcaa aggccacgag 3900
agagccgagg gaaatgatga cggcagacat tcttcgcagg tagagaggct cccagggatt 3960
gaagtggttg acctttcggg gaggaaatcc agcaacgagg taggcaaacc aagccgaacc 4020
aagggagatg accatgtgtc gggacagggg atgagagtcg gcttctcgct gagggtagaa 4080
gatctcatcc ttgtcgatgt tgccggtgtt cttgtgatgg tgtcgatggc tgatcttcca 4140
cgactcgtag ggagtcagaa tgatggagtg aatgagtgtg ccaacagaga agttgagcag 4200
gtgggatcgc gagaaggcac catgtccaca gtcgtgaccg atggtaaaga atccccagaa 4260
cacgataccc tggagcagaa tgtagccagt gcaaaggacg gcatcgagca gtgcaaactc 4320
ctgcacgata gcaagggctc gagcatagta cagtccgaga gcaagggaac cggcaatgcc 4380
cagagctcgc acggtatagt agagggacca gggaacagag gcttcgaagc agtgggcagg 4440
cagggatcgc ttgatctcgg tgagagtagg gaactcgtag ggagcggcaa cggtagagga 4500
agccatggtt gtgaattagg gtggtgagaa tggttggttg tagggaagaa tcaaaggccg 4560
gtctcgggat ccgtgggtat atatatatat atatatatat acgatccttc gttacctccc 4620
tgttctcaaa actgtggttt ttcgtttttc gttttttgct ttttttgatt tttttagggc 4680
caactaagct tccagatttc gctaatcacc tttgtactaa ttacaagaaa ggaagaagct 4740
gattagagtt gggcttttta tgcaactgtg ctactcctta tctctgatat gaaagtgtag 4800
acccaatcac atcatgtcat ttagagttgg taatactggg aggatagata aggcacgaaa 4860
acgagccata gcagacatgc tgggtgtagc caagcagaag aaagtagatg ggagccaatt 4920
gacgagcgag ggagctacgc caatccgaca tacgacacgc tgagatcgtc ttggccgggg 4980
ggtacctaca gatgtccaag ggtaagtgct tgactgtaat tgtatgtctg aggacaaata 5040
tgtagtcagc cgtataaagt cataccaggc accagtgcca tcatcgaacc actaactctc 5100
tatgatacat gcctccggta ttattgtacc atgcgtcgct ttgttacata cgtatcttgc 5160
ctttttctct cagaaactcc agactttggc tattggtcga gataagcccg gaccatagtg 5220
agtctttcac actctacatt tctcccttgc tccaactatc gattgttgtc tactaactat 5280
cgtacgataa cttcgtatag catacattat acgaagttat cgcgtcgacg agtatctgtc 5340
tgactcgtca ttgccgcctt tggagtacga ctccaactat gagtgtgctt ggatcacttt 5400
gacgatacat tcttcgttgg aggctgtggg tctgacagct gcgttttcgg cgcggttggc 5460
cgacaacaat atcagctgca acgtcattgc tggctttcat catgatcaca tttttgtcgg 5520
caaaggcgac gcccagagag ccattgacgt tctttctaat ttggaccgat agccgtatag 5580
tccagtctat ctataagttc aactaactcg taactattac cataacatat acttcactgc 5640
cccagataag gttccgataa aaagttctgc agactaaatt tatttcagtc tcctcttcac 5700
caccaaaatg ccctcctacg aagctcgagc taacgtccac aagtccgcct ttgccgctcg 5760
agtgctcaag ctcgtggcag ccaagaaaac caacctgtgt gcttctctgg atgttaccac 5820
caccaaggag ctcattgagc ttgccgataa ggtcggacct tatgtgtgca tgatcaaaac 5880
ccatatcgac atcattgacg acttcaccta cgccggcact gtgctccccc tcaaggaact 5940
tgctcttaag cacggtttct tcctgttcga ggacagaaag ttcgcagata ttggcaacac 6000
tgtcaagcac cagtaccggt gtcaccgaat cgccgagtgg tccgatatca ccaacgccca 6060
cggtgtaccc ggaaccggaa tcattgctgg cctgcgagct ggtgccgagg aaactgtctc 6120
tgaacagaag aaggaggacg tctctgacta cgagaactcc cagtacaagg agttcctagt 6180
cccctctccc aacgagaagc tggccagagg tctgctcatg ctggccgagc tgtcttgcaa 6240
gggctctctg gccactggcg agtactccaa gcagaccatt gagcttgccc gatccgaccc 6300
cgagtttgtg gttggcttca ttgcccagaa ccgacctaag ggcgactctg aggactggct 6360
tattctgacc cccggggtgg gtcttgacga caagggagac gctctcggac agcagtaccg 6420
aactgttgag gatgtcatgt ctaccggaac ggatatcata attgtcggcc gaggtctgta 6480
cggccagaac cgagatccta ttgaggaggc caagcgatac cagaaggctg gctgggaggc 6540
ttaccagaag attaactgtt agaggttaga ctatggatat gtaatttaac tgtgtatata 6600
gagagcgtgc aagtatggag cgcttgttca gcttgtatga tggtcagacg acctgtctga 6660
tcgagtatgt atgatactgc acaacctgtg tatccgcatg atctgtccaa tggggcatgt 6720
tgttgtgttt ctcgatacgg agatgctggg tacagtgcta atacgttgaa ctacttatac 6780
ttatatgagg ctcgaagaaa gctgacttgt gtatgactta ttctcaacta catccccagt 6840
cacaatacca ccactgcact accactacac caaaaccatg atcaaaccac ccatggactt 6900
cctggaggca gaagaacttg ttatggaaaa gctcaagaga gagatcataa cttcgtatag 6960
catacattat acgaagttat cctgcaggta aaggaattca tgctgttcat cgtggttaat 7020
gctgctgtgt gctgtgtgtg tgtgttgttt ggcgctcatt gttgcgttat gcagcgtaca 7080
ccacaatatt ggaagcttat tagcctttct attttttcgt ttgcaaggct taacaacatt 7140
gctgtggaga gggatgggga tatggaggcc gctggaggga gtcggagagg cgttttggag 7200
cggcttggcc tggcgcccag ctcgcgaaac gcacctagga ccctttggca cgccgaaatg 7260
tgccactttt cagtctagta acgccttacc tacgtcattc catgcgtgca tgtttgcgcc 7320
ttttttccct tgcccttgat cgccacacag tacagtgcac tgtacagtgg aggttttggg 7380
ggggtcttag atgggagcta aaagcggcct agcggtacac tagtgggatt gtatggagtg 7440
gcatggagcc taggtggagc ctgacaggac gcacgaccgg ctagcccgtg acagacgatg 7500
ggtggctcct gttgtccacc gcgtacaaat gtttgggcca aagtcttgtc agccttgctt 7560
gcgaacctaa ttcccaattt tgtcacttcg cacccccatt gatcgagccc taacccctgc 7620
ccatcaggca atccaattaa gctcgcattg tctgccttgt ttagtttggc tcctgcccgt 7680
ttcggcgtcc acttgcacaa acacaaacaa gcattatata taaggctcgt ctctccctcc 7740
caaccacact cacttttttg cccgtcttcc cttgctaaca caaaagtcaa gaacacaaac 7800
aaccacccca acccccttac acacaagaca tatctacagc aatggccatg gcttcttcca 7860
ctgttgctgc gccgtacgag ttcccgacgc tgacggagat caagcgctcg ctgccagcgc 7920
actgctttga ggcctcggtc ccgtggtcgc tctactacac cgtgcgcgcg ctgggcatcg 7980
ccggctcgct cgcgctcggc ctctactacg cgcgcgcgct cgcgatcgtg caggagtttg 8040
ccctgctgga tgcggtgctc tgcacggggt acattctgct gcagggcatc gtattctggg 8100
ggttcttcac catcggccat gactgcggcc acggcgcgtt ctcgcgttcg cacctgctca 8160
acttcagcgt cggcacgctc attcactcga tcatcctcac gccgtacgag tcatggaaga 8220
tctcgcaccg ccaccaccac aagaacacgg gcaacatcga caaggacgag attttctacc 8280
cgcagcgcga ggccgactcg cacccactgt cccgacacat ggtgatctcg ctcggctcgg 8340
cctggttcgc gtacctcgtt gcgggcttcc ctcctcgcaa ggtgaaccac ttcaaccctt 8400
gggaaccgtt gtacctgcgc cgcatgtctg ccgtcatcat ctcactcggc tcgctcgtgg 8460
cgttcgcggg cttgtatgcg tatctcacct acgtctatgg ccttaagacc atggcgctgt 8520
actacttcgc ccctctcttt gggttcgcca cgatgctcgt ggtcactacc tttttgcacc 8580
acaatgacga ggaaacgcca tggtacgccg actcggagtg gacgtacgtc aagggcaacc 8640
tctcgtccgt ggaccgctcg tacggcgcgc tcatcgacaa cctgagccac aacatcggca 8700
cgcaccagat ccaccacctg tttccgatca tcccgcacta caagctgaac gaggcgacgg 8760
cagcgttcgc gcaggcgttc ccggagctcg tgcgcaagag cgcgtcgccg atcatcccga 8820
cgttcatccg catcgggctc atgtacgcca agtacggcgt cgtggacaag gacgccaaga 8880
tgtttacgct caaggaggcc aaggccgcca agaccaaggc caactaggcg gccgcattga 8940
tgattggaaa cacacacatg ggttatatct aggtgagagt tagttggaca gttatatatt 9000
aaatcagcta tgccaacggt aacttcattc atgtcaacga ggaaccagtg actgcaagta 9060
atatagaatt tgaccacctt gccattctct tgcactcctt tactatatct catttatttc 9120
ttatatacaa atcacttctt cttcccagca tcgagctcgg aaacctcatg agcaataaca 9180
tcgtggatct cgtcaataga gggctttttg gactccttgc tgttggccac cttgtccttg 9240
ctgtttaaac agtgtacgca gatctactat agaggaacat ttaaattgcc ccggagaaga 9300
cggccaggcc gcctagatga caaattcaac aactcacagc tgactttctg ccattgccac 9360
tagggggggg cctttttata tggccaagcc aagctctcca cgtcggttgg gctgcaccca 9420
acaataaatg ggtagggttg caccaacaaa gggatgggat ggggggtaga agatacgagg 9480
ataacggggc tcaatggcac aaataagaac gaatactgcc attaagactc gtgatccagc 9540
gactgacacc attgcatcat ctaagggcct caaaactacc tcggaactgc tgcgctgatc 9600
tggacaccac agaggttccg agcactttag gttgcaccaa atgtcccacc aggtgcaggc 9660
agaaaacgct ggaacagcgt gtacagtttg tcttaacaaa aagtgagggc gctgaggtcg 9720
agcagggtgg tgtgacttgt tatagccttt agagctgcga aagcgcgtat ggatttggct 9780
catcaggcca gattgagggt ctgtggacac atgtcatgtt agtgtacttc aatcgccccc 9840
tggatatagc cccgacaata ggccgtggcc tcattttttt gccttccgca catttccatt 9900
gctcggtacc cacaccttgc ttctcctgca cttgccaacc ttaatactgg tttacattga 9960
ccaacatctt acaagcgggg ggcttgtcta gggtatatat aaacagtggc tctcccaatc 10020
ggttgccagt ctcttttttc ctttctttcc ccacagattc gaaatctaaa ctacacatca 10080
cagaattccg agccgtgagt atccacgaca agatcagtgt cgagacgacg cgttttgtgt 10140
aatgacacaa tccgaaagtc gctagcaaca cacactctct acacaaacta acccagctct 10200
ggtaccatgg cttcttccac tgttgctgcg ccgtacgagt tcccgacgct gacggagatc 10260
aagcgctcgc tgccagcgca ctgctttgag gcctcggtcc cgtggtcgct ctactacacc 10320
gtgcgcgcgc tgggcatcgc cggctcgctc gcgctcggcc tctactacgc gcgcgcgctc 10380
gcgatcgtgc aggagtttgc cctgctggat gcggtgctct gcacggggta cattctgctg 10440
cagggcatcg tattctgggg gttcttcacc atcggccatg actgcggcca cggcgcgttc 10500
tcgcgttcgc acctgctcaa cttcagcgtc ggcacgctca ttcactcgat catcctcacg 10560
ccgtacgagt catggaagat ctcgcaccgc caccaccaca agaacacggg caacatcgac 10620
aaggacgaga ttttctaccc gcagcgcgag gccgactcgc acccactgtc ccgacacatg 10680
gtgatctcgc tcggctcggc ctggttcgcg tacctcgttg cgggcttccc tcctcgcaag 10740
gtgaaccact tcaacccttg ggaaccgttg tacctgcgcc gcatgtctgc cgtcatcatc 10800
tcactcggct cgctcgtggc gttcgcgggc ttgtatgcgt atctcaccta cgtctatggc 10860
cttaagacca tggcgctgta ctacttcgcc cctctctttg ggttcgccac gatgctcgtg 10920
gtcactacct ttttgcacca caatgacgag gaaacgccat ggtacgccga ctcggagtgg 10980
acgtacgtca agggcaacct ctcgtccgtg gaccgctcgt acggcgcgct catcgacaac 11040
ctgagccaca acatcggcac gcaccagatc caccacctgt ttccgatcat cccgcactac 11100
aagctgaacg aggcgacggc agcgttcgcg caggcgttcc cggagctcgt gcgcaagagc 11160
gcgtcgccga tcatcccgac gttcatccgc atcgggctca tgtacgccaa gtacggcgtc 11220
gtggacaagg acgccaagat gtttacgctc aaggaggcca aggccgccaa gaccaaggcc 11280
aactaggcgg ccgcatggag cgtgtgttct gagtcgatgt tttctatgga gttgtgagtg 11340
ttagtagaca tgatgggttt atatatgatg aatgaataga tgtgattttg atttgcacga 11400
tggaattgag aactttgtaa acgtacatgg gaatgtatga atgtgggggt tttgtgactg 11460
gataactgac ggtcagtgga cgccgttgtt caaatatcca agagatgcga gaaactttgg 11520
gtcaagtgaa catgtcctct ctgttcaagt aaaccatcaa ctatgggtag tatatttagt 11580
aaggacaaga gttgagattc tttggagtcc tagaaacgta ttttcgcgtt ccaagatcaa 11640
attagtagag taatacgggc acgggaatcc attcatagtc tcaatcctgc aggtgagtta 11700
attaagatga cgacatttgc gagctggacg aggaatagat ggagcgtgtg ttctgagtcg 11760
atgttttcta tggagttgtg agtgttagta gacatgatgg gtttatatat gatgaatgaa 11820
tagatgtgat tttgatttgc acgatggaat tgagaacttt gtaaacgtac atgggaatgt 11880
atgaatgtgg gggttttgtg actggataac tgacggtcag tggacgccgt tgttcaaata 11940
tccaagagat gcgagaaact ttgggtcaag tgaacatgtc ctctctgttc aagtaaacca 12000
tcaactatgg gtagtatatt tagtaaggac aagagttgag attctttgga gtcctagaaa 12060
cgtattttcg cgttccaaga tcaaattagt agagtaatac gggcacggga atccattcat 12120
agtctcaatt ttcccatagg tgtgctacaa ggtgttgaga tgtggtacag taccaccatg 12180
attcgaggta aagagcccag aagtcattga tgaggtcaag aaatacacag atctacagct 12240
caatacaatg aatatcttct ttcatattct tcaggtgaca ccaagggtgt ctattttccc 12300
cagaaatgcg tgaaaaggcg cgtgtgtagc gtggagtatg ggttcggttg gcgtatcctt 12360
catatatcga cgaaatagta gggcaagaga tgacaaaaag tatctatatg tagacagcgt 12420
agaatatgga tttgattggt ataaattcat ttattgcgtg tctcacaaat actctcgata 12480
agttggggtt aaactggaga tggaacaatg tcgatatctc gacgcatgcg acgtcgggcc 12540
caattcgccc tatagtgagt cgtattacaa ttcactggcc gtcgttttac aacgtcgtga 12600
ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag 12660
ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa 12720
tggcgaatgg acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc 12780
agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc 12840
tttctcgcca cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg 12900
ttccgattta gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca 12960
cgtagtgggc catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc 13020
tttaatagtg gactcttgtt ccaaactgga acaacactca acccta 13066
<210>50
<211>1080
<212>DNA
<213>瓜果腐霉菌
<220>
<221>CDS
<222>(1)..(1080)
<223>合成Δ-17去饱和酶(经密码子优化用于解脂耶氏酵母)
<400>50
atg gct tcc tct acc gtt gcc gct ccc tac gag ttc cct act ctc acc 48
Met Ala Ser Ser Thr Val Ala Ala Pro Tyr Glu Phe Pro Thr Leu Thr
1 5 10 15
gag arc aag cga tcc ctg cct gcc cac tgc ttc gaa gcc tct gtt ccc 96
Glu Ile Lys Arg Ser Leu Pro Ala His Cys Phe Glu Ala Ser Val Pro
20 25 30
tgg tcc ctc tac tat acc gtg cga gct ctg ggc att gcc ggt tcc ctt 144
Trp Ser Leu Tyr Tyr Thr Val Arg Ala Leu Gly Ile Ala Gly Ser Leu
35 40 45
gct ctc gga ctg tac tat gct cga gcc ctt gct atc gtg cag gag ttt 192
Ala Leu Gly Leu Tyr Tyr Ala Arg Ala Leu Ala Ile Val Gln Glu Phe
50 55 60
gca ctg ctc gat gcc gtc ctt tgc act ggc tac att ctg ctc cag ggt 240
Ala Leu Leu Asp Ala Val Leu Cys Thr Gly Tyr Ile Leu Leu Gln Gly
65 70 75 80
atc gtg ttc tgg gga ttc ttt acc atc ggt cac gac tgt gga cat ggt 288
Ile Val Phe Trp Gly Phe Phe Thr Ile Gly His Asp Cys Gly His Gly
85 90 95
gcc ttc tcg cga tcc cac ctg ctc aac ttc tct gtt ggc aca ctc att 336
Ala Phe Ser Arg Ser His Leu Leu Asn Phe Ser Val Gly Thr Leu Ile
100 105 110
cac tcc atc att ctg act ccc tac gag tcg tgg aag atc agc cat cga 384
His Ser Ile Ile Leu Thr Pro Tyr Glu Ser Trp Lys Ile Ser His Arg
115 120 125
cac cat cac aag aac acc ggc aac atc gac aag gat gag atc ttc tac 432
His His His Lys Asn Thr Gly Asn Ile Asp Lys Asp Glu Ile Phe Tyr
130 135 140
cct cag cga gaa gcc gac tct cat ccc ctg tcc cga cac atg gtc atc 480
Pro Gln Arg Glu Ala Asp Ser His Pro Leu Ser Arg His Met Val Ile
145 150 155 160
tcc ctt ggt tcg gct tgg ttt gcc tac ctc gtt gct gga ttt cct ccc 528
Ser Leu Gly Ser Ala Trp Phe Ala Tyr Leu Val Ala Gly Phe Pro Pro
165 170 175
cga aag gtc aac cac ttc aat ccc tgg gag cct ctc tac ctg cga aga 576
Arg Lys Val Asn His Phe Asn Pro Trp Glu Pro Leu Tyr Leu Arg Arg
180 185 190
atg tct gcc gtc atc att tcc ctc ggc tct ctc gtg gcc ttt gct ggt 624
Met Ser Ala Val Ile Ile Ser Leu Gly Ser Leu Val Ala Phe Ala Gly
195 200 205
ctg tac gcc tac ctt acc tac gtc tac ggc ctc aag acc atg gct ctg 672
Leu Tyr Ala Tyr Leu Thr Tyr Val Tyr Gly Leu Lys Thr Met Ala Leu
210 215 220
tat tac ttc gca cct ctc ttt gga ttc gcc acc atg ctg gtt gtc act 720
Tyr Tyr Phe Ala Pro Leu Phe Gly Phe Ala Thr Met Leu Val Val Thr
225 230 235 240
acc ttc ctc cat cac aac gac gag gaa act ccc tgg tac gcc gat tcg 768
Thr Phe Leu His His Asn Asp Glu Glu Thr Pro Trp Tyr Ala Asp Ser
245 250 255
gag tgg acc tat gtc aag ggc aac ttg tcc tct gtg gac cga agc tac 816
Glu Trp Thr Tyr Val Lys Gly Asn Leu Ser Ser Val Asp Arg Ser Tyr
260 265 270
gga gcc ctc atc gac aac ctg tcc cac aac att ggt aca cat cag atc 864
Gly Ala Leu Ile Asp Asn Leu Ser His Asn Ile Gly Thr His Gln Ile
275 280 285
cac cat ctg ttt ccc atc att cct cac tac aag ctc aac gag gcc act 912
His His Leu Phe Pro Ile Ile Pro His Tyr Lys Leu Asn Glu Ala Thr
290 295 300
gct gcc ttc gct cag gcc ttt ccc gaa ctg gtg cga aag tcg gct tct 960
Ala Ala Phe Ala Gln Ala Phe Pro Glu Leu Val Arg Lys Ser Ala Ser
305 310 315 320
ccc atc att ccc acc ttc atc cga att ggt ctt atg tac gcc aag tac 1008
Pro Ile Ile Pro Thr Phe Ile Arg Ile Gly Leu Met Tyr Ala Lys Tyr
325 330 335
ggc gtg gtc gac aag gat gcc aag atg ttt acc ctc aag gaggcc aag 1056
Gly ValVal Asp Lys Asp Ala Lys Met Phe Thr Leu Lys Glu Ala Lys
340 345 350
gct gcc aag acc aaa gcc aac taa 1080
Ala Ala Lys Thr Lys Ala Asn
355
<210>51
<211>359
<212>PRT
<213>瓜果腐霉菌
<400>51
Met Ala Ser Ser Thr Val Ala Ala Pro Tyr Glu Phe Pro Thr Leu Thr
1 5 10 15
Glu Ile Lys Arg Ser Leu Pro Ala His Cys Phe Glu Ala Ser Val Pro
20 25 30
Trp Ser Leu Tyr Tyr Thr Val Arg Ala Leu Gly Ile Ala Gly Ser Leu
35 40 45
Ala Leu Gly Leu Tyr Tyr Ala Arg Ala Leu Ala Ile Val Gln Glu Phe
50 55 60
Ala Leu Leu Asp Ala Val Leu Cys Thr Gly Tyr Ile Leu Leu Gln Gly
65 70 75 80
Ile Val Phe Trp Gly Phe Phe Thr Ile Gly His Asp Cys Gly His Gly
85 90 95
Ala Phe Ser Arg Ser His Leu Leu Asn Phe Ser Val Gly Thr Leu Ile
100 105 110
His Ser Ile Ile Leu Thr Pro Tyr Glu Ser Trp Lys Ile Ser His Arg
115 120 125
His His His Lys Asn Thr Gly Asn Ile Asp Lys Asp Glu Ile Phe Tyr
130 135 140
Pro Gln Arg Glu Ala Asp Ser His Pro Leu Ser Arg His Met Val Ile
145 150 155 160
Ser Leu Gly Ser Ala Trp Phe Ala Tyr Leu Val Ala Gly Phe Pro Pro
165 170 175
Arg Lys Val Asn His Phe Asn Pro Trp Glu Pro Leu Tyr Leu Arg Arg
180 185 190
Met Ser Ala Val Ile Ile Ser Leu Gly Ser Leu Val Ala Phe Ala Gly
195 200 205
Leu Tyr Ala Tyr Leu Thr Tyr Val Tyr Gly Leu Lys Thr Met Ala Leu
210 215 220
Tyr Tyr Phe Ala Pro Leu Phe Gly Phe Ala Thr Met Leu Val Val Thr
225 230 235 240
Thr Phe Leu His His Asn Asp Glu Glu Thr Pro Trp Tyr Ala Asp Ser
245 250 255
Glu Trp Thr Tyr Val Lys Gly Asn Leu Ser Ser Val Asp Arg Ser Tyr
260 265 270
Gly Ala Leu Ile Asp Asn Leu Ser His Asn Ile Gly Thr His Gln Ile
275 280 285
His His Leu Phe Pro Ile Ile Pro His Tyr Lys Leu Asn Glu Ala Thr
290 295 300
Ala Ala Phe Ala Gln Ala Phe Pro Glu Leu Val Arg Lys Ser Ala Ser
305 310 315 320
Pro Ile Ile Pro Thr Phe Ile Arg Ile Gly Leu Met Tyr Ala Lys Tyr
325 330 335
Gly Val Val Asp Lys Asp Ala Lys Met Phe Thr Leu Lys Glu Ala Lys
340 345 350
Ala Ala Lys Thr Lys Ala Asn
355
<210>52
<211>1080
<212>DNA
<213>瓜果腐霉菌
<220>
<221>CDS
<222>(1)..(1080)
<223>Δ-17去饱和酶
<400>52
atg gct tct tcc act gtt gct gcg ccg tac gag ttc ccg acg ctg acg 48
Met Ala Ser Ser Thr Val Ala Ala Pro Tyr Glu Phe Pro Thr Leu Thr
1 5 10 15
gag atc aag cgc tcg ctg cca gcg cac tgc ttt gag gcc tcg gtc ccg 96
Glu Ile Lys Arg Ser Leu Pro Ala His Cys Phe Glu Ala Ser Val Pro
20 25 30
tgg tcg ctc tac tac acc gtg cgc gcg ctg ggc atc gcc ggc tcg ctc 144
Trp Ser Leu Tyr Tyr Thr Val Arg Ala Leu Gly Ile Ala Gly Ser Leu
35 40 45
gcg ctc ggc ctc tac tac gcg cgc gcg ctc gcg atc gtg cag gag ttt 192
Ala Leu Gly Leu Tyr Tyr Ala Arg Ala Leu Ala Ile Val Gln Glu Phe
50 55 60
gcc ctg ctg gat gcg gtg ctc tgc acg ggg tac att ctg ctg cag ggc 240
Ala Leu Leu Asp Ala Val Leu Cys Thr Gly Tyr Ile Leu Leu Gln Gly
65 70 75 80
atc gta ttc tgg ggg ttc ttc acc atc ggc cat gac tgc ggc cac ggc 288
Ile Val Phe Trp Gly Phe Phe Thr Ile Gly His Asp Cys Gly His Gly
85 90 95
gcg ttc tcg cgt tcg cac ctg ctc aac ttc agc gtc ggc acg ctc att 336
Ala Phe Ser Arg Ser His Leu Leu Asn Phe Ser ValGly Thr Leu Ile
100 105 110
cac tcg atc atc ctc acg ccg tac gag tca tgg aag atc tcg cac cgc 384
His Ser Ile Ile Leu Thr Pro Tyr Glu Ser Trp Lys Ile Ser His Arg
115 120 125
cac cac cac aag aac acg ggc aac atc gac aag gac gag att ttc tac 432
His His His Lys Asn Thr Gly Asn Ile Asp Lys Asp Glu Ile Phe Tyr
130 135 140
ccg cag cgc gag gcc gac tcg cac cca ctg tcc cga cac atg gtg atc 480
Pro Gln Arg Glu Ala Asp Ser His Pro Leu Ser Arg His Met Val Ile
145 150 155 160
tcg ctc ggc tcg gcc tgg ttc gcg tac ctc gtt gcg ggc ttc cct cct 528
Ser Leu Gly Ser Ala Trp Phe Ala Tyr Leu Val Ala Gly Phe Pro Pro
165 170 175
cgc aag gtg aac cacttc aac cct tgg gaa ccg ttg tac ctg cgc cgc 576
Arg Lys Val Asn His Phe Asn Pro Trp Glu Pro Leu Tyr Leu Arg Arg
180 185 190
atg tct gcc gtc a tc atctca ctc ggc tcg ctc gtg gcg ttc gcg ggc 624
Met Ser Ala Val Ile Ile Ser Leu Gly Ser Leu Val Ala Phe Ala Gly
195 200 205
ttg tat gcg tat ctc acc tac gtc tat ggc ctt aag acc atg gcg ctg 672
Leu Tyr Ala Tyr Leu Thr Tyr Val Tyr Gly Leu Lys Thr Met Ala Leu
210 215 220
tac tac ttc gcc cct ctc ttt ggg ttc gcc acg atg ctc gtg gtc act 720
Tyr Tyr Phe Ala Pro Leu Phe Gly Phe Ala Thr Met Leu Val Val Thr
225 230 235 240
acc ttt ttg cac cac aat gac gag gaa acg cca tgg tac gcc gac tcg 768
Thr Phe Leu His His Asn Asp Glu Glu Thr Pro Trp Tyr Ala Asp Ser
245 250 255
gag tgg acg tac gtc aag ggc aac ctc tcg tcc gtg gac cgc tcg tac 816
Glu Trp Thr Tyr Val Lys Gly Asn Leu Ser Ser Val Asp Arg Ser Tyr
260 265 270
ggc gcg ctc atc gac aac ctg agc cac aac atc ggc acg cac cag atc 864
Gly Ala Leu Ile Asp Asn Leu Ser His Asn Ile Gly Thr His Gln Ile
275 280 285
cac cac ctg ttt ccg atc atc ccg cac tac aag ctg aac gag gcg acg 912
His His Leu Phe Pro Ile Ile Pro His Tyr Lys Leu Asn Glu Ala Thr
290 295 300
gca gcg ttc gcg cag gcg ttc ccg gag ctc gtg cgc aag agc gcg tcg 960
Ala Ala Phe Ala Gln Ala Phe Pro Glu Leu Val Arg Lys Ser Ala Ser
305 310 315 320
ccg atc atc ccg acg ttc atc cgc atc ggg ctc atg tac gcc aag tac 1008
Pro Ile Ile Pro Thr Phe Ile Arg Ile Gly Leu Met Tyr Ala Lys Tyr
325 330 335
ggc gtc gtg gac aag gac gcc aag atg ttt acg ctc aag gag gcc aag 1056
Gly Val Val Asp Lys Asp Ala Lys Met Phe Thr Leu Lys Glu Ala Lys
340 345 350
gcc gcc aag acc aag gcc aac tag 1080
Ala Ala Lys Thr Lys Ala Asn
355
<210>53
<211>359
<212>PRT
<213>瓜果腐霉菌
<400>53
Met Ala Ser Ser Thr Val Ala Ala Pro Tyr Glu Phe Pro Thr Leu Thr
1 5 10 15
Glu Ile Lys Arg Ser Leu Pro Ala His Cys Phe Glu Ala Ser Val Pro
20 25 30
Trp Ser Leu Tyr Tyr Thr Val Arg Ala Leu Gly Ile Ala Gly Ser Leu
35 40 45
Ala Leu Gly Leu Tyr Tyr Ala Arg Ala Leu Ala Ile Val Gln Glu Phe
50 55 60
Ala Leu Leu Asp Ala Val Leu Cys Thr Gly Tyr Ile Leu Leu Gln Gly
65 70 75 80
Ile Val Phe Trp Gly Phe Phe Thr Ile Gly His Asp Cys Gly His Gly
85 90 95
Ala Phe Ser Arg Ser His Leu Leu Asn Phe Ser Val Gly Thr Leu Ile
100 105 110
His Ser Ile Ile Leu Thr Pro Tyr Glu Ser Trp Lys Ile Ser His Arg
115 120 125
His His His Lys Asn Thr Gly Asn Ile Asp Lys Asp Gln Ile Phe Tyr
130 135 140
Pro Gln Arg Glu Ala Asp Ser His Pro Leu Ser Arg His Met Val Ile
145 150 155 160
Ser Leu Gly Ser Ala Trp Phe Ala Tyr Leu Val Ala Gly Phe Pro Pro
165 170 175
Arg Lys Val Asn His Phe Asn Pro Trp Glu Pro Leu Tyr Leu Arg Arg
180 185 190
Met Ser Ala Val Ile Ile Ser Leu Gly Ser Leu Val Ala Phe Ala Gly
195 200 205
Leu Tyr Ala Tyr Leu Thr Tyr Val Tyr Gly Leu Lys Thr Met Ala Leu
210 215 220
Tyr Tyr Phe Ala Pro Leu Phe Gly Phe Ala Thr Met Leu Val Val Thr
225 230 235 240
Thr Phe Leu His His Asn Asp Glu Glu Thr Pro Trp Tyr Ala Asp Ser
245 250 255
Glu Trp Thr Tyr Val Lys Gly Asn Leu Ser Ser Val Asp Arg Ser Tyr
260 265 270
Gly Ala Leu Ile Asp Asn Leu Ser His Asn Ile Gly Thr His Gln Ile
275 280 285
His His Leu Phe Pro Ile Ile Pro His Tyr Lys Leu Asn Glu Ala Thr
290 295 300
Ala Ala Phe Ala Gln Ala Phe Pro Glu Leu Val Arg Lys Ser Ala Ser
305 310 315 320
Pro Ile Ile Pro Thr Phe Ile Arg Ile Gly Leu Met Tyr Ala Lys Tyr
325 330 335
Gly Val Val Asp Lys Asp Ala Lys Met Phe Thr Leu Lys Glu Ala Lys
340 345 350
Ala Ala Lys Thr Lys Ala Asn
355
<210>54
<211>9570
<212>DNA
<213>人工序列
<220>
<223>质粒pY117
<400>54
ggccgccacc gcggcccgag attccggcct cttcggccgc caagcgaccc gggtggacgt 60
ctagaggtac ctagcaatta acagatagtt tgccggtgat aattctctta acctcccaca 120
ctcctttgac ataacgattt atgtaacgaa actgaaattt gaccagatat tgtgtccgcg 180
gtggagctcc agcttttgtt ccctttagtg agggtttaaa cgagcttggc gtaatcatgg 240
tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa cgtacgagcc 300
ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg 360
ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc 420
ggccaacgcg cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact 480
gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta 540
atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag 600
caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc 660
cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta 720
taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg 780
ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc 840
tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac 900
gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac 960
ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg 1020
aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga 1080
aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt 1140
agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag 1200
cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct 1260
gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg 1320
atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat 1380
gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc 1440
tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg 1500
gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct 1560
ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca 1620
actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg 1680
ccagttaata gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg 1740
tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc 1800
cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag 1860
ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg 1920
ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag 1980
tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat 2040
agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg 2100
atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca 2160
gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca 2220
aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat 2280
tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag 2340
aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc tgacgcgccc 2400
tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt 2460
gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc 2520
ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta 2580
cggcacctcg accccaaaaa acttgattag ggtgatggtt cacgtagtgg gccatcgccc 2640
tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg 2700
ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt ataagggatt 2760
ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat 2820
tttaacaaaa tattaacgct tacaatttcc attcgccatt caggctgcgc aactgttggg 2880
aagggcgatc ggtgcgggcc tcttcgctat tacgccagct ggcgaaaggg ggatgtgctg 2940
caaggcgatt aagttgggta acgccagggt tttcccagtc acgacgttgt aaaacgacgg 3000
ccagtgaatt gtaatacgac tcactatagg gcgaattggg taccgggccc cccctcgagg 3060
tcgatggtgt cgataagctt gatatcgaat tcatgtcaca caaaccgatc ttcgcctcaa 3120
ggaaacctaa ttctacatcc gagagactgc cgagatccag tctacactga ttaattttcg 3180
ggccaataat ttaaaaaaat cgtgttatat aatattatat gtattatata tatacatcat 3240
gatgatactg acagtcatgt cccattgcta aatagacaga ctccatctgc cgcctccaac 3300
tgatgttctc aatatttaag gggtcatctc gcattgttta ataataaaca gactccatct 3360
accgcctcca aatgatgttc tcaaaatata ttgtatgaac ttatttttat tacttagtat 3420
tattagacaa cttacttgct ttatgaaaaa cacttcctat ttaggaaaca atttataatg 3480
gcagttcgtt catttaacaa tttatgtaga ataaatgtta taaatgcgta tgggaaatct 3540
taaatatgga tagcataaat gatatctgca ttgcctaatt cgaaatcaac agcaacgaaa 3600
aaaatccctt gtacaacata aatagtcatc gagaaatatc aactatcaaa gaacagctat 3660
tcacacgtta ctattgagat tattattgga cgagaatcac acactcaact gtctttctct 3720
cttctagaaa tacaggtaca agtatgtact attctcattg ttcatacttc tagtcatttc 3780
atcccacata ttccttggat ttctctccaa tgaatgacat tctatcttgc aaattcaaca 3840
attataataa gatataccaa agtagcggta tagtggcaat caaaaagctt ctctggtgtg 3900
cttctcgtat ttatttttat tctaatgatc cattaaaggt atatatttat ttcttgttat 3960
ataatccttt tgtttattac atgggctgga tacataaagg tattttgatt taattttttg 4020
cttaaattca atcccccctc gttcagtgtc aactgtaatg gtaggaaatt accatacttt 4080
tgaagaagca aaaaaaatga aagaaaaaaa aaatcgtatt tccaggttag acgttccgca 4140
gaatctagaa tgcggtatgc ggtacattgt tcttcgaacg taaaagttgc gctccctgag 4200
atattgtaca tttttgcttt tacaagtaca agtacatcgt acaactatgt actactgttg 4260
atgcatccac aacagtttgt tttgtttttt tttgtttttt ttttttctaa tgattcatta 4320
ccgctatgta tacctacttg tacttgtagt aagccgggtt attggcgttc aattaatcat 4380
agacttatga atctgcacgg tgtgcgctgc gagttacttt tagcttatgc atgctacttg 4440
ggtgtaatat tgggatctgt tcggaaatca acggatgctc aaccgatttc gacagtaatt 4500
aattaattcc ctagtcccag tgtacacccg ccgatatcgc ttaccctgca gccggattaa 4560
ggttggcaat ttttcacgtc cttgtctccg caattactca ccgggtggtt tataagattg 4620
caagcgtctt gatttgtctc tgtatactaa catgcaatcg cgactcgccc gacgggccac 4680
taacctggcc agaatctcca gatccaagta ttctcttggt ctgcgatatg tttccaacac 4740
aaaagcccct gctgcccagc cggcaactgc tgagtgagta ttccttgcca taaacgaccc 4800
agaaccactg tatagtgttt ggaagcacta gtcagaagac cagcgaaaac aggtggaaaa 4860
aactgagacg aaaagcaacg accagaaatg taatgtgtgg aaaagcgaca cacacagagc 4920
agataaagag gtgacaaata acgacaaatg aaatatcagt atcttcccac aatcactacc 4980
tctcagctgt ctgaaggtgc ggctgatata tccatcccac gtctaacgta tggagtgtga 5040
tagaatatga cgacacaagc atgagaactc gctctctatc caaccaccga aacactgtca 5100
ctacagccgt tcttgttgct ccattcgctt ttgtgattcc atgccttctc tggtgactga 5160
caacattcct tccttttctc cagccctgtt gttatctgct catgacctac ggccactctc 5220
tatcgcatac taacatagac gatcccagcc cgctccccac ttccagggca ccgttggcaa 5280
gcctcctatc ctcaagaagg ctgaggctgc caacgctgac atggacgagt ccttcatcgg 5340
aatgtctgga ggagagatct tccacgagat gatgctgcga cacaacgtcg acactgtctt 5400
cggttacccc ggtggagcca ttctccccgt ctttgacgcc attcacaact ctgagtactt 5460
caactttgtg ctccctcgac acgagcaggg tgccggccac atggccgagg gctacgctcg 5520
agcctctggt aagcccggtg tcgttctcgt cacctctggc cccggtgcca ccaacgtcat 5580
cacccccatg caggacgctc tttccgatgg tacccccatg gttgtcttca ccggtcaggt 5640
cctgacctcc gttatcggca ctgacgcctt ccaggaggcc gatgttgtcg gcatctcccg 5700
atcttgcacc aagtggaacg tcatggtcaa gaacgttgct gagctccccc gacgaatcaa 5760
cgaggccttt gagattgcta cttccggccg acccggtccc gttctcgtcg atctgcccaa 5820
ggatgttact gctgccatcc tgcgagagcc catccccacc aagtccacca ttccctcgca 5880
ttctctgacc aacctcacct ctgccgccgc caccgagttc cagaagcagg ctatccagcg 5940
agccgccaac ctcatcaacc agtccaagaa gcccgtcctt tacgtcggac agggtatcct 6000
tggctccgag gagggtccta agctgcttaa ggagctggct gagaaggccg agattcccgt 6060
caccactact ctgcagggtc ttggtgcctt tgacgagcga gaccccaagt ctctgcacat 6120
gctcggtatg cacggttccg gctacgccaa catggccatg cagaacgctg actgtatcat 6180
tgctctcggc gcccgatttg atgaccgagt taccggctcc atccccaagt ttgcccccga 6240
ggctcgagcc gctgcccttg agggtcgagg tggtattgtt cactttgaga tccaggccaa 6300
gaacatcaac aaggttgttc aggccaccga agccgttgag ggagacgtta ccgagtctgt 6360
ccgacagctc atccccctca tcaacaaggt ctctgccgct gagcgagctc cctggactga 6420
gactatccag tcctggaagc agcagttccc cttcctcttc gaggctgaag gtgaggatgg 6480
tgttatcaag ccccagtccg tcattgctct gctctctgac ctgacagaga acaacaagga 6540
caagaccatc atcaccaccg gtgttggtca gcatcagatg tggactgccc agcatttccg 6600
atggcgacac cctcgaacca tgatcacttc tggtggtctt ggaactatgg gttacggcct 6660
gcccgccgct atcggcgcca aggttgcccg acctgactgc gacgtcattg acatcgatgg 6720
tgacgcttct ttcaacatga ctctgaccga gctgtccacc gccgttcagt tcaacattgg 6780
cgtcaaggct attgtcctca acaacgagga acagggtatg gtcacccagc tgcagtctct 6840
cttctacgag aaccgatact gccacactca tcagaagaac cccgacttca tgaagctggc 6900
cgagtccatg ggcatgaagg gtatccgaat cactcacatt gaccagctgg aggccggtct 6960
caaggagatg ctcgcataca agggccctgt gctcgttgag gttgttgtcg acaagaagat 7020
ccccgttctt cccatggttc ccgctggtaa ggctttgcat gagttccttg tctacgacgc 7080
tgacgccgag gctgcttctc gacccgatcg actgaagaat gcccccgccc ctcacgtcca 7140
ccagaccacc tttgagaact aagtggaaag gaacacaagc aatccgaacc aaaaataatt 7200
ggggtcccgt gcccacagag tctagtgcag acctaaaatg accacagtaa attatagctg 7260
ttattaaaca tgagattttg accaacaaga gcgtaggaat gttattagct actacttgta 7320
catacacagc atttgtttta aataatgttg cctccagggg cagtgagatc aggacccaga 7380
tccgtggcca gctctctgac ttcagaccgc ttgtacttaa gcagctcgca acactgttgt 7440
cgaggattga acttgccata ttcgattttg tggtcatgaa tccagcacac ctcatttaaa 7500
tgtagctaac ggtagcaggc gaactactgg tacatacctc ccccggaata tgtacaggca 7560
taatgcgtat ctgtgggaca tgtggtcgtt gcgccattat gtaagcagcg tgtactcctc 7620
tgactgtcca tatggtttgc tccatctcac cctcatcgtt ttcattgttc acaggcggcc 7680
acaaaaaaac tgtcttctct ccttctctct tcgccttagt ctactcggac cagttttagt 7740
ttagcttggc gccactggat aaatgagacc tcaggccttg tgatgaggag gtcacttatg 7800
aagcatgtta ggaggtgctt gtatggatag agaagcaccc aaaataataa gaataataat 7860
aaaacagggg gcgttgtcat ttcatatcgt gttttcacca tcaatacacc tccaaacaat 7920
gcccttcatg tggccagccc caatattgtc ctgtagttca actctatgca gctcgtatct 7980
tattgagcaa gtaaaactct gtcagccgat attgcccgac ccgcgacaag ggtcaacaag 8040
gtggtgtaag gccttcgcag aagtcaaaac tgtgccaaac aaacatctag agtctctttg 8100
gtgtttctcg catatatttw atcggctgtc ttacgtattt gcgcctcggt accggactaa 8160
tttcggatca tccccaatac gctttttctt cgcagctgtc aacagtgtcc atgatctatc 8220
cacctaaatg ggtcatatga ggcgtataat ttcgtggtgc tgataataat tcccatatat 8280
ttgacacaaa acttcccccc ctagacatac atctcacaat ctcacttctt gtgcttctgt 8340
cacacatctc ctccagctga cttcaactca cacctctgcc ccagttggtc tacagcggta 8400
taaggtttct ccgcatagag gtgcaccact cctcccgata cttgtttgtg tgacttgtgg 8460
gtcacgacat atatatctac acacattgcg ccaccctttg gttcttccag cacaacaaaa 8520
acacgacacg ctaaccatgg ccaatttact gaccgtacac caaaatttgc ctgcattacc 8580
ggtcgatgca acgagtgatg aggttcgcaa gaacctgatg gacatgttca gggatcgcca 8640
ggcgttttct gagcatacct ggaaaatgct tctgtccgtt tgccggtcgt gggcggcatg 8700
gtgcaagttg aataaccgga aatggtttcc cgcagaacct gaagatgttc gcgattatct 8760
tctatatctt caggcgcgcg gtctggcagt aaaaactatc cagcaacatt tgggccagct 8820
aaacatgctt catcgtcggt ccgggctgcc acgaccaagt gacagcaatg ctgtttcact 8880
ggttatgcgg cggatccgaa aagaaaacgt tgatgccggt gaacgtgcaa aacaggctct 8940
agcgttcgaa cgcactgatt tcgaccaggt tcgttcactc atggaaaata gcgatcgctg 9000
ccaggatata cgtaatctgg catttctggg gattgcttat aacaccctgt tacgtatagc 9060
cgaaattgcc aggatcaggg ttaaagatat ctcacgtact gacggtggga gaatgttaat 9120
ccatattggc agaacgaaaa cgctggttag caccgcaggt gtagagaagg cacttagcct 9180
gggggtaact aaactggtcg agcgatggat ttccgtctct ggtgtagctg atgatccgaa 9240
taactacctg ttttgccggg tcagaaaaaa tggtgttgcc gcgccatctg ccaccagcca 9300
gctatcaact cgcgccctgg aagggatttt tgaagcaact catcgattga tttacggcgc 9360
taaggatgac tctggtcaga gatacctggc ctggtctgga cacagtgccc gtgtcggagc 9420
cgcgcgagat atggcccgcg ctggagtttc aataccggag atcatgcaag ctggtggctg 9480
gaccaatgta aatattgtca tgaactatat ccgtaacctg gatagtgaaa caggggcaat 9540
ggtgcgcctg ctggaagatg gcgattaagc 9570
<210>55
<211>15743
<212>DNA
<213>人工序列
<220>
<223>质粒pZP2-2988
<400>55
ggccgcatgt acatacaaga ttatttatag aaatgaatcg cgatcgaaca aagagtacga 60
gtgtacgagt aggggatgat gataaaagtg gaagaagttc cgcatctttg gatttatcaa 120
cgtgtaggac gatacttcct gtaaaaatgc aatgtcttta ccataggttc tgctgtagat 180
gttattaact accattaaca tgtctacttg tacagttgca gaccagttgg agtatagaat 240
ggtacactta ccaaaaagtg ttgatggttg taactacgat atataaaact gttgacggga 300
tctgtatatt cggtaagata tattttgtgg ggttttagtg gtgtttaaac agtgtacgca 360
gtactataga ggaacaattg ccccggagaa gacggccagg ccgcctagat gacaaattca 420
acaactcaca gctgactttc tgccattgcc actagggggg ggccttttta tatggccaag 480
ccaagctctc cacgtcggtt gggctgcacc caacaataaa tgggtagggt tgcaccaaca 540
aagggatggg atggggggta gaagatacga ggataacggg gctcaatggc acaaataaga 600
acgaatactg ccattaagac tcgtgatcca gcgactgaca ccattgcatc atctaagggc 660
ctcaaaacta cctcggaact gctgcgctga tctggacacc acagaggttc cgagcacttt 720
aggttgcacc aaatgtccca ccaggtgcag gcagaaaacg ctggaacagc gtgtacagtt 780
tgtcttaaca aaaagtgagg gcgctgaggt cgagcagggt ggtgtgactt gttatagcct 840
ttagagctgc gaaagcgcgt atggatttgg ctcatcaggc cagattgagg gtctgtggac 900
acatgtcatg ttagtgtact tcaatcgccc cctggatata gccccgacaa taggccgtgg 960
cctcattttt ttgccttccg cacatttcca ttgctcggta cccacacctt gcttctcctg 1020
cacttgccaa ccttaatact ggtttacatt gaccaacatc ttacaagcgg ggggcttgtc 1080
tagggtatat ataaacagtg gctctcccaa tcggttgcca gtctcttttt tcctttcttt 1140
ccccacagat tcgaaatcta aactacacat cacaccatgg aggtcgtgaa cgaaatcgtc 1200
tccattggcc aggaggttct tcccaaggtc gactatgctc agctctggtc tgatgcctcg 1260
cactgcgagg tgctgtacct ctccatcgcc ttcgtcatcc tgaagttcac ccttggtcct 1320
ctcggaccca agggtcagtc tcgaatgaag tttgtgttca ccaactacaa cctgctcatg 1380
tccatctact cgctgggctc cttcctctct atggcctacg ccatgtacac cattggtgtc 1440
atgtccgaca actgcgagaa ggctttcgac aacaatgtct tccgaatcac cactcagctg 1500
ttctacctca gcaagttcct cgagtacatt gactccttct atctgcccct catgggcaag 1560
cctctgacct ggttgcagtt ctttcaccat ctcggagctc ctatggacat gtggctgttc 1620
tacaactacc gaaacgaagc cgtttggatc tttgtgctgc tcaacggctt cattcactgg 1680
atcatgtacg gctactattg gacccgactg atcaagctca agttccctat gcccaagtcc 1740
ctgattactt ctatgcagat cattcagttc aacgttggct tctacatcgt ctggaagtac 1800
cggaacattc cctgctaccg acaagatgga atgagaatgt ttggctggtt tttcaactac 1860
ttctacgttg gtactgtcct gtgtctgttc ctcaacttct acgtgcagac ctacatcgtc 1920
cgaaagcaca agggagccaa aaagattcag tgagcggccg caagtgtgga tggggaagtg 1980
agtgcccggt tctgtgtgca caattggcaa tccaagatgg atggattcaa cacagggata 2040
tagcgagcta cgtggtggtg cgaggatata gcaacggata tttatgtttg acacttgaga 2100
atgtacgata caagcactgt ccaagtacaa tactaaacat actgtacata ctcatactcg 2160
tacccgggca acggtttcac ttgagtgcag tggctagtgc tcttactcgt acagtgtgca 2220
atactgcgta tcatagtctt tgatgtatat cgtattcatt catgttagtt gcgtacgggc 2280
gtcgttgctt gtgtgatttt tgaggaccca tccctttggt atataagtat actctggggt 2340
taaggttgcc cgtgtagtct aggttatagt tttcatgtga aataccgaga gccgagggag 2400
aataaacggg ggtatttgga cttgtttttt tcgcggaaaa gcgtcgaatc aaccctgcgg 2460
gccttgcacc atgtccacga cgtgtttctc gccccaattc gccccttgca cgtcaaaatt 2520
aggcctccat ctagacccct ccataacatg tgactgtggg gaaaagtata agggaaacca 2580
tgcaaccata gacgacgtga aagacgggga ggaaccaatg gaggccaaag aaatggggta 2640
gcaacagtcc aggagacaga caaggagaca aggagagggc gcccgaaaga tcggaaaaac 2700
aaacatgtcc aattggggca gtgacggaaa cgacacggac acttcagtac aatggaccga 2760
ccatctccaa gccagggtta ttccggtatc accttggccg taacctcccg ctggtacctg 2820
atattgtaca cgttcacatt caatatactt tcagctacaa taagagaggc tgtttgtcgg 2880
gcatgtgtgt ccgtcgtatg gggtgatgtc cgagggcgaa attcgctaca agcttaactc 2940
tggcgcttgt ccagtatgaa tagacaagtc aagaccagtg gtgccatgat tgacagggag 3000
gtacaagact tcgatactcg agcattactc ggacttgtgg cgattgaaca gacgggcgat 3060
cgcttctccc ccgtattgcc ggcgcgccag ctgcattaat gaatcggcca acgcgcgggg 3120
agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 3180
gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 3240
gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 3300
cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 3360
aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 3420
tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 3480
ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat 3540
ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 3600
cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 3660
ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 3720
gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaagaac agtatttggt 3780
atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 3840
aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 3900
aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 3960
gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 4020
cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 4080
gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 4140
tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 4200
ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 4260
ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 4320
atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 4380
cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct 4440
tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 4500
aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 4560
tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 4620
ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 4680
agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa 4740
gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 4800
agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 4860
accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 4920
gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 4980
cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 5040
ggggttccgc gcacatttcc ccgaaaagtg ccacctgatg cggtgtgaaa taccgcacag 5100
atgcgtaagg agaaaatacc gcatcaggaa attgtaagcg ttaatatttt gttaaaattc 5160
gcgttaaatt tttgttaaat cagctcattt tttaaccaat aggccgaaat cggcaaaatc 5220
ccttataaat caaaagaata gaccgagata gggttgagtg ttgttccagt ttggaacaag 5280
agtccactat taaagaacgt ggactccaac gtcaaagggc gaaaaaccgt ctatcagggc 5340
gatggcccac tacgtgaacc atcaccctaa tcaagttttt tggggtcgag gtgccgtaaa 5400
gcactaaatc ggaaccctaa agggagcccc cgatttagag cttgacgggg aaagccggcg 5460
aacgtggcga gaaaggaagg gaagaaagcg aaaggagcgg gcgctagggc gctggcaagt 5520
gtagcggtca cgctgcgcgt aaccaccaca cccgccgcgc ttaatgcgcc gctacagggc 5580
gcgtccattc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg cgggcctctt 5640
cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt tgggtaacgc 5700
cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa tacgactcac 5760
tatagggcga attgggcccg acgtcgcatg cgctgatgac actttggtct gaaagagatg 5820
cattttgaat cccaaacttg cagtgcccaa gtgacataca tctccgcgtt ttggaaaatg 5880
ttcagaaaca gttgattgtg ttggaatggg gaatggggaa tggaaaaatg actcaagtat 5940
caattccaaa aacttctctg gctggcagta cctactgtcc atactactgc attttctcca 6000
gtcaggccac tctatactcg acgacacagt agtaaaaccc agataatttc gacataaaca 6060
agaaaacaga cccaataata tttatatata gtcagccgtt tgtccagttc agactgtaat 6120
agccgaaaaa aaatccaaag tttctattct aggaaaatat attccaatat ttttaattct 6180
taatctcatt tattttattc tagcgaaata catttcagct acttgagaca tgtgataccc 6240
acaaatcgga ttcggactcg gttgttcaga agagcatatg gcattcgtgc tcgcttgttc 6300
acgtattctt cctgttccat ctcttggccg acaatcacac aaaaatgggg tttttttttt 6360
aattctaatg attcattaca gcaaaattga gatatagcag accacgtatt ccataatcac 6420
caaggaagtt cttgggcgtc ttaattaact cacctgcagg attgagacta tgaatggatt 6480
cccgtgcccg tattactcta ctaatttgat cttggaacgc gaaaatacgt ttctaggact 6540
ccaaagaatc tcaactcttg tccttactaa atatactacc catagttgat ggtttacttg 6600
aacagagagg acatgttcac ttgacccaaa gtttctcgca tctcttggat atttgaacaa 6660
cggcgtccac tgaccgtcag ttatccagtc acaaaacccc cacattcata cattcccatg 6720
tacgtttaca aagttctcaa ttccatcgtg caaatcaaaa tcacatctat tcattcatca 6780
tatataaacc catcatgtct actaacactc acaactccat agaaaacatc gactcagaac 6840
acacgctcca tgcggccgct tactgagcct tggcaccggg ctgcttctcg gccattcgag 6900
cgaactggga caggtatcgg agcaggatga cgagaccttc atggggcaga gggtttcggt 6960
aggggaggtt gtgcttctgg cacagctgtt ccacctggta ggaaacggca gtgaggttgt 7020
gtcgaggcag ggtgggccag agatggtgct cgatctggta gttcaggcct ccaaagaacc 7080
agtcagtaat gatgcctcgt cgaatgttca tggtctcatg gatctgaccc acagagaagc 7140
catgtccgtc ccagacggaa tcaccgatct tctccagagg gtagtggttc atgaagacca 7200
cgatggcaat tccgaagcca ccgacgagct cggaaacaaa gaacaccagc atcgaggtca 7260
ggatggaggg cataaagaag aggtggaaca gggtcttgag agtccagtgc agagcgagtc 7320
caatggcctc tttcttgtac tgagatcggt agaactggtt gtctcggtcc ttgagggatc 7380
gaacggtcag cacagactgg aaacaccaga tgaatcgcag gagaatacag atgaccagga 7440
aatagtactg ttggaactga atgagctttc gggagatggg agaagctcga gtgacatcgt 7500
cctcggacca ggcgagcaga ggcaggttat caatgtcggg atcgtgaccc tgaacgttgg 7560
tagcagaatg atgggcgttg tgtctgtcct tccaccaggt cacggagaag ccctggagtc 7620
cgttgccaaa gaccagaccc aggacgttat tccagtttcg gttcttgaag gtctggtggt 7680
ggcagatgtc atgagacagc catcccattt gctggtagtg cataccgagc acgagagcac 7740
caatgaagta caggtggtac tggaccagca tgaagaaggc aagcacgcca agacccaggg 7800
tggtcaagat cttgtacgag taccagaggg gagaggcgtc aaacatgcca gtggcgatca 7860
gctcttctcg gagctttcgg aaatcctcct gagcttcgtt gacggcagcc tggggaggca 7920
gctcggaagc ctggttgatc ttgggcattc gcttgagctt gtcgaaggct tcctgagagt 7980
gcataaccat gaaggcgtca gtagcatctc gtccctggta gttctcaatg atttcagctc 8040
caccagggtg gaagttcacc caagcggaga cgtcgtacac ctttccgtcg atgacgaggg 8100
gcagagcctg tcgagaagcc ttcaccatgg ttgtgaatta gggtggtgag aatggttggt 8160
tgtagggaag aatcaaaggc cggtctcggg atccgtgggt atatatatat atatatatat 8220
atacgatcct tcgttacctc cctgttctca aaactgtggt ttttcgtttt tcgttttttg 8280
ctttttttga tttttttagg gccaactaag cttccagatt tcgctaatca cctttgtact 8340
aattacaaga aaggaagaag ctgattagag ttgggctttt tatgcaactg tgctactcct 8400
tatctctgat atgaaagtgt agacccaatc acatcatgtc atttagagtt ggtaatactg 8460
ggaggataga taaggcacga aaacgagcca tagcagacat gctgggtgta gccaagcaga 8520
agaaagtaga tgggagccaa ttgacgagcg agggagctac gccaatccga catacgacac 8580
gctgagatcg tcttggccgg ggggtaccta cagatgtcca agggtaagtg cttgactgta 8640
attgtatgtc tgaggacaaa tatgtagtca gccgtataaa gtcataccag gcaccagtgc 8700
catcatcgaa ccactaactc tctatgatac atgcctccgg tattattgta ccatgcgtcg 8760
ctttgttaca tacgtatctt gcctttttct ctcagaaact ccagactttg gctattggtc 8820
gagataagcc cggaccatag tgagtctttc acactctaca tttctccctt gctccaacta 8880
tttaaattcc ttcacttcaa gttcattctt catctgcttc tgttttactt tgacaggcaa 8940
atgaagacat ggtacgactt gatggaggcc aagaacgcca tttcaccccg agacaccgaa 9000
gtgcctgaaa tcctggctgc ccccattgat aacatcggaa actacggtat tccggaaagt 9060
gtatatagaa cctttcccca gcttgtgtct gtggatatgg atggtgtaat cccctttgag 9120
tactcgtctt ggcttctctc cgagcagtat gaggctctct aatctagcgc atttaatatc 9180
tcaatgtatt tatatattta tcttctcatg cggccgctta ctgagccttg gcaccgggct 9240
gcttctcggc cattcgagcg aactgggaca ggtatcggag caggatgacg agaccttcat 9300
ggggcagagg gtttcggtag gggaggttgt gcttctggca cagctgttcc acctggtagg 9360
aaacggcagt gaggttgtgt cgaggcaggg tgggccagag atggtgctcg atctggtagt 9420
tcaggcctcc aaagaaccag tcagtaatga tgcctcgtcg aatgttcatg gtctcatgga 9480
tctgacccac agagaagcca tgtccgtccc agacggaatc accgatcttc tccagagggt 9540
agtggttcat gaagaccacg atggcaattc cgaagccacc gacgagctcg gaaacaaaga 9600
acaccagcat cgaggtcagg atggagggca taaagaagag gtggaacagg gtcttgagag 9660
tccagtgcag agcgagtcca atggcctctt tcttgtactg agatcggtag aactggttgt 9720
ctcggtcctt gagggatcga acggtcagca cagactggaa acaccagatg aatcgcagga 9780
gaatacagat gaccaggaaa tagtactgtt ggaactgaat gagctttcgg gagatgggag 9840
aagctcgagt gacatcgtcc tcggaccagg cgagcagagg caggttatca atgtcgggat 9900
cgtgaccctg aacgttggta gcagaatgat gggcgttgtg tctgtccttc caccaggtca 9960
cggagaagcc ctggagtccg ttgccaaaga ccagacccag gacgttattc cagtttcggt 10020
tcttgaaggt ctggtggtgg cagatgtcat gagacagcca tcccatttgc tggtagtgca 10080
taccgagcac gagagcacca atgaagtaca ggtggtactg gaccagcatg aagaaggcaa 10140
gcacgccaag acccagggtg gtcaagatct tgtacgagta ccagagggga gaggcgtcaa 10200
acatgccagt ggcgatcagc tcttctcgga gctttcggaa atcctcctga gcttcgttga 10260
cggcagcctg gggaggcagc tcggaagcct ggttgatctt gggcattcgc ttgagcttgt 10320
cgaaggcttc ctgagagtgc ataaccatga aggcgtcagt agcatctcgt ccctggtagt 10380
tctcaatgat ttcagctcca ccagggtgga agttcaccca agcggagacg tcgtacacct 10440
ttccgtcgat gacgaggggc agagcctgtc gagaagcctt caccatgggc aggacctgtg 10500
ttagtacatt gtcggggagt catcaattgg ttcgacaggt tgtcgactgt tagtatgagc 10560
tcaattgggc tctggtgggt cgatgacact tgtcatctgt ttctgttggg tcatgtttcc 10620
atcaccttct atggtactca caattcgtcc gattcgcccg aatccgttaa taccgacttt 10680
gatggccatg ttgatgtgtg tttaattcaa gaatgaatat agagaagaga agaagaaaaa 10740
agattcaatt gagccggcga tgcagaccct tatataaatg ttgccttgga cagacggagc 10800
aagcccgccc aaacctacgt tcggtataat atgttaagct ttttaacaca aaggtttggc 10860
ttggggtaac ctgatgtggt gcaaaagacc gggcgttggc gagccattgc gcgggcgaat 10920
ggggccgtga ctcgtctcaa attcgagggc gtgcctcaat tcgtgccccc gtggcttttt 10980
cccgccgttt ccgccccgtt tgcaccactg cagccgcttc tttggttcgg acaccttgct 11040
gcgagctagg tgccttgtgc tacttaaaaa gtggcctccc aacaccaaca tgacatgagt 11100
gcgtgggcca agacacgttg gcggggtcgc agtcggctca atggcccgga aaaaacgctg 11160
ctggagctgg ttcggacgca gtccgccgcg gcgtatggat atccgcaagg ttccatagcg 11220
ccattgccct ccgtcggcgt ctatcccgca acctctaaat agagcgggaa tataacccaa 11280
gcttcttttt tttcctttaa cacgcacacc cccaactatc atgttgctgc tgctgtttga 11340
ctctactctg tggaggggtg ctcccaccca acccaaccta caggtggatc cggcgctgtg 11400
attggctgat aagtctccta tccggactaa ttctgaccaa tgggacatgc gcgcaggacc 11460
caaatgccgc aattacgtaa ccccaacgaa atgcctaccc ctctttggag cccagcggcc 11520
ccaaatcccc ccaagcagcc cggttctacc ggcttccatc tccaagcaca agcagcccgg 11580
aattccttta cctgcaggat aacttcgtat aatgtatgct atacgaagtt atgatctctc 11640
tcttgagctt ttccataaca agttcttctg cctccaggaa gtccatgggt ggtttgatca 11700
tggttttggt gtagtggtag tgcagtggtg gtattgtgac tggggatgta gttgagaata 11760
agtcatacac aagtcagctt tcttcgagcc tcatataagt ataagtagtt caacgtatta 11820
gcactgtacc cagcatctcc gtatcgagaa acacaacaac atgccccatt ggacagatca 11880
tgcggataca caggttgtgc agtatcatac atactcgatc agacaggtcg tctgaccatc 11940
atacaagctg aacaagcgct ccatacttgc acgctctcta tatacacagt taaattacat 12000
atccatagtc taacctctaa cagttaatct tctggtaagc ctcccagcca gccttctggt 12060
atcgcttggc ctcctcaata ggatctcggt tctggccgta cagacctcgg ccgacaatta 12120
tgatatccgt tccggtagac atgacatcct caacagttcg gtactgctgt ccgagagcgt 12180
ctcccttgtc gtcaagaccc accccggggg tcagaataag ccagtcctca gagtcgccct 12240
taggtcggtt ctgggcaatg aagccaacca caaactcggg gtcggatcgg gcaagctcaa 12300
tggtctgctt ggagtactcg ccagtggcca gagagccctt gcaagacagc tcggccagca 12360
tgagcagacc tctggccagc ttctcgttgg gagaggggac taggaactcc ttgtactggg 12420
agttctcgta gtcagagacg tcctccttct tctgttcaga gacagtttcc tcggcaccag 12480
ctcgcaggcc agcaatgatt ccggttccgg gtacaccgtg ggcgttggtg atatcggacc 12540
actcggcgat tcggtgacac cggtactggt gcttgacagt gttgccaata tctgcgaact 12600
ttctgtcctc gaacaggaag aaaccgtgct taagagcaag ttccttgagg gggagcacag 12660
tgccggcgta ggtgaagtcg tcaatgatgt cgatatgggt tttgatcatg cacacataag 12720
gtccgacctt atcggcaagc tcaatgagct ccttggtggt ggtaacatcc agagaagcac 12780
acaggttggt tttcttggct gccacgagct tgagcactcg agcggcaaag gcggacttgt 12840
ggacgttagc tcgagcttcg taggagggca ttttggtggt gaagaggaga ctgaaataaa 12900
tttagtctgc agaacttttt atcggaacct tatctggggc agtgaagtat atgttatggt 12960
aatagttacg agttagttga acttatagat agactggact atacggctat cggtccaaat 13020
tagaaagaac gtcaatggct ctctgggcgt cgcctttgcc gacaaaaatg tgatcatgat 13080
gaaagccagc aatgacgttg cagctgatat tgttgtcggc caaccgcgcc gaaaacgcag 13140
ctgtcagacc cacagcctcc aacgaagaat gtatcgtcaa agtgatccaa gcacactcat 13200
agttggagtc gtactccaaa ggcggcaatg acgagtcaga cagatactcg tcgacgcgat 13260
aacttcgtat aatgtatgct atacgaagtt atcgtacgat agttagtaga caacaatcga 13320
taacgtctcg taccaaccac agattacgac ccattcgcag tcacagttca ctagggtttg 13380
ggttgcatcc gttgagagcg gtttgttttt aaccttctcc atgtgctcac tcaggttttg 13440
ggttcagatc aaatcaaggc gtgaaccact ttgtttgagg acaaatgtga cacaaccaac 13500
cagtgtcagg ggcaagtccg tgacaaaggg gaagatacaa tgcaattact gacagttaca 13560
gactgcctcg atgccctaac cttgccccaa aataagacaa ctgtcctcgt ttaagcgcaa 13620
ccctattcag cgtcacgtca taatagcgtt tggatagcac tagtctatga ggagcgtttt 13680
atgttgcggt gagggcgatt ggtgctcata tgggttcaat tgaggtggcg gaacgagctt 13740
agtcttcaat tgaggtgcga gcgacacaat tgggtgtcac gtggcctaat tgacctcggg 13800
tcgtggagtc cccagttata cagcaaccac gaggtgcatg ggtaggagac gtcaccagac 13860
aatagggttt tttttggact ggagagggtt gggcaaaagc gctcaacggg ctgtttgggg 13920
agctgtgggg gaggaattgg cgatatttgt gaggttaacg gctccgattt gcgtgttttg 13980
tcgctcctgc atctccccat acccatatct tccctcccca cctctttcca cgataatttt 14040
acggatcagc aataaggttc cttctcctag tttccacgtc catatatatc tatgctgcgt 14100
cgtccttttc gtgacatcac caaaacacat acaacaatgg ctgttactga cgtccttaag 14160
cgaaagtccg gtgtcatcgt cggcgacgat gtccgagccg tgagtatcca cgacaagatc 14220
agtgtcgaga cgacgcgttt tgtgtaatga cacaatccga aagtcgctag caacacacac 14280
tctctacaca aactaaccca gctctccatg gcctccacct cggctctgcc caagcagaac 14340
cctgccctcc gacgaaccgt cacttccacc actgtgaccg actcggagtc tgctgccgtc 14400
tctccctccg attctcccag acactcggcc tcctctacat cgctgtcttc catgtccgag 14460
gtggacattg ccaagcccaa gtccgagtac ggtgtcatgc tggataccta cggcaaccag 14520
ttcgaagttc ccgacttcac catcaaggac atctacaacg ctattcccaa gcactgcttc 14580
aagcgatctg ctctcaaggg atacggctac attcttcgag acattgtcct cctgactacc 14640
actttcagca tctggtacaa ctttgtgaca cccgagtaca ttccctccac tcctgctcga 14700
gccggtctgt gggctgtgta caccgttctt cagggactct tcggtactgg actgtgggtc 14760
attgcccacg agtgtggaca tggtgctttc tccgattccc gaatcatcaa cgacattact 14820
ggctgggtgc ttcactcttc cctgcttgtt ccctacttca gctggcaaat ctcccaccgg 14880
aagcatcaca aggccactgg aaacatggag cgagacatgg tcttcgttcc tcgaacccga 14940
gagcagcaag ctactcgact cggcaagatg acccacgaac tcgcccatct taccgaggaa 15000
actcctgctt tcaccctgct catgcttgtg cttcagcaac tggtcggttg gcccaactat 15060
ctcattacca acgttactgg acacaactac catgagcggc agcgagaggg tcgaggcaag 15120
ggaaagcaca acggtcttgg cggtggagtt aaccatttcg atccccgatc tcctctgtac 15180
gagaacagcg acgccaagct catcgtgctc tccgacattg gcattggtct tatggccacc 15240
gctctgtact ttctcgttca gaagttcgga ttctacaaca tggccatctg gtacttcgtt 15300
ccctacttgt gggttaacca ctggctcgtc gccattacct ttctgcagca cacagatcct 15360
actcttcccc actacaccaa cgacgagtgg aactttgtgc gaggtgccgc tgcaaccatc 15420
gaccgagaga tgggcttcat tggacgtcat ctgctccacg gcattatcga gactcacgtc 15480
ctgcatcact acgtctcttc cattcccttc tacaatgcgg acgaagctac cgaggccatc 15540
aaacctatca tgggcaagca ctatcgagct gatgtccagg acggtcctcg aggattcatt 15600
cgagccatgt accgatctgc acgaatgtgc cagtgggttg aaccctccgc tggtgccgag 15660
ggagctggca agggtgtcct gttctttcga aaccgaaaca atgtgggcac tcctcccgct 15720
gtcatcaagc ccgttgccta agc 15743
<210>56
<211>1434
<212>DNA
<213>串珠镰刀菌
<220>
<221>CDS
<222>(1)..(1434)
<223>合成Δ-12去饱和酶(经密码子优化用于解脂耶氏酵母)
<300>
<302>适于改变含油酵母中多不饱和脂肪酸含量的Δ-12去饱和酶
<310>WO 2005/047485
<311>2004-11-12
<312>2005-05-26
<313>(1)..(1434)
<300>
<302>适于改变含油酵母中多不饱和脂肪酸含量的Δ-12去饱和酶
<310>US 2005-0216975-A1
<311>2004-11-10
<312>2005-09-29
<313>(1)..(1434)
<400>56
atg gcc tcc acc tcg gct ctg ccc aag cag aac cct gcc ctc cga cga 48
Met Ala Ser Thr Ser Ala Leu Pro Lys Gln Asn Pro Ala Leu Arg Arg
1 5 10 15
acc gtc act tcc acc act gtg acc gac tcg gag tct gct gcc gtc tct 96
Thr Val Thr Ser Thr Thr Val Thr Asp Ser Glu Ser Ala Ala Val Ser
20 25 30
ccc tcc gat tct ccc aga cac tcg gcc tcc tct aca tcg ctg tct tcc 144
Pro Ser Asp Ser Pro Arg His Ser Ala Ser Ser Thr Ser Leu Ser Ser
35 40 45
atg tcc gag gtg gac att gcc aag ccc aag tcc gag tac ggt gtc atg 192
Met Ser Glu Val Asp Ile Ala Lys Pro Lys Ser Glu Tyr Gly Val Met
50 55 60
ctg gat acc tac ggc aac cag ttc gaa gtt ccc gac ttc acc atc aag 240
Leu Asp Thr Tyr Gly Asn Gln Phe Glu Val Pro Asp Phe Thr Ile Lys
65 70 75 80
gac atc tac aac gct att ccc aag cac tgc ttc aag cga tct gct ctc 288
Asp Ile Tyr Asn Ala Ile Pro Lys His Cys Phe Lys Arg Ser Ala Leu
85 90 95
aag gga tac ggc tac att ctt cga gac att gtc ctc ctg act acc act 336
Lys Gly Tyr Gly Tyr Ile Leu Arg Asp Ile Val Leu Leu Thr Thr Thr
100 105 110
ttc agc atc tgg tac aac ttt gtg aca ccc gag tac att ccc tcc act 384
Phe Ser Ile Trp Tyr Asn Phe Val Thr Pro Glu Tyr Ile Pro Ser Thr
115 120 125
cct gct cga gcc ggt ctg tgg gct gtg tac acc gtt ctt cag gga ctc 432
Pro Ala Arg Ala Gly Leu Trp Ala Val Tyr Thr Val Leu Gln Gly Leu
130 135 140
ttc ggt act gga ctg tgg gtc att gcc cac gag tgt gga cat ggt gct 480
Phe Gly Thr Gly Leu Trp Val Ile Ala His Glu Cys Gly His Gly Ala
145 150 155 160
ttc tcc gat tcc cga atc atc aac gac att act ggc tgg gtg ctt cac 528
Phe Ser Asp Ser Arg Ile Ile Asn Asp Ile Thr Gly Trp Val Leu His
165 170 175
tct tcc ctg ctt gtt ccc tac ttc agc tgg caa atc tcc cac cgg aag 576
Ser Ser Leu Leu Val Pro Tyr Phe Ser Trp Gln Ile Ser His Arg Lys
180 185 190
cat cac aag gcc act gga aac atg gag cga gac atg gtc ttc gtt cct 624
His His Lys Ala Thr Gly Asn Met Glu Arg Asp Met Val Phe Val Pro
195 200 205
cga acc cga gag cag caa gct act cga ctc ggc aag atg acc cac gaa 672
Arg Thr Arg Glu Gln Gln Ala Thr Arg Leu Gly Lys Met Thr His Glu
210 215 220
ctc gcc cat ctt acc gag gaa act cct gct ttc acc ctg ctc atg ctt 720
Leu Ala His Leu Thr Glu Glu Thr Pro Ala Phe Thr Leu Leu Met Leu
225 230 235 240
gtg ctt cag caa ctg gtc ggt tgg ccc aac tat ctc att acc aac gtt 768
Val Leu Gln Gln Leu Val Gly Trp Pro Asn Tyr Leu Ile Thr Asn Val
245 250 255
act gga cac aac tac cat gag cgg cag cga gag ggt cga ggc aag gga 816
Thr Gly His Asn Tyr His Glu Arg Gln Arg Glu Gly Arg Gly Lys Gly
260 265 270
aag cac aac ggt ctt ggc ggt gga gtt aac cat ttc gat ccc cga tct 864
Lys His Asn Gly Leu Gly Gly Gly Val Asn His Phe Asp Pro Arg Ser
275 280 285
cct ctg tac gag aac agc gac gcc aag ctc atc gtg ctc tcc gac att 912
Pro Leu Tyr Glu Asn Ser Asp Ala Lys Leu Ile Val Leu Ser Asp Ile
290 295 300
ggc att ggt ctt atg gcc acc gct ctg tac ttt ctc gtt cag aag ttc 960
Gly Ile Gly Leu Met Ala Thr Ala Leu Tyr Phe Leu Val Gln Lys Phe
305 310 315 320
gga ttc tac aac atg gcc atc tgg tac ttc gtt ccc tac ttg tgg gtt 1008
Gly Phe Tyr Asn Met Ala Ile Trp Tyr Phe Val Pro Tyr Leu Trp Val
325 330 335
aac cac tgg ctc gtc gcc att acc ttt ctg cag cac aca gat cct act 1056
Asn His Trp Leu Val Ala Ile Thr Phe Leu Gln His Thr Asp Pro Thr
340 345 350
ctt ccc cac tac acc aac gac gag tgg aac ttt gtg cga ggt gcc gct 1104
Leu Pro His Tyr Thr Asn Asp Glu Trp Asn Phe Val Arg Gly Ala Ala
355 360 365
gca acc atc gac cga gag atg ggc ttc att gga cgt cat ctg ctc cac 1152
Ala Thr Ile Asp Arg Glu Met Gly Phe Ile Gly Arg His Leu Leu His
370 375 380
ggc att atc gag act cac gtc ctg cat cac tac gtc tct tcc att ccc 1200
Gly Ile Ile Glu Thr His Val Leu His His Tyr Val Ser Ser Ile Pro
385 390 395 400
ttc tac aat gcg gac gaa gct acc gag gcc atc aaa cct atc atg ggc 1248
Phe Tyr Asu Ala Asp Glu Ala Thr Glu Ala Ile Lys Pro Ile Met Gly
405 410 415
aag cac tat cga gct gat gtc cag gac ggt cct cga gga ttc att cga 1296
Lys His Tyr Arg Ala Asp Val Gln Asp Gly Pro Arg Gly Phe Ile Arg
420 425 430
gcc atg tac cga tct gca cga atg tgc cag tgg gtt gaa ccc tcc gct 1344
Ala Met Tyr Arg Ser Ala Arg Met Cys Gln Trp Val Glu Pro Ser Ala
435 440 445
ggt gcc gag gga gct ggc aag ggt gtc ctg ttc ttt cga aac cga aac 1392
Gly Ala Glu Gly Ala Gly Lys Gly Val Leu Phe Phe Arg Asn Arg Asn
450 455 460
aat gtg ggc act cct ccc gct gtc atc aag ccc gtt gcc taa 1434
Asn Val Gly Thr Pro Pro Ala Val Ile Lys Pro Val Ala
465 470 475
<210>57
<211>477
<212>PRT
<213>串珠镰刀菌
<400>57
Met Ala Ser Thr Ser Ala Leu Pro Lys Gln Asn Pro Ala Leu Arg Arg
1 5 10 15
Thr Val Thr Ser Thr Thr Val Thr Asp Ser Glu Ser Ala Ala Val Ser
20 25 30
Pro Ser Asp Ser Pro Arg His Ser Ala Ser Ser Thr Ser Leu Ser Ser
35 40 45
Met Ser Glu Val Asp Ile Ala Lys Pro Lys Ser Glu Tyr Gly Val Met
50 55 60
Leu Asp Thr Tyr Gly Asn Gln Phe Glu Val Pro Asp Phe Thr Ile Lys
65 70 75 80
Asp Ile Tyr Asn Ala Ile Pro Lys His Cys Phe Lys Arg Ser Ala Leu
85 90 95
Lys Gly Tyr Gly Tyr Ile Leu Arg Asp Ile Val Leu Leu Thr Thr Thr
100 105 110
Phe Ser Ile Trp Tyr Asn Phe Val Thr Pro Glu Tyr Ile Pro Ser Thr
115 120 125
Pro Ala Arg Ala Gly Leu Trp Ala Val Tyr Thr Val Leu Gln Gly Leu
130 135 140
Phe Gly Thr Gly Leu Trp Val Ile Ala His Glu Cys Gly His Gly Ala
145 150 155 160
Phe Ser Asp Ser Arg Ile Ile Asn Asp Ile Thr Gly Trp Val Leu His
165 170 175
Ser Ser Leu Leu Val Pro Tyr Phe Ser Trp Gln Ile Ser His Arg Lys
180 185 190
His His Lys Ala Thr Gly Asn Met Glu Arg Asp Met Val Phe Val Pro
195 200 205
Arg Thr Arg Glu Gln Gln Ala Thr Arg Leu Gly Lys Met Thr His Glu
210 215 220
Leu Ala His Leu Thr Glu Glu Thr Pro Ala Phe Thr Leu Leu Met Leu
225 230 235 240
Val Leu Gln Gln Leu Val Gly Trp Pro Asn Tyr Leu Ile Thr Asn Val
245 250 255
Thr Gly His Asn Tyr His Glu Arg Gln Arg Glu Gly Arg Gly Lys Gly
260 265 270
Lys His Asn Gly Leu Gly Gly Gly Val Asn His Phe Asp Pro Arg Ser
275 280 285
Pro Leu Tyr Glu Asn Ser Asp Ala Lys Leu Ile Val Leu Ser Asp Ile
290 295 300
Gly Ile Gly Leu Met Ala Thr Ala Leu Tyr Phe Leu Val Gln Lys Phe
305 310 315 320
Gly Phe Tyr Asn Met Ala Ile Trp Tyr Phe Val Pro Tyr Leu Trp Val
325 330 335
Asn His Trp Leu Val Ala Ile Thr Phe Leu Gln His Thr Asp Pro Thr
340 345 350
Leu Pro His Tyr Thr Asn Asp Glu Trp Asn Phe Val Arg Gly Ala Ala
355 360 365
Ala Thr Ile Asp Arg Glu Met Gly Phe Ile Gly Arg His Leu Leu His
370 375 380
Gly Ile Ile Glu Thr His Val Leu His His Tyr Val Ser Ser Ile Pro
385 390 395 400
Phe Tyr Asn Ala Asp Glu Ala Thr Glu Ala Ile Lys Pro Ile Met Gly
405 410 415
Lys His Tyr Arg Ala Asp Val Gln Asp Gly Pro Arg Gly Phe Ile Arg
420 425 430
Ala Met Tyr Arg Ser Ala Arg Met Cys Gln Trp Val Glu Pro Ser Ala
435 440 445
Gly Ala Glu Gly Ala Gly Lys Gly Val Leu Phe Phe Arg Asn Arg Asn
450 455 460
Asn Val Gly Thr Pro Pro Ala Val Ile Lys Pro Val Ala
465 470 475
<210>58
<211>6303
<212>DNA
<213>人工序列
<220>
<223>质粒pZKUE3S
<400>58
ggccgcaagt gtggatgggg aagtgagtgc ccggttctgt gtgcacaatt ggcaatccaa 60
gatggatgga ttcaacacag ggatatagcg agctacgtgg tggtgcgagg atatagcaac 120
ggatatttat gtttgacact tgagaatgta cgatacaagc actgtccaag tacaatacta 180
aacatactgt acatactcat actcgtaccc gggcaacggt ttcacttgag tgcagtggct 240
agtgctctta ctcgtacagt gtgcaatact gcgtatcata gtctttgatg tatatcgtat 300
tcattcatgt tagttgcgta cgaggaaact gtctctgaac agaagaagga ggacgtctct 360
gactacgaga actcccagta caaggagttc ctagtcccct ctcccaacga gaagctggcc 420
agaggtctgc tcatgctggc cgagctgtct tgcaagggct ctctggccac tggcgagtac 480
tccaagcaga ccattgagct tgcccgatcc gaccccgagt ttgtggttgg cttcattgcc 540
cagaaccgac ctaagggcga ctctgaggac tggcttattc tgacccccgg ggtgggtctt 600
gacgacaagg gagacgctct cggacagcag taccgaactg ttgaggatgt catgtctacc 660
ggaacggata tcataattgt cggccgaggt ctgtacggcc agaaccgaga tcctattgag 720
gaggccaagc gataccagaa ggctggctgg gaggcttacc agaagattaa ctgttagagg 780
ttagactatg gatatgtaat ttaactgtgt atatagagag cgtgcaagta tggagcgctt 840
gttcagcttg tatgatggtc agacgacctg tctgatcgag tatgtatgat actgcacaac 900
ctgtgtatcc gcatgatctg tccaatgggg catgttgttg tgtttctcga tacggagatg 960
ctgggtacag tgctaatacg ttgaactact tatacttata tgaggctcga agaaagctga 1020
cttgtgtatg acttaattaa tcgagcttgg cgtaatcatg gtcatagctg tttcctgtgt 1080
gaaattgtta tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag 1140
cctggggtgc ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgctt 1200
tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag 1260
gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg 1320
ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 1380
caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 1440
aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa 1500
atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc 1560
cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt 1620
ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca 1680
gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg 1740
accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat 1800
cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta 1860
cagagttctt gaagtggtgg cctaactacg gctacactag aaggacagta tttggtatct 1920
gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac 1980
aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa 2040
aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa 2100
actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt 2160
taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca 2220
gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca 2280
tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc 2340
ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta tcagcaataa 2400
accagccagc cggaagggcc gagcgcagaa gtggtcctgc aactttatcc gcctccatcc 2460
agtctattaa ttgttgccgg gaagctagag taagtagttc gccagttaat agtttgcgca 2520
acgttgttgc cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat 2580
tcagctccgg ttcccaacga tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag 2640
cggttagctc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca gtgttatcac 2700
tcatggttat ggcagcactg cataattctc ttactgtcat gccatccgta agatgctttt 2760
ctgtgactgg tgagtactca accaagtcat tctgagaata gtgtatgcgg cgaccgagtt 2820
gctcttgccc ggcgtcaata cgggataata ccgcgccaca tagcagaact ttaaaagtgc 2880
tcatcattgg aaaacgttct tcggggcgaa aactctcaag gatcttaccg ctgttgagat 2940
ccagttcgat gtaacccact cgtgcaccca actgatcttc agcatctttt actttcacca 3000
gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga ataagggcga 3060
cacggaaatg ttgaatactc atactcttcc tttttcaata ttattgaagc atttatcagg 3120
gttattgtct catgagcgga tacatatttg aatgtattta gaaaaataaa caaatagggg 3180
ttccgcgcac atttccccga aaagtgccac ctgacgcgcc ctgtagcggc gcattaagcg 3240
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 3300
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 3360
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 3420
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 3480
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 3540
tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt 3600
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc 3660
ttacaatttc cattcgccat tcaggctgcg caactgttgg gaagggcgat cggtgcgggc 3720
ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct gcaaggcgat taagttgggt 3780
aacgccaggg ttttcccagt cacgacgttg taaaacgacg gccagtgaat tgtaatacga 3840
ctcactatag ggcgaattgg gtaccgggcc ccacctcgag gtcgacgagt atctgtctga 3900
ctcgtcattg catgcctttg gagtacgact ccaactatga gtgtgcttgg atcactttga 3960
cgatacattc ttcgttggag gctgtgggtc tgacagctgc gttttcggcg cggttggccg 4020
acaacaatat cagctgcaac gtcattgctg gctttcatca tgatcacatt tttgtcggca 4080
aaggcgacgc ccagagagcc attgacgttc tttctaattt ggaccgatag ccgtatagtc 4140
cagtctatct ataagttcaa ctaactcgta actattacca taacatatac ttcactgccc 4200
cagataaggt tccgataaaa agttctgcag actaaattta tttcagtctc ctcttcacca 4260
ccaaaatgcc ctcctacgaa gctcgagtgc tcaagctcgt ggcagccaag aaaaccaacc 4320
tgtgtgcttc tctggatgtt accaccacca aggagctcat tgagcttgcc gataaggtcg 4380
gaccttatgt gtgcatgatc aaaacccata tcgacatcat tgacgacttc acctacgccg 4440
gcactgtgct ccccctcaag gaacttgctc ttaagcacgg tttcttcctg ttcgaggaca 4500
gaaagttcgc agatattggc aacactgtca agcaccagta ccggtgtcac cgaatcgccg 4560
agtggtccga tatcaccaac gcccacggtg tttaaacccg gaaccggaat cgataagctt 4620
gatatcgaat tcatgctgtt catcgtggtt aatgctgctg tgtgctgtgt gtgtgtgttg 4680
tttggcgctc attgttgcgt tatgcagcgt acaccacaat attggaagct tattagcctt 4740
tctatttttt cgtttgcaag gcttaacaac attgctgtgg agagggatgg ggatatggag 4800
gccgctggag ggagtcggag aggcgttttg gagcggcttg gcctggcgcc cagctcgcga 4860
aacgcaccta ggaccctttg gcacgccgaa atgtgccact tttcagtcta gtaacgcctt 4920
acctacgtca ttccatgcgt gcatgtttgc gccttttttc ccttgccctt gatcgccaca 4980
cagtacagtg cactgtacag tggaggtttt gggggggtct tagatgggag ctaaaagcgg 5040
cctagcggta cactagtggg attgtatgga gtggcatgga gcctaggtgg agcctgacag 5100
gacgcacgac cggctagccc gtgacagacg atgggtggct cctgttgtcc accgcgtaca 5160
aatgtttggg ccaaagtctt gtcagccttg cttgcgaacc taattcccaa ttttgtcact 5220
tcgcaccccc attgatcgag ccctaacccc tgcccatcag gcaatccaat taagctcgca 5280
ttgtctgcct tgtttagttt ggctcctgcc cgtttcggcg tccacttgca caaacacaaa 5340
caagcattat atataaggct cgtctctccc tcccaaccac actcactttt ttgcccgtct 5400
tcccttgcta acacaaaagt caagaacaca aacaaccacc ccaaccccct tacacacaag 5460
acatatctac accatggagt ctggacccat gcctgctggc attcccttcc ctgagtacta 5520
tgacttcttt atggactgga agactcccct ggccatcgct gccacctaca ctgctgccgt 5580
cggtctcttc aaccccaagg ttggcaaggt ctcccgagtg gttgccaagt cggctaacgc 5640
aaagcctgcc gagcgaaccc agtccggagc tgccatgact gccttcgtct ttgtgcacaa 5700
cctcattctg tgtgtctact ctggcatcac cttctactac atgtttcctg ctatggtcaa 5760
gaacttccga acccacacac tgcacgaagc ctactgcgac acggatcagt ccctctggaa 5820
caacgcactt ggctactggg gttacctctt ctacctgtcc aagttctacg aggtcattga 5880
caccatcatc atcatcctga agggacgacg gtcctcgctg cttcagacct accaccatgc 5940
tggagccatg attaccatgt ggtctggcat caactaccaa gccactccca tttggatctt 6000
tgtggtcttc aactccttca ttcacaccat catgtactgt tactatgcct tcacctctat 6060
cggattccat cctcctggca aaaagtacct gacttcgatg cagattactc agtttctggt 6120
cggtatcacc attgccgtgt cctacctctt cgttcctggc tgcatccgaa cacccggtgc 6180
tcagatggct gtctggatca acgtcggcta cctgtttccc ttgacctatc tgttcgtgga 6240
ctttgccaag cgaacctact ccaagcgatc tgccattgcc gctcagaaaa aggctcagta 6300
agc 6303
<210>59
<211>21
<212>DNA
<213>人工序列
<220>
<223>引物pZP-GW-5-1
<400>59
cgacaagatg gaatgagaat g 21
<210>60
<211>22
<212>DNA
<213>人工序列
<220>
<223>引物pZP-GW-5-2
<400>60
ctggtttttc aactacttct ac 22
<210>61
<211>21
<212>DNA
<213>人工序列
<220>
<223>引物pZP-GW-5-3
<400>61
gtactgtcct gtgtctgttc c 21
<210>62
<211>22
<212>DNA
<213>人工序列
<220>
<223>引物pZP-GW-5-4
<400>62
ctacatcgtc cgaaagcaca ag 22
<210>63
<211>24
<212>DNA
<213>人工序列
<220>
<223>引物pZP-GW-3-1
<400>63
ctaccagatc gagcaccatc tctg 24
<210>64
<211>21
<212>DNA
<213>人工序列
<220>
<223>引物pZP-GW-3-2
<400>64
ctaccaggtg gaacagctgt g 21
<210>65
<211>22
<212>DNA
<213>人工序列
<220>
<223>引物pZP-GW-3-3
<400>65
tctgccccat gaaggtctcg tc 22
<210>66
<211>22
<212>DNA
<213>人工序列
<220>
<223>引物pZP-GW-3-4
<400>66
cctgtcccag ttcgctcgaa tg 22
<210>67
<211>44
<212>DNA
<213>人工序列
<220>
<223>Genome Walker衔接子-1
<400>67
gtaatacgac tatagggcac gcgtggtcga cggcccgggc tggt 44
<210>68
<211>8
<212>DNA
<213>人工序列
<220>
<223>Genome Walker衔接子-2
<220>
<221>misc_feature
<222>(1)..(1)
<223>5’末端与-PO4基附连
<220>
<221>misc_feature
<222>(8)..(8)
<223>3’末端与-H2N基附连
<400>68
accagccc 8
<210>69
<211>22
<212>DNA
<213>人工序列
<220>
<223>巢式衔接子引物
<400>69
gtaatacgac tcactatagg gc 22
<210>70
<211>36
<212>DNA
<213>人工序列
<220>
<223>引物Per10F1
<400>70
gatcaaccat ggggggaagt tcacatgcat tcgctg 36
<210>71
<211>29
<212>DNA
<213>人工序列
<220>
<223>引物ZPGW-5-5
<400>71
gttatagttt tcatgtgaaa taccgagag 29
<210>72
<211>37
<212>DNA
<213>人工序列
<220>
<223>引物Per10R
<400>72
gatcaagcgg ccgccagacc tcgtcattat ctgatag 37
<210>73
<211>7222
<212>DNA
<213>人工序列
<220>
<223>质粒pFBAIn-MOD-1
<400>73
catggatcca ggcctgttaa cggccattac ggcctgcagg atccgaaaaa acctcccaca 60
cctccccctg aacctgaaac ataaaatgaa tgcaattgtt gttgttaact tgtttattgc 120
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 180
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctgcgg 240
ccgcaagtgt ggatggggaa gtgagtgccc ggttctgtgt gcacaattgg caatccaaga 300
tggatggatt caacacaggg atatagcgag ctacgtggtg gtgcgaggat atagcaacgg 360
atatttatgt ttgacacttg agaatgtacg atacaagcac tgtccaagta caatactaaa 420
catactgtac atactcatac tcgtacccgg gcaacggttt cacttgagtg cagtggctag 480
tgctcttact cgtacagtgt gcaatactgc gtatcatagt ctttgatgta tatcgtattc 540
attcatgtta gttgcgtacg agccggaagc ataaagtgta aagcctgggg tgcctaatga 600
gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg 660
tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg 720
cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg 780
gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga 840
aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 900
gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 960
aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 1020
gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 1080
ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 1140
cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 1200
ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 1260
actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 1320
tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 1380
gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 1440
ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 1500
cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 1560
ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 1620
tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 1680
agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 1740
gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 1800
ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 1860
gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 1920
cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct 1980
acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 2040
cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 2100
cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 2160
ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 2220
tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 2280
atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 2340
tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 2400
actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 2460
aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 2520
ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc 2580
ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 2640
cgaaaagtgc cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt 2700
acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc 2760
ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct 2820
ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat 2880
ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc 2940
acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc 3000
tattcttttg atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg 3060
atttaacaaa aatttaacgc gaattttaac aaaatattaa cgcttacaat ttccattcgc 3120
cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc 3180
agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc 3240
agtcacgacg ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat 3300
tgggtaccgg gccccccctc gaggtcgatg gtgtcgataa gcttgatatc gaattcatgt 3360
cacacaaacc gatcttcgcc tcaaggaaac ctaattctac atccgagaga ctgccgagat 3420
ccagtctaca ctgattaatt ttcgggccaa taatttaaaa aaatcgtgtt atataatatt 3480
atatgtatta tatatataca tcatgatgat actgacagtc atgtcccatt gctaaataga 3540
cagactccat ctgccgcctc caactgatgt tctcaatatt taaggggtca tctcgcattg 3600
tttaataata aacagactcc atctaccgcc tccaaatgat gttctcaaaa tatattgtat 3660
gaacttattt ttattactta gtattattag acaacttact tgctttatga aaaacacttc 3720
ctatttagga aacaatttat aatggcagtt cgttcattta acaatttatg tagaataaat 3780
gttataaatg cgtatgggaa atcttaaata tggatagcat aaatgatatc tgcattgcct 3840
aattcgaaat caacagcaac gaaaaaaatc ccttgtacaa cataaatagt catcgagaaa 3900
tatcaactat caaagaacag ctattcacac gttactattg agattattat tggacgagaa 3960
tcacacactc aactgtcttt ctctcttcta gaaatacagg tacaagtatg tactattctc 4020
attgttcata cttctagtca tttcatccca catattcctt ggatttctct ccaatgaatg 4080
acattctatc ttgcaaattc aacaattata ataagatata ccaaagtagc ggtatagtgg 4140
caatcaaaaa gcttctctgg tgtgcttctc gtatttattt ttattctaat gatccattaa 4200
aggtatatat ttatttcttg ttatataatc cttttgttta ttacatgggc tggatacata 4260
aaggtatttt gatttaattt tttgcttaaa ttcaatcccc cctcgttcag tgtcaactgt 4320
aatggtagga aattaccata cttttgaaga agcaaaaaaa atgaaagaaa aaaaaaatcg 4380
tatttccagg ttagacgttc cgcagaatct agaatgcggt atgcggtaca ttgttcttcg 4440
aacgtaaaag ttgcgctccc tgagatattg tacatttttg cttttacaag tacaagtaca 4500
tcgtacaact atgtactact gttgatgcat ccacaacagt ttgttttgtt tttttttgtt 4560
tttttttttt ctaatgattc attaccgcta tgtataccta cttgtacttg tagtaagccg 4620
ggttattggc gttcaattaa tcatagactt atgaatctgc acggtgtgcg ctgcgagtta 4680
cttttagctt atgcatgcta cttgggtgta atattgggat ctgttcggaa atcaacggat 4740
gctcaatcga tttcgacagt aattaattaa gtcatacaca agtcagcttt cttcgagcct 4800
catataagta taagtagttc aacgtattag cactgtaccc agcatctccg tatcgagaaa 4860
cacaacaaca tgccccattg gacagatcat gcggatacac aggttgtgca gtatcataca 4920
tactcgatca gacaggtcgt ctgaccatca tacaagctga acaagcgctc catacttgca 4980
cgctctctat atacacagtt aaattacata tccatagtct aacctctaac agttaatctt 5040
ctggtaagcc tcccagccag ccttctggta tcgcttggcc tcctcaatag gatctcggtt 5100
ctggccgtac agacctcggc cgacaattat gatatccgtt ccggtagaca tgacatcctc 5160
aacagttcgg tactgctgtc cgagagcgtc tcccttgtcg tcaagaccca ccccgggggt 5220
cagaataagc cagtcctcag agtcgccctt aggtcggttc tgggcaatga agccaaccac 5280
aaactcgggg tcggatcggg caagctcaat ggtctgcttg gagtactcgc cagtggccag 5340
agagcccttg caagacagct cggccagcat gagcagacct ctggccagct tctcgttggg 5400
agaggggact aggaactcct tgtactggga gttctcgtag tcagagacgt cctccttctt 5460
ctgttcagag acagtttcct cggcaccagc tcgcaggcca gcaatgattc cggttccggg 5520
tacaccgtgg gcgttggtga tatcggacca ctcggcgatt cggtgacacc ggtactggtg 5580
cttgacagtg ttgccaatat ctgcgaactt tctgtcctcg aacaggaaga aaccgtgctt 5640
aagagcaagt tccttgaggg ggagcacagt gccggcgtag gtgaagtcgt caatgatgtc 5700
gatatgggtt ttgatcatgc acacataagg tccgacctta tcggcaagct caatgagctc 5760
cttggtggtg gtaacatcca gagaagcaca caggttggtt ttcttggctg ccacgagctt 5820
gagcactcga gcggcaaagg cggacttgtg gacgttagct cgagcttcgt aggagggcat 5880
tttggtggtg aagaggagac tgaaataaat ttagtctgca gaacttttta tcggaacctt 5940
atctggggca gtgaagtata tgttatggta atagttacga gttagttgaa cttatagata 6000
gactggacta tacggctatc ggtccaaatt agaaagaacg tcaatggctc tctgggcgtc 6060
gcctttgccg acaaaaatgt gatcatgatg aaagccagca atgacgttgc agctgatatt 6120
gttgtcggcc aaccgcgccg aaaacgcagc tgtcagaccc acagcctcca acgaagaatg 6180
tatcgtcaaa gtgatccaag cacactcata gttggagtcg tactccaaag gcggcaatga 6240
cgagtcagac agatactcgt cgaaaacagt gtacgcagat ctactataga ggaacattta 6300
aattgccccg gagaagacgg ccaggccgcc tagatgacaa attcaacaac tcacagctga 6360
ctttctgcca ttgccactag gggggggcct ttttatatgg ccaagccaag ctctccacgt 6420
cggttgggct gcacccaaca ataaatgggt agggttgcac caacaaaggg atgggatggg 6480
gggtagaaga tacgaggata acggggctca atggcacaaa taagaacgaa tactgccatt 6540
aagactcgtg atccagcgac tgacaccatt gcatcatcta agggcctcaa aactacctcg 6600
gaactgctgc gctgatctgg acaccacaga ggttccgagc actttaggtt gcaccaaatg 6660
tcccaccagg tgcaggcaga aaacgctgga acagcgtgta cagtttgtct taacaaaaag 6720
tgagggcgct gaggtcgagc agggtggtgt gacttgttat agcctttaga gctgcgaaag 6780
cgcgtatgga tttggctcat caggccagat tgagggtctg tggacacatg tcatgttagt 6840
gtacttcaat cgccccctgg atatagcccc gacaataggc cgtggcctca tttttttgcc 6900
ttccgcacat ttccattgct cggtacccac accttgcttc tcctgcactt gccaacctta 6960
atactggttt acattgacca acatcttaca agcggggggc ttgtctaggg tatatataaa 7020
cagtggctct cccaatcggt tgccagtctc ttttttcctt tctttcccca cagattcgaa 7080
atctaaacta cacatcacag aattccgagc cgtgagtatc cacgacaaga tcagtgtcga 7140
gacgacgcgt tttgtgtaat gacacaatcc gaaagtcgct agcaacacac actctctaca 7200
caaactaacc cagctctggt ac 7222
<210>74
<211>8133
<212>DNA
<213>人工序列
<220>
<223>质粒pFBA IN-Pex10
<400>74
ggccgcaagt gtggatgggg aagtgagtgc ccggttctgt gtgcacaatt ggcaatccaa 60
gatggatgga ttcaacacag ggatatagcg agctacgtgg tggtgcgagg atatagcaac 120
ggatatttat gtttgacact tgagaatgta cgatacaagc actgtccaag tacaatacta 180
aacatactgt acatactcat actcgtaccc gggcaacggt ttcacttgag tgcagtggct 240
agtgctctta ctcgtacagt gtgcaatact gcgtatcata gtctttgatg tatatcgtat 300
tcattcatgt tagttgcgta cgagccggaa gcataaagtg taaagcctgg ggtgcctaat 360
gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc 420
tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg 480
ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag 540
cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag 600
gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 660
tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc 720
agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc 780
tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt 840
cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 900
ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 960
ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag 1020
ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 1080
ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc 1140
cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 1200
gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 1260
atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 1320
ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa 1380
gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa 1440
tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc 1500
ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga 1560
taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa 1620
gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt 1680
gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg 1740
ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc 1800
aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg 1860
gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag 1920
cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt 1980
actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt 2040
caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac 2100
gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac 2160
ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag 2220
caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa 2280
tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga 2340
gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc 2400
cccgaaaagt gccacctgac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg 2460
ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct 2520
tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc 2580
ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg 2640
atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt 2700
ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg 2760
tctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc 2820
tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgcttaca atttccattc 2880
gccattcagg ctgcgcaact gttgggaagg gcgatcggtg cgggcctctt cgctattacg 2940
ccagctggcg aaagggggat gtgctgcaag gcgattaagt tgggtaacgc cagggttttc 3000
ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa tacgactcac tatagggcga 3060
attgggtacc gggccccccc tcgaggtcga tggtgtcgat aagcttgata tcgaattcat 3120
gtcacacaaa ccgatcttcg cctcaaggaa acctaattct acatccgaga gactgccgag 3180
atccagtcta cactgattaa ttttcgggcc aataatttaa aaaaatcgtg ttatataata 3240
ttatatgtat tatatatata catcatgatg atactgacag tcatgtccca ttgctaaata 3300
gacagactcc atctgccgcc tccaactgat gttctcaata tttaaggggt catctcgcat 3360
tgtttaataa taaacagact ccatctaccg cctccaaatg atgttctcaa aatatattgt 3420
atgaacttat ttttattact tagtattatt agacaactta cttgctttat gaaaaacact 3480
tcctatttag gaaacaattt ataatggcag ttcgttcatt taacaattta tgtagaataa 3540
atgttataaa tgcgtatggg aaatcttaaa tatggatagc ataaatgata tctgcattgc 3600
ctaattcgaa atcaacagca acgaaaaaaa tcccttgtac aacataaata gtcatcgaga 3660
aatatcaact atcaaagaac agctattcac acgttactat tgagattatt attggacgag 3720
aatcacacac tcaactgtct ttctctcttc tagaaataca ggtacaagta tgtactattc 3780
tcattgttca tacttctagt catttcatcc cacatattcc ttggatttct ctccaatgaa 3840
tgacattcta tcttgcaaat tcaacaatta taataagata taccaaagta gcggtatagt 3900
ggcaatcaaa aagcttctct ggtgtgcttc tcgtatttat ttttattcta atgatccatt 3960
aaaggtatat atttatttct tgttatataa tccttttgtt tattacatgg gctggataca 4020
taaaggtatt ttgatttaat tttttgctta aattcaatcc cccctcgttc agtgtcaact 4080
gtaatggtag gaaattacca tacttttgaa gaagcaaaaa aaatgaaaga aaaaaaaaat 4140
cgtatttcca ggttagacgt tccgcagaat ctagaatgcg gtatgcggta cattgttctt 4200
cgaacgtaaa agttgcgctc cctgagatat tgtacatttt tgcttttaca agtacaagta 4260
catcgtacaa ctatgtacta ctgttgatgc atccacaaca gtttgttttg tttttttttg 4320
tttttttttt ttctaatgat tcattaccgc tatgtatacc tacttgtact tgtagtaagc 4380
cgggttattg gcgttcaatt aatcatagac ttatgaatct gcacggtgtg cgctgcgagt 4440
tacttttagc ttatgcatgc tacttgggtg taatattggg atctgttcgg aaatcaacgg 4500
atgctcaatc gatttcgaca gtaattaatt aagtcataca caagtcagct ttcttcgagc 4560
ctcatataag tataagtagt tcaacgtatt agcactgtac ccagcatctc cgtatcgaga 4620
aacacaacaa catgccccat tggacagatc atgcggatac acaggttgtg cagtatcata 4680
catactcgat cagacaggtc gtctgaccat catacaagct gaacaagcgc tccatacttg 4740
cacgctctct atatacacag ttaaattaca tatccatagt ctaacctcta acagttaatc 4800
ttctggtaag cctcccagcc agccttctgg tatcgcttgg cctcctcaat aggatctcgg 4860
ttctggccgt acagacctcg gccgacaatt atgatatccg ttccggtaga catgacatcc 4920
tcaacagttc ggtactgctg tccgagagcg tctcccttgt cgtcaagacc caccccgggg 4980
gtcagaataa gccagtcctc agagtcgccc ttaggtcggt tctgggcaat gaagccaacc 5040
acaaactcgg ggtcggatcg ggcaagctca atggtctgct tggagtactc gccagtggcc 5100
agagagccct tgcaagacag ctcggccagc atgagcagac ctctggccag cttctcgttg 5160
ggagagggga ctaggaactc cttgtactgg gagttctcgt agtcagagac gtcctccttc 5220
ttctgttcag agacagtttc ctcggcacca gctcgcaggc cagcaatgat tccggttccg 5280
ggtacaccgt gggcgttggt gatatcggac cactcggcga ttcggtgaca ccggtactgg 5340
tgcttgacag tgttgccaat atctgcgaac tttctgtcct cgaacaggaa gaaaccgtgc 5400
ttaagagcaa gttccttgag ggggagcaca gtgccggcgt aggtgaagtc gtcaatgatg 5460
tcgatatggg ttttgatcat gcacacataa ggtccgacct tatcggcaag ctcaatgagc 5520
tccttggtgg tggtaacatc cagagaagca cacaggttgg ttttcttggc tgccacgagc 5580
ttgagcactc gagcggcaaa ggcggacttg tggacgttag ctcgagcttc gtaggagggc 5640
attttggtgg tgaagaggag actgaaataa atttagtctg cagaactttt tatcggaacc 5700
ttatctgggg cagtgaagta tatgttatgg taatagttac gagttagttg aacttataga 5760
tagactggac tatacggcta tcggtccaaa ttagaaagaa cgtcaatggc tctctgggcg 5820
tcgcctttgc cgacaaaaat gtgatcatga tgaaagccag caatgacgtt gcagctgata 5880
ttgttgtcgg ccaaccgcgc cgaaaacgca gctgtcagac ccacagcctc caacgaagaa 5940
tgtatcgtca aagtgatcca agcacactca tagttggagt cgtactccaa aggcggcaat 6000
gacgagtcag acagatactc gtcgaaaaca gtgtacgcag atctactata gaggaacatt 6060
taaattgccc cggagaagac ggccaggccg cctagatgac aaattcaaca actcacagct 6120
gactttctgc cattgccact aggggggggc ctttttatat ggccaagcca agctctccac 6180
gtcggttggg ctgcacccaa caataaatgg gtagggttgc accaacaaag ggatgggatg 6240
gggggtagaa gatacgagga taacggggct caatggcaca aataagaacg aatactgcca 6300
ttaagactcg tgatccagcg actgacacca ttgcatcatc taagggcctc aaaactacct 6360
cggaactgct gcgctgatct ggacaccaca gaggttccga gcactttagg ttgcaccaaa 6420
tgtcccacca ggtgcaggca gaaaacgctg gaacagcgtg tacagtttgt cttaacaaaa 6480
agtgagggcg ctgaggtcga gcagggtggt gtgacttgtt atagccttta gagctgcgaa 6540
agcgcgtatg gatttggctc atcaggccag attgagggtc tgtggacaca tgtcatgtta 6600
gtgtacttca atcgccccct ggatatagcc ccgacaatag gccgtggcct catttttttg 6660
ccttccgcac atttccattg ctcggtaccc acaccttgct tctcctgcac ttgccaacct 6720
taatactggt ttacattgac caacatctta caagcggggg gcttgtctag ggtatatata 6780
aacagtggct ctcccaatcg gttgccagtc tcttttttcc tttctttccc cacagattcg 6840
aaatctaaac tacacatcac agaattccga gccgtgagta tccacgacaa gatcagtgtc 6900
gagacgacgc gttttgtgta atgacacaat ccgaaagtcg ctagcaacac acactctcta 6960
cacaaactaa cccagctctg gtaccatggg gggaagttca catgcattcg ctggtgaatc 7020
tgatctgaca ctacaactac acaccaggtc caacatgagc gacaatacga caatcaaaaa 7080
gccgatccga cccaaaccga tccggacgga acgcctgcct tacgctgggg ccgcagaaat 7140
catccgagcc aaccagaaag accactactt tgagtccgtg cttgaacagc atctcgtcac 7200
gtttctgcag aaatggaagg gagtacgatt tatccaccag tacaaggagg agctggagac 7260
ggcgtccaag tttgcatatc tcggtttgtg tacgcttgtg ggctccaaga ctctcggaga 7320
agagtacacc aatctcatgt acactatcag agaccgaaca gctctaccgg gggtggtgag 7380
acggtttggc tacgtgcttt ccaacactct gtttccatac ctgtttgtgc gctacatggg 7440
caagttgcgc gccaaactga tgcgcgagta tccccatctg gtggagtacg acgaagatga 7500
gcctgtgccc agcccggaaa catggaagga gcgggtcatc aagacgtttg tgaacaagtt 7560
tgacaagttc acggcgctgg aggggtttac cgcgatccac ttggcgattt tctacgtcta 7620
cggctcgtac taccagctca gtaagcggat ctggggcatg cgttatgtat ttggacaccg 7680
actggacaag aatgagcctc gaatcggtta cgagatgctc ggtctgctga ttttcgcccg 7740
gtttgccacg tcatttgtgc agacgggaag agagtacctc ggagcgctgc tggaaaagag 7800
cgtggagaaa gaggcagggg agaaggaaga tgaaaaggaa gcggttgtgc cgaaaaagaa 7860
gtcgtcaatt ccgttcattg aggatacaga aggggagacg gaagacaaga tcgatctgga 7920
ggaccctcga cagctcaagt tcattcctga ggcgtccaga gcgtgcactc tgtgtctgtc 7980
atacattagt gcgccggcat gtacgccatg tggacacttt ttctgttggg actgtatttc 8040
cgaatgggtg agagagaagc ccgagtgtcc cttgtgtcgg cagggtgtga gagagcagaa 8100
cttgttgcct atcagataat gacgaggtct ggc 8133
<210>75
<211>35
<212>DNA
<213>人工序列
<220>
<223>引物PEX10-R-BsiWI
<400>75
gatcaacgta cgcttcagca gtaactgtat tgctc 35
<210>76
<211>35
<212>DNA
<213>人工序列
<220>
<223>引物PEX10-F1-SalI
<400>76
gatcaagtcg acattgtaac tagtcctgga gggtc 35
<210>77
<211>36
<212>DNA
<213>人工序列
<220>
<223>引物PEX10-F2-SalI
<400>77
gatcaagtcg acgtcttagc gtcatgtatt ctcaag 36
<210>78
<211>7277
<212>DNA
<213>人工序列
<220>
<223>质粒pEXP-MOD1
<400>78
catggatcca ggcctgttaa cggccattac ggcctgcagg atccgaaaaa acctcccaca 60
cctccccctg aacctgaaac ataaaatgaa tgcaattgtt gttgttaact tgtttattgc 120
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 180
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctgcgg 240
ccgcaagtgt ggatggggaa gtgagtgccc ggttctgtgt gcacaattgg caatccaaga 300
tggatggatt caacacaggg atatagcgag ctacgtggtg gtgcgaggat atagcaacgg 360
atatttatgt ttgacacttg agaatgtacg atacaagcac tgtccaagta caatactaaa 420
catactgtac atactcatac tcgtacccgg gcaacggttt cacttgagtg cagtggctag 480
tgctcttact cgtacagtgt gcaatactgc gtatcatagt ctttgatgta tatcgtattc 540
attcatgtta gttgcgtacg agccggaagc ataaagtgta aagcctgggg tgcctaatga 600
gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg 660
tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg 720
cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg 780
gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga 840
aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 900
gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 960
aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 1020
gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 1080
ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 1140
cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 1200
ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 1260
actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 1320
tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 1380
gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 1440
ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 1500
cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 1560
ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 1620
tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 1680
agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 1740
gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 1800
ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 1860
gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 1920
cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct 1980
acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 2040
cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 2100
cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 2160
ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 2220
tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 2280
atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 2340
tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 2400
actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 2460
aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 2520
ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc 2580
ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 2640
cgaaaagtgc cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt 2700
acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc 2760
ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct 2820
ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat 2880
ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc 2940
acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc 3000
tattcttttg atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg 3060
atttaacaaa aatttaacgc gaattttaac aaaatattaa cgcttacaat ttccattcgc 3120
cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc 3180
agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc 3240
agtcacgacg ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat 3300
tgggtaccgg gccccccctc gaggtcgatg gtgtcgataa gcttgatatc gaattcatgt 3360
cacacaaacc gatcttcgcc tcaaggaaac ctaattctac atccgagaga ctgccgagat 3420
ccagtctaca ctgattaatt ttcgggccaa taatttaaaa aaatcgtgtt atataatatt 3480
atatgtatta tatatataca tcatgatgat actgacagtc atgtcccatt gctaaataga 3540
cagactccat ctgccgcctc caactgatgt tctcaatatt taaggggtca tctcgcattg 3600
tttaataata aacagactcc atctaccgcc tccaaatgat gttctcaaaa tatattgtat 3660
gaacttattt ttattactta gtattattag acaacttact tgctttatga aaaacacttc 3720
ctatttagga aacaatttat aatggcagtt cgttcattta acaatttatg tagaataaat 3780
gttataaatg cgtatgggaa atcttaaata tggatagcat aaatgatatc tgcattgcct 3840
aattcgaaat caacagcaac gaaaaaaatc ccttgtacaa cataaatagt catcgagaaa 3900
tatcaactat caaagaacag ctattcacac gttactattg agattattat tggacgagaa 3960
tcacacactc aactgtcttt ctctcttcta gaaatacagg tacaagtatg tactattctc 4020
attgttcata cttctagtca tttcatccca catattcctt ggatttctct ccaatgaatg 4080
acattctatc ttgcaaattc aacaattata ataagatata ccaaagtagc ggtatagtgg 4140
caatcaaaaa gcttctctgg tgtgcttctc gtatttattt ttattctaat gatccattaa 4200
aggtatatat ttatttcttg ttatataatc cttttgttta ttacatgggc tggatacata 4260
aaggtatttt gatttaattt tttgcttaaa ttcaatcccc cctcgttcag tgtcaactgt 4320
aatggtagga aattaccata cttttgaaga agcaaaaaaa atgaaagaaa aaaaaaatcg 4380
tatttccagg ttagacgttc cgcagaatct agaatgcggt atgcggtaca ttgttcttcg 4440
aacgtaaaag ttgcgctccc tgagatattg tacatttttg cttttacaag tacaagtaca 4500
tcgtacaact atgtactact gttgatgcat ccacaacagt ttgttttgtt tttttttgtt 4560
tttttttttt ctaatgattc attaccgcta tgtataccta cttgtacttg tagtaagccg 4620
ggttattggc gttcaattaa tcatagactt atgaatctgc acggtgtgcg ctgcgagtta 4680
cttttagctt atgcatgcta cttgggtgta atattgggat ctgttcggaa atcaacggat 4740
gctcaatcga tttcgacagt aattaattaa gtcatacaca agtcagcttt cttcgagcct 4800
catataagta taagtagttc aacgtattag cactgtaccc agcatctccg tatcgagaaa 4860
cacaacaaca tgccccattg gacagatcat gcggatacac aggttgtgca gtatcataca 4920
tactcgatca gacaggtcgt ctgaccatca tacaagctga acaagcgctc catacttgca 4980
cgctctctat atacacagtt aaattacata tccatagtct aacctctaac agttaatctt 5040
ctggtaagcc tcccagccag ccttctggta tcgcttggcc tcctcaatag gatctcggtt 5100
ctggccgtac agacctcggc cgacaattat gatatccgtt ccggtagaca tgacatcctc 5160
aacagttcgg tactgctgtc cgagagcgtc tcccttgtcg tcaagaccca ccccgggggt 5220
cagaataagc cagtcctcag agtcgccctt aggtcggttc tgggcaatga agccaaccac 5280
aaactcgggg tcggatcggg caagctcaat ggtctgcttg gagtactcgc cagtggccag 5340
agagcccttg caagacagct cggccagcat gagcagacct ctggccagct tctcgttggg 5400
agaggggact aggaactcct tgtactggga gttctcgtag tcagagacgt cctccttctt 5460
ctgttcagag acagtttcct cggcaccagc tcgcaggcca gcaatgattc cggttccggg 5520
tacaccgtgg gcgttggtga tatcggacca ctcggcgatt cggtgacacc ggtactggtg 5580
cttgacagtg ttgccaatat ctgcgaactt tctgtcctcg aacaggaaga aaccgtgctt 5640
aagagcaagt tccttgaggg ggagcacagt gccggcgtag gtgaagtcgt caatgatgtc 5700
gatatgggtt ttgatcatgc acacataagg tccgacctta tcggcaagct caatgagctc 5760
cttggtggtg gtaacatcca gagaagcaca caggttggtt ttcttggctg ccacgagctt 5820
gagcactcga gcggcaaagg cggacttgtg gacgttagct cgagcttcgt aggagggcat 5880
tttggtggtg aagaggagac tgaaataaat ttagtctgca gaacttttta tcggaacctt 5940
atctggggca gtgaagtata tgttatggta atagttacga gttagttgaa cttatagata 6000
gactggacta tacggctatc ggtccaaatt agaaagaacg tcaatggctc tctgggcgtc 6060
gcctttgccg acaaaaatgt gatcatgatg aaagccagca atgacgttgc agctgatatt 6120
gttgtcggcc aaccgcgccg aaaacgcagc tgtcagaccc acagcctcca acgaagaatg 6180
tatcgtcaaa gtgatccaag cacactcata gttggagtcg tactccaaag gcggcaatga 6240
cgagtcagac agatactcgt cgaccgtacg gggagtttgg cgcccgtttt ttcgagcccc 6300
acacgtttcg gtgagtatga gcggcggcag attcgagcgt ttccggtttc cgcggctgga 6360
cgagagccca tgatgggggc tcccaccacc agcaatcagg gccctgatta cacacccacc 6420
tgtaatgtca tgctgttcat cgatggttaa tgctgctgtg tgctgtgtgt gtgtgttgtt 6480
tggcgctcat tgttgcgtta tgcagcgtac accacaatat tggaagctta ttagcctttc 6540
tattttttcg tttgcaaggc ttaacaacat tgctgtggag agggatgggg atatggaggc 6600
cgctggaggg agtcggagag gcgttttgga gcggcttggc ctggcgccca gctcgcgaaa 6660
cgcacctagg accctttggc acgccgaaat gtgccacttt tcagtctagt aacgccttac 6720
ctacgtcatt ccatgcgtgc atgtttgcgc cttttttccc ttgcccttga tcgccacaca 6780
gtacagtgca ctgtacagtg gaggttttgg gggggtctta gatgggagct aaaagcggcc 6840
tagcggtaca ctagtgggat tgtatggagt ggcatggagc ctaggtggag cctgacagga 6900
cgcacgaccg gctagcccgt gacagacgat gggtggctcc tgttgtccac cgcgtacaaa 6960
tgtttgggcc aaagtcttgt cagccttgct tgcgaaccta attcccaatt ttgtcacttc 7020
gcacccccat tgatcgagcc ctaacccctg cccatcaggc aatccaatta agctcgcatt 7080
gtctgccttg tttagtttgg ctcctgcccg tttcggcgtc cacttgcaca aacacaaaca 7140
agcattatat ataaggctcg tctctccctc ccaaccacac tcactttttt gcccgtcttc 7200
ccttgctaac acaaaagtca agaacacaaa caaccacccc aaccccctta cacacaagac 7260
atatctacag caatggc 7277
<210>79
<211>7559
<212>DNA
<213>人工序列
<220>
<223>质粒pPEX10-1
<400>79
gtacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca 60
ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat 120
taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc 180
tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 240
aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 300
aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 360
ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 420
acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 480
ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 540
tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 600
tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 660
gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 720
agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 780
tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 840
agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 900
tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 960
acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 1020
tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 1080
agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc 1140
tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact 1200
acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc 1260
tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt 1320
ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta 1380
agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg 1440
tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt 1500
acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc 1560
agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt 1620
actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc 1680
tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc 1740
gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa 1800
ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac 1860
tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa 1920
aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt 1980
tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa 2040
tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct 2100
gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc 2160
gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc 2220
acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt 2280
agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg 2340
ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt 2400
ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta 2460
taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt 2520
aacgcgaatt ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca 2580
actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg 2640
gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta 2700
aaacgacggc cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc 2760
ccctcgaggt cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct 2820
tcgcctcaag gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat 2880
taattttcgg gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat 2940
atacatcatg atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc 3000
gcctccaact gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag 3060
actccatcta ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt 3120
acttagtatt attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa 3180
tttataatgg cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat 3240
gggaaatctt aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca 3300
gcaacgaaaa aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag 3360
aacagctatt cacacgttac tattgagatt attattggac gagaatcaca cactcaactg 3420
tctttctctc ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct 3480
agtcatttca tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca 3540
aattcaacaa ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc 3600
tctggtgtgc ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt 3660
tcttgttata taatcctttt gtttattaca tgggctggat acataaaggt attttgattt 3720
aattttttgc ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta 3780
ccatactttt gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga 3840
cgttccgcag aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg 3900
ctccctgaga tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta 3960
ctactgttga tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat 4020
gattcattac cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca 4080
attaatcata gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca 4140
tgctacttgg gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg 4200
acagtaatta attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt 4260
agttcaacgt attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc 4320
cattggacag atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag 4380
gtcgtctgac catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca 4440
cagttaaatt acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca 4500
gccagccttc tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc 4560
tcggccgaca attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg 4620
ctgtccgaga gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc 4680
ctcagagtcg cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga 4740
tcgggcaagc tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga 4800
cagctcggcc agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa 4860
ctccttgtac tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt 4920
ttcctcggca ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt 4980
ggtgatatcg gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc 5040
aatatctgcg aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt 5100
gagggggagc acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat 5160
catgcacaca taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac 5220
atccagagaa gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc 5280
aaaggcggac ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag 5340
gagactgaaa taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa 5400
gtatatgtta tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg 5460
ctatcggtcc aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa 5520
aatgtgatca tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg 5580
cgccgaaaac gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat 5640
ccaagcacac tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata 5700
ctcgtcgaca ttgtaactag tcctggaggg tcttttttat ggataacctc catgtacgat 5760
gtatccaaga tctccacgta ctgtgttctg tttcctaagt aatacccaac aacctctcca 5820
acaaacactt gggaagatgc acttgtgctg agatgtcaag atgttagtac tgtactggat 5880
ggagagaata ttaataaata attgttaccc aactacatct tgtcgattga aagagatacc 5940
cctaagacag ataggatatc tgcaacccga ggaatgaacc ccccagcacc ggcacccttt 6000
ctattaacaa aatgccaact gaaatttgaa aagttcaact aaacttattt gacccacaaa 6060
aactcgtcaa aagtggcggc gaaagctggc aaatgatgac atccccttgg aactatgata 6120
tcccctcgga atcttcgtcc ccatttgcca catctacttg caacgccacg tctgcttact 6180
aagcaaccca aatctgcctc ggctcaaaat gtggggaagt tcacatgcat tcgctggtga 6240
atctgatctg acactacaac tacacaccag gtccaacatg agcgacaata cgacaatcaa 6300
aaagccgatc cgacccaaac cgatccggac ggaacgcctg ccttacgctg gggccgcaga 6360
aatcatccga gccaaccaga aagaccacta ctttgagtcc gtgcttgaac agcatctcgt 6420
cacgtttctg cagaaatgga agggagtacg atttatccac cagtacaagg aggagctgga 6480
gacggcgtcc aagtttgcat atctcggttt gtgtacgctt gtgggctcca agactctcgg 6540
agaagagtac accaatctca tgtacactat cagagaccga acagctctac cgggggtggt 6600
gagacggttt ggctacgtgc tttccaacac tctgtttcca tacctgtttg tgcgctacat 6660
gggcaagttg cgcgccaaac tgatgcgcga gtatccccat ctggtggagt acgacgaaga 6720
tgagcctgtg cccagcccgg aaacatggaa ggagcgggtc atcaagacgt ttgtgaacaa 6780
gtttgacaag ttcacggcgc tggaggggtt taccgcgatc cacttggcga ttttctacgt 6840
ctacggctcg tactaccagc tcagtaagcg gatctggggc atgcgttatg tatttggaca 6900
ccgactggac aagaatgagc ctcgaatcgg ttacgagatg ctcggtctgc tgattttcgc 6960
ccggtttgcc acgtcatttg tgcagacggg aagagagtac ctcggagcgc tgctggaaaa 7020
gagcgtggag aaagaggcag gggagaagga agatgaaaag gaagcggttg tgccgaaaaa 7080
gaagtcgtca attccgttca ttgaggatac agaaggggag acggaagaca agatcgatct 7140
ggaggaccct cgacagctca agttcattcc tgaggcgtcc agagcgtgca ctctgtgtct 7200
gtcatacatt agtgcgccgg catgtacgcc atgtggacac tttttctgtt gggactgtat 7260
ttccgaatgg gtgagagaga agcccgagtg tcccttgtgt cggcagggtg tgagagagca 7320
gaacttgttg cctatcagat aatgacgagg tctggatgga aggactagtc agcgagacac 7380
agagcatcag ggaccagaca cgaccaattc aatcgacaac actgtgctgc atagcagtgc 7440
acagaggtcc tgggcatgaa tatattttag cattggagat atgagtggta gagcgtatac 7500
agtattaatt gtggaggtat ctcgtcgcat tgatagagca atacagttac tgctgaagc 7559
<210>80
<211>8051
<212>DNA
<213>人工序列
<220>
<223>质粒pPEX10-2
<400>80
gtacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca 60
ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat 120
taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc 180
tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 240
aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 300
aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 360
ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 420
acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 480
ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 540
tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 600
tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 660
gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 720
agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 780
tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 840
agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 900
tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 960
acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 1020
tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 1080
agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc 1140
tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact 1200
acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc 1260
tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt 1320
ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta 1380
agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg 1440
tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt 1500
acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc 1560
agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt 1620
actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc 1680
tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc 1740
gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa 1800
ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac 1860
tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa 1920
aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt 1980
tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa 2040
tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct 2100
gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc 2160
gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc 2220
acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt 2280
agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg 2340
ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt 2400
ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta 2460
taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt 2520
aacgcgaatt ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca 2580
actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg 2640
gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta 2700
aaacgacggc cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc 2760
ccctcgaggt cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct 2820
tcgcctcaag gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat 2880
taattttcgg gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat 2940
atacatcatg atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc 3000
gcctccaact gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag 3060
actccatcta ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt 3120
acttagtatt attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa 3180
tttataatgg cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat 3240
gggaaatctt aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca 3300
gcaacgaaaa aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag 3360
aacagctatt cacacgttac tattgagatt attattggac gagaatcaca cactcaactg 3420
tctttctctc ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct 3480
agtcatttca tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca 3540
aattcaacaa ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc 3600
tctggtgtgc ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt 3660
tcttgttata taatcctttt gtttattaca tgggctggat acataaaggt attttgattt 3720
aattttttgc ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta 3780
ccatactttt gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga 3840
cgttccgcag aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg 3900
ctccctgaga tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta 3960
ctactgttga tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat 4020
gattcattac cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca 4080
attaatcata gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca 4140
tgctacttgg gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg 4200
acagtaatta attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt 4260
agttcaacgt attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc 4320
cattggacag atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag 4380
gtcgtctgac catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca 4440
cagttaaatt acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca 4500
gccagccttc tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc 4560
tcggccgaca attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg 4620
ctgtccgaga gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc 4680
ctcagagtcg cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga 4740
tcgggcaagc tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga 4800
cagctcggcc agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa 4860
ctccttgtac tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt 4920
ttcctcggca ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt 4980
ggtgatatcg gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc 5040
aatatctgcg aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt 5100
gagggggagc acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat 5160
catgcacaca taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac 5220
atccagagaa gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc 5280
aaaggcggac ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag 5340
gagactgaaa taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa 5400
gtatatgtta tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg 5460
ctatcggtcc aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa 5520
aatgtgatca tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg 5580
cgccgaaaac gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat 5640
ccaagcacac tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata 5700
ctcgtcgacg tcttagcgtc atgtattctc aagcttagtc agagagaagg actatggagg 5760
agaaggggag aattgagaag ggtatttgaa gggactttga aggtcgcgtg gaagaggtac 5820
ttgaagaggt atttgaaggt cacgtggaag aggtatttga agatcacgtg gaagaagtac 5880
ttgttttaca gagaatatcg gggtgatttt gacagtggga ttgtctccca agtcctaatc 5940
gtttgacatg ggagcagtga aaagtcgggc taaaaaaggg aatatcggaa atcggaaaga 6000
cggaaagaat tactggactc atgtttagta gatctgagca cttcaaattt gaaaatatct 6060
cttcaaacag cagatcggtt ggtcgtggag gtaccatcaa gggtaaaatc aaggctatca 6120
tcaagggcca tatatcgcaa gtttggggga agataatatg ttcatagtga atcagggttg 6180
tggatttcct catctaacgg cattgtaact agtcctggag ggtctttttt atggataacc 6240
tccatgtacg atgtatccaa gatctccacg tactgtgttc tgtttcctaa gtaataccca 6300
acaacctctc caacaaacac ttgggaagat gcacttgtgc tgagatgtca agatgttagt 6360
actgtactgg atggagagaa tattaataaa taattgttac ccaactacat cttgtcgatt 6420
gaaagagata cccctaagac agataggata tctgcaaccc gaggaatgaa ccccccagca 6480
ccggcaccct ttctattaac aaaatgccaa ctgaaatttg aaaagttcaa ctaaacttat 6540
ttgacccaca aaaactcgtc aaaagtggcg gcgaaagctg gcaaatgatg acatcccctt 6600
ggaactatga tatcccctcg gaatcttcgt ccccatttgc cacatctact tgcaacgcca 6660
cgtctgctta ctaagcaacc caaatctgcc tcggctcaaa atgtggggaa gttcacatgc 6720
attcgctggt gaatctgatc tgacactaca actacacacc aggtccaaca tgagcgacaa 6780
tacgacaatc aaaaagccga tccgacccaa accgatccgg acggaacgcc tgccttacgc 6840
tggggccgca gaaatcatcc gagccaacca gaaagaccac tactttgagt ccgtgcttga 6900
acagcatctc gtcacgtttc tgcagaaatg gaagggagta cgatttatcc accagtacaa 6960
ggaggagctg gagacggcgt ccaagtttgc atatctcggt ttgtgtacgc ttgtgggctc 7020
caagactctc ggagaagagt acaccaatct catgtacact atcagagacc gaacagctct 7080
accgggggtg gtgagacggt ttggctacgt gctttccaac actctgtttc catacctgtt 7140
tgtgcgctac atgggcaagt tgcgcgccaa actgatgcgc gagtatcccc atctggtgga 7200
gtacgacgaa gatgagcctg tgcccagccc ggaaacatgg aaggagcggg tcatcaagac 7260
gtttgtgaac aagtttgaca agttcacggc gctggagggg tttaccgcga tccacttggc 7320
gattttctac gtctacggct cgtactacca gctcagtaag cggatctggg gcatgcgtta 7380
tgtatttgga caccgactgg acaagaatga gcctcgaatc ggttacgaga tgctcggtct 7440
gctgattttc gcccggtttg ccacgtcatt tgtgcagacg ggaagagagt acctcggagc 7500
gctgctggaa aagagcgtgg agaaagaggc aggggagaag gaagatgaaa aggaagcggt 7560
tgtgccgaaa aagaagtcgt caattccgtt cattgaggat acagaagggg agacggaaga 7620
caagatcgat ctggaggacc ctcgacagct caagttcatt cctgaggcgt ccagagcgtg 7680
cactctgtgt ctgtcataca ttagtgcgcc ggcatgtacg ccatgtggac actttttctg 7740
ttgggactgt atttccgaat gggtgagaga gaagcccgag tgtcccttgt gtcggcaggg 7800
tgtgagagag cagaacttgt tgcctatcag ataatgacga ggtctggatg gaaggactag 7860
tcagcgagac acagagcatc agggaccaga cacgaccaat tcaatcgaca acactgtgct 7920
gcatagcagt gcacagaggt cctgggcatg aatatatttt agcattggag atatgagtgg 7980
tagagcgtat acagtattaa ttgtggaggt atctcgtcgc attgatagag caatacagtt 8040
actgctgaag c 8051
<210>81
<211>15877
<212>DNA
<213>人工序列
<220>
<223>质粒pZKL1-2SP98C
<400>81
aaatgatgtc gacgcagtag gatgtcctgc acgggtcttt ttgtggggtg tggagaaagg 60
ggtgcttgga tcgatggaag ccggtagaac cgggctgctt gtgcttggag atggaagccg 120
gtagaaccgg gctgcttggg gggatttggg gccgctgggc tccaaagagg ggtaggcatt 180
tcgttggggt tacgtaattg cggcatttgg gtcctgcgcg catgtcccat tggtcagaat 240
tagtccggat aggagactta tcagccaatc acagcgccgg atccacctgt aggttgggtt 300
gggtgggagc acccctccac agagtagagt caaacagcag cagcaacatg atagttgggg 360
gtgtgcgtgt taaaggaaaa aaaagaagct tgggttatat tcccgctcta tttagaggtt 420
gcgggataga cgccgacgga gggcaatggc gctatggaac cttgcggata tccatacgcc 480
gcggcggact gcgtccgaac cagctccagc agcgtttttt ccgggccatt gagccgactg 540
cgaccccgcc aacgtgtctt ggcccacgca ctcatgtcat gttggtgttg ggaggccact 600
ttttaagtag cacaaggcac ctagctcgca gcaaggtgtc cgaaccaaag aagcggctgc 660
agtggtgcaa acggggcgga aacggcggga aaaagccacg ggggcacgaa ttgaggcacg 720
ccctcgaatt tgagacgagt cacggcccca ttcgcccgcg caatggctcg ccaacgcccg 780
gtcttttgca ccacatcagg ttaccccaag ccaaaccttt gtgttaaaaa gcttaacata 840
ttataccgaa cgtaggtttg ggcgggcttg ctccgtctgt ccaaggcaac atttatataa 900
gggtctgcat cgccggctca attgaatctt ttttcttctt ctcttctcta tattcattct 960
tgaattaaac acacatcaac catgggcgta ttcattaaac aggagcagct tccggctctc 1020
aagaagtaca agtactccgc cgaggatcac tcgttcatct ccaacaacat tctgcgcccc 1080
ttctggcgac agtttgtcaa aatcttccct ctgtggatgg cccccaacat ggtgactctg 1140
ctgggcttct tctttgtcat tgtgaacttc atcaccatgc tcattgttga tcccacccac 1200
gaccgcgagc ctcccagatg ggtctacctc acctacgctc tgggtctgtt cctttaccag 1260
acatttgatg cctgtgacgg atcccatgcc cgacgaactg gccagagtgg accccttgga 1320
gagctgtttg accactgtgt cgacgccatg aatacctctc tgattctcac ggtggtggtg 1380
tccaccaccc atatgggata taacatgaag ctactgattg tgcagattgc cgctctcgga 1440
aacttctacc tgtcgacctg ggagacctac cataccggaa ctctgtacct ttctggcttc 1500
tctggtcctg ttgaaggtat cttgattctg gtggctcttt tcgtcctcac cttcttcact 1560
ggtcccaacg tgtacgctct gaccgtctac gaggctcttc ccgagtccat cacttcgctg 1620
ctgcctgcca gcttcctgga cgtcaccatc acccagatct acattggatt cggagtgctg 1680
ggcatggtgt tcaacatcta cggcgcctgc ggaaacgtga tcaagtacta caacaacaag 1740
ggcaagagcg ctctccccgc cattctcgga atcgccccct ttggcatctt ctacgtcggc 1800
gtctttgcct gggcccatgt tgctcctctg cttctctcca agtacgccat cgtctatctg 1860
tttgccattg gggctgcctt tgccatgcaa gtcggccaga tgattcttgc ccatctcgtg 1920
cttgctccct ttccccactg gaacgtgctg ctcttcttcc cctttgtggg actggcagtg 1980
cactacattg cacccgtgtt tggctgggac gccgatatcg tgtcggttaa cactctcttc 2040
acctgttttg gcgccaccct ctccatttac gccttctttg tgcttgagat catcgacgag 2100
atcaccaact acctcgatat ctggtgtctg cgaatcaagt accctcagga gaagaagacc 2160
gaataagcgg ccgcatggag cgtgtgttct gagtcgatgt tttctatgga gttgtgagtg 2220
ttagtagaca tgatgggttt atatatgatg aatgaataga tgtgattttg atttgcacga 2280
tggaattgag aactttgtaa acgtacatgg gaatgtatga atgtgggggt tttgtgactg 2340
gataactgac ggtcagtgga cgccgttgtt caaatatcca agagatgcga gaaactttgg 2400
gtcaagtgaa catgtcctct ctgttcaagt aaaccatcaa ctatgggtag tatatttagt 2460
aaggacaaga gttgagattc tttggagtcc tagaaacgta ttttcgcgtt ccaagatcaa 2520
attagtagag taatacgggc acgggaatcc attcatagtc tcaatcctgc aggtgagtta 2580
attaatcgag cttggcgtaa tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc 2640
tcacaattcc acacaacgta cgatagttag tagacaacaa tcagaacatc tccctcctta 2700
tataatcaca caggccagaa cgcgctaaac taaagcgctt tggacactat gttacattgg 2760
cattgattga actgaaacca cagtctccct cgcctgaatc gagcaatgga tgttgtcgga 2820
agtcaacttc actagaagag cggttctatg ccttgtcaag atcatatcat aaactcactc 2880
tgtattaccc catctataga acacttgtta tgaatgggcg gaaacattcc gctatatgca 2940
cctttccaca ctaatgcaaa gatgtgcatc ttcaacgggt agtaagactg gttccgactt 3000
ccgttgcatg gagagcaatg acctcgataa tgcgaacatc ccccacatat acactcttac 3060
acaggccaat ataatctgtg catttactaa atatttaagt ctatgcacct gcttgatgaa 3120
aagcggcacg gatggtatca tctagtttcc gccaatccaa gaaccaactg tgttggcagt 3180
ggtgtagccc atggcacaca gaccaaagat gaaaatacag acatcggcgg ttcgagccgt 3240
ggtgcctcga gcaacaccct tgtaatgcaa aagaggaggg taaatgtaca ccagaggcac 3300
acatgcaaac gatccggtga gagcgacgaa ccgatcgaga tcgtcggcac ctccccatgc 3360
aacaaaggcg gtgacaaaca caaggaagaa ccggaaaatg ttcttctgcc acttgatggt 3420
agagttgtac ttgcctgatc gggtgaagag accattctcg atgattcgga tggcgcgcca 3480
gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc 3540
cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc 3600
tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat 3660
gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 3720
ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 3780
aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc 3840
tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 3900
ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa 3960
gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta 4020
tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa 4080
caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 4140
ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt 4200
cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt 4260
ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat 4320
cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat 4380
gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc 4440
aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc 4500
acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta 4560
gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga 4620
cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg 4680
cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc 4740
tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat 4800
cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag 4860
gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat 4920
cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa 4980
ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa 5040
gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga 5100
taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg 5160
gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc 5220
acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg 5280
aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact 5340
cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat 5400
atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt 5460
gccacctgat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcagga 5520
aattgtaagc gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa tcagctcatt 5580
ttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat agaccgagat 5640
agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg tggactccaa 5700
cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac catcacccta 5760
atcaagtttt ttggggtcga ggtgccgtaa agcactaaat cggaacccta aagggagccc 5820
ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag ggaagaaagc 5880
gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg taaccaccac 5940
acccgccgcg cttaatgcgc cgctacaggg cgcgtccatt cgccattcag gctgcgcaac 6000
tgttgggaag ggcgatcggt gcgggcctct tcgctattac gccagctggc gaaaggggga 6060
tgtgctgcaa ggcgattaag ttgggtaacg ccagggtttt cccagtcacg acgttgtaaa 6120
acgacggcca gtgaattgta atacgactca ctatagggcg aattgggccc gacgtcgcat 6180
gcttagaagt gaggattaca agaagcctct ggatatcaat gatgaacgta ctcagcggct 6240
ggtcaagcat ttcgaccgtc gaatcgacga ggtgttcacc tttgacaagc gagggttccc 6300
aattgatcac gttctcgagt tgttcaaatc ttctctcaac atctctctgc atgaactatc 6360
tctgttgacg aacgtgtcac ccactgttcc tcgaacgccc ttctccgagt ttggtctgaa 6420
catcttcgat ctcaaactga cccccgcagt gatcaatagt gccatgccac tgccgatgcg 6480
gtgcgaacat ccctggaggg attctcggag ctctacacaa tgcagattct gtcgtcgagt 6540
actctctacc ttgctcgaat gacttattgt gctactactg cactcatgct tcgatcatgt 6600
gccctactgc accccaaatt tggtgatctg attgagacag agtaccctct tcagctgatt 6660
cagaagatca tcagcaacat gaatgatgtg gttgaccagg caggctgttg tagtcacgtc 6720
cttcacttca agttcattct tcatctgctt ctgttttact ttgacaggca aatgaagaca 6780
tggtacgact tgatggaggc caagaacgcc atttcacccc gagacaccga agtgcctgaa 6840
atcctggctg cccccattga taacatcgga aactacggta ttccggaaag tgtatataga 6900
acctttcccc agcttgtgtc tgtggatatg gatggtgtaa tccccttaat taactcacct 6960
gcaggattga gactatgaat ggattcccgt gcccgtatta ctctactaat ttgatcttgg 7020
aacgcgaaaa tacgtttcta ggactccaaa gaatctcaac tcttgtcctt actaaatata 7080
ctacccatag ttgatggttt acttgaacag agaggacatg ttcacttgac ccaaagtttc 7140
tcgcatctct tggatatttg aacaacggcg tccactgacc gtcagttatc cagtcacaaa 7200
acccccacat tcatacattc ccatgtacgt ttacaaagtt ctcaattcca tcgtgcaaat 7260
caaaatcaca tctattcatt catcatatat aaacccatca tgtctactaa cactcacaac 7320
tccatagaaa acatcgactc agaacacacg ctccatgcgg ccgcttaggc aacgggcttg 7380
atgacagcgg gaggagtgcc cacattgttt cggtttcgaa agaacaggac acccttgcca 7440
gctccctcgg caccagcgga gggttcaacc cactggcaca ttcgtgcaga tcggtacatg 7500
gctcgaatga atcctcgagg accgtcctgg acatcagctc gatagtgctt gcccatgata 7560
ggtttgatgg cctcggtagc ttcgtccgca ttgtagaagg gaatggaaga gacgtagtga 7620
tgcaggacgt gagtctcgat aatgccgtgg agcagatgac gtccaatgaa gcccatctct 7680
cggtcgatgg ttgcagcggc acctcgcaca aagttccact cgtcgttggt gtagtgggga 7740
agagtaggat ctgtgtgctg cagaaaggta atggcgacga gccagtggtt aacccacaag 7800
tagggaacga agtaccagat ggccatgttg tagaatccga acttctgaac gagaaagtac 7860
agagcggtgg ccataagacc aatgccaatg tcggagagca cgatgagctt ggcgtcgctg 7920
ttctcgtaca gaggagatcg gggatcgaaa tggttaactc caccgccaag accgttgtgc 7980
tttcccttgc ctcgaccctc tcgctgccgc tcatggtagt tgtgtccagt aacgttggta 8040
atgagatagt tgggccaacc gaccagttgc tgaagcacaa gcatgagcag ggtgaaagca 8100
ggagtttcct cggtaagatg ggcgagttcg tgggtcatct tgccgagtcg agtagcttgc 8160
tgctctcggg ttcgaggaac gaagaccatg tctcgctcca tgtttccagt ggccttgtga 8220
tgcttccggt gggagatttg ccagctgaag tagggaacaa gcagggaaga gtgaagcacc 8280
cagccagtaa tgtcgttgat gattcgggaa tcggagaaag caccatgtcc acactcgtgg 8340
gcaatgaccc acagtccagt accgaagagt ccctgaagaa cggtgtacac agcccacaga 8400
ccggctcgag caggagtgga gggaatgtac tcgggtgtca caaagttgta ccagatgctg 8460
aaagtggtag tcaggaggac aatgtctcga agaatgtagc cgtatccctt gagagcagat 8520
cgcttgaagc agtgcttggg aatagcgttg tagatgtcct tgatggtgaa gtcgggaact 8580
tcgaactggt tgccgtaggt atccagcatg acaccgtact cggacttggg cttggcaatg 8640
tccacctcgg acatggaaga cagcgatgta gaggaggccg agtgtctggg agaatcggag 8700
ggagagacgg cagcagactc cgagtcggtc acagtggtgg aagtgacggt tcgtcggagg 8760
gcagggttct gcttgggcag agccgaggtg gaggccatgg ccattgctgt agatatgtct 8820
tgtgtgtaag ggggttgggg tggttgtttg tgttcttgac ttttgtgtta gcaagggaag 8880
acgggcaaaa aagtgagtgt ggttgggagg gagagacgag ccttatatat aatgcttgtt 8940
tgtgtttgtg caagtggacg ccgaaacggg caggagccaa actaaacaag gcagacaatg 9000
cgagcttaat tggattgcct gatgggcagg ggttagggct cgatcaatgg gggtgcgaag 9060
tgacaaaatt gggaattagg ttcgcaagca aggctgacaa gactttggcc caaacatttg 9120
tacgcggtgg acaacaggag ccacccatcg tctgtcacgg gctagccggt cgtgcgtcct 9180
gtcaggctcc acctaggctc catgccactc catacaatcc cactagtgta ccgctaggcc 9240
gcttttagct cccatctaag acccccccaa aacctccact gtacagtgca ctgtactgtg 9300
tggcgatcaa gggcaaggga aaaaaggcgc aaacatgcac gcatggaatg acgtaggtaa 9360
ggcgttacta gactgaaaag tggcacattt cggcgtgcca aagggtccta ggtgcgtttc 9420
gcgagctggg cgccaggcca agccgctcca aaacgcctct ccgactccct ccagcggcct 9480
ccatatcccc atccctctcc acagcaatgt tgttaagcct tgcaaacgaa aaaatagaaa 9540
ggctaataag cttccaatat tgtggtgtac gctgcataac gcaacaatga gcgccaaaca 9600
acacacacac acagcacaca gcagcattaa ccacgatgaa cagcatgaat tcctttacct 9660
gcaggataac ttcgtataat gtatgctata cgaagttatg atctctctct tgagcttttc 9720
cataacaagt tcttctgcct ccaggaagtc catgggtggt ttgatcatgg ttttggtgta 9780
gtggtagtgc agtggtggta ttgtgactgg ggatgtagtt gagaataagt catacacaag 9840
tcagctttct tcgagcctca tataagtata agtagttcaa cgtattagca ctgtacccag 9900
catctccgta tcgagaaaca caacaacatg ccccattgga cagatcatgc ggatacacag 9960
gttgtgcagt atcatacata ctcgatcaga caggtcgtct gaccatcata caagctgaac 10020
aagcgctcca tacttgcacg ctctctatat acacagttaa attacatatc catagtctaa 10080
cctctaacag ttaatcttct ggtaagcctc ccagccagcc ttctggtatc gcttggcctc 10140
ctcaatagga tctcggttct ggccgtacag acctcggccg acaattatga tatccgttcc 10200
ggtagacatg acatcctcaa cagttcggta ctgctgtccg agagcgtctc ccttgtcgtc 10260
aagacccacc ccgggggtca gaataagcca gtcctcagag tcgcccttag gtcggttctg 10320
ggcaatgaag ccaaccacaa actcggggtc ggatcgggca agctcaatgg tctgcttgga 10380
gtactcgcca gtggccagag agcccttgca agacagctcg gccagcatga gcagacctct 10440
ggccagcttc tcgttgggag aggggactag gaactccttg tactgggagt tctcgtagtc 10500
agagacgtcc tccttcttct gttcagagac agtttcctcg gcaccagctc gcaggccagc 10560
aatgattccg gttccgggta caccgtgggc gttggtgata tcggaccact cggcgattcg 10620
gtgacaccgg tactggtgct tgacagtgtt gccaatatct gcgaactttc tgtcctcgaa 10680
caggaagaaa ccgtgcttaa gagcaagttc cttgaggggg agcacagtgc cggcgtaggt 10740
gaagtcgtca atgatgtcga tatgggtttt gatcatgcac acataaggtc cgaccttatc 10800
ggcaagctca atgagctcct tggtggtggt aacatccaga gaagcacaca ggttggtttt 10860
cttggctgcc acgagcttga gcactcgagc ggcaaaggcg gacttgtgga cgttagctcg 10920
agcttcgtag gagggcattt tggtggtgaa gaggagactg aaataaattt agtctgcaga 10980
actttttatc ggaaccttat ctggggcagt gaagtatatg ttatggtaat agttacgagt 11040
tagttgaact tatagataga ctggactata cggctatcgg tccaaattag aaagaacgtc 11100
aatggctctc tgggcgtcgc ctttgccgac aaaaatgtga tcatgatgaa agccagcaat 11160
gacgttgcag ctgatattgt tgtcggccaa ccgcgccgaa aacgcagctg tcagacccac 11220
agcctccaac gaagaatgta tcgtcaaagt gatccaagca cactcatagt tggagtcgta 11280
ctccaaaggc ggcaatgacg agtcagacag atactcgtcg acgcgataac ttcgtataat 11340
gtatgctata cgaagttatc gtacgatagt tagtagacaa caatcgatcg aggaagagga 11400
caagcggctg cttcttaagt ttgtgacatc agtatccaag gcaccattgc aaggattcaa 11460
ggctttgaac ccgtcatttg ccattcgtaa cgctggtaga caggttgatc ggttccctac 11520
ggcctccacc tgtgtcaatc ttctcaagct gcctgactat caggacattg atcaacttcg 11580
gaagaaactt ttgtatgcca ttcgatcaca tgctggtttc gatttgtctt agaggaacgc 11640
atatacagta atcatagaga ataaacgata ttcatttatt aaagtagata gttgaggtag 11700
aagttgtaaa gagtgataaa tagcggccgc tcactgaatc tttttggctc ccttgtgctt 11760
tcggacgatg taggtctgca cgtagaagtt gaggaacaga cacaggacag taccaacgta 11820
gaagtagttg aaaaaccagc caaacattct cattccatct tgtcggtagc agggaatgtt 11880
ccggtacttc cagacgatgt agaagccaac gttgaactga atgatctgca tagaagtaat 11940
cagggacttg ggcataggga acttgagctt gatcagtcgg gtccaatagt agccgtacat 12000
gatccagtga atgaagccgt tgagcagcac aaagatccaa acggcttcgt ttcggtagtt 12060
gtagaacagc cacatgtcca taggagctcc gagatggtga aagaactgca accaggtcag 12120
aggcttgccc atgaggggca gatagaagga gtcaatgtac tcgaggaact tgctgaggta 12180
gaacagctga gtggtgattc ggaagacatt gttgtcgaaa gccttctcgc agttgtcgga 12240
catgacacca atggtgtaca tggcgtaggc catagagagg aaggagccca gcgagtagat 12300
ggacatgagc aggttgtagt tggtgaacac aaacttcatt cgagactgac ccttgggtcc 12360
gagaggacca agggtgaact tcaggatgac gaaggcgatg gagaggtaca gcacctcgca 12420
gtgcgaggca tcagaccaga gctgagcata gtcgaccttg ggaagaacct cctggccaat 12480
ggagacgatt tcgttcacga cctccatggt tgtgaattag ggtggtgaga atggttggtt 12540
gtagggaaga atcaaaggcc ggtctcggga tccgtgggta tatatatata tatatatata 12600
tacgatcctt cgttacctcc ctgttctcaa aactgtggtt tttcgttttt cgttttttgc 12660
tttttttgat ttttttaggg ccaactaagc ttccagattt cgctaatcac ctttgtacta 12720
attacaagaa aggaagaagc tgattagagt tgggcttttt atgcaactgt gctactcctt 12780
atctctgata tgaaagtgta gacccaatca catcatgtca tttagagttg gtaatactgg 12840
gaggatagat aaggcacgaa aacgagccat agcagacatg ctgggtgtag ccaagcagaa 12900
gaaagtagat gggagccaat tgacgagcga gggagctacg ccaatccgac atacgacacg 12960
ctgagatcgt cttggccggg gggtacctac agatgtccaa gggtaagtgc ttgactgtaa 13020
ttgtatgtct gaggacaaat atgtagtcag ccgtataaag tcataccagg caccagtgcc 13080
atcatcgaac cactaactct ctatgataca tgcctccggt attattgtac catgcgtcgc 13140
tttgttacat acgtatcttg cctttttctc tcagaaactc cagactttgg ctattggtcg 13200
agataagccc ggaccatagt gagtctttca cactctgttt aaacaccact aaaaccccac 13260
aaaatatatc ttaccgaata tacagatcta ctatagagga acaattgccc cggagaagac 13320
ggccaggccg cctagatgac aaattcaaca actcacagct gactttctgc cattgccact 13380
aggggggggc ctttttatat ggccaagcca agctctccac gtcggttggg ctgcacccaa 13440
caataaatgg gtagggttgc accaacaaag ggatgggatg gggggtagaa gatacgagga 13500
taacggggct caatggcaca aataagaacg aatactgcca ttaagactcg tgatccagcg 13560
actgacacca ttgcatcatc taagggcctc aaaactacct cggaactgct gcgctgatct 13620
ggacaccaca gaggttccga gcactttagg ttgcaccaaa tgtcccacca ggtgcaggca 13680
gaaaacgctg gaacagcgtg tacagtttgt cttaacaaaa agtgagggcg ctgaggtcga 13740
gcagggtggt gtgacttgtt atagccttta gagctgcgaa agcgcgtatg gatttggctc 13800
atcaggccag attgagggtc tgtggacaca tgtcatgtta gtgtacttca atcgccccct 13860
ggatatagcc ccgacaatag gccgtggcct catttttttg ccttccgcac atttccattg 13920
ctcggtaccc acaccttgct tctcctgcac ttgccaacct taatactggt ttacattgac 13980
caacatctta caagcggggg gcttgtctag ggtatatata aacagtggct ctcccaatcg 14040
gttgccagtc tcttttttcc tttctttccc cacagattcg aaatctaaac tacacatcac 14100
acaatgcctg ttactgacgt ccttaagcga aagtccggtg tcatcgtcgg cgacgatgtc 14160
cgagccgtga gtatccacga caagatcagt gtcgagacga cgcgttttgt gtaatgacac 14220
aatccgaaag tcgctagcaa cacacactct ctacacaaac taacccagct ctccatggtg 14280
aaggcttctc gacaggctct gcccctcgtc atcgacggaa aggtgtacga cgtctccgct 14340
tgggtgaact tccaccctgg tggagctgaa atcattgaga actaccaggg acgagatgct 14400
actgacgcct tcatggttat gcactctcag gaagccttcg acaagctcaa gcgaatgccc 14460
aagatcaacc aggcttccga gctgcctccc caggctgccg tcaacgaagc tcaggaggat 14520
ttccgaaagc tccgagaaga gctgatcgcc actggcatgt ttgacgcctc tcccctctgg 14580
tactcgtaca agatcttgac caccctgggt cttggcgtgc ttgccttctt catgctggtc 14640
cagtaccacc tgtacttcat tggtgctctc gtgctcggta tgcactacca gcaaatggga 14700
tggctgtctc atgacatctg ccaccaccag accttcaaga accgaaactg gaataacgtc 14760
ctgggtctgg tctttggcaa cggactccag ggcttctccg tgacctggtg gaaggacaga 14820
cacaacgccc atcattctgc taccaacgtt cagggtcacg atcccgacat tgataacctg 14880
cctctgctcg cctggtccga ggacgatgtc actcgagctt ctcccatctc ccgaaagctc 14940
attcagttcc aacagtacta tttcctggtc atctgtattc tcctgcgatt catctggtgt 15000
ttccagtctg tgctgaccgt tcgatccctc aaggaccgag acaaccagtt ctaccgatct 15060
cagtacaaga aagaggccat tggactcgct ctgcactgga ctctcaagac cctgttccac 15120
ctcttcttta tgccctccat cctgacctcg atgctggtgt tctttgtttc cgagctcgtc 15180
ggtggcttcg gaattgccat cgtggtcttc atgaaccact accctctgga gaagatcggt 15240
gattccgtct gggacggaca tggcttctct gtgggtcaga tccatgagac catgaacatt 15300
cgacgaggca tcattactga ctggttcttt ggaggcctga actaccagat cgagcaccat 15360
ctctggccca ccctgcctcg acacaacctc actgccgttt cctaccaggt ggaacagctg 15420
tgccagaagc acaacctccc ctaccgaaac cctctgcccc atgaaggtct cgtcatcctg 15480
ctccgatacc tgtcccagtt cgctcgaatg gccgagaagc agcccggtgc caaggctcag 15540
taagcggccg catgagaaga taaatatata aatacattga gatattaaat gcgctagatt 15600
agagagcctc atactgctcg gagagaagcc aagacgagta ctcaaagggg attacaccat 15660
ccatatccac agacacaagc tggggaaagg ttctatatac actttccgga ataccgtagt 15720
ttccgatgtt atcaatgggg gcagccagga tttcaggcac ttcggtgtct cggggtgaaa 15780
tggcgttctt ggcctccatc aagtcgtacc atgtcttcat ttgcctgtca aagtaaaaca 15840
gaagcagatg aagaatgaac ttgaagtgaa ggaattt 15877
<210>82
<211>1185
<212>DNA
<213>解脂耶氏酵母
<220>
<221>CDS
<222>(1)..(1185)
<223>二酰基甘油胆碱磷酸转移酶(YlCPT1)
<300>
<302>高二十碳五烯酸生产菌株
<310>WO 2006/052870
<311>2005-11-03
<312>2006-05-18
<313>(1)..(1185)
<300>
<302>高二十碳五烯酸生产菌株
<310>US 2006-0115881-A1
<311>2005-11-02
<312>2006-06-01
<313>(1)..(1185)
<400>82
atg ggc gta ttc att aaa cag gag cag ctt ccg gct ctc aag aag tac 48
Met Gly Val Phe Ile Lys Gln Glu Gln Leu Pro Ala Leu Lys Lys Tyr
1 5 10 15
aag tac tcc gcc gag gat cac tcg ttc atc tcc aac aac att ctg cgc 96
Lys Tyr Ser Ala Glu Asp His Ser Phe Ile Ser Asn Asn Ile Leu Arg
20 25 30
ccc ttc tgg cga cag ttt gtc aaa atc ttc cct ctg tgg atg gcc ccc 144
Pro Phe Trp Arg Gln Phe Val Lys Ile Phe Pro Leu Trp Met Ala Pro
35 40 45
aac atg gtg act ctg ttg ggc ttc ttc ttt gtc att gtg aac ttc atc 192
Asn Met Val Thr Leu Leu Gly Phe Phe Phe Val Ile Val Asn Phe Ile
50 55 60
acc atg ctc att gtt gat ccc acc cac gac cgc gag cct ccc aga tgg 240
Thr Met Leu Ile Val Asp Pro Thr His Asp Arg Glu Pro Pro Arg Trp
65 70 75 80
gtc tac ctc acc tac gct ctg ggt ctg ttc ctt tac cag aca ttt gat 288
Val Tyr Leu Thr Tyr Ala Leu Gly Leu Phe Leu Tyr Gln Thr Phe Asp
85 90 95
gcc tgt gac gga tcc cat gcc cga cga act ggc cag agt gga ccc ctt 336
Ala Cys Asp Gly Ser His Ala Arg Arg Thr Gly Gln Ser Gly Pro Leu
100 105 110
gga gag ctg ttt gac cac tgt gtc gac gcc atg aat acc tct ctg att 384
Gly Glu Leu Phe Asp His Cys Val Asp Ala Met Asn Thr Ser Leu Ile
115 120 125
ctc acg gtg gtg gtg tcc acc acc cat atg gga tat aac atg aag ctg 432
Leu Thr Val Val Val Ser Thr Thr His Met Gly Tyr Asn Met Lys Leu
130 135 140
ctg att gtg cag att gcc gct ctc gga aac ttc tac ctg tcg acc tgg 480
Leu Ile Val Gln Ile Ala Ala Leu Gly Asn Phe Tyr Leu Ser Thr Trp
145 150 155 160
gag acc tac cat acc gga act ctg tac ctt tct ggc ttc tct ggt cct 528
Glu Thr Tyr His Thr Gly Thr Leu Tyr Leu Ser Gly Phe Ser Gly Pro
165 170 175
gtt gaa ggt atc ttg att ctg gtg gct ctt ttc gtc ctc acc ttc ttc 576
Val Glu Gly Ile Leu Ile Leu Val Ala Leu Phe Val Leu Thr Phe Phe
180 185 190
act ggt ccc aac gtg tac gct ctg acc gtc tac gag gct ctt ccc gaa 624
Thr Gly Pro Asn Val Tyr Ala Leu Thr Val Tyr Glu Ala Leu Pro Glu
195 200 205
tcc atc act tcg ctg ctg cct gcc agc ttc ctg gac gtc acc atc acc 672
Ser Ile Thr Ser Leu Leu Pro Ala Ser Phe Leu Asp Val Thr Ile Thr
210 215 220
cag atc tac att gga ttc gga gtg ctg ggc atg gtg ttc aac atc tac 720
Gln Ile Tyr Ile Gly Phe Gly Val Leu Gly Met Val Phe Asn Ile Tyr
225 230 235 240
ggc gcc tgc gga aac gtg atc aag tac tac aac aac aag ggc aag agc 768
Gly Ala Cys Gly Asn Val Ile Lys Tyr Tyr Asn Asn Lys Gly Lys Ser
245 250 255
gct ctc ccc gcc att ctc gga atc gcc ccc ttt ggc atc ttc tac gtc 816
Ala Leu Pro Ala Ile Leu Gly Ile Ala Pro Phe Gly Ile Phe Tyr Val
260 265 270
ggc gtc ttt gcc tgg gcc cat gtt gct cct ctg ctt ctc tcc aag tac 864
Gly Val Phe Ala Trp Ala His Val Ala Pro Leu Leu Leu Ser Lys Tyr
275 280 285
gcc atc gtc tat ctg ttt gcc att ggg gct gcc ttt gcc atg caa gtc 912
Ala Ile Val Tyr Leu Phe Ala Ile Gly Ala Ala Phe Ala Met Gln Val
290 295 300
ggc cag atg att ctt gcc cat ctc gtg ctt gct ccc ttc ccc cac tgg 960
Gly Gln Met Ile Leu Ala His Leu Val Leu Ala Pro Phe Pro His Trp
305 310 315 320
aac gtg ctg ctc ttc ttc ccc ttt gtg gga ctg gca gtg cac tac att 1008
Asn Val Leu Leu Phe Phe Pro Phe Val Gly Leu Ala Val His Tyr Ile
325 330 335
gca ccc gtg ttt ggc tgg gac gcc gat atc gtg tcg gtt aac act ctc 1056
Ala Pro Val Phe Gly Trp Asp Ala Asp Ile Val Ser Val Asn Thr Leu
340 345 350
ttc acc tgt ttt ggc gcc acc ctc tcc att tac gcc ttc ttt gtg ctt 1104
Phe Thr Cys Phe Gly Ala Thr Leu Ser Ile Tyr Ala Phe Phe Val Leu
355 360 365
gag atc atc gac gag atc acc aac tac ctc gat atc tgg tgt ctg cga 1152
Glu Ile Ile Asp Glu Ile Thr Asn Tyr Leu Asp Ile Trp Cys Leu Arg
370 375 380
atc aag tac cct cag gag aag aag act gag taa 1185
Ile Lys Tyr Pro Gln Glu Lys Lys Thr Glu
385 390
<210>83
<211>394
<212>PRT
<213>解脂耶氏酵母
<400>83
Met Gly Val Phe Ile Lys Gln Glu Gln Leu Pro Ala Leu Lys Lys Tyr
1 5 10 15
Lys Tyr Ser Ala Glu Asp His Ser Phe Ile Ser Asn Asn Ile Leu Arg
20 25 30
Pro Phe Trp Arg Gln Phe Val Lys Ile Phe Pro Leu Trp Met Ala Pro
35 40 45
Asn Met Val Thr Leu Leu Gly Phe Phe Phe Val Ile Val Asn Phe Ile
50 55 60
Thr Met Leu Ile Val Asp Pro Thr His Asp Arg Glu Pro Pro Arg Trp
65 70 75 80
Val Tyr Leu Thr Tyr Ala Leu Gly Leu Phe Leu Tyr Gln Thr Phe Asp
85 90 95
Ala Cys Asp Gly Ser His Ala Arg Arg Thr Gly Gln Ser Gly Pro Leu
100 105 110
Gly Glu Leu Phe Asp His Cys Val Asp Ala Met Asn Thr Ser Leu Ile
115 120 125
Leu Thr Val Val Val Ser Thr Thr His Met Gly Tyr Asn Met Lys Leu
130 135 140
Leu Ile Val Gln Ile Ala Ala Leu Gly Asn Phe Tyr Leu Ser Thr Trp
145 150 155 160
Glu Thr Tyr His Thr Gly Thr Leu Tyr Leu Ser Gly Phe Ser Gly Pro
165 170 175
Val Glu Gly Ile Leu Ile Leu Val Ala Leu Phe Val Leu Thr Phe Phe
180 185 190
Thr Gly Pro Asn Val Tyr Ala Leu Thr ValTyr Glu Ala Leu Pro Glu
195 200 205
Ser Ile Thr Ser Leu Leu Pro Ala Ser Phe Leu Asp Val Thr Ile Thr
210 215 220
Gln Ile Tyr Ile Gly Phe Gly Val Leu Gly Met Val Phe Asn Ile Tyr
225 230 235 240
Gly Ala Cys Gly Asn Val Ile Lys Tyr Tyr Asn Asn Lys Gly Lys Ser
245 250 255
Ala Leu Pro Ala Ile Leu Gly Ile Ala Pro Phe Gly Ile Phe Tyr Val
260 265 270
Gly Val Phe Ala Trp Ala His Val Ala Pro Leu Leu Leu Ser Lys Tyr
275 280 285
Ala Ile Val Tyr Leu Phe Ala Ile Gly Ala Ala Phe Ala Met Gln Val
290 295 300
Gly Gln Met Ile Leu Ala His Leu Val Leu Ala Pro Phe Pro His Trp
305 310 315 320
Asn Val Leu Leu Phe Phe Pro Phe Val Gly Leu Ala Val His Tyr Ile
325 330 335
Ala Pro Val Phe Gly Trp Asp Ala Asp Ile Val Ser Val Asn Thr Leu
340 345 350
Phe Thr Cys Phe Gly Ala Thr Leu Ser Ile Tyr Ala Phe Phe Val Leu
355 360 365
Glu Ile Ile Asp Glu Ile Thr Asn Tyr Leu Asp Ile Trp Cys Leu Arg
370 375 380
Ile Lys Tyr Pro Gln Glu Lys Lys Thr Glu
385 390
<210>84
<211>15812
<212>DNA
<213>人工序列
<220>
<223>质粒pZKL2-5U89Gc
<400>84
gtacgttatc atttgaacag tgaaaggcta cagtaacaga agcagttgta aacttcattc 60
cgttgattct gtactacagt accccactac gccgcttccg ctgacactgt tcaacccaaa 120
aactacatct gcgtgcgctg tgtaaggcta tcatcagata catactgtag attctgtaga 180
tgcgaacctg cttgtatcat atacatcccc ctccccctga cctgcacaag caagcaatgt 240
gacattgata ttgctgctta tctagtgccg aggatgtgaa agccgagact caaacatttc 300
ttttactctc ttgttcctga ccagacctgg cggagattac gccagtatga ttcttgcagg 360
tctgagacaa gcctggaaca gccaacattt atttttcgaa gcgagaaaca tgccacaccc 420
cggcacgttc agagatgcat atgatttgtt tttcgagtaa cagtaccccc cccccccccc 480
ccaatgaaac cagtattact cacaccatcc tcattcaaag cgttacactg attacgcgcc 540
catcaacgac agcatgaggg gactgctgat ctgatctaat caaatgacta caaaaatcgc 600
aataatgaag agcaaacgac aaaaaagaaa caggttaacc aatcccgctt caatgtctca 660
ccacaatcca gcactgtttc tcattacctc ctccctctaa tttcagagtt gcatcagggt 720
ccttgatggc gcgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 780
gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 840
ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 900
acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 960
cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 1020
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 1080
gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 1140
tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 1200
aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 1260
ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 1320
cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 1380
tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc 1440
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 1500
ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 1560
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 1620
aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 1680
aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 1740
gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 1800
gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 1860
caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 1920
ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta 1980
attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg 2040
ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg 2100
gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 2160
ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta 2220
tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg 2280
gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc 2340
cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 2400
gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 2460
tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 2520
ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 2580
gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 2640
tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 2700
catttccccg aaaagtgcca cctgatgcgg tgtgaaatac cgcacagatg cgtaaggaga 2760
aaataccgca tcaggaaatt gtaagcgtta atattttgtt aaaattcgcg ttaaattttt 2820
gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa 2880
aagaatagac cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa 2940
agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac 3000
gtgaaccatc accctaatca agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga 3060
accctaaagg gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa 3120
aggaagggaa gaaagcgaaa ggagcgggcg ctagggcgct ggcaagtgta gcggtcacgc 3180
tgcgcgtaac caccacaccc gccgcgctta atgcgccgct acagggcgcg tccattcgcc 3240
attcaggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc tattacgcca 3300
gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccag ggttttccca 3360
gtcacgacgt tgtaaaacga cggccagtga attgtaatac gactcactat agggcgaatt 3420
gggcccgacg tcgcatgctg gtttcgattt gtcttagagg aacgcatata cagtaatcat 3480
agagaataaa cgatattcat ttattaaagt agatagttga ggtagaagtt gtaaagagtg 3540
ataaatagct tagataccac agacaccctc ggtgacgaag tactgcagat ggtttccaat 3600
cacattgacc tgctggagca gagtgttacc ggcagagcac tgtttattgc tctggccctg 3660
gcacatgaca acgttggaga gaggagggtg gatcaggggc cagtcaataa agacctcacc 3720
agagcagtgc tggtaaccgt cccagaaggg cacttgaggg acgatatctc ctcggtgggt 3780
gattcggtag agctttcggt ctttggacac cttggagaca tcggggttct cctggccaaa 3840
gaagagttta tcgacccagt tagcaaagcc agcgttaccg acaatgggct gaccaagagt 3900
aacaacgagg ggatcgtggc cgttaacctt gaggttgatt ccgaacagaa gggctgcagc 3960
tcctccgaga gagtgaccgg tgacagcaat ctggtagtcg ggatactgct caatcacaga 4020
gtcgagcttg gggccgatct gattgtaggt gttgttgtag gactggatga agccattgtg 4080
gacaagacag tcatcacaag tagcagtaga agagatgtta gcagcaagat caaagttaat 4140
taactcacct gcaggattga gactatgaat ggattcccgt gcccgtatta ctctactaat 4200
ttgatcttgg aacgcgaaaa tacgtttcta ggactccaaa gaatctcaac tcttgtcctt 4260
actaaatata ctacccatag ttgatggttt acttgaacag agaggacatg ttcacttgac 4320
ccaaagtttc tcgcatctct tggatatttg aacaacggcg tccactgacc gtcagttatc 4380
cagtcacaaa acccccacat tcatacattc ccatgtacgt ttacaaagtt ctcaattcca 4440
tcgtgcaaat caaaatcaca tctattcatt catcatatat aaacccatca tgtctactaa 4500
cactcacaac tccatagaaa acatcgactc agaacacacg ctccatgcgg ccgcttagga 4560
atcctgagcg tccttgacac agtgaaccac accgactttg tgcatgtact tgagggtgga 4620
aatgatgttg cccacaatgg tagggtagaa gacgtaccga actccgtgtc gttcgcaaca 4680
ctctcggaca gcttgctgca cgaagggata gtgccaagac gacattcgag gaaagaggtg 4740
atgctcgatc tggaagttga gaccgccagt aaagaacatg gcaatgggtc caccgtaggt 4800
ggaagaggtc tccacctgag ctctgtacca gtcgatctga tcggcttcaa cgtccttctc 4860
ggagctcttg accttgcagt tcttgtcggg gattcgctcc gagccatcga agttgtgaga 4920
caagatgaaa aagaaggtga ggaaggcacc ggtagcagtg ggcaccagag gaatggtgat 4980
gagcagggag gttccagtga gataccaggg caagaaggcg gttcgaaaga tgaagaaagc 5040
tcgcataacg aatgcaaggg ttcggtaccg tcgcagaaag ccgttctctc gcatggctgt 5100
gacagactcg ggaatggtgt cgttgtgctg cattcggaag atgtagagag ggttgtacac 5160
cagcgaaacg ccgtaggctc caagcacgag gtacatgtac caggcctgga atcggtgaaa 5220
ccactttcga gcagtgttgg cagcagggta gttgtggaac acaaggaatg gttctgcgga 5280
ctcggcatcc aggtcgagac catgctgatt ggtgtaggtg tgatgtcgca tgatgtgaga 5340
ctgcagccag atccatctgg acgatccaat gacgtcgatg ccgtaggcaa agagagcgtt 5400
gacccagggc tttttgctga tggcaccatg agaggcatcg tgctgaatgg acaggccgat 5460
ctgcatgtgc atgaatccag tcaagagacc ccacagcacc attccggtag tagcccagtg 5520
ccactcgcaa aaggcggtga cagcaatgat gccaacggtt cgcagccaga atccaggtgt 5580
ggcataccag ttccgacctt tcatgacctc tcgcatagtt cgcttgacgt cctgtgcaaa 5640
gggagagtcg taggtgtaga caatgtcctt ggaggttcgg tcgtgcttgc ctcgcacgaa 5700
ctgttgaagc agcttcgagt tctcgggctt gacgtaaggg tgcatggagt agaacagagg 5760
agaagcatcg gaggcaccag aagcgaggat caagtcgcct ccgggatgga ccttggcaag 5820
accttccaga tcgtagagaa tgccgtcgat ggcaaccagg tcgggtcgct cgagcagctg 5880
ctcggtagta agggagagag ccatggttgt gaattagggt ggtgagaatg gttggttgta 5940
gggaagaatc aaaggccggt ctcgggatcc gtgggtatat atatatatat atatatatac 6000
gatccttcgt tacctccctg ttctcaaaac tgtggttttt cgtttttcgt tttttgcttt 6060
ttttgatttt tttagggcca actaagcttc cagatttcgc taatcacctt tgtactaatt 6120
acaagaaagg aagaagctga ttagagttgg gctttttatg caactgtgct actccttatc 6180
tctgatatga aagtgtagac ccaatcacat catgtcattt agagttggta atactgggag 6240
gatagataag gcacgaaaac gagccatagc agacatgctg ggtgtagcca agcagaagaa 6300
agtagatggg agccaattga cgagcgaggg agctacgcca atccgacata cgacacgctg 6360
agatcgtctt ggccgggggg tacctacaga tgtccaaggg taagtgcttg actgtaattg 6420
tatgtctgag gacaaatatg tagtcagccg tataaagtca taccaggcac cagtgccatc 6480
atcgaaccac taactctcta tgatacatgc ctccggtatt attgtaccat gcgtcgcttt 6540
gttacatacg tatcttgcct ttttctctca gaaactccag aattctctct cttgagcttt 6600
tccataacaa gttcttctgc ctccaggaag tccatgggtg gtttgatcat ggttttggtg 6660
tagtggtagt gcagtggtgg tattgtgact ggggatgtag ttgagaataa gtcatacaca 6720
agtcagcttt cttcgagcct catataagta taagtagttc aacgtattag cactgtaccc 6780
agcatctccg tatcgagaaa cacaacaaca tgccccattg gacagatcat gcggatacac 6840
aggttgtgca gtatcataca tactcgatca gacaggtcgt ctgaccatca tacaagctga 6900
acaagcgctc catacttgca cgctctctat atacacagtt aaattacata tccatagtct 6960
aacctctaac agttaatctt ctggtaagcc tcccagccag ccttctggta tcgcttggcc 7020
tcctcaatag gatctcggtt ctggccgtac agacctcggc cgacaattat gatatccgtt 7080
ccggtagaca tgacatcctc aacagttcgg tactgctgtc cgagagcgtc tcccttgtcg 7140
tcaagaccca ccccgggggt cagaataagc cagtcctcag agtcgccctt aggtcggttc 7200
tgggcaatga agccaaccac aaactcgggg tcggatcggg caagctcaat ggtctgcttg 7260
gagtactcgc cagtggccag agagcccttg caagacagct cggccagcat gagcagacct 7320
ctggccagct tctcgttggg agaggggact aggaactcct tgtactggga gttctcgtag 7380
tcagagacgt cctccttctt ctgttcagag acagtttcct cggcaccagc tcgcaggcca 7440
gcaatgattc cggttccggg tacaccgtgg gcgttggtga tatcggacca ctcggcgatt 7500
cggtgacacc ggtactggtg cttgacagtg ttgccaatat ctgcgaactt tctgtcctcg 7560
aacaggaaga aaccgtgctt aagagcaagt tccttgaggg ggagcacagt gccggcgtag 7620
gtgaagtcgt caatgatgtc gatatgggtt ttgatcatgc acacataagg tccgacctta 7680
tcggcaagct caatgagctc cttggtggtg gtaacatcca gagaagcaca caggttggtt 7740
ttcttggctg ccacgagctt gagcactcga gcggcaaagg cggacttgtg gacgttagct 7800
cgagcttcgt aggagggcat tttggtggtg aagaggagac tgaaataaat ttagtctgca 7860
gaacttttta tcggaacctt atctggggca gtgaagtata tgttatggta atagttacga 7920
gttagttgaa cttatagata gactggacta tacggctatc ggtccaaatt agaaagaacg 7980
tcaatggctc tctgggcgtc gcctttgccg acaaaaatgt gatcatgatg aaagccagca 8040
atgacgttgc agctgatatt gttgtcggcc aaccgcgccg aaaacgcagc tgtcagaccc 8100
acagcctcca acgaagaatg tatcgtcaaa gtgatccaag cacactcata gttggagtcg 8160
tactccaaag gcggcaatga cgagtcagac agatactcgt cgaccttttc cttgggaacc 8220
accaccgtca gcccttctga ctcacgtatt gtagccaccg acacaggcaa cagtccgtgg 8280
atagcagaat atgtcttgtc ggtccatttc tcaccaactt taggcgtcaa gtgaatgttg 8340
cagaagaagt atgtgccttc attgagaatc ggtgttgctg atttcaataa agtcttgaga 8400
tcagtttggc cagtcatgtt gtggggggta attggattga gttatcgcct acagtctgta 8460
caggtatact cgctgcccac tttatacttt ttgattccgc tgcacttgaa gcaatgtcgt 8520
ttaccaaaag tgagaatgct ccacagaaca caccccaggg tatggttgag caaaaaataa 8580
acactccgat acggggaatc gaaccccggt ctccacggtt ctcaagaagt attcttgatg 8640
agagcgtatc gatcgaggaa gaggacaagc ggctgcttct taagtttgtg acatcagtat 8700
ccaaggcacc attgcaagga ttcaaggctt tgaacccgtc atttgccatt cgtaacgctg 8760
gtagacaggt tgatcggttc cctacggcct ccacctgtgt caatcttctc aagctgcctg 8820
actatcagga cattgatcaa cttcggaaga aacttttgta tgccattcga tcacatgctg 8880
gtttcgattt gtcttagagg aacgcatata cagtaatcat agagaataaa cgatattcat 8940
ttattaaagt agatagttga ggtagaagtt gtaaagagtg ataaatagcg gccgctcact 9000
gaatcttttt ggctcccttg tgctttcgga cgatgtaggt ctgcacgtag aagttgagga 9060
acagacacag gacagtacca acgtagaagt agttgaaaaa ccagccaaac attctcattc 9120
catcttgtcg gtagcaggga atgttccggt acttccagac gatgtagaag ccaacgttga 9180
actgaatgat ctgcatagaa gtaatcaggg acttgggcat agggaacttg agcttgatca 9240
gtcgggtcca atagtagccg tacatgatcc agtgaatgaa gccgttgagc agcacaaaga 9300
tccaaacggc ttcgtttcgg tagttgtaga acagccacat gtccatagga gctccgagat 9360
ggtgaaagaa ctgcaaccag gtcagaggct tgcccatgag gggcagatag aaggagtcaa 9420
tgtactcgag gaacttgctg aggtagaaca gctgagtggt gattcggaag acattgttgt 9480
cgaaagcctt ctcgcagttg tcggacatga caccaatggt gtacatggcg taggccatag 9540
agaggaagga gcccagcgag tagatggaca tgagcaggtt gtagttggtg aacacaaact 9600
tcattcgaga ctgacccttg ggtccgagag gaccaagggt gaacttcagg atgacgaagg 9660
cgatggagag gtacagcacc tcgcagtgcg aggcatcaga ccagagctga gcatagtcga 9720
ccttgggaag aacctcctgg ccaatggaga cgatttcgtt cacgacctcc atggttgatg 9780
tgtgtttaat tcaagaatga atatagagaa gagaagaaga aaaaagattc aattgagccg 9840
gcgatgcaga cccttatata aatgttgcct tggacagacg gagcaagccc gcccaaacct 9900
acgttcggta taatatgtta agctttttaa cacaaaggtt tggcttgggg taacctgatg 9960
tggtgcaaaa gaccgggcgt tggcgagcca ttgcgcgggc gaatggggcc gtgactcgtc 10020
tcaaattcga gggcgtgcct caattcgtgc ccccgtggct ttttcccgcc gtttccgccc 10080
cgtttgcacc actgcagccg cttctttggt tcggacacct tgctgcgagc taggtgcctt 10140
gtgctactta aaaagtggcc tcccaacacc aacatgacat gagtgcgtgg gccaagacac 10200
gttggcgggg tcgcagtcgg ctcaatggcc cggaaaaaac gctgctggag ctggttcgga 10260
cgcagtccgc cgcggcgtat ggatatccgc aaggttccat agcgccattg ccctccgtcg 10320
gcgtctatcc cgcaacctct aaatagagcg ggaatataac ccaagcttct tttttttcct 10380
ttaacacgca cacccccaac tatcatgttg ctgctgctgt ttgactctac tctgtggagg 10440
ggtgctccca cccaacccaa cctacaggtg gatccggcgc tgtgattggc tgataagtct 10500
cctatccgga ctaattctga ccaatgggac atgcgcgcag gacccaaatg ccgcaattac 10560
gtaaccccaa cgaaatgcct acccctcttt ggagcccagc ggccccaaat ccccccaagc 10620
agcccggttc taccggcttc catctccaag cacaagcagc ccggttctac cggcttccat 10680
ctccaagcac ccctttctcc acaccccaca aaaagacccg tgcaggacat cctactgcgt 10740
gtttaaacac cactaaaacc ccacaaaata tatcttaccg aatatacaga tctactatag 10800
aggaacaatt gccccggaga agacggccag gccgcctaga tgacaaattc aacaactcac 10860
agctgacttt ctgccattgc cactaggggg gggccttttt atatggccaa gccaagctct 10920
ccacgtcggt tgggctgcac ccaacaataa atgggtaggg ttgcaccaac aaagggatgg 10980
gatggggggt agaagatacg aggataacgg ggctcaatgg cacaaataag aacgaatact 11040
gccattaaga ctcgtgatcc agcgactgac accattgcat catctaaggg cctcaaaact 11100
acctcggaac tgctgcgctg atctggacac cacagaggtt ccgagcactt taggttgcac 11160
caaatgtccc accaggtgca ggcagaaaac gctggaacag cgtgtacagt ttgtcttaac 11220
aaaaagtgag ggcgctgagg tcgagcaggg tggtgtgact tgttatagcc tttagagctg 11280
cgaaagcgcg tatggatttg gctcatcagg ccagattgag ggtctgtgga cacatgtcat 11340
gttagtgtac ttcaatcgcc ccctggatat agccccgaca ataggccgtg gcctcatttt 11400
tttgccttcc gcacatttcc attgctcggt acccacacct tgcttctcct gcacttgcca 11460
accttaatac tggtttacat tgaccaacat cttacaagcg gggggcttgt ctagggtata 11520
tataaacagt ggctctccca atcggttgcc agtctctttt ttcctttctt tccccacaga 11580
ttcgaaatct aaactacaca tcacacaatg cctgttactg acgtccttaa gcgaaagtcc 11640
ggtgtcatcg tcggcgacga tgtccgagcc gtgagtatcc acgacaagat cagtgtcgag 11700
acgacgcgtt ttgtgtaatg acacaatccg aaagtcgcta gcaacacaca ctctctacac 11760
aaactaaccc agctctccat ggtgaaggct tctcgacagg ctctgcccct cgtcatcgac 11820
ggaaaggtgt acgacgtctc cgcttgggtg aacttccacc ctggtggagc tgaaatcatt 11880
gagaactacc agggacgaga tgctactgac gccttcatgg ttatgcactc tcaggaagcc 11940
ttcgacaagc tcaagcgaat gcccaagatc aaccaggctt ccgagctgcc tccccaggct 12000
gccgtcaacg aagctcagga ggatttccga aagctccgag aagagctgat cgccactggc 12060
atgtttgacg cctctcccct ctggtactcg tacaagatct tgaccaccct gggtcttggc 12120
gtgcttgcct tcttcatgct ggtccagtac cacctgtact tcattggtgc tctcgtgctc 12180
ggtatgcact accagcaaat gggatggctg tctcatgaca tctgccacca ccagaccttc 12240
aagaaccgaa actggaataa cgtcctgggt ctggtctttg gcaacggact ccagggcttc 12300
tccgtgacct ggtggaagga cagacacaac gcccatcatt ctgctaccaa cgttcagggt 12360
cacgatcccg acattgataa cctgcctctg ctcgcctggt ccgaggacga tgtcactcga 12420
gcttctccca tctcccgaaa gctcattcag ttccaacagt actatttcct ggtcatctgt 12480
attctcctgc gattcatctg gtgtttccag tctgtgctga ccgttcgatc cctcaaggac 12540
cgagacaacc agttctaccg atctcagtac aagaaagagg ccattggact cgctctgcac 12600
tggactctca agaccctgtt ccacctcttc tttatgccct ccatcctgac ctcgatgctg 12660
gtgttctttg tttccgagct cgtcggtggc ttcggaattg ccatcgtggt cttcatgaac 12720
cactaccctc tggagaagat cggtgattcc gtctgggacg gacatggctt ctctgtgggt 12780
cagatccatg agaccatgaa cattcgacga ggcatcatta ctgactggtt ctttggaggc 12840
ctgaactacc agatcgagca ccatctctgg cccaccctgc ctcgacacaa cctcactgcc 12900
gtttcctacc aggtggaaca gctgtgccag aagcacaacc tcccctaccg aaaccctctg 12960
ccccatgaag gtctcgtcat cctgctccga tacctgtccc agttcgctcg aatggccgag 13020
aagcagcccg gtgccaaggc tcagtaagcg gccgcatgag aagataaata tataaataca 13080
ttgagatatt aaatgcgcta gattagagag cctcatactg ctcggagaga agccaagacg 13140
agtactcaaa ggggattaca ccatccatat ccacagacac aagctgggga aaggttctat 13200
atacactttc cggaataccg tagtttccga tgttatcaat gggggcagcc aggatttcag 13260
gcacttcggt gtctcggggt gaaatggcgt tcttggcctc catcaagtcg taccatgtct 13320
tcatttgcct gtcaaagtaa aacagaagca gatgaagaat gaacttgaag tgaaggaatt 13380
taaatagttg gagcaaggga gaaatgtaga gtgtgaaaga ctcactatgg tccgggctta 13440
tctcgaccaa tagccaaagt ctggagtttc tgagagaaaa aggcaagata cgtatgtaac 13500
aaagcgacgc atggtacaat aataccggag gcatgtatca tagagagtta gtggttcgat 13560
gatggcactg gtgcctggta tgactttata cggctgacta catatttgtc ctcagacata 13620
caattacagt caagcactta cccttggaca tctgtaggta ccccccggcc aagacgatct 13680
cagcgtgtcg tatgtcggat tggcgtagct ccctcgctcg tcaattggct cccatctact 13740
ttcttctgct tggctacacc cagcatgtct gctatggctc gttttcgtgc cttatctatc 13800
ctcccagtat taccaactct aaatgacatg atgtgattgg gtctacactt tcatatcaga 13860
gataaggagt agcacagttg cataaaaagc ccaactctaa tcagcttctt cctttcttgt 13920
aattagtaca aaggtgatta gcgaaatctg gaagcttagt tggccctaaa aaaatcaaaa 13980
aaagcaaaaa acgaaaaacg aaaaaccaca gttttgagaa cagggaggta acgaaggatc 14040
gtatatatat atatatatat atatacccac ggatcccgag accggccttt gattcttccc 14100
tacaaccaac cattctcacc accctaattc acaaccatgg gcgtattcat taaacaggag 14160
cagcttccgg ctctcaagaa gtacaagtac tccgccgagg atcactcgtt catctccaac 14220
aacattctgc gccccttctg gcgacagttt gtcaaaatct tccctctgtg gatggccccc 14280
aacatggtga ctctgctggg cttcttcttt gtcattgtga acttcatcac catgctcatt 14340
gttgatccca cccacgaccg cgagcctccc agatgggtct acctcaccta cgctctgggt 14400
ctgttccttt accagacatt tgatgcctgt gacggatccc atgcccgacg aactggccag 14460
agtggacccc ttggagagct gtttgaccac tgtgtcgacg ccatgaatac ctctctgatt 14520
ctcacggtgg tggtgtccac cacccatatg ggatataaca tgaagctact gattgtgcag 14580
attgccgctc tcggaaactt ctacctgtcg acctgggaga cctaccatac cggaactctg 14640
tacctttctg gcttctctgg tcctgttgaa ggtatcttga ttctggtggc tcttttcgtc 14700
ctcaccttct tcactggtcc caacgtgtac gctctgaccg tctacgaggc tcttcccgag 14760
tccatcactt cgctgctgcc tgccagcttc ctggacgtca ccatcaccca gatctacatt 14820
ggattcggag tgctgggcat ggtgttcaac atctacggcg cctgcggaaa cgtgatcaag 14880
tactacaaca acaagggcaa gagcgctctc cccgccattc tcggaatcgc cccctttggc 14940
atcttctacg tcggcgtctt tgcctgggcc catgttgctc ctctgcttct ctccaagtac 15000
gccatcgtct atctgtttgc cattggggct gcctttgcca tgcaagtcgg ccagatgatt 15060
cttgcccatc tcgtgcttgc tccctttccc cactggaacg tgctgctctt cttccccttt 15120
gtgggactgg cagtgcacta cattgcaccc gtgtttggct gggacgccga tatcgtgtcg 15180
gttaacactc tcttcacctg ttttggcgcc accctctcca tttacgcctt ctttgtgctt 15240
gagatcatcg acgagatcac caactacctc gatatctggt gtctgcgaat caagtaccct 15300
caggagaaga agaccgaata agcggccgca tggagcgtgt gttctgagtc gatgttttct 15360
atggagttgt gagtgttagt agacatgatg ggtttatata tgatgaatga atagatgtga 15420
ttttgatttg cacgatggaa ttgagaactt tgtaaacgta catgggaatg tatgaatgtg 15480
ggggttttgt gactggataa ctgacggtca gtggacgccg ttgttcaaat atccaagaga 15540
tgcgagaaac tttgggtcaa gtgaacatgt cctctctgtt caagtaaacc atcaactatg 15600
ggtagtatat ttagtaagga caagagttga gattctttgg agtcctagaa acgtattttc 15660
gcgttccaag atcaaattag tagagtaata cgggcacggg aatccattca tagtctcaat 15720
cctgcaggtg agttaattaa tcgagcttgg cgtaatcatg gtcatagctg tttcctgtgt 15780
gaaattgtta tccgctcaca attccacaca ac 15812
<210>85
<211>4313
<212>DNA
<213>人工序列
<220>
<223>质粒pZKUM
<400>85
taatcgagct tggcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc 60
acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga 120
gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg 180
tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg 240
cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg 300
gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga 360
aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 420
gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 480
aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 540
gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 600
ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 660
cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 720
ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 780
actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 840
tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 900
gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 960
ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 1020
cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 1080
ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 1140
tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 1200
agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 1260
gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 1320
ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 1380
gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 1440
cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct 1500
acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 1560
cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 1620
cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 1680
ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 1740
tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 1800
atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 1860
tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 1920
actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 1980
aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 2040
ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc 2100
ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 2160
cgaaaagtgc cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt 2220
acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc 2280
ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct 2340
ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat 2400
ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc 2460
acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc 2520
tattcttttg atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg 2580
atttaacaaa aatttaacgc gaattttaac aaaatattaa cgcttacaat ttccattcgc 2640
cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc 2700
agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc 2760
agtcacgacg ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat 2820
tgggtaccgg gccccccctc gaggtcgacg agtatctgtc tgactcgtca ttgccgcctt 2880
tggagtacga ctccaactat gagtgtgctt ggatcacttt gacgatacat tcttcgttgg 2940
aggctgtggg tctgacagct gcgttttcgg cgcggttggc cgacaacaat atcagctgca 3000
acgtcattgc tggctttcat catgatcaca tttttgtcgg caaaggcgac gcccagagag 3060
ccattgacgt tctttctaat ttggaccgat agccgtatag tccagtctat ctataagttc 3120
aactaactcg taactattac cataacatat acttcactgc cccagataag gttccgataa 3180
aaagttctgc agactaaatt tatttcagtc tcctcttcac caccaaaatg ccctcctacg 3240
aagctcgagt gctcaagctc gtggcagcca agaaaaccaa cctgtgtgct tctctggatg 3300
ttaccaccac caaggagctc attgagcttg ccgataaggt cggaccttat gtgtgcatga 3360
tcaaaaccca tatcgacatc attgacgact tcacctacgc cggcactgtg ctccccctca 3420
aggaacttgc tcttaagcac ggtttcttcc tgttcgagga cagaaagttc gcagatattg 3480
gcaacactgt caagcaccag taccggtgtc accgaatcgc cgagtggtcc gatatcacca 3540
acgcccacgg tgtacccgga accggaatcg attgctggcc tgcgagctgg tgcgtacgag 3600
gaaactgtct ctgaacagaa gaaggaggac gtctctgact acgagaactc ccagtacaag 3660
gagttcctag tcccctctcc caacgagaag ctggccagag gtctgctcat gctggccgag 3720
ctgtcttgca agggctctct ggccactggc gagtactcca agcagaccat tgagcttgcc 3780
cgatccgacc ccgagtttgt ggttggcttc attgcccaga accgacctaa gggcgactct 3840
gaggactggc ttattctgac ccccggggtg ggtcttgacg acaagggaga cgctctcgga 3900
cagcagtacc gaactgttga ggatgtcatg tctaccggaa cggatatcat aattgtcggc 3960
cgaggtctgt acggccagaa ccgagatcct attgaggagg ccaagcgata ccagaaggct 4020
ggctgggagg cttaccagaa gattaactgt tagaggttag actatggata tgtaatttaa 4080
ctgtgtatat agagagcgtg caagtatgga gcgcttgttc agcttgtatg atggtcagac 4140
gacctgtctg atcgagtatg tatgatactg cacaacctgt gtatccgcat gatctgtcca 4200
atggggcatg ttgttgtgtt tctcgatacg gagatgctgg gtacagtgct aatacgttga 4260
actacttata cttatatgag gctcgaagaa agctgacttg tgtatgactt aat 4313
<210>86
<211>1459
<212>DNA
<213>人工序列
<220>
<223>合成突变型Ura3基因,包含对耶氏酵母属Ura3编码区(GenBank保藏号AJ306421)的一个从+21至+53的33bp的缺失,一个在+376的1bp缺失,和一个从+400至403的3bp缺失
<400>86
gagtatctgt ctgactcgtc attgccgcct ttggagtacg actccaacta tgagtgtgct 60
tggatcactt tgacgataca ttcttcgttg gaggctgtgg gtctgacagc tgcgttttcg 120
gcgcggttgg ccgacaacaa tatcagctgc aacgtcattg ctggctttca tcatgatcac 180
atttttgtcg gcaaaggcga cgcccagaga gccattgacg ttctttctaa tttggaccga 240
tagccgtata gtccagtcta tctataagtt caactaactc gtaactatta ccataacata 300
tacttcactg ccccagataa ggttccgata aaaagttctg cagactaaat ttatttcagt 360
ctcctcttca ccaccaaaat gccctcctac gaagctcgag tgctcaagct cgtggcagcc 420
aagaaaacca acctgtgtgc ttctctggat gttaccacca ccaaggagct cattgagctt 480
gccgataagg tcggacctta tgtgtgcatg atcaaaaccc atatcgacat cattgacgac 540
ttcacctacg ccggcactgt gctccccctc aaggaacttg ctcttaagca cggtttcttc 600
ctgttcgagg acagaaagtt cgcagatatt ggcaacactg tcaagcacca gtaccggtgt 660
caccgaatcg ccgagtggtc cgatatcacc aacgcccacg gtgtacccgg aaccggaatc 720
gattgctggc ctgcgagctg gtgcgtacga ggaaactgtc tctgaacaga agaaggagga 780
cgtctctgac tacgagaact cccagtacaa ggagttccta gtcccctctc ccaacgagaa 840
gctggccaga ggtctgctca tgctggccga gctgtcttgc aagggctctc tggccactgg 900
cgagtactcc aagcagacca ttgagcttgc ccgatccgac cccgagtttg tggttggctt 960
cattgcccag aaccgaccta agggcgactc tgaggactgg cttattctga cccccggggt 1020
gggtcttgac gacaagggag acgctctcgg acagcagtac cgaactgttg aggatgtcat 1080
gtctaccgga acggatatca taattgtcgg ccgaggtctg tacggccaga accgagatcc 1140
tattgaggag gccaagcgat accagaaggc tggctgggag gcttaccaga agattaactg 1200
ttagaggtta gactatggat atgtaattta actgtgtata tagagagcgt gcaagtatgg 1260
agcgcttgtt cagcttgtat gatggtcaga cgacctgtct gatcgagtat gtatgatact 1320
gcacaacctg tgtatccgca tgatctgtcc aatggggcat gttgttgtgt ttctcgatac 1380
ggagatgctg ggtacagtgc taatacgttg aactacttat acttatatga ggctcgaaga 1440
aagctgactt gtgtatgac 1459
<210>87
<211>7966
<212>DNA
<213>人工序列
<220>
<223>质粒pYPS161
<400>87
aaatgtaacg aaactgaaat ttgaccagat attgtgtccg cggtggagct ccagcttttg 60
ttccctttag tgagggttaa tttcgagctt ggcgtaatca tggtcatagc tgtttcctgt 120
gtgaaattgt tatccgctca caagcttcca cacaacgtac gttctggttg gctcggatga 180
tttctgcggc cccagcgtaa ggcaggcgtt ccgtccggat cggtttgggt cggatcggct 240
ttttgattgt cgtattgtcg ctcatgttgg acctggtgtg tagttgtagt gtcagatcag 300
attcaccagc gaatgcatgt gaacttcccc acattttgag ccgaggcaga tttgggttgc 360
ttagtaagca gacgtggcgt tgcaagtaga tgtggcaaat ggggacgaag attccgaggg 420
gatatcatag ttccaagggg atgtcatcat ttgccagctt tcgccgccac ttttgacgag 480
tttttgtggg tcaaataagt ttagttgaac ttttcaaatt tcagttggca ttttgttaat 540
agaaagggtg ccggtgctgg ggggttcatt cctcgggttg cagatatcct atctgtctta 600
ggggtatctc tttcaatcga caagatgtag ttgggtaaca attatttatt aatattctct 660
ccatccagta cagtactaac atcttgacat ctcagcacaa gtgcatcttc ccaagtgttt 720
gttggagagg ttgttgggta ttacttagga aacagaacac agtacgtgga gatcttggat 780
acatcgtaca tggaggttat ccataaaaaa gaccctccag gactagttac aatgccgtta 840
gatgaggaaa tccacaaccc tgattcacta tgaacatatt atcttccccc aaacttgcga 900
tatatggccc ttgatgatag ccttgatttt acccttgatg gtacctccac gaccaaccga 960
tctgctgttt gaagagatat tttcaaattt gaagtgctca gatctactaa acatgagtcc 1020
agtaattctt tccgtctttc cgatttccga tattcccttt tttagcccga cttttcactg 1080
ctcccatgtc aaacgattag gacttgggag acaatcccac tgtcaaaatc accccgatat 1140
tctctgtaaa acaagtactt cttccacgtg atcttcaaat acctcttcca cgtgaccttc 1200
aaatacctct tcaagtacct cttccacgcg accttcaaag tcccttcaaa tacccttctc 1260
aattctcccc ttctcctcca tagtccttct ctctgactaa gcttgagaat acatgacgct 1320
aagacgaaaa cacactagag accctgagag cctgaacatg catccactct gcagttgcgc 1380
acgtgcctac agcaactatc gggtccagtg ctggatctga cactgcgtct ccctatgaag 1440
aaactgataa acagatctgc actcataaca atgatctgag cgatgaaaac gtgacctcca 1500
cagccacaag tcataatcgg cgcgccagct gcattaatga atcggccaac gcgcggggag 1560
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 1620
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 1680
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 1740
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 1800
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 1860
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 1920
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 1980
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 2040
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 2100
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 2160
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 2220
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 2280
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 2340
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 2400
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 2460
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 2520
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 2580
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 2640
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 2700
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 2760
ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 2820
caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 2880
attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 2940
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 3000
actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 3060
ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 3120
ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 3180
gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 3240
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 3300
cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 3360
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 3420
gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 3480
ggttccgcgc acatttcccc gaaaagtgcc acctgatgcg gtgtgaaata ccgcacagat 3540
gcgtaaggag aaaataccgc atcaggaaat tgtaagcgtt aatattttgt taaaattcgc 3600
gttaaatttt tgttaaatca gctcattttt taaccaatag gccgaaatcg gcaaaatccc 3660
ttataaatca aaagaataga ccgagatagg gttgagtgtt gttccagttt ggaacaagag 3720
tccactatta aagaacgtgg actccaacgt caaagggcga aaaaccgtct atcagggcga 3780
tggcccacta cgtgaaccat caccctaatc aagttttttg gggtcgaggt gccgtaaagc 3840
actaaatcgg aaccctaaag ggagcccccg atttagagct tgacggggaa agccggcgaa 3900
cgtggcgaga aaggaaggga agaaagcgaa aggagcgggc gctagggcgc tggcaagtgt 3960
agcggtcacg ctgcgcgtaa ccaccacacc cgccgcgctt aatgcgccgc tacagggcgc 4020
gtccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg 4080
ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca 4140
gggttttccc agtcacgacg ttgtaaaacg acggccagtg aattgtaata cgactcacta 4200
tagggcgaat tgggcccgac gtcgcatgca actattagtg aggcttcggg agtggttgtc 4260
tcggttgtct cattcagact cgttgtgttg tatctatatc tatataaaca ctcttgtccc 4320
tcaatcccac tgccatcttt tgctaaactt gccgccaata tgaaactcat ctccctcatc 4380
accgtcgcta ccaccgctct ggcggctgtc ggagacaagt acaagctgac ctataccaga 4440
tcagacgccc aatcggtcga atctctgccc gtcacctacc aagatgacct gatcaccgcc 4500
tccaccgacg gcgaacccat caccatcacc gagggcgagg gcaacacctt ctctgttaac 4560
gacatgccca tcgcctatct ggagctgcag gctttgttct ggaccggcga ctacggctac 4620
aagctccagg gctcggtctt tgacattgcc gccgatggaa cctttgagct gagagacggc 4680
cccaaggagt actactattg cactcctcac cctgagcgaa acgtcatcta cgtcatcaac 4740
agccccgact actccaagtg tcggttcaag cgtaccatca agttccacgc tgaaaagatc 4800
taagtggtaa tcgaccgact aaccattttt agctgacaaa cacttgctaa ctcctataac 4860
gaatgaatga ctaacttggc atattgttac caagtattac ttgggatata gttgagtgta 4920
accattgcta agaatccaaa ctggagcttc taaaggtctg ggagtcgccg tatgtgttca 4980
tatcgaaatc aaagaaatca taatcgcaac agaattcaaa atcaagcaga ttaatatcca 5040
ttattgtact cggatcgtga catatctgat atgatctcgg atatgatctc tgactgttta 5100
ctgggagatt tgttgaagat ttgttgaggt tatctgaaaa gtagacaata gagacaaaat 5160
gacgatatca agaactgaat cgggccgaaa tactcggtat cattcccttc agcagtaact 5220
gtattgctct atcaatgcga cgagatacct ccacaattaa tactgtatac gctctaccac 5280
tcatatctcc aatgctaaaa tatattcatg cccaggacct ctgtgcactg ctatgcagca 5340
cagtgttgtc gattgaattg gtcgtgtctg gtccctgatg ctctgtgtct cgctgactag 5400
tccttccatc cagacctcgt cattatctga taggcaacaa gttctgctct ctcacaccct 5460
gccgacacaa gggacactcg ggcttctctc tcacccattc ggaaatacag tccttaatta 5520
agttgcgaca catgtcttga tagtatcttg aattctctct cttgagcttt tccataacaa 5580
gttcttctgc ctccaggaag tccatgggtg gtttgatcat ggttttggtg tagtggtagt 5640
gcagtggtgg tattgtgact ggggatgtag ttgagaataa gtcatacaca agtcagcttt 5700
cttcgagcct catataagta taagtagttc aacgtattag cactgtaccc agcatctccg 5760
tatcgagaaa cacaacaaca tgccccattg gacagatcat gcggatacac aggttgtgca 5820
gtatcataca tactcgatca gacaggtcgt ctgaccatca tacaagctga acaagcgctc 5880
catacttgca cgctctctat atacacagtt aaattacata tccatagtct aacctctaac 5940
agttaatctt ctggtaagcc tcccagccag ccttctggta tcgcttggcc tcctcaatag 6000
gatctcggtt ctggccgtac agacctcggc cgacaattat gatatccgtt ccggtagaca 6060
tgacatcctc aacagttcgg tactgctgtc cgagagcgtc tcccttgtcg tcaagaccca 6120
ccccgggggt cagaataagc cagtcctcag agtcgccctt aggtcggttc tgggcaatga 6180
agccaaccac aaactcgggg tcggatcggg caagctcaat ggtctgcttg gagtactcgc 6240
cagtggccag agagcccttg caagacagct cggccagcat gagcagacct ctggccagct 6300
tctcgttggg agaggggact aggaactcct tgtactggga gttctcgtag tcagagacgt 6360
cctccttctt ctgttcagag acagtttcct cggcaccagc tcgcaggcca gcaatgattc 6420
cggttccggg tacaccgtgg gcgttggtga tatcggacca ctcggcgatt cggtgacacc 6480
ggtactggtg cttgacagtg ttgccaatat ctgcgaactt tctgtcctcg aacaggaaga 6540
aaccgtgctt aagagcaagt tccttgaggg ggagcacagt gccggcgtag gtgaagtcgt 6600
caatgatgtc gatatgggtt ttgatcatgc acacataagg tccgacctta tcggcaagct 6660
caatgagctc cttggtggtg gtaacatcca gagaagcaca caggttggtt ttcttggctg 6720
ccacgagctt gagcactcga gcggcaaagg cggacttgtg gacgttagct cgagcttcgt 6780
aggagggcat tttggtggtg aagaggagac tgaaataaat ttagtctgca gaacttttta 6840
tcggaacctt atctggggca gtgaagtata tgttatggta atagttacga gttagttgaa 6900
cttatagata gactggacta tacggctatc ggtccaaatt agaaagaacg tcaatggctc 6960
tctgggcgtc gcctttgccg acaaaaatgt gatcatgatg aaagccagca atgacgttgc 7020
agctgatatt gttgtcggcc aaccgcgccg aaaacgcagc tgtcagaccc acagcctcca 7080
acgaagaatg tatcgtcaaa gtgatccaag cacactcata gttggagtcg tactccaaag 7140
gcggcaatga cgagtcagac agatactcgt cgaccttttc cttgggaacc accaccgtca 7200
gcccttctga ctcacgtatt gtagccaccg acacaggcaa cagtccgtgg atagcagaat 7260
atgtcttgtc ggtccatttc tcaccaactt taggcgtcaa gtgaatgttg cagaagaagt 7320
atgtgccttc attgagaatc ggtgttgctg atttcaataa agtcttgaga tcagtttggc 7380
cagtcatgtt gtggggggta attggattga gttatcgcct acagtctgta caggtatact 7440
cgctgcccac tttatacttt ttgattccgc tgcacttgaa gcaatgtcgt ttaccaaaag 7500
tgagaatgct ccacagaaca caccccaggg tatggttgag caaaaaataa acactccgat 7560
acggggaatc gaaccccggt ctccacggtt ctcaagaagt attcttgatg agagcgtatc 7620
gatgagccta aaatgaaccc gagtatatct cataaaattc tcggtgagag gtctgtgact 7680
gtcagtacaa ggtgccttca ttatgccctc aaccttacca tacctcactg aatgtagtgt 7740
acctctaaaa atgaaataca gtgccaaaag ccaaggcact gagctcgtct aacggacttg 7800
atatacaacc aattaaaaca aatgaaaaga aatacagttc tttgtatcat ttgtaacaat 7860
taccctgtac aaactaaggt attgaaatcc cacaatattc ccaaagtcca cccctttcca 7920
aattgtcatg cctacaactc atataccaag cactaaccta ccgttt 7966
<210>88
<211>20
<212>DNA
<213>人工序列
<220>
<223>引物Pex-10del13’.正向
<400>88
ccaacatgag cgacaatacg 20
<210>89
<211>20
<212>DNA
<213>人工序列
<220>
<223>引物Pex-10del25’.反向
<400>89
caagttctgc tctctcacac 20
Claims (15)
1.使具有总脂质含量、总脂质级分和油级分的含油真核生物中至少一种多不饱和脂肪酸的重量百分比相对于总脂肪酸的重量百分比提高的方法,所述方法包括:
a)提供含油真核生物,所述含油真核生物包含:
1)编码功能性多不饱和脂肪酸生物合成途径的基因;和
2)在编码过氧化物酶体生物合成因子蛋白的天然基因中的破
坏,由此提供PEX破坏的生物,以及
b)使所述PEX破坏的生物在以下条件下生长:即,使得当与编码过氧化物酶体生物合成因子蛋白的天然基因未被破坏的含油真核生物中的总脂质级分或油级分中相对于总脂肪酸的重量百分比的至少一种多不饱和脂肪酸的重量百分比进行比较时,所述总脂质级分或油级分中至少一种多不饱和脂肪酸的重量百分比相对于总脂肪酸的重量百分比提高。
2.提高含油真核生物中至少一种多不饱和脂肪酸相对于干细胞重量的百分比的方法,所述方法包括:
a)提供含油真核生物,所述含油真核生物包含:
1)编码功能性多不饱和脂肪酸生物合成途径的基因;和
2)在编码过氧化物酶体生物合成因子蛋白的天然基因中的破坏,由此提供PEX破坏的生物,以及
b)使所述PEX破坏的生物在以下条件下生长:即,使得当与编码过氧化物酶体生物合成因子蛋白的天然基因未被破坏的含油真核生物中至少一种多不饱和脂肪酸相对于干细胞重量的百分比进行比较时,所述至少一种多不饱和脂肪酸相对于所述干细胞重量的百分比提高。
3.权利要求1或2的方法,其中当与编码过氧化物酶体生物合成因子蛋白的天然基因未被破坏的含油真核生物中的总脂质级分或油级分中相对于总脂肪酸的重量百分比的多不饱和脂肪酸的重量百分比进行比较时,所述至少一种多不饱和脂肪酸的重量百分比相对于总脂肪酸的重量百分比的提高为至少1.3。
4.权利要求1或2的方法,其中所述至少一种多不饱和脂肪酸选自:
亚油酸、共轭亚油酸、γ-亚麻酸、二高-γ-亚麻酸、花生四烯酸、二十二碳四烯酸、ω-6二十二碳五烯酸、α-亚麻酸、十八碳四烯酸、二十碳四烯酸、二十碳五烯酸、ω-3二十二碳五烯酸、二十碳二烯酸、二十碳三烯酸、二十二碳六烯酸、这些脂肪酸的羟基化或环氧脂肪酸、C18多不饱和脂肪酸、C20多不饱和脂肪酸、和C22多不饱和脂肪酸。
5.权利要求1的方法,其中所述至少一种多不饱和脂肪酸由多不饱和脂肪酸的组合组成,并且其中所述组合的重量百分比相对于总脂肪酸的重量百分比被提高。
6.权利要求4的方法,其中所述多不饱和脂肪酸的组合由两种或更多种选自下组的多不饱和脂肪酸的任何组合组成:
亚油酸、共轭亚油酸、γ-亚麻酸、二高-γ-亚麻酸、花生四烯酸、二十二碳四烯酸、ω-6二十二碳五烯酸、α-亚麻酸、十八碳四烯酸、二十碳四烯酸、二十碳五烯酸、ω-3二十二碳五烯酸、二十碳二烯酸、二十碳三烯酸、二十二碳六烯酸、这些脂肪酸的羟基化或环氧脂肪酸、C20多不饱和脂肪酸的组合、C20-22多不饱和脂肪酸的组合、和C22多不饱和脂肪酸的组合。
7.权利要求1或2的方法,其中当与编码过氧化物酶体生物合成因子蛋白的天然基因未被破坏的含油真核生物中的总脂质含量比较时,所述PEX破坏的生物中的总脂质含量发生改变。
8.权利要求1或2的方法,其中所述PEX破坏的生物选自:具有含油特性的耶氏酵母属(Yarrowia)、假丝酵母属(Candida)、红酵母属(Rhodotorula)、红冬孢酵母属(Rhodosporidium)、隐球酵母属(Cryptococcus)、丝孢酵母属(Trichosporon)、油脂酵母属(Lipomyces)、被孢霉属(Mortierella)、破囊壶菌属(Thraustochytrium)、裂殖壶菌属(Schizochytrium)、和糖酵母属(Saccharomyces)。
9.权利要求1或2的方法,其中所述多不饱和脂肪酸生物合成途径包含编码选自下组的酶的基因:
Δ9去饱和酶、Δ12去饱和酶、Δ6去饱和酶、Δ5去饱和酶、Δ17去饱和酶、Δ8去饱和酶、Δ15去饱和酶、Δ4去饱和酶、C14/16延伸酶、C16/18延伸酶、C18/20延伸酶、C20/22延伸酶和Δ9延伸酶。
10.权利要求1或2的方法,其中在编码过氧化物酶体生物合成因子蛋白的天然基因中的所述破坏包括缺失,所述缺失选自:
在编码所述蛋白的C末端部分的基因部分中的缺失和基因敲除。
11.权利要求1或2的方法,其中所述过氧化物酶体生物合成因子蛋白选自:
Pex1p、Pex2p、Pex3p、Pex3Bp、Pex4p、Pex5p、Pex5Bp、Pex5Cp、Pex5/20p、Pex6p、Pex7p、Pex8p、Pex10p、Pex12p、Pex13p、Pex14p、Pex15p、Pex16p、Pex17p、Pex14/17p、Pex18p、Pex19p、Pex20p、Pex21p、Pex21Bp、Pex22p、Pex22p样和Pex26p。
12.权利要求11的方法,其中所述过氧化物酶体生物合成因子,其中所述破坏是基因敲除或在编码所述蛋白的C3HC4环状锌指基序的C末端部分的基因部分中的缺失。
13.在PEX破坏的生物中的油级分或总脂质级分,其具有至少一种多不饱和脂肪酸的重量百分比相对于总脂肪酸的重量百分比的提高,其中所述提高是通过权利要求1的方法获得的。
14.PEX破坏的生物的至少一种多不饱和脂肪酸用作食品、饲料或用于工业应用的用途,所述至少一种多不饱和脂肪酸的重量百分比相对于总脂肪酸的重量百分比已经通过权利要求1的方法得到提高。
15.PEX破坏的解脂耶氏酵母,其中所述破坏发生在编码过氧化物酶体生物合成因子蛋白的天然基因中,所述蛋白选自Pex3p、Pex10p和Pex16p。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US97717407P | 2007-10-03 | 2007-10-03 | |
US60/977174 | 2007-10-03 | ||
PCT/US2008/078671 WO2009046248A1 (en) | 2007-10-03 | 2008-10-03 | Peroxisome biogenesis factor protein (pex) disruptions for altering the content of polyunsaturated fatty acids and the total lipid content in oleaginous eukaryotic organisms |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101883843A true CN101883843A (zh) | 2010-11-10 |
CN101883843B CN101883843B (zh) | 2016-03-02 |
Family
ID=40032846
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200880118917.4A Expired - Fee Related CN101883843B (zh) | 2007-10-03 | 2008-10-03 | 破坏过氧化物酶体生物合成因子蛋白(pex)以改变含油真核生物中多不饱和脂肪酸和总脂质含量 |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP2198005B1 (zh) |
JP (1) | JP5658034B2 (zh) |
CN (1) | CN101883843B (zh) |
CA (1) | CA2701130A1 (zh) |
DK (1) | DK2198005T3 (zh) |
WO (1) | WO2009046248A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104911192A (zh) * | 2015-04-27 | 2015-09-16 | 海南大学 | 拟南芥抗逆相关基因AtPED2及其制备方法和应用 |
CN114395500A (zh) * | 2014-12-10 | 2022-04-26 | 银杏生物制品公司 | 酵母中的油酸生产 |
CN118726155A (zh) * | 2024-06-26 | 2024-10-01 | 微康益生菌(苏州)股份有限公司 | 一种防治真菌性阴道炎的植物乳植杆菌Lp769及其应用 |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2009335903B2 (en) * | 2008-12-18 | 2016-04-14 | E. I. Du Pont De Nemours And Company | Reducing byproduction of malonates in a fermentation process |
AU2010260234B2 (en) | 2009-06-16 | 2015-02-26 | E. I. Du Pont De Nemours And Company | High eicosapentaenoic acid oils from improved optimized strains of Yarrowia lipolytica |
JP5893555B2 (ja) | 2009-06-16 | 2016-03-23 | イー・アイ・デュポン・ドウ・ヌムール・アンド・カンパニーE.I.Du Pont De Nemours And Company | エイコサペンタエン酸を高産生するよう改良されたヤロウイア・リポリティカ(Yarrowia lipolytica)最適化株 |
CA2763424C (en) | 2009-06-16 | 2018-06-12 | E.I. Du Pont De Nemours And Company | Improvement of long chain omega-3 and omega-6 polyunsaturated fatty acid biosynthesis by expression of acyl-coa lysophospholipid acyltransferases |
WO2013013211A1 (en) * | 2011-07-21 | 2013-01-24 | Dsm Ip Assets B.V. | Fatty acid compositions |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7214491B2 (en) * | 2003-05-07 | 2007-05-08 | E. I. Du Pont De Nemours And Company | Δ-12 desaturase gene suitable for altering levels of polyunsaturated fatty acids in oleaginous yeasts |
US7267976B2 (en) * | 2003-07-02 | 2007-09-11 | E.I. Du Pont De Nemours And Company | Acyltransferases for alteration of polyunsaturated fatty acids and oil content in oleaginous yeasts |
US7504259B2 (en) * | 2003-11-12 | 2009-03-17 | E. I. Du Pont De Nemours And Company | Δ12 desaturases suitable for altering levels of polyunsaturated fatty acids in oleaginous yeast |
WO2005047479A2 (en) * | 2003-11-12 | 2005-05-26 | E.I. Dupont De Nemours And Company | Delta-15 desaturases suitable for altering levels of polyunsaturated fatty acids in oilseed plants and oleaginous yeast |
US7273746B2 (en) * | 2004-11-04 | 2007-09-25 | E.I. Dupont De Nemours And Company | Diacylglycerol acyltransferases for alteration of polyunsaturated fatty acids and oil content in oleaginous organisms |
US7588931B2 (en) * | 2004-11-04 | 2009-09-15 | E. I. Du Pont De Nemours And Company | High arachidonic acid producing strains of Yarrowia lipolytica |
CA2624661C (en) * | 2005-11-23 | 2015-06-30 | E.I. Du Pont De Nemours And Company | Delta-9 elongases and their use in making polyunsaturated fatty acids |
-
2008
- 2008-10-03 EP EP08835263.8A patent/EP2198005B1/en not_active Not-in-force
- 2008-10-03 CA CA2701130A patent/CA2701130A1/en not_active Abandoned
- 2008-10-03 WO PCT/US2008/078671 patent/WO2009046248A1/en active Application Filing
- 2008-10-03 CN CN200880118917.4A patent/CN101883843B/zh not_active Expired - Fee Related
- 2008-10-03 JP JP2010528150A patent/JP5658034B2/ja not_active Expired - Fee Related
- 2008-10-03 DK DK08835263.8T patent/DK2198005T3/da active
Non-Patent Citations (2)
Title |
---|
LIN YUE ET AL: "The peroxisome deficient Arabidopsis mutant ssel exhibits impaired fatty acid synthesis", 《PLANT PHYSIOLOGY》 * |
SUMITA TORU ET AL: "Perosisome deficiency represses the expression of n-alkane-inducible Y1ALK1 encoding cytochrome P450ALK1 in Yarrowia lipolytica", 《FEMS MICROBIOLOGY LETTERS》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114395500A (zh) * | 2014-12-10 | 2022-04-26 | 银杏生物制品公司 | 酵母中的油酸生产 |
CN104911192A (zh) * | 2015-04-27 | 2015-09-16 | 海南大学 | 拟南芥抗逆相关基因AtPED2及其制备方法和应用 |
CN118726155A (zh) * | 2024-06-26 | 2024-10-01 | 微康益生菌(苏州)股份有限公司 | 一种防治真菌性阴道炎的植物乳植杆菌Lp769及其应用 |
Also Published As
Publication number | Publication date |
---|---|
EP2198005A1 (en) | 2010-06-23 |
EP2198005B1 (en) | 2014-05-07 |
JP2010539985A (ja) | 2010-12-24 |
CA2701130A1 (en) | 2009-04-09 |
CN101883843B (zh) | 2016-03-02 |
JP5658034B2 (ja) | 2015-01-21 |
DK2198005T3 (da) | 2014-07-21 |
WO2009046248A1 (en) | 2009-04-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101365788B (zh) | Δ-9延伸酶及其在制备多不饱和脂肪酸中的用途 | |
CN101939434B (zh) | 用于在大豆中提高种子贮藏油脂的生成和改变脂肪酸谱的来自解脂耶氏酵母的dgat基因 | |
KR102700050B1 (ko) | 조작된 내수송/외수송을 가진 미생물 숙주에서 모유 올리고당류의 생산 | |
DK2087105T3 (da) | Delta 17-desaturase og anvendelse heraf ved fremstilling af flerumættede fedtsyrer | |
DK2087106T3 (en) | MUTATING DELTA8 DESATURATION GENES CONSTRUCTED BY TARGETED MUTAGENES AND USE THEREOF IN THE MANUFACTURE OF MULTI-Saturated FAT ACIDS | |
DK2140006T3 (en) | DELTA-5 desaturases AND USE THEREOF FOR THE PRODUCTION OF polyunsaturated fatty acids | |
DK2443248T3 (en) | IMPROVEMENT OF LONG-CHAIN POLYUM Saturated OMEGA-3 AND OMEGA-6 FATTY ACID BIOS SYNTHESIS BY EXPRESSION OF ACYL-CoA LYSOPHOSPHOLIPID ACYL TRANSFERASES | |
CA2683497C (en) | .delta.8 desaturases and their use in making polyunsaturated fatty acids | |
DK2324119T3 (en) | Mutant DELTA5 Desaturases AND USE THEREOF FOR THE PRODUCTION OF polyunsaturated fatty acids | |
KR102711998B1 (ko) | 조작되고 완전-기능 맞춤 당단백질 | |
CN101646766B (zh) | △17去饱和酶及其用于制备多不饱和脂肪酸的用途 | |
KR20070085669A (ko) | 고농도의 아라키돈산을 생성하는 야로위아 리폴리티카 균주 | |
CN112204147A (zh) | 基于Cpf1的植物转录调控系统 | |
KR20140099224A (ko) | 케토-아이소발레레이트 데카르복실라제 효소 및 이의 이용 방법 | |
KR20220012327A (ko) | 피토칸나비노이드 및 피토칸나비노이드 전구체의 생산을 위한 방법 및 세포 | |
KR20130032897A (ko) | 알코올 발효 시의 알코올 에스테르의 생성 및 원위치에서의 생성물 제거 | |
BRPI0711020A2 (pt) | polinucleotìdeo isolado, construto de dna recombinente, célula, método para transformar uma célula, método para produzir uma planta trasfornanda, sementes transgênicas, método para a produção de ácidos graxos poliinsaturados de cadeia longa em uma célula vegetal, óleos ou subporudots, método para produzir pelo menos um ácido graxo poliinsaturado em uma célula vegetal de uma semente oleaginosa, plantas de semente oleoginosa, sementes transgênicas, produto alimentìcios, progênies de plantas e molécula de ácido nucléico isolada | |
KR20140092759A (ko) | 숙주 세포 및 아이소부탄올의 제조 방법 | |
KR20120099509A (ko) | 재조합 숙주 세포에서 육탄당 키나아제의 발현 | |
KR20130105649A (ko) | 피루베이트로부터 아세토락테이트로의 전환을 촉매작용시키는 폴리펩티드를 암호화하는 폴리뉴클레오티드의 통합 | |
KR20140113997A (ko) | 부탄올 생성을 위한 유전자 스위치 | |
BRPI0806354A2 (pt) | plantas oleaginosas transgências, sementes, óleos, produtos alimentìcios ou análogos a alimento, produtos alimentìcios medicinais ou análogos alimentìcios medicinais, produtos farmacêuticos, bebidas fórmulas para bebês, suplementos nutricionais, rações para animais domésticos, alimentos para aquacultura, rações animais, produtos de sementes inteiras, produtos de óleos misturados, produtos, subprodutos e subprodutos parcialmente processados | |
DK2623594T3 (da) | Antistof mod human prostaglandin-E2-receptor EP4 | |
CN101883843A (zh) | 破坏过氧化物酶体生物合成因子蛋白(pex)以改变含油真核生物中多不饱和脂肪酸和总脂质含量 | |
CN109996874A (zh) | 10-甲基硬脂酸的异源性产生 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20160302 Termination date: 20191003 |
|
CF01 | Termination of patent right due to non-payment of annual fee |