CN101646766B - △17去饱和酶及其用于制备多不饱和脂肪酸的用途 - Google Patents
△17去饱和酶及其用于制备多不饱和脂肪酸的用途 Download PDFInfo
- Publication number
- CN101646766B CN101646766B CN2007800139991A CN200780013999A CN101646766B CN 101646766 B CN101646766 B CN 101646766B CN 2007800139991 A CN2007800139991 A CN 2007800139991A CN 200780013999 A CN200780013999 A CN 200780013999A CN 101646766 B CN101646766 B CN 101646766B
- Authority
- CN
- China
- Prior art keywords
- sequence
- gene
- desaturases
- ala
- leu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 235000020777 polyunsaturated fatty acids Nutrition 0.000 title abstract description 54
- ZXVONLUNISGICL-UHFFFAOYSA-N 4,6-dinitro-o-cresol Chemical compound CC1=CC([N+]([O-])=O)=CC([N+]([O-])=O)=C1O ZXVONLUNISGICL-UHFFFAOYSA-N 0.000 title 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims abstract description 92
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 89
- 235000020673 eicosapentaenoic acid Nutrition 0.000 claims abstract description 59
- 238000006243 chemical reaction Methods 0.000 claims abstract description 27
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 claims abstract description 10
- JAZBEHYOTPTENJ-JLNKQSITSA-N all-cis-5,8,11,14,17-icosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O JAZBEHYOTPTENJ-JLNKQSITSA-N 0.000 claims abstract description 9
- 229960005135 eicosapentaenoic acid Drugs 0.000 claims abstract description 9
- 235000021342 arachidonic acid Nutrition 0.000 claims abstract description 5
- 229940114079 arachidonic acid Drugs 0.000 claims abstract description 5
- 108090000623 proteins and genes Proteins 0.000 claims description 194
- 238000000034 method Methods 0.000 claims description 95
- 241000235015 Yarrowia lipolytica Species 0.000 claims description 94
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 94
- 229920001184 polypeptide Polymers 0.000 claims description 90
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 90
- 239000002773 nucleotide Substances 0.000 claims description 60
- 125000003729 nucleotide group Chemical group 0.000 claims description 60
- 102000039446 nucleic acids Human genes 0.000 claims description 59
- 108020004707 nucleic acids Proteins 0.000 claims description 59
- 238000000926 separation method Methods 0.000 claims description 42
- 230000008859 change Effects 0.000 claims description 33
- 241000894006 Bacteria Species 0.000 claims description 30
- VZCCETWTMQHEPK-QNEBEIHSSA-N gamma-linolenic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/CCCCC(O)=O VZCCETWTMQHEPK-QNEBEIHSSA-N 0.000 claims description 19
- 235000020664 gamma-linolenic acid Nutrition 0.000 claims description 17
- 241000233654 Oomycetes Species 0.000 claims description 16
- 241000235070 Saccharomyces Species 0.000 claims description 15
- 230000000295 complement effect Effects 0.000 claims description 15
- 241000223252 Rhodotorula Species 0.000 claims description 14
- 241000233866 Fungi Species 0.000 claims description 13
- IQLUYYHUNSSHIY-HZUMYPAESA-N eicosatetraenoic acid Chemical compound CCCCCCCCCCC\C=C\C=C\C=C\C=C\C(O)=O IQLUYYHUNSSHIY-HZUMYPAESA-N 0.000 claims description 13
- VZCCETWTMQHEPK-UHFFFAOYSA-N gamma-Linolensaeure Natural products CCCCCC=CCC=CCC=CCCCCC(O)=O VZCCETWTMQHEPK-UHFFFAOYSA-N 0.000 claims description 10
- 241000195493 Cryptophyta Species 0.000 claims description 8
- HXWJFEZDFPRLBG-UHFFFAOYSA-N Timnodonic acid Natural products CCCC=CC=CCC=CCC=CCC=CCCCC(O)=O HXWJFEZDFPRLBG-UHFFFAOYSA-N 0.000 claims description 8
- 229960002733 gamolenic acid Drugs 0.000 claims description 8
- 241000894007 species Species 0.000 claims description 8
- 241001527609 Cryptococcus Species 0.000 claims description 6
- 241000223230 Trichosporon Species 0.000 claims description 6
- 238000004064 recycling Methods 0.000 claims description 6
- 238000012258 culturing Methods 0.000 claims description 5
- 241000222120 Candida <Saccharomycetales> Species 0.000 claims description 4
- 241001149698 Lipomyces Species 0.000 claims description 3
- 241001467333 Thraustochytriaceae Species 0.000 claims description 3
- 241000235013 Yarrowia Species 0.000 claims description 3
- 241000490645 Yarrowia sp. Species 0.000 claims description 2
- 241000003595 Aurantiochytrium limacinum Species 0.000 claims 1
- 241000235575 Mortierella Species 0.000 claims 1
- 241001558147 Mortierella sp. Species 0.000 claims 1
- 241000598397 Schizochytrium sp. Species 0.000 claims 1
- 241001298230 Thraustochytrium sp. Species 0.000 claims 1
- 235000020660 omega-3 fatty acid Nutrition 0.000 abstract description 35
- 239000012634 fragment Substances 0.000 abstract description 31
- 238000004519 manufacturing process Methods 0.000 abstract description 19
- 235000020665 omega-6 fatty acid Nutrition 0.000 abstract description 11
- 229940033080 omega-6 fatty acid Drugs 0.000 abstract description 11
- 235000020978 long-chain polyunsaturated fatty acids Nutrition 0.000 abstract description 3
- JAZBEHYOTPTENJ-UHFFFAOYSA-N eicosapentaenoic acid Natural products CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O JAZBEHYOTPTENJ-UHFFFAOYSA-N 0.000 abstract 1
- 108090000790 Enzymes Proteins 0.000 description 104
- 210000004027 cell Anatomy 0.000 description 102
- 102000004190 Enzymes Human genes 0.000 description 99
- 239000013612 plasmid Substances 0.000 description 89
- 108020004705 Codon Proteins 0.000 description 81
- 230000001580 bacterial effect Effects 0.000 description 69
- 150000001413 amino acids Chemical class 0.000 description 67
- 108020004414 DNA Proteins 0.000 description 62
- 150000002632 lipids Chemical class 0.000 description 59
- 230000014509 gene expression Effects 0.000 description 56
- 239000002253 acid Substances 0.000 description 54
- 108010022240 delta-8 fatty acid desaturase Proteins 0.000 description 51
- 239000000047 product Substances 0.000 description 38
- HOBAELRKJCKHQD-UHFFFAOYSA-N (8Z,11Z,14Z)-8,11,14-eicosatrienoic acid Natural products CCCCCC=CCC=CCC=CCCCCCCC(O)=O HOBAELRKJCKHQD-UHFFFAOYSA-N 0.000 description 32
- HOBAELRKJCKHQD-QNEBEIHSSA-N dihomo-γ-linolenic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/CCCCCCC(O)=O HOBAELRKJCKHQD-QNEBEIHSSA-N 0.000 description 32
- 238000013459 approach Methods 0.000 description 30
- 239000000194 fatty acid Substances 0.000 description 29
- 239000000758 substrate Substances 0.000 description 29
- 244000068988 Glycine max Species 0.000 description 27
- 235000010469 Glycine max Nutrition 0.000 description 27
- 241000233614 Phytophthora Species 0.000 description 27
- 230000012010 growth Effects 0.000 description 27
- 239000000523 sample Substances 0.000 description 27
- 230000000694 effects Effects 0.000 description 26
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 25
- 229940024606 amino acid Drugs 0.000 description 23
- 235000001014 amino acid Nutrition 0.000 description 23
- 230000009466 transformation Effects 0.000 description 23
- 206010042434 Sudden death Diseases 0.000 description 20
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 19
- 108091028043 Nucleic acid sequence Proteins 0.000 description 19
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 19
- 244000052769 pathogen Species 0.000 description 19
- 230000001717 pathogenic effect Effects 0.000 description 19
- 241000195620 Euglena Species 0.000 description 18
- 229910052799 carbon Inorganic materials 0.000 description 18
- 238000009396 hybridization Methods 0.000 description 18
- 102100034542 Acyl-CoA (8-3)-desaturase Human genes 0.000 description 17
- 108010073542 Delta-5 Fatty Acid Desaturase Proteins 0.000 description 17
- 102000004169 proteins and genes Human genes 0.000 description 17
- 238000006555 catalytic reaction Methods 0.000 description 16
- 235000018102 proteins Nutrition 0.000 description 16
- 238000004458 analytical method Methods 0.000 description 15
- 235000014113 dietary fatty acids Nutrition 0.000 description 15
- 229930195729 fatty acid Natural products 0.000 description 15
- 239000003921 oil Substances 0.000 description 15
- 108700012830 rat Lip2 Proteins 0.000 description 15
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 14
- 241000196324 Embryophyta Species 0.000 description 14
- 101100005882 Mus musculus Cel gene Proteins 0.000 description 14
- 101100289046 Mus musculus Lias gene Proteins 0.000 description 14
- 101150091094 lipA gene Proteins 0.000 description 14
- YAFQFNOUYXZVPZ-UHFFFAOYSA-N liproxstatin-1 Chemical compound ClC1=CC=CC(CNC=2C3(CCNCC3)NC3=CC=CC=C3N=2)=C1 YAFQFNOUYXZVPZ-UHFFFAOYSA-N 0.000 description 14
- 108020004999 messenger RNA Proteins 0.000 description 14
- 108091026890 Coding region Proteins 0.000 description 13
- 108010087894 Fatty acid desaturases Proteins 0.000 description 13
- 241000907999 Mortierella alpina Species 0.000 description 13
- 108700026244 Open Reading Frames Proteins 0.000 description 13
- 150000004665 fatty acids Chemical class 0.000 description 13
- 238000007429 general method Methods 0.000 description 13
- 244000005700 microbiome Species 0.000 description 13
- 102100034544 Acyl-CoA 6-desaturase Human genes 0.000 description 12
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 12
- 108010037138 Linoleoyl-CoA Desaturase Proteins 0.000 description 12
- 230000002068 genetic effect Effects 0.000 description 12
- 239000007788 liquid Substances 0.000 description 12
- 108700023863 Gene Components Proteins 0.000 description 11
- 241000233622 Phytophthora infestans Species 0.000 description 11
- 108010011713 delta-15 desaturase Proteins 0.000 description 11
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 11
- MBMBGCFOFBJSGT-KUBAVDMBSA-N docosahexaenoic acid Natural products CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O MBMBGCFOFBJSGT-KUBAVDMBSA-N 0.000 description 11
- 235000013350 formula milk Nutrition 0.000 description 11
- 238000013519 translation Methods 0.000 description 11
- 230000014616 translation Effects 0.000 description 11
- -1 Δ 12 desaturases Proteins 0.000 description 11
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 10
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 10
- 238000013461 design Methods 0.000 description 10
- 239000008103 glucose Substances 0.000 description 10
- 229940012843 omega-3 fatty acid Drugs 0.000 description 10
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 10
- 238000005809 transesterification reaction Methods 0.000 description 10
- SEHFUALWMUWDKS-UHFFFAOYSA-N 5-fluoroorotic acid Chemical compound OC(=O)C=1NC(=O)NC(=O)C=1F SEHFUALWMUWDKS-UHFFFAOYSA-N 0.000 description 9
- 102000009114 Fatty acid desaturases Human genes 0.000 description 9
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 9
- 239000000284 extract Substances 0.000 description 9
- 235000019387 fatty acid methyl ester Nutrition 0.000 description 9
- 108010050848 glycylleucine Proteins 0.000 description 9
- 239000004519 grease Substances 0.000 description 9
- 238000003752 polymerase chain reaction Methods 0.000 description 9
- 239000000126 substance Substances 0.000 description 9
- 230000001131 transforming effect Effects 0.000 description 9
- 238000005406 washing Methods 0.000 description 9
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 8
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 8
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 8
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 8
- 239000005642 Oleic acid Substances 0.000 description 8
- DTOSIQBPPRVQHS-PDBXOOCHSA-N alpha-linolenic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(O)=O DTOSIQBPPRVQHS-PDBXOOCHSA-N 0.000 description 8
- 239000001963 growth medium Substances 0.000 description 8
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 8
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 239000000203 mixture Substances 0.000 description 8
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 8
- 239000002243 precursor Substances 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 230000014621 translational initiation Effects 0.000 description 8
- 102100039239 Amidophosphoribosyltransferase Human genes 0.000 description 7
- 108010039224 Amidophosphoribosyltransferase Proteins 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 238000011156 evaluation Methods 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 108010033653 omega-3 fatty acid desaturase Proteins 0.000 description 7
- 238000004321 preservation Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 7
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 6
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 6
- 108091081024 Start codon Proteins 0.000 description 6
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 230000003570 biosynthesizing effect Effects 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 108010036413 histidylglycine Proteins 0.000 description 6
- 108010028295 histidylhistidine Proteins 0.000 description 6
- 239000013067 intermediate product Substances 0.000 description 6
- 239000000463 material Substances 0.000 description 6
- 230000004060 metabolic process Effects 0.000 description 6
- 229910052757 nitrogen Inorganic materials 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 238000002360 preparation method Methods 0.000 description 6
- 230000003362 replicative effect Effects 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 5
- 108700010070 Codon Usage Proteins 0.000 description 5
- 102000053602 DNA Human genes 0.000 description 5
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 5
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 5
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 5
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 5
- 108091034117 Oligonucleotide Proteins 0.000 description 5
- 108091034057 RNA (poly(A)) Proteins 0.000 description 5
- 101000912235 Rebecca salina Acyl-lipid (7-3)-desaturase Proteins 0.000 description 5
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 5
- 101000877236 Siganus canaliculatus Acyl-CoA Delta-4 desaturase Proteins 0.000 description 5
- 108700005078 Synthetic Genes Proteins 0.000 description 5
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 5
- 241000144181 Thraustochytrium aureum Species 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 235000005911 diet Nutrition 0.000 description 5
- 230000037213 diet Effects 0.000 description 5
- 235000019197 fats Nutrition 0.000 description 5
- 238000000855 fermentation Methods 0.000 description 5
- 230000004151 fermentation Effects 0.000 description 5
- 230000004927 fusion Effects 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 230000000968 intestinal effect Effects 0.000 description 5
- 101150077696 lip-1 gene Proteins 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 238000007899 nucleic acid hybridization Methods 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 150000004671 saturated fatty acids Chemical class 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- DCXXMTOCNZCJGO-UHFFFAOYSA-N tristearoylglycerol Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(OC(=O)CCCCCCCCCCCCCCCCC)COC(=O)CCCCCCCCCCCCCCCCC DCXXMTOCNZCJGO-UHFFFAOYSA-N 0.000 description 5
- 108010044292 tryptophyltyrosine Proteins 0.000 description 5
- 108010003137 tyrosyltyrosine Proteins 0.000 description 5
- 229920001817 Agar Polymers 0.000 description 4
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 4
- MMGCRPZQZWTZTA-IHRRRGAJSA-N Arg-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N MMGCRPZQZWTZTA-IHRRRGAJSA-N 0.000 description 4
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 4
- 101100289888 Caenorhabditis elegans lys-5 gene Proteins 0.000 description 4
- 241000233732 Fusarium verticillioides Species 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- 241000282414 Homo sapiens Species 0.000 description 4
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 4
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 4
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 4
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 4
- OYHQOLUKZRVURQ-HZJYTTRNSA-N Linoleic acid Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(O)=O OYHQOLUKZRVURQ-HZJYTTRNSA-N 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 4
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 4
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 4
- WQDUMFSSJAZKTM-UHFFFAOYSA-N Sodium methoxide Chemical compound [Na+].[O-]C WQDUMFSSJAZKTM-UHFFFAOYSA-N 0.000 description 4
- 102100028897 Stearoyl-CoA desaturase Human genes 0.000 description 4
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 4
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 4
- 238000009825 accumulation Methods 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 235000020661 alpha-linolenic acid Nutrition 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 230000001588 bifunctional effect Effects 0.000 description 4
- 150000001721 carbon Chemical class 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 230000001276 controlling effect Effects 0.000 description 4
- 230000008878 coupling Effects 0.000 description 4
- 238000010168 coupling process Methods 0.000 description 4
- 238000005859 coupling reaction Methods 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 108010054813 diprotin B Proteins 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 239000003925 fat Substances 0.000 description 4
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 238000011081 inoculation Methods 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 4
- 229960004232 linoleic acid Drugs 0.000 description 4
- 229960004488 linolenic acid Drugs 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- TUNFSRHWOTWDNC-UHFFFAOYSA-N tetradecanoic acid Chemical compound CCCCCCCCCCCCCC(O)=O TUNFSRHWOTWDNC-UHFFFAOYSA-N 0.000 description 4
- 235000021122 unsaturated fatty acids Nutrition 0.000 description 4
- 150000004670 unsaturated fatty acids Chemical class 0.000 description 4
- 102000057234 Acyl transferases Human genes 0.000 description 3
- 108700016155 Acyl transferases Proteins 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 3
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 3
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 3
- 241000024188 Andala Species 0.000 description 3
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 3
- JGIAYNNXZKKKOW-KKUMJFAQSA-N Asn-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N JGIAYNNXZKKKOW-KKUMJFAQSA-N 0.000 description 3
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 3
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 3
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 3
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 3
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 3
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 3
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 3
- 108010051219 Cre recombinase Proteins 0.000 description 3
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 3
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 3
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 3
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 3
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 3
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 3
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 3
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 3
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 3
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 3
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 3
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 3
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 3
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 3
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 3
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 3
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 3
- CTJHHEQNUNIYNN-SRVKXCTJSA-N His-His-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O CTJHHEQNUNIYNN-SRVKXCTJSA-N 0.000 description 3
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 3
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 3
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 3
- 240000007049 Juglans regia Species 0.000 description 3
- 235000009496 Juglans regia Nutrition 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 3
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 3
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 3
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 3
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 3
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 3
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 3
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 3
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 3
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 3
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 3
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 3
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 3
- MNNKPHGAPRUKMW-BPUTZDHNSA-N Met-Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 MNNKPHGAPRUKMW-BPUTZDHNSA-N 0.000 description 3
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 3
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 3
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 3
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 3
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 3
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 3
- 241000948155 Phytophthora sojae Species 0.000 description 3
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 3
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 3
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 3
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 3
- 101100382243 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YAT1 gene Proteins 0.000 description 3
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 3
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 3
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 3
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 3
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 3
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 3
- IWAVRIPRTCJAQO-HSHDSVGOSA-N Thr-Pro-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IWAVRIPRTCJAQO-HSHDSVGOSA-N 0.000 description 3
- 241000078013 Trichormus variabilis Species 0.000 description 3
- NOFFAYIYPAUNRM-HKUYNNGSSA-N Trp-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NOFFAYIYPAUNRM-HKUYNNGSSA-N 0.000 description 3
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 3
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 3
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 3
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 3
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 3
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 3
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- 240000008042 Zea mays Species 0.000 description 3
- 239000000654 additive Substances 0.000 description 3
- 230000000996 additive effect Effects 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 3
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 3
- 235000011130 ammonium sulphate Nutrition 0.000 description 3
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 3
- 230000008238 biochemical pathway Effects 0.000 description 3
- 239000007795 chemical reaction product Substances 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 108010069495 cysteinyltyrosine Proteins 0.000 description 3
- 238000013016 damping Methods 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 230000008034 disappearance Effects 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 238000006073 displacement reaction Methods 0.000 description 3
- 150000002066 eicosanoids Chemical class 0.000 description 3
- 230000002255 enzymatic effect Effects 0.000 description 3
- 239000012530 fluid Substances 0.000 description 3
- 238000013467 fragmentation Methods 0.000 description 3
- 238000006062 fragmentation reaction Methods 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 230000008676 import Effects 0.000 description 3
- 230000001976 improved effect Effects 0.000 description 3
- 150000002500 ions Chemical class 0.000 description 3
- 108010012058 leucyltyrosine Proteins 0.000 description 3
- 238000007834 ligase chain reaction Methods 0.000 description 3
- OYHQOLUKZRVURQ-IXWMQOLASA-N linoleic acid Natural products CCCCC\C=C/C\C=C\CCCCCCCC(O)=O OYHQOLUKZRVURQ-IXWMQOLASA-N 0.000 description 3
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- SECPZKHBENQXJG-FPLPWBNLSA-N palmitoleic acid Chemical compound CCCCCC\C=C/CCCCCCCC(O)=O SECPZKHBENQXJG-FPLPWBNLSA-N 0.000 description 3
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 3
- 108010084572 phenylalanyl-valine Proteins 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- NLKNQRATVPKPDG-UHFFFAOYSA-M potassium iodide Chemical compound [K+].[I-] NLKNQRATVPKPDG-UHFFFAOYSA-M 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 108010079317 prolyl-tyrosine Proteins 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 229920006395 saturated elastomer Polymers 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 230000002194 synthesizing effect Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 230000010474 transient expression Effects 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- 108010020532 tyrosyl-proline Proteins 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 3
- 229940045145 uridine Drugs 0.000 description 3
- 235000020234 walnut Nutrition 0.000 description 3
- XSXIVVZCUAHUJO-AVQMFFATSA-N (11e,14e)-icosa-11,14-dienoic acid Chemical compound CCCCC\C=C\C\C=C\CCCCCCCCCC(O)=O XSXIVVZCUAHUJO-AVQMFFATSA-N 0.000 description 2
- 101150084750 1 gene Proteins 0.000 description 2
- LODRRYMGPWQCTR-UHFFFAOYSA-N 5-fluoro-2,4-dioxo-1h-pyrimidine-6-carboxylic acid;hydrate Chemical compound O.OC(=O)C=1NC(=O)NC(=O)C=1F LODRRYMGPWQCTR-UHFFFAOYSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 2
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 2
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 2
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- 241000972773 Aulopiformes Species 0.000 description 2
- 108010023063 Bacto-peptone Proteins 0.000 description 2
- 241000222178 Candida tropicalis Species 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 241000223233 Cutaneotrichosporon cutaneum Species 0.000 description 2
- 241000235646 Cyberlindnera jadinii Species 0.000 description 2
- 241000199914 Dinophyceae Species 0.000 description 2
- 235000021297 Eicosadienoic acid Nutrition 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- 102000004533 Endonucleases Human genes 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- 230000005526 G1 to G0 transition Effects 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 2
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 2
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- ZRALSGWEFCBTJO-UHFFFAOYSA-N Guanidine Chemical compound NC(N)=N ZRALSGWEFCBTJO-UHFFFAOYSA-N 0.000 description 2
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 2
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 2
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 2
- LYSMQLXUCAKELQ-DCAQKATOSA-N His-Asp-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N LYSMQLXUCAKELQ-DCAQKATOSA-N 0.000 description 2
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 2
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 2
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 2
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 2
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 2
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 2
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 2
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 2
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 2
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 2
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 2
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 2
- XVUAQNRNFMVWBR-BLMTYFJBSA-N Ile-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N XVUAQNRNFMVWBR-BLMTYFJBSA-N 0.000 description 2
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 2
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 2
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 2
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 2
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 2
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 2
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 2
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 206010027336 Menstruation delayed Diseases 0.000 description 2
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 2
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 2
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 2
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 2
- YJNDFEWPGLNLNH-IHRRRGAJSA-N Met-Tyr-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 YJNDFEWPGLNLNH-IHRRRGAJSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 241000199911 Peridinium Species 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- NOFBJKKOPKJDCO-KKXDTOCCSA-N Phe-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOFBJKKOPKJDCO-KKXDTOCCSA-N 0.000 description 2
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 2
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 2
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 2
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 2
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 2
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 2
- 241000370518 Phytophthora ramorum Species 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 2
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 2
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 229940096437 Protein S Drugs 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 2
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 2
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- HGZINTSBOUQIBU-UHFFFAOYSA-N Thr Tyr Gly Gly Chemical compound OC(=O)CNC(=O)CNC(=O)C(NC(=O)C(N)C(O)C)CC1=CC=C(O)C=C1 HGZINTSBOUQIBU-UHFFFAOYSA-N 0.000 description 2
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 2
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 2
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 2
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 2
- GIBPOCDKBPNRJB-HSHDSVGOSA-N Thr-Met-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GIBPOCDKBPNRJB-HSHDSVGOSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 2
- WVHUFSCKCBQKJW-HKUYNNGSSA-N Trp-Gly-Tyr Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 WVHUFSCKCBQKJW-HKUYNNGSSA-N 0.000 description 2
- AZBIIKDSDLVJAK-VHWLVUOQSA-N Trp-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N AZBIIKDSDLVJAK-VHWLVUOQSA-N 0.000 description 2
- YVXIAOOYAKBAAI-SZMVWBNQSA-N Trp-Leu-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 YVXIAOOYAKBAAI-SZMVWBNQSA-N 0.000 description 2
- XKTWZYNTLXITCY-QRTARXTBSA-N Trp-Val-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 XKTWZYNTLXITCY-QRTARXTBSA-N 0.000 description 2
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 2
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 2
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 2
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 2
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 2
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 2
- 101100215634 Yarrowia lipolytica (strain CLIB 122 / E 150) XPR2 gene Proteins 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- CKUAXEQHGKSLHN-UHFFFAOYSA-N [C].[N] Chemical compound [C].[N] CKUAXEQHGKSLHN-UHFFFAOYSA-N 0.000 description 2
- 125000002252 acyl group Chemical group 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- AHANXAKGNAKFSK-PDBXOOCHSA-N all-cis-icosa-11,14,17-trienoic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCCCC(O)=O AHANXAKGNAKFSK-PDBXOOCHSA-N 0.000 description 2
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 239000012620 biological material Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 239000006071 cream Substances 0.000 description 2
- CYQFCXCEBYINGO-IAGOWNOFSA-N delta1-THC Chemical compound C1=C(C)CC[C@H]2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3[C@@H]21 CYQFCXCEBYINGO-IAGOWNOFSA-N 0.000 description 2
- 238000009795 derivation Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- DVSZKTAMJJTWFG-UHFFFAOYSA-N docosa-2,4,6,8,10,12-hexaenoic acid Chemical compound CCCCCCCCCC=CC=CC=CC=CC=CC=CC(O)=O DVSZKTAMJJTWFG-UHFFFAOYSA-N 0.000 description 2
- PRHHYVQTPBEDFE-UHFFFAOYSA-N eicosatrienoic acid Natural products CCCCCC=CCC=CCCCCC=CCCCC(O)=O PRHHYVQTPBEDFE-UHFFFAOYSA-N 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 230000032050 esterification Effects 0.000 description 2
- 238000005886 esterification reaction Methods 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 229940098330 gamma linoleic acid Drugs 0.000 description 2
- 208000016361 genetic disease Diseases 0.000 description 2
- 235000011187 glycerol Nutrition 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 229910052500 inorganic mineral Inorganic materials 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 150000002617 leukotrienes Chemical class 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000012269 metabolic engineering Methods 0.000 description 2
- 239000002207 metabolite Substances 0.000 description 2
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Natural products C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 239000011707 mineral Substances 0.000 description 2
- 235000010755 mineral Nutrition 0.000 description 2
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical class CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 238000003499 nucleic acid array Methods 0.000 description 2
- 239000002751 oligonucleotide probe Substances 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 2
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 150000003180 prostaglandins Chemical class 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 235000019515 salmon Nutrition 0.000 description 2
- 235000003441 saturated fatty acids Nutrition 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 210000001541 thymus gland Anatomy 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 230000005026 transcription initiation Effects 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- YZAZXIUFBCPZGB-QZOPMXJLSA-N (z)-octadec-9-enoic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O.CCCCCCCC\C=C/CCCCCCCC(O)=O YZAZXIUFBCPZGB-QZOPMXJLSA-N 0.000 description 1
- PORPENFLTBBHSG-MGBGTMOVSA-N 1,2-dihexadecanoyl-sn-glycerol-3-phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(O)=O)OC(=O)CCCCCCCCCCCCCCC PORPENFLTBBHSG-MGBGTMOVSA-N 0.000 description 1
- IHPYMWDTONKSCO-UHFFFAOYSA-N 2,2'-piperazine-1,4-diylbisethanesulfonic acid Chemical compound OS(=O)(=O)CCN1CCN(CCS(O)(=O)=O)CC1 IHPYMWDTONKSCO-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 1
- SCPRYBYMKVYVND-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-4-methylpentanoyl)pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(O)=O SCPRYBYMKVYVND-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- PIFPCDRPHCQLSJ-WYIJOVFWSA-N 4,8,12,15,19-Docosapentaenoic acid Chemical compound CC\C=C\CC\C=C\C\C=C\CC\C=C\CC\C=C\CCC(O)=O PIFPCDRPHCQLSJ-WYIJOVFWSA-N 0.000 description 1
- 101150014984 ACO gene Proteins 0.000 description 1
- 101150040074 Aco2 gene Proteins 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- VQBULXOHAZSTQY-GKCIPKSASA-N Ala-Trp-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VQBULXOHAZSTQY-GKCIPKSASA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- QDGMZAOSMNGBLP-MRFFXTKBSA-N Ala-Trp-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N QDGMZAOSMNGBLP-MRFFXTKBSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 101100172211 Arabidopsis thaliana HAG3 gene Proteins 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 1
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- NYZGVTGOMPHSJW-CIUDSAMLSA-N Arg-Glu-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N NYZGVTGOMPHSJW-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- HIMXTOIXVXWHTB-DCAQKATOSA-N Arg-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HIMXTOIXVXWHTB-DCAQKATOSA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- POZKLUIXMHIULG-FDARSICLSA-N Arg-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N POZKLUIXMHIULG-FDARSICLSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- FANGHKQYFPYDNB-UBHSHLNASA-N Asn-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N FANGHKQYFPYDNB-UBHSHLNASA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- MOHUTCNYQLMARY-GUBZILKMSA-N Asn-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MOHUTCNYQLMARY-GUBZILKMSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- CXEFNHOVIIDHFU-IHPCNDPISA-N Asp-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N CXEFNHOVIIDHFU-IHPCNDPISA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 241000220243 Brassica sp. Species 0.000 description 1
- 241000195940 Bryophyta Species 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 108090000489 Carboxy-Lyases Proteins 0.000 description 1
- WLYGSPLCNKYESI-RSUQVHIMSA-N Carthamin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1[C@@]1(O)C(O)=C(C(=O)\C=C\C=2C=CC(O)=CC=2)C(=O)C(\C=C\2C([C@](O)([C@H]3[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)C(O)=C(C(=O)\C=C\C=3C=CC(O)=CC=3)C/2=O)=O)=C1O WLYGSPLCNKYESI-RSUQVHIMSA-N 0.000 description 1
- 241000208809 Carthamus Species 0.000 description 1
- 244000020518 Carthamus tinctorius Species 0.000 description 1
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 1
- PIFPCDRPHCQLSJ-UHFFFAOYSA-N Clupanodonic acid Natural products CCC=CCCC=CCC=CCCC=CCCC=CCCC(O)=O PIFPCDRPHCQLSJ-UHFFFAOYSA-N 0.000 description 1
- 244000241257 Cucumis melo Species 0.000 description 1
- 235000015510 Cucumis melo subsp melo Nutrition 0.000 description 1
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 1
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 1
- MUZAUPFGPMMZSS-GUBZILKMSA-N Cys-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N MUZAUPFGPMMZSS-GUBZILKMSA-N 0.000 description 1
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 1
- UDDITVWSXPEAIQ-IHRRRGAJSA-N Cys-Phe-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UDDITVWSXPEAIQ-IHRRRGAJSA-N 0.000 description 1
- YQEHNIKPAOPBNH-DCAQKATOSA-N Cys-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N YQEHNIKPAOPBNH-DCAQKATOSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 201000004624 Dermatitis Diseases 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 241000305071 Enterobacterales Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000195619 Euglena gracilis Species 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 241001149475 Gaeumannomyces graminis Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- LTXLIIZACMCQTO-GUBZILKMSA-N Gln-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LTXLIIZACMCQTO-GUBZILKMSA-N 0.000 description 1
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- SAHTWBLTLJWAQA-XIRDDKMYSA-N Gln-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N SAHTWBLTLJWAQA-XIRDDKMYSA-N 0.000 description 1
- VCUNGPMMPNJSGS-JYJNAYRXSA-N Gln-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VCUNGPMMPNJSGS-JYJNAYRXSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- SYDJILXOZNEEDK-XIRDDKMYSA-N Glu-Arg-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SYDJILXOZNEEDK-XIRDDKMYSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 241000490472 Helianthus sp. Species 0.000 description 1
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 1
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 1
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 1
- CYHWWHKRCKHYGQ-GUBZILKMSA-N His-Cys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CYHWWHKRCKHYGQ-GUBZILKMSA-N 0.000 description 1
- NNBWMLHQXBTIIT-HVTMNAMFSA-N His-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N NNBWMLHQXBTIIT-HVTMNAMFSA-N 0.000 description 1
- JWLWNCVBBSBCEM-NKIYYHGXSA-N His-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O JWLWNCVBBSBCEM-NKIYYHGXSA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- AKAPKBNIVNPIPO-KKUMJFAQSA-N His-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 AKAPKBNIVNPIPO-KKUMJFAQSA-N 0.000 description 1
- OZBDSFBWIDPVDA-BZSNNMDCSA-N His-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N OZBDSFBWIDPVDA-BZSNNMDCSA-N 0.000 description 1
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 1
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 1
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- VGYOLSOFODKLSP-IHPCNDPISA-N His-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 VGYOLSOFODKLSP-IHPCNDPISA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- BCZFOHDMCDXPDA-BZSNNMDCSA-N His-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)O BCZFOHDMCDXPDA-BZSNNMDCSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 1
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- UIRUVUUGUYCMBY-KCTSRDHCSA-N His-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N UIRUVUUGUYCMBY-KCTSRDHCSA-N 0.000 description 1
- CSRRMQFXMBPSIL-SIXJUCDHSA-N His-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N CSRRMQFXMBPSIL-SIXJUCDHSA-N 0.000 description 1
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- LLHYWBGDMBGNHA-VGDYDELISA-N Ile-Cys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LLHYWBGDMBGNHA-VGDYDELISA-N 0.000 description 1
- CNPNWGHRMBQHBZ-ZKWXMUAHSA-N Ile-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O CNPNWGHRMBQHBZ-ZKWXMUAHSA-N 0.000 description 1
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 1
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 1
- IXEFKXAGHRQFAF-HVTMNAMFSA-N Ile-Glu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IXEFKXAGHRQFAF-HVTMNAMFSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- CZWANIQKACCEKW-CYDGBPFRSA-N Ile-Pro-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N CZWANIQKACCEKW-CYDGBPFRSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- SRBFZHDQGSBBOR-HWQSCIPKSA-N L-arabinopyranose Chemical compound O[C@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-HWQSCIPKSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- LKXANTUNFMVCNF-IHPCNDPISA-N Leu-His-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LKXANTUNFMVCNF-IHPCNDPISA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 1
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- 241000208202 Linaceae Species 0.000 description 1
- 241000208204 Linum Species 0.000 description 1
- 235000004431 Linum usitatissimum Nutrition 0.000 description 1
- 241000255640 Loa loa Species 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- CKSBRMUOQDNPKZ-SRVKXCTJSA-N Lys-Gln-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CKSBRMUOQDNPKZ-SRVKXCTJSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 1
- PRCHKVGXZVTALR-KKUMJFAQSA-N Lys-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N PRCHKVGXZVTALR-KKUMJFAQSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- XFANQCRHTMOEAP-WDSOQIARSA-N Lys-Pro-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XFANQCRHTMOEAP-WDSOQIARSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- 108091027974 Mature messenger RNA Proteins 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 1
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 1
- BQVJARUIXRXDKN-DCAQKATOSA-N Met-Asn-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 BQVJARUIXRXDKN-DCAQKATOSA-N 0.000 description 1
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- YKWHHKDMBZBMLG-GUBZILKMSA-N Met-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N YKWHHKDMBZBMLG-GUBZILKMSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- WRLYTJVPSUBYST-AVGNSLFASA-N Met-His-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N WRLYTJVPSUBYST-AVGNSLFASA-N 0.000 description 1
- XPCLRYNQMZOOFB-ULQDDVLXSA-N Met-His-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N XPCLRYNQMZOOFB-ULQDDVLXSA-N 0.000 description 1
- ULLIQRYQNMAAHC-RWMBFGLXSA-N Met-His-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N ULLIQRYQNMAAHC-RWMBFGLXSA-N 0.000 description 1
- MXEASDMFHUKOGE-ULQDDVLXSA-N Met-His-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MXEASDMFHUKOGE-ULQDDVLXSA-N 0.000 description 1
- NLHSFJQUHGCWSD-PYJNHQTQSA-N Met-Ile-His Chemical compound N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O NLHSFJQUHGCWSD-PYJNHQTQSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 1
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 1
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 1
- PHKBGZKVOJCIMZ-SRVKXCTJSA-N Met-Pro-Arg Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PHKBGZKVOJCIMZ-SRVKXCTJSA-N 0.000 description 1
- WXXNVZMWHOLNRJ-AVGNSLFASA-N Met-Pro-Lys Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O WXXNVZMWHOLNRJ-AVGNSLFASA-N 0.000 description 1
- SBFPAAPFKZPDCZ-JYJNAYRXSA-N Met-Pro-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SBFPAAPFKZPDCZ-JYJNAYRXSA-N 0.000 description 1
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- OOXVBECOTYHTCK-WDSOQIARSA-N Met-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCSC)N OOXVBECOTYHTCK-WDSOQIARSA-N 0.000 description 1
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 1
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 1
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 1
- 241001123676 Metschnikowia pulcherrima Species 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- CHJJGSNFBQVOTG-UHFFFAOYSA-N N-methyl-guanidine Natural products CNC(N)=N CHJJGSNFBQVOTG-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 108091093105 Nuclear DNA Proteins 0.000 description 1
- 108010055012 Orotidine-5'-phosphate decarboxylase Proteins 0.000 description 1
- 239000007990 PIPES buffer Substances 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 235000021319 Palmitoleic acid Nutrition 0.000 description 1
- 241000934740 Peridinium sp. Species 0.000 description 1
- 241000233679 Peronosporaceae Species 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- KKYHKZCMETTXEO-AVGNSLFASA-N Phe-Cys-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKYHKZCMETTXEO-AVGNSLFASA-N 0.000 description 1
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 1
- ACJULKNZOCRWEI-ULQDDVLXSA-N Phe-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O ACJULKNZOCRWEI-ULQDDVLXSA-N 0.000 description 1
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 1
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 1
- TXJJXEXCZBHDNA-ACRUOGEOSA-N Phe-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N TXJJXEXCZBHDNA-ACRUOGEOSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 1
- YVXPUUOTMVBKDO-IHRRRGAJSA-N Phe-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CS)C(=O)O YVXPUUOTMVBKDO-IHRRRGAJSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- 102000001107 Phosphatidate Phosphatase Human genes 0.000 description 1
- 108010069394 Phosphatidate Phosphatase Proteins 0.000 description 1
- 244000298647 Poinciana pulcherrima Species 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- 229920002319 Poly(methyl acrylate) Polymers 0.000 description 1
- 244000028344 Primula vulgaris Species 0.000 description 1
- 235000016311 Primula vulgaris Nutrition 0.000 description 1
- XZGWNSIRZIUHHP-SRVKXCTJSA-N Pro-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 XZGWNSIRZIUHHP-SRVKXCTJSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- YKQNVTOIYFQMLW-IHRRRGAJSA-N Pro-Cys-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 YKQNVTOIYFQMLW-IHRRRGAJSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- CFVRJNZJQHDQPP-CYDGBPFRSA-N Pro-Ile-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 CFVRJNZJQHDQPP-CYDGBPFRSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- DIDLUFMLRUJLFB-FKBYEOEOSA-N Pro-Trp-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)O DIDLUFMLRUJLFB-FKBYEOEOSA-N 0.000 description 1
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 241000169446 Promethis Species 0.000 description 1
- 241000382353 Pupa Species 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 241000918585 Pythium aphanidermatum Species 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 241000221523 Rhodotorula toruloides Species 0.000 description 1
- 101100501251 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ELO3 gene Proteins 0.000 description 1
- 241000172147 Saprolegnia diclina Species 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 1
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 1
- XERQKTRGJIKTRB-CIUDSAMLSA-N Ser-His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CN=CN1 XERQKTRGJIKTRB-CIUDSAMLSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- MHVXPTAMDHLTHB-IHPCNDPISA-N Ser-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MHVXPTAMDHLTHB-IHPCNDPISA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- LSHUNRICNSEEAN-BPUTZDHNSA-N Ser-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N LSHUNRICNSEEAN-BPUTZDHNSA-N 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 239000004141 Sodium laurylsulphate Substances 0.000 description 1
- 241000862632 Soja Species 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 229940100389 Sulfonylurea Drugs 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- PAOYNIKMYOGBMR-PBCZWWQYSA-N Thr-Asn-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PAOYNIKMYOGBMR-PBCZWWQYSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- BGHVVGPELPHRCI-HZTRNQAASA-N Thr-Trp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N)O BGHVVGPELPHRCI-HZTRNQAASA-N 0.000 description 1
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- XKGZEDNYGPNJAR-XIRDDKMYSA-N Trp-Asn-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N XKGZEDNYGPNJAR-XIRDDKMYSA-N 0.000 description 1
- VEYXZZGMIBKXCN-UBHSHLNASA-N Trp-Asp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VEYXZZGMIBKXCN-UBHSHLNASA-N 0.000 description 1
- DTPARJBMONKGGC-IHPCNDPISA-N Trp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N DTPARJBMONKGGC-IHPCNDPISA-N 0.000 description 1
- SSNGFWKILJLTQM-QEJZJMRPSA-N Trp-Gln-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SSNGFWKILJLTQM-QEJZJMRPSA-N 0.000 description 1
- RPVDDQYNBOVWLR-HOCLYGCPSA-N Trp-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RPVDDQYNBOVWLR-HOCLYGCPSA-N 0.000 description 1
- OAZLRFLMQASGNW-PMVMPFDFSA-N Trp-His-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)O)N OAZLRFLMQASGNW-PMVMPFDFSA-N 0.000 description 1
- BYSKNUASOAGJSS-NQCBNZPSSA-N Trp-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N BYSKNUASOAGJSS-NQCBNZPSSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- YTZYHKOSHOXTHA-TUSQITKMSA-N Trp-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)CC(C)C)C(O)=O)=CNC2=C1 YTZYHKOSHOXTHA-TUSQITKMSA-N 0.000 description 1
- YPBYQWFZAAQMGW-XIRDDKMYSA-N Trp-Lys-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N YPBYQWFZAAQMGW-XIRDDKMYSA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- RIKLKPANMFNREP-FDARSICLSA-N Trp-Met-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 RIKLKPANMFNREP-FDARSICLSA-N 0.000 description 1
- VUMCLPHXCBIJJB-PMVMPFDFSA-N Trp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N VUMCLPHXCBIJJB-PMVMPFDFSA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 1
- WTRQBSSQBKRNKV-MNSWYVGCSA-N Trp-Thr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=C(O)C=C1 WTRQBSSQBKRNKV-MNSWYVGCSA-N 0.000 description 1
- RQKMZXSRILVOQZ-GMVOTWDCSA-N Trp-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N RQKMZXSRILVOQZ-GMVOTWDCSA-N 0.000 description 1
- UIDJDMVRDUANDL-BVSLBCMMSA-N Trp-Tyr-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UIDJDMVRDUANDL-BVSLBCMMSA-N 0.000 description 1
- ZPZNQAZHMCLTOA-PXDAIIFMSA-N Trp-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 ZPZNQAZHMCLTOA-PXDAIIFMSA-N 0.000 description 1
- ICPRIGUXAFULPH-ILWGZMRPSA-N Trp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O ICPRIGUXAFULPH-ILWGZMRPSA-N 0.000 description 1
- IEESWNWYUOETOT-BVSLBCMMSA-N Trp-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccccc1)C(O)=O IEESWNWYUOETOT-BVSLBCMMSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 239000006035 Tryptophane Substances 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- CDRYEAWHKJSGAF-BPNCWPANSA-N Tyr-Ala-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O CDRYEAWHKJSGAF-BPNCWPANSA-N 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- LFCQXIXJQXWZJI-BZSNNMDCSA-N Tyr-His-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O LFCQXIXJQXWZJI-BZSNNMDCSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- BJCILVZEZRDIDR-PMVMPFDFSA-N Tyr-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 BJCILVZEZRDIDR-PMVMPFDFSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 1
- XYNFFTNEQDWZNY-ULQDDVLXSA-N Tyr-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N XYNFFTNEQDWZNY-ULQDDVLXSA-N 0.000 description 1
- FDKDGFGTHGJKNV-FHWLQOOXSA-N Tyr-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FDKDGFGTHGJKNV-FHWLQOOXSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- FRMFMFNMGQGMNB-BVSLBCMMSA-N Tyr-Pro-Trp Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FRMFMFNMGQGMNB-BVSLBCMMSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 1
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- JRMCISZDVLOTLR-BVSLBCMMSA-N Tyr-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N JRMCISZDVLOTLR-BVSLBCMMSA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- FZADUTOCSFDBRV-RNXOBYDBSA-N Tyr-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FZADUTOCSFDBRV-RNXOBYDBSA-N 0.000 description 1
- 101150050575 URA3 gene Proteins 0.000 description 1
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 1
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- JMCOXFSCTGKLLB-FKBYEOEOSA-N Val-Phe-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JMCOXFSCTGKLLB-FKBYEOEOSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- FJJCIZWZNKZHII-UHFFFAOYSA-N [4,6-bis(cyanoamino)-1,3,5-triazin-2-yl]cyanamide Chemical compound N#CNC1=NC(NC#N)=NC(NC#N)=N1 FJJCIZWZNKZHII-UHFFFAOYSA-N 0.000 description 1
- DPDMMXDBJGCCQC-UHFFFAOYSA-N [Na].[Cl] Chemical compound [Na].[Cl] DPDMMXDBJGCCQC-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 150000001335 aliphatic alkanes Chemical class 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 229920006318 anionic polymer Polymers 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 238000002820 assay format Methods 0.000 description 1
- 208000006673 asthma Diseases 0.000 description 1
- 125000004429 atom Chemical group 0.000 description 1
- 208000010668 atopic eczema Diseases 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 230000023555 blood coagulation Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 229910052792 caesium Inorganic materials 0.000 description 1
- TVFDJXOCXUVLDH-UHFFFAOYSA-N caesium atom Chemical compound [Cs] TVFDJXOCXUVLDH-UHFFFAOYSA-N 0.000 description 1
- 229960001948 caffeine Drugs 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 230000006652 catabolic pathway Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- 230000003196 chaotropic effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000005660 chlorination reaction Methods 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- SECPZKHBENQXJG-UHFFFAOYSA-N cis-palmitoleic acid Natural products CCCCCCC=CCCCCCCCC(O)=O SECPZKHBENQXJG-UHFFFAOYSA-N 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- GUJOJGAPFQRJSV-UHFFFAOYSA-N dialuminum;dioxosilane;oxygen(2-);hydrate Chemical compound O.[O-2].[O-2].[O-2].[Al+3].[Al+3].O=[Si]=O.O=[Si]=O.O=[Si]=O.O=[Si]=O GUJOJGAPFQRJSV-UHFFFAOYSA-N 0.000 description 1
- SWSQBOPZIKWTGO-UHFFFAOYSA-N dimethylaminoamidine Natural products CN(C)C(N)=N SWSQBOPZIKWTGO-UHFFFAOYSA-N 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 239000008157 edible vegetable oil Substances 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- KAQKFAOMNZTLHT-VVUHWYTRSA-N epoprostenol Chemical compound O1C(=CCCCC(O)=O)C[C@@H]2[C@@H](/C=C/[C@@H](O)CCCCC)[C@H](O)C[C@@H]21 KAQKFAOMNZTLHT-VVUHWYTRSA-N 0.000 description 1
- 229960001123 epoprostenol Drugs 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 241001233957 eudicotyledons Species 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 150000002190 fatty acyls Chemical group 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 229960002989 glutamic acid Drugs 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 230000007407 health benefit Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- YQYJSBFKSSDGFO-FWAVGLHBSA-N hygromycin A Chemical compound O[C@H]1[C@H](O)[C@H](C(=O)C)O[C@@H]1Oc1ccc(\C=C(/C)C(=O)N[C@@H]2[C@@H]([C@H]3OCO[C@H]3[C@@H](O)[C@@H]2O)O)cc1O YQYJSBFKSSDGFO-FWAVGLHBSA-N 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 229910017053 inorganic salt Inorganic materials 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 235000020667 long-chain omega-3 fatty acid Nutrition 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 239000002075 main ingredient Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 235000021281 monounsaturated fatty acids Nutrition 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 235000021290 n-3 DPA Nutrition 0.000 description 1
- WQEPLUUGTLDZJY-UHFFFAOYSA-N n-Pentadecanoic acid Natural products CCCCCCCCCCCCCCC(O)=O WQEPLUUGTLDZJY-UHFFFAOYSA-N 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 235000020939 nutritional additive Nutrition 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- RQFLGKYCYMMRMC-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O.CCCCCCCCCCCCCCCCCC(O)=O RQFLGKYCYMMRMC-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- FKCRAVPPBFWEJD-XVFCMESISA-N orotidine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1C(O)=O FKCRAVPPBFWEJD-XVFCMESISA-N 0.000 description 1
- FKCRAVPPBFWEJD-UHFFFAOYSA-N orotidine Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1C(O)=O FKCRAVPPBFWEJD-UHFFFAOYSA-N 0.000 description 1
- 210000004332 phalangeal cell Anatomy 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 230000007096 poisonous effect Effects 0.000 description 1
- 239000003495 polar organic solvent Substances 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 239000004175 ponceau 4R Substances 0.000 description 1
- 229960004839 potassium iodide Drugs 0.000 description 1
- 235000007715 potassium iodide Nutrition 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 238000001273 protein sequence alignment Methods 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 230000001185 psoriatic effect Effects 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- FOGKDYADEBOSPL-UHFFFAOYSA-M rubidium(1+);acetate Chemical compound [Rb+].CC([O-])=O FOGKDYADEBOSPL-UHFFFAOYSA-M 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000003375 selectivity assay Methods 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- AWUCVROLDVIAJX-GSVOUGTGSA-N sn-glycerol 3-phosphate Chemical compound OC[C@@H](O)COP(O)(O)=O AWUCVROLDVIAJX-GSVOUGTGSA-N 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- BAZAXWOYCMUHIX-UHFFFAOYSA-M sodium perchlorate Chemical compound [Na+].[O-]Cl(=O)(=O)=O BAZAXWOYCMUHIX-UHFFFAOYSA-M 0.000 description 1
- 229910001488 sodium perchlorate Inorganic materials 0.000 description 1
- VGTPCRGMBIAPIM-UHFFFAOYSA-M sodium thiocyanate Chemical compound [Na+].[S-]C#N VGTPCRGMBIAPIM-UHFFFAOYSA-M 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- YROXIXLRRCOBKF-UHFFFAOYSA-N sulfonylurea Chemical class OC(=N)N=S(=O)=O YROXIXLRRCOBKF-UHFFFAOYSA-N 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 230000008961 swelling Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- RYYVLZVUVIJVGH-UHFFFAOYSA-N trimethylxanthine Natural products CN1C(=O)N(C)C(=O)C2=C1N=CN2C RYYVLZVUVIJVGH-UHFFFAOYSA-N 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010012567 tyrosyl-glycyl-glycyl-phenylalanyl Proteins 0.000 description 1
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 1
- 238000001291 vacuum drying Methods 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6409—Fatty acids
- C12P7/6427—Polyunsaturated fatty acids [PUFA], i.e. having two or more double bonds in their backbone
- C12P7/6432—Eicosapentaenoic acids [EPA]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8247—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified lipid metabolism, e.g. seed oil composition
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
- C12N9/0083—Miscellaneous (1.14.99)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
- C12N9/0083—Miscellaneous (1.14.99)
- C12N9/0087—Steroid 21-monooxygenase (1.14.99.10)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6436—Fatty acid esters
- C12P7/6445—Glycerides
- C12P7/6472—Glycerides containing polyunsaturated fatty acid [PUFA] residues, i.e. having two or more double bonds in their backbone
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Nutrition Science (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Fats And Perfumes (AREA)
Abstract
本发明涉及Δ17去饱和酶,其具有将ω-6脂肪酸转变为其ω-3对应物的能力(即转变花生四烯酸[20:4,ARA]为二十碳五烯酸[20:5,EPA])。公开了编码Δ17去饱和酶的分离核酸片段和含有该类片段的重组构建体,以及使用这些Δ17去饱和酶在植物和油脂酵母中制备长链多不饱和脂肪酸(PUFA)的方法。
Description
本申请要求于2006年4月20日申请的美国临时专利申请60/793575的权益。
发明领域
本发明属于生物技术领域。更具体地说,本发明涉及编码Δ17脂肪酸去饱和酶的核酸片段的鉴定以及这些去饱和酶制备长链多不饱和脂肪酸(PUFA)的用途。
发明背景
PUFA的重要性是无可置疑的。例如,某些PUFA是健康细胞的重要生物组分,被看成是:“必需”脂肪酸,其无法在哺乳动物中重新从头合成,而是必须在饮食中获得,或者来自亚油酸(LA;18:2ω-6)或α-亚麻酸(ALA;18:3ω-3)的进一步去饱和和延长;细胞胞质膜组分,在胞质膜中它们可以以诸如磷脂或者三酰甘油的形式存在;正常发育(尤其是正在发育的婴儿大脑)以及组织形成与修复所必需的;以及,哺乳动物中几种重要的生物活性类二十烷酸的前体(例如环前列腺素、类二十烷酸、白三烯、前列腺素)。另外,长链ω-3PUFA的高摄入产生心血管保护作用(Dyerberg,J.等,Amer.J.Clin.Nutr.,28:958-966(1975);Dyerberg,J.等,Lancet,2(8081):117-119(1978年7月15日);Shimokawa,H.,World Rev.Nutr.Diet,88:100-108(2001);von Schacky,C.和Dyerberg,J.,World Rev.Nutr.Diet,88:90-99(2001))。再者,众多的其它研究以文件形式记录了由ω-3和/或ω-6PUFA给药赋予的针对多种症状和疾病(例如哮喘、银屑病、湿疹、糖尿病、癌症)的广泛范围的健康利益。
正在研究包括植物、藻类、真菌和酵母在内的多种不同宿主作为工业化PUFA生产的手段。遗传工程已表明,某些宿主(即便是天然限于LA和ALA脂肪酸生产的那些宿主)的天然能力可被显著改变,导致高水平生产花生四烯酸(ARA;20:4ω-6)、二十碳五烯酸(EPA;20:5ω-3)和二十二碳六烯酸(DHA;22:6ω-3)。无论ω-3/ω-6PUFA生产是天然能力还是重组技术的结果,两种策略都可能需要将ω-6PUFA转变为其ω-3对应物。具体地说,Δ15去饱和酶负责将LA转变为ALA,而Δ17去饱和酶负责将ARA转变为EPA(尽管某些Δ17去饱和酶还可以使用二均γ亚麻酸(DGLA;20:3ω-6)作为底物来产生二十碳四烯酸(ETA;20:4ω-3))。这两种酶在下列途径中都具有作用:Δ6去饱和酶/Δ6延长酶途径(该途径主要存在于藻类、藓类、真菌、线虫和人中,以生产γ亚油酸(GLA;18:3ω-6)和/或十八碳四烯酸(STA;18:4ω-3)为特征)和Δ9延长酶/Δ8去饱和酶途径(该途径在某些生物(例如眼虫物种)中运行,以生产二十碳二烯酸(EDA;20:2ω-6)和/或二十碳三烯酸(ETrA;20:3ω-3)为特征)(图1)。
基于Δ17去饱和酶所起的作用,由此能够有效地进行ω-3脂肪酸合成,已付出了相当大的努力由多种来源鉴定和表征这些酶。然而,目前仅已知少数Δ17去饱和酶。具体地说,美国专利公布号2003/0190733描述了来自异枝水霉(Saprolegnia diclina)的Δ17去饱和酶的分离和表征(另参见GenBank登录号AY373823)。PCT公布号WO2005/083053提供了致病疫霉(Phytophthora infestans)的“ω3去饱和酶”的序列(另参见GenBank登录号CAJ30870),而PCT公布号WO2006/100241提供了大豆疫霉(Phytophthora sojae)“ω3去饱和酶”的序列,这二者似乎均编码Δ17去饱和酶。另外,获得临时申请号60/855177的共同拥有的共同未决申请公开了来自瓜果腐霉(Pythiumaphanidermatum)的Δ17去饱和酶的氨基酸和核酸序列。因此,需要鉴定和分离编码Δ17去饱和酶的其它基因,这些基因适于在多种用于生产ω-3脂肪酸的宿主生物中异源表达。
申请人已通过由卵菌、栎树猝死病菌(Phytophthora ramorum)和大豆疫霉(Phytophthora sojae)分离编码Δ17去饱和酶的基因解决了所述问题。
发明概述
本发明涉及编码具有Δ17去饱和酶活性的多肽的新的遗传构建体,以及它们在植物和酵母中用于生产PUFA和尤其是ω-3脂肪酸的用途。
因此,本发明提供选自以下的分离的核酸分子:
a)编码Δ17去饱和酶的分离的核苷酸序列,其选自SEQ ID NO:3和SEQ ID NO:7;或
b)与(a)完全互补的分离的核苷酸序列。
在又一个实施方案中,本发明提供本发明的遗传构建体,其为了在耶氏酵母属(Yarrowia sp.)中表达而进行了密码子优化。另外,本发明提供用本发明的遗传构建体转化的宿主细胞,用于表达和生产ω-3脂肪酸。
在又一个实施方案中,本发明提供用于生产二十碳五烯酸的方法,所述方法包括:
a)提供一种宿主细胞,所述宿主细胞包含:
i)一种编码Δ17去饱和酶多肽的分离的核苷酸分子,所述分离的核苷酸分子选自:
1)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQID NO:2的氨基酸序列的多肽对比时具有至少90.9%的同一性;
2)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQID NO:4的氨基酸序列的多肽对比时具有至少91.4%的同一性;和
3)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQID NO:6的氨基酸序列的多肽对比时具有至少89.5%的同一性;和
ii)花生四烯酸的来源;
b)在其中表达编码Δ17去饱和酶多肽的核酸分子和花生四烯酸转变为二十碳五烯酸的条件下培养步骤(a)的宿主细胞;和
c)任选地回收步骤(b)的二十碳五烯酸。
在一个替代性实施方案中,本发明提供用于生产二十碳四烯酸的方法,所述方法包括:
a)提供一种宿主细胞,所述宿主细胞包含:
i)一种编码Δ17去饱和酶多肽的分离的核苷酸分子,所述分离的核苷酸分子选自:
1)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQID NO:2的氨基酸序列的多肽对比时具有至少90.9%的同一性;
2)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQID NO:4的氨基酸序列的多肽对比时具有至少91.4%的同一性;和
3)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQID NO:6的氨基酸序列的多肽对比时具有至少89.5%的同一性;和
ii)二均γ亚麻酸的来源;
b)在其中表达编码Δ17去饱和酶多肽的核酸分子和二均γ亚麻酸转变为二十碳四烯酸的条件下培养步骤(a)的宿主细胞;和
c)任选地回收步骤(b)的二十碳四烯酸。
在又一个实施方案中,本发明提供用于生产α-亚麻酸的方法,所述方法包括:
a)提供一种宿主细胞,所述宿主细胞包含:
i)一种编码双功能Δ17去饱和酶多肽的分离的核苷酸分子,所述分离的核苷酸分子选自:
1)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQID NO:2的氨基酸序列的多肽对比时具有至少90.9%的同一性;
2)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQID NO:4的氨基酸序列的多肽对比时具有至少91.4%的同一性;和
ii)亚油酸的来源;
b)在其中表达编码双功能Δ17去饱和酶多肽的核酸分子和亚油酸转变为α-亚麻酸的条件下培养步骤(a)的宿主细胞;和
c)任选地回收步骤(b)的α-亚麻酸。
生物保藏
以下生物物质已保藏在美国典型培养物保藏中心(ATCC),10801University Boulevard,Manassas,VA20110-2209,并有以下名称、保藏号及保藏日期。
生物材料 | 保藏号 | 保藏日期 |
解脂耶氏酵母(Yarrowialipolytica)Y2047 | ATCCPTA-7186 | 2005年10月26日 |
以上列出的生物材料按照国际承认用于专利程序的微生物保藏布达佩斯条约的条款保藏。所列出的保藏物在所示国际保藏机构保持至少30年,公众可在授予公开其的专利时获得。保藏物的可用性不构成在废除通过政府行为授予的专利权时实施本主题发明的许可。
附图和序列表的简述
图1图解了ω-3/ω-6脂肪酸的生物合成途径。
图2显示了大豆疫霉Δ17去饱和酶基因(称为“PsD17”;SEQ IDNO:1)和为了在解脂耶氏酵母(Y.lipolytica.)中表达而进行了密码子优化的合成基因(称为“PsD17S”;SEQ ID NO:3)的DNA序列的比较。
图3提供了以下的质粒图谱:(A)pPsD17S;(B)pKUNF1-KEA;(C)pDMW214;和(D)pFmPs17S。
图4A图示了解脂耶氏酵母菌株Y2047的发育,其产生全脂质级分11%的ARA。图4B提供了pKUNF12T6E的质粒图谱,而图4C提供了pDMW271的质粒图谱。
图5提供了pZP3-P7U的质粒图谱。
图6A图示了解脂耶氏酵母菌株Y4070的发育,其产生全脂质级分12%的ARA。图6B提供了pZKLeuN-29E3的质粒图谱,而图6C提供了pY116的质粒图谱。
图7提供了以下的质粒图谱:(A)pKO2UF8289;和(B)pZKSL-555R。
图8显示了栎树猝死病菌Δ17去饱和酶基因(称为“PrD17”;SEQID NO:5)和为了在解脂耶氏酵母中表达而进行了密码子优化的合成基因(称为“PrD17S”;SEQ ID NO:7)的DNA序列的比较。
图9提供了以下的质粒图谱:(A)pPrD17S;(B)pZUFPrD17S;和(C)pZUF17。
由以下构成本申请组成部分的详述和随附的序列描述可更完整地理解本发明。
以下序列遵照37C.F.R.§1.821-1.825(“含有核苷酸序列和/或氨基酸序列的专利申请的相关要求公告-序列规则”),并与世界知识产权组织(WIPO)标准ST.25(1998)以及EPO与PCT的序列表要求(行政指导的规则5.2与49.5(a-bis),以及208节与附件C)相一致。用于核苷酸及氨基酸序列数据的符号与格式遵照37C.F.R.§1.822所示的规则。
SEQ ID NO:1-40为如在表1中所确定的编码基因、蛋白或质粒的ORF。
表1
基因和蛋白质SEQ ID标号的概述
描述和缩写 | 核酸SEQ ID NO. | 蛋白质SEQ ID NO. |
大豆疫霉Δ17去饱和酶(“PsD17”) | 1(1092bp) | 2(363AA) |
来源于大豆疫霉、为在解脂耶氏酵母中表达而进行了密码子优化的合成Δ17去饱和酶(“PsD17S”) | 3(1086bp) | 4(361AA) |
栎树猝死病菌Δ17去饱和酶(“PrD17”) | 5(1086bp) | 6(361AA) |
来源于栎树猝死病菌、为在解脂耶氏酵母中表达而进行了密码子优化的合成Δ17去饱和酶(“PrD17S”) | 7(1086bp) | 6(361AA) |
致病疫霉Δ17去饱和酶(“PiD17”) | 8(1086bp) | 9(361AA) |
质粒pPsD17S | 10(3806bp) | -- |
质粒pKunF1-KEA | 11(6619bp) | -- |
质粒pDMW214 | 12(9513bp) | -- |
质粒pFmPsD17S | 13(8727bp) | |
质粒pKUNF12T6E | 14(12,649bp) | |
来源于金黄色破囊壶菌(Thraustochytrium aureum)(美国专利号6,677,145)、为在解脂耶氏酵母中表达而进行了密码子优化的合成C18/20延长酶(“EL2S”) | 15(819bp) | 16(272AA) |
质粒pDMW271 | 17(13,034bp) | |
来源于人(Homo sapiens)的合成Δ5去饱和酶(GenBank登录号NP_037534),其为在解脂耶氏酵母中表达而进行了密码子优化 | 18(1335bp) | 19(444AA) |
质粒pZP3-P7U | 20(8867bp) | -- |
大肠杆菌LoxP重组位点,由Cre重组酶识别 | 21(34bp) | -- |
质粒pZKLeuN-29E3 | 22(14,655bp) | -- |
纤细裸藻(Euglena gracilis)Δ9延长酶(美国专利申请号11/601563和11/601564)(“EgD9e”) | 31(777bp) | 24(258AA) |
来源于纤细裸藻的合成Δ9延长酶(美国专利申请号11/601563和11/601564),其为在解脂耶氏酵母中表达而进行了密码子优化(“EgD9eS”) | 23(777bp) | 24(258AA) |
来源于高山被孢霉(Mortierella alpina) | 25 | 26 |
ELO3的合成C16/18延长酶(美国专利申请号11/253882),其为在解脂耶氏酵母中表达而进行了密码子优化(“ME3S”) | (828bp) | (275AA) |
质粒pY116 | 27(8739bp) | -- |
质粒pKO2UF8289 | 28(15,304bp) | -- |
来源于纤细裸藻的合成突变Δ8去饱和酶(美国专利申请号11/635258)(“突变EgD8S-23”) | 29(1272bp) | 30(422AA) |
质粒pZKSL-555R | 32(13,707bp) | -- |
纤细裸藻Δ5去饱和酶(美国专利申请号60/801172)(“EgD5”) | 37(1350bp) | 34(449AA) |
来源于纤细裸藻(美国专利申请号60/801172)、为在解脂耶氏酵母中表达而进行了密码子优化的合成Δ5去饱和酶(“EgD5S”) | 33(1350bp) | 34(449AA) |
来源于多甲藻属(Peridinium sp.)CCMP626(美国专利申请号60/801119)、为在解脂耶氏酵母中表达而进行了密码子优化的合成Δ5去饱和酶(“RD5S”) | 35(1392bp) | 36(463AA) |
质粒pPrD17S | 38(3806bp) | -- |
质粒pZUF17 | 39(8165bp) | -- |
质粒pZUFPrD17S | 40(8174bp) | -- |
发明详述
本文援引的所有专利、专利申请和出版物都整体引入本文作为参考。这具体地包括以下申请人的代理人的共同未决申请:美国专利7,125,672、美国专利7,189,559、美国专利7,192,762、美国专利申请号10/840579和10/840325(2004年5月6日申请)、美国专利申请号10/869630(2004年6月16日申请)、美国专利申请号10/882760(2004年7月1日申请)、美国专利申请号10/985254和10/985691(2004年11月10日申请)、美国专利申请号10/987548(2004年11月12日申请)、美国专利申请号11/024545和11/024544(2004年12月29日申请)、美国专利申请号11/166993(2005年6月24日申请)、美国专利申请号11/183664(2005年7月8日申请)、美国专利申请号11/185301(2005年7月20日申请)、美国专利申请号11/190750(2005年7月27日申请)、美国专利申请号11/198975(2005年8月8日申请)、美国专利申请号11/225354(2005年9月13日申请)、美国专利申请号11/253882(2005年10月19日申请)、美国专利申请号11/264784和11/264737(2005年11月1日申请)、美国专利申请号11/265761(2005年11月2日申请)、美国专利申请号60/795810(2006年4月28日申请)、美国专利申请号60/793575(2006年4月20日申请)、美国专利申请号60/796637(2006年5月2日申请)、美国专利申请号60/801172和60/801119(2006年5月17日申请)、美国专利申请号60/853563(2006年10月23日申请)、美国专利申请号60/855177(2006年10月30日申请)、美国专利申请号11/601563和11/601564(2006年11月16日申请)、美国专利申请号11/635258(2006年12月7日申请)和美国专利申请号11/613420(2006年12月20日申请)。
按照本发明,申请人鉴定出新颖的卵菌门Δ17去饱和酶和编码该酶的基因,可使用该酶操控生成有益于健康的PUFA的生化途径。因此,本发明具有很多应用。通过本文公开的方法制备的PUFA或其衍生物可作为饮食代用品或添加剂,尤其是婴儿配方食品,用于进行静脉营养或者预防或治疗营养不良的患者。或者,纯化的PUFA(或其衍生物)可被掺入到食用油、脂肪或配制的人造黄油中,使得在正常使用时接受者会接受所需量的饮食补充。PUFA还可以被掺入到婴儿配方食品、营养添加剂或其它食物产品中,并可以用作抗炎或降低胆固醇的试剂。任选地,所述组合物可用于药学用途(人类的或兽医的)。
对人或动物补充通过重组方式生产的PUFA可以使所加入的PUFA及其代谢产物的水平增加。例如,用EPA治疗不仅可以使EPA水平提高,还可以提高EPA下游产物如类二十烷酸(即前列腺素、白三烯、凝血線烷)的水平。复杂的调控机制会使其需要组合多种PUFA或添加不同的PUFA缀合物,以便预防、控制或克服这样的机制,以在个体中获得期望水平的特定PUFA。
定义
在本文公开内容中使用大量术语和缩写。提供以下的定义。.
“可读框”缩写为ORF。
“聚合酶链式反应”缩写为PCR。
“美国典型培养物保藏中心”缩写为ATCC。
“多不饱和脂肪酸”缩写为PUFA。
“三酰甘油”缩写为TAG。
本文使用的术语“发明”或“本发明”是指如在本文的权利要求书和说明书中描述的本发明的所有方面和实施方案,并且不应被解读为限于任何具体的实施方案或方面。
术语“脂肪酸”是指可变链长的长链脂肪族酸(烷酸),其长度从约C12至C22(尽管已知有更长及更短链长的酸)。多数链长在C16和C22之间。脂肪酸的结构可以通过一个简单的符号体系“X:Y”来表示,其中X是特定脂肪酸中碳(C)原子的总数,而Y是双键的数目。涉及“饱和脂肪酸”对“不饱和脂肪酸”、“单不饱和脂肪酸”对“多不饱和脂肪酸”(或“PUFA”)以及“ω-6脂肪酸”(ω-6或n-6)对“ω-3脂肪酸”(ω-3或n-3)之间差异的其它细节提供于PCT公布号WO2004/101757。
在本文公开内容中用于描述PUFA的命名法示于以下表2。在标题为“简化符号”的列中,使用ω参照系统来指明碳原子数、双键数以及由ω碳(为此该碳被编号为1)计数最接近ω碳的双键位置。表中其余部分总结了ω-3与ω-6脂肪酸及其前体的常用名、在说明书全文中将使用的缩写及每种化合物的化学名。
表2
多不饱和脂肪酸和前体的命名
常用名 | 缩写 | 化学名 | 简化符号 |
肉豆蔻酸 | -- | 十四酸 | 14:0 |
棕榈酸 | 棕榈酸 | 十六酸 | 16:0 |
棕榈油酸 | -- | 9-十六碳烯酸 | 16:1 |
硬脂酸 | -- | 十八酸 | 18:0 |
油酸 | -- | 顺式-9-十八烯酸 | 18:1 |
亚油酸 | LA | 顺式-9,12-十八碳二烯酸 | 18:2ω-6 |
γ-亚油酸 | GLA | 顺式-6,9,12-十八碳三烯酸 | 18:3ω-6 |
二十碳二烯酸 | EDA | 顺式-11,14-二十碳二烯酸 | 20:2ω-6 |
二均-γ-亚油酸 | DGLA | 顺式-8,11,14-二十碳三烯酸 | 20:3ω-6 |
二十碳四烯酸 | ARA | 顺式-5,8,11,14-二十碳四烯酸 | 20:4ω-6 |
α-亚油酸 | ALA | 顺式-9,12,15-十八碳三烯酸 | 18:3ω-3 |
十八碳四烯酸 | STA | 顺式-6,9,12,15-十八碳四烯酸 | 18:4ω-3 |
二十碳三烯酸 | ETrA | 顺式-11,14,17-二十碳三烯酸 | 20:3ω-3 |
二十碳四烯酸 | ETA | 顺式-8,11,14,17-二十碳四烯酸 | 20:4ω-3 |
二十碳五烯酸 | EPA | 顺式-5,8,11,14,17-二十碳五烯酸 | 20:5ω-3 |
二十二碳五烯酸 | DPA | 顺式-7,10,13,16,19-二十二碳五烯酸 | 22:5ω-3 |
二十二碳六烯酸 | DHA | 顺式-4,7,10,13,16,19-二十二碳六烯酸 | 22:6ω-3 |
术语“三酰甘油”、“油”和“TAG”是指由3个脂肪酰基残基和其酯化的甘油分子组成的中性脂质(这些术语在本公开内容全文中可交互使用)。这些油可含有长链PUFA以及较短的饱和与不饱和脂肪酸和较长链的饱和脂肪酸。因此,“油生物合成”一般是指细胞中的TAG合成。“微生物油”或“单细胞油”是由微生物在其存活期间自然产生的那些油。
“总脂质和油质级分中的PUFA百分率(%)”是指在那些级分中PUFA相对于总脂肪酸的百分率。术语“总脂质级分”或“脂质级分”均是指产油生物中所有脂质(即中性的和极性的)的总和,因此包括位于磷脂酰胆碱(PC)级分、磷脂酰乙醇胺(PE)级分和三酰甘油(TAG或油质)级分中的那些脂质。然而,术语“脂质”和“油质”在本说明书全文中可交互使用。
“代谢途径”或“生物合成途径”在生物化学意义上可被看作是在细胞中发生、由酶催化的一系列化学反应,以实现由细胞使用或储存的代谢产物的形成,或启动另一个代谢途径(因而叫做流动产生步骤)。这些途径有许多是非常精细的,包括初始物质的逐步修饰,以将其定型为具有所需的精确化学结构的产物的步骤。
术语“PUFA生物合成途径”是指将油酸转变为LA、EDA、GLA、DGLA、ARA、ALA、STA、ETrA、ETA、EPA、DPA和DHA的代谢过程。该过程在文献中已被充分描述(例如参见PCT公布号WO2006/052870)。简而言之,该过程包括经由在内质网膜中存在的一系列专用的去饱酶和延长酶(即“PUFA生物合成途径酶”)通过添加碳原子延长碳链和通过添加双键去饱和该分子。更具体地说,“PUFA生物合成途径酶”是指与PUFA的生物合成相关的下列酶中的任一种(和编码所述酶的基因),包括:Δ4去饱和酶、Δ5去饱和酶、Δ6去饱和酶、Δ12去饱和酶、Δ15去饱和酶、Δ17去饱和酶、Δ9去饱和酶、Δ8去饱和酶、Δ9延长酶、C14/16延长酶、C16/18延长酶、C18/20延长酶和/或C20/22延长酶。
术语“ω-3/ω-6脂肪酸生物合成途径”指的是一系列基因,当它们在适当条件下表达时,所编码的酶催化ω-3及ω-6这两种脂肪酸的产生。典型地,涉及ω-3/ω-6脂肪酸生物合成途径的基因编码PUFA生物合成途径酶。代表性的途径如图1所示,提供了油酸通过多种中间体向DHA的转变,该图展示了如何可以从普通来源生产ω-3和ω-6这两种脂肪酸。该途径被自然分成两部分,其中一部分产生ω-3脂肪酸,另一部分仅产生ω-6脂肪酸。仅产生ω-3脂肪酸的那部分在本文中被称为ω-3脂肪酸生物合成途径,而仅产生ω-6脂肪酸的那部分在本文中被称为ω-6脂肪酸生物合成途径。
本文在ω-3/ω-6脂肪酸生物合成途径背景中使用的术语“功能性的”是指在途径中的部分(或全部)基因表达活性酶,产生体内催化或底物转变。应当理解的是,“ω-3/ω-6脂肪酸生物合成途径”或“功能性ω-3/ω-6脂肪酸生物合成途径”并不暗示以上段落中列出的所有基因都是必需的,因为很多脂肪酸产物仅需要该途径中的部分基因表达。
术语“去饱和酶”是指可以在一种或多种脂肪酸中去饱和(即引入双键)而生成目标脂肪酸或前体的多肽。尽管在本说明书全文中使用ω-参照系统指示特定脂肪酸,但使用Δ-系统从底物的羧基末端计数指示去饱和酶活性更为便利。本文中令人关注的是:1)Δ8去饱和酶,其催化EDA转变为DGLA和/或ETrA转变为ETA;2)Δ5去饱和酶,其催化DGLA转变为ARA和/或ETA转变为EPA;3)Δ6去饱和酶,其催化LA转变为GLA和/或ALA转变为STA;4)Δ4去饱和酶,其催化DPA转变为DHA;5)Δ12去饱和酶,其催化油酸转变为LA;6)Δ15去饱和酶,其催化LA转变为ALA和/或GLA转变为STA;和7)Δ9去饱和酶,其催化棕榈酸转变为棕榈油酸(16:1)和/或硬脂酸转变为油酸(18:1)。
在本文中尤其令人感兴趣的是Δ17去饱和酶,其去饱和由分子的羧基末端编号的第17个和第18个碳原子之间的脂肪酸,并例如催化ARA转变为EPA(和任选地DGLA转变为ETA)。在本领域中,Δ17去饱和酶(还有Δ15去饱和酶)基于其将ω-6脂肪酸转变为其ω-3对应物的能力(例如分别为LA转变成ALA,或DGLA转变成ETA,以及ARA转变成EPA),有时也被称为“Ω-3去饱和酶”、“w-3去饱和酶”和/或“ω-3去饱和酶”。
某些去饱和酶对两种或更多种底物具有活性。基于该能力,针对这些酶的去饱和酶活性还可以将这些酶分类为“单功能的”或“双功能的”。在某些实施方案中,最理想的是如下凭经验确定脂肪酸去饱和酶的特异性:用脂肪酸去饱和酶的基因转化适宜的宿主,并确定其对宿主脂肪酸谱的影响。
更具体地说,Δ17去饱和酶在本文被定义为具有单功能或双功能的Δ17去饱和酶活性的那些脂肪酸去饱和酶,其中Δ17去饱和酶活性是ARA转变为EPA和/或DGLA转变为ETA。术语“单功能Δ17去饱和酶”、“单功能Δ17去饱和酶活性”或“独有的Δ17去饱和酶活性”是指能够将ARA转变为EPA和/或DGLA转变为ETA但不能将LA转变为ALA的Δ17去饱和酶。相对应地,“双功能Δ17去饱和酶”、“双功能Δ17去饱和酶活性”或“主要的Δ17去饱和酶活性”是指优先将ARA转变为EPA和/或DGLA转变为ETA但额外具有将LA转变为ALA的有限能力的Δ17去饱和酶(由此表现出主要的Δ17去饱和酶活性和有限的Δ15去饱和酶活性)。
应当指出的是,Δ17去饱和酶可具有在该类别中不相关的、非Δ17和Δ15去饱和的特异性。还应当指出的是,单功能和双功能的Δ17去饱和酶之间的区别在于事实区别且不是绝对的;例如该酶可发挥单功能的或双功能的Δ17去饱和酶活性,这取决于其表达水平或生长条件。
对本文目的来说,术语“PsD17”是指分离自大豆疫霉、由SEQ IDNO:1编码的Δ17去饱和酶(SEQ ID NO:2)。相对应地,术语“PsD17S”是指来源于大豆疫霉的合成的Δ17去饱和酶,其为了在解脂耶氏酵母中表达而进行了密码子优化(即SEQ ID NO;3和4)。基于本文所述的分析,PsD17和PsD17S被进一步分类为双功能Δ17去饱和酶。
同样,术语“PrD17”是指分离自栎树猝死病菌、由SEQ IDNO:5编码的Δ17去饱和酶(SEQ ID NO:6)。相对应地,术语“PrD17S”是指来源于栎树猝死病菌的合成的Δ17去饱和酶,其为了在解脂耶氏酵母中表达而进行了密码子优化(即SEQ ID NO:7和6)。基于本文所述的分析,PrD17和PrD17S被进一步分类为单功能Δ17去饱和酶。
术语“转化率”和“底物转化百分率”是指特定酶(例如去饱和酶)可将底物转化为产物的效率。转化率根据下式计算:([产物]/[底物+产物])*100,其中“产物”包括中间产物以及在途径中由中间产物衍生的所有产物。
术语“延长酶”是指能够延长脂肪酸碳链以产生比延长酶所作用的脂肪酸底物长两个碳的酸的多肽。此延长过程是同脂肪酸合成酶相关的多步机制,描述于PCT公布号WO2004/101757。由延长酶系统催化的反应实例是GLA转变为DGLA、STA转变为ETA和EPA转变为DPA。一般而言,延长酶的底物选择性略微宽广,但以链长以及不饱和的程度和类型这二者来区分。例如,C14/16延长酶利用C14底物(例如肉豆蔻酸),C16/18延长酶利用C16底物(例如棕榈酸),C18/20延长酶(也称为Δ6延长酶,这些术语可交互使用)利用C18底物(例如GLA、STA),C20/22延长酶利用C20底物(例如EPA)。以类似的方式,Δ9延长酶能够催化LA和ALA分别向EDA和ETrA转变。要重点指出的是,某些延长酶具有广泛的特异性,因此单个酶能够催化几种延长酶反应(例如由此同时用作C16/18延长酶和C18/20延长酶)。
术语“卵菌”是指一组通常称为水霉和霜霉的异养生物。它们是丝状原生生物,必须由周围的水或土壤吸收它们的食物,或者可以侵入另一种生物的机体来进食。因此,卵菌在降解和再循环腐烂物质方面起重要作用。疫霉(Phytophthora)是一种卵菌属,包含59个不同物种。尽管卵菌通过趋同进化与真菌类似,但它们不是真菌(先前认为是);相反,卵菌是原生藻菌界的组成部分,由此与植物、真菌和动物不同。硅藻属以及金褐藻和褐藻(例如海藻)也包括在原生藻菌界中。
术语“油脂(的)(oleaginous)”是指那些倾向于以脂质形式储备它们的能量源的生物(Weete,载于:Fungal Lipid Biochemistry,第2版,Plenum,1980)。术语“油脂酵母(oleaginous yeast)”是指那些被分类为酵母的可产油微生物。通常,油脂微生物的细胞油质或TAG含量符合S形曲线,其中脂质浓度升高,至对数期晚期或生长稳定期早期达到最大,然后在稳定期晚期和死亡期当中逐渐降低(Yongmanitchai和Ward,Appl.Environ.Microbiol.,57:419-25(1991))。累积油超过其干细胞重量约25%对油脂微生物并不罕见。油脂酵母的实例包括但不限于以下属:耶氏酵母属(Yarrowia)、假丝酵母属(Candida)、红酵母属(Rhodotorula)、红冬孢酵母属(Rhodosporidium)、隐球酵母属(Cryptococcus)、丝孢酵母属(Trichosporon)和油脂酵母属(Lipomyces)。
本文所用的“分离的核酸片段”或“分离的核酸分子”可交互使用,是指为单或双链形式并任选地含有合成、非天然或改变的核苷酸碱基的RNA或DNA聚合体。DNA聚合体形式的分离核酸片段可包括cDNA、基因组DNA或合成DNA的一个或多个区段。
当单链形式的核酸片段可在合适的温度和溶液离子强度条件下与另一个核酸片段(例如cDNA、基因组DNA或RNA分子)退火时,该核酸片段与所述另一个核酸片段是“可杂交的”。杂交和洗涤条件是众所周知的,例举于Sambrook,J.,Fritsch,E.F.和Maniatis,T.Molecular Cloning:A Laboratory Manual,第2版,Cold Spring HarborLaboratory:Cold Spring Harbor,NY(1989),尤其是第11章和其中的表11.1(完整引入本文作为参考)。温度和离子强度的条件决定了杂交的“严格性”。可调整严格条件,以筛选中度相似片段(诸如远缘相关生物的同源序列)至高度相似片段(诸如由近缘相关生物复制功能酶的基因)。杂交后洗涤决定严格条件。一组优选条件使用一系列洗涤:首先于室温用6X SSC,0.5%SDS洗涤15分钟,接着于45℃用2X SSC,0.5%SDS重复洗涤30分钟,再重复2次于50℃用0.2X SSC,0.5%SDS洗涤30分钟。更优选的严格条件组使用更高温度,其中洗涤除了最后两次用0.2X SSC,0.5%SDS洗涤30分钟的温度提高为60℃外,其它条件与上述一致。另一组优选的高度严格条件使用的最后两次洗涤是于65℃用0.1X SSC,0.1%SDS进行。另一组严格条件包括例如于65℃在0.1X SSC,0.1%SDS中杂交,用2X SSC,0.1%SDS洗涤,接着用0.1X SSC,0.1%SDS洗涤。
杂交要求两个核酸含有互补序列,但根据杂交严格性,碱基间有可能错配。适用于杂交核酸的严格性取决于核酸长度及互补程度这些本领域众所周知的变量。两个核苷酸序列之间的相似性或同源性的程度越高,具有这些序列的核酸杂合体的Tm值就越大。核酸杂交的相对稳定性(对应于较高的Tm)按下列顺序递减:RNA:RNA、DNA:RNA、DNA:DNA。对于长度大于100个核苷酸的杂合体,已得到计算Tm的公式(参见Sambrook等,出处同上,9.50-9.51)。对于与较短的核酸(即寡核苷酸)杂交,错配的位置变得更为重要,寡核苷酸的长度决定了它的特异性(参见Sambrook等,出处同上,11.7-11.8)。在一个实施方案中,可杂交核酸的长度至少为约10个核苷酸。优选地,可杂交核酸的最小长度为至少约15个核苷酸;更优选地为至少约20个核苷酸;最优选地长度为至少约30个核苷酸。此外,熟练技术人员知晓,必要时可以根据诸如探针长度等因素来调整温度及洗涤溶液盐浓度。
氨基酸或核苷酸序列的“实质部分(substantial portion)”是指包含多肽中的足够氨基酸序列或基因中的足够核苷酸序列的部分,以根据本领域技术人员的手工序列估算或使用诸如BLAST(基本局部比对检索工具;Altschul,S.F.等,J.Mol.Biol.215:403-410(1993))的算法通过计算机自动序列对比和鉴定来推测鉴定该多肽或基因。一般而言,为了推测鉴定与已知蛋白或基因同源的多肽或核酸序列,必需10个以上连续氨基酸或30个以上核苷酸的序列。此外,对于核苷酸序列,包含20-30个连续核苷酸的基因特异性寡核苷酸探针可用于序列依赖性基因鉴定(例如DNA杂交)和分离(例如细菌菌落或噬菌斑的原位杂交)方法。另外,12-15个碱基的短寡核苷酸可用作PCR扩增引物,以获得包含所述引物的特定核酸片段。因此,核苷酸序列的“实质部分”含有足以特异性鉴定和/或分离含该序列的核酸片段的序列。本说明书讲授了编码特定卵菌蛋白的完整氨基酸序列和核苷酸序列。由于本文报道了这些序列的益处,技术人员现在可使用公开序列的全部或实质部分用于本领域技术人员已知的用途。因此,本发明包含随附序列表中报告的完整序列,以及如上定义的那些序列的实质部分。
术语“互补的”用于描述能够相互杂交的核苷酸碱基间的关系。例如,关于DNA,腺苷同胸腺嘧啶互补,胞嘧啶同鸟嘌呤互补。因此,本发明也包括与随附序列表中报告的完整序列互补的分离核酸片段,以及那些显著相似的核酸序列。
术语“同源性”和“同源的”在本文可互换使用。它们是指其中一个或多个核苷酸碱基的改变不影响核酸片段介导基因表达或产生某些表型的能力的核酸片段。这些术语也指本发明中核酸片段的修饰,譬如相对于初始的、未修饰的片段基本上未改变所产生核酸片段的功能特性的一个或多个核苷酸的缺失或插入。因此,要理解的是,正如本领域的技术人员所意识到的,本发明包含比具体例举序列更多的序列。
此外,技术人员公认,本发明所包含的同源核酸序列也由其在适度严格条件下(例如0.5X SSC,0.1%SDS,60℃)下与本文例举的序列、或与本文公开的和在功能上等同于本文公开的任何核酸序列的核苷酸序列的任何一部分杂交的能力来定义。
“密码子简并性”是指在不改变编码多肽的氨基酸序列的情况下允许核苷酸序列变异的遗传密码的性质。技术人员很了解特定宿主细胞在指定给定氨基酸的核苷酸密码使用方面所表现出的“密码子偏倚”。因此,当合成用于在宿主细胞中改善表达的基因时,期望所设计的基因使其密码子使用频率接近宿主细胞中优选密码子使用的频率。
与DNA序列关联的“化学合成的”是指组成核苷酸在体外组装。DNA的人工化学合成可以使用已充分确定的方法来完成,或者可使用众多市售机器中的一种进行自动化学合成。“合成基因”可以由使用本领域技术人员已知的程序化学合成的寡核苷酸结构单元来组装。连接并退火这些结构单元,以形成基因区段,随后酶促组装这些基因区段,以构建完整基因。因此,基于反映宿主细胞的密码子偏倚的核苷酸序列的优化,可使基因适于最佳基因表达。技术人员知晓在密码子使用偏向于宿主偏爱的密码子的情况下成功进行基因表达的可能性。基于对来源于宿主细胞的、其中序列信息可用的基因的调查,可以确定优选密码子。
“基因”是指表达特定蛋白的核酸片段,其可指单独编码区,或者可包括编码序列之前(5′非编码序列)和之后(3′非编码序列)的调节序列。“天然基因”是指与其自身调节序列一起天然存在的基因。“嵌合基因”是指不是天然基因的任何基因,其包含非天然一起存在的调节和编码序列。因此,嵌合基因可包含来源于不同来源的调节序列和编码序列,或者来源于相同来源但以不同于天然存在形式的方式排列的调节序列和编码序列。“内源基因”是指在生物基因组中处于其天然位置的天然基因。“外源”基因是指通过基因转移导入宿主生物中的基因。外源基因可以包含插入非天然生物中的天然基因、导入天然宿主中的新位置的天然基因或嵌合基因。“转基因”是已通过转化步骤导入基因组中的基因。“密码子优化基因”是其密码子使用频率设计成模拟宿主细胞中的优选密码子使用频率的基因。
“编码序列”是指编码特定氨基酸序列的DNA序列。“合适的调控序列”是指位于编码序列上游(5′非编码序列)、当中或下游(3′非编码序列)并影响相关编码序列的转录、RNA加工或稳定性或翻译的核苷酸序列。调节序列可以包括启动子、翻译前导序列、内含子、多聚腺苷酸识别序列、RNA加工位点、效应子结合位点和茎环结构。
“启动子”是指能够控制编码序列或功能RNA表达的DNA序列。一般而言,编码序列位于启动子序列的3′。启动子可以整体来源于天然基因,或者由来源于天然存在的不同启动子的不同元件所组成,或者甚至含有合成的DNA区段。本领域的技术人员知道,不同的启动子可以在不同的组织或细胞类型中、或在发育的不同阶段、或响应于不同的环境或生理条件指导基因表达。使基因在大多数时刻在大多数细胞类型中表达的启动子通常称为“组成型启动子”。还要认识到,由于在大多数情况下调节序列的精确边界并未完全确定,因此不同长度的DNA片段可以具有相同的启动子活性。
术语“3′非编码序列”及“转录终止子”是指位于编码序列下游的DNA序列。它包括聚腺苷酸识别序列以及编码能够影响mRNA加工或基因表达的调节信号的其它序列。聚腺苷酸信号通常以影响聚腺苷酸序列段添加至mRNA前体的3′末端为特征。3′区可影响相关编码序列的转录、RNA加工或稳定性或翻译。
“RNA转录物”是指由RNA聚合酶催化的DNA序列转录产生的产物。当RNA转录物为DNA序列的完全互补拷贝时,其被称为初级转录物,或者其可为来源于初级转录物的转录后加工的RNA序列,被称为成熟RNA。“信使RNA”或“mRNA”是指没有内含子、可被细胞翻译成蛋白的RNA。“cDNA”是指来源于mRNA并与之互补的双链DNA。“有义”RNA是指包括mRNA的RNA转录物,所以可被细胞翻译为蛋白。“反义RNA”是指同全部或部分的目标初始转录物或者mRNA互补的RNA转录物,其阻断靶基因的表达(美国专利号5,107,065;PCT公布号WO99/28508)。反义RNA的互补物可以是特定基因转录物的任一部分,即处于5′非编码序列、3′非编码序列或编码序列。“功能RNA”是指反义RNA、核酶RNA或其它未被翻译但仍对细胞过程有影响的RNA。
术语“操作性连接(的)(operably linked)”是指单个核酸片段上核酸序列连接,使得一个序列的功能受另一个序列影响。例如,启动子与编码序列操作性连接或有效连接时,启动子能够影响编码序列的表达(即编码序列处于启动子的转录控制下)。编码序列可与调节序列以有义或反义方向有效连接或操作性连接。
本文所用的术语“表达”是指来源于本发明核酸片段的有义(mRNA)或反义RNA的转录和稳定累积。表达还可以指mRNA翻译为多肽。
“成熟”蛋白是指翻译后加工的多肽,即初级翻译产物中存在的任何前肽或原肽均已被除去的多肽。“前体”蛋白是指mRNA翻译的初级产物,即仍存在前肽和原肽。前肽和原肽可以是(但不限于)胞内定位信号。
“转化”是指将核酸分子转入宿主生物中,产生遗传学上稳定的遗传。核酸分子可以是例如自主复制或可整合到宿主生物的基因组中的质粒。含有转化核酸片段的宿主生物被称为“转基因”或“重组”或“转化”生物。
术语“质粒”、“载体”和“表达盒”是指通常携带不是细胞核心代谢部分的基因的染色体外元件,通常为环状双链DNA片段的形式。这类元件可以是自主复制序列、基因组整合序列、噬菌体或核苷酸序列、线性或环状的单链或双链DNA或RNA,其来自任何来源,其中许多核苷酸序列已连接或重组为独特的构建体,它能够将用于选定基因产物的启动子片段和DNA序列与合适的3′非翻译序列一起导入细胞中。“表达盒”是指含有外源基因并在外源基因之外还具有增强该基因在外源宿主中的表达的元件的特定载体。
如本领域中所知,术语“同一性百分率”是两个或更多个多肽序列或者两个或更多个多核苷酸序列之间的关系,该关系通过比较这些序列来确定。在本领域中,“同一性”也指多肽或多核苷酸序列之间的序列相关度,视情况由这些序列的字串之间的匹配程度来确定。“同一性”和“相似性”通过已知方法可容易地算出,包括但不限于描述于以下的方法:1)Computational Molecular Biology(Lesk,A.M.编辑)Oxford University:NY(1988);2)Biocomputing:Informatics and Genome Projects(Smith,D.W.编辑)Academic:NY(1993);3)Computer Analysis of Sequence Data,Part I(Griffin,A.M.和Griffin,H.G.编辑.)Humania:NJ(1994);4)Sequence Analysis in Molecular Biology(vonHeinje,G.编辑)Academic(1987);和5)Sequence Analysis Primer(Gribskov,M.和Devereux,J.编辑)Stockton:NY(1991)。
用于确定同一性的优选方法设计用于在检验序列之间产生最佳匹配。用于确定同一性和相似性的方法已编程为公开可获得的计算机程序。序列比对及同一性百分率的计算可以使用LASERGENE生物信息学计算套件(DNASTAR Inc.,Madison,WI)的MegAlignTM程序进行。序列的多重比对使用“Clustal比对法”进行,该方法包含几个变通算法,包括对应于标识为Clustal V(描述于Higgins和Sharp,CABIOS.5:151-153(1989);Higgins,D.G.等,(1992)Comput.Appl.Biosci.8:189-191)并存在于生物信息学计算套件(DNASTAR Inc.,Madison,WI)的MegAlign程序中的比对法的“Clustal V比对法”。对于多重比对,默认值对应于空位罚分=10和空位长度罚分=10。成对比对和使用Clustal法计算蛋白序列的同一性百分率的默认参数为KTUPLE=1、空位罚分=3、比较窗=5和DIAGONALS SAVED=5。对于核酸,这些参数为KTUPLE=2、空位罚分=5、比较窗=4和DIAGONALS SAVED=4。在使用Clustal V程序比对序列后,有可能通过查看相同程序中的“序列距离”表获得“同一性百分率”。另外,可利用“Clustal W比对法”,其对应于标识为Clustal W(描述于Higgins和Sharp,CABIOS.5:151-153(1989);Higgins,D.G.等,(1992)Comput.Appl.Biosci.8:189-191)并存在于LASERGENE生物信息学计算套件(DNASTARInc.,Madison,WI)的MegAlign v6.1程序中的算法。用于多重比对的默认参数(空位罚分=10,空位长度罚分=0.2,Delay Divergen Seqs(%)=30,DNA转变权重=0.5,蛋白质权重矩阵=Gonnet Series,DNA权重矩阵=IUB)。在使用Clustal V程序比对序列后,有可能通过查看相同程序中的“序列距离”表获得“同一性百分率”。
“BLASTN比对法”是由国立生物技术信息中心(NCBI)提供的算法,使用默认参数比较核苷酸序列。
本领域技术人员会充分理解的是,许多序列同一性水平可用于鉴定来自其它物种的多肽,其中这些多肽具有相同或类似的功能或活性。合适的核酸片段(本发明的分离多核苷酸)编码与本文所报道的氨基酸序列具有至少约70%同一性、优选至少约75%同一性、更优选至少约80%同一性的多肽。优选的核酸片段编码与本文报道的氨基酸序列具有至少约85%同一性的氨基酸序列。更优选的核酸片段编码与本文报道的氨基酸序列具有至少约90%同一性的氨基酸序列。最优选的核酸片段编码与本文报道的氨基酸序列具有至少约95%同一性的氨基酸序列。实际上,70%至100%的任何整数氨基酸同一性均可用于描述本发明,例如71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%。合适的核酸片段不仅具有上述同源性,还典型地编码具有至少50个氨基酸、优选至少100个氨基酸、更优选至少150个氨基酸、再更优选至少200个氨基酸和最优选至少250个氨基酸的多肽。
术语“序列分析软件”是指用于分析核苷酸或氨基酸序列的任何计算机算法或软件程序。“序列分析软件”可通过商业渠道或独立研发获得。典型的序列分析软件包括但不限于:1)GCG程序套件(WisconsinPackage9.0版,Genetics Computer Group(GCG),Madison,WI);2)BLASTP、BLASTN、BLASTX(Altschul等,J.Mol.Biol.,215:403-410(1990));3)DNASTAR(DNASTAR,Inc.Madison,WI);4)Sequencher(Gene Codes Corporation,Ann Arbor,MI);和5)编入Smith-Waterman算法的FASTA程序(W.R.Pearson,Comput.Methods Genome Res.,[Proc.Int.Symp.](1994),Meeting Date1992,111-20.编辑:Suhai,Sandor.Plenum:New York,NY)。在本申请的上下文中,应当理解的是,除非另有说明,否则序列分析软件均被用于分析,分析结果基于所提及程序的“默认值”。本文所用的“默认值”指软件在首次初始化时原始加载的任何数值组或参数组。
本文所用的标准重组DNA和分子克隆技术在本领域众所周知,描述于Sambrook,J.,Fritsch,E.F.和Maniatis,T.,Molecular Cloning: A Laboratory Manual,第2版,Cold Spring Harbor Laboratory:ColdSpring Harbor,NY(1989)(后文称为“Maniatis”);Silhavy,T.J.,Bennan,M.L.和Enquist,L.W.,Experiments with Gene Fusions,Cold SpringHarbor Laboratory:Cold Spring Harbor,NY(1984);和Ausubel,F.M.等,Current Protocols in Molecular Biology,由Greene Publishing Assoc.和Wiley-Interscience,Hoboken,NJ(1987)出版。
综述:脂肪酸和三酰甘油的微生物生物合成
一般而言,油脂微生物中的脂类累积响应于生长培养基中的总碳氮比而被触发。该过程导致在油脂微生物中从头合成游离棕榈酸(16:0),详述于PCT公布号WO2004/101757。棕榈酸是长链饱和及不饱和脂肪酸衍生物的前体,其通过延长酶和去饱和酶的作用形成(图1)。
TAG(脂肪酸的主要储存单位)通过一系列反应形成,这些反应包括:1)经由酰基转移酶将1个酰基-CoA分子酯化为甘油-3-磷酸,产生溶血磷脂酸;2)经由酰基转移酶酯化第二个酰基-CoA分子,产生1,2-二酰基甘油磷酸(通常鉴定为磷脂酸);3)通过磷脂酸磷酸酶除去磷酸,产生1,2-二酰基甘油(DAG);和4)通过酰基转移酶的作用加入第三种脂肪酸,形成TAG。可将广谱脂肪酸掺入TAG中,包括饱和及不饱和脂肪酸以及短链和长链脂肪酸。
Ω脂肪酸的生物合成
其中油酸被转变为ω-3/ω-6脂肪酸的代谢过程包括通过添加碳原子延长碳链和通过加入双键使分子去饱和。这需要一系列存在于内质网膜中的专用去饱和酶和延长酶。然而,如在图1中所见和下文所述,经常有多个替代途径来生产特定的ω-3/ω-6脂肪酸。
具体地说,所有途径都需要通过Δ12去饱和酶将油酸初始转变为第一种ω-6脂肪酸LA。然后,使用“Δ6去饱和酶/Δ6延长酶途径”,如下形成ω-6脂肪酸:(1)通过Δ6去饱和酶将LA转变为GLA;(2)通过C18/20延长酶将GLA转变为DGLA;和(3)通过Δ5去饱和酶将DGLA转变为ARA。或者,“Δ6去饱和酶/Δ6延长酶途径”可如下用于形成ω-3脂肪酸:(1)通过Δ15去饱和酶将LA转变为第一种ω-3脂肪酸ALA;(2)通过Δ6去饱和酶将ALA转变为STA;(3)通过C18/20延长酶将STA转变为ETA;(4)通过Δ5去饱和酶将ETA转变为EPA;(5)通过C20/22延长酶将EPA转变为DPA;和(6)通过Δ4去饱和酶将DPA转变为DHA。任选地,ω-6脂肪酸可被转变为ω-3脂肪酸;例如,通过Δ17去饱和酶活性分别由DGLA和ARA产生ETA和EPA。
用于生物合成ω-3/ω-6脂肪酸的替代途径利用Δ9延长酶和Δ8去饱和酶。更具体地说,可通过Δ9延长酶将LA和ALA分别转变为EDA和ETrA;然后,Δ8去饱和酶将EDA转变为DGLA,和/或将ETrA转变为ETA。
预期需要在用于生产ω-3/ω-6脂肪酸的特定宿主生物中表达的具体功能性取决于宿主细胞(及其天然PUFA谱和/或去饱和酶/延长酶谱)、底物的可利用性和所需的终产物。本领域技术人员能够鉴定编码ω-3/ω-6脂肪酸生物合成所需的每种酶的多种候选基因。有用的去饱和酶和延长酶序列可来自任何来源,例如分离自天然来源(细菌、藻类、真菌、卵菌、植物、动物等)、经半合成途径产生或从头合成。尽管导入到宿主中的去饱和酶和延长酶基因的具体来源并不关键,但选择具有去饱和酶或延长酶活性的特定多肽的考虑因素包括:1)多肽的底物特异性;2)多肽或其组分是否为限速酶;3)去饱和酶或延长酶是否是合成所需PUFA必需的;和/或4)多肽所需的辅因子。表达的多肽优选具有与其在宿主细胞中的位置的生物化学环境相适的参数(关于其余细节,参见PCT公布号WO2004/101757)。
在其它实施方案中,考虑每个具体的去饱和酶和/或延长酶的转化效率也是有用的。更具体地说,因为每种酶很少以100%的效率将底物转变为产物,所以在宿主细胞中产生的未纯化油类物质的最终脂质谱通常为由所需的ω-3/ω-6脂肪酸组成的多种PUFA以及多种上游中间体PUFA的混合物。因此,在优化所需脂肪酸的生物合成时,每种酶的转化效率的考虑因素也是一个考虑变量。
由于考虑了以上的每种考虑因素,可按照具有PUFA生产能力的生物的公开可用的文献(例如GenBank)、专利文献和实验分析鉴定具有合适的去饱和酶和延长酶活性的候选基因(例如Δ6去饱和酶、C18/20延长酶、Δ5去饱和酶、Δ17去饱和酶、Δ15去饱和酶、Δ9去饱和酶、Δ12去饱和酶、C14/16延长酶、C16/18延长酶、Δ9延长酶、Δ8去饱和酶、Δ4去饱和酶和C20/22延长酶)。这些基因适于导入到特定宿主生物中,以使生物能合成PUFA或增强生物的PUFA合成。
Δ17去饱和酶的鉴定
本发明的Δ17去饱和酶序列分离自大豆疫霉和栎树猝死病菌(P.ramorum),并如本文所述首先由美国能源部联合基因组研究所(JGI;Walnut Creek,CA)报告。然而,应当指出的是,尽管所述序列是已知的,但它们作为Δ17去饱和酶的功能在本文描述的发现之前未被阐述。申请人首先识别出这些多肽的特定Δ17去饱和酶活性。
为获取这些序列的正确活性,致病疫霉(P.infestans)去饱和酶(“PiD17”;SEQ ID NO:9)在针对JGI数据库的BLAST检索中用作查询字段。这导致鉴定出2个PiD17同源物;具体地说为本文称为“PsD17”(SEQ ID NO:1和2)的大豆疫霉(P.sojae)同源物和本文称为“PrD17”(SEQ ID NO:5和6)的栎树猝死病菌同源物。
尽管进行了密码子优化,但需要修饰野生型PsD17和PrD17去饱和酶,以优化在替代宿主中使用的每种酶的表达。应用宿主优选密码子可显著增强编码多肽的外源基因的表达。一般而言,可通过检验蛋白质中的密码子选择(优选以最大量表达的那些),并确定哪种密码子以最高频率使用,确定在特定目标宿主物种中的宿主优选密码子。随后,可使用在宿主物种中优选的密码子,整体或部分地合成具有例如去饱和酶活性的目标多肽的编码序列。
野生型PsD17和PrD17去饱和酶为在解脂耶氏酵母中表达进行了密码子优化,如在表3中所示。
表3
在解脂耶氏酵母中进行ω-3生物合成的密码子优化的Δ17去饱和酶
野生型(WT)Δ17 | WTΔ17的SEQID NO | 在密码子优化的基因中修饰的总碱基 | 在密码子优化的基因中修饰的总密码子 | 由野生型至密码子优化的基因的GC含量改变 | 密码子优化的Δ17 | 密码子优化的Δ17的SEQ IDNO |
PsD17(大豆疫霉) | 1,2 | 1092bp中的175bp(16.0%) | 364个密码子中的168个(46.2%) | 65.1%-54.5% | PsD17S | 3,4 |
PrD17(栎树猝死病 | 5,6 | 1086bp中的168bp | 362个密码子中的160个 | 64.4%-54.5% | PrD17S | 7,6 |
菌) | (15.5%) | (44.2%) |
本领域技术人员应能够使用本文的教导,基于野生型PsD17和PrD17序列,产生多种其它的适于在替代宿主(即非解脂耶氏酵母)中最优表达的密码子优化的Δ17去饱和酶。因此,本发明涉及来源于野生型PsD17和PrD17(即分别由SEQ ID NO:2和6编码)的任何密码子优化的Δ17去饱和酶。这包括但不限于示于SEQ ID NO:3和7的核苷酸序列,其编码为了在解脂耶氏酵母中表达而进行了密码子优化的合成Δ17去饱和酶蛋白(即分别示于SEQ ID NO:4和6的PsD17S和PrD17S)。
更具体地说,使用Clustal W比对算法(MegAlignTM,DNASTAR,出处同上)在361个氨基酸的长度内将PsD17S核苷酸碱基和推导的氨基酸序列与公共数据库对比揭示,大部分相似的已知序列(即PCT公布号WO2005/083053的致病疫霉“ω3去饱和酶”;另参见GenBank登录号CAJ30870)与本文报告的PsD17S氨基酸序列(SEQ ID NO:4)约91.4%相同。相比之下,致病疫霉“ω3去饱和酶”的氨基酸序列和PsD17(SEQ ID NO:2)之间具有90.9%的同一性。
使用Clustal W比对算法(出处同上)在361个氨基酸的长度内将PrD17S核苷酸碱基和推导的氨基酸序列与公共数据库对比揭示,大部分相似的已知序列(即致病疫霉“ω3去饱和酶”,出处同上)与本文报告的PrD17S的氨基酸序列(SEQ ID NO:6)约89.5%相同。
在本发明范围内,优选的氨基酸片段与本文的PsD17S和PrD17S序列至少约70%-80%相同,其中至少约85%-90%相同的序列特别合适,最优选至少约95%相同的那些序列。编码对应于本发明的ORF的核酸序列的优选的PsD17S和PrD17S是编码活性蛋白的那些,分别与本文报告的PsD17S和PrD17S的核酸序列至少约70%-80%相同,其中至少85%-90%相同的那些序列特别合适,最优选至少约95%相同的那些序列。
在鉴定上述卵菌多肽时,每个密码子优化的脂肪酸去饱和酶的特异性通过转化入合适的宿主(即解脂耶氏酵母)中并测定其对宿主脂肪酸谱的影响来确定(实施例5、8和12)。正如所料,PsD17S和PrD17S均具有Δ17去饱和酶活性,使得所述酶能够催化ARA向EPA的转变。具体地说,PsD17S的ARA向EPA的转变效率为40%和98%(基于在两个不同的解脂耶氏酵母菌株中和在不同的生长条件下确定),而PrD17S的ARA向EPA的转变效率为49%。转变效率按照下式检测:([产物]/[底物+产物])*100,其中‘产物’包括直接产物和在途径中由其衍生的所有产物。
然而,出乎意料的是,PsD17S额外具有有限的Δ15去饱和酶活性(即LA向ALA的转变效率为15%),而PrD17S没有(即LA向ALA的转变效率小于1%)。因此,大豆疫霉去饱和酶在本文被定义为双功能Δ17去饱和酶,而栎树猝死病菌去饱和酶在本文被定义为单功能Δ17去饱和酶。
同源物的鉴定和分离
任何本发明的去饱和酶序列(即PsD17、PrD17、PsD17S、PrD17S)或其部分均可用于在相同或其它的细菌、藻类、真菌、卵菌或植物物种中使用序列分析软件检索Δ17去饱和酶同源物。一般而言,该计算机软件通过指定多种取代、缺失和其它修饰的同源程度匹配相似的序列。
或者,本发明的去饱和酶序列中的任一种或其部分还可以用作Δ17同源物鉴定中的杂交试剂。核酸杂交实验的基本组分包括探针、怀疑含有目标基因或基因片段的样品和特异性杂交方法。本发明的探针通常是单链核酸序列,其与待检测的核酸序列互补。探针与待检测的核酸序列是“可杂交的”。尽管探针长度可从5个碱基至数万个碱基不等,但通常约15个碱基至约30个碱基的探针长度是合适的。仅部分探针分子需要与待检测的核酸序列互补。此外,探针和靶序列之间不必完全互补。杂交的确在不完全互补的分子之间发生,结果是杂交区中一定部分的碱基不与正确互补的碱基配对。
杂交方法是非常成熟确定的。通常,探针和样品必须在允许核酸杂交的条件下混合。这包括在无机盐或有机盐存在下在适当的浓度和温度条件下使探针和样品接触。探针和样品核酸必须接触足够长的时间,使得可以在探针和样品核酸之间发生任何可能的杂交。混合物中探针或靶标的浓度将决定发生杂交所必需的时间。探针或靶标浓度越高,需要的杂交温育时间越短。任选地,可以加入离液剂(例如氯化胍、硫氰酸胍、硫氰酸钠、四氯乙酸锂、高氯酸钠、四氯乙酸铷、碘化钾、三氟乙酸铯)。如有需要,可以将甲酰胺加至杂交混合物,通常为30-50%(体积/体积)。
可以使用多种杂交溶液。通常,这些包含约20-60%体积、优选30%的极性有机溶剂。常用的杂交溶液使用约30-50%(体积/体积)甲酰胺、约0.15-1M氯化钠、约0.05-0.1M缓冲液(例如柠檬酸钠、Tris-HCl、PIPES或HEPES(pH范围约6-9))、约0.05-0.2%去污剂(例如十二烷基硫酸钠)或0.5-20mM EDTA、FICOLL(Pharmacia Inc.)(约300-500kdal)、聚乙烯吡咯烷酮(约250-500kdal)和血清白蛋白。典型杂交溶液中还包括约0.1-5mg/mL未标记的载体核酸、片段化核DNA(例如小牛胸腺或鲑精DNA或酵母RNA)和任选约0.5-2%(重量/体积)的甘氨酸。还可以包括其它添加剂,如体积排阻剂,其包括多种极性水溶剂或膨胀剂(例如聚乙二醇)、阴离子聚合物(例如聚丙烯酸酯或聚丙烯酸甲酯)和阴离子糖类聚合物(例如硫酸葡聚糖)。
核酸杂交适于多种测定形式。一种最适宜的形式是夹心测定形式。夹心测定尤其适于在非变性条件下杂交。夹心型测定的主要组分是固相支持体。固相支持体已吸附或共价偶联的固定化核酸探针未被标记,并与序列的一部分互补。
在其它实施方案中,本文描述的任何Δ17去饱和酶核酸片段(或鉴定的其任何同源物)都可用于分离编码相同或其它细菌、藻类、真菌、卵菌或植物物种的同源蛋白的基因。使用序列依赖性方法分离同源基因在本领域众所周知。序列依赖性方法的实例包括但不限于:1)核酸杂交方法;2)DNA和RNA扩增方法,以核酸扩增技术的多种应用为例[例如聚合酶链式反应(PCR),Mullis等,美国专利号4,683,202;连接酶链式反应(LCR),Tabor,S.等,Proc.Natl.Acad.Sci.U.S.A.,82:1074(1985);或链置换扩增(SDA),Walker等,Proc.Natl.Acad.Sci.U.S.A.,89:392(1992)];和3)通过互补进行文库构建和筛选的方法。
例如,可使用本领域技术人员众所周知的方法,通过使用本发明核酸片段的全部或部分作为DNA杂交探针筛选来自任何所需酵母、真菌或卵菌的文库(其中生产EPA[或其衍生物]的那些酵母或真菌应是优选的),可直接分离编码与本文所述的Δ17去饱和酶类似的蛋白或多肽的基因。可通过本领域已知的方法设计和合成基于本发明核酸序列的特异性寡核苷酸探针(Maniatis,出处同上)。而且,可直接使用完整序列通过技术人员已知的方法合成DNA探针(例如随机引物DNA标记、缺口翻译或末端标记技术),或使用可用的体外转录系统合成RNA探针。另外,可设计特异性引物,并用于扩增本发明序列的一部分(或全长)。所获扩增产物可在扩增反应当中直接标记或在扩增反应后标记,并用作探针,以在适宜的严格性条件下分离全长DNA片段。
通常,在PCR型扩增技术中,引物具有不同序列且彼此不互补。根据所需的测试条件,应设计引物序列,以提供既有效又可靠的靶核酸复制。PCR引物设计方法是本领域常见并众所周知的(Thein和Wallace,“The use of oligonucleotide as specific hybridization probes inthe Diagnosis of Genetic Disorders”,载于Human Genetic Diseases:APractical Approach,K.E.Davis编辑,(1986)33-50页,IRL:Herndon,VA;和Rychlik,W.,载于Methods in Molecular Biology,White,B.A.编辑,(1993)15卷,31-39页,PCR Protocols:Current Methods andApplications.Humania:Totowa,NJ)。
通常,本发明序列的两种短区段可用于PCR方案,以由DNA或RNA扩增编码同源基因的更长核酸片段。还可以对所克隆的核酸片段文库进行PCR,其中一种引物的序列来自本发明的核酸片段,另一种引物的序列利用了编码真核生物基因的mRNA前体的3′末端存在的聚腺苷酸序列段。
或者,第二种引物序列可以基于来自克隆载体的序列。例如,技术人员可以通过使用PCR扩增转录物的单个点和3′或5′末端之间区域的拷贝,按照RACE方案(Frohman等,Proc.Natl.Acad.Sci.U.S.A.,85:8998(1988))来产生cDNA。可以从本发明序列设计以3′和5′方向定向的引物。使用市售3′RACE或5′RACE系统(Gibco/BRL,Gaithersburg,MD),可以分离特异性3′或5′cDNA片段(Ohara等,Proc.Natl.Acad.Sci.U.S.A.,86:5673(1989);Loh等,Science,243:217(1989))。
在其它实施方案中,本文描述的任何Δ17去饱和酶核酸片段(或鉴定的其任何同源物)都可以用于产生新的和改进的脂肪酸去饱和酶。如本领域众所周知的,可使用体外诱变和选择、化学诱变、“基因改组”法或其它方法获得天然去饱和酶基因的突变。或者,可通过结构域交换合成改进的脂肪酸,其中来自本文所述的任何Δ17去饱和酶核酸片段的功能结构域都可与替代去饱和酶基因中的功能结构域交换,由此产生新蛋白。
生产不同ω-3和/或ω-6脂肪酸的方法
预期引入处于适当启动子控制下的、编码本文描述的Δ17去饱和酶(即PsD17、PrD17、PsD17S、PrD17S或其它突变酶、密码子优化的酶或其同源物)的嵌合基因将分别导致所转化的宿主生物中的EPA产量增加。因此,本发明包括指导PUPA生产的方法,其包括使脂肪酸底物(即ARA)与本文描述的去饱和酶(例如PsD17、PrD17、PsD17S、PrD17S)接触,使得底物被转变为所需脂肪酸产物(即EPA)。
更具体地说,本发明的一个目的是提供在宿主细胞(例如油脂酵母)中产生EPA的方法,其中宿主细胞包含:
a)一种编码Δ17去饱和酶多肽的分离的核苷酸分子,其选自:
i)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQ IDNO:2的氨基酸序列的多肽对比时具有至少90.9%的同一性;
ii)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQ IDNO:4的氨基酸序列的多肽对比时具有至少91.4%的同一性;
iii)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQ IDNO:6的氨基酸序列的多肽对比时具有至少89.5%的同一性;和
b)ARA的来源;
其中,宿主细胞在使Δ17去饱和酶基因表达和ARA转变为EPA的条件下生长,其中任选地回收EPA。
本领域技术人员将认识到,Δ17去饱和酶的广泛底物范围将允许使用该酶将二均γ亚麻酸转变为二十碳四烯酸。因此,本发明提供用于生产二十碳四烯酸的方法,该方法包括提供含以下的宿主细胞:
a)一种编码Δ17去饱和酶多肽的分离的核苷酸分子,其选自:
i)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQ IDNO:2的氨基酸序列的多肽对比时具有至少90.9%的同一性;
ii)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQ IDNO:4的氨基酸序列的多肽对比时具有至少91.4%的同一性;
iii)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQ IDNO:6的氨基酸序列的多肽对比时具有至少89.5%的同一性;和
b)二均γ亚麻酸的来源;
c)在其中表达编码Δ17去饱和酶多肽的核酸分子和二均γ亚麻酸被转变为二十碳四烯酸的条件下培养包含步骤(a)的核苷酸分子的宿主细胞;和
d)任选地回收步骤(c)的二十碳四烯酸。
在一个替代实施方案中,基于大豆疫霉Δ17去饱和酶的双功能性,本发明的一个目标是提供在宿主细胞(例如油脂酵母)中生产ALA的方法,其中宿主细胞含有:
a)一种编码双功能Δ17去饱和酶多肽的分离的核苷酸分子,其选自:
i)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQ IDNO:2的氨基酸序列的多肽对比时具有至少90.9%的同一性;和
ii)编码Δ17去饱和酶多肽的分离的核苷酸分子,所述Δ17去饱和酶多肽在基于Clustal比对方法与具有示于SEQ IDNO:4的氨基酸序列的多肽对比时具有至少91.4%的同一性;
b)LA的来源;
其中,宿主细胞在使双功能Δ17去饱和酶基因表达并将LA转变为ALA的条件下生长,其中任选地回收ALA。
或者,本文描述的每种Δ17去饱和酶基因及其相应的酶产物可以间接用于生产ω-3脂肪酸(参见PCT公布号WO2004/101757)。进行ω-3/ω-6PUFA的间接生产,其中脂肪酸底物通过中间步骤或途径中间体间接转化成期望的脂肪酸产物。因此,预期本文描述的Δ17去饱和酶(例如PsD17、PrD17、PsD17S、PrD17S或其它突变酶、密码子优化的酶或其同源物)可与编码PUFA生物合成途径的酶(例如Δ6去饱和酶、C18/20延长酶、Δ5去饱和酶、Δ15去饱和酶、Δ9去饱和酶、Δ12去饱和酶、C14/16延长酶、C16/18延长酶、Δ9延长酶、Δ8去饱和酶、Δ4去饱和酶、C20/22延长酶)的其它基因一起表达,导致较高水平地生产长链ω-3脂肪酸(例如EPA、DPA和DHA)。包含在具体表达盒中的具体基因取决于宿主细胞(及其PUFA谱和/或去饱和酶/延长酶谱)、底物的可利用性和所需的终产物。
在备选实施方案中,基于本文描述的完整序列、那些完整序列的互补物、那些序列的实质部分、来源于这些序列的密码子优化的去饱和酶和与这些序列基本同源的那些序列破坏宿主生物的天然Δ17去饱和酶可能有用。例如,宿主生物中Δ17去饱和酶(和任选的Δ15去饱和酶)的定向破坏产生合成ω-3脂肪酸的能力下降的突变菌株。该突变菌株可用于生产“纯的”ω-6脂肪酸(没有共合成ω-3脂肪酸)。
表达系统、表达盒和载体
可以在异源宿主细胞中表达本文描述的本发明序列的基因和基因产物。重组宿主中的表达可用于产生多种PUFA途径中间体,或者可用于调节宿主中已有的PUFA途径,用以合成迄今使用该宿主还不可能合成的新产物。
含有指导外源蛋白质高水平表达的调节序列的表达系统和表达载体是本领域技术人员众所周知的。这些表达系统和表达载体中的任一种都可以用于构建嵌合基因,用以产生本发明序列的任何基因产物。然后,这些嵌合基因可以通过转化导入适宜的宿主细胞中,以提供所编码酶的高水平表达。
用于转化适宜的宿主细胞的载体或DNA表达盒在本领域众所周知。构建体中存在的序列的具体选择取决于所需表达产物(见上文)、宿主细胞的性质和所建议的分离转化细胞与未转化细胞的方法。然而,通常,载体或表达盒含有指导相关基因转录和翻译的序列、选择性标记和允许自主复制或染色体整合的序列。适宜的载体包含控制转录起始的基因5′区(例如启动子)和控制转录终止的DNA片段的3′区(即终止子)。当两个控制区都来自转化宿主细胞的基因时是最优选的,尽管一般认为此类控制区不必来自选择作为生产宿主的特定物种的天然基因。
用于驱动本发明的ORF在期望宿主细胞中表达的起始控制区或启动子为数众多,且是本领域技术人员熟知的。实际上,能够指导这些基因在选定宿主细胞中表达的任何启动子都适于本发明。可以以瞬时或稳定的方式实现在宿主细胞中的表达。通过诱导与目的基因有效连接的可调节启动子的活性可以实现瞬时表达。通过使用与目的基因有效连接的组成型启动子可以实现稳定表达。作为实例,当宿主细胞是酵母时,提供了在酵母细胞中有功能的转录和翻译区,尤其来自宿主物种的(关于用于解脂耶氏酵母的优选转录起始调节区,例如参见美国专利申请号11/265761,其对应于PCT公布号WO2006/052870)。可以使用多种调节序列的任一种,这取决于是需要组成型转录还是诱导型转录、启动子表达目的ORF的效率、构建的容易性等。
终止区可以来自获得起始区的基因的3′区或来自不同的基因。众多的终止区是已知的,并且在多种宿主中令人满意地发挥功能(当用于与它们所来源的属和种相同或不同的属和种时)。通常选择终止区更多地是为了方便起见,而不是因为任何具体性质。终止控制区还可以来自优选宿主的多种天然基因。任选地,终止位点可为不必要的;然而,最优选包括终止位点。
如本领城技术人员所知,仅仅将基因插入到克隆载体不能确保基因将以所需水平成功表达。为响应高表达率的需要,通过操作许多不同的遗传元件已产生了许多专门的表达载体,所述遗传元件控制转录、翻译、蛋白质稳定性、氧限制和从宿主细胞分泌的方面。更具体地说,已被操作而控制基因表达的一些分子特征包括:1)相关转录启动子和终止子序列的性质;2)所克隆基因的拷贝数以及该基因是质粒携带的还是整合到宿主细胞的基因组中;3)所合成的外源蛋白的最终细胞定位;4)蛋白在宿主生物中翻译和正确折叠的效率;5)宿主细胞中所克隆基因的mRNA和蛋白质的内在稳定性;和6)所克隆基因中的密码子选择,使得其频率接近宿主细胞的优选密码子选择的频率。这些修饰类型的每一种都包括在本发明中,作为进一步优化本文所述的Δ17去饱和酶表达的手段。
宿主细胞的转化
一旦已经得到了编码适于在合适的宿主细胞中表达的多肽的DNA,就将其置于能够在宿主细胞中自主复制的质粒载体中,或者将其直接整合到宿主细胞的基因组中。表达盒的整合可以在宿主基因组内随机发生,或者可以通过使用含有宿主基因组的同源区域、足以与宿主基因座靶向重组的构建体靶向。当构建体靶向内源基因座时,通过内源基因座可以提供所有或者部分的转录和翻译调节区。
当由分开的复制载体表达两种和更多种基因时,希望每种载体都具有不同的选择方式,并且与其它构建体应当没有同源性,以保持稳定表达和防止元件在构建体间再分配。可以通过实验确定调节区的明智选择、选择方法和导入构建体的增殖方法,使得所有导入的基因都以必要的水平表达,以提供目的产物的合成。
可以通过任何标准技术将包含目标基因的构建体导入宿主细胞中。这些技术包括转化(例如乙酸锂转化[Methods in Enzymology,194:186-187(1991)])、原生质体融合、生物粒子轰击(biolistic impact)、电穿孔、微注射或者将目标基因导入宿主细胞中的任何其它方法。
为方便起见,已通过任意方法操作而吸收DNA序列(例如表达盒)的宿主细胞在本文将被称作“转化的”或“重组的”。转化宿主将具有表达构建体的至少一个拷贝,并可具有两个或更多个拷贝,这取决于所述基因是整合到基因组中、扩增还是存在于具有多个拷贝数的染色体外元件上。可以通过多种选择技术鉴定转化的宿主细胞,如在PCT公布号WO2004/101757和PCT公布号WO2005/003310中所述。
转化后,适于本发明的Δ17去饱和酶(和任选地在宿主细胞内共表达的其它PUFA酶)的底物可以由宿主天然产生或转基因产生,或者它们可以外源提供。
ω-3和/或ω-6脂肪酸生物合成的代谢工程
本发明的Δ17去饱和酶的序列信息可用于在多种宿主细胞中操作ω-3和/或ω-6脂肪酸生物合成。这可能需要在PUFA生物合成途径内直接代谢工程,或者贡献碳给PUFA生物合成途径的其它操作途径。用于上调所需生物化学途径和下调不需要的生物化学途径的方法是本领域技术人员众所周知的。例如,与ω-3和/或ω-6脂肪酸生物合成途径竞争能量或碳的生物化学途径,或者干扰特定PUFA终产物生产的天然PUFA生物合成途径酶,可通过基因破坏消除,或者可通过其它方法(例如反义mRNA)下调。
作为增加ARA、EPA或DHA的手段在PUFA生物合成途径中操作(及其相关技术)的详细讨论分别提供于PCT公布号WO2006/055322[美国专利公布号2006-0094092-A1]、PCT公布号WO2006/052870[美国专利公布号2006-0115881-A1]和PCT公布号WO2006/052871[美国专利公布号2006-0110806-A1],还提供了在TAG生物合成途径和TAG降解途径中所需的操作(及其相关技术)。
用于重组表达Δ17去饱和酶的优选宿主
用于表达本发明基因和核酸片段的宿主细胞可以包括在多种原料(包括简单或复杂的糖、脂肪酸、有机酸、油类和醇类和/或烃类)上于宽范围内温度和pH值生长的宿主。基于申请人的代理人的需要,首先分离出用于在油脂酵母(尤其是解脂耶氏酵母)中表达的本发明所述基因;然而,预计因为转录、翻译和蛋白质生物合成装置是高度保守的,所以任何细菌、酵母、藻类、卵菌和/或丝状真菌都将是用于表达本发明的核酸片段的适宜宿主。
优选宿主为油脂生物,例如油脂酵母。这些油脂生物天然地能够进行油合成和累积,其中油质可以占细胞干重的约25%以上,更优选地占细胞干重的约30%以上,最优选地占细胞干重的约40%以上。通常鉴定为油脂酵母的属包括但不限于:耶氏酵母属(Yarrowia)、假丝酵母属(Candida)、红酵母属(Rhodotorula)、红冬孢酵母属(Rhodosporidium)、隐球酵母属(Cryptococcus)、丝孢酵母属(Trichosporon)和油脂酵母属(Lipomyces)。更具体地说,示例性油质合成酵母包括:红冬孢酵母菌(Rhodosporidium toruloides)、Lipomycesstarkeyii、L.lipoferus、Candida revkaufi、铁红假丝酵母(C.pulcherrima)、热带念珠菌(C.tropicalis)、产朊假丝酵母(C.utilis)、Trichosporonpullans、皮肤毛孢子菌(T.cutaneum)、Rhodotorula glutinus、牧草红酵母(R.graminis)和解脂耶氏酵母(Yarrowia lipolytica)(以前分类为Candida lipolytica)。
最优选的是油脂酵母解脂耶氏酵母;在又一个实施方案中,最优选的是称为ATCC#76982、ATCC#20362、ATCC#8862、ATCC#18944和/或LGAMS(7)1的解脂耶氏酵母菌株(Papanikolaou S.和Aggelis G.,Bioresour.Technol.,82(1):43-9(2002))。
适用于在解脂耶氏酵母中工程化EPA和DHA的具体技术分别提供于美国专利申请号11/265761和11/264737。用于在油脂酵母(即解脂耶氏酵母)中合成和转化含有Δ17去饱和酶的表达载体的详细方法提供于PCT公布号WO2004/101757和WO2006/052870。在该酵母中表达基因的优选方法是通过将线性DNA整合入宿主基因组中;以及,在需要高水平表达基因时,整合入基因组内的多个基因座中可能尤其有用[例如在Ura3基因座(GenBank登录号AJ306421)、Leu2基因座(GenBank登录号AF260230)、Lys5基因座(GenBank登录号M34929)、Aco2基因座(GenBank登录号AJ001300)、Pox3基因座(Pox3:GenBank登录号XP_503244;或Aco3:GenBank登录号AJ001301)、Δ12去饱和酶基因座(PCT公布号WO2004/104167)、Lip1基因座(GenBank登录号Z50020)和/或Lip2基因座(GenBank登录号AJ012632)]。
用于解脂耶氏酵母的优选选择方法是针对卡那霉素、潮霉素和氨基糖苷G418的抗性,以及在没有尿嘧啶、亮氨酸、赖氨酸、色氨酸或组氨酸的培养基上生长的能力。在替代的实施方案中,使用5-氟乳清酸(5-氟尿嘧啶-6-羧酸一水合物;“5-FOA”)选择酵母Ura-突变体。该化合物对具有编码乳清酸核苷5’-一磷酸脱羧酶(OMP脱羧酶)的功能性URA3基因的酵母细胞有毒;因此,基于该毒性,5-FOA尤其可用于选择和鉴定Ura-突变酵母菌(Bartel,P.L.和Fields,S.,Yeast2-Hybrid System,Oxford University:New York,第7卷,109-147页,1997)。
其它优选微生物宿主包括油脂细菌、藻类、卵菌和其它真菌;以及,在该广泛的微生物宿主类别中,特别令人感兴趣的是合成ω-3/ω-6脂肪酸的微生物。因此,例如,用处于诱导型或调节型启动子控制下的任何本发明的Δ17去饱和酶基因转化高山被孢霉(其在商业上用于生产ARA)可产生能够合成EPA的转化生物。转化高山被孢霉(M.alpina)的方法描述于Mackenzie等(Appl.Environ.Microbiol.,66:4655(2000))。同样,用于转化破囊壶菌目(Thraustochytriales)微生物的方法公开于美国专利号7,001,772。
在替代的优选实施方案中,本发明提供用本文所述的Δ17去饱和酶转化的多种植物宿主。如此转化的植物可为单子叶植物或双子叶植物,优选地,它们属于一类被鉴定为含油的植物(例如油籽植物)。优选的油籽植物宿主的实例包括但不限于:大豆(Glycine和Soja sp.)、玉米(Zea mays)、亚麻(Linum sp.)、油菜籽(Brassica sp.)、樱草、油菜、玉米、红花(Carthamus sp.)和向日葵(Helianthus sp.)。在油籽植物中过表达脂肪酸去饱和酶的方法(例如构建表达盒、转化、选择等)描述于PCT公布号WO2005/047479。
不管选择哪种具体宿主表达本文所述的Δ17去饱和酶,都必须筛选多种转化体,以便获得展现出所需的表达水平和模式的菌株。此筛选可通过下列方式来进行:DNA印迹的DNA分析(Southern,J.Mol.Biol.,98:503(1975))、mRNA表达的RNA分析(Kroczek,J.Chromatogr.Biomed.Appl.,618(1-2):133-145(1993))、蛋白质表达的蛋白质和/或Elisa分析、PUFA产物的表型分析或GC分析。
用于Ω脂肪酸生产的发酵方法
转化的宿主细胞在优化嵌合去饱和酶基因表达并产生最大和最经济的所需PUFA产量的条件下生长。通常,可以优化的培养基条件包括碳源的类型和量、氮源的类型和量、碳-氮比率、不同矿物离子的量、氧水平、生长温度、pH、生物量生产期的长度、油累积期的长度以及细胞收获的时间和方法。解脂耶氏酵母一般生长在复杂培养基(例如酵母提取物-蛋白胨-葡萄糖培养基(YPD))中,或者生长在缺少生长必需组分的限定基本培养基中,由此强迫选择所需表达盒(例如酵母氮源(DIFCO Laboratories,Detroit,MI))。
本发明中的发酵培养基必须含有适宜的碳源。适宜的碳源讲述于PCT公布号WO2004/101757。尽管预计本发明可利用的碳源可包括各种各样的含碳来源,但优选的碳源为糖、甘油和/或脂肪酸。最优选含有10-22个碳的葡萄糖和/或脂肪酸。
可以从无机来源(例如(NH4)2SO4)或有机来源(例如尿素或谷氨酸)提供氮。除了适宜的碳源和氮源以外,发酵培养基必须还含有适宜的矿物质、盐、辅因子、缓冲剂、维生素和本领域技术人员已知的其它组分,它们适于油脂宿主的生长和促进PUFA生产必需的酶促途径。特别要关注促进脂质和PUFA合成的一些金属离子(例如Fe+2、Cu+2、Mn+2、Co+2、Zn+2、Mg+2)(Nakahara,T.等,Ind.Appl.Single Cell Oils,D.J.Kyle和R.Colin编辑,61-97页(1992))。
本发明中优选的生长培养基是通常商业制备的培养基,例如酵母氮源(DIFCO Laboratories,Detroit,MI)。还可以使用其它限定或合成生长培养基,用于转化体宿主细胞生长的适宜培养基是微生物学或发酵科学领域的技术人员已知的。发酵适宜的pH范围通常为约pH4.0至pH8.0,其中pH5.5至pH7.5优选为最初生长条件的范围。发酵可以在需氧或厌氧条件下进行,其中优选微需氧条件。
通常,油脂酵母细胞中的高水平PUFA累积需要两步法,因为代谢状态必须在脂肪的生长和合成/贮存之间“平衡”。因此,最优选地,在解脂耶氏酵母中生产PUFA必需两步发酵法。该方法描述于PCT公布号WO2004/101757,还描述了多种适宜的发酵过程设计(即分批、补料-分批和连续)和生长过程中的考虑因素。
实施例
在下面实施例中进一步限定本发明。应当理解的是,这些实施例尽管指出了本发明的优选实施方案,但是仅作为示例给出。本领域技术人员由以上讨论和这些实施例可以确定本发明的基本特征,并可以在不背离本发明的精神和范围的情况下,对本发明做出多种使其适于多种用法和条件的改变和修改。
一般方法
实施例中使用的标准重组DNA和分子克隆技术在本领城众所周知,描述于:1)Sambrook,J.,Fritsch,E.F.和Maniatis,T.MolecularCloning:ALaboratory Manual;Cold Spring Harbor Laboratory:ColdSpring Harbor,NY(1989)(Maniatis);2)T.J.Silhavy,M.L Bennan,和L.W.Enquist,Experiments with Gene Fusions;Cold Spring HarborLaboratory:Cold Spring Harbor,NY(1984);和3)Ausubel,F.M.等,Current Protocols in Molecular Biology,由Greene Publishing Assoc.和Wiley-Interscience,Hoboken,NJ(1987)出版。
适于维持和生长微生物培养物的材料和方法在本领域众所周知。适用于以下实施例的技术可见于在Manual of Methods for GeneralBacteriology(Phillipp Gerhardt,R.G.E.Murray,Ralph N.Costilow,Eugene W.Nester,Willis A.Wood,Noel R.Krieg和G.Briggs Phillips编辑),American Society for Microbiology:Washington,D.C.(1994));或ThomasD.Brock in Biotechnology:A Textbook of IndustrialMicrobiology,第2版,Sinauer Associates:Sunderland,MA(1989)中的陈述。除非另外指出,否则用于生长和维持微生物细胞的所有试剂、限制酶和材料都得自Aldrich Chemicals(Milwaukee,WI)、DIFCOLaboratories(Detroit,MI)、GIBCO/BRL(Gaithersburg,MD)或SigmaChemical Company(St.Louis,MO)。大肠杆菌(XL1-Blue)感受态细胞购自Stratagene Company(San Diego,CA)。大肠杆菌菌株通常在37℃于Luria Bertani(LB)板上生长。
根据标准方法(Sambrook等,出处同上)进行常规分子克隆。DNA序列在ABI自动测序仪上使用染料终止物技术(美国专利号5,366,860;EP272,007)使用载体和插入片段特异性引物的组合产生。用DNASTAR软件(DNA Star,Inc.)完成基因序列的比较。
除非另有说明,否则进行BLAST(基本局部算法检索工具;Altschul,S.F.等,J.Mol.Biol.,215:403-410(1993)和Nucleic AcidsRes.,25:3389-3402(1997))检索来鉴定与在BLAST“nr”数据库(包含所有的非冗余GenBank CDS翻译,来源于3维结构Brookhaven蛋白质数据库的序列,SWISS-PROT蛋白质序列数据库、EMBL和DDBJ数据库)中包含的序列具有相似性的分离序列。翻译所有读框中的序列,并使用由国立生物技术信息中心(NCBI)提供的BLASTX算法(Gish,W.和States,D.J.Nature Genetics,3:266-272(1993))对“nr”数据库中包含的所有公众可用的蛋白质序列比较相似性。概述了与查询序列具有最高相似性的序列的BLAST比较结果根据%同一性、%相似性和期望值报告。“%同一性”被定义为两个蛋白之间相同的氨基酸的百分率。“%相似性”被定义为两个蛋白之间相同或保守的氨基酸的百分率。“期望值”估计匹配的统计学显著性,以给定记分指定预期在该大小的数据库检索中完全偶然的匹配数。
缩写的含义如下:“sec”表示秒;“min”表示分钟;“h”表示小时;“d”表示天,“μL”表示微升,“mL”表示毫升,“L”表示升,“μM”表示微摩尔,“mM”表示毫摩尔,“M”表示摩尔,“mmol”表示毫摩尔,“μmole”表示微摩尔,“g”表示克,“μg”表示微克,“ng”表示纳克,“U”表示单位,“bp”表示碱基对,而“kB”表示千碱基。
解脂耶氏酵母的转化和培养
解脂耶氏酵母菌株ATCC#20362购自美国典型培养物保藏中心(Rockville,MD)。解脂耶氏酵母菌株通常于28℃在YPD琼脂(1%酵母提取物、2%细菌用蛋白胨、2%葡萄糖、2%琼脂)上生长。
除非另有说明,否则按照Chen,D.C.等的方法(Appl.MicrobiolBiotechnol.,48(2):232-235(1997))进行解脂耶氏酵母的转化。简而言之,将耶氏酵母在YPD平板上划线,并于30℃生长约18小时。由该板刮出若干大接种环细胞,并重悬浮在1mL转化缓冲液中,该缓冲液含有:2.25mL50%PEG,平均分子量3350;0.125mL2M乙酸锂,pH6.0;0.125mL2M DTT;和50μg已剪切鲑精DNA。然后,将约500ng的线性化质粒DNA在100μl重悬浮细胞中温育,并保持于39℃达1小时,以15分钟间隔涡旋混合。将细胞铺板在选择培养基板上,并于30℃保持2-3天。
对于转化体选择,一般使用基础培养基(“MM”);MM的组成如下:0.17%无硫酸铵或氨基酸的酵母氮源(DIFCO Laboratories,Detroit,MI)、2%葡萄糖、0.1%脯氨酸,pH6.1。适宜时加入亮氨酸、赖氨酸和/或尿嘧啶的添加物至终浓度0.01%(由此产生“MMLe”、“MMLys”和“MMU”选择培养基,每个均用20g/L琼脂制备)。
或者,在5-氟乳清酸(“FOA”;又为5-氟尿嘧啶-6-羧酸一水合物)选择培养基上选择转化体,该培养基含有:0.17%无硫酸铵或氨基酸的酵母氮源(DIFCO Laboratories)、2%葡萄糖、0.1%脯氨酸、75mg/L尿嘧啶、75mg/L尿苷、900mg/L FOA(Zymo Research Corp.,Orange,CA)和20g/L琼脂。
如下制备高葡萄糖培养基(“HGM”):6.3g/L KH2PO4、27g/LK2HPO4和80g/L葡萄糖(pH7.5)。
解脂耶氏酵母的脂肪酸分析
对于脂肪酸分析,通过离心收集细胞,如Bligh,E.G.和Dyer,W.J.(Can.J.Biochem.Physiol.,37:911-917(1959))所述提取脂质。用甲醇钠进行脂质提取物的酯基转移反应制备脂肪酸甲酯(Roughan,G.和Nishida I.Arch Biochem Biophys.,276(1):38-46(1990)),然后用配备30-m×0.25mm(内径)HP-INNOWAX(Hewlett-Packard)柱的Hewlett-Packard6890GC进行分析。烤箱温度以3.5℃/分钟的速率从170℃(维持25分钟)升到185℃。
对于直接的碱性酯基转移反应,收获耶氏酵母培养物(3mL),在蒸馏水中洗涤一次,然后在Speed-Vac中真空干燥5-10分钟。向样品中加入甲醇钠(100μl,1%),然后涡旋并振荡20分钟。在加入3滴1MNaCl和400μl己烷后,涡旋样品并离心。取出上层,如上所述通过GC分析。
实施例1
编码Δ17去饱和酶的大豆疫霉基因的鉴定
美国能源部联合基因组研究所(“JGI”;Walnut Creek,CA)建立了1.0版的大豆疫霉基因组(估计基因组大小为95Mbp)。该基因组序列使用全基因组鸟枪法策略产生,含有总共19,276个基因模型。
使用致病疫霉的Δ17去饱和酶的氨基酸序列(GenBank登录号CAJ30870;本文称为“PiD17”,对应于SEQ ID NO;9)作为查询序列,对JGI的大豆疫霉数据库进行TBLASTN(BLAST蛋白对翻译的核苷酸)检索(使用得自JGI的默认参数)。发现一个位于骨架17:338148-339167的大豆疫霉ORF与PiD17共有广泛同源性(即在期望值为0的情况下具有91.8%同一性和95.6%相似性)。基于该同源性,大豆疫霉ORF被暂时鉴定为Δ17去饱和酶,称为“PsD17”。当由该数据库得到PsD17的1092bp的DNA序列(SEQ IDNO:1)时,发现其编码长度为363个氨基酸的多肽(SEQ ID NO:2)。使用ClustalW分析(DNASTAR软件的MegAlignTM程序)进行氨基酸序列比对表明,在PiD17和PsD17之间具有90.9%的同一性;相比之下,核苷酸序列共有仅86.6%的同一性。
还使用PsD17(SEQ ID NO:2)作为查询序列进行蛋白质-蛋白质BLAST检索来测定PsD17与包含在“nr”数据库(参见一般方法)中的所有公共可用的蛋白序列的序列同源性。基于该分析,发现PsD17与异枝水霉的ω-3脂肪酸去饱和酶(GenBank登录号AAR20444)具有最大同源性;具体地说,在预期值为7E-117的情况下,PsD17与GenBank登录号AAR20444的氨基酸序列具有60%的同一性和74%的相似性。另外,在预期值为4E-57的情况下,PsD17与多变鱼腥藻(Anabaenavariabilis)ATCC#29413(GenBank登录号ABA23809)的脂肪酸去饱和酶的氨基酸序列具有39%的同一性和57%的相似性。
实施例2
解脂耶氏酵母的密码子优化的Δ17去饱和酶基因(“PsD17S”)的合成
以和在美国专利7,125,672中所述类似的方式,将大豆疫霉的Δ17去饱和酶基因的密码子选择针对在解脂耶氏酵母中表达进行优化。具体地说,基于PsD17的编码序列(SEQ ID NO:1和2),按照耶氏酵母密码子选择模式(PCT公布号WO2004/101753)、‘ATG’翻译起始密码子周围的共有序列和RNA稳定性的一般规则(Guhaniyogi,G.和J.Brewer,Gene,265(1-2):11-23(2001)),设计密码子优化的Δ17去饱和酶基因(称为“PsD17S”,SEQ ID NO:3和4)。除了翻译起始位点的修饰以外,还修饰1092bp编码区的175bp(16.0%),并优化168个密码子(46.2%)。GC含量由野生型基因(即PsD17)中的65.1%下降至合成基因(即PsD17S)中的54.5%。NcoI位点和NotI位点分别掺入到PsD17S(SEQ ID NO:3)的翻译起始密码子周围和终止密码子之后。图2显示了PsD17和PsD17S的核苷酸序列的对比。在氨基酸水平上,与野生型PsD17相比,PsD17S没有第3个和第4个氨基酸;因此,PsD17S的总长为361个氨基酸(SEQ ID NO:4)。设计的PsD17S基因由GenScriptCorporation(Piscataway,NJ)合成,并克隆入pUC57(GenBank登录号Y14837)中,以产生pPsD17S(SEQ ID NO:10;图3A)。
实施例3
构建体pFmPsD17S的产生
本发明的实施例描述了质粒pFmPsD17S的构建,其含有嵌合FBAINm::PsD17S::XPR基因。使用来自质粒pKUNF1-KEA、pDMW214和pPsD17S的片段通过3向连接构建质粒pFmPsD17S(SEQ ID NO:13;图3D)。如在下文实施例5中所述利用质粒pFmPsD17S测试PsD17S的功能表达。
质粒pKUNF1-KEA
pKUNF1-KEA(SEQ ID NO:11;图3B)包含嵌合FBAINm::E1S::Pex20基因。该嵌合基因中的“FBAINm”启动子(PCT公布号WO05/049805;在图3B中也被标识为“Fbal+内含子”)是指合成启动子加具有fbal基因的内含子的5’编码区的一部分,所述合成启动子来源于“FBAIN”启动子,是表达必需的,其中FBAIN启动子是指在由fbal基因编码的果糖二磷酸醛缩酶(E.C.4.1.2.13)上的‘ATG’翻译起始密码子之前的5’上游非翻译区。FBAINm启动子由FBAIN修饰,即FBAINm在ATG翻译起始密码子和FBAIN启动子的内含子之间具有52bp缺失(由此包含N-末端的仅22个氨基酸)和内含子之后的新翻译共有基序。此外,尽管FBAIN启动子在与要表达的基因的编码区融合时产生融合蛋白,但FBAINm启动子不产生这样的融合蛋白。
表4概述了质粒pKUNF1-KEA的组成。
表4
质粒DKUNF1-KEA(SEQ ID NO:11)的描述
SEQ ID NO:11中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
PmeI/PacI(360-2596) | FBAINm::EL1S::Pex20,其含有:●FBAINm:解脂耶氏酵母FBAINm启动子(PCT公布号WO2005/049805)●EL1S:密码子优化的延长酶1基因(PCT公布号WO2004/101753),其来源于高山被孢霉(GenBank登录号AX464731)●Pex20:耶氏酵母Pex20基因的Pex20终止子序列(GenBank登录号AF054613) |
AscI/BsiWI3387-2596 | 耶氏酵母Ura3基因(GenBank登录号AJ306421)的5’部分 |
PacI/SphI6617-6095 | 耶氏酵母Ura3基因(GenBank登录号AJ306421)的3’部分 |
质粒pDMW214
pDMW214(SEQ ID NO:12;图3C)是在大肠杆菌和解脂耶氏酵母中均可复制的穿梭质粒。其含有以下组分:
表5
质粒pDMW214(SEQ ID NO:12)的描述
SEQ ID NO:12中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
1150-270 | ColE1质粒复制起点 |
2080-1220 | 氨苄青霉素抗性基因(AmpR) |
2979-4256 | 耶氏酵母自主复制序列(ARS18;GenBank登录号A17608) |
PmeI/SphI6501-4256 | 耶氏酵母Leu2基因(GenBank登录号AF260230) |
6501-1 | FBAIN::GUS::XPR,其含有:●FBAIN:解脂耶氏酵母FBAIN启动子(PCT公布号WO2005/049805)●GUS:编码β-葡糖醛酸酶的大肠杆菌基因(Jefferson,R.A.Nature,14;342:837-838(1989))●XPR:耶氏酵母Xpr基因(GenBank登录号M17741)的约100bp的3’区 |
质粒pFmPsD17S的最终构建
质粒pKUNF1-KEA的PmeI/NcoI片段(图3B;含有FBAINm启动子)和质粒pPsD17S的NcoI/NotI片段(图3A;含有合成的Δ17去饱和酶基因PsD17S)定向用于置换pDMW214的PmeI/NotI片段(图3C)。这导致产生pFmPsD17S(SEQ ID NO:13;图3D),其含有嵌合FBAINm::PsD17S::XPR基因。因此,pFmPsD17S的组分如下表6所述。
表6
质粒pFmPsD17S(SEQ ID NO:13)的描述
SEQ ID NO:13中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
5551-1305 | 耶氏酵母自主复制序列(ARS18;GenBank登录号A17608) |
7769-1269 | FBAINm::PsD17S::XPR,其含有:●FBAINm:解脂耶氏酵母FBAINm启动子(PCT公布号WO2005/049805)●PsD17S:密码子优化的Δ17去饱和酶基因(SEQ IDNO:3),其来源于大豆疫霉●XPR:耶氏酵母Xpr基因(GenBank登录号M17741)的约100bp的3’区 |
2418-1538 | ColE1质粒复制起点 |
3348-2488 | 氨苄青霉素抗性基因(AmpR) |
PmeI/SphI7769-5524 | 耶氏酵母Leu2基因(GenBank登录号AF260230), |
实施例4
经Δ6去饱和酶/Δ6延长酶途径生产总脂质的约11%的ARA的
解脂耶氏酵母菌株Y2047的产生
本实施例描述了菌株Y2047的构建,该菌株来自解脂耶氏酵母ATCC#20362,能够经由Δ6去饱和酶/Δ6延长酶途径的表达生产相对于总脂质为11%的ARA(图4A)。Y2047已按照布达佩斯公约的条款保藏,得到ATCC编号PTA-7186。另外,Y2047的构建已描述于共同未决的美国专利申请号11/265761,该专利通过引用结合到本文中。
菌株Y2047的开发首先需要构建菌株M4(产生8%的DGLA)。
生产总脂质的约8%的DGLA的M4菌株的产生
产生构建体pKUNF12T6E(图4B;SEQ ID NO:14),以将4个嵌合基因(含有Δ12去饱和酶、Δ6去饱和酶和两个C18/20延长酶)整合入野生型耶氏酵母菌株ATCC#20362的Ura3基因座,由此能够生产DGLA。pKUNF12T6E质粒含有以下组分:
表7
质粒DKUNF12T6E(SEQ ID NO:14)的描述
SEQ ID NO:14中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
AscI/BsiWI(9420-8629) | 耶氏酵母Ura3基因(GenBank登录号AJ306421)的784bp的5’部分 |
SphI/PacI(12128-1) | 耶氏酵母Ura3基因(GenBank登录号AJ306421)的516bp的3’部分 |
SwaI/BsiWI(6380-8629) | FBAIN::EL1S::PEX20,其含有:●FBAIN:解脂耶氏酵母FBAIN启动子(PCT公布号WO2005/049805)●EL1S:密码子优化的延长酶1基因(PCT公布号WO2004/101753),其来源于高山被孢霉(GenBank登录号AX464731)●Pex20:来自耶氏酵母Pex20基因的Pex20终止子序列(GenBank登录号AF054613) |
BglII/SwaI(4221-6380) | TEF::Δ6S::Lip1,其含有:●TEF:解脂耶氏酵母TEF启动子(GenBank登录号AF054508)●Δ6S:密码子优化的Δ6去饱和酶基因(PCT公布号WO2004/101753),其来源于高山被孢霉(GenBank登录号AF465281)●Lip1:来自耶氏酵母Lip1基因的Lip1终止子序列(GenBank登录号Z50020) |
PmeI/ClaI(4207-1459) | FBA::F.Δ12::Lip2,其含有:●FBA:解脂耶氏酵母FBA启动子(PCT公布号WO2005/049805)●F.Δ12:串珠镰刀菌(Fusarium moniliforme)Δ12去饱和酶基因(PCT公布号WO2005/047485)●Lip2:来自耶氏酵母Lip2基因的Lip2终止子序列(GenBank登录号AJ012632) |
ClaI/PacI(1459-1) | TEF::EL2S::XPR,其含有:●TEF:解脂耶氏酵母TEF启动子(GenBank登录号AF054508)●EL2S:密码子优化的延长酶基因(SEQ ID NO:15),其来源于金黄色破囊壶菌(美国专利号6,677,145)●XPR:耶氏酵母Xpr基因(GenBank登录号M17741)的约100bp的3’区 |
用AscI/SphI消化pKUNF12T6E质粒,然后按照一般方法用于转化野生型解脂耶氏酵母ATCC#20362。将转化体细胞铺板在FOA选择培养基平板上,并于30℃保持2-3天。拣选FOA抗性菌落,并在MM和MMU选择平板上划线。选择可在MMU平板上生长但不在MM平板上生长的菌落作为Ura-菌株。然后将单个菌落的Ura-菌株接种入30℃的液态MMU,并以250rpm/分钟振摇2天。通过离心收集细胞,提取脂质,并通过酯基转移反应制备脂肪酸甲酯,随后用Hewlett-Packard6890GC分析。
GC分析表明,DGLA存在于含有pKUNF12T6E的4个嵌合基因的转化体中,但不存在于野生型耶氏酵母对照菌株中。选定的32个Ura-菌株的大部分产生总脂质的约6%的DGLA。有2个菌株(即菌株M4和13-8)产生总脂质的约8%的DGLA。
产生总脂质的约11%的ARA的Y2047菌株的产生
产生构建体pDMW271(图4C;SEQ ID NO:17),以将3个Δ5嵌合基因整合入耶氏酵母菌株M4的Leu2基因中。质粒pDMW271包含如在下表8中所述的组分:
表8
质粒pDMW271(SEQ ID NO:17)的描述
SEQ ID NO:17中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
AscI/BsiWI(5520-6315) | 耶氏酵母Leu2基因(GenBank登录号AF260230)的788bp的5’部分 |
SphI/PacI(2820-2109) | 耶氏酵母Leu2基因(GenBank登录号AF260230)的703bp的3’部分 |
SwaI/Bsi WI(8960-6315) | FBAIN::MAΔ5::Pex20,其含有:●FBAIN:解脂耶氏酵母FBAIN启动子(PCT公布号WO2005/049805)●MAΔ5:高山被孢霉Δ5去饱和酶基因(GenBank登录号AF067654)●Pex20:来自耶氏酵母Pex20基因的Pex20终止子序列(GenBank登录号AF054613) |
SwaI/ClaI(8960-11055) | TEF::MAΔ5::Lip1,其含有:●TEF:解脂耶氏酵母TEF启动子(GenBank登录号AF054508)●MAΔ5:高山被孢霉Δ5去饱和酶基因(GenBank登录号AF067654)●Lip1:耶氏酵母Lip1基因的Lip1终止子序列(GenBank登录号Z50020) |
PmeI/ClaI(12690-11055) | 耶氏酵母Ura3基因(GenBank登录号AJ306421) |
ClaI/PacI(1-2109) | TEF::HΔ5S::Pex16,其含有:●TEF:解脂耶氏酵母TEF启动子(GenBank登录号AF054508)●HΔ5S:密码子优化的Δ5去饱和酶基因(SEQ IDNO:18),其来源于人(Homo sapiens)(GenBank登录号NP_037534)●Pex16:耶氏酵母Pex16基因的Pex16终止子序列(GenBank登录号U75433) |
用AscI/SphI消化质粒pDMW271,然后按照一般方法用于转化菌株M4。在转化后,将细胞铺板在MMLe平板上,并于30℃保持2-3天。拣选在MMLe平板上生长的单个菌落,并在MM和MMLe平板上划线。选择可以在MMLe平板上生长但不在MM平板上生长的那些菌落作为Leu2-菌株。然后将Leu2-菌株的单个菌落接种入30℃的液体MMLe培养基,并以250rpm/分钟振摇2天。通过离心收集细胞,提取脂质,并通过酯基转移反应制备脂肪酸甲酯,随后用Hewlett-Packard6890GC分析。
GC分析表明,ARA存在于pDMW271转化体中,但不存在于亲本M4菌株中。具体地说,在48个选定的具有pDMW271的Leu2-转化体中,有35个菌株产生少于总脂质5%的ARA,12个菌株产生6-8%ARA,1个菌株产生工程化耶氏酵母中总脂质的约11%的ARA。产生11%ARA的菌株被命名为“Y2047”。
实施例5
密码子优化的Δ17去饱和酶基因(“PsD17S”)
在解脂耶氏酵母菌株Y2047中的表达
如在一般方法中所述,将质粒pFmPsD17S(图3D;实施例3)转化入解脂耶氏酵母菌株Y2047(实施例4)中。将转化体细胞铺板在MM选择培养基平板上,并于30℃保持2-3天。拣选在MM平板上生长的8个转化体,并在新鲜MM平板上再划线。一旦生长,就将这些菌株单独接种入30℃的3mL液体MM中,并以250rpm/分钟振摇2天。通过离心收集细胞,提取脂质,并通过酯基转移反应制备脂肪酸甲酯,随后用Hewlett-Packard6890GC分析。
GC结果表明,在全部8个转化体中都产生总脂质的约3%ARA和2%EPA。在这8个菌株中PsD17S将ARA转变为EPA的转变效率为约40%的平均速率。按照下式测定转变效率:([产物]/[底物+产物])*100,其中‘产物’包括中间产物和途径中由中间产物衍生的所有产物。因此,该实验数据证实,来源于大豆疫霉的密码子优化的Δ17去饱和酶基因(SEQ ID NO:3)将ARA有效去饱和为EPA。
实施例6
构建体pZP3-P7U的产生
本实施例描述了含有嵌合YAT::PsD17S::Lip1基因的质粒pZP3-P7U的构建,该质粒设计用于整合入耶氏酵母基因组的Pox3基因座(GenBank登录号XP_503244)中。如在下文的实施例8中所述,质粒pZP3-P7U用于检验PsD17S的功能表达。pZP3-P7U的组分描述于下表9。
表9
质粒pZP3-P7U(SEQ ID NO:20)的描述
SEQ ID NO:20中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
AscI/BsiWI(4030-4800) | 耶氏酵母Pox3基因(GenBank登录号AJ001301)的770bp的5’部分 |
PacI/SphI(504-1300) | 耶氏酵母Pox3基因(GenBank登录号AJ001301)的826bp的3’部分 |
ClaI/SwaI | YAT::PsD17S::Lip1,其含有: |
7133-4960 | ●YAT:耶氏酵母YAT启动子(PCT公布号WO2006/052754)●PsD17S:密码子优化的Δ17去饱和酶基因(SEQIDNO:3),其来源于大豆疫霉●Lip1:耶氏酵母Lip1基因的Lip1终止子序列(GenBank登录号Z50020) |
ClaI/EcoRI7133-1 | LoxP::Ura3::LoxP,其含有:●LoxP序列(SEQ ID NO:21)●耶氏酵母Ura3基因(GenBank登录号AJ306421)●LoxP序列(SEQ ID NO:21) |
实施例7
经Δ9延长酶/Δ8去饱和酶途径生产总脂质的约12%的ARA的
解脂耶氏酵母菌株Y4070的产生
本实施例描述了菌株Y4070的构建,该菌株来源于解脂耶氏酵母ATCC#20362,能够经由Δ9延长酶/Δ8去饱和酶途径的表达生产相对于总脂质约12%的ARA(图6A)。菌株Y4070用于检验下文实施例8中的PsD17S和实施例12中的PrD17S的功能表达。
菌株Y4070的开发需要构建菌株Y2224(来自野生型耶氏酵母菌株ATCC#20362的Ura3基因的自发突变的FOA抗性突变体)、菌株Y4001(产生17%EDA,具有Leu-表型)、菌株Y4001U(产生17%EDA,具有Leu-和Ura-表型)、菌株Y4036(产生18%DGLA,具有Leu-表型)和菌株Y4036U(产生18%DGLA,具有Leu-和Ura-表型)。
菌株Y2224的产生
以下述方式分离菌株Y2224:将来自YPD琼脂平板(1%酵母提取物、2%细菌用蛋白胨、2%葡萄糖、2%琼脂)的解脂耶氏酵母ATCC#20362细胞在含有250mg/L5-FOA(Zymo Research)的MM平板(各75mg/L的尿嘧啶和尿苷、6.7g/L含硫酸铵但无氨基酸的YNB以及20g/L葡萄糖)上划线。将平板于28℃温育,将所获菌落中的4个单独贴在含有200mg/mL5-FOA的MM平板上以及没有尿嘧啶和尿苷的MM平板上,以证实尿嘧啶Ura3营养缺陷型。
产生总脂质的约17%的EDA的菌株Y4001的产生
经整合构建体pZKLeuN-29E3产生菌株Y4001(图6B)。将含有4个嵌合基因(即Δ12去饱和酶、C16/18延长酶和2个Δ9延长酶)的该构建体整合入菌株Y2224的Leu2基因座中,由此能够生产EDA。
构建体pZKLeuN-29E3含有表10中所示组分。
表10
质粒pZKLeuN-29E3(SEQ ID NO:22)的描述
SEQ ID NO:22中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
BsiW I/Asc I(7797-7002) | 耶氏酵母Leu2基因(GenBank登录号AF260230)的788bp的3’部分 |
Sph I/Pac I(4302-3591) | 耶氏酵母Leu2基因(GenBank登录号AF260230)的703bp的5’部分 |
Swa I/BsiW I(10500-7797) | GPD::F.D12::Pex20,其含有:●GPD:解脂耶氏酵母GPD启动子(PCT公布号WO2005/003310)●F.D12:串珠镰刀菌Δ12去饱和酶基因(PCT公布号WO2005/047485)●Pex20:耶氏酵母Pex20基因的Pex20终止子序列(GenBank登录号AF054613) |
Bgl II/Swa I(12526-10500) | Exp pro::EgD9E::Lip1,其含有:●Exp pro:解脂耶氏酵母输出蛋白(EXP1)启动子 |
(PCT公布号WO2006/052870和美国专利申请号11/265761)●EgD9E:来源于纤细裸藻的密码子优化的Δ9延长酶(SEQ ID NO:23)(“EgD9eS”;美国专利申请号11/601563和11/601564)●Lip1:耶氏酵母Lip1基因的Lip1终止子序列(GenBank登录号Z50020) | |
Pme I/Cla I(12544-1) | FBAINm::EgD9S::Lip2,其含有:●FBAINm:解脂耶氏酵母FBAINm启动子(PCT公布号WO2005/049805)●EgD9S:来源于纤细裸藻的密码子优化的Δ9延长酶基因(SEQ ID NO:23)(“EgD9eS”;美国专利申请号11/601563和11/601564)●Lip2:耶氏酵母Lip2基因的Lip2终止子序列(GenBank登录号AJ012632) |
Cla I/EcoR I(1-1736) | LoxP::Ura3::LoxP,其含有:●LoxP序列(SEQ ID NO:21)●耶氏酵母Ura3基因(GenBank登录号AJ306421)●LoxP序列(SEQ ID NO:21) |
EcoR I/Pac I(1736-3591) | NT::ME3S::Pex16,其含有:●NT:解脂耶氏酵母YAT1启动子(专利公布号U.S.2006/0094102-A1)●ME3S:来源于高山被孢霉的密码子优化的C16/18延长酶基因(SEQ ID NO:25)(美国专利申请号11/253882还有PCT公布号WO2006/052870)●Pex16:耶氏酵母Pex16基因的Pex16终止子序列(GenBank登录号U75433) |
用Asc I/Sph I消化质粒pZKLeuN-29E3,然后按照一般方法用于转化解脂耶氏酵母菌株Y2224(即ATCC #20362 Ura3-)。将转化体细胞铺板在MMLeu培养基平板上,并于30℃保持2-3天。拣选菌落,并在MM和MMLeu选择平板上划线。选择可以在MMLeu平板上生长但不在MM平板上生长的菌落作为Leu-菌株。然后将Leu-菌株的单个菌落接种入30℃的液体MMLeu培养基,并以250rpm/分钟振摇2天。通过离心收集细胞,提取脂质,并通过酯基转移反应制备脂肪酸甲酯,随后用Hewlett-Packard6890GC分析。
GC分析表明,EDA存在于含有pZKLeuN-29E3的4个嵌合基因的转化体中,但不存在于耶氏酵母Y2224对照菌株中。选定的36个Leu-菌株大部分产生总脂质的约12-16.9%的EDA。有3个菌株(即菌株#11、#30和#34)产生总脂质的约17.4%、17%和17.5%的EDA;它们分别被称为菌株Y4001、Y4002和Y4003。
生产总脂质的约17%的EDA的菌株Y4001U(Leu-,Ura-)的产生
经在菌株Y4001的质粒pY116(图6C)中短暂表达Cre重组酶建立菌株Y4001U,以产生Leu-和Ura-表型。构建体pY116含有以下组分:
表11
质粒pY116(SEQ ID NO:27)的描述
SEQ ID NO:27中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
1328-448 | ColE1质粒复制起点 |
2258-1398 | 用于在大肠杆菌中选择的氨苄青霉素抗性基因(AmpR) |
3157-4461 | 耶氏酵母自主复制序列(ARS18;GenBank登录号A17608) |
PacI/SawI6667-4504 | 耶氏酵母Leu2基因(GenBank登录号AF260230) |
Swa I/Pme I(6667-218 | GPAT::Cre::XPR2,其含有:●GPAT:解脂耶氏酵母GPAT启动子(PCT公布号WO2006/031937)●Cre:用于重组酶蛋白的肠细菌噬菌体P1Cre基因(GenBank登录号X03453)●XPR2:耶氏酵母Xpr基因(GenBank登录号M17741)的约100bp的3’区 |
按照一般方法将质粒pY116用于新鲜生长的Y4001细胞的转化。将转化体细胞铺板在含有280μg/mL磺酰脲的MMLeu+Ura平板(MMU加亮氨酸)上,并于30℃保持3-4天。拣选4个菌落,并接种入30℃的3mL液体YPD培养基中,以250rpm/分钟振摇1天。用液体MMLeu+Ura培养基将培养物稀释至1:50,000,将100μL铺板在新的YPD平板上,并于30℃保持2天。拣选菌落,并在MMLeu和MMLeu+Ura选择平板上划线。选择可在MMLeu+Ura平板上生长但不在MMLeu平板上生长的菌落,并通过GC分析,以证实存在C20:2(EDA)。1个具有Leu-和Ura-表型的菌株产生总脂质的约17%的EDA,称为Y4001U。
产生总脂质的约18%的DGLA的Y4036菌株的产生
产生构建体pKO2UF8289(图7A;SEQ ID NO:28),以将4个嵌合基因(包含Δ12去饱和酶、1个Δ9延长酶和2个突变Δ8去饱和酶)整合入菌株Y4001U1的Δ12基因座中,由此能够产生DGLA。构建体pKO2UF8289包含以下组分:
表12
质粒pKO2UF8289(SEQ ID NO:28)的描述
SEQ ID NO:28中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
AscI/BsiWI(10304-9567) | 耶氏酵母Δ12去饱和酶基因(PCT公布号WO2004/104167)的5’部分 |
EcoRI/SphI(13568-13012) | 耶氏酵母Δ12去饱和酶基因(PCT公布号WO2004/104167)的3’部分 |
SwaI/BsiWI(7055-9567) | FBAINm::EgD8M::Pex20,其含有:●FBAINm:解脂耶氏酵母FBAINm启动子(PCT公布号WO2005/049805)●EgD8M:来源于纤细裸藻的合成的Δ8去饱和酶突变体EgD8S-23(SEQ ID NO:29)(美国专利申请号11/635258)●Pex20:耶氏酵母Pex20基因的Pex20终止子序列(GenBank登录号AF054613) |
SwaI/PmeI(7055-4581) | YAT::F.D12::OCT,其含有:●YAT:解脂耶氏酵母YAT1启动子(专利公布号US2006/0094102-A1)●F.D12:串珠镰刀菌Δ12去饱和酶基因(PCT公布号WO2005/047485)●耶氏酵母OCT基因的OCT终止子序列(GenBank登录号X69988) |
PmeI/PacI(4581-2124) | EXP::EgD8M::Pex16,其含有:●EXP:解脂耶氏酵母输出蛋白(EXP1)启动子(PCT公布号WO2006/052870和美国专利申请号11/265761)●EgD8M:来源于纤细裸藻的合成的Δ8去饱和酶突变体EgD8S-23(SEQ ID NO:29)(美国专利申请 |
号11/635258)●Pex16:耶氏酵母Pex16基因的Pex16终止子(GenBank登录号U75433) | |
PmeI/ClaI(2038-1) | GPAT::EgD9e::Lip2,其含有:●GPAT:解脂耶氏酵母GPAT启动子(PCT公布号WO2006/031937)●EgD9e:纤细裸藻Δ9延长酶基因(SEQ IDNO:31)(美国专利申请号11/601563和11/601564)●Lip2:耶氏酵母Lip2基因的Lip2终止子序列(GenBank登录号AJ012632) |
ClaI/EcoRI(13568-1) | LoxP::Ura3::LoxP,其含有:●LoxP序列(SEQ ID NO:21)●耶氏酵母Ura3基因(GenBank登录号AJ306421)●LoxP序列(SEQ ID NO:21) |
用AscI/SphI消化pKO2UF8289质粒,然后按照一般方法用于转化菌株Y4001U1。将转化体细胞铺板在MMLeu平板上,并于30℃保持2-3天。拣选菌落,并在MMLeu选择平板上划线,于30℃保持2天。然后将这些细胞接种入30℃的液体MMLeu,并以250rpm/分钟振摇2天。通过离心收集细胞,提取脂质,并通过酯基转移反应制备脂肪酸甲酯,随后用Hewlett-Packard6890GC分析。
GC分析表明,DGLA存在于含有pKO2UF8289的4个嵌合基因的转化体中,但不存在于亲本Y4001U1菌株中。选定的96个菌株中大部分产生总脂质的7-13%的DGLA。有6个菌株(即#32、#42、#60、#68、#72和#94)产生总脂质的约15%、13.8%、18.2%、13.1%、15.6%和13.9%的DGLA。这6个菌株分别被称为Y4034、Y4035、Y4036、Y4037、Y4038和Y4039。
产生总脂质的约18%DGLA的菌株Y4036U(Leu-,Ura3-)的产生
构建体pY116(图6C;SEQ ID NO:27)用于在菌株Y4036中瞬时表达Cre重组酶。这由基因组释放出夹在LoxP之间的Ura3基因。
按照一般方法将质粒pY116用于转化菌株Y4036。在转化后,将细胞铺板在MMLeu+Ura平板(MMU加亮氨酸)上,并于30℃保持2-3天。拣选在MMLeu+Ura平板生长的单个菌落,并于30℃划线入YPD液体培养基中,以250rpm/分钟振摇1天,以分离出(cure)pY116质粒。将生长培养物划线在MMLeu+Urau平板上。在于30℃生长两天后,将单个菌落再划线在MMLeu+Ura、MMU和MMLeu平板上。选择那些可以在MMLeu+Ura上生长但不在MM或MMLeu平板上生长的菌落。这些菌株中的1个具有Leu-和Ura-表型的菌株被称为Y4036U(Ura-,Leu-)。
产生总脂质的约12%的ARA的Y4070菌株的产生
产生构建体pZKSL-555R(图7B;SEQ ID NO:32),以将3个Δ5去饱和酶基因整合入菌株Y4036U的Lys基因座中,由此能够生产ARA。pZKSL-555R质粒含有以下组分:
表13
质粒pZKSL-555R(SEQ ID NO:32)的描述
SEQ ID NO:32中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
AscI/BsiWI(3321-2601) | 耶氏酵母Lys5基因(GenBank登录号M34929)的720bp的5’部分 |
PacI/SphI(6716-6029) | 耶氏酵母Lys5基因(GenBank登录号M34929)的687bp的3’部分 |
BglII/BsiWI(15-2601) | EXP::EgD5S::Pex20,其含有:●EXP:解脂耶氏酵母输出蛋白(EXP1)启动子(WO |
2006/052870和美国专利申请号11/265761)●EgD5S:来源于纤细裸藻的密码子优化的Δ5去饱和酶(SEQ ID NO:33)(美国临时申请序号60/801172)●Pex20:耶氏酵母Pex20基因的Pex20终止子序列(GenBank登录号AF054613) | |
ClaI/PmeI(11243-1) | YAT::RD5S::OCT,其含有:●YAT:解脂耶氏酵母YAT1启动子(专利申请US2006/0094102-A1)●RD5S:来源于多甲藻属CCMP626的密码子优化的Δ5去饱和酶(SEQ ID NO:35)(美国临时申请序号60/801119)●OCT:耶氏酵母OCT基因(GenBank登录号X69988)的OCT终止子序列 |
EcoRI/PacI(9500-6716) | FBAIN::EgD5WT::Aco,其含有●FBAIN:解脂耶氏酵母FBAIN启动子(WO2005/049805)●EgD5WT:纤细裸藻Δ5去饱和酶(SEQ ID NO:35;美国临时申请序号60/801172),除去了内部的BglII、HindIII和NcoI限制酶位点●Aco:耶氏酵母Aco基因(GenBank登录号AJ001300)的Aco终止子 |
EcoR/ClaI(9500-11243) | 耶氏酵母Leu2基因(GenBank登录号M37309) |
用AscI/SphI消化pZKSL-555R质粒,然后按照一般方法用于转化菌株Y4036U。将转化体细胞铺板在MMLeuLys平板(MMLeu加亮氨酸)上,并于30℃保持2-3天。然后将单个菌落再划线在MMLeuLys平板上,接着将这些细胞接种入30℃的液体MMLeuLys中,并以250rpm/分钟振摇2天。通过离心收集细胞,提取脂质,并通过酯基转移反应制备脂肪酸甲酯,随后用Hewlett-Packard6890GC分析。
GC分析表明,ARA存在于含有pZKSL-555R的3个嵌合基因的转化体中,但不存在于亲本Y4036菌株中。选定的96个菌株大部分产生总脂质的约10%的ARA。有4个菌株(即#57、#58、#69和#75)产生总脂质的约11.7%、11.8%、11.9%和11.7%的ARA。这4个菌株分别被称为Y4068、Y4069、Y4070和Y4071。进一步的分析表明,pZKSL-555R的3个嵌合基因没有整合入Y4068、Y4069、Y4070和Y4071菌株的Lys5位点中。所有菌株都具有Lys+表型。菌株Y4070相对于野生型解脂耶氏酵母ATCC#20362的最终表型为Ura3-、Leu+、Lys+、GPD::F.D12::Pex20、YAT::F.D12::OCT、YAT::ME3S::Pex16、GPAT::EgD9e::Lip2、Exp::EgD9eS::Lip1、FBAINm::EgD9eS::Lip2、FBAINm::EgD8M::Pex20、EXP::EgD8M::Pex16、FBAIN::EgD5WT::Aco、EXP::EgD5S::Pex20、YAT::RD5S::OCT。
实施例8
密码子优化的Δ17去饱和酶基因(“PsD17S”)
在解脂耶氏酵母菌株Y4070中的表达
用AscI/SphI消化质粒pZP3-P7U(图5;实施例6),然后按照一般方法用于转化解脂耶氏酵母菌株Y4070。在转化后,将细胞铺板在MM平板上,并于30℃保持2-3天。拣选在MM平板上生长的12个转化体,并再划线在新鲜MM平板上。一旦生长,就将这些菌株单独接种入30℃的3mL液体MM中,并以250rpm/分钟振摇2天。通过离心收集细胞,并重悬浮在高葡萄糖培养基中,然后于30℃生长5天,并以250rpm/分钟振摇。通过离心收集细胞,提取脂质,并通过酯基转移反应制备脂肪酸甲酯,随后用Hewlett-Packard6890GC分析。
GC结果示于以下的表14。脂肪酸被鉴定为16:0、16:1、18:0、18:1(油酸)、LA(18:2)、ALA、EDA(20:2)、DGLA、ARA、ETrA(20:3)、ETA和EPA;每个的组成都表示为总脂肪酸的%。“Δ17活性”按照下式计算:([EPA]/[ARA+EPA])*100,代表底物转变为EPA的百分率。“Δ15活性”按照下式计算:([ALA]/[LA+ALA])*100,代表底物转变为ALA的百分率。
GC结果证实,在12个转化体中产生总脂质的平均0.3%的ARA和11.5%的EPA。在这12个菌株中PsD17S将ARA转变为EPA的转变效率为约98%的平均速率。按照下式测定转变效率:([产物]/[底物+产物])*100,其中‘产物’包括中间产物和途径中由中间产物衍生的所有产物。因此,该实验数据证实,来自大豆疫霉的密码子优化的Δ17去饱和酶基因(SEQ ID NO:3)将ARA有效去饱和为EPA。
在12个转化体中产生总脂质的平均5%的ALA和29.1%的LA。在这12个菌株中PsD17S将LA转变为ALA的转变效率为约15%的平均速率。另外,在全部12个菌株中都产生EtrA和ETA,提示PsD17可能会将EDA转变为EtrA和将DGLA转变为ETA;然而,在这两种情况下难以计算转变效率,因为菌株Y4070中的Δ9延长酶可将ALA转变为EtrA,Δ8去饱和酶可将EtrA转变为ETA。
显然,PsD17同时具有Δ17和Δ15去饱和酶活性,因此用作如本文定义的双功能去饱和酶。
实施例9
编码Δ17去饱和酶的栎树猝死病菌基因的鉴定
美国能源部联合基因组研究所(“JGI”;Walnut Creek,CA)建立了1.0版的栎树猝死病菌基因组(估计基因组大小为65Mbp)。该基因组序列使用全基因组鸟枪法策略产生,含有总共16,066个基因模型。
以类似于实施例1所述的方式,将PiD17的氨基酸序列(SEQ IDNO:9)用作查询序列,对JGI的栎树猝死病菌数据库进行TBLASTN检索(使用得自JGI的默认参数)。
发现在栎树猝死病菌的基因组序列中有两个ORF与PiD17共有广泛的同源性。具体地说,在预期值为0的情况下,ORF80222与SEQID NO:9共有89%的同一性和94%的相似性。同样,在预期值为6E-44的情况下,ORF48790与SEQ ID NO:9共有40%的同一性和61%的相似性。基于这些结果,ORF80222被暂时鉴定为Δ17去饱和酶,被称为“PrD17”。
当由该数据库得到PrD17的1086bp DNA序列(SEQ ID NO:5)时,发现其编码长度为361个氨基酸的多肽(SEQ ID NO:6)。使用ClustalW分析(DNASTAR软件的MegAlignTM程序)进行氨基酸序列比对表明,在PiD17和PrD17之间具有89.5%的同一性;相比之下,核苷酸序列仅共有85.7%的同一性。
又使用PrD17(SEQ ID NO:6)作为查询序列进行蛋白质-蛋白质BLAST检索来测定PrD17与包含在“nr”数据库(参见一般方法)中的所有公共可用的蛋白序列依次相比的序列同源性。显示出具有最高相似性程度的序列为异枝水霉的ω-3脂肪酸去饱和酶(GenBank登录号AAR20444),在预期值为E-124的情况下,共有59%的同一性和74%的相似性。另外,在预期值为6E-61的情况下,PrD17与多变鱼腥藻ATCC#29413(GenBank登录号ABA23809)的脂肪酸去饱和酶的氨基酸序列具有38%的同一性和57%的相似性。
实施例10
用于解脂耶氏酵母的密码子优化的
Δ17去饱和酶基因(“PrD17S”)的合成
以和实施例2中所述类似的方式,将栎树猝死病菌的Δ17去饱和酶基因的密码子选择针对在解脂耶氏酵母中表达进行优化。具体地说,基于PrD17的编码序列(SEQ ID NO:5和6),按照耶氏酵母密码子选择模式(PCT公布号WO2004/101753)、‘ATG’翻译起始密码子周围的共有序列和RNA稳定性的一般规则(Guhaniyogi,G.和J.Brewer,Gene,265(1-2):11-23(2001)),设计密码子优化的Δ17去饱和酶基因(称为“PrD17S”,SEQ ID NO:7)。除了翻译起始位点的修饰以外,还修饰1086bp编码区的168bp(15.5%),并优化160个密码子(44.2%)。GC含量由野生型基因(即PrD17)中的64.4%下降至合成基因(即PrD17S)中的54.5%。NcoI位点和NotI位点分别被掺入到PrD17S(SEQID NO:7)的翻译起始密码子周围和终止密码子之后。图8显示了PrD17和PrD17S的核苷酸序列的对比。密码子优化的基因中的修饰没有一个改变所编码蛋白的氨基酸序列(SEQ ID NO:6)。设计的PrD17S基因通过GenScript Corporation(Piscataway,NJ)合成,并克隆入pUC57(GenBank登录号Y14837)中,以产生pPrD17S(图9A;SEQ ID NO:38)。
实施例11
构建体pZuFPrD17S的产生
本实施例描述了含有嵌合FBAIN::PrD17S::Pex20基因的质粒pZuFPrD17S(图9B)的构建。质粒pZuFPrD17S如在下文实施例12中所述用于测试PrD17S的功能表达。
具体地说,通过使用含有质粒pPrD17S的PrD17S编码区的NcoI/NotI片段置换pZuF17的NcoI/NotI片段构建质粒pZuFPrD17S(图9C;SEQ ID NO:39)。因此,pZuFPrD17S的组分如在下表15中所述。
表15
质粒pZuFPrD17S(SEQ ID NO:40)的描述
SEQ ID NO:40中的限制性位点和核苷酸 | 片段和嵌合基因组分的描述 |
EcoRI/ClaI4203--5599 | 耶氏酵母自主复制序列(ARS18;GenBank登录号A17608) |
EcoRI/BsiWI7152-1407 | FBAIN::PrD17S::Pex20,含有:●FBAIN:解脂耶氏酵母FBAINm启动子(PCT公布号WO2005/049805)●PrD17S:来源于栎树猝死病菌的密码子优化的Δ17去饱和酶基因(SEQ ID NO:7)●耶氏酵母Pex20基因的Pex20:Pex20终止子序列(GenBank登录号AF054613) |
2443-1563 | ColE1质粒复制起点 |
3373-2513 | 氨苄青霉素抗性基因(AmpR) |
EcoRI/PacI7130-5619 | 耶氏酵母Ura3基因(GenBank登录号AJ306421) |
实施例12
密码子优化的Δ17去饱和酶基因PrD17S
在解脂耶氏酵母菌株Y4070中的表达
如在一般方法中所述将质粒pZuFPrD17S(图9B;实施例11)转化入解脂耶氏酵母菌株Y4070中(实施例8)。将转化体细胞铺板在MM选择培养基平板上,并于30℃保持2-3天。拣选在MM平板上生长的10个转化体,并再划线在新鲜MM平板上。一旦生长,就将这些菌株单独接种入30℃的3mL液体MM中,并以250rpm/分钟振摇2天。通过离心收集细胞,并重悬浮在高葡萄糖培养基中,然后于30℃生长5天,并以250rpm/分钟振摇。通过离心收集细胞,提取脂质,并通过酯基转移反应制备脂肪酸甲酯,随后用Hewlett-Packard6890GC分析。
GC结果示于下表16。脂肪酸被鉴定为16:0、16:1、18:0、18:1(油酸)、LA(18:2)、ALA、EDA(20:2)、DGLA、ARA、ETrA(20:3)、ETA和EPA;每个的组成都表示为总脂肪酸的%。“Δ17活性”按照下式计算:([EPA]/[ARA+EPA])*100,代表底物转变为EPA的百分率。“Δ15活性”按照下式计算:([ALA]/[LA+ALA])*100,代表底物转变为ALA的百分率。
表16表明,在10个转化体中产生总脂质的平均7.5%的ARA和7.2%的EPA。在这10个菌株中PrD17S将ARA转变为EPA的转变效率为约49%的平均速率。转变效率如在实施例8中所述检测。因此,该实验数据证实,来自栎树猝死病菌的密码子优化的Δ17去饱和酶基因(SEQ ID NO:5)将ARA有效去饱和为EPA。
在10个转化体中产生总脂质的平均0.4%的ALA和39.6%的LA(C18:2)。在这10个菌株中PrD17S将LA转变为ALA的转变效率为约仅1%的平均速率。
因此,PrD17S在本文被鉴定为单功能的Δ17去饱和酶,其催化ARA向EPA的转变。
序列表
<110>纳幕尔杜邦公司
<120>Δ-17去饱和酶及其用于制备多不饱和脂肪酸的用途
<130>CL3479PCT
<160>40
<170>PatentIn version 3.4
<210>1
<211>1092
<212>DNA
<213>大豆疫霉
<400>1
atggcgtcca agcaggagca gccgtaccag ttcccgacgc tgacggagat caagcgctcg 60
ctgcccagcg agtgtttcga ggcgtccgtg ccgctctcgc tctactacac ggtgcgctgc 120
ctggtgatcg ccgtgtcgct ggccttcggg ctccaccacg cgcgctcgct gcccgtggtc 180
gagggcctct gggcgctgga cgccgcgctc tgcacgggct acgtgctgct gcagggcatc 240
gtgttctggg gcttcttcac cgtgggccat gacgccggcc acggcgcctt ctcgcgctac 300
cacctgctca acttcgtgat cggcaccttc atccactcgc tcatcctgac gcccttcgag 360
tcgtggaagc tcacgcaccg ccaccaccac aagaacacgg gcaacatcga ccgcgacgag 420
atcttctacc cgcagcgcaa ggccgacgac cacccgctct cgcgtaacct catcctggcg 480
ctgggcgccg cgtggttcgc ctacctggtc gagggcttcc cgccgcgcaa ggtcaaccac 540
ttcaacccgt tcgagccgct gttcgtccgc caggtgtccg ccgtggtcat ctcgctggcc 600
gcgcacttcg gcgtggccgc gctgtccatc tacctgagcc tgcagttcgg cttcaagacc 660
atggctatct actactacgg gcccgtgttc gtgttcggca gcatgctggt catcaccacc 720
ttcctgcacc acaacgacga ggagaccccc tggtacgccg actcggagtg gacctacgtc 780
aagggcaacc tctcgtcggt cgaccgctcc tacggcgcgc tcatcgacaa cctgagccac 840
aacatcggca cgcaccagat ccaccacctc ttccccatca tcccgcacta taagctcaag 900
cgcgccaccg aggccttcca ccaggcgttc cccgagctcg tgcgcaagag cgacgagccc 960
atcattaagg ccttcttccg cgtcggccgc ctctacgcca actacggcgt cgtggactcg 1020
gacgccaagc tcttcacgct caaggaggcc aaggccgtgt ccgaggcggc gaccaagact 1080
aaggccaact ga 1092
<210>2
<211>363
<212>PRT
<213>大豆疫霉
<400>2
Met Ala Ser Lys Gln Glu Gln Pro Tyr Gln Phe Pro Thr Leu Thr Glu
1 5 10 15
Ile Lys Arg Ser Leu Pro Ser Glu Cys Phe Glu Ala Ser Val Pro Leu
20 25 30
Ser Leu Tyr Tyr Thr Val Arg Cys Leu Val Ile Ala Val Ser Leu Ala
35 40 45
Phe Gly Leu His His Ala Arg Ser Leu Pro Val Val Glu Gly Leu Trp
50 55 60
Ala Leu Asp Ala Ala Leu Cys Thr Gly Tyr Val Leu Leu Gln Gly Ile
65 70 75 80
Val Phe Trp Gly Phe Phe Thr Val Gly His Asp Ala Gly His Gly Ala
85 90 95
Phe Ser Arg Tyr His Leu Leu Asn Phe Val Ile Gly Thr Phe Ile His
100 105 110
Ser Leu Ile Leu Thr Pro Phe Glu Ser Trp Lys Leu Thr His Arg His
115 120 125
His His Lys Asn Thr Gly Asn Ile Asp Arg Asp Glu Ile Phe Tyr Pro
130 135 140
Gln Arg Lys Ala Asp Asp His Pro Leu Ser Arg Asn Leu Ile Leu Ala
145 150 155 160
Leu Gly Ala Ala Trp Phe Ala Tyr Leu Val Glu Gly Phe Pro Pro Arg
165 170 175
Lys Val Asn His Phe Asn Pro Phe Glu Pro Leu Phe Val Arg Gln Val
180 185 190
Ser Ala Val Val Ile Ser Leu Ala Ala His Phe Gly Val Ala Ala Leu
195 200 205
Ser Ile Tyr Leu Ser Leu Gln Phe Gly Phe Lys Thr Met Ala Ile Tyr
210 215 220
Tyr Tyr Gly Pro Val Phe Val Phe Gly Ser Met Leu Val Ile Thr Thr
225 230 235 240
Phe Leu His His Asn Asp Glu Glu Thr Pro Trp Tyr Ala Asp Ser Glu
245 250 255
Trp Thr Tyr Val Lys Gly Asn Leu Ser Ser Val Asp Arg Ser Tyr Gly
260 265 270
Ala Leu Ile Asp Asn Leu Ser His Asn Ile Gly Thr His Gln Ile His
275 280 285
His Leu Phe Pro Ile Ile Pro His Tyr Lys Leu Lys Arg Ala Thr Glu
290 295 300
Ala Phe His Gln Ala Phe Pro Glu Leu Val Arg Lys Ser Asp Glu Pro
305 310 315 320
Ile Ile Lys Ala Phe Phe Arg Val Gly Arg Leu Tyr Ala Asn Tyr Gly
325 330 335
Val Val Asp Ser Asp Ala Lys Leu Phe Thr Leu Lys Glu Ala Lys Ala
340 345 350
Val Ser Glu Ala Ala Thr Lys Thr Lys Ala Asn
355 360
<210>3
<211>1086
<212>DNA
<213>大豆疫霉
<220>
<221>其它特征
<223>合成的Δ17去饱和酶(密码子优化的)
<400>3
atggctacca agcagcccta ccagttccct actctgaccg agatcaagcg atctctgccc 60
tccgagtgtt tcgaggcctc cgtgcctctc tctctgtact acaccgttcg atgcctggtc 120
attgctgtgt cgctcgcctt cggacttcac catgcacgat ctctgcccgt tgtcgaaggc 180
ctctgggctc tggatgccgc tctctgcacc ggttacgtgc tgctccaggg catcgtcttc 240
tggggattct ttactgttgg tcacgacgct ggacatggtg ccttctcccg ataccacctg 300
ctcaactttg tcatcggaac cttcattcac tctctcatcc ttacaccctt cgagtcctgg 360
aagctcaccc acagacacca tcacaagaac actggcaaca tcgaccgaga cgaaatcttc 420
taccctcaac gaaaggccga cgatcatcct ctgtctcgaa acctcattct ggctttgggt 480
gcagcctggt ttgcctacct ggtcgaaggc tttcctcccc gaaaggtcaa ccacttcaac 540
cccttcgagc ctctctttgt tcgacaggtc tctgccgtgg tcatttcgct ggctgcgcac 600
tttggagtgg ctgccctgtc catctacctc agcctgcagt tcggcttcaa gactatggcc 660
atctactact atggtcccgt ctttgtgttc ggatccatgc tcgtcattac tacctttctt 720
catcacaacg acgaagagac accttggtac gcagattcgg agtggaccta cgtcaaaggc 780
aacctgtcct ctgtcgaccg atcctacggt gccctcatcg acaacctttc tcacaacatc 840
ggaacccacc agattcatca cctctttccc atcattcctc actacaagct caagcgagct 900
accgaggcct tccatcaagc ctttcccgag ctggttcgaa agtccgacga acccatcatc 960
aaggcctttt tcagagtcgg ccgactctac gcaaactacg gtgtggtcga ctcggatgcc 1020
aagctgttca ctctcaagga ggccaaggct gtttccgaag ccgctaccaa gactaaggcc 1080
acctaa 1086
<210>4
<211>361
<212>PRT
<213>大豆疫霉
<220>
<221>其它特征
<223>合成的Δ17去饱和酶(密码子优化的)
<400>4
Met Ala Thr Lys Gln Pro Tyr Gln Phe Pro Thr Leu Thr Glu Ile Lys
1 5 10 15
Arg Ser Leu Pro Ser Glu Cys Phe Glu Ala Ser Val Pro Leu Ser Leu
20 25 30
Tyr Tyr Thr Val Arg Cys Leu Val Ile Ala Val Ser Leu Ala Phe Gly
35 40 45
Leu His His Ala Arg Ser Leu Pro Val Val Glu Gly Leu Trp Ala Leu
50 55 60
Asp Ala Ala Leu Cys Thr Gly Tyr Val Leu Leu Gln Gly Ile Val Phe
65 70 75 80
Trp Gly Phe Phe Thr Val Gly His Asp Ala Gly His Gly Ala Phe Ser
85 90 95
Arg Tyr His Leu Leu Asn Phe Val Ile Gly Thr Phe Ile His Ser Leu
100 105 110
Ile Leu Thr Pro Phe Glu Ser Trp Lys Leu Thr His Arg His His His
115 120 125
Lys Asn Thr Gly Asn Ile Asp Arg Asp Glu Ile Phe Tyr Pro Gln Arg
130 135 140
Lys Ala Asp Asp His Pro Leu Ser Arg Asn Leu Ile Leu Ala Leu Gly
145 150 155 160
Ala Ala Trp Phe Ala Tyr Leu Val Glu Gly Phe Pro Pro Arg Lys Val
165 170 175
Asn His Phe Asn Pro Phe Glu Pro Leu Phe Val Arg Gln Val Ser Ala
180 185 190
Val Val Ile Ser Leu Ala Ala His Phe Gly Val Ala Ala Leu Ser Ile
195 200 205
Tyr Leu Ser Leu Gln Phe Gly Phe Lys Thr Met Ala Ile Tyr Tyr Tyr
210 215 220
Gly Pro Val Phe Val Phe Gly Ser Met Leu Val Ile Thr Thr Phe Leu
225 230 235 240
His His Asn Asp Glu Glu Thr Pro Trp Tyr Ala Asp Ser Glu Trp Thr
245 250 255
Tyr Val Lys Gly Asn Leu Ser Ser Val Asp Arg Ser Tyr Gly Ala Leu
260 265 270
Ile Asp Asn Leu Ser His Asn Ile Gly Thr His Gln Ile His His Leu
275 280 285
Phe Pro Ile Ile Pro His Tyr Lys Leu Lys Arg Ala Thr Glu Ala Phe
290 295 300
His Gln Ala Phe Pro Glu Leu Val Arg Lys Ser Asp Glu Pro Ile Ile
305 310 315 320
Lys Ala Phe Phe Arg Val Gly Arg Leu Tyr Ala Asn Tyr Gly Val Val
325 330 335
Asp Ser Asp Ala Lys Leu Phe Thr Leu Lys Glu Ala Lys Ala Val Ser
340 345 350
Glu Ala Ala Thr Lys Thr Lys Ala Thr
355 360
<210>5
<211>1086
<212>DNA
<213>栎树猝死病菌
<400>5
atggcgacta agcagccgta ccagttcccg accctgacgg agatcaagcg gtcgctgccc 60
agcgagtgct ttgaggcctc ggtgccgctg tcgctctact acacggtgcg catcgtggcc 120
atcgccgtgg cgctggcgtt cggcctcaac tacgcgcgcg cgctgcccgt ggtcgagagc 180
ttgtgggcgc tggacgctgc gctctgctgc ggttacgtgc tgctgcaggg catcgtgttc 240
tggggcttct tcacggtggg ccatgacgcc ggccacggcg ccttctcgcg ttaccacctg 300
ctcaacttcg tggtgggcac cttcatccac tcgctcatcc tcacgccctt cgagtcgtgg 360
aagctcacgc accgccacca ccacaagaac acgggcaaca ttgaccgcga cgagatcttc 420
tacccgcagc gcaaggccga cgaccacccg ctgtcgcgca acctcgtgct ggcgctcggc 480
gccgcgtggt tcgcctacct ggtcgagggc ttcccgcccc gcaaggtcaa ccacttcaac 540
ccattcgagc cgctgtttgt gcgccaggtg gccgccgtcg tcatctcgct ctccgcgcac 600
ttcgccgtgt tggcgctgtc cgtgtatctg agcttccagt tcggtctcaa gaccatggcg 660
ctctactact acggccccgt cttcgtgttc ggcagcatgc ttgtgatcac caccttcctg 720
catcacaatg acgaggagac cccatggtac ggagactceg actggaecta cgtcaagggc 780
aacctgtcgt ccgtggaccg gtcctacggc gcgttcatcg acaacctgag ccacaacatc 840
ggcacgcacc agatccacca cctcttcccc atcatcccgc actacaagct caaccgcgct 900
acggcggcat tccaccaggc cttccccgag ctcgtgcgca agagcgacga gccgatcctc 960
aaggccttct ggcgcgtcgg ccgactgtac gccaactacg gcgtcgtgga cccggacgcc 1020
aagctcttca cgctcaagga ggccaaggcg gcgtccgagg cggcgaccaa gaccaaggcc 1080
acctaa 1086
<210>6
<211>361
<212>PRT
<213>栎树猝死病菌
<400>6
Met Ala Thr Lys Gln Pro Tyr Gln Phe Pro Thr Leu Thr Glu Ile Lys
1 5 10 15
Arg Ser Leu Pro Ser Glu Cys Phe Glu Ala Ser Val Pro Leu Ser Leu
20 25 30
Tyr Tyr Thr Val Arg Ile Val Ala Ile Ala Val Ala Leu Ala Phe Gly
35 40 45
Leu Asn Tyr Ala Arg Ala Leu Pro Val Val Glu Ser Leu Trp Ala Leu
50 55 60
Asp Ala Ala Leu Cys Cys Gly Tyr Val Leu Leu Gln Gly Ile Val Phe
65 70 75 80
Trp Gly Phe Phe Thr Val Gly His Asp Ala Gly His Gly Ala Phe Ser
85 90 95
Arg Tyr His Leu Leu Asn Phe Val Val Gly Thr Phe Ile His Ser Leu
100 105 110
Ile Leu Thr Pro Phe Glu Ser Trp Lys Leu Thr His Arg His His His
115 120 125
Lys Asn Thr Gly Asn Ile Asp Arg Asp Glu Ile Phe Tyr Pro Gln Arg
130 135 140
Lys Ala Asp Asp His Pro Leu Ser Arg Asn Leu Val Leu Ala Leu Gly
145 150 155 160
Ala Ala Trp Phe Ala Tyr Leu Val Glu Gly Phe Pro Pro Arg Lys Val
165 170 175
Asn His Phe Asn Pro Phe Glu Pro Leu Phe Val Arg Gln Val Ala Ala
180 185 190
Val Val Ile Ser Leu Ser Ala His Phe Ala Val Leu Ala Leu Ser Val
195 200 205
Tyr Leu Ser Phe Gln Phe Gly Leu Lys Thr Met Ala Leu Tyr Tyr Tyr
210 215 220
Gly Pro Val Phe Val Phe Gly Ser Met Leu Val Ile Thr Thr Phe Leu
225 230 235 240
His His Asn Asp Glu Glu Thr Pro Trp Tyr Gly Asp Ser Asp Trp Thr
245 250 255
Tyr Val Lys Gly Asn Leu Ser Ser Val Asp Arg Ser Tyr Gly Ala Phe
260 265 270
Ile Asp Asn Leu Ser His Asn Ile Gly Thr His Gln Ile His His Leu
275 280 285
Phe Pro Ile Ile Pro His Tyr Lys Leu Asn Arg Ala Thr Ala Ala Phe
290 295 300
His Gln Ala Phe Pro Glu Leu Val Arg Lys Ser Asp Glu Pro Ile Leu
305 310 315 320
Lys Ala Phe Trp Arg Val Gly Arg Leu Tyr Ala Asn Tyr Gly Val Val
325 330 335
Asp Pro Asp Ala Lys Leu Phe Thr Leu Lys Glu Ala Lys Ala Ala Ser
340 345 350
Glu Ala Ala Thr Lys Thr Lys Ala Thr
355 360
<210>7
<211>1086
<212>DNA
<213>栎树猝死病菌
<220>
<221>其它特征
<223>合成的Δ17去饱和酶(密码子优化的)
<400>7
atggctacca agcagcccta ccagttccct actctgaccg agatcaagcg atctcttccc 60
tccgagtgct ttgaagcctc ggtccctctg tccttgtact acaccgtgcg aatcgtcgct 120
attgccgttg ctctggcctt cggactcaac tacgctcgag cccttcccgt ggtcgagtct 180
ctgtgggcac tcgacgctgc cctttgttgc ggttacgttc tgctccaagg cattgtcttc 240
tggggattct ttaccgtggg tcacgatgct ggacatggtg ccttctctcg ataccacctg 300
ctcaactttg tcgttggcac ctttatccac tccctcattc ttactccctt cgagtcgtgg 360
aagctcacac atcgacacca tcacaagaac accggaaaca tcgaccgaga cgaaatcttc 420
taccctcagc gaaaggccga cgatcatcct ctgtctcgaa acctcgtcct ggctctcggt 480
gccgcttggt ttgcctacct tgtcgagggc tttcctcccc gaaaggtcaa ccacttcaac 540
cccttcgaac ctctgtttgt gcgacaggtg gctgccgttg tcatttccct ctctgctcac 600
ttcgccgtcc tggcactgtc cgtgtatctg agctttcagt tcggtctcaa gacaatggct 660
ctgtactact atggacccgt cttcgtgttc ggctccatgc tcgtcattac tacctttctg 720
catcacaatg acgaggaaac tccttggtac ggagattccg actggaccta cgtcaagggc 780
aacttgtctt ccgtggaccg atcttacggt gccttcatcg acaacctctc gcacaacatt 840
ggcacacacc agatccacca tctgtttccc atcattcctc actacaagct caaccgagcc 900
accgctgcct tccaccaggc ctttcccgaa cttgtccgaa agagcgacga gcccattctc 960
aaggctttct ggagagttgg tcgactttac gccaactacg gagtcgtgga tcccgacgca 1020
aagctgttta ctctcaagga ggccaaagct gcctccgagg ctgccaccaa gaccaaggct 1080
acttaa 1086
<210>8
<211>1086
<212>DNA
<213>致病疫霉(GenBank登录号CAJ30870)
<400>8
atggcgacga aggaggcgta tgtgttcccc actctgacgg agatcaagcg gtcgctacct 60
aaagactgtt tcgaggcttc ggtgcctctg tcgctctact acaccgtgcg ttgtctggtg 120
atcgcggtgg ctctaacctt cggtctcaac tacgctcgcg ctctgcccga ggtcgagagc 180
ttctgggctc tggacgccgc actctgcacg ggctacatct tgctgcaggg catcgtgttc 240
tggggcttct tcacggtggg ccacgatgcc ggccacggcg ccttctcgcg ctaccacctg 300
cttaacttcg tggtgggcac tttcatgcac tcgctcatcc tcacgccctt cgagtcgtgg 360
aagctcacgc accgtcacca ccacaagaac acgggcaaca ttgaccgtga cgaggtcttc 420
tacccgcaac gcaaggccga cgaccacccg ctgtctcgca acctgattct ggcgctcggg 480
gcagcgtggc tcgcctattt ggtcgagggc ttccctcctc gtaaggtcaa ccacttcaac 540
ccgttcgagc ctctgttcgt gcgtcaggtg tcagctgtgg taatctctct tctcgcccac 600
ttcttcgtgg ccggactctc catctatctg agcctccagc tgggccttaa gacgatggca 660
atctactact atggacctgt ttttgtgttc ggcagcatgc tggtcattac caccttccta 720
caccacaatg atgaggagac cccatggtac gccgactcgg agtggacgta cgtcaagggc 780
aacctctcgt ccgtggaccg atcgtacggc gcgctcattg acaacctgag ccacaacatc 840
ggcacgcacc agatccacca ccttttccct atcattccgc actacaaact caagaaagcc 900
actgcggcct tccaccaggc tttccctgag ctcgtgcgca agagcgacga gccaattatc 960
aaggctttct tccgggttgg acgtctctac gcaaactacg gcgttgtgga ccaggaggcg 1020
aagctcttca cgctaaagga agccaaggcg gcgaccgagg cggcggccaa gaccaagtcc 1080
acgtaa 1086
<210>9
<211>361
<212>PRT
<213>致病疫霉
<300>
<302>用于以转基因生产不饱和ω3脂肪酸的方法
<308>CAJ30870
<309>2005-09-21
<310>WO 2005083053
<311>2005-02-23
<312>2005-09-09
<313>(1)..(361)
<400>9
Met Ala Thr Lys Glu Ala Tyr Val Phe Pro Thr Leu Thr Glu Ile Lys
1 5 10 15
Arg Ser Leu Pro Lys Asp Cys Phe Glu Ala Ser Val Pro Leu Ser Leu
20 25 30
Tyr Tyr Thr Val Arg Cys Leu Val Ile Ala Val Ala Leu Thr Phe Gly
35 40 45
Leu Asn Tyr Ala Arg Ala Leu Pro Glu Val Glu Ser Phe Trp Ala Leu
50 55 60
Asp Ala Ala Leu Cys Thr Gly Tyr Ile Leu Leu Gln Gly Ile Val Phe
65 70 75 80
Trp Gly Phe Phe Thr Val Gly His Asp Ala Gly His Gly Ala Phe Ser
85 90 95
Arg Tyr His Leu Leu Asn Phe Val Val Gly Thr Phe Met His Ser Leu
100 105 110
Ile Leu Thr Pro Phe Glu Ser Trp Lys Leu Thr His Arg His His His
115 120 125
Lys Asn Thr Gly Asn Ile Asp Arg Asp Glu Val Phe Tyr Pro Gln Arg
130 135 140
Lys Ala Asp Asp His Pro Leu Ser Arg Asn Leu Ile Leu Ala Leu Gly
145 150 155 160
Ala Ala Trp Leu Ala Tyr Leu Val Glu Gly Phe Pro Pro Arg Lys Val
165 170 175
Asn His Phe Asn Pro Phe Glu Pro Leu Phe Val Arg Gln Val Ser Ala
180 185 190
Val Val Ile Ser Leu Leu Ala His Phe Phe Val Ala Gly Leu Ser Ile
195 200 205
Tyr Leu Ser Leu Gln Leu Gly Leu Lys Thr Met Ala Ile Tyr Tyr Tyr
210 215 220
Gly Pro Val Phe Val Phe Gly Ser Met Leu Val Ile Thr Thr Phe Leu
225 230 235 240
His His Asn Asp Glu Glu Thr Pro Trp Tyr Ala Asp Ser Glu Trp Thr
245 250 255
Tyr Val Lys Gly Asn Leu Ser Ser Val Asp Arg Ser Tyr Gly Ala Leu
260 265 270
Ile Asp Asn Leu Ser His Asn Ile Gly Thr His Gln Ile His His Leu
275 280 285
Phe Pro Ile Ile Pro His Tyr Lys Leu Lys Lys Ala Thr Ala Ala Phe
290 295 300
His Gln Ala Phe Pro Glu Leu Val Arg Lys Ser Asp Glu Pro Ile Ile
305 310 315 320
Lys Ala Phe Phe Arg Val Gly Arg Leu Tyr Ala Asn Tyr Gly Val Val
325 330 335
Asp Gln Glu Ala Lys Leu Phe Thr Leu Lys Glu Ala Lys Ala Ala Thr
340 345 350
Glu Ala Ala Ala Lys Thr Lys Ser Thr
355 360
<210>10
<211>3806
<212>DNA
<213>人工序列
<220>
<223>质粒pPsD17S
<400>10
atcggatccc gggcccgtcg actgcagagg cctgcatgca agcttggcgt aatcatggtc 60
atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca tacgagccgg 120
aagcataaag tgtaaagcct ggggtgccta atgagtgagc taactcacat taattgcgtt 180
gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg 240
ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct cgctcactga 300
ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat 360
acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca 420
aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 480
tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 540
aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 600
gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc 660
acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 720
accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 780
ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 840
gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 900
aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 960
ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 1020
gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 1080
cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 1140
cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga 1200
gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg 1260
tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga 1320
gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc 1380
agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac 1440
tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc 1500
agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc 1560
gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc 1620
catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt 1680
ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc 1740
atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg 1800
tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag 1860
cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat 1920
cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc 1980
atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa 2040
aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta 2100
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 2160
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 2220
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtct 2280
cgcgcgtttc ggtgatgacg gtgaaaacct ctgacacatg cagctcccgg agacggtcac 2340
agcttgtctg taagcggatg ccgggagcag acaagcccgt cagggcgcgt cagcgggtgt 2400
tggcgggtgt cggggctggc ttaactatgc ggcatcagag cagattgtac tgagagtgca 2460
ccatatgcgg tgtgaaatac cgcacagatg cgtaaggaga aaataccgca tcaggcgcca 2520
ttcgccattc aggctgcgca actgttggga agggcgatcg gtgcgggcct cttcgctatt 2580
acgccagctg gcgaaagggg gatgtgctgc aaggcgatta agttgggtaa cgccagggtt 2640
ttcccagtca cgacgttgta aaacgacggc cagtgaattc gagctcggta cctcgcgaat 2700
gcatctagat ccatggctac caagcagccc taccagttcc ctactctgac cgagatcaag 2760
cgatctctgc cctccgagtg tttcgaggcc tccgtgcctc tctctctgta ctacaccgtt 2820
cgatgcctgg tcattgctgt gtcgctcgcc ttcggacttc accatgcacg atctctgccc 2880
gttgtcgaag gcctctgggc tctggatgcc gctctctgca ccggttacgt gctgctccag 2940
ggcatcgtct tctggggatt ctttactgtt ggtcacgacg ctggacatgg tgccttctcc 3000
cgataccacc tgctcaactt tgtcatcgga accttcattc actctctcat ccttacaccc 3060
ttcgagtcct ggaagctcac ccacagacac catcacaaga acactggcaa catcgaccga 3120
gacgaaatct tctaccctca acgaaaggcc gacgatcatc ctctgtctcg aaacctcatt 3180
ctggctttgg gtgcagcctg gtttgcctac ctggtcgaag gctttcctcc ccgaaaggtc 3240
aaccacttca accccttcga gcctctcttt gttcgacagg tctctgccgt ggtcatttcg 3300
ctggctgcgc actttggagt ggctgccctg tccatctacc tcagcctgca gttcggcttc 3360
aagactatgg ccatctacta ctatggtccc gtctttgtgt tcggatccat gctcgtcatt 3420
actacctttc ttcatcacaa cgacgaagag acaccttggt acgcagattc ggagtggacc 3480
tacgtcaaag gcaacctgtc ctctgtcgac cgatcctacg gtgccctcat cgacaacctt 3540
tctcacaaca tcggaaccca ccagattcat cacctctttc ccatcattcc tcactacaag 3600
ctcaagcgag ctaccgaggc cttccatcaa gcctttcccg agctggttcg aaagtccgac 3660
gaacccatca tcaaggcctt tttcagagtc ggccgactct acgcaaacta cggtgtggtc 3720
gactcggatg ccaagctgtt cactctcaag gaggccaagg ctgtttccga agccgctacc 3780
aagactaagg ccacctaagc ggccgc 3806
<210>11
<211>6619
<212>DNA
<213>人工序列
<220>
<223>质粒pKunF1-KEA
<400>11
tttgaatcga atcgatgagc ctaaaatgaa cccgagtata tctcataaaa ttctcggtga 60
gaggtctgtg actgtcagta caaggtgcct tcattatgcc ctcaacctta ccatacctca 120
ctgaatgtag tgtacctcta aaaatgaaat acagtgccaa aagccaaggc actgagctcg 180
tctaacggac ttgatataca accaattaaa acaaatgaaa agaaatacag ttctttgtat 240
catttgtaac aattaccctg tacaaactaa ggtattgaaa tcccacaata ttcccaaagt 300
ccaccccttt ccaaattgtc atgcctacaa ctcatatacc aagcactaac ctaccgttta 360
aacagtgtac gcagatctac tatagaggaa catttaaatt gccccggaga agacggccag 420
gccgcctaga tgacaaattc aacaactcac agctgacttt ctgccattgc cactaggggg 480
gggccttttt atatggccaa gccaagctct ccacgtcggt tgggctgcac ccaacaataa 540
atgggtaggg ttgcaccaac aaagggatgg gatggggggt agaagatacg aggataacgg 600
ggctcaatgg cacaaataag aacgaatact gccattaaga ctcgtgatcc agcgactgac 660
accattgcat catctaaggg cctcaaaact acctcggaac tgctgcgctg atctggacac 720
cacagaggtt ccgagcactt taggttgcac caaatgtccc accaggtgca ggcagaaaac 780
gctggaacag cgtgtacagt ttgtcttaac aaaaagtgag ggcgctgagg tcgagcaggg 840
tggtgtgact tgttatagcc tttagagctg cgaaagcgcg tatggatttg gctcatcagg 900
ccagattgag ggtctgtgga cacatgtcat gttagtgtac ttcaatcgcc ccctggatat 960
agccccgaca ataggccgtg gcctcatttt tttgccttcc gcacatttcc attgctcggt 1020
acccacacct tgcttctcct gcacttgcca accttaatac tggtttacat tgaccaacat 1080
cttacaagcg gggggcttgt ctagggtata tataaacagt ggctctccca atcggttgcc 1140
agtctctttt ttcctttctt tccccacaga ttcgaaatct aaactacaca tcacagaatt 1200
ccgagccgtg agtatccacg acaagatcag tgtcgagacg acgcgttttg tgtaatgaca 1260
caatccgaaa gtcgctagca acacacactc tctacacaaa ctaacccagc tctggtacca 1320
tggagtccat tgctcccttc ctgccctcca agatgcctca ggacctgttc atggacctcg 1380
ccagcgctat cggtgtccga gctgctccct acgtcgatcc cctggaggct gccctggttg 1440
cccaggccga gaagtacatt cccaccattg tccatcacac tcgaggcttc ctggttgccg 1500
tggagtctcc cctggctcga gagctgcctc tgatgaaccc cttccacgtg ctcctgatcg 1560
tgctcgccta cctggtcacc gtgtttgtgg gtatgcagat catgaagaac tttgaacgat 1620
tcgaggtcaa gaccttctcc ctcctgcaca acttctgtct ggtctccatc tccgcctaca 1680
tgtgcggtgg catcctgtac gaggcttatc aggccaacta tggactgttt gagaacgctg 1740
ccgatcacac cttcaagggt ctccctatgg ctaagatgat ctggctcttc tacttctcca 1800
agatcatgga gtttgtcgac accatgatca tggtcctcaa gaagaacaac cgacagattt 1860
cctttctgca cgtgtaccac cactcttcca tcttcaccat ctggtggctg gtcaccttcg 1920
ttgctcccaa cggtgaagcc tacttctctg ctgccctgaa ctccttcatc cacgtcatca 1980
tgtacggcta ctactttctg tctgccctgg gcttcaagca ggtgtcgttc atcaagttct 2040
acatcactcg atcccagatg acccagttct gcatgatgtc tgtccagtct tcctgggaca 2100
tgtacgccat gaaggtcctt ggccgacctg gatacccctt cttcatcacc gctctgctct 2160
ggttctacat gtggaccatg ctcggtctct tctacaactt ttaccgaaag aacgccaagc 2220
tcgccaagca ggccaaggct gacgctgcca aggagaaggc cagaaagctc cagtaagcgg 2280
ccgcaagtgt ggatggggaa gtgagtgccc ggttctgtgt gcacaattgg caatccaaga 2340
tggatggatt caacacaggg atatagcgag ctacgtggtg gtgcgaggat atagcaacgg 2400
atatttatgt ttgacacttg agaatgtacg atacaagcac tgtccaagta caatactaaa 2460
catactgtac atactcatac tcgtacccgg gcaacggttt cacttgagtg cagtggctag 2520
tgctcttact cgtacagtgt gcaatactgc gtatcatagt ctttgatgta tatcgtattc 2580
attcatgtta gttgcgtacg aagtcgtcaa tgatgtcgat atgggttttg atcatgcaca 2640
cataaggtcc gaccttatcg gcaagctcaa tgagctcctt ggtggtggta acatccagag 2700
aagcacacag gttggttttc ttggctgcca cgagcttgag cactcgagcg gcaaaggcgg 2760
acttgtggac gttagctcga gcttcgtagg agggcatttt ggtggtgaag aggagactga 2820
aataaattta gtctgcagaa ctttttatcg gaaccttatc tggggcagtg aagtatatgt 2880
tatggtaata gttacgagtt agttgaactt atagatagac tggactatac ggctatcggt 2940
ccaaattaga aagaacgtca atggctctct gggcgtcgcc tttgccgaca aaaatgtgat 3000
catgatgaaa gccagcaatg acgttgcagc tgatattgtt gtcggccaac cgcgccgaaa 3060
acgcagctgt cagacccaca gcctccaacg aagaatgtat cgtcaaagtg atccaagcac 3120
actcatagtt ggagtcgtac tccaaaggcg gcaatgacga gtcagacaga tactcgtcga 3180
ccttttcctt gggaaccacc accgtcagcc cttctgactc acgtattgta gccaccgaca 3240
caggcaacag tccgtggata gcagaatatg tcttgtcggt ccatttctca ccaactttag 3300
gcgtcaagtg aatgttgcag aagaagtatg tgccttcatt gagaatcggt gttgctgatt 3360
tcaataaagt cttgagatca gtttggcgcg ccagctgcat taatgaatcg gccaacgcgc 3420
ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg 3480
ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc 3540
cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag 3600
gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca 3660
tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca 3720
ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg 3780
atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag 3840
gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt 3900
tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca 3960
cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg 4020
cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt 4080
tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc 4140
cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg 4200
cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg 4260
gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta 4320
gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg 4380
gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg 4440
ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc 4500
atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc 4560
agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc 4620
ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag 4680
tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat 4740
ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg 4800
caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt 4860
gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag 4920
atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg 4980
accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt 5040
aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct 5100
gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac 5160
tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat 5220
aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat 5280
ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca 5340
aataggggtt ccgcgcacat ttccccgaaa agtgccacct gatgcggtgt gaaataccgc 5400
acagatgcgt aaggagaaaa taccgcatca ggaaattgta agcgttaata ttttgttaaa 5460
attcgcgtta aatttttgtt aaatcagctc attttttaac caataggccg aaatcggcaa 5520
aatcccttat aaatcaaaag aatagaccga gatagggttg agtgttgttc cagtttggaa 5580
caagagtcca ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca 5640
gggcgatggc ccactacgtg aaccatcacc ctaatcaagt tttttggggt cgaggtgccg 5700
taaagcacta aatcggaacc ctaaagggag cccccgattt agagcttgac ggggaaagcc 5760
ggcgaacgtg gcgagaaagg aagggaagaa agcgaaagga gcgggcgcta gggcgctggc 5820
aagtgtagcg gtcacgctgc gcgtaaccac cacacccgcc gcgcttaatg cgccgctaca 5880
gggcgcgtcc attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc 5940
tcttcgctat tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta 6000
acgccagggt tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt gtaatacgac 6060
tcactatagg gcgaattggg cccgacgtcg catgcagtgg tggtattgtg actggggatg 6120
tagttgagaa taagtcatac acaagtcagc tttcttcgag cctcatataa gtataagtag 6180
ttcaacgtat tagcactgta cccagcatct ccgtatcgag aaacacaaca acatgcccca 6240
ttggacagat catgcggata cacaggttgt gcagtatcat acatactcga tcagacaggt 6300
cgtctgacca tcatacaagc tgaacaagcg ctccatactt gcacgctctc tatatacaca 6360
gttaaattac atatccatag tctaacctct aacagttaat cttctggtaa gcctcccagc 6420
cagccttctg gtatcgcttg gcctcctcaa taggatctcg gttctggccg tacagacctc 6480
ggccgacaat tatgatatcc gttccggtag acatgacatc ctcaacagtt cggtactgct 6540
gtccgagagc gtctcccttg tcgtcaagac ccaccccggg ggtcagaata agccagtcct 6600
cagagtcgcc cttaattaa 6619
<210>12
<211>9513
<212>DNA
<213>人工序列
<220>
<223>质粒pDMW214
<400>12
ggtggagctc cagcttttgt tccctttagt gagggttaat ttcgagcttg gcgtaatcat 60
ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 120
ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 180
cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 240
tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca 300
ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 360
taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc 420
agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc 480
cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 540
tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc 600
tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata 660
gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 720
acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 780
acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 840
cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 900
gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 960
gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 1020
agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 1080
ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa 1140
ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 1200
atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga 1260
tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac 1320
gggagggctt accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg 1380
ctccagattt atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg 1440
caactttatc cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt 1500
cgccagttaa tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct 1560
cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat 1620
cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta 1680
agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca 1740
tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat 1800
agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac 1860
atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa 1920
ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt 1980
cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg 2040
caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat 2100
attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt 2160
agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgcgc 2220
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 2280
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg 2340
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt 2400
tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt gggccatcgc 2460
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct 2520
tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga 2580
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga 2640
attttaacaa aatattaacg cttacaattt ccattcgcca ttcaggctgc gcaactgttg 2700
ggaagggcga tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc 2760
tgcaaggcga ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac 2820
ggccagtgaa ttgtaatacg actcactata gggcgaattg ggtaccgggc cccccctcga 2880
ggtcgatggt gtcgataagc ttgatatcga attcatgtca cacaaaccga tcttcgcctc 2940
aaggaaacct aattctacat ccgagagact gccgagatcc agtctacact gattaatttt 3000
cgggccaata atttaaaaaa atcgtgttat ataatattat atgtattata tatatacatc 3060
atgatgatac tgacagtcat gtcccattgc taaatagaca gactccatct gccgcctcca 3120
actgatgttc tcaatattta aggggtcatc tcgcattgtt taataataaa cagactccat 3180
ctaccgcctc caaatgatgt tctcaaaata tattgtatga acttattttt attacttagt 3240
attattagac aacttacttg ctttatgaaa aacacttcct atttaggaaa caatttataa 3300
tggcagttcg ttcatttaac aatttatgta gaataaatgt tataaatgcg tatgggaaat 3360
cttaaatatg gatagcataa atgatatctg cattgcctaa ttcgaaatca acagcaacga 3420
aaaaaatccc ttgtacaaca taaatagtca tcgagaaata tcaactatca aagaacagct 3480
attcacacgt tactattgag attattattg gacgagaatc acacactcaa ctgtctttct 3540
ctcttctaga aatacaggta caagtatgta ctattctcat tgttcatact tctagtcatt 3600
tcatcccaca tattccttgg atttctctcc aatgaatgac attctatctt gcaaattcaa 3660
caattataat aagatatacc aaagtagcgg tatagtggca atcaaaaagc ttctctggtg 3720
tgcttctcgt atttattttt attctaatga tccattaaag gtatatattt atttcttgtt 3780
atataatcct tttgtttatt acatgggctg gatacataaa ggtattttga tttaattttt 3840
tgcttaaatt caatcccccc tcgttcagtg tcaactgtaa tggtaggaaa ttaccatact 3900
tttgaagaag caaaaaaaat gaaagaaaaa aaaaatcgta tttccaggtt agacgttccg 3960
cagaatctag aatgcggtat gcggtacatt gttcttcgaa cgtaaaagtt gcgctccctg 4020
agatattgta catttttgct tttacaagta caagtacatc gtacaactat gtactactgt 4080
tgatgcatcc acaacagttt gttttgtttt tttttgtttt ttttttttct aatgattcat 4140
taccgctatg tatacctact tgtacttgta gtaagccggg ttattggcgt tcaattaatc 4200
atagacttat gaatctgcac ggtgtgcgct gcgagttact tttagcttat gcatgctact 4260
tgggtgtaat attgggatct gttcggaaat caacggatgc tcaaccgatt tcgacagtaa 4320
taatttgaat cgaatcggag cctaaaatga acccgagtat atctcataaa attctcggtg 4380
agaggtctgt gactgtcagt acaaggtgcc ttcattatgc cctcaacctt accatacctc 4440
actgaatgta gtgtacctct aaaaatgaaa tacagtgcca aaagccaagg cactgagctc 4500
gtctaacgga cttgatatac aaccaattaa aacaaatgaa aagaaataca gttctttgta 4560
tcatttgtaa caattaccct gtacaaacta aggtattgaa atcccacaat attcccaaag 4620
tccacccctt tccaaattgt catgcctaca actcatatac caagcactaa cctaccaaac 4680
accactaaaa ccccacaaaa tatatcttac cgaatataca gtaacaagct accaccacac 4740
tcgttgggtg cagtcgccag cttaaagata tctatccaca tcagccacaa ctcccttcct 4800
ttaataaacc gactacaccc ttggctattg aggttatgag tgaatatact gtagacaaga 4860
cactttcaag aagactgttt ccaaaacgta ccactgtcct ccactacaaa cacacccaat 4920
ctgcttcttc tagtcaaggt tgctacaccg gtaaattata aatcatcatt tcattagcag 4980
ggcagggccc tttttataga gtcttataca ctagcggacc ctgccggtag accaacccgc 5040
aggcgcgtca gtttgctcct tccatcaatg cgtcgtagaa acgacttact ccttcttgag 5100
cagctccttg accttgttgg caacaagtct ccgacctcgg aggtggagga agagcctccg 5160
atatcggcgg tagtgatacc agcctcgacg gactccttga cggcagcctc aacagcgtca 5220
ccggcgggct tcatgttaag agagaacttg agcatcatgg cggcagacag aatggtggca 5280
atggggttga ccttctgctt gccgagatcg ggggcagatc cgtgacaggg ctcgtacaga 5340
ccgaacgcct cgttggtgtc gggcagagaa gccagagagg cggagggcag cagacccaga 5400
gaaccgggga tgacggaggc ctcgtcggag atgatatcgc caaacatgtt ggtggtgatg 5460
atgataccat tcatcttgga gggctgcttg atgaggatca tggcggccga gtcgatcagc 5520
tggtggttga gctcgagctg ggggaattcg tccttgagga ctcgagtgac agtctttcgc 5580
caaagtcgag aggaggccag cacgttggcc ttgtcaagag accacacggg aagagggggg 5640
ttgtgctgaa gggccaggaa ggcggccatt cgggcaattc gctcaacctc aggaacggag 5700
taggtctcgg tgtcggaagc gacgccagat ccgtcatcct cctttcgctc tccaaagtag 5760
atacctccga cgagctctcg gacaatgatg aagtcggtgc cctcaacgtt tcggatgggg 5820
gagagatcgg cgagcttggg cgacagcagc tggcagggtc gcaggttggc gtacaggttc 5880
aggtcctttc gcagcttgag gagaccctgc tcgggtcgca cgtcggttcg tccgtcggga 5940
gtggtccata cggtgttggc agcgcctccg acagcaccga gcataataga gtcagccttt 6000
cggcagatgt cgagagtagc gtcggtgatg ggctcgccct ccttctcaat ggcagctcct 6060
ccaatgagtc ggtcctcaaa cacaaactcg gtgccggagg cctcagcaac agacttgagc 6120
accttgacgg cctcggcaat cacctcgggg ccacagaagt cgccgccgag aagaacaatc 6180
ttcttggagt cagtcttggt cttcttagtt tcgggttcca ttgtggatgt gtgtggttgt 6240
atgtgtgatg tggtgtgtgg agtgaaaatc tgtggctggc aaacgctctt gtatatatac 6300
gcacttttgc ccgtgctatg tggaagacta aacctccgaa gattgtgact caggtagtgc 6360
ggtatcggct agggacccaa accttgtcga tgccgatagc gctatcgaac gtaccccagc 6420
cggccgggag tatgtcggag gggacatacg agatcgtcaa gggtttgtgg ccaactggta 6480
aataaatgat gtcgacgttt aaacagtgta cgcagtacta tagaggaaca attgccccgg 6540
agaagacggc caggccgcct agatgacaaa ttcaacaact cacagctgac tttctgccat 6600
tgccactagg ggggggcctt tttatatggc caagccaagc tctccacgtc ggttgggctg 6660
cacccaacaa taaatgggta gggttgcacc aacaaaggga tgggatgggg ggtagaagat 6720
acgaggataa cggggctcaa tggcacaaat aagaacgaat actgccatta agactcgtga 6780
tccagcgact gacaccattg catcatctaa gggcctcaaa actacctcgg aactgctgcg 6840
ctgatctgga caccacagag gttccgagca ctttaggttg caccaaatgt cccaccaggt 6900
gcaggcagaa aacgctggaa cagcgtgtac agtttgtctt aacaaaaagt gagggcgctg 6960
aggtcgagca gggtggtgtg acttgttata gcctttagag ctgcgaaagc gcgtatggat 7020
ttggctcatc aggccagatt gagggtctgt ggacacatgt catgttagtg tacttcaatc 7080
gccccctgga tatagccccg acaataggcc gtggcctcat ttttttgcct tccgcacatt 7140
tccattgctc ggtacccaca ccttgcttct cctgcacttg ccaaccttaa tactggttta 7200
cattgaccaa catcttacaa gcggggggct tgtctagggt atatataaac agtggctctc 7260
ccaatcggtt gccagtctct tttttccttt ctttccccac agattcgaaa tctaaactac 7320
acatcacaca atgcctgtta ctgacgtcct taagcgaaag tccggtgtca tcgtcggcga 7380
cgatgtccga gccgtgagta tccacgacaa gatcagtgtc gagacgacgc gttttgtgta 7440
atgacacaat ccgaaagtcg ctagcaacac acactctcta cacaaactaa cccagctctc 7500
catggcatgg atggtacgtc ctgtagaaac cccaacccgt gaaatcaaaa aactcgacgg 7560
cctgtgggca ttcagtctgg atcgcgaaaa ctgtggaatt gatcagcgtt ggtgggaaag 7620
cgcgttacaa gaaagccggg caattgctgt gccaggcagt tttaacgatc agttcgccga 7680
tgcagatatt cgtaattatg cgggcaacgt ctggtatcag cgcgaagtct ttataccgaa 7740
aggttgggca ggccagcgta tcgtgctgcg tttcgatgcg gtcactcatt acggcaaagt 7800
gtgggtcaat aatcaggaag tgatggagca tcagggcggc tatacgccat ttgaagccga 7860
tgtcacgccg tatgttattg ccgggaaaag tgtacgtatc accgtttgtg tgaacaacga 7920
actgaactgg cagactatcc cgccgggaat ggtgattacc gacgaaaacg gcaagaaaaa 7980
gcagtcttac ttccatgatt tctttaacta tgccgggatc catcgcagcg taatgctcta 8040
caccacgccg aacacctggg tggacgatat caccgtggtg acgcatgtcg cgcaagactg 8100
taaccacgcg tctgttgact ggcaggtggt ggccaatggt gatgtcagcg ttgaactgcg 8160
tgatgcggat caacaggtgg ttgcaactgg acaaggcact agcgggactt tgcaagtggt 8220
gaatccgcac ctctggcaac cgggtgaagg ttatctctat gaactgtgcg tcacagccaa 8280
aagccagaca gagtgtgata tctacccgct tcgcgtcggc atccggtcag tggcagtgaa 8340
gggcgaacag ttcctgatta accacaaacc gttctacttt actggctttg gtcgtcatga 8400
agatgcggac ttacgtggca aaggattcga taacgtgctg atggtgcacg accacgcatt 8460
aatggactgg attggggcca actcctaccg tacctcgcat tacccttacg ctgaagagat 8520
gctcgactgg gcagatgaac atggcatcgt ggtgattgat gaaactgctg ctgtcggctt 8580
taacctctct ttaggcattg gtttcgaagc gggcaacaag ccgaaagaac tgtacagcga 8640
agaggcagtc aacggggaaa ctcagcaagc gcacttacag gcgattaaag agctgatagc 8700
gcgtgacaaa aaccacccaa gcgtggtgat gtggagtatt gccaacgaac cggatacccg 8760
tccgcaagtg cacgggaata tttcgccact ggcggaagca acgcgtaaac tcgacccgac 8820
gcgtccgatc acctgcgtca atgtaatgtt ctgcgacgct cacaccgata ccatcagcga 8880
tctctttgat gtgctgtgcc tgaaccgtta ttacggatgg tatgtccaaa gcggcgattt 8940
ggaaacggca gagaaggtac tggaaaaaga acttctggcc tggcaggaga aactgcatca 9000
gccgattatc atcaccgaat acggcgtgga tacgttagcc gggctgcact caatgtacac 9060
cgacatgtgg agtgaagagt atcagtgtgc atggctggat atgtatcacc gcgtctttga 9120
tcgcgtcagc gccgtcgtcg gtgaacaggt atggaatttc gccgattttg cgacctcgca 9180
aggcatattg cgcgttggcg gtaacaagaa agggatcttc actcgcgacc gcaaaccgaa 9240
gtcggcggct tttctgctgc aaaaacgctg gactggcatg aacttcggtg aaaaaccgca 9300
gcagggaggc aaacaatgat taattaacta gagcggccgc caccgcggcc cgagattccg 9360
gcctcttcgg ccgccaagcg acccgggtgg acgtctagag gtacctagca attaacagat 9420
agtttgccgg tgataattct cttaacctcc cacactcctt tgacataacg atttatgtaa 9480
cgaaactgaa atttgaccag atattgtgtc cgc 9513
<210>13
<211>8727
<212>DNA
<213>人工序列
<220>
<223>质粒pFmPsD17S
<400>13
catggctacc aagcagccct accagttccc tactctgacc gagatcaagc gatctctgcc 60
ctccgagtgt ttcgaggcct ccgtgcctct ctctctgtac tacaccgttc gatgcctggt 120
cattgctgtg tcgctcgcct tcggacttca ccatgcacga tctctgcccg ttgtcgaagg 180
cctctgggct ctggatgccg ctctctgcac cggttacgtg ctgctccagg gcatcgtctt 240
ctggggattc tttactgttg gtcacgacgc tggacatggt gccttctccc gataccacct 300
gctcaacttt gtcatcggaa ccttcattca ctctctcatc cttacaccct tcgagtcctg 360
gaagctcacc cacagacacc atcacaagaa cactggcaac atcgaccgag acgaaatctt 420
ctaccctcaa cgaaaggccg acgatcatcc tctgtctcga aacctcattc tggctttggg 480
tgcagcctgg tttgcctacc tggtcgaagg ctttcctccc cgaaaggtca accacttcaa 540
ccccttcgag cctctctttg ttcgacaggt ctctgccgtg gtcatttcgc tggctgcgca 600
ctttggagtg gctgccctgt ccatctacct cagcctgcag ttcggcttca agactatggc 660
catctactac tatggtcccg tctttgtgtt cggatccatg ctcgtcatta ctacctttct 720
tcatcacaac gacgaagaga caccttggta cgcagattcg gagtggacct acgtcaaagg 780
caacctgtcc tctgtcgacc gatcctacgg tgccctcatc gacaaccttt ctcacaacat 840
cggaacccac cagattcatc acctctttcc catcattcct cactacaagc tcaagcgagc 900
taccgaggcc ttccatcaag cctttcccga gctggttcga aagtccgacg aacccatcat 960
caaggccttt ttcagagtcg gccgactcta cgcaaactac ggtgtggtcg actcggatgc 1020
caagctgttc actctcaagg aggccaaggc tgtttccgaa gccgctacca agactaaggc 1080
cacctaagcg gccgccaccg cggcccgaga ttccggcctc ttcggccgcc aagcgacccg 1140
ggtggacgtc tagaggtacc tagcaattaa cagatagttt gccggtgata attctcttaa 1200
cctcccacac tcctttgaca taacgattta tgtaacgaaa ctgaaatttg accagatatt 1260
gtgtccgcgg tggagctcca gcttttgttc cctttagtga gggttaattt cgagcttggc 1320
gtaatcatgg tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa 1380
catacgagcc ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac 1440
attaattgcg ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca 1500
ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt attgggcgct cttccgcttc 1560
ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagctcactc 1620
aaaggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 1680
aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 1740
gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 1800
gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 1860
tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 1920
ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 1980
ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 2040
tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 2100
tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 2160
ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 2220
aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 2280
ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 2340
tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 2400
atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 2460
aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat 2520
ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac 2580
tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg 2640
ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcagaag 2700
tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt 2760
aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt 2820
gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt 2880
tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt 2940
cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct 3000
tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt 3060
ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac 3120
cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa 3180
actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa 3240
ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca 3300
aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct 3360
ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga 3420
atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc 3480
tgacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac 3540
cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc 3600
cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt 3660
tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt cacgtagtgg 3720
gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag 3780
tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt 3840
ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt 3900
taacgcgaat tttaacaaaa tattaacgct tacaatttcc attcgccatt caggctgcgc 3960
aactgttggg aagggcgatc ggtgcgggcc tcttcgctat tacgccagct ggcgaaaggg 4020
ggatgtgctg caaggcgatt aagttgggta acgccagggt tttcccagtc acgacgttgt 4080
aaaacgacgg ccagtgaatt gtaatacgac tcactatagg gcgaattggg taccgggccc 4140
cccctcgagg tcgatggtgt cgataagctt gatatcgaat tcatgtcaca caaaccgatc 4200
ttcgcctcaa ggaaacctaa ttctacatcc gagagactgc cgagatccag tctacactga 4260
ttaattttcg ggccaataat ttaaaaaaat cgtgttatat aatattatat gtattatata 4320
tatacatcat gatgatactg acagtcatgt cccattgcta aatagacaga ctccatctgc 4380
cgcctccaac tgatgttctc aatatttaag gggtcatctc gcattgttta ataataaaca 4440
gactccatct accgcctcca aatgatgttc tcaaaatata ttgtatgaac ttatttttat 4500
tacttagtat tattagacaa cttacttgct ttatgaaaaa cacttcctat ttaggaaaca 4560
atttataatg gcagttcgtt catttaacaa tttatgtaga ataaatgtta taaatgcgta 4620
tgggaaatct taaatatgga tagcataaat gatatctgca ttgcctaatt cgaaatcaac 4680
agcaacgaaa aaaatccctt gtacaacata aatagtcatc gagaaatatc aactatcaaa 4740
gaacagctat tcacacgtta ctattgagat tattattgga cgagaatcac acactcaact 4800
gtctttctct cttctagaaa tacaggtaca agtatgtact attctcattg ttcatacttc 4860
tagtcatttc atcccacata ttccttggat ttctctccaa tgaatgacat tctatcttgc 4920
aaattcaaca attataataa gatataccaa agtagcggta tagtggcaat caaaaagctt 4980
ctctggtgtg cttctcgtat ttatttttat tctaatgatc cattaaaggt atatatttat 5040
ttcttgttat ataatccttt tgtttattac atgggctgga tacataaagg tattttgatt 5100
taattttttg cttaaattca atcccccctc gttcagtgtc aactgtaatg gtaggaaatt 5160
accatacttt tgaagaagca aaaaaaatga aagaaaaaaa aaatcgtatt tccaggttag 5220
acgttccgca gaatctagaa tgcggtatgc ggtacattgt tcttcgaacg taaaagttgc 5280
gctccctgag atattgtaca tttttgcttt tacaagtaca agtacatcgt acaactatgt 5340
actactgttg atgcatccac aacagtttgt tttgtttttt tttgtttttt ttttttctaa 5400
tgattcatta ccgctatgta tacctacttg tacttgtagt aagccgggtt attggcgttc 5460
aattaatcat agacttatga atctgcacgg tgtgcgctgc gagttacttt tagcttatgc 5520
atgctacttg ggtgtaatat tgggatctgt tcggaaatca acggatgctc aaccgatttc 5580
gacagtaata atttgaatcg aatcggagcc taaaatgaac ccgagtatat ctcataaaat 5640
tctcggtgag aggtctgtga ctgtcagtac aaggtgcctt cattatgccc tcaaccttac 5700
catacctcac tgaatgtagt gtacctctaa aaatgaaata cagtgccaaa agccaaggca 5760
ctgagctcgt ctaacggact tgatatacaa ccaattaaaa caaatgaaaa gaaatacagt 5820
tctttgtatc atttgtaaca attaccctgt acaaactaag gtattgaaat cccacaatat 5880
tcccaaagtc cacccctttc caaattgtca tgcctacaac tcatatacca agcactaacc 5940
taccaaacac cactaaaacc ccacaaaata tatcttaccg aatatacagt aacaagctac 6000
caccacactc gttgggtgca gtcgccagct taaagatatc tatccacatc agccacaact 6060
cccttccttt aataaaccga ctacaccctt ggctattgag gttatgagtg aatatactgt 6120
agacaagaca ctttcaagaa gactgtttcc aaaacgtacc actgtcctcc actacaaaca 6180
cacccaatct gcttcttcta gtcaaggttg ctacaccggt aaattataaa tcatcatttc 6240
attagcaggg cagggccctt tttatagagt cttatacact agcggaccct gccggtagac 6300
caacccgcag gcgcgtcagt ttgctccttc catcaatgcg tcgtagaaac gacttactcc 6360
ttcttgagca gctccttgac cttgttggca acaagtctcc gacctcggag gtggaggaag 6420
agcctccgat atcggcggta gtgataccag cctcgacgga ctccttgacg gcagcctcaa 6480
cagcgtcacc ggcgggcttc atgttaagag agaacttgag catcatggcg gcagacagaa 6540
tggtggcaat ggggttgacc ttctgcttgc cgagatcggg ggcagatccg tgacagggct 6600
cgtacagacc gaacgcctcg ttggtgtcgg gcagagaagc cagagaggcg gagggcagca 6660
gacccagaga accggggatg acggaggcct cgtcggagat gatatcgcca aacatgttgg 6720
tggtgatgat gataccattc atcttggagg gctgcttgat gaggatcatg gcggccgagt 6780
cgatcagctg gtggttgagc tcgagctggg ggaattcgtc cttgaggact cgagtgacag 6840
tctttcgcca aagtcgagag gaggccagca cgttggcctt gtcaagagac cacacgggaa 6900
gaggggggtt gtgctgaagg gccaggaagg cggccattcg ggcaattcgc tcaacctcag 6960
gaacggagta ggtctcggtg tcggaagcga cgccagatcc gtcatcctcc tttcgctctc 7020
caaagtagat acctccgacg agctctcgga caatgatgaa gtcggtgccc tcaacgtttc 7080
ggatggggga gagatcggcg agcttgggcg acagcagctg gcagggtcgc aggttggcgt 7140
acaggttcag gtcctttcgc agcttgagga gaccctgctc gggtcgcacg tcggttcgtc 7200
cgtcgggagt ggtccatacg gtgttggcag cgcctccgac agcaccgagc ataatagagt 7260
cagcctttcg gcagatgtcg agagtagcgt cggtgatggg ctcgccctcc ttctcaatgg 7320
cagctcctcc aatgagtcgg tcctcaaaca caaactcggt gccggaggcc tcagcaacag 7380
acttgagcac cttgacggcc tcggcaatca cctcggggcc acagaagtcg ccgccgagaa 7440
gaacaatctt cttggagtca gtcttggtct tcttagtttc gggttccatt gtggatgtgt 7500
gtggttgtat gtgtgatgtg gtgtgtggag tgaaaatctg tggctggcaa acgctcttgt 7560
atatatacgc acttttgccc gtgctatgtg gaagactaaa cctccgaaga ttgtgactca 7620
ggtagtgcgg tatcggctag ggacccaaac cttgtcgatg ccgatagcgc tatcgaacgt 7680
accccagccg gccgggagta tgtcggaggg gacatacgag atcgtcaagg gtttgtggcc 7740
aactggtaaa taaatgatgt cgacgtttaa acagtgtacg cagatctact atagaggaac 7800
atttaaattg ccccggagaa gacggccagg ccgcctagat gacaaattca acaactcaca 7860
gctgactttc tgccattgcc actagggggg ggccttttta tatggccaag ccaagctctc 7920
cacgtcggtt gggctgcacc caacaataaa tgggtagggt tgcaccaaca aagggatggg 7980
atggggggta gaagatacga ggataacggg gctcaatggc acaaataaga acgaatactg 8040
ccattaagac tcgtgatcca gcgactgaca ccattgcatc atctaagggc ctcaaaacta 8100
cctcggaact gctgcgctga tctggacacc acagaggttc cgagcacttt aggttgcacc 8160
aaatgtccca ccaggtgcag gcagaaaacg ctggaacagc gtgtacagtt tgtcttaaca 8220
aaaagtgagg gcgctgaggt cgagcagggt ggtgtgactt gttatagcct ttagagctgc 8280
gaaagcgcgt atggatttgg ctcatcaggc cagattgagg gtctgtggac acatgtcatg 8340
ttagtgtact tcaatcgccc cctggatata gccccgacaa taggccgtgg cctcattttt 8400
ttgccttccg cacatttcca ttgctcggta cccacacctt gcttctcctg cacttgccaa 8460
ccttaatact ggtttacatt gaccaacatc ttacaagcgg ggggcttgtc tagggtatat 8520
ataaacagtg gctctcccaa tcggttgcca gtctcttttt tcctttcttt ccccacagat 8580
tcgaaatcta aactacacat cacagaattc cgagccgtga gtatccacga caagatcagt 8640
gtcgagacga cgcgttttgt gtaatgacac aatccgaaag tcgctagcaa cacacactct 8700
ctacacaaac taacccagct ctggtac 8727
<210>14
<211>12649
<212>DNA
<213>人工序列
<220>
<223>质粒pKUNF12T6E
<220>
<221>其它特征
<222>(2507)..(2507)
<223>n为a、c、g或t
<220>
<221>其它特征
<222>(2512)..(2515)
<223>n为a、c、g或t
<400>14
taaccctcac taaagggaac aaaagctgga gctccaccgc ggacacaata tctggtcaaa 60
tttcagtttc gttacataaa tcgttatgtc aaaggagtgt gggaggttaa gagaattatc 120
accggcaaac tatctgttaa ttgctaggta cctctagacg tccacccggg tcgcttggcg 180
gccgaagagg ccggaatctc gggccgcggt ggcggccgct tagttggtct tggacttctt 240
gggcttcttc aggtaggact ggacaaagaa gttgccgaac agagcgagca gggtgatcat 300
gtacacgccg agcagctgga ccagagcctg agggtagtcg caggggaaga ggtagtcgta 360
cagggactgc accagcatag ccatgaactg ggtcatctgc agagtggtga tgtagggctt 420
gatgggcttg acgaagccga agccctgaga ggaaaagaag tagtaggcgt acatgacggt 480
gtggacgaag gagttgagga tgacggagaa gtaggcgtcg ccaccaggag cgtacttggc 540
aatagcccac cagatggcga agatggtggc atggtggtac acgtgcagga aggagacctg 600
gttgaacttc ttgcacagga tcatgatagc ggtgtccagg aactcgtagg ccttggagac 660
gtagaacacg tagacgattc gggacatgcc ctgagcgtgg gactcgttgc ccttctccat 720
gtcgttgccg aagaccttgt agccacccag gatagcctgt cggatggtct cgacgcacat 780
gtagagggac agtccgaaga ggaacaggtt gtggagcagc ttgatggtct tcagctcgaa 840
gggcttctcc atctgcttca tgatgggaat gccgaagagc agcatggcca tgtagccgac 900
ctcgaaggcg agcatggtgg agacgtccat catgggcaga ccgtcggtca gagcgtaggg 960
cttagctccg tccatccact ggtcgacacc ggtctcgact cgtccgacca cgtcgtccca 1020
gacagaggag ttggccatgg tgaatgattc ttatactcag aaggaaatgc ttaacgattt 1080
cgggtgtgag ttgacaagga gagagagaaa agaagaggaa aggtaattcg gggacggtgg 1140
tcttttatac ccttggctaa agtcccaacc acaaagcaaa aaaattttca gtagtctatt 1200
ttgcgtccgg catgggttac ccggatggcc agacaaagaa actagtacaa agtctgaaca 1260
agcgtagatt ccagactgca gtaccctacg cccttaacgg caagtgtggg aaccggggga 1320
ggtttgatat gtggggtgaa gggggctctc gccggggttg ggcccgctac tgggtcaatt 1380
tggggtcaat tggggcaatt ggggctgttt tttgggacac aaatacgccg ccaacccggt 1440
ctctcctgaa ttctgcatcg atcgaggaag aggacaagcg gctgcttctt aagtttgtga 1500
catcagtatc caaggcacca ttgcaaggat tcaaggcttt gaacccgtca tttgccattc 1560
gtaacgctgg tagacaggtt gatcggttcc ctacggcctc cacctgtgtc aatcttctca 1620
agctgcctga ctatcaggac attgatcaac ttcggaagaa acttttgtat gccattcgat 1680
cacatgctgg tttcgatttg tcttagagga acgcatatac agtaatcata gagaataaac 1740
gatattcatt tattaaagta gatagttgag gtagaagttg taaagagtga taaatagcgg 1800
ccgcgcctac ttaagcaacg ggcttgataa cagcgggggg ggtgcccacg ttgttgcggt 1860
tgcggaagaa cagaacaccc ttaccagcac cctcggcacc agcgctgggc tcaacccact 1920
ggcacatacg cgcactgcgg tacatggcgc ggatgaagcc acgaggacca tcctggacat 1980
cagcccggta gtgcttgccc atgatgggct taatggcctc ggtggcctcg tccgcgttgt 2040
agaaggggat gctgctgacg tagtggtgga ggacatgagt ctcgatgatg ccgtggagaa 2100
ggtggcggcc gatgaagccc atctcacggt caatggtagc agcggcacca cggacgaagt 2160
tccactcgtc gttggtgtag tggggaaggg tagggtcggt gtgctggagg aaggtgatgg 2220
caacgagcca gtggttaacc cagaggtagg gaacaaagta ccagatggcc atgttgtaga 2280
aaccgaactt ctgaacgagg aagtacagag cagtggccat cagaccgata ccaatatcgc 2340
tgaggacgat gagcttagcg tcactgttct cgtacagagg gctgcgggga tcgaagtggt 2400
taacaccacc gccgaggccg ttatgcttgc ccttgccgcg accctcacgc tggcgctcgt 2460
ggtagttgtg gccggtaaca ttggtgatga ggtagttggg ccagccnacg annnnctcag 2520
taagatgagc gagctcgtgg gtcatctttc cgagacgagt agcctgctgc tcgcgggttc 2580
ggggaacgaa gaccatgtca cgctccatgt tgccagtggc cttgtggtgc tttcggtggg 2640
agatttgcca gctgaagtag gggacaagga gggaagagtg aagaacccag ccagtaatgt 2700
cgttgatgat gcgagaatcg gagaaagcac cgtgaccgca ctcatgggca ataacccaga 2760
gaccagtacc gaaaagaccc tgaagaacgg tgtacacggc ccacagacca gcgcgggcgg 2820
gggtggaggg gatatattcg ggggtcacaa agttgtacca gatgctgaaa gtggtagtca 2880
ggaggacaat gtcgcggagg atataaccgt atcccttgag agcggagcgc ttgaagcagt 2940
gcttagggat ggcattgtag atgtccttga tggtaaagtc gggaacctcg aactggttgc 3000
cgtaggtgtc gagcatgaca ccatactcgg acttgggctt ggcgatatca acctcggaca 3060
tggacgagag cgatgtggaa gaggccgagt ggcggggaga gtctgaagga gagacggcgg 3120
cagactcaga atccgtcaca gtagttgagg tgacggtgcg tctaagcgca gggttctgct 3180
tgggcagagc cgaagtggac gccatggaga gctgggttag tttgtgtaga gagtgtgtgt 3240
tgctagcgac tttcggattg tgtcattaca caaaacgcgt cgtctcgaca ctgatcttgt 3300
cgtggatact cacggctcgg acatcgtcgc cgacgatgac accggacttt cgcttaagga 3360
cgtcagtaac aggcattgtg tgatgtgtag tttagatttc gaatctgtgg ggaaagaaag 3420
gaaaaaagag actggcaacc gattgggaga gccactgttt atatataccc tagacaagcc 3480
ccccgcttgt aagatgttgg tcaatgtaaa ccagtattaa ggttggcaag tgcaggagaa 3540
gcaaggtgtg ggtaccgagc aatggaaatg tgcggaaggc aaaaaaatga ggccacggcc 3600
tattgtcggg gctatatcca gggggcgatt gaagtacact aacatgacat gtgtccacag 3660
accctcaatc tggcctgatg agccaaatcc atacgcgctt tcgcagctct aaaggctata 3720
acaagtcaca ccaccctgct cgacctcagc gccctcactt tttgttaaga caaactgtac 3780
acgctgttcc agcgttttct gcctgcacct ggtgggacat ttggtgcaac ctaaagtgct 3840
cggaacctct gtggtgtcca gatcagcgca gcagttccga ggtagttttg aggcccttag 3900
atgatgcaat ggtgtcagtc gctggatcac gagtcttaat ggcagtattc gttcttattt 3960
gtgccattga gccccgttat cctcgtatct tctacccccc atcccatccc tttgttggtg 4020
caaccctacc catttattgt tgggtgcagc ccaaccgacg tggagagctt ggcttggcca 4080
tataaaaagg ccccccccta gtggcaatgg cagaaagtca gctgtgagtt gttgaatttg 4140
tcatctaggc ggcctggccg tcttctccgg ggcaattgtt cctctatagt actgcgtaca 4200
ctgtttaaac agtgtacgca gatctgcgac gacggaattc ctgcagccca tctgcagaat 4260
tcaggagaga ccgggttggc ggcgtatttg tgtcccaaaa aacagcccca attgccccaa 4320
ttgaccccaa attgacccag tagcgggccc aaccccggcg agagccccct tcaccccaca 4380
tatcaaacct cccccggttc ccacacttgc cgttaagggc gtagggtact gcagtctgga 4440
atctacgctt gttcagactt tgtactagtt tctttgtctg gccatccggg taacccatgc 4500
cggacgcaaa atagactact gaaaattttt ttgctttgtg gttgggactt tagccaaggg 4560
tataaaagac caccgtcccc gaattacctt tcctcttctt ttctctctct ccttgtcaac 4620
tcacacccga aatcgttaag catttccttc tgagtataag aatcattcac catggctgcc 4680
gctccctctg tgcgaacctt tacccgagcc gaggttctga acgctgaggc tctgaacgag 4740
ggcaagaagg acgctgaggc tcccttcctg atgatcatcg acaacaaggt gtacgacgtc 4800
cgagagttcg tccctgacca tcctggaggc tccgtgattc tcacccacgt tggcaaggac 4860
ggcaccgacg tctttgacac ctttcatccc gaggctgctt gggagactct cgccaacttc 4920
tacgttggag acattgacga gtccgaccga gacatcaaga acgatgactt tgccgctgag 4980
gtccgaaagc tgcgaaccct gttccagtct ctcggctact acgactcctc taaggcctac 5040
tacgccttca aggtctcctt caacctctgc atctggggac tgtccaccgt cattgtggcc 5100
aagtggggtc agacctccac cctcgccaac gtgctctctg ctgccctgct cggcctgttc 5160
tggcagcagt gcggatggct ggctcacgac tttctgcacc accaggtctt ccaggaccga 5220
ttctggggtg atctcttcgg agccttcctg ggaggtgtct gccagggctt ctcctcttcc 5280
tggtggaagg acaagcacaa cactcaccat gccgctccca acgtgcatgg cgaggatcct 5340
gacattgaca cccaccctct cctgacctgg tccgagcacg ctctggagat gttctccgac 5400
gtccccgatg aggagctgac ccgaatgtgg tctcgattca tggtcctgaa ccagacctgg 5460
ttctacttcc ccattctctc cttcgctcga ctgtcttggt gcctccagtc cattctcttt 5520
gtgctgccca acggtcaggc tcacaagccc tccggagctc gagtgcccat ctccctggtc 5580
gagcagctgt ccctcgccat gcactggacc tggtacctcg ctaccatgtt cctgttcatc 5640
aaggatcctg tcaacatgct cgtgtacttc ctggtgtctc aggctgtgtg cggaaacctg 5700
ctcgccatcg tgttctccct caaccacaac ggtatgcctg tgatctccaa ggaggaggct 5760
gtcgacatgg atttctttac caagcagatc atcactggtc gagatgtcca tcctggactg 5820
ttcgccaact ggttcaccgg tggcctgaac taccagatcg agcatcacct gttcccttcc 5880
atgcctcgac acaacttctc caagatccag cctgccgtcg agaccctgtg caagaagtac 5940
aacgtccgat accacaccac tggtatgatc gagggaactg ccgaggtctt ctcccgactg 6000
aacgaggtct ccaaggccac ctccaagatg ggcaaggctc agtaagcggc cgcatgagaa 6060
gataaatata taaatacatt gagatattaa atgcgctaga ttagagagcc tcatactgct 6120
cggagagaag ccaagacgag tactcaaagg ggattacacc atccatatcc acagacacaa 6180
gctggggaaa ggttctatat acactttccg gaataccgta gtttccgatg ttatcaatgg 6240
gggcagccag gatttcaggc acttcggtgt ctcggggtga aatggcgttc ttggcctcca 6300
tcaagtcgta ccatgtcttc atttgcctgt caaagtaaaa cagaagcaga tgaagaatga 6360
acttgaagtg aaggaattta aattgccccg gagaagacgg ccaggccgcc tagatgacaa 6420
attcaacaac tcacagctga ctttctgcca ttgccactag gggggggcct ttttatatgg 6480
ccaagccaag ctctccacgt cggttgggct gcacccaaca ataaatgggt agggttgcac 6540
caacaaaggg atgggatggg gggtagaaga tacgaggata acggggctca atggcacaaa 6600
taagaacgaa tactgccatt aagactcgtg atccagcgac tgacaccatt gcatcatcta 6660
agggcctcaa aactacctcg gaactgctgc gctgatctgg acaccacaga ggttccgagc 6720
actttaggtt gcaccaaatg tcccaccagg tgcaggcaga aaacgctgga acagcgtgta 6780
cagtttgtct taacaaaaag tgagggcgct gaggtcgagc agggtggtgt gacttgttat 6840
agcctttaga gctgcgaaag cgcgtatgga tttggctcat caggccagat tgagggtctg 6900
tggacacatg tcatgttagt gtacttcaat cgccccctgg atatagcccc gacaataggc 6960
cgtggcctca tttttttgcc ttccgcacat ttccattgct cggtacccac accttgcttc 7020
tcctgcactt gccaacctta atactggttt acattgacca acatcttaca agcggggggc 7080
ttgtctaggg tatatataaa cagtggctct cccaatcggt tgccagtctc ttttttcctt 7140
tctttcccca cagattcgaa atctaaacta cacatcacac aatgcctgtt actgacgtcc 7200
ttaagcgaaa gtccggtgtc atcgtcggcg acgatgtccg agccgtgagt atccacgaca 7260
agatcagtgt cgagacgacg cgttttgtgt aatgacacaa tccgaaagtc gctagcaaca 7320
cacactctct acacaaacta acccagctct ccatggagtc cattgctccc ttcctgccct 7380
ccaagatgcc tcaggacctg ttcatggacc tcgccagcgc tatcggtgtc cgagctgctc 7440
cctacgtcga tcccctggag gctgccctgg ttgcccaggc cgagaagtac attcccacca 7500
ttgtccatca cactcgaggc ttcctggttg ccgtggagtc tcccctggct cgagagctgc 7560
ctctgatgaa ccccttccac gtgctcctga tcgtgctcgc ctacctggtc accgtgtttg 7620
tgggtatgca gatcatgaag aactttgaac gattcgaggt caagaccttc tccctcctgc 7680
acaacttctg tctggtctcc atctccgcct acatgtgcgg tggcatcctg tacgaggctt 7740
atcaggccaa ctatggactg tttgagaacg ctgccgatca caccttcaag ggtctcccta 7800
tggctaagat gatctggctc ttctacttct ccaagatcat ggagtttgtc gacaccatga 7860
tcatggtcct caagaagaac aaccgacaga tttcctttct gcacgtgtac caccactctt 7920
ccatcttcac catctggtgg ctggtcacct tcgttgctcc caacggtgaa gcctacttct 7980
ctgctgccct gaactccttc atccacgtca tcatgtacgg ctactacttt ctgtctgccc 8040
tgggcttcaa gcaggtgtcg ttcatcaagt tctacatcac tcgatcccag atgacccagt 8100
tctgcatgat gtctgtccag tcttcctggg acatgtacgc catgaaggtc cttggccgac 8160
ctggataccc cttcttcatc accgctctgc tctggttcta catgtggacc atgctcggtc 8220
tcttctacaa cttttaccga aagaacgcca agctcgccaa gcaggccaag gctgacgctg 8280
ccaaggagaa ggccagaaag ctccagtaag cggccgcaag tgtggatggg gaagtgagtg 8340
cccggttctg tgtgcacaat tggcaatcca agatggatgg attcaacaca gggatatagc 8400
gagctacgtg gtggtgcgag gatatagcaa cggatattta tgtttgacac ttgagaatgt 8460
acgatacaag cactgtccaa gtacaatact aaacatactg tacatactca tactcgtacc 8520
cgggcaacgg tttcacttga gtgcagtggc tagtgctctt actcgtacag tgtgcaatac 8580
tgcgtatcat agtctttgat gtatatcgta ttcattcatg ttagttgcgt acgaagtcgt 8640
caatgatgtc gatatgggtt ttgatcatgc acacataagg tccgacctta tcggcaagct 8700
caatgagctc cttggtggtg gtaacatcca gagaagcaca caggttggtt ttcttggctg 8760
ccacgagctt gagcactcga gcggcaaagg cggacttgtg gacgttagct cgagcttcgt 8820
aggagggcat tttggtggtg aagaggagac tgaaataaat ttagtctgca gaacttttta 8880
tcggaacctt atctggggca gtgaagtata tgttatggta atagttacga gttagttgaa 8940
cttatagata gactggacta tacggctatc ggtccaaatt agaaagaacg tcaatggctc 9000
tctgggcgtc gcctttgccg acaaaaatgt gatcatgatg aaagccagca atgacgttgc 9060
agctgatatt gttgtcggcc aaccgcgccg aaaacgcagc tgtcagaccc acagcctcca 9120
acgaagaatg tatcgtcaaa gtgatccaag cacactcata gttggagtcg tactccaaag 9180
gcggcaatga cgagtcagac agatactcgt cgaccttttc cttgggaacc accaccgtca 9240
gcccttctga ctcacgtatt gtagccaccg acacaggcaa cagtccgtgg atagcagaat 9300
atgtcttgtc ggtccatttc tcaccaactt taggcgtcaa gtgaatgttg cagaagaagt 9360
atgtgccttc attgagaatc ggtgttgctg atttcaataa agtcttgaga tcagtttggc 9420
gcgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg 9480
ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt 9540
atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa 9600
gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc 9660
gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag 9720
gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt 9780
gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg 9840
aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg 9900
ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg 9960
taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac 10020
tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg 10080
gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt 10140
taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg 10200
tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc 10260
tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt 10320
ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt 10380
taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag 10440
tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt 10500
cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg caatgatacc 10560
gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag ccggaagggc 10620
cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta attgttgccg 10680
ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg ccattgctac 10740
aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg gttcccaacg 10800
atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc 10860
tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta tggcagcact 10920
gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg gtgagtactc 10980
aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat 11040
acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc 11100
ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac 11160
tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg ggtgagcaaa 11220
aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact 11280
catactcttc ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg 11340
atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg 11400
aaaagtgcca cctgatgcgg tgtgaaatac cgcacagatg cgtaaggaga aaataccgca 11460
tcaggaaatt gtaagcgtta atattttgtt aaaattcgcg ttaaattttt gttaaatcag 11520
ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa aagaatagac 11580
cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa agaacgtgga 11640
ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac gtgaaccatc 11700
accctaatca agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga accctaaagg 11760
gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa aggaagggaa 11820
gaaagcgaaa ggagcgggcg ctagggcgct ggcaagtgta gcggtcacgc tgcgcgtaac 11880
caccacaccc gccgcgctta atgcgccgct acagggcgcg tccattcgcc attcaggctg 11940
cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc tattacgcca gctggcgaaa 12000
gggggatgtg ctgcaaggcg attaagttgg gtaacgccag ggttttccca gtcacgacgt 12060
tgtaaaacga cggccagtga attgtaatac gactcactat agggcgaatt gggcccgacg 12120
tcgcatgcag tggtggtatt gtgactgggg atgtagttga gaataagtca tacacaagtc 12180
agctttcttc gagcctcata taagtataag tagttcaacg tattagcact gtacccagca 12240
tctccgtatc gagaaacaca acaacatgcc ccattggaca gatcatgcgg atacacaggt 12300
tgtgcagtat catacatact cgatcagaca ggtcgtctga ccatcataca agctgaacaa 12360
gcgctccata cttgcacgct ctctatatac acagttaaat tacatatcca tagtctaacc 12420
tctaacagtt aatcttctgg taagcctccc agccagcctt ctggtatcgc ttggcctcct 12480
caataggatc tcggttctgg ccgtacagac ctcggccgac aattatgata tccgttccgg 12540
tagacatgac atcctcaaca gttcggtact gctgtccgag agcgtctccc ttgtcgtcaa 12600
gacccacccc gggggtcaga ataagccagt cctcagagtc gcccttaat 12649
<210>15
<211>819
<212>DNA
<213>金黄色破囊壶菌
<220>
<221>其它特征
<223>合成的延长酶(密码子优化的)
<400>15
atggccaact cctctgtctg ggacgacgtg gtcggacgag tcgagaccgg tgtcgaccag 60
tggatggacg gagctaagcc ctacgctctg accgacggtc tgcccatgat ggacgtctcc 120
accatgctcg ccttcgaggt cggctacatg gccatgctgc tcttcggcat tcccatcatg 180
aagcagatgg agaagccctt cgagctgaag accatcaagc tgctccacaa cctgttcctc 240
ttcggactgt ccctctacat gtgcgtcgag accatccgac aggctatcct gggtggctac 300
aaggtcttcg gcaacgacat ggagaagggc aacgagtccc acgctcaggg catgtcccga 360
atcgtctacg tgttctacgt ctccaaggcc tacgagttcc tggacaccgc tatcatgatc 420
ctgtgcaaga agttcaacca ggtctccttc ctgcacgtgt accaccatgc caccatcttc 480
gccatctggt gggctattgc caagtacgct cctggtggcg acgcctactt ctccgtcatc 540
ctcaactcct tcgtccacac cgtcatgtac gcctactact tcttttcctc tcagggcttc 600
ggcttcgtca agcccatcaa gccctacatc accactctgc agatgaccca gttcatggct 660
atgctggtgc agtccctgta cgactacctc ttcccctgcg actaccctca ggctctggtc 720
cagctgctcg gcgtgtacat gatcaccctg ctcgctctgt tcggcaactt ctttgtccag 780
tcctacctga agaagcccaa gaagtccaag accaactaa 819
<210>16
<211>272
<212>PRT
<213>金黄色破囊壶菌
<400>16
Met Ala Asn Ser Ser Val Trp Asp Asp Val Val Gly Arg Val Glu Thr
1 5 10 15
Gly Val Asp Gln Trp Met Asp Gly Ala Lys Pro Tyr Ala Leu Thr Asp
20 25 30
Gly Leu Pro Met Met Asp Val Ser Thr Met Leu Ala Phe Glu Val Gly
35 40 45
Tyr Met Ala Met Leu Leu Phe Gly Ile Pro Ile Met Lys Gln Met Glu
50 55 60
Lys Pro Phe Glu Leu Lys Thr Ile Lys Leu Leu His Asn Leu Phe Leu
65 70 75 80
Phe Gly Leu Ser Leu Tyr Met Cys Val Glu Thr Ile Arg Gln Ala Ile
85 90 95
Leu Gly Gly Tyr Lys Val Phe Gly Asn Asp Met Glu Lys Gly Asn Glu
100 105 110
Ser His Ala Gln Gly Met Ser Arg Ile Val Tyr Val Phe Tyr Val Ser
115 120 125
Lys Ala Tyr Glu Phe Leu Asp Thr Ala Ile Met Ile Leu Cys Lys Lys
130 135 140
Phe Asn Gln Val Ser Phe Leu His Val Tyr His His Ala Thr Ile Phe
145 150 155 160
Ala Ile Trp Trp Ala Ile Ala Lys Tyr Ala Pro Gly Gly Asp Ala Tyr
165 170 175
Phe Ser Val Ile Leu Asn Ser Phe Val His Thr Val Met Tyr Ala Tyr
180 185 190
Tyr Phe Phe Ser Ser Gln Gly Phe Gly Phe Val Lys Pro Ile Lys Pro
195 200 205
Tyr Ile Thr Thr Leu Gln Met Thr Gln Phe Met Ala Met Leu Val Gln
210 215 220
Ser Leu Tyr Asp Tyr Leu Phe Pro Cys Asp Tyr Pro Gln Ala Leu Val
225 230 235 240
Gln Leu Leu Gly Val Tyr Met Ile Thr Leu Leu Ala Leu Phe Gly Asn
245 250 255
Phe Phe Val Gln Ser Tyr Leu Lys Lys Pro Lys Lys Ser Lys Thr Asn
260 265 270
<210>17
<211>13034
<212>DNA
<213>人工序列
<220>
<223>质粒pDMW271
<400>17
cgatgcagaa ttcaggagag accgggttgg cggcgtattt gtgtcccaaa aaacagcccc 60
aattgcccca attgacccca aattgaccca gtagcgggcc caaccccggc gagagccccc 120
ttcaccccac atatcaaacc tcccccggtt cccacacttg ccgttaaggg cgtagggtac 180
tgcagtctgg aatctacgct tgttcagact ttgtactagt ttctttgtct ggccatccgg 240
gtaacccatg ccggacgcaa aatagactac tgaaaatttt tttgctttgt ggttgggact 300
ttagccaagg gtataaaaga ccaccgtccc cgaattacct ttcctcttct tttctctctc 360
tccttgtcaa ctcacacccg aaatcgttaa gcatttcctt ctgagtataa gaatcattca 420
ccatggatgg ctcccgaccc tgtcgctgcc gagaccgctg cccagggtcc cactccccga 480
tacttcacct gggacgaggt cgcccagcga tccggttgcg aggaacgatg gctggtcatc 540
gaccgaaagg tgtacaacat ctctgagttc acccgacgac atcccggtgg ctcccgagtg 600
atctcgcact acgctggaca ggacgccact gaccccttcg ttgcctttca cattaacaag 660
ggcctggtta agaagtacat gaactccctg ctcattggag agctgtctcc cgaacagcct 720
tcgtttgagc ctaccaagaa caaggagctg accgacgagt ttcgagagct ccgagccacc 780
gttgagcgaa tgggactgat gaaggccaac catgtcttct ttctgctcta cctgctccac 840
attcttctcc ttgacggagc tgcctggctt accctgtggg tcttcggcac ttcctttctg 900
ccctttcttc tctgcgccgt cctgctctct gccgtgcagg ctcaggctgg ttggcttcag 960
catgactttg gtcacctttc cgtgttctct acctccaagt ggaaccacct gctccatcac 1020
ttcgtgatcg gccacctcaa gggtgctcct gcctcgtggt ggaaccacat gcatttccag 1080
caccatgcca agcccaactg ttttcgaaag gatcccgaca tcaacatgca ccccttcttt 1140
ttcgctcttg gcaagatcct gtccgtcgag ctcggaaagc agaagaagaa gtacatgccc 1200
tacaaccacc agcacaagta cttcttcctg attggacctc ccgctctcct gcctctttac 1260
tttcagtggt acatctttta ctttgttatt cagcgaaaga agtgggttga tcttgcctgg 1320
atgatcacct tctacgtccg attcttcctg acctacgtcc ctctccttgg actgaaggcc 1380
tttctcggtc tgttctttat cgtccgattc ctggagtcca actggttcgt gtgggtgacc 1440
cagatgaacc acattcccat gcacattgac catgatcgaa acatggactg ggtgtcgact 1500
cagctgcagg ccacctgcaa cgttcacaag tctgctttca acgactggtt ttccggtcac 1560
ctcaactttc agattgagca ccatctgttt cccaccatgc ctcgacacaa ctaccacaag 1620
gttgctcccc tggtccagtc gctctgtgcc aagcatggca tcgagtacca gtccaagccc 1680
ctgctctctg ccttcgctga catcattcac tcgctgaagg aatctggcca gctctggctc 1740
gatgcctacc tgcaccagta agcggccgca ttgatgattg gaaacacaca catgggttat 1800
atctaggtga gagttagttg gacagttata tattaaatca gctatgccaa cggtaacttc 1860
attcatgtca acgaggaacc agtgactgca agtaatatag aatttgacca ccttgccatt 1920
ctcttgcact cctttactat atctcattta tttcttatat acaaatcact tcttcttccc 1980
agcatcgagc tcggaaacct catgagcaat aacatcgtgg atctcgtcaa tagagggctt 2040
tttggactcc ttgctgttgg ccaccttgtc cttgctgtct ggctcattct gtttcaacgc 2100
cttttaatta acggagtagg tctcggtgtc ggaagcgacg ccagatccgt catcctcctt 2160
tcgctctcca aagtagatac ctccgacgag ctctcggaca atgatgaagt cggtgccctc 2220
aacgtttcgg atgggggaga gatcggcgag cttgggcgac agcagctggc agggtcgcag 2280
gttggcgtac aggttcaggt cctttcgcag cttgaggaga ccctgctcgg gtcgcacgtc 2340
ggttcgtccg tcgggagtgg tccatacggt gttggcagcg cctccgacag caccgagcat 2400
aatagagtca gcctttcggc agatgtcgag agtagcgtcg gtgatgggct cgccctcctt 2460
ctcaatggca gctcctccaa tgagtcggtc ctcaaacaca aactcggtgc cggaggcctc 2520
agcaacagac ttgagcacct tgacggcctc ggcaatcacc tcggggccac agaagtcgcc 2580
gccgagaaga acaatcttct tggagtcagt cttggtcttc ttagtttcgg gttccattgt 2640
ggatgtgtgt ggttgtatgt gtgatgtggt gtgtggagtg aaaatctgtg gctggcaaac 2700
gctcttgtat atatacgcac ttttgcccgt gctatgtgga agactaaacc tccgaagatt 2760
gtgactcagg tagtgcggta tcggctaggg acccaaacct tgtcgatgcc gatagcatgc 2820
gacgtcgggc ccaattcgcc ctatagtgag tcgtattaca attcactggc cgtcgtttta 2880
caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc 2940
cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg 3000
cgcagcctga atggcgaatg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg 3060
tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt 3120
tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 3180
tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg 3240
gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 3300
agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct 3360
cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg 3420
agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgctt acaatttcct 3480
gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatca ggtggcactt 3540
ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt 3600
atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta 3660
tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 3720
tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 3780
gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 3840
aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 3900
gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 3960
ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 4020
gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 4080
gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 4140
atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 4200
ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 4260
cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 4320
cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 4380
gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 4440
cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 4500
cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 4560
taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 4620
ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 4680
aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 4740
caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 4800
taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag ccgtagttag 4860
gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 4920
cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 4980
taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 5040
agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 5100
ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 5160
gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 5220
acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 5280
acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 5340
tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 5400
ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag 5460
agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 5520
gcgcccactg agctcgtcta acggacttga tatacaacca attaaaacaa atgaaaagaa 5580
atacagttct ttgtatcatt tgtaacaatt accctgtaca aactaaggta ttgaaatccc 5640
acaatattcc caaagtccac ccctttccaa attgtcatgc ctacaactca tataccaagc 5700
actaacctac caaacaccac taaaacccca caaaatatat cttaccgaat atacagtaac 5760
aagctaccac cacactcgtt gggtgcagtc gccagcttaa agatatctat ccacatcagc 5820
cacaactccc ttcctttaat aaaccgacta cacccttggc tattgaggtt atgagtgaat 5880
atactgtaga caagacactt tcaagaagac tgtttccaaa acgtaccact gtcctccact 5940
acaaacacac ccaatctgct tcttctagtc aaggttgcta caccggtaaa ttataaatca 6000
tcatttcatt agcagggcag ggcccttttt atagagtctt atacactagc ggaccctgcc 6060
ggtagaccaa cccgcaggcg cgtcagtttg ctccttccat caatgcgtcg tagaaacgac 6120
ttactccttc ttgagcagct ccttgacctt gttggcaaca agtctccgac ctcggaggtg 6180
gaggaagagc ctccgatatc ggcggtagtg ataccagcct cgacggactc cttgacggca 6240
gcctcaacag cgtcaccggc gggcttcatg ttaagagaga acttgagcat catggcggca 6300
gacagaatgg tggcgtacgc aactaacatg aatgaatacg atatacatca aagactatga 6360
tacgcagtat tgcacactgt acgagtaaga gcactagcca ctgcactcaa gtgaaaccgt 6420
tgcccgggta cgagtatgag tatgtacagt atgtttagta ttgtacttgg acagtgcttg 6480
tatcgtacat tctcaagtgt caaacataaa tatccgttgc tatatcctcg caccaccacg 6540
tagctcgcta tatccctgtg ttgaatccat ccatcttgga ttgccaattg tgcacacaga 6600
accgggcact cacttcccca tccacacttg cggccgctta gctgcctact cttccttggg 6660
acggagtcca agaacacgca agtgctccaa atgtgaagca aatgcttgcc aaaacgtatc 6720
cttgacaagg tatggaacct tgtactcgct gcaggtgttc ttgatgatgg ccagaatatc 6780
gggataatgg tgctgcgaca cgttggggaa cagatggtgc acagcctggt agttcaagct 6840
gccagtgatg ctggtccaga ggtgcgaatc gtgtgcgtaa tcctgcgtag tctcgacctg 6900
catagctgcc cagtcctttt ggatgatccc gttctcgtca ggcaacggcc actgaacttc 6960
ctcaacaacg tggttcgcct ggaaggtcag cgccagccag taagacgaca ccatgtccgc 7020
gaccgtgaac aagagcagca ccttgcccag gggcagatac tgcaggggaa caatcaggcg 7080
ataccagaca aagaaagcct tgccgcccca gaacatcaca gtgtgccatg tcgagatggg 7140
attgacacga atagcgtcat tggtcttgac aaagtacaaa atgttgatgt cctgaatgcg 7200
caccttgaac gccagcagtc cgtacaggaa aggaacaaac atgtgctggt tgatgtggtt 7260
gacaaaccac ttttggttgg gcttgatacg acgaacatcg ggctcagacg tcgacacgtc 7320
gggatctgct ccagcaatgt tggtgtaggg gtgatggccg agcatatgtt ggtacatcca 7380
caccaggtac gatgctccgt tgaaaaagtc gtgcgtggct cccagaatct tccagacagt 7440
ggggttgtgg gtcactgaaa agtgagacgc atcatgaaga gggttgagtc cgacttgtgc 7500
gcacgcaaat cccatgatga ttgcaaacac cacctgaagc catgtgcgtt cgacaacgaa 7560
aggcacaaag agctgcgcgt agtaggaagc gatcaaggat ccaaagataa gagcgtatcg 7620
tccccagatc tctggtctat tcttgggatc aatgttccga tccgtaaagt agccctcgac 7680
tctcgtcttg atggttttgt ggaacaccgt tggctccggg aagatgggca gctcattcga 7740
gaccagtgta ccgacatagt acttcttcat aatggcatct gcagccccaa acgcgtgata 7800
catctcaaag accggagtaa catctcggcc agctccgagc aggagagtgt ccactccacc 7860
aggatggcgg ctcaagaact ttgtgacatc gtacaccctg ccgcggatgg ccaagagtag 7920
gtcgtccttg gtgttatggg ccgccagctc ttcccaggtg aaggtttttc cttggtccgt 7980
tcccatggag agctgggtta gtttgtgtag agagtgtgtg ttgctagcga ctttcggatt 8040
gtgtcattac acaaaacgcg tcgtctcgac actgatcttg tcgtggatac tcacggctcg 8100
gacatcgtcg ccgacgatga caccggactt tcgcttaagg acgtcagtaa caggcattgt 8160
gtgatgtgta gtttagattt cgaatctgtg gggaaagaaa ggaaaaaaga gactggcaac 8220
cgattgggag agccactgtt tatatatacc ctagacaagc cccccgcttg taagatgttg 8280
gtcaatgtaa accagtatta aggttggcaa gtgcaggaga agcaaggtgt gggtaccgag 8340
caatggaaat gtgcggaagg caaaaaaatg aggccacggc ctattgtcgg ggctatatcc 8400
agggggcgat tgaagtacac taacatgaca tgtgtccaca gaccctcaat ctggcctgat 8460
gagccaaatc catacgcgct ttcgcagctc taaaggctat aacaagtcac accaccctgc 8520
tcgacctcag cgccctcact ttttgttaag acaaactgta cacgctgttc cagcgttttc 8580
tgcctgcacc tggtgggaca tttggtgcaa cctaaagtgc tcggaacctc tgtggtgtcc 8640
agatcagcgc agcagttccg aggtagtttt gaggccctta gatgatgcaa tggtgtcagt 8700
cgctggatca cgagtcttaa tggcagtatt cgttcttatt tgtgccattg agccccgtta 8760
tcctcgtatc ttctaccccc catcccatcc ctttgttggt gcaaccctac ccatttattg 8820
ttgggtgcag cccaaccgac gtggagagct tggcttggcc atataaaaag gcccccccct 8880
agtggcaatg gcagaaagtc agctgtgagt tgttgaattt gtcatctagg cggcctggcc 8940
gtcttctccg gggcaattta aattccttca cttcaagttc attcttcatc tgcttctgtt 9000
ttactttgac aggcaaatga agacatggta cgacttgatg gaggccaaga acgccatttc 9060
accccgagac accgaagtgc ctgaaatcct ggctgccccc attgataaca tcggaaacta 9120
cggtattccg gaaagtgtat atagaacctt tccccagctt gtgtctgtgg atatggatgg 9180
tgtaatcccc tttgagtact cgtcttggct tctctccgag cagtatgagg ctctctaatc 9240
tagcgcattt aatatctcaa tgtatttata tatttatctt ctcatgcggc cgcttagctg 9300
cctactcttc cttgggacgg agtccaagaa cacgcaagtg ctccaaatgt gaagcaaatg 9360
cttgccaaaa cgtatccttg acaaggtatg gaaccttgta ctcgctgcag gtgttcttga 9420
tgatggccag aatatcggga taatggtgct gcgacacgtt ggggaacaga tggtgcacag 9480
cctggtagtt caagctgcca gtgatgctgg tccagaggtg cgaatcgtgt gcgtaatcct 9540
gcgtagtctc gacctgcata gctgcccagt ccttttggat gatcccgttc tcgtcaggca 9600
acggccactg aacttcctca acaacgtggt tcgcctggaa ggtcagcgcc agccagtaag 9660
acgacaccat gtccgcgacc gtgaacaaga gcagcacctt gcccaggggc agatactgca 9720
ggggaacaat caggcgatac cagacaaaga aagccttgcc gccccagaac atcacagtgt 9780
gccatgtcga gatgggattg acacgaatag cgtcattggt cttgacaaag tacaaaatgt 9840
tgatgtcctg aatgcgcacc ttgaacgcca gcagtccgta caggaaagga acaaacatgt 9900
gctggttgat gtggttgaca aaccactttt ggttgggctt gatacgacga acatcgggct 9960
cagacgtcga cacgtcggga tctgctccag caatgttggt gtaggggtga tggccgagca 10020
tatgttggta catccacacc aggtacgatg ctccgttgaa aaagtcgtgc gtggctccca 10080
gaatcttcca gacagtgggg ttgtgggtca ctgaaaagtg agacgcatca tgaagagggt 10140
tgagtccgac ttgtgcgcac gcaaatccca tgatgattgc aaacaccacc tgaagccatg 10200
tgcgttcgac aacgaaaggc acaaagagct gcgcgtagta ggaagcgatc aaggatccaa 10260
agataagagc gtatcgtccc cagatctctg gtctattctt gggatcaatg ttccgatccg 10320
taaagtagcc ctcgactctc gtcttgatgg ttttgtggaa caccgttggc tccgggaaga 10380
tgggcagctc attcgagacc agtgtaccga catagtactt cttcataatg gcatctgcag 10440
ccccaaacgc gtgatacatc tcaaagaccg gagtaacatc tcggccagct ccgagcagga 10500
gagtgtccac tccaccagga tggcggctca agaactttgt gacatcgtac accctgccgc 10560
ggatggccaa gagtaggtcg tccttggtgt tatgggccgc cagctcttcc caggtgaagg 10620
tttttccttg gtccgttccc atggtgaatg attcttatac tcagaaggaa atgcttaacg 10680
atttcgggtg tgagttgaca aggagagaga gaaaagaaga ggaaaggtaa ttcggggacg 10740
gtggtctttt atacccttgg ctaaagtccc aaccacaaag caaaaaaatt ttcagtagtc 10800
tattttgcgt ccggcatggg ttacccggat ggccagacaa agaaactagt acaaagtctg 10860
aacaagcgta gattccagac tgcagtaccc tacgccctta acggcaagtg tgggaaccgg 10920
gggaggtttg atatgtgggg tgaagggggc tctcgccggg gttgggcccg ctactgggtc 10980
aatttggggt caattggggc aattggggct gttttttggg acacaaatac gccgccaacc 11040
cggtctctcc tgatcgatgg gctgcaggaa ttctacaata cgtgagtcag aagggctgac 11100
ggtggtggtt cccaaggaaa aggtcgacga gtatctgtct gactcgtcat tgccgccttt 11160
ggagtacgac tccaactatg agtgtgcttg gatcactttg acgatacatt cttcgttgga 11220
ggctgtgggt ctgacagctg cgttttcggc gcggttggcc gacaacaata tcagctgcaa 11280
cgtcattgct ggctttcatc atgatcacat ttttgtcggc aaaggcgacg cccagagagc 11340
cattgacgtt ctttctaatt tggaccgata gccgtatagt ccagtctatc tataagttca 11400
actaactcgt aactattacc ataacatata cttcactgcc ccagataagg ttccgataaa 11460
aagttctgca gactaaattt atttcagtct cctcttcacc accaaaatgc cctcctacga 11520
agctcgagct aacgtccaca agtccgcctt tgccgctcga gtgctcaagc tcgtggcagc 11580
caagaaaacc aacctgtgtg cttctctgga tgttaccacc accaaggagc tcattgagct 11640
tgccgataag gtcggacctt atgtgtgcat gatcaaaacc catatcgaca tcattgacga 11700
cttcacctac gccggcactg tgctccccct caaggaactt gctcttaagc acggtttctt 11760
cctgttcgag gacagaaagt tcgcagatat tggcaacact gtcaagcacc agtaccggtg 11820
tcaccgaatc gccgagtggt ccgatatcac caacgcccac ggtgtacccg gaaccggaat 11880
cattgctggc ctgcgagctg gtgccgagga aactgtctct gaacagaaga aggaggacgt 11940
ctctgactac gagaactccc agtacaagga gttcctagtc ccctctccca acgagaagct 12000
ggccagaggt ctgctcatgc tggccgagct gtcttgcaag ggctctctgg ccactggcga 12060
gtactccaag cagaccattg agcttgcccg atccgacccc gagtttgtgg ttggcttcat 12120
tgcccagaac cgacctaagg gcgactctga ggactggctt attctgaccc ccggggtggg 12180
tcttgacgac aagggagacg ctctcggaca gcagtaccga actgttgagg atgtcatgtc 12240
taccggaacg gatatcataa ttgtcggccg aggtctgtac ggccagaacc gagatcctat 12300
tgaggaggcc aagcgatacc agaaggctgg ctgggaggct taccagaaga ttaactgtta 12360
gaggttagac tatggatatg taatttaact gtgtatatag agagcgtgca agtatggagc 12420
gcttgttcag cttgtatgat ggtcagacga cctgtctgat cgagtatgta tgatactgca 12480
caacctgtgt atccgcatga tctgtccaat ggggcatgtt gttgtgtttc tcgatacgga 12540
gatgctgggt acagtgctaa tacgttgaac tacttatact tatatgaggc tcgaagaaag 12600
ctgacttgtg tatgacttat tctcaactac atccccagtc acaataccac cactgcacta 12660
ccactacacc agatctgcgt acactgttta aacggtaggt tagtgcttgg tatatgagtt 12720
gtaggcatga caatttggaa aggggtggac tttgggaata ttgtgggatt tcaatacctt 12780
agtttgtaca gggtaattgt tacaaatgat acaaagaact gtatttcttt tcatttgttt 12840
taattggttg tatatcaagt ccgttagacg agctcagtgc cttggctttt ggcactgtat 12900
ttcattttta gaggtacact acattcagtg aggtatggta aggttgaggg cataatgaag 12960
gcaccttgta ctgacagtca cagacctctc accgagaatt ttatgagata tactcgggtt 13020
cattttaggc tcat 13034
<210>18
<211>1335
<212>DNA
<213>人
<220>
<221>其它特征
<223>合成的Δ-5去饱和酶(密码子优化的)
<400>18
atggctcccg accctgtcgc tgccgagacc gctgcccagg gtcccactcc ccgatacttc 60
acctgggacg aggtcgccca gcgatccggt tgcgaggaac gatggctggt catcgaccga 120
aaggtgtaca acatctctga gttcacccga cgacatcccg gtggctcccg agtgatctcg 180
cactacgctg gacaggacgc cactgacccc ttcgttgcct ttcacattaa caagggcctg 240
gttaagaagt acatgaactc cctgctcatt ggagagctgt ctcccgaaca gccttcgttt 300
gagcctacca agaacaagga gctgaccgac gagtttcgag agctccgagc caccgttgag 360
cgaatgggac tgatgaaggc caaccatgtc ttctttctgc tctacctgct ccacattctt 420
ctccttgacg gagctgcctg gcttaccctg tgggtcttcg gcacttcctt tctgcccttt 480
cttctctgcg ccgtcctgct ctctgccgtg caggctcagg ctggttggct tcagcatgac 540
tttggtcacc tttccgtgtt ctctacctcc aagtggaacc acctgctcca tcacttcgtg 600
atcggccacc tcaagggtgc tcctgcctcg tggtggaacc acatgcattt ccagcaccat 660
gccaagccca actgttttcg aaaggatccc gacatcaaca tgcacccctt ctttttcgct 720
cttggcaaga tcctgtccgt cgagctcgga aagcagaaga agaagtacat gccctacaac 780
caccagcaca agtacttctt cctgattgga cctcccgctc tcctgcctct ttactttcag 840
tggtacatct tttactttgt tattcagcga aagaagtggg ttgatcttgc ctggatgatc 900
accttctacg tccgattctt cctgacctac gtccctctcc ttggactgaa ggcctttctc 960
ggtctgttct ttatcgtccg attcctggag tccaactggt tcgtgtgggt gacccagatg 1020
aaccacattc ccatgcacat tgaccatgat cgaaacatgg actgggtgtc gactcagctg 1080
caggccacct gcaacgttca caagtctgct ttcaacgact ggttttccgg tcacctcaac 1140
tttcagattg agcaccatct gtttcccacc atgcctcgac acaactacca caaggttgct 1200
cccctggtcc agtcgctctg tgccaagcat ggcatcgagt accagtccaa gcccctgctc 1260
tctgccttcg ctgacatcat tcactcgctg aaggaatctg gccagctctg gctcgatgcc 1320
tacctgcacc agtaa 1335
<210>19
<211>444
<212>PRT
<213>人
<400>19
Met Ala Pro Asp Pro Val Ala Ala Glu Thr Ala Ala Gln Gly Pro Thr
1 5 10 15
Pro Arg Tyr Phe Thr Trp Asp Glu Val Ala Gln Arg Ser Gly Cys Glu
20 25 30
Glu Arg Trp Leu Val Ile Asp Arg Lys Val Tyr Asn Ile Ser Glu Phe
35 40 45
Thr Arg Arg His Pro Gly Gly Ser Arg Val Ile Ser His Tyr Ala Gly
50 55 60
Gln Asp Ala Thr Asp Pro Phe Val Ala Phe His Ile Asn Lys Gly Leu
65 70 75 80
Val Lys Lys Tyr Met Asn Ser Leu Leu Ile Gly Glu Leu Ser Pro Glu
85 90 95
Gln Pro Ser Phe Glu Pro Thr Lys Asn Lys Glu Leu Thr Asp Glu Phe
100 105 110
Arg Glu Leu Arg Ala Thr Val Glu Arg Met Gly Leu Met Lys Ala Asn
115 120 125
His Val Phe Phe Leu Leu Tyr Leu Leu His Ile Leu Leu Leu Asp Gly
130 135 140
Ala Ala Trp Leu Thr Leu Trp Val Phe Gly Thr Ser Phe Leu Pro Phe
145 150 155 160
Leu Leu Cys Ala Val Leu Leu Ser Ala Val Gln Ala Gln Ala Gly Trp
165 170 175
Leu Gln His Asp Phe Gly His Leu Ser Val Phe Ser Thr Ser Lys Trp
180 185 190
Asn His Leu Leu His His Phe Val Ile Gly His Leu Lys Gly Ala Pro
195 200 205
Ala Ser Trp Trp Asn His Met His Phe Gln His His Ala Lys Pro Asn
210 215 220
Cys Phe Arg Lys Asp Pro Asp Ile Asn Met His Pro Phe Phe Phe Ala
225 230 235 240
Leu Gly Lys Ile Leu Ser Val Glu Leu Gly Lys Gln Lys Lys Lys Tyr
245 250 255
Met Pro Tyr Asn His Gln His Lys Tyr Phe Phe Leu Ile Gly Pro Pro
260 265 270
Ala Leu Leu Pro Leu Tyr Phe Gln Trp Tyr Ile Phe Tyr Phe Val Ile
275 280 285
Gln Arg Lys Lys Trp Val Asp Leu Ala Trp Met Ile Thr Phe Tyr Val
290 295 300
Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu
305 310 315 320
Gly Leu Phe Phe Ile Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp
325 330 335
Val Thr Gln Met Asn His Ile Pro Met His Ile Asp His Asp Arg Asn
340 345 350
Met Asp Trp Val Ser Thr Gln Leu Gln Ala Thr Cys Asn Val His Lys
355 360 365
Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gln Ile Glu
370 375 380
His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Lys Val Ala
385 390 395 400
Pro Leu Val Gln Ser Leu Cys Ala Lys His Gly Ile Glu Tyr Gln Ser
405 410 415
Lys Pro Leu Leu Ser Ala Phe Ala Asp Ile Ile His Ser Leu Lys Glu
420 425 430
Ser Gly Gln Leu Trp Leu Asp Ala Tyr Leu His Gln
435 440
<210>20
<211>8867
<212>DNA
<213>人工序列
<220>
<223>质粒pZP3-P7U
<400>20
aattcaggag agaccgggtt ggcggcgtat ttgtgtccca aaaaacagcc ccaattgccc 60
caattgaccc caaattgacc cagtagcggg cccaaccccg gcgagagccc ccttcacccc 120
acatatcaaa cctcccccgg ttcccacact tgccgttaag ggcgtagggt actgcagtct 180
ggaatctacg cttgttcaga ctttgtacta gtttctttgt ctggccatcc gggtaaccca 240
tgccggacgc aaaatagact actgaaaatt tttttgcttt gtggttggga ctttagccaa 300
gggtataaaa gaccaccgtc cccgaattac ctttcctctt cttttctctc tctccttgtc 360
aactcacacc cgaaatcgtt aagcatttcc ttctgagtat aagaatcatt caccatggac 420
ttcctggagg cagaagaact tgttatggaa aagctcaaga gagagaagcc aagatactat 480
caagacatgt gtcgcaactt aattaagatg acgacatttg cgagctggac gaggaataga 540
tggagcgtgt gttctgagtc gatgttttct atggagttgt gagtgttagt agacatgatg 600
ggtttatata tgatgaatga atagatgtga ttttgatttg cacgatggaa ttgagaactt 660
tgtaaacgta catgggaatg tatgaatgtg ggggttttgt gactggataa ctgacggtca 720
gtggacgccg ttgttcaaat atccaagaga tgcgagaaac tttgggtcaa gtgaacatgt 780
cctctctgtt caagtaaacc atcaactatg ggtagtatat ttagtaagga caagagttga 840
gattctttgg agtcctagaa acgtattttc gcgttccaag atcaaattag tagagtaata 900
cgggcacggg aatccattca tagtctcaat tttcccatag gtgtgctaca aggtgttgag 960
atgtggtaca gtaccaccat gattcgaggt aaagagccca gaagtcattg atgaggtcaa 1020
gaaatacaca gatctacagc tcaatacaat gaatatcttc tttcatattc ttcaggtgac 1080
accaagggtg tctattttcc ccagaaatgc gtgaaaaggc gcgtgtgtag cgtggagtat 1140
gggttcggtt ggcgtatcct tcatatatcg acgaaatagt agggcaagag atgacaaaaa 1200
gtatctatat gtagacagcg tagaatatgg atttgattgg tataaattca tttattgcgt 1260
gtctcacaaa tactctcgat aagttggggt taaactggag atggaacaat gtcgatatct 1320
cgacgcatgc gacgtcgggc ccaattcgcc ctatagtgag tcgtattaca attcactggc 1380
cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc 1440
agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc 1500
ccaacagttg cgcagcctga atggcgaatg gacgcgccct gtagcggcgc attaagcgcg 1560
gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct 1620
cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta 1680
aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa 1740
cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct 1800
ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc 1860
aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg 1920
ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgctt 1980
acaatttcct gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatca 2040
ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat 2100
tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa 2160
aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt 2220
tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag 2280
ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt 2340
tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg 2400
gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag 2460
aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta 2520
agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg 2580
acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta 2640
actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac 2700
accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt 2760
actctagctt cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca 2820
cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag 2880
cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta 2940
gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag 3000
ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt 3060
tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 3120
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 3180
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 3240
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 3300
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 3360
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 3420
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 3480
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 3540
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 3600
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 3660
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 3720
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 3780
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 3840
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 3900
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 3960
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 4020
tgcagctggc gcgccaccaa tcacaattct gaaaagcaca tcttgatctc ctcattgcgg 4080
ggagtccaac ggtggtctta ttcccccgaa tttcccgctc aatctcgttc cagaccgacc 4140
cggacacagt gcttaacgcc gttccgaaac tctaccgcag atatgctcca acggactggg 4200
ctgcatagat gtgatcctcg gcttggagaa atggataaaa gccggccaaa aaaaaagcgg 4260
aaaaaagcgg aaaaaaagag aaaaaaaatc gcaaaatttg aaaaataggg ggaaaagacg 4320
caaaaacgca aggagggggg agtatatgac actgataagc aagctcacaa cggttcctct 4380
tatttttttc ctcatcttct gcctaggttc ccaaaatccc agatgcttct ctccagtgcc 4440
aaaagtaagt accccacagg ttttcggccg aaaattccac gtgcagcaac gtcgtgtggg 4500
gtgttaaaat gtgggggggg ggaaccagga caagaggctc ttgtgggagc cgaatgagag 4560
cacaaagcgg gcgggtgtga taagggcatt tttgcccatt ttcccttctc ctgtctctcc 4620
gacggtgatg gcgttgtgcg tcctctattt ctttttattt ctttttgttt tatttctctg 4680
actaccgatt tggtttgatt tcctcaaccc cacacaaata agctcgggcc gaggaatata 4740
tatatacacg gacacagtcg ccctgtggac aacacgtcac tacctctacg atacacaccg 4800
tacgttgtgt ggaagcttgt gagcggataa caatttcaca caggaaacag ctatgaccat 4860
gattacgcca agctcgaaat taaccctcac taaagggaac aaaagctgga gctccaccgc 4920
ggacacaata tctggtcaaa tttcagtttc gttacattta aattccttca cttcaagttc 4980
attcttcatc tgcttctgtt ttactttgac aggcaaatga agacatggta cgacttgatg 5040
gaggccaaga acgccatttc accccgagac accgaagtgc ctgaaatcct ggctgccccc 5100
attgataaca tcggaaacta cggtattccg gaaagtgtat atagaacctt tccccagctt 5160
gtgtctgtgg atatggatgg tgtaatcccc tttgagtact cgtcttggct tctctccgag 5220
cagtatgagg ctctctaatc tagcgcattt aatatctcaa tgtatttata tatttatctt 5280
ctcatgcggc cgcttaggtg gccttagtct tggtagcggc ttcggaaaca gccttggcct 5340
ccttgagagt gaacagcttg gcatccgagt cgaccacacc gtagtttgcg tagagtcggc 5400
cgactctgaa aaaggccttg atgatgggtt cgtcggactt tcgaaccagc tcgggaaagg 5460
cttgatggaa ggcctcggta gctcgcttga gcttgtagtg aggaatgatg ggaaagaggt 5520
gatgaatctg gtgggttccg atgttgtgag aaaggttgtc gatgagggca ccgtaggatc 5580
ggtcgacaga ggacaggttg cctttgacgt aggtccactc cgaatctgcg taccaaggtg 5640
tctcttcgtc gttgtgatga agaaaggtag taatgacgag catggatccg aacacaaaga 5700
cgggaccata gtagtagatg gccatagtct tgaagccgaa ctgcaggctg aggtagatgg 5760
acagggcagc cactccaaag tgcgcagcca gcgaaatgac cacggcagag acctgtcgaa 5820
caaagagagg ctcgaagggg ttgaagtggt tgacctttcg gggaggaaag ccttcgacca 5880
ggtaggcaaa ccaggctgca cccaaagcca gaatgaggtt tcgagacaga ggatgatcgt 5940
cggcctttcg ttgagggtag aagatttcgt ctcggtcgat gttgccagtg ttcttgtgat 6000
ggtgtctgtg ggtgagcttc caggactcga agggtgtaag gatgagagag tgaatgaagg 6060
ttccgatgac aaagttgagc aggtggtatc gggagaaggc accatgtcca gcgtcgtgac 6120
caacagtaaa gaatccccag aagacgatgc cctggagcag cacgtaaccg gtgcagagag 6180
cggcatccag agcccagagg ccttcgacaa cgggcagaga tcgtgcatgg tgaagtccga 6240
aggcgagcga cacagcaatg accaggcatc gaacggtgta gtacagagag agaggcacgg 6300
aggcctcgaa acactcggag ggcagagatc gcttgatctc ggtcagagta gggaactggt 6360
agggctgctt ggtagccatg gttgtgaatt agggtggtga gaatggttgg ttgtagggaa 6420
gaatcaaagg ccggtctcgg gatccgtggg tatatatata tatatatata tatacgatcc 6480
ttcgttacct ccctgttctc aaaactgtgg tttttcgttt ttcgtttttt gctttttttg 6540
atttttttag ggccaactaa gcttccagat ttcgctaatc acctttgtac taattacaag 6600
aaaggaagaa gctgattaga gttgggcttt ttatgcaact gtgctactcc ttatctctga 6660
tatgaaagtg tagacccaat cacatcatgt catttagagt tggtaatact gggaggatag 6720
ataaggcacg aaaacgagcc atagcagaca tgctgggtgt agccaagcag aagaaagtag 6780
atgggagcca attgacgagc gagggagcta cgccaatccg acatacgaca cgctgagatc 6840
gtcttggccg gggggtacct acagatgtcc aagggtaagt gcttgactgt aattgtatgt 6900
ctgaggacaa atatgtagtc agccgtataa agtcatacca ggcaccagtg ccatcatcga 6960
accactaact ctctatgata catgcctccg gtattattgt accatgcgtc gctttgttac 7020
atacgtatct tgcctttttc tctcagaaac tccagacttt ggctattggt cgagataagc 7080
ccggaccata gtgagtcttt cacactctac atttctccct tgctccaact atcgattgtt 7140
gtctactaac tatcgtacga taacttcgta tagcatacat tatacgaagt tatcgcgtcg 7200
acgagtatct gtctgactcg tcattgccgc ctttggagta cgactccaac tatgagtgtg 7260
cttggatcac tttgacgata cattcttcgt tggaggctgt gggtctgaca gctgcgtttt 7320
cggcgcggtt ggccgacaac aatatcagct gcaacgtcat tgctggcttt catcatgatc 7380
acatttttgt cggcaaaggc gacgcccaga gagccattga cgttctttct aatttggacc 7440
gatagccgta tagtccagtc tatctataag ttcaactaac tcgtaactat taccataaca 7500
tatacttcac tgccccagat aaggttccga taaaaagttc tgcagactaa atttatttca 7560
gtctcctctt caccaccaaa atgccctcct acgaagctcg agctaacgtc cacaagtccg 7620
cctttgccgc tcgagtgctc aagctcgtgg cagccaagaa aaccaacctg tgtgcttctc 7680
tggatgttac caccaccaag gagctcattg agcttgccga taaggtcgga ccttatgtgt 7740
gcatgatcaa aacccatatc gacatcattg acgacttcac ctacgccggc actgtgctcc 7800
ccctcaagga acttgctctt aagcacggtt tcttcctgtt cgaggacaga aagttcgcag 7860
atattggcaa cactgtcaag caccagtacc ggtgtcaccg aatcgccgag tggtccgata 7920
tcaccaacgc ccacggtgta cccggaaccg gaatcattgc tggcctgcga gctggtgccg 7980
aggaaactgt ctctgaacag aagaaggagg acgtctctga ctacgagaac tcccagtaca 8040
aggagttcct agtcccctct cccaacgaga agctggccag aggtctgctc atgctggccg 8100
agctgtcttg caagggctct ctggccactg gcgagtactc caagcagacc attgagcttg 8160
cccgatccga ccccgagttt gtggttggct tcattgccca gaaccgacct aagggcgact 8220
ctgaggactg gcttattctg acccccgggg tgggtcttga cgacaaggga gacgctctcg 8280
gacagcagta ccgaactgtt gaggatgtca tgtctaccgg aacggatatc ataattgtcg 8340
gccgaggtct gtacggccag aaccgagatc ctattgagga ggccaagcga taccagaagg 8400
ctggctggga ggcttaccag aagattaact gttagaggtt agactatgga tatgtaattt 8460
aactgtgtat atagagagcg tgcaagtatg gagcgcttgt tcagcttgta tgatggtcag 8520
acgacctgtc tgatcgagta tgtatgatac tgcacaacct gtgtatccgc atgatctgtc 8580
caatggggca tgttgttgtg tttctcgata cggagatgct gggtacagtg ctaatacgtt 8640
gaactactta tacttatatg aggctcgaag aaagctgact tgtgtatgac ttattctcaa 8700
ctacatcccc agtcacaata ccaccactgc actaccacta caccaaaacc atgatcaaac 8760
cacccatgga cttcctggag gcagaagaac ttgttatgga aaagctcaag agagagatca 8820
taacttcgta tagcatacat tatacgaagt tatcctgcag gtaaagg 8867
<210>21
<211>34
<212>DNA
<213>大肠杆菌
<400>21
ataacttcgt ataatgtatg ctatacgaag ttat 34
<210>22
<211>14655
<212>DNA
<213>人工序列
<220>
<223>质粒pZKLeuN-29E3
<220>
<221>其它特征
<222>(8822)..(8822)
<223>n为a、c、g或t
<220>
<221>其它特征
<222>(8827)..(8830)
<223>n为a、c、g或t
<400>22
cgattgttgt ctactaacta tcgtacgata acttcgtata gcatacatta tacgaagtta 60
tcgcgtcgac gagtatctgt ctgactcgtc attgccgcct ttggagtacg actccaacta 120
tgagtgtgct tggatcactt tgacgataca ttcttcgttg gaggctgtgg gtctgacagc 180
tgcgttttcg gcgcggttgg ccgacaacaa tatcagctgc aacgtcattg ctggctttca 240
tcatgatcac atttttgtcg gcaaaggcga cgcccagaga gccattgacg ttctttctaa 300
tttggaccga tagccgtata gtccagtcta tctataagtt caactaactc gtaactatta 360
ccataacata tacttcactg ccccagataa ggttccgata aaaagttctg cagactaaat 420
ttatttcagt ctcctcttca ccaccaaaat gccctcctac gaagctcgag ctaacgtcca 480
caagtccgcc tttgccgctc gagtgctcaa gctcgtggca gccaagaaaa ccaacctgtg 540
tgcttctctg gatgttacca ccaccaagga gctcattgag cttgccgata aggtcggacc 600
ttatgtgtgc atgatcaaaa cccatatcga catcattgac gacttcacct acgccggcac 660
tgtgctcccc ctcaaggaac ttgctcttaa gcacggtttc ttcctgttcg aggacagaaa 720
gttcgcagat attggcaaca ctgtcaagca ccagtaccgg tgtcaccgaa tcgccgagtg 780
gtccgatatc accaacgccc acggtgtacc cggaaccgga atcattgctg gcctgcgagc 840
tggtgccgag gaaactgtct ctgaacagaa gaaggaggac gtctctgact acgagaactc 900
ccagtacaag gagttcctag tcccctctcc caacgagaag ctggccagag gtctgctcat 960
gctggccgag ctgtcttgca agggctctct ggccactggc gagtactcca agcagaccat 1020
tgagcttgcc cgatccgacc ccgagtttgt ggttggcttc attgcccaga accgacctaa 1080
gggcgactct gaggactggc ttattctgac ccccggggtg ggtcttgacg acaagggaga 1140
cgctctcgga cagcagtacc gaactgttga ggatgtcatg tctaccggaa cggatatcat 1200
aattgtcggc cgaggtctgt acggccagaa ccgagatcct attgaggagg ccaagcgata 1260
ccagaaggct ggctgggagg cttaccagaa gattaactgt tagaggttag actatggata 1320
tgtaatttaa ctgtgtatat agagagcgtg caagtatgga gcgcttgttc agcttgtatg 1380
atggtcagac gacctgtctg atcgagtatg tatgatactg cacaacctgt gtatccgcat 1440
gatctgtcca atggggcatg ttgttgtgtt tctcgatacg gagatgctgg gtacagtgct 1500
aatacgttga actacttata cttatatgag gctcgaagaa agctgacttg tgtatgactt 1560
attctcaact acatccccag tcacaatacc accactgcac taccactaca ccaaaaccat 1620
gatcaaacca cccatggact tcctggaggc agaagaactt gttatggaaa agctcaagag 1680
agagatcata acttcgtata gcatacatta tacgaagtta tcctgcaggt aaaggaattc 1740
tggagtttct gagagaaaaa ggcaagatac gtatgtaaca aagcgacgca tggtacaata 1800
ataccggagg catgtatcat agagagttag tggttcgatg atggcactgg tgcctggtat 1860
gactttatac ggctgactac atatttgtcc tcagacatac aattacagtc aagcacttac 1920
ccttggacat ctgtaggtac cccccggcca agacgatctc agcgtgtcgt atgtcggatt 1980
ggcgtagctc cctcgctcgt caattggctc ccatctactt tcttctgctt ggctacaccc 2040
agcatgtctg ctatggctcg ttttcgtgcc ttatctatcc tcccagtatt accaactcta 2100
aatgacatga tgtgattggg tctacacttt catatcagag ataaggagta gcacagttgc 2160
ataaaaagcc caactctaat cagcttcttc ctttcttgta attagtacaa aggtgattag 2220
cgaaatctgg aagcttagtt ggccctaaaa aaatcaaaaa aagcaaaaaa cgaaaaacga 2280
aaaaccacag ttttgagaac agggaggtaa cgaaggatcg tatatatata tatatatata 2340
tatacccacg gatcccgaga ccggcctttg attcttccct acaaccaacc attctcacca 2400
ccctaattca caaccatgga gtctggaccc atgcctgctg gcattccctt ccctgagtac 2460
tatgacttct ttatggactg gaagactccc ctggccatcg ctgccaccta cactgctgcc 2520
gtcggtctct tcaaccccaa ggttggcaag gtctcccgag tggttgccaa gtcggctaac 2580
gcaaagcctg ccgagcgaac ccagtccgga gctgccatga ctgccttcgt ctttgtgcac 2640
aacctcattc tgtgtgtcta ctctggcatc accttctact acatgtttcc tgctatggtc 2700
aagaacttcc gaacccacac actgcacgaa gcctactgcg acacggatca gtccctctgg 2760
aacaacgcac ttggctactg gggttacctc ttctacctgt ccaagttcta cgaggtcatt 2820
gacaccatca tcatcatcct gaagggacga cggtcctcgc tgcttcagac ctaccaccat 2880
gctggagcca tgattaccat gtggtctggc atcaactacc aagccactcc catttggatc 2940
tttgtggtct tcaactcctt cattcacacc atcatgtact gttactatgc cttcacctct 3000
atcggattcc atcctcctgg caaaaagtac ctgacttcga tgcagattac tcagtttctg 3060
gtcggtatca ccattgccgt gtcctacctc ttcgttcctg gctgcatccg aacacccggt 3120
gctcagatgg ctgtctggat caacgtcggc tacctgtttc ccttgaccta tctgttcgtg 3180
gactttgcca agcgaaccta ctccaagcga tctgccattg ccgctcagaa aaaggctcag 3240
taagcggccg cattgatgat tggaaacaca cacatgggtt atatctaggt gagagttagt 3300
tggacagtta tatattaaat cagctatgcc aacggtaact tcattcatgt caacgaggaa 3360
ccagtgactg caagtaatat agaatttgac caccttgcca ttctcttgca ctcctttact 3420
atatctcatt tatttcttat atacaaatca cttcttcttc ccagcatcga gctcggaaac 3480
ctcatgagca ataacatcgt ggatctcgtc aatagagggc tttttggact ccttgctgtt 3540
ggccaccttg tccttgctgt ctggctcatt ctgtttcaac gccttttaat taacggagta 3600
ggtctcggtg tcggaagcga cgccagatcc gtcatcctcc tttcgctctc caaagtagat 3660
acctccgacg agctctcgga caatgatgaa gtcggtgccc tcaacgtttc ggatggggga 3720
gagatcggcg agcttgggcg acagcagctg gcagggtcgc aggttggcgt acaggttcag 3780
gtcctttcgc agcttgagga gaccctgctc gggtcgcacg tcggttcgtc cgtcgggagt 3840
ggtccatacg gtgttggcag cgcctccgac agcaccgagc ataatagagt cagcctttcg 3900
gcagatgtcg agagtagcgt cggtgatggg ctcgccctcc ttctcaatgg cagctcctcc 3960
aatgagtcgg tcctcaaaca caaactcggt gccggaggcc tcagcaacag acttgagcac 4020
cttgacggcc tcggcaatca cctcggggcc acagaagtcg ccgccgagaa gaacaatctt 4080
cttggagtca gtcttggtct tcttagtttc gggttccatt gtggatgtgt gtggttgtat 4140
gtgtgatgtg gtgtgtggag tgaaaatctg tggctggcaa acgctcttgt atatatacgc 4200
acttttgccc gtgctatgtg gaagactaaa cctccgaaga ttgtgactca ggtagtgcgg 4260
tatcggctag ggacccaaac cttgtcgatg ccgatagcat gcgacgtcgg gcccaattcg 4320
ccctatagtg agtcgtatta caattcactg gccgtcgttt tacaacgtcg tgactgggaa 4380
aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt 4440
aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa 4500
tggacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 4560
ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 4620
ccacgttcgc cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat 4680
ttagtgcttt acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg 4740
ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata 4800
gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt 4860
tataagggat tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat 4920
ttaacgcgaa ttttaacaaa atattaacgc ttacaatttc ctgatgcggt attttctcct 4980
tacgcatctg tgcggtattt cacaccgcat caggtggcac ttttcgggga aatgtgcgcg 5040
gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat 5100
aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc 5160
gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa 5220
cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac 5280
tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga 5340
tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag 5400
agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca 5460
cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca 5520
tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa 5580
ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc 5640
tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa 5700
cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag 5760
actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct 5820
ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac 5880
tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa 5940
ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt 6000
aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat 6060
ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg 6120
agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc 6180
ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg 6240
tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag 6300
cgcagatacc aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact 6360
ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg 6420
gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc 6480
ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg 6540
aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg 6600
cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag 6660
ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc 6720
gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct 6780
ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc 6840
ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc 6900
gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac 6960
cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg gcgcgcccac tgagctcgtc 7020
taacggactt gatatacaac caattaaaac aaatgaaaag aaatacagtt ctttgtatca 7080
tttgtaacaa ttaccctgta caaactaagg tattgaaatc ccacaatatt cccaaagtcc 7140
acccctttcc aaattgtcat gcctacaact catataccaa gcactaacct accaaacacc 7200
actaaaaccc cacaaaatat atcttaccga atatacagta acaagctacc accacactcg 7260
ttgggtgcag tcgccagctt aaagatatct atccacatca gccacaactc ccttccttta 7320
ataaaccgac tacacccttg gctattgagg ttatgagtga atatactgta gacaagacac 7380
tttcaagaag actgtttcca aaacgtacca ctgtcctcca ctacaaacac acccaatctg 7440
cttcttctag tcaaggttgc tacaccggta aattataaat catcatttca ttagcagggc 7500
agggcccttt ttatagagtc ttatacacta gcggaccctg ccggtagacc aacccgcagg 7560
cgcgtcagtt tgctccttcc atcaatgcgt cgtagaaacg acttactcct tcttgagcag 7620
ctccttgacc ttgttggcaa caagtctccg acctcggagg tggaggaaga gcctccgata 7680
tcggcggtag tgataccagc ctcgacggac tccttgacgg cagcctcaac agcgtcaccg 7740
gcgggcttca tgttaagaga gaacttgagc atcatggcgg cagacagaat ggtggcgtac 7800
gcaactaaca tgaatgaata cgatatacat caaagactat gatacgcagt attgcacact 7860
gtacgagtaa gagcactagc cactgcactc aagtgaaacc gttgcccggg tacgagtatg 7920
agtatgtaca gtatgtttag tattgtactt ggacagtgct tgtatcgtac attctcaagt 7980
gtcaaacata aatatccgtt gctatatcct cgcaccacca cgtagctcgc tatatccctg 8040
tgttgaatcc atccatcttg gattgccaat tgtgcacaca gaaccgggca ctcacttccc 8100
catccacact tgcggccgcg cctacttaag caacgggctt gataacagcg gggggggtgc 8160
ccacgttgtt gcggttgcgg aagaacagaa cacccttacc agcaccctcg gcaccagcgc 8220
tgggctcaac ccactggcac atacgcgcac tgcggtacat ggcgcggatg aagccacgag 8280
gaccatcctg gacatcagcc cggtagtgct tgcccatgat gggcttaatg gcctcggtgg 8340
cctcgtccgc gttgtagaag gggatgctgc tgacgtagtg gtggaggaca tgagtctcga 8400
tgatgccgtg gagaaggtgg cggccgatga agcccatctc acggtcaatg gtagcagcgg 8460
caccacggac gaagttccac tcgtcgttgg tgtagtgggg aagggtaggg tcggtgtgct 8520
ggaggaaggt gatggcaacg agccagtggt taacccagag gtagggaaca aagtaccaga 8580
tggccatgtt gtagaaaccg aacttctgaa cgaggaagta cagagcagtg gccatcagac 8640
cgataccaat atcgctgagg acgatgagct tagcgtcact gttctcgtac agagggctgc 8700
ggggatcgaa gtggttaaca ccaccgccga ggccgttatg cttgcccttg ccgcgaccct 8760
cacgctggcg ctcgtggtag ttgtggccgg taacattggt gatgaggtag ttgggccagc 8820
cnacgannnn ctcagtaaga tgagcgagct cgtgggtcat ctttccgaga cgagtagcct 8880
gctgctcgcg ggttcgggga acgaagacca tgtcacgctc catgttgcca gtggccttgt 8940
ggtgctttcg gtgggagatt tgccagctga agtaggggac aaggagggaa gagtgaagaa 9000
cccagccagt aatgtcgttg atgatgcgag aatcggagaa agcaccgtga ccgcactcat 9060
gggcaataac ccagagacca gtaccgaaaa gaccctgaag aacggtgtac acggcccaca 9120
gaccagcgcg ggcgggggtg gaggggatat attcgggggt cacaaagttg taccagatgc 9180
tgaaagtggt agtcaggagg acaatgtcgc ggaggatata accgtatccc ttgagagcgg 9240
agcgcttgaa gcagtgctta gggatggcat tgtagatgtc cttgatggta aagtcgggaa 9300
cctcgaactg gttgccgtag gtgtcgagca tgacaccata ctcggacttg ggcttggcga 9360
tatcaacctc ggacatggac gagagcgatg tggaagaggc cgagtggcgg ggagagtctg 9420
aaggagagac ggcggcagac tcagaatccg tcacagtagt tgaggtgacg gtgcgtctaa 9480
gcgcagggtt ctgcttgggc agagccgaag tggacgccat ggttgatgtg tgtttaattc 9540
aagaatgaat atagagaaga gaagaagaaa aaagattcaa ttgagccggc gatgcagacc 9600
cttatataaa tgttgccttg gacagacgga gcaagcccgc ccaaacctac gttcggtata 9660
atatgttaag ctttttaaca caaaggtttg gcttggggta acctgatgtg gtgcaaaaga 9720
ccgggcgttg gcgagccatt gcgcgggcga atggggccgt gactcgtctc aaattcgagg 9780
gcgtgcctca attcgtgccc ccgtggcttt ttcccgccgt ttccgccccg tttgcaccac 9840
tgcagccgct tctttggttc ggacaccttg ctgcgagcta ggtgccttgt gctacttaaa 9900
aagtggcctc ccaacaccaa catgacatga gtgcgtgggc caagacacgt tggcggggtc 9960
gcagtcggct caatggcccg gaaaaaacgc tgctggagct ggttcggacg cagtccgccg 10020
cggcgtatgg atatccgcaa ggttccatag cgccattgcc ctccgtcggc gtctatcccg 10080
caacctctaa atagagcggg aatataaccc aagcttcttt tttttccttt aacacgcaca 10140
cccccaacta tcatgttgct gctgctgttt gactctactc tgtggagggg tgctcccacc 10200
caacccaacc tacaggtgga tccggcgctg tgattggctg ataagtctcc tatccggact 10260
aattctgacc aatgggacat gcgcgcagga cccaaatgcc gcaattacgt aaccccaacg 10320
aaatgcctac ccctctttgg agcccagcgg ccccaaatcc ccccaagcag cccggttcta 10380
ccggcttcca tctccaagca caagcagccc ggttctaccg gcttccatct ccaagcaccc 10440
ctttctccac accccacaaa aagacccgtg caggacatcc tactgcgtcg acatcattta 10500
aattccttca cttcaagttc attcttcatc tgcttctgtt ttactttgac aggcaaatga 10560
agacatggta cgacttgatg gaggccaaga acgccatttc accccgagac accgaagtgc 10620
ctgaaatcct ggctgccccc attgataaca tcggaaacta cggtattccg gaaagtgtat 10680
atagaacctt tccccagctt gtgtctgtgg atatggatgg tgtaatcccc tttgagtact 10740
cgtcttggct tctctccgag cagtatgagg ctctctaatc tagcgcattt aatatctcaa 10800
tgtatttata tatttatctt ctcatgcggc cgctcactga atctttttgg ctcccttgtg 10860
cttcctgacg atatacgttt gcacatagaa attcaagaac aaacacaaga ctgtgccaac 10920
ataaaagtaa ttgaagaacc agccaaacat cctcatccca tcttggcgat aacagggaat 10980
gttcctgtac ttccagacaa tgtagaaacc aacattgaat tgaatgatct gcattgatgt 11040
aatcagggat tttggcatgg ggaacttcag cttgatcaat ctggtccaat aataaccgta 11100
catgatccag tggatgaaac cattcaacag cacaaaaatc caaacagctt catttcggta 11160
attatagaac agccacatat ccatcggtgc ccccaaatga tggaagaatt gcaaccaggt 11220
cagaggcttg cccatcagtg gcaaatagaa ggagtcaata tactccagga acttgctcaa 11280
atagaacaac tgcgtggtga tcctgaagac gttgttgtca aaagccttct cgcagttgtc 11340
agacataaca ccgatggtgt acatggcata tgccattgag aggaatgatc ccaacgaata 11400
aatggacatg agaaggttgt aattggtgaa aacaaacttc atacgagact gaccttttgg 11460
accaaggggg ccaagagtga acttcaagat gacaaatgcg atggacaagt aaagcacctc 11520
acagtgactg gcatcactcc agagttgggc ataatcaact ggttgggtaa aacttcctgc 11580
ccaattgaga ctatttcatt caccacctcc atggccattg ctgtagatat gtcttgtgtg 11640
taagggggtt ggggtggttg tttgtgttct tgacttttgt gttagcaagg gaagacgggc 11700
aaaaaagtga gtgtggttgg gagggagaga cgagccttat atataatgct tgtttgtgtt 11760
tgtgcaagtg gacgccgaaa cgggcaggag ccaaactaaa caaggcagac aatgcgagct 11820
taattggatt gcctgatggg caggggttag ggctcgatca atgggggtgc gaagtgacaa 11880
aattgggaat taggttcgca agcaaggctg acaagacttt ggcccaaaca tttgtacgcg 11940
gtggacaaca ggagccaccc atcgtctgtc acgggctagc cggtcgtgcg tcctgtcagg 12000
ctccacctag gctccatgcc actccataca atcccactag tgtaccgcta ggccgctttt 12060
agctcccatc taagaccccc ccaaaacctc cactgtacag tgcactgtac tgtgtggcga 12120
tcaagggcaa gggaaaaaag gcgcaaacat gcacgcatgg aatgacgtag gtaaggcgtt 12180
actagactga aaagtggcac atttcggcgt gccaaagggt cctaggtgcg tttcgcgagc 12240
tgggcgccag gccaagccgc tccaaaacgc ctctccgact ccctccagcg gcctccatat 12300
ccccatccct ctccacagca atgttgttaa gccttgcaaa cgaaaaaata gaaaggctaa 12360
taagcttcca atattgtggt gtacgctgca taacgcaaca atgagcgcca aacaacacac 12420
acacacagca cacagcagca ttaaccacga tgaacagcat gacattacag gtgggtgtgt 12480
aatcagggcc ctgattgctg gtggtgggag cccccatcat gggcagatct gcgtacactg 12540
tttaaacagt gtacgcagat ctactataga ggaacattta aattgccccg gagaagacgg 12600
ccaggccgcc tagatgacaa attcaacaac tcacagctga ctttctgcca ttgccactag 12660
gggggggcct ttttatatgg ccaagccaag ctctccacgt cggttgggct gcacccaaca 12720
ataaatgggt agggttgcac caacaaaggg atgggatggg gggtagaaga tacgaggata 12780
acggggctca atggcacaaa taagaacgaa tactgccatt aagactcgtg atccagcgac 12840
tgacaccatt gcatcatcta agggcctcaa aactacctcg gaactgctgc gctgatctgg 12900
acaccacaga ggttccgagc actttaggtt gcaccaaatg tcccaccagg tgcaggcaga 12960
aaacgctgga acagcgtgta cagtttgtct taacaaaaag tgagggcgct gaggtcgagc 13020
agggtggtgt gacttgttat agcctttaga gctgcgaaag cgcgtatgga tttggctcat 13080
caggccagat tgagggtctg tggacacatg tcatgttagt gtacttcaat cgccccctgg 13140
atatagcccc gacaataggc cgtggcctca tttttttgcc ttccgcacat ttccattgct 13200
cgatacccac accttgcttc tcctgcactt gccaacctta atactggttt acattgacca 13260
acatcttaca agcggggggc ttgtctaggg tatatataaa cagtggctct cccaatcggt 13320
tgccagtctc ttttttcctt tctttcccca cagattcgaa atctaaacta cacatcacag 13380
aattccgagc cgtgagtatc cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat 13440
gacacaatcc gaaagtcgct agcaacacac actctctaca caaactaacc cagctctggt 13500
accatggagg tcgtgaacga aatcgtctcc attggccagg aggttcttcc caaggtcgac 13560
tatgctcagc tctggtctga tgcctcgcac tgcgaggtgc tgtacctctc catcgccttc 13620
gtcatcctga agttcaccct tggtcctctc ggacccaagg gtcagtctcg aatgaagttt 13680
gtgttcacca actacaacct gctcatgtcc atctactcgc tgggctcctt cctctctatg 13740
gcctacgcca tgtacaccat tggtgtcatg tccgacaact gcgagaaggc tttcgacaac 13800
aatgtcttcc gaatcaccac tcagctgttc tacctcagca agttcctcga gtacattgac 13860
tccttctatc tgcccctcat gggcaagcct ctgacctggt tgcagttctt tcaccatctc 13920
ggagctccta tggacatgtg gctgttctac aactaccgaa acgaagccgt ttggatcttt 13980
gtgctgctca acggcttcat tcactggatc atgtacggct actattggac ccgactgatc 14040
aagctcaagt tccctatgcc caagtccctg attacttcta tgcagatcat tcagttcaac 14100
gttggcttct acatcgtctg gaagtaccgg aacattccct gctaccgaca agatggaatg 14160
agaatgtttg gctggttttt caactacttc tacgttggta ctgtcctgtg tctgttcctc 14220
aacttctacg tgcagaccta catcgtccga aagcacaagg gagccaaaaa gattcagtga 14280
gcggccgcat gtacatacaa gattatttat agaaatgaat cgcgatcgaa caaagagtac 14340
gagtgtacga gtaggggatg atgataaaag tggaagaagt tccgcatctt tggatttatc 14400
aacgtgtagg acgatacttc ctgtaaaaat gcaatgtctt taccataggt tctgctgtag 14460
atgttattaa ctaccattaa catgtctact tgtacagttg cagaccagtt ggagtataga 14520
atggtacact taccaaaaag tgttgatggt tgtaactacg atatataaaa ctgttgacgg 14580
gatccccgct gatatgccta aggaacaatc aaagaggaag atattaattc agaatgctag 14640
tatacagtta gggat 14655
<210>23
<211>777
<212>DNA
<213>纤细裸藻
<220>
<221>其它特征
<223>合成的Δ-9延长酶(密码子优化的)
<400>23
atggaggtcg tgaacgaaat cgtctccatt ggccaggagg ttcttcccaa ggtcgactat 60
gctcagctct ggtctgatgc ctcgcactgc gaggtgctgt acctctccat cgccttcgtc 120
atcctgaagt tcacccttgg tcctctcgga cccaagggtc agtctcgaat gaagtttgtg 180
ttcaccaact acaacctgct catgtccatc tactcgctgg gctccttcct ctctatggcc 240
tacgccatgt acaccattgg tgtcatgtcc gacaactgcg agaaggcttt cgacaacaat 300
gtcttccgaa tcaccactca gctgttctac ctcagcaagt tcctcgagta cattgactcc 360
ttctatctgc ccctcatggg caagcctctg acctggttgc agttctttca ccatctcgga 420
gctcctatgg acatgtggct gttctacaac taccgaaacg aagccgtttg gatctttgtg 480
ctgctcaacg gcttcattca ctggatcatg tacggctact attggacccg actgatcaag 540
ctcaagttcc ctatgcccaa gtccctgatt acttctatgc agatcattca gttcaacgtt 600
ggcttctaca tcgtctggaa gtaccggaac attccctgct accgacaaga tggaatgaga 660
atgtttggct ggtttttcaa ctacttctac gttggtactg tcctgtgtct gttcctcaac 720
ttctacgtgc agacctacat cgtccgaaag cacaagggag ccaaaaagat tcagtga 777
<210>24
<211>258
<212>PRT
<213>纤细裸藻
<220>
<221>其它特征
<222>(1)..(258)
<223>Δ-9延长酶(EgD9e)
<400>24
Met Glu Val Val Asn Glu Ile Val Ser Ile Gly Gln Glu Val Leu Pro
1 5 10 15
Lys Val Asp Tyr Ala Gln Leu Trp Ser Asp Ala Ser His Cys Glu Val
20 25 30
Leu Tyr Leu Ser Ile Ala Phe Val Ile Leu Lys Phe Thr Leu Gly Pro
35 40 45
Leu Gly Pro Lys Gly Gln Ser Arg Met Lys Phe Val Phe Thr Asn Tyr
50 55 60
Asn Leu Leu Met Ser Ile Tyr Ser Leu Gly Ser Phe Leu Ser Met Ala
65 70 75 80
Tyr Ala Met Tyr Thr Ile Gly Val Met Ser Asp Asn Cys Glu Lys Ala
85 90 95
Phe Asp Asn Asn Val Phe Arg Ile Thr Thr Gln Leu Phe Tyr Leu Ser
100 105 110
Lys Phe Leu Glu Tyr Ile Asp Ser Phe Tyr Leu Pro Leu Met Gly Lys
115 120 125
Pro Leu Thr Trp Leu Gln Phe Phe His His Leu Gly Ala Pro Met Asp
130 135 140
Met Trp Leu Phe Tyr Asn Tyr Arg Asn Glu Ala Val Trp Ile Phe Val
145 150 155 160
Leu Leu Asn Gly Phe Ile His Trp Ile Met Tyr Gly Tyr Tyr Trp Thr
165 170 175
Arg Leu Ile Lys Leu Lys Phe Pro Met Pro Lys Ser Leu Ile Thr Ser
180 185 190
Met Gln Ile Ile Gln Phe Asn Val Gly Phe Tyr Ile Val Trp Lys Tyr
195 200 205
Arg Asn Ile Pro Cys Tyr Arg Gln Asp Gly Met Arg Met Phe Gly Trp
210 215 220
Phe Phe Asn Tyr Phe Tyr Val Gly Thr Val Leu Cys Leu Phe Leu Asn
225 230 235 240
Phe Tyr Val Gln Thr Tyr Ile Val Arg Lys His Lys Gly Ala Lys Lys
245 250 255
Ile Gln
<210>25
<211>828
<212>DNA
<213>高山被孢霉
<220>
<221>CDS
<222>(1)..(828)
<223>合成的C16/18延长酶(密码子优化的)
<400>25
atg gag tct gga ccc atg cct gct ggc att ccc ttc cct gag tac tat 48
Met Glu Ser Gly Pro Met Pro Ala Gly Ile Pro Phe Pro Glu Tyr Tyr
1 5 10 15
gac ttc ttt atg gac tgg aag act ccc ctg gcc atc gct gcc acc tac 96
Asp Phe Phe Met Asp Trp Lys Thr Pro Leu Ala Ile Ala Ala Thr Tyr
20 25 30
act gct gcc gtc ggt ctc ttc aac ccc aag gtt ggc aag gtc tcc cga 144
Thr Ala Ala Val Gly Leu Phe Asn Pro Lys Val Gly Lys Val Ser Arg
35 40 45
gtg gtt gcc aag tcg gct aac gca aag cct gcc gag cga acc cag tcc 192
Val Val Ala Lys Ser Ala Asn Ala Lys Pro Ala Glu Arg Thr Gln Ser
50 55 60
gga gct gcc atg act gcc ttc gtc ttt gtg cac aac ctc att ctg tgt 240
Gly Ala Ala Met Thr Ala Phe Val Phe Val His Asn Leu Ile Leu Cys
65 70 75 80
gtc tac tct ggc atc acc ttc tac tac atg ttt cct gct atg gtc aag 288
Val Tyr Ser Gly Ile Thr Phe Tyr Tyr Met Phe Pro Ala Met Val Lys
85 90 95
aac ttc cga acc cac aca ctg cac gaa gcc tac tgc gac acg gat cag 336
Asn Phe Arg Thr His Thr Leu His Glu Ala Tyr Cys Asp Thr Asp Gln
100 105 110
tcc ctc tgg aac aac gca ctt ggc tac tgg ggt tac ctc ttc tac ctg 384
Ser Leu Trp Asn Asn Ala Leu Gly Tyr Trp Gly Tyr Leu Phe Tyr Leu
115 120 125
tcc aag ttc tac gag gtc att gac acc atc atc atc atc ctg aag gga 432
Ser Lys Phe Tyr Glu Val Ile Asp Thr Ile IleIle Ile Leu Lys Gly
130 135 140
cga cgg tcc tcg ctg ctt cag acc tac cac cat gct gga gcc atg att 480
Arg Arg Ser Ser Leu Leu Gln Thr Tyr His His Ala Gly Ala Met Ile
145 150 155 160
acc atg tgg tct ggc atc aac tac caa gcc act ccc att tgg atc ttt 528
Thr Met Trp Ser Gly Ile Asn Tyr Gln Ala Thr Pro Ile Trp Ile Phe
165 170 175
gtg gtc ttc aac tcc ttc att cac acc atc atg tac tgt tac tat gcc 576
Val Val Phe Asn Ser Phe Ile His Thr Ile Met Tyr Cys Tyr Tyr Ala
180 185 190
ttc acc tct atc gga ttc cat cct cct ggc aaa aag tac ctg act tcg 624
Phe Thr Ser Ile Gly Phe His Pro Pro Gly Lys Lys Tyr Leu Thr Ser
195 200 205
atg cag att act cag ttt ctg gtc ggt atc acc att gcc gtg tcc tac 672
Met Gln Ile Thr Gln Phe Leu Val Gly Ile Thr Ile Ala Val Ser Tyr
210 215 220
ctc ttc gtt cct ggc tgc atc cga aca ccc ggt gct cag atg gct gtc 720
Leu Phe Val Pro Gly Cys Ile Arg Thr Pro Gly Ala Gln Met Ala Val
225 230 235 240
tgg atc aac gtc ggc tac ctg ttt ccc ttg acc tat ctg ttc gtg gac 768
Trp Ile Asn Val Gly Tyr Leu Phe Pro Leu Thr Tyr Leu Phe Val Asp
245 250 255
ttt gcc aag cga acc tac tcc aag cga tct gcc att gcc gct cag aaa 816
Phe Ala Lys Arg Thr Tyr Ser Lys Arg Ser Ala Ile Ala Ala Gln Lys
260 265 270
aag gct cag taa 828
Lys Ala Gln
275
<210>26
<211>275
<212>PRT
<213>高山被孢霉
<400>26
Met Glu Ser Gly Pro Met Pro Ala Gly Ile Pro Phe Pro Glu Tyr Tyr
1 5 10 15
Asp Phe Phe Met Asp Trp Lys Thr Pro Leu Ala Ile Ala Ala Thr Tyr
20 25 30
Thr Ala Ala Val Gly Leu Phe Asn Pro Lys Val Gly Lys Val Ser Arg
35 40 45
Val Val Ala Lys Ser Ala Asn Ala Lys Pro Ala Glu Arg Thr Gln Ser
50 55 60
Gly Ala Ala Met Thr Ala Phe Val Phe Val His Asn Leu Ile Leu Cys
65 70 75 80
Val Tyr Ser Gly Ile Thr Phe Tyr Tyr Met Phe Pro Ala Met Val Lys
85 90 95
Asn Phe Arg Thr His Thr Leu His Glu Ala Tyr Cys Asp Thr Asp Gln
100 105 110
Ser Leu Trp Asn Asn Ala Leu Gly Tyr Trp Gly Tyr Leu Phe Tyr Leu
115 120 125
Ser Lys Phe Tyr Glu Val Ile Asp Thr Ile Ile Ile Ile Leu Lys Gly
130 135 140
Arg Arg Ser Ser Leu Leu Gln Thr Tyr His His Ala Gly Ala Met Ile
145 150 155 160
Thr Met Trp Ser Gly Ile Asn Tyr Gln Ala Thr Pro Ile Trp Ile Phe
165 170 175
Val Val Phe Asn Ser Phe Ile His Thr Ile Met Tyr Cys Tyr Tyr Ala
180 185 190
Phe Thr Ser Ile Gly Phe His Pro Pro Gly Lys Lys Tyr Leu Thr Ser
195 200 205
Met Gln Ile Thr Gln Phe Leu Val Gly Ile Thr Ile Ala Val Ser Tyr
210 215 220
Leu Phe Val Pro Gly Cys Ile Arg Thr Pro Gly Ala Gln Met Ala Val
225 230 235 240
Trp Ile Asn Val Gly Tyr Leu Phe Pro Leu Thr Tyr Leu Phe Val Asp
245 250 255
Phe Ala Lys Arg Thr Tyr Ser Lys Arg Ser Ala Ile Ala Ala Gln Lys
260 265 270
Lys Ala Gln
275
<210>27
<211>8739
<212>DNA
<213>人工序列
<220>
<223>质粒pY116
<400>27
ggccgccacc gcggcccgag attccggcct cttcggccgc caagcgaccc gggtggacgt 60
ctagaggtac ctagcaatta acagatagtt tgccggtgat aattctctta acctcccaca 120
ctcctttgac ataacgattt atgtaacgaa actgaaattt gaccagatat tgtgtccgcg 180
gtggagctcc agcttttgtt ccctttagtg agggtttaaa cgagcttggc gtaatcatgg 240
tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa cgtacgagcc 300
ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg 360
ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc 420
ggccaacgcg cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact 480
gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta 540
atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag 600
caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc 660
cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta 720
taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg 780
ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc 840
tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac 900
gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac 960
ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg 1020
aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga 1080
aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt 1140
agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag 1200
cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct 1260
gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg 1320
atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat 1380
gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc 1440
tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg 1500
gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct 1560
ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca 1620
actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg 1680
ccagttaata gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg 1740
tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc 1800
cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag 1860
ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg 1920
ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag 1980
tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat 2040
agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg 2100
atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca 2160
gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca 2220
aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat 2280
tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag 2340
aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc tgacgcgccc 2400
tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt 2460
gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc 2520
ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta 2580
cggcacctcg accccaaaaa acttgattag ggtgatggtt cacgtagtgg gccatcgccc 2640
tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg 2700
ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt ataagggatt 2760
ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat 2820
tttaacaaaa tattaacgct tacaatttcc attcgccatt caggctgcgc aactgttggg 2880
aagggcgatc ggtgcgggcc tcttcgctat tacgccagct ggcgaaaggg ggatgtgctg 2940
caaggcgatt aagttgggta acgccagggt tttcccagtc acgacgttgt aaaacgacgg 3000
ccagtgaatt gtaatacgac tcactatagg gcgaattggg taccgggccc cccctcgagg 3060
tcgatggtgt cgataagctt gatatcgaat tcatgtcaca caaaccgatc ttcgcctcaa 3120
ggaaacctaa ttctacatcc gagagactgc cgagatccag tctacactga ttaattttcg 3180
ggccaataat ttaaaaaaat cgtgttatat aatattatat gtattatata tatacatcat 3240
gatgatactg acagtcatgt cccattgcta aatagacaga ctccatctgc cgcctccaac 3300
tgatgttctc aatatttaag gggtcatctc gcattgttta ataataaaca gactccatct 3360
accgcctcca aatgatgttc tcaaaatata ttgtatgaac ttatttttat tacttagtat 3420
tattagacaa cttacttgct ttatgaaaaa cacttcctat ttaggaaaca atttataatg 3480
gcagttcgtt catttaacaa tttatgtaga ataaatgtta taaatgcgta tgggaaatct 3540
taaatatgga tagcataaat gatatctgca ttgcctaatt cgaaatcaac agcaacgaaa 3600
aaaatccctt gtacaacata aatagtcatc gagaaatatc aactatcaaa gaacagctat 3660
tcacacgtta ctattgagat tattattgga cgagaatcac acactcaact gtctttctct 3720
cttctagaaa tacaggtaca agtatgtact attctcattg ttcatacttc tagtcatttc 3780
atcccacata ttccttggat ttctctccaa tgaatgacat tctatcttgc aaattcaaca 3840
attataataa gatataccaa agtagcggta tagtggcaat caaaaagctt ctctggtgtg 3900
cttctcgtat ttatttttat tctaatgatc cattaaaggt atatatttat ttcttgttat 3960
ataatccttt tgtttattac atgggctgga tacataaagg tattttgatt taattttttg 4020
cttaaattca atcccccctc gttcagtgtc aactgtaatg gtaggaaatt accatacttt 4080
tgaagaagca aaaaaaatga aagaaaaaaa aaatcgtatt tccaggttag acgttccgca 4140
gaatctagaa tgcggtatgc ggtacattgt tcttcgaacg taaaagttgc gctccctgag 4200
atattgtaca tttttgcttt tacaagtaca agtacatcgt acaactatgt actactgttg 4260
atgcatccac aacagtttgt tttgtttttt tttgtttttt ttttttctaa tgattcatta 4320
ccgctatgta tacctacttg tacttgtagt aagccgggtt attggcgttc aattaatcat 4380
agacttatga atctgcacgg tgtgcgctgc gagttacttt tagcttatgc atgctacttg 4440
ggtgtaatat tgggatctgt tcggaaatca acggatgctc aaccgatttc gacagtaatt 4500
aattaatttg aatcgaatcg gagcctaaaa tgaacccgag tatatctcat aaaattctcg 4560
gtgagaggtc tgtgactgtc agtacaaggt gccttcatta tgccctcaac cttaccatac 4620
ctcactgaat gtagtgtacc tctaaaaatg aaatacagtg ccaaaagcca aggcactgag 4680
ctcgtctaac ggacttgata tacaaccaat taaaacaaat gaaaagaaat acagttcttt 4740
gtatcatttg taacaattac cctgtacaaa ctaaggtatt gaaatcccac aatattccca 4800
aagtccaccc ctttccaaat tgtcatgcct acaactcata taccaagcac taacctacca 4860
aacaccacta aaaccccaca aaatatatct taccgaatat acagtaacaa gctaccacca 4920
cactcgttgg gtgcagtcgc cagcttaaag atatctatcc acatcagcca caactccctt 4980
cctttaataa accgactaca cccttggcta ttgaggttat gagtgaatat actgtagaca 5040
agacactttc aagaagactg tttccaaaac gtaccactgt cctccactac aaacacaccc 5100
aatctgcttc ttctagtcaa ggttgctaca ccggtaaatt ataaatcatc atttcattag 5160
cagggcaggg ccctttttat agagtcttat acactagcgg accctgccgg tagaccaacc 5220
cgcaggcgcg tcagtttgct ccttccatca atgcgtcgta gaaacgactt actccttctt 5280
gagcagctcc ttgaccttgt tggcaacaag tctccgacct cggaggtgga ggaagagcct 5340
ccgatatcgg cggtagtgat accagcctcg acggactcct tgacggcagc ctcaacagcg 5400
tcaccggcgg gcttcatgtt aagagagaac ttgagcatca tggcggcaga cagaatggtg 5460
gcaatggggt tgaccttctg cttgccgaga tcgggggcag atccgtgaca gggctcgtac 5520
agaccgaacg cctcgttggt gtcgggcaga gaagccagag aggcggaggg cagcagaccc 5580
agagaaccgg ggatgacgga ggcctcgtcg gagatgatat cgccaaacat gttggtggtg 5640
atgatgatac cattcatctt ggagggctgc ttgatgagga tcatggcggc cgagtcgatc 5700
agctggtggt tgagctcgag ctgggggaat tcgtccttga ggactcgagt gacagtcttt 5760
cgccaaagtc gagaggaggc cagcacgttg gccttgtcaa gagaccacac gggaagaggg 5820
gggttgtgct gaagggccag gaaggcggcc attcgggcaa ttcgctcaac ctcaggaacg 5880
gagtaggtct cggtgtcgga agcgacgcca gatccgtcat cctcctttcg ctctccaaag 5940
tagatacctc cgacgagctc tcggacaatg atgaagtcgg tgccctcaac gtttcggatg 6000
ggggagagat cggcgagctt gggcgacagc agctggcagg gtcgcaggtt ggcgtacagg 6060
ttcaggtcct ttcgcagctt gaggagaccc tgctcgggtc gcacgtcggt tcgtccgtcg 6120
ggagtggtcc atacggtgtt ggcagcgcct ccgacagcac cgagcataat agagtcagcc 6180
tttcggcaga tgtcgagagt agcgtcggtg atgggctcgc cctccttctc aatggcagct 6240
cctccaatga gtcggtcctc aaacacaaac tcggtgccgg aggcctcagc aacagacttg 6300
agcaccttga cggcctcggc aatcacctcg gggccacaga agtcgccgcc gagaagaaca 6360
atcttcttgg agtcagtctt ggtcttctta gtttcgggtt ccattgtgga tgtgtgtggt 6420
tgtatgtgtg atgtggtgtg tggagtgaaa atctgtggct ggcaaacgct cttgtatata 6480
tacgcacttt tgcccgtgct atgtggaaga ctaaacctcc gaagattgtg actcaggtag 6540
tgcggtatcg gctagggacc caaaccttgt cgatgccgat agcgctatcg aacgtacccc 6600
agccggccgg gagtatgtcg gaggggacat acgagatcgt caagggtttg tggccaactg 6660
gtatttaaat gtagctaacg gtagcaggcg aactactggt acatacctcc cccggaatat 6720
gtacaggcat aatgcgtatc tgtgggacat gtggtcgttg cgccattatg taagcagcgt 6780
gtactcctct gactgtccat atggtttgct ccatctcacc ctcatcgttt tcattgttca 6840
caggcggcca caaaaaaact gtcttctctc cttctctctt cgccttagtc tactcggacc 6900
agttttagtt tagcttggcg ccactggata aatgagacct caggccttgt gatgaggagg 6960
tcacttatga agcatgttag gaggtgcttg tatggataga gaagcaccca aaataataag 7020
aataataata aaacaggggg cgttgtcatt tcatatcgtg ttttcaccat caatacacct 7080
ccaaacaatg cccttcatgt ggccagcccc aatattgtcc tgtagttcaa ctctatgcag 7140
ctcgtatctt attgagcaag taaaactctg tcagccgata ttgcccgacc cgcgacaagg 7200
gtcaacaagg tggtgtaagg ccttcgcaga agtcaaaact gtgccaaaca aacatctaga 7260
gtctctttgg tgtttctcgc atatatttwa tcggctgtct tacgtatttg cgcctcggta 7320
ccggactaat ttcggatcat ccccaatacg ctttttcttc gcagctgtca acagtgtcca 7380
tgatctatcc acctaaatgg gtcatatgag gcgtataatt tcgtggtgct gataataatt 7440
cccatatatt tgacacaaaa cttccccccc tagacataca tctcacaatc tcacttcttg 7500
tgcttctgtc acacatctcc tccagctgac ttcaactcac acctctgccc cagttggtct 7560
acagcggtat aaggtttctc cgcatagagg tgcaccactc ctcccgatac ttgtttgtgt 7620
gacttgtggg tcacgacata tatatctaca cacattgcgc caccctttgg ttcttccagc 7680
acaacaaaaa cacgacacgc taaccatggc caatttactg accgtacacc aaaatttgcc 7740
tgcattaccg gtcgatgcaa cgagtgatga ggttcgcaag aacctgatgg acatgttcag 7800
ggatcgccag gcgttttctg agcatacctg gaaaatgctt ctgtccgttt gccggtcgtg 7860
ggcggcatgg tgcaagttga ataaccggaa atggtttccc gcagaacctg aagatgttcg 7920
cgattatctt ctatatcttc aggcgcgcgg tctggcagta aaaactatcc agcaacattt 7980
gggccagcta aacatgcttc atcgtcggtc cgggctgcca cgaccaagtg acagcaatgc 8040
tgtttcactg gttatgcggc ggatccgaaa agaaaacgtt gatgccggtg aacgtgcaaa 8100
acaggctcta gcgttcgaac gcactgattt cgaccaggtt cgttcactca tggaaaatag 8160
cgatcgctgc caggatatac gtaatctggc atttctgggg attgcttata acaccctgtt 8220
acgtatagcc gaaattgcca ggatcagggt taaagatatc tcacgtactg acggtgggag 8280
aatgttaatc catattggca gaacgaaaac gctggttagc accgcaggtg tagagaaggc 8340
acttagcctg ggggtaacta aactggtcga gcgatggatt tccgtctctg gtgtagctga 8400
tgatccgaat aactacctgt tttgccgggt cagaaaaaat ggtgttgccg cgccatctgc 8460
caccagccag ctatcaactc gcgccctgga agggattttt gaagcaactc atcgattgat 8520
ttacggcgct aaggatgact ctggtcagag atacctggcc tggtctggac acagtgcccg 8580
tgtcggagcc gcgcgagata tggcccgcgc tggagtttca ataccggaga tcatgcaagc 8640
tggtggctgg accaatgtaa atattgtcat gaactatatc cgtaacctgg atagtgaaac 8700
aggggcaatg gtgcgcctgc tggaagatgg cgattaagc 8739
<210>28
<211>15304
<212>DNA
<213>人工序列
<220>
<223>质粒pK02UF8289
<220>
<221>其它特征
<222>(5601)..(5601)
<223>n为a、c、g或t
<220>
<221>其它特征
<222>(5606)..(5609)
<223>n为a、c、g或t
<400>28
cgatcgagga agaggacaag cggctgcttc ttaagtttgt gacatcagta tccaaggcac 60
cattgcaagg attcaaggct ttgaacccgt catttgccat tcgtaacgct ggtagacagg 120
ttgatcggtt ccctacggcc tccacctgtg tcaatcttct caagctgcct gactatcagg 180
acattgatca acttcggaag aaacttttgt atgccattcg atcacatgct ggtttcgatt 240
tgtcttagag gaacgcatat acagtaatca tagagaataa acgatattca tttattaaag 300
tagatagttg aggtagaagt tgtaaagagt gataaatagc ggccgctcac tgaatctttt 360
tggctccctt gtgcttcctg acgatatacg tttgcacata gaaattcaag aacaaacaca 420
agactgtgcc aacataaaag taattgaaga accagccaaa catcctcatc ccatcttggc 480
gataacaggg aatgttcctg tacttccaga caatgtagaa accaacattg aattgaatga 540
tctgcattga tgtaatcagg gattttggca tggggaactt cagcttgatc aatctggtcc 600
aataataacc gtacatgatc cagtggatga aaccattcaa cagcacaaaa atccaaacag 660
cttcatttcg gtaattatag aacagccaca tatccatcgg tgcccccaaa tgatggaaga 720
attgcaacca ggtcagaggc ttgcccatca gtggcaaata gaaggagtca atatactcca 780
ggaacttgct caaatagaac aactgcgtgg tgatcctgaa gacgttgttg tcaaaagcct 840
tctcgcagtt gtcagacata acaccgatgg tgtacatggc atatgccatt gagaggaatg 900
atcccaacga ataaatggac atgagaaggt tgtaattggt gaaaacaaac ttcatacgag 960
actgaccttt tggaccaagg gggccaagag tgaacttcaa gatgacaaat gcgatggaca 1020
agtaaagcac ctcacagtga ctggcatcac tccagagttg ggcataatca actggttggg 1080
taaaacttcc tgcccaattg agactatttc attcaccacc tccatggtta gcgtgtcgtg 1140
tttttgttgt gctggaagaa ccaaagggtg gcgcaatgtg tgtagatata tatgtcgtga 1200
cccacaagtc acacaaacaa gtatcgggag gagtggtgca cctctatgcg gagaaacctt 1260
ataccgctgt agaccaactg gggcagaggt gtgagttgaa gtcagctgga ggagatgtgt 1320
gacagaagca caagaagtga gattgtgaga tgtatgtcta gggggggaag ttttgtgtca 1380
aatatatggg aattattatc agcaccacga aattatacgc ctcatatgac ccatttaggt 1440
ggatagatca tggacactgt tgacagctgc gaagaaaaag cgtattgggg atgatccgaa 1500
attagtccgg taccgaggcg caaatacgta agacagccga twaaatatat gcgagaaaca 1560
ccaaagagac tctagatgtt tgtttggcac agttttgact tctgcgaagg ccttacacca 1620
ccttgttgac ccttgtcgcg ggtcgggcaa tatcggctga cagagtttta cttgctcaat 1680
aagatacgag ctgcatagag ttgaactaca ggacaatatt ggggctggcc acatgaaggg 1740
cattgtttgg aggtgtattg atggtgaaaa cacgatatga aatgacaacg ccccctgttt 1800
tattattatt cttattattt tgggtgcttc tctatccata caagcacctc ctaacatgct 1860
tcataagtga cctcctcatc acaaggcctg aggtctcatt tatccagtgg cgccaagcta 1920
aactaaaact ggtccgagta gactaaggcg aagagagaag gagagaagac agtttttttg 1980
tggccgcctg tgaacaatga aaacgatgag ggtgagatgg agcaaaccat atggtttaaa 2040
cagtcagagg agtacacgct gcttacataa tggcgcaacg accacatgtc ccacagatac 2100
gcatcgattc gattcaaatt aattaaaagg cgttgaaaca gaatgagcca gacagcaagg 2160
acaaggtggc caacagcaag gagtccaaaa agccctctat tgacgagatc cacgatgtta 2220
ttgctcatga ggtttccgag ctcgatgctg ggaagaagaa gtgatttgta tataagaaat 2280
aaatgagata tagtaaagga gtgcaagaga atggcaaggt ggtcaaattc tatattactt 2340
gcagtcactg gttcctcgtt gacatgaatg aagttaccgt tggcatagct gatttaatat 2400
ataactgtcc aactaactct cacctagata taacccatgt gtgtgtttcc aatcatcaat 2460
gcggccgctt actgagcctt ggcaccgggc tgcttctcgg ccattcgagc gaactgggac 2520
aggtatcgga gcaggatgac gagaccttca tggggcagag ggtttcggta ggggaggttg 2580
tgcttctggc acagctgttc cacctggtag gaaacggcag tgaggttgtg tcgaggcagg 2640
gtgggccaga gatggtgctc gatctggtag ttcaggcctc caaagaacca gtcagtaatg 2700
atgcctcgtc gaatgttcat ggtctcatgg atctgaccca cagagaagcc atgtccgtcc 2760
cagacggaat caccgatctt ctccagaggg tagtggttca tgaagaccac gatggcaatt 2820
ccgaagccac cgacgagctc ggaaacaaag aacaccagca tcgaggtcag gatggagggc 2880
ataaagaaga ggtggaacag ggtcttgaga gtccagtgca gagcgagtcc aatggcctct 2940
ttcttgtact gagatcggta gaactggttg tctcggtcct tgagggatcg aacggtcagc 3000
acagactgga aacaccagat gaatcgcagg agaatacaga tgaccaggaa atagtactgt 3060
tggaactgaa tgagctttcg ggagatggga gaagctcgag tgacatcgtc ctcggaccag 3120
gcgagcagag gcaggttatc aatgtcggga tcgtgaccct gaacgttggt agcagaatga 3180
tgggcgttgt gtctgtcctt ccaccaggtc acggagaagc cctggagtcc gttgccaaag 3240
accagaccca ggacgttatt ccagtttcgg ttcttgaagg tctggtggtg gcagatgtca 3300
tgagacagcc atcccatttg ctggtagtgc ataccgagca cgagagcacc aatgaagtac 3360
aggtggtact ggaccagcat gaagaaggca agcacgccaa gacccagggt ggtcaagatc 3420
ttgtacgagt accagagggg agaggcgtca aacatgccag tggcgatcag ctcttctcgg 3480
agctttcgga aatcctcctg agcttcgttg acggcagcct ggggaggcag ctcggaagcc 3540
tggttgatct tgggcattcg cttgagcttg tcgaaggctt cctgagagtg cataaccatg 3600
aaggcgtcag tagcatctcg tccctggtag ttctcaatga tttcagctcc accagggtgg 3660
aagttcaccc aagcggagac gtcgtacacc tttccgtcga tgacgagggg cagagcctgt 3720
cgagaagcct tcaccatggc cattgctgta gatatgtctt gtgtgtaagg gggttggggt 3780
ggttgtttgt gttcttgact tttgtgttag caagggaaga cgggcaaaaa agtgagtgtg 3840
gttgggaggg agagacgagc cttatatata atgcttgttt gtgtttgtgc aagtggacgc 3900
cgaaacgggc aggagccaaa ctaaacaagg cagacaatgc gagcttaatt ggattgcctg 3960
atgggcaggg gttagggctc gatcaatggg ggtgcgaagt gacaaaattg ggaattaggt 4020
tcgcaagcaa ggctgacaag actttggccc aaacatttgt acgcggtgga caacaggagc 4080
cacccatcgt ctgtcacggg ctagccggtc gtgcgtcctg tcaggctcca cctaggctcc 4140
atgccactcc atacaatccc actagtgtac cgctaggccg cttttagctc ccatctaaga 4200
cccccccaaa acctccactg tacagtgcac tgtactgtgt ggcgatcaag ggcaagggaa 4260
aaaaggcgca aacatgcacg catggaatga cgtaggtaag gcgttactag actgaaaagt 4320
ggcacatttc ggcgtgccaa agggtcctag gtgcgtttcg cgagctgggc gccaggccaa 4380
gccgctccaa aacgcctctc cgactccctc cagcggcctc catatcccca tccctctcca 4440
cagcaatgtt gttaagcctt gcaaacgaaa aaatagaaag gctaataagc ttccaatatt 4500
gtggtgtacg ctgcataacg caacaatgag cgccaaacaa cacacacaca cagcacacag 4560
cagcattaac cacgatgttt aaacagtgta cgcagatccc gtcaacagtt ttatatatcg 4620
tagttacaac catcaacact ttttggtaag tgtaccattc tatactccaa ctggtctgca 4680
actgtacaag tagacatgtt aatggtagtt aataacatct acagcagaac ctatggtaaa 4740
gacattgcat ttttacagga agtatcgtcc tacacgttga taaatccaaa gatgcggaac 4800
ttcttccact tttatcatca tcccctactc gtacactcgt actctttgtt cgatcgcgat 4860
tcatttctat aaataatctt gtatgtacat gcggccgcgc ctacttaagc aacgggcttg 4920
ataacagcgg ggggggtgcc cacgttgttg cggttgcgga agaacagaac acccttacca 4980
gcaccctcgg caccagcgct gggctcaacc cactggcaca tacgcgcact gcggtacatg 5040
gcgcggatga agccacgagg accatcctgg acatcagccc ggtagtgctt gcccatgatg 5100
ggcttaatgg cctcggtggc ctcgtccgcg ttgtagaagg ggatgctgct gacgtagtgg 5160
tggaggacat gagtctcgat gatgccgtgg agaaggtggc ggccgatgaa gcccatctca 5220
cggtcaatgg tagcagcggc accacggacg aagttccact cgtcgttggt gtagtgggga 5280
agggtagggt cggtgtgctg gaggaaggtg atggcaacga gccagtggtt aacccagagg 5340
tagggaacaa agtaccagat ggccatgttg tagaaaccga acttctgaac gaggaagtac 5400
agagcagtgg ccatcagacc gataccaata tcgctgagga cgatgagctt agcgtcactg 5460
ttctcgtaca gagggctgcg gggatcgaag tggttaacac caccgccgag gccgttatgc 5520
ttgcccttgc cgcgaccctc acgctggcgc tcgtggtagt tgtggccggt aacattggtg 5580
atgaggtagt tgggccagcc nacgannnnc tcagtaagat gagcgagctc gtgggtcatc 5640
tttccgagac gagtagcctg ctgctcgcgg gttcggggaa cgaagaccat gtcacgctcc 5700
atgttgccag tggccttgtg gtgctttcgg tgggagattt gccagctgaa gtaggggaca 5760
aggagggaag agtgaagaac ccagccagta atgtcgttga tgatgcgaga atcggagaaa 5820
gcaccgtgac cgcactcatg ggcaataacc cagagaccag taccgaaaag accctgaaga 5880
acggtgtaca cggcccacag accagcgcgg gcgggggtgg aggggatata ttcgggggtc 5940
acaaagttgt accagatgct gaaagtggta gtcaggagga caatgtcgcg gaggatataa 6000
ccgtatccct tgagagcgga gcgcttgaag cagtgcttag ggatggcatt gtagatgtcc 6060
ttgatggtaa agtcgggaac ctcgaactgg ttgccgtagg tgtcgagcat gacaccatac 6120
tcggacttgg gcttggcgat atcaacctcg gacatggacg agagcgatgt ggaagaggcc 6180
gagtggcggg gagagtctga aggagagacg gcggcagact cagaatccgt cacagtagtt 6240
gaggtgacgg tgcgtctaag cgcagggttc tgcttgggca gagccgaagt ggacgccatg 6300
gttgtgaatt agggtggtga gaatggttgg ttgtagggaa gaatcaaagg ccggtctcgg 6360
gatccgtggg tatatatata tatatatata tatacgatcc ttcgttacct ccctgttctc 6420
aaaactgtgg tttttcgttt ttcgtttttt gctttttttg atttttttag ggccaactaa 6480
gcttccagat ttcgctaatc acctttgtac taattacaag aaaggaagaa gctgattaga 6540
gttgggcttt ttatgcaact gtgctactcc ttatctctga tatgaaagtg tagacccaat 6600
cacatcatgt catttagagt tggtaatact gggaggatag ataaggcacg aaaacgagcc 6660
atagcagaca tgctgggtgt agccaagcag aagaaagtag atgggagcca attgacgagc 6720
gagggagcta cgccaatccg acatacgaca cgctgagatc gtcttggccg gggggtacct 6780
acagatgtcc aagggtaagt gcttgactgt aattgtatgt ctgaggacaa atatgtagtc 6840
agccgtataa agtcatacca ggcaccagtg ccatcatcga accactaact ctctatgata 6900
catgcctccg gtattattgt accatgcgtc gctttgttac atacgtatct tgcctttttc 6960
tctcagaaac tccagacttt ggctattggt cgagataagc ccggaccata gtgagtcttt 7020
cacactctac atttctccct tgctccaact atttaaattg ccccggagaa gacggccagg 7080
ccgcctagat gacaaattca acaactcaca gctgactttc tgccattgcc actagggggg 7140
ggccttttta tatggccaag ccaagctctc cacgtcggtt gggctgcacc caacaataaa 7200
tgggtagggt tgcaccaaca aagggatggg atggggggta gaagatacga ggataacggg 7260
gctcaatggc acaaataaga acgaatactg ccattaagac tcgtgatcca gcgactgaca 7320
ccattgcatc atctaagggc ctcaaaacta cctcggaact gctgcgctga tctggacacc 7380
acagaggttc cgagcacttt aggttgcacc aaatgtccca ccaggtgcag gcagaaaacg 7440
ctggaacagc gtgtacagtt tgtcttaaca aaaagtgagg gcgctgaggt cgagcagggt 7500
ggtgtgactt gttatagcct ttagagctgc gaaagcgcgt atggatttgg ctcatcaggc 7560
cagattgagg gtctgtggac acatgtcatg ttagtgtact tcaatcgccc cctggatata 7620
gccccgacaa taggccgtgg cctcattttt ttgccttccg cacatttcca ttgctcggta 7680
cccacacctt gcttctcctg cacttgccaa ccttaatact ggtttacatt gaccaacatc 7740
ttacaagcgg ggggcttgtc tagggtatat ataaacagtg gctctcccaa tcggttgcca 7800
gtctcttttt tcctttcttt ccccacagat tcgaaatcta aactacacat cacagaattc 7860
cgagccgtga gtatccacga caagatcagt gtcgagacga cgcgttttgt gtaatgacac 7920
aatccgaaag tcgctagcaa cacacactct ctacacaaac taacccagct ctggtaccat 7980
ggtgaaggct tctcgacagg ctctgcccct cgtcatcgac ggaaaggtgt acgacgtctc 8040
cgcttgggtg aacttccacc ctggtggagc tgaaatcatt gagaactacc agggacgaga 8100
tgctactgac gccttcatgg ttatgcactc tcaggaagcc ttcgacaagc tcaagcgaat 8160
gcccaagatc aaccaggctt ccgagctgcc tccccaggct gccgtcaacg aagctcagga 8220
ggatttccga aagctccgag aagagctgat cgccactggc atgtttgacg cctctcccct 8280
ctggtactcg tacaagatct tgaccaccct gggtcttggc gtgcttgcct tcttcatgct 8340
ggtccagtac cacctgtact tcattggtgc tctcgtgctc ggtatgcact accagcaaat 8400
gggatggctg tctcatgaca tctgccacca ccagaccttc aagaaccgaa actggaataa 8460
cgtcctgggt ctggtctttg gcaacggact ccagggcttc tccgtgacct ggtggaagga 8520
cagacacaac gcccatcatt ctgctaccaa cgttcagggt cacgatcccg acattgataa 8580
cctgcctctg ctcgcctggt ccgaggacga tgtcactcga gcttctccca tctcccgaaa 8640
gctcattcag ttccaacagt actatttcct ggtcatctgt attctcctgc gattcatctg 8700
gtgtttccag tctgtgctga ccgttcgatc cctcaaggac cgagacaacc agttctaccg 8760
atctcagtac aagaaagagg ccattggact cgctctgcac tggactctca agaccctgtt 8820
ccacctcttc tttatgccct ccatcctgac ctcgatgctg gtgttctttg tttccgagct 8880
cgtcggtggc ttcggaattg ccatcgtggt cttcatgaac cactaccctc tggagaagat 8940
cggtgattcc gtctgggacg gacatggctt ctctgtgggt cagatccatg agaccatgaa 9000
cattcgacga ggcatcatta ctgactggtt ctttggaggc ctgaactacc agatcgagca 9060
ccatctctgg cccaccctgc ctcgacacaa cctcactgcc gtttcctacc aggtggaaca 9120
gctgtgccag aagcacaacc tcccctaccg aaaccctctg ccccatgaag gtctcgtcat 9180
cctgctccga tacctgtccc agttcgctcg aatggccgag aagcagcccg gtgccaaggc 9240
tcagtaagcg gccgcaagtg tggatgggga agtgagtgcc cggttctgtg tgcacaattg 9300
gcaatccaag atggatggat tcaacacagg gatatagcga gctacgtggt ggtgcgagga 9360
tatagcaacg gatatttatg tttgacactt gagaatgtac gatacaagca ctgtccaagt 9420
acaatactaa acatactgta catactcata ctcgtacccg ggcaacggtt tcacttgagt 9480
gcagtggcta gtgctcttac tcgtacagtg tgcaatactg cgtatcatag tctttgatgt 9540
atatcgtatt cattcatgtt agttgcgtac gggtgaagct tccactggtc ggcgtggtag 9600
tggggcagag tggggtcggt gtgctgcagg taggtgatgg ccacgagcca gtggttgacc 9660
cacaggtagg ggatcaggta gtagagggtg acggaagcca ggccccatcg gttgatggag 9720
tatgcgatga cggacatggt gataccaata ccgacgttag agatccagat gttgaaccag 9780
tccttcttct caaacagcgg ggcgttgggg ttgaagtggt tgacagccca tttgttgagc 9840
ttggggtact tctgtccggt aacgtaagac agcagataca gaggccatcc aaacacctgc 9900
tgggtgatga ggccgtagag ggtcatgagg ggagcgtcct cagcaagctc agaccagtca 9960
tgggcgcctc ggttctccat aaactccttt cggtccttgg gcacaaacac catatcacgg 10020
gtgaggtgac cagtggactt gtggtgcatg gagtgggtca gcttccaggc gtagtaaggg 10080
accagcatgg aggagtgcag aacccatccg gtgacgttgt tgacggtgtt agagtcggag 10140
aaagcagagt ggccacactc gtgggcaaga acccacagac cggtgccaaa cagaccctgg 10200
acaatggagt acatggccca ggccacagct cggccggaag ccgagggaat aagaggcagg 10260
tacgcgtagg ccatgtaggc aaaaacggcg ataaagaagc aggcgcgcca gctgcattaa 10320
tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg 10380
ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 10440
gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 10500
ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 10560
cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 10620
ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 10680
accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 10740
catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 10800
gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 10860
tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 10920
agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 10980
actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 11040
gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 11100
aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 11160
gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 11220
aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 11280
atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 11340
gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg 11400
atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga cccacgctca 11460
ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 11520
cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc tagagtaagt 11580
agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca 11640
cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca 11700
tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga 11760
agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa ttctcttact 11820
gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga 11880
gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga taataccgcg 11940
ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg gcgaaaactc 12000
tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc acccaactga 12060
tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg aaggcaaaat 12120
gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact cttccttttt 12180
caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 12240
atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctgat 12300
gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcagga aattgtaagc 12360
gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa tcagctcatt ttttaaccaa 12420
taggccgaaa tcggcaaaat cccttataaa tcaaaagaat agaccgagat agggttgagt 12480
gttgttccag tttggaacaa gagtccacta ttaaagaacg tggactccaa cgtcaaaggg 12540
cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac catcacccta atcaagtttt 12600
ttggggtcga ggtgccgtaa agcactaaat cggaacccta aagggagccc ccgatttaga 12660
gcttgacggg gaaagccggc gaacgtggcg agaaaggaag ggaagaaagc gaaaggagcg 12720
ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg taaccaccac acccgccgcg 12780
cttaatgcgc cgctacaggg cgcgtccatt cgccattcag gctgcgcaac tgttgggaag 12840
ggcgatcggt gcgggcctct tcgctattac gccagctggc gaaaggggga tgtgctgcaa 12900
ggcgattaag ttgggtaacg ccagggtttt cccagtcacg acgttgtaaa acgacggcca 12960
gtgaattgta atacgactca ctatagggcg aattgggccc gacgtcgcat gcttgaatct 13020
acaagtagga gggttggagt gattaagtga aacttcttta acggctctat gccagttcta 13080
ttgatatccg aaacatcagt atgaaggtct gataagggtg acttcttccc acagattcgt 13140
atcagtacga gtacgagacc ggtacttgta acagtattga tactaaaggg aaactacaac 13200
ggttgtcagc gtaatgtgac ttcgcccatg aacgcagaca cgcagtgccg agtgcggtga 13260
tatcgcctac tcgttacgtc catggactac acaacccctc ggcttcgctt ggcttagcct 13320
cgggctcggt gctgttcagt taaaacacaa tcaaataaca tttctacttt ttagaaggca 13380
ggccgtcagg agcaactccg actccattga cgtttctaaa catctgaatg ccttccttac 13440
cttcaacaaa ctggcaggtt cgggcgacag tgtaaagaga cttgatgaag ttggtgtcgt 13500
cgtgtcggta gtgcttgccc atgaccttct tgatcttctc agtggcgatt cgggcgttgt 13560
agaagggaat tcctttacct gcaggataac ttcgtataat gtatgctata cgaagttatg 13620
atctctctct tgagcttttc cataacaagt tcttctgcct ccaggaagtc catgggtggt 13680
ttgatcatgg ttttggtgta gtggtagtgc agtggtggta ttgtgactgg ggatgtagtt 13740
gagaataagt catacacaag tcagctttct tcgagcctca tataagtata agtagttcaa 13800
cgtattagca ctgtacccag catctccgta tcgagaaaca caacaacatg ccccattgga 13860
cagatcatgc ggatacacag gttgtgcagt atcatacata ctcgatcaga caggtcgtct 13920
gaccatcata caagctgaac aagcgctcca tacttgcacg ctctctatat acacagttaa 13980
attacatatc catagtctaa cctctaacag ttaatcttct ggtaagcctc ccagccagcc 14040
ttctggtatc gcttggcctc ctcaatagga tctcggttct ggccgtacag acctcggccg 14100
acaattatga tatccgttcc ggtagacatg acatcctcaa cagttcggta ctgctgtccg 14160
agagcgtctc ccttgtcgtc aagacccacc ccgggggtca gaataagcca gtcctcagag 14220
tcgcccttag gtcggttctg ggcaatgaag ccaaccacaa actcggggtc ggatcgggca 14280
agctcaatgg tctgcttgga gtactcgcca gtggccagag agcccttgca agacagctcg 14340
gccagcatga gcagacctct ggccagcttc tcgttgggag aggggactag gaactccttg 14400
tactgggagt tctcgtagtc agagacgtcc tccttcttct gttcagagac agtttcctcg 14460
gcaccagctc gcaggccagc aatgattccg gttccgggta caccgtgggc gttggtgata 14520
tcggaccact cggcgattcg gtgacaccgg tactggtgct tgacagtgtt gccaatatct 14580
gcgaactttc tgtcctcgaa caggaagaaa ccgtgcttaa gagcaagttc cttgaggggg 14640
agcacagtgc cggcgtaggt gaagtcgtca atgatgtcga tatgggtttt gatcatgcac 14700
acataaggtc cgaccttatc ggcaagctca atgagctcct tggtggtggt aacatccaga 14760
gaagcacaca ggttggtttt cttggctgcc acgagcttga gcactcgagc ggcaaaggcg 14820
gacttgtgga cgttagctcg agcttcgtag gagggcattt tggtggtgaa gaggagactg 14880
aaataaattt agtctgcaga actttttatc ggaaccttat ctggggcagt gaagtatatg 14940
ttatggtaat agttacgagt tagttgaact tatagataga ctggactata cggctatcgg 15000
tccaaattag aaagaacgtc aatggctctc tgggcgtcgc ctttgccgac aaaaatgtga 15060
tcatgatgaa agccagcaat gacgttgcag ctgatattgt tgtcggccaa ccgcgccgaa 15120
aacgcagctg tcagacccac agcctccaac gaagaatgta tcgtcaaagt gatccaagca 15180
cactcatagt tggagtcgta ctccaaaggc ggcaatgacg agtcagacag atactcgtcg 15240
acgcgataac ttcgtataat gtatgctata cgaagttatc gtacgatagt tagtagacaa 15300
caat 15304
<210>29
<211>1272
<212>DNA
<213>人工序列
<220>
<223>突变EgD8S-23
<220>
<221>其它特征
<222>(2)..(1270)
<223>突变EgD8S-23Δ-8去饱和酶CDS
<400>29
catggtgaag gcttctcgac aggctctgcc cctcgtcatc gacggaaagg tgtacgacgt 60
ctccgcttgg gtgaacttcc accctggtgg agctgaaatc attgagaact accagggacg 120
agatgctact gacgccttca tggttatgca ctctcaggaa gccttcgaca agctcaagcg 180
aatgcccaag atcaaccagg cttccgagct gcctccccag gctgccgtca acgaagctca 240
ggaggatttc cgaaagctcc gagaagagct gatcgccact ggcatgtttg acgcctctcc 300
cctctggtac tcgtacaaga tcttgaccac cctgggtctt ggcgtgcttg ccttcttcat 360
gctggtccag taccacctgt acttcattgg tgctctcgtg ctcggtatgc actaccagca 420
aatgggatgg ctgtctcatg acatctgcca ccaccagacc ttcaagaacc gaaactggaa 480
taacgtcctg ggtctggtct ttggcaacgg actccagggc ttctccgtga cctggtggaa 540
ggacagacac aacgcccatc attctgctac caacgttcag ggtcacgatc ccgacattga 600
taacctgcct ctgctcgcct ggtccgagga cgatgtcact cgagcttctc ccatctcccg 660
aaagctcatt cagttccaac agtactattt cctggtcatc tgtattctcc tgcgattcat 720
ctggtgtttc cagtctgtgc tgaccgttcg atccctcaag gaccgagaca accagttcta 780
ccgatctcag tacaagaaag aggccattgg actcgctctg cactggactc tcaagaccct 840
gttccacctc ttctttatgc cctccatcct gacctcgatg ctggtgttct ttgtttccga 900
gctcgtcggt ggcttcggaa ttgccatcgt ggtcttcatg aaccactacc ctctggagaa 960
gatcggtgat tccgtctggg acggacatgg cttctctgtg ggtcagatcc atgagaccat 1020
gaacattcga cgaggcatca ttactgactg gttctttgga ggcctgaact accagatcga 1080
gcaccatctc tggcccaccc tgcctcgaca caacctcact gccgtttcct accaggtgga 1140
acagctgtgc cagaagcaca acctccccta ccgaaaccct ctgccccatg aaggtctcgt 1200
catcctgctc cgatacctgt cccagttcgc tcgaatggcc gagaagcagc ccggtgccaa 1260
ggctcagtaa gc 1272
<210>30
<211>422
<212>PRT
<213>人工序列
<220>
<223>突变EgD8S-23,其含有M1、M2、M3、M8、M12、M15、M16、M18、M21、M26、M45、
M46、M68和M70突变位点
<400>30
Met Val Lys Ala Ser Arg Gln Ala Leu Pro Leu Val Ile Asp Gly Lys
1 5 10 15
Val Tyr Asp Val Ser Ala Trp Val Asn Phe His Pro Gly Gly Ala Glu
20 25 30
Ile Ile Glu Asn Tyr Gln Gly Arg Asp Ala Thr Asp Ala Phe Met Val
35 40 45
Met His Ser Gln Glu Ala Phe Asp Lys Leu Lys Arg Met Pro Lys Ile
50 55 60
Asn Gln Ala Ser Glu Leu Pro Pro Gln Ala Ala Val Asn Glu Ala Gln
65 70 75 80
Glu Asp Phe Arg Lys Leu Arg Glu Glu Leu Ile Ala Thr Gly Met Phe
85 90 95
Asp Ala Ser Pro Leu Trp Tyr Ser Tyr Lys Ile Leu Thr Thr Leu Gly
100 105 110
Leu Gly Val Leu Ala Phe Phe Met Leu Val Gln Tyr His Leu Tyr Phe
115 120 125
Ile Gly Ala Leu Val Leu Gly Met His Tyr Gln Gln Met Gly Trp Leu
130 135 140
Ser His Asp Ile Cys His His Gln Thr Phe Lys Asn Arg Asn Trp Asn
145 150 155 160
Asn Val Leu Gly Leu Val Phe Gly Asn Gly Leu Gln Gly Phe Ser Val
165 170 175
Thr Trp Trp Lys Asp Arg His Asn Ala His His Ser Ala Thr Asn Val
180 185 190
Gln Gly His Asp Pro Asp Ile Asp Asn Leu Pro Leu Leu Ala Trp Ser
195 200 205
Glu Asp Asp Val Thr Arg Ala Ser Pro Ile Ser Arg Lys Leu Ile Gln
210 215 220
Phe Gln Gln Tyr Tyr Phe Leu Val Ile Cys Ile Leu Leu Arg Phe Ile
225 230 235 240
Trp Cys Phe Gln Ser Val Leu Thr Val Arg Ser Leu Lys Asp Arg Asp
245 250 255
Asn Gln Phe Tyr Arg Ser Gln Tyr Lys Lys Glu Ala Ile Gly Leu Ala
260 265 270
Leu His Trp Thr Leu Lys Thr Leu Phe His Leu Phe Phe Met Pro Ser
275 280 285
Ile Leu Thr Ser Met Leu Val Phe Phe Val Ser Glu Leu Val Gly Gly
290 295 300
Phe Gly Ile Ala Ile Val Val Phe Met Asn His Tyr Pro Leu Glu Lys
305 310 315 320
Ile Gly Asp Ser Val Trp Asp Gly His Gly Phe Ser Val Gly Gln Ile
325 330 335
His Glu Thr Met Asn Ile Arg Arg Gly Ile Ile Thr Asp Trp Phe Phe
340 345 350
Gly Gly Leu Asn Tyr Gln Ile Glu His His Leu Trp Pro Thr Leu Pro
355 360 365
Arg His Asn Leu Thr Ala Val Ser Tyr Gln Val Glu Gln Leu Cys Gln
370 375 380
Lys His Asn Leu Pro Tyr Arg Asn Pro Leu Pro His Glu Gly Leu Val
385 390 395 400
Ile Leu Leu Arg Tyr Leu Ser Gln Phe Ala Arg Met Ala Glu Lys Gln
405 410 415
Pro Gly Ala Lys Ala Gln
420
<210>31
<211>777
<212>DNA
<213>纤细裸藻
<220>
<221>其它特征
<223>Δ-9延长酶
<400>31
atggaggtgg tgaatgaaat agtctcaatt gggcaggaag ttttacccaa agttgattat 60
gcccaactct ggagtgatgc cagtcactgt gaggtgcttt acttgtccat cgcatttgtc 120
atcttgaagt tcactcttgg cccccttggt ccaaaaggtc agtctcgtat gaagtttgtt 180
ttcaccaatt acaaccttct catgtccatt tattcgttgg gatcattcct ctcaatggca 240
tatgccatgt acaccatcgg tgttatgtct gacaactgcg agaaggcttt tgacaacaac 300
gtcttcagga tcaccacgca gttgttctat ttgagcaagt tcctggagta tattgactcc 360
ttctatttgc cactgatggg caagcctctg acctggttgc aattcttcca tcatttgggg 420
gcaccgatgg atatgtggct gttctataat taccgaaatg aagctgtttg gatttttgtg 480
ctgttgaatg gtttcatcca ctggatcatg tacggttatt attggaccag attgatcaag 540
ctgaagttcc ccatgccaaa atccctgatt acatcaatgc agatcattca attcaatgtt 600
ggtttctaca ttgtctggaa gtacaggaac attccctgtt atcgccaaga tgggatgagg 660
atgtttggct ggttcttcaa ttacttttat gttggcacag tcttgtgttt gttcttgaat 720
ttctatgtgc aaacgtatat cgtcaggaag cacaagggag ccaaaaagat tcagtga 777
<210>32
<211>13707
<212>DNA
<213>人工序列
<220>
<223>质粒pZKSL-555R
<400>32
aaacagtgta cgcagatctg cccatgatgg gggctcccac caccagcaat cagggccctg 60
attacacacc cacctgtaat gtcatgctgt tcatcgtggt taatgctgct gtgtgctgtg 120
tgtgtgtgtt gtttggcgct cattgttgcg ttatgcagcg tacaccacaa tattggaagc 180
ttattagcct ttctattttt tcgtttgcaa ggcttaacaa cattgctgtg gagagggatg 240
gggatatgga ggccgctgga gggagtcgga gaggcgtttt ggagcggctt ggcctggcgc 300
ccagctcgcg aaacgcacct aggacccttt ggcacgccga aatgtgccac ttttcagtct 360
agtaacgcct tacctacgtc attccatgcg tgcatgtttg cgcctttttt cccttgccct 420
tgatcgccac acagtacagt gcactgtaca gtggaggttt tgggggggtc ttagatggga 480
gctaaaagcg gcctagcggt acactagtgg gattgtatgg agtggcatgg agcctaggtg 540
gagcctgaca ggacgcacga ccggctagcc cgtgacagac gatgggtggc tcctgttgtc 600
caccgcgtac aaatgtttgg gccaaagtct tgtcagcctt gcttgcgaac ctaattccca 660
attttgtcac ttcgcacccc cattgatcga gccctaaccc ctgcccatca ggcaatccaa 720
ttaagctcgc attgtctgcc ttgtttagtt tggctcctgc ccgtttcggc gtccacttgc 780
acaaacacaa acaagcatta tatataaggc tcgtctctcc ctcccaacca cactcacttt 840
tttgcccgtc ttcccttgct aacacaaaag tcaagaacac aaacaaccac cccaaccccc 900
ttacacacaa gacatatcta cagcaatggc catggctctc tcccttacta ccgagcagct 960
gctcgagcga cccgacctgg ttgccatcga cggcattctc tacgatctgg aaggtcttgc 1020
caaggtccat cccggaggcg acttgatcct cgcttctggt gcctccgatg cttctcctct 1080
gttctactcc atgcaccctt acgtcaagcc cgagaactcg aagctgcttc aacagttcgt 1140
gcgaggcaag cacgaccgaa cctccaagga cattgtctac acctacgact ctccctttgc 1200
acaggacgtc aagcgaacta tgcgagaggt catgaaaggt cggaactggt atgccacacc 1260
tggattctgg ctgcgaaccg ttggcatcat tgctgtcacc gccttttgcg agtggcactg 1320
ggctactacc ggaatggtgc tgtggggtct cttgactgga ttcatgcaca tgcagatcgg 1380
cctgtccatt cagcacgatg cctctcatgg tgccatcagc aaaaagccct gggtcaacgc 1440
tctctttgcc tacggcatcg acgtcattgg atcgtccaga tggatctggc tgcagtctca 1500
catcatgcga catcacacct acaccaatca gcatggtctc gacctggatg ccgagtccgc 1560
agaaccattc cttgtgttcc acaactaccc tgctgccaac actgctcgaa agtggtttca 1620
ccgattccag gcctggtaca tgtacctcgt gcttggagcc tacggcgttt cgctggtgta 1680
caaccctctc tacatcttcc gaatgcagca caacgacacc attcccgagt ctgtcacagc 1740
catgcgagag aacggctttc tgcgacggta ccgaaccctt gcattcgtta tgcgagcttt 1800
cttcatcttt cgaaccgcct tcttgccctg gtatctcact ggaacctccc tgctcatcac 1860
cattcctctg gtgcccactg ctaccggtgc cttcctcacc ttctttttca tcttgtctca 1920
caacttcgat ggctcggagc gaatccccga caagaactgc aaggtcaaga gctccgagaa 1980
ggacgttgaa gccgatcaga tcgactggta cagagctcag gtggagacct cttccaccta 2040
cggtggaccc attgccatgt tctttactgg cggtctcaac ttccagatcg agcatcacct 2100
ctttcctcga atgtcgtctt ggcactatcc cttcgtgcag caagctgtcc gagagtgttg 2160
cgaacgacac ggagttcggt acgtcttcta ccctaccatt gtgggcaaca tcatttccac 2220
cctcaagtac atgcacaaag tcggtgtggt tcactgtgtc aaggacgctc aggattccta 2280
agcggccgca agtgtggatg gggaagtgag tgcccggttc tgtgtgcaca attggcaatc 2340
caagatggat ggattcaaca cagggatata gcgagctacg tggtggtgcg aggatatagc 2400
aacggatatt tatgtttgac acttgagaat gtacgataca agcactgtcc aagtacaata 2460
ctaaacatac tgtacatact catactcgta cccgggcaac ggtttcactt gagtgcagtg 2520
gctagtgctc ttactcgtac agtgtgcaat actgcgtatc atagtctttg atgtatatcg 2580
tattcattca tgttagttgc gtacgctgtg ttgttgtatg tggtgaagct tgacaatgga 2640
tggtgtgtcg tatcaggctg gggaacaatt gtgcttaagt atgctgcagt tgagtaagag 2700
tcatcgctcc accaaaataa agtttgccat tagggttgga gagagagatg gtggctggaa 2760
gaattaaatg acatcaagct gaggattgtg ggtgtgcaat aacacatgtt aggggtgacc 2820
tgtggctcga aatctgataa ttattttgta actttatgat tattcttaga ttttttaata 2880
ttcctctata taacacataa gtagctgtcg tctagttgtt catagcctga ctcctgcaat 2940
agattagtgc agagtgattt tgtgcaattg agagccacgg ttgagtcaag tgactttgtg 3000
tgtgaagtca tcttacgttt caagtctcac aggttactca attggttggt tgtctgccct 3060
ttacagatat ttacagtacc tgagcgtaaa gtcgttcatc cacggaatga ctgttcctgt 3120
cacgcagtca tgatcatgga tgtggctggt caggaaccat tttggatagg agacttaggg 3180
attggactat tattgaaaaa actgagccga atatgatata gttctatttg aatgcagaac 3240
ttctgatggt caattcactt atttcaggca tatcggtcat ggtggcagct gccacgatgt 3300
tatctcgttg gaaacctcgg cgcgccagct gcattaatga atcggccaac gcgcggggag 3360
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 3420
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 3480
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 3540
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 3600
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 3660
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 3720
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 3780
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 3840
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 3900
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 3960
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 4020
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 4080
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 4140
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 4200
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 4260
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 4320
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 4380
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 4440
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 4500
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 4560
ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 4620
caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 4680
attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 4740
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 4800
actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 4860
ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 4920
ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 4980
gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 5040
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 5100
cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 5160
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 5220
gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 5280
ggttccgcgc acatttcccc gaaaagtgcc acctgatgcg gtgtgaaata ccgcacagat 5340
gcgtaaggag aaaataccgc atcaggaaat tgtaagcgtt aatattttgt taaaattcgc 5400
gttaaatttt tgttaaatca gctcattttt taaccaatag gccgaaatcg gcaaaatccc 5460
ttataaatca aaagaataga ccgagatagg gttgagtgtt gttccagttt ggaacaagag 5520
tccactatta aagaacgtgg actccaacgt caaagggcga aaaaccgtct atcagggcga 5580
tggcccacta cgtgaaccat caccctaatc aagttttttg gggtcgaggt gccgtaaagc 5640
actaaatcgg aaccctaaag ggagcccccg atttagagct tgacggggaa agccggcgaa 5700
cgtggcgaga aaggaaggga agaaagcgaa aggagcgggc gctagggcgc tggcaagtgt 5760
agcggtcacg ctgcgcgtaa ccaccacacc cgccgcgctt aatgcgccgc tacagggcgc 5820
gtccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg 5880
ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca 5940
gggttttccc agtcacgacg ttgtaaaacg acggccagtg aattgtaata cgactcacta 6000
tagggcgaat tgggcccgac gtcgcatgca ttccatagcc acacctttgc ctatggcttc 6060
acaaccgaag gcaattcgag aggtcgcgct tatggaatcg actcgtataa agctgaaggg 6120
aaagggagac gttccgagcg ctcagatgca atagtcgtcc agctaatgtg gattcaaaaa 6180
caaccccaac agtaatcttg aaaatttgaa cggatcaatc tgaacactct tgctccaggt 6240
cattcttcta acgcacatcc ccagagtcta gagggagttg tgttgtgaac atcctaataa 6300
acaatgcaat ggattcggga tatcttctgt ctcgccccct actcgatgtc gagtaaaccg 6360
atcaccaact aacaatactc ctccgcgttc tgccattgac tctcaaacag acatcgctat 6420
caacggaaca gcatatttta gcttcttagg acaataaata ttgataatgc cggctctccc 6480
tcggtatatt aagcaatcca ttcatacact cattcatcag gttaatttta tatatataat 6540
ttgtctattc aaacaccgta aattactggt accatcatct cctccttttc aaatacacgt 6600
ctatttgcat taatgaaatt actcgccaat tcgcagaacg tgtttgtcga acagagcctt 6660
agctcgggtc cagacaggag cagtgtctcg ctgaggaagc tgcaggagag ttaattaact 6720
cacctgcagg attgagacta tgaatggatt cccgtgcccg tattactcta ctaatttgat 6780
cttggaacgc gaaaatacgt ttctaggact ccaaagaatc tcaactcttg tccttactaa 6840
atatactacc catagttgat ggtttacttg aacagagagg acatgttcac ttgacccaaa 6900
gtttctcgca tctcttggat atttgaacaa cggcgtccac tgaccgtcag ttatccagtc 6960
acaaaacccc cacattcata cattcccatg tacgtttaca aagttctcaa ttccatcgtg 7020
caaatcaaaa tcacatctat tcattcatca tatataaacc catcatgtct actaacactc 7080
acaactccat agaaaacatc gactcagaac acacgctcca tgcggccgct taggaatcct 7140
gtgcgtcctt cacgcagtgg acgacaccca ccttatgcat gtacttcagg gtggagatga 7200
tgttgccgac gatggtaggg tagaaaacat atcgcactcc atgtcgttcg caacactccc 7260
ggaccgcctg ctggacgaag gggtagtgcc aagacgacat ccggggaaag aggtggtgct 7320
cgatctggaa attgagaccg ccagtgaaga acatggcgat ggggccaccg tatgtggagg 7380
acgtctccac ctgcgcccga taccagtcaa tttggtcagc ctcaacgtcc ttctcagatc 7440
gcttaacctt gcagttcttg tcggggatcc gttcggagcc atcaaaattg tgggacaaaa 7500
tgaagaagaa cgtcaagaag gcaccagttg cggtgggcac cagaggaatg gtgatcagca 7560
atgaggtccc agtgaggtac cagggcaaga atgcggtccg gaagatgaag aaagctcgca 7620
tcacgaatgc aagtgtgcgg tagcgccgca gaaagccatt ttcccgcatg gccgtgacag 7680
actctgggat ggtgtcattg tgctgcatcc ggaaaatgta gagcgggttg tacaccagcg 7740
ataccccgta tgcccccagc acaaggtaca tgtaccaagc ctggaagcgg tggaaccact 7800
ttcgggcggt gtttgcggcg gggtagttgt ggaacaccag gaacggctct gccgactccg 7860
catccaggtc gaggccgtgc tggttggtgt aggtgtggtg ccgcatgatg tgcgactgca 7920
gccaaatcca ccgggacgat ccgatgacgt caatgccgta ggcgaagagg gcgttgaccc 7980
aaggcttctt gctgatggcc ccgtgggacg catcatgctg gatggataag ccgatctgca 8040
tgtgcatgaa tccagtcaac aggccccaca gcaccatccc cgtggtagcc cagtgccact 8100
cgcaaaaggc cgtcacggcg atgatcccaa cggtgcgcag ccagaagcca ggggttgcgt 8160
accagttcct ccctttcatc acctcgcgca ttgtccgctt aacgtcttgt gcgaagggag 8220
aatcatacgt gtagacaatg tccttcgagg tgcggtcatg cttccctcgg acgaactgtt 8280
gaagcaattt ggagttctcc ggtttgacgt atggatgcat tgaataaaag agaggggagg 8340
catcagaggc accagaagcg agaatcaaat ctcctcctgg atgaactttg gcaagccctt 8400
caaggtcgta gaggatgcca tcaatcgcaa ccaaatcagg gcgttctaac agctgttctg 8460
tggtaagact gagagccatg gagagctggg ttagtttgtg tagagagtgt gtgttgctag 8520
cgactttcgg attgtgtcat tacacaaaac gcgtcgtctc gacactgatc ttgtcgtgga 8580
tactcacggc tcggacatcg tcgccgacga tgacaccgga ctttcgctta aggacgtcag 8640
taacaggcat tgtgtgatgt gtagtttaga tttcgaatct gtggggaaag aaaggaaaaa 8700
agagactggc aaccgattgg gagagccact gtttatatat accctagaca agccccccgc 8760
ttgtaagatg ttggtcaatg taaaccagta ttaaggttgg caagtgcagg agaagcaagg 8820
tgtgggtacc gagcaatgga aatgtgcgga aggcaaaaaa atgaggccac ggcctattgt 8880
cggggctata tccagggggc gattgaagta cactaacatg acatgtgtcc acagaccctc 8940
aatctggcct gatgagccaa atccatacgc gctttcgcag ctctaaaggc tataacaagt 9000
cacaccaccc tgctcgacct cagcgccctc actttttgtt aagacaaact gtacacgctg 9060
ttccagcgtt ttctgcctgc acctggtggg acatttggtg caacctaaag tgctcggaac 9120
ctctgtggtg tccagatcag cgcagcagtt ccgaggtagt tttgaggccc ttagatgatg 9180
caatggtgtc agtcgctgga tcacgagtct taatggcagt attcgttctt atttgtgcca 9240
ttgagccccg ttatcctcgt atcttctacc ccccatccca tccctttgtt ggtgcaaccc 9300
tacccattta ttgttgggtg cagcccaacc gacgtggaga gcttggcttg gccatataaa 9360
aaggcccccc cctagtggca atggcagaaa gtcagctgtg agttgttgaa tttgtcatct 9420
aggcggcctg gccgtcttct ccggggcaat tggggctgtt ttttgggaca caaatacgcc 9480
gccaacccgg tctctcctga attccgtcgt cgcctgagtc gacatcattt atttaccagt 9540
tggccacaaa cccttgacga tctcgtatgt cccctccgac atactcccgg ccggctgggg 9600
tacgttcgat agcgctatcg gcatcgacaa ggtttgggtc cctagccgat accgcactac 9660
ctgagtcaca atcttcggag gtttagtctt ccacatagca cgggcaaaag tgcgtatata 9720
tacaagagcg tttgccagcc acagattttc actccacaca ccacatcaca catacaacca 9780
cacacatcca caatggaacc cgaaactaag aagaccaaga ctgactccaa gaagattgtt 9840
cttctcggcg gcgacttctg tggccccgag gtgattgccg aggccgtcaa ggtgctcaag 9900
tctgttgctg aggcctccgg caccgagttt gtgtttgagg accgactcat tggaggagct 9960
gccattgaga aggagggcga gcccatcacc gacgctactc tcgacatctg ccgaaaggct 10020
gactctatta tgctcggtgc tgtcggaggc gctgccaaca ccgtatggac cactcccgac 10080
ggacgaaccg acgtgcgacc cgagcagggt ctcctcaagc tgcgaaagga cctgaacctg 10140
tacgccaacc tgcgaccctg ccagctgctg tcgcccaagc tcgccgatct ctcccccatc 10200
cgaaacgttg agggcaccga cttcatcatt gtccgagagc tcgtcggagg tatctacttt 10260
ggagagcgaa aggaggatga cggatctggc gtcgcttccg acaccgagac ctactccgtt 10320
cctgaggttg agcgaattgc ccgaatggcc gccttcctgg cccttcagca caacccccct 10380
cttcccgtgt ggtctcttga caaggccaac gtgctggcct cctctcgact ttggcgaaag 10440
actgtcactc gagtcctcaa ggacgaactc ccccagctcg agctcaacca ccagctgatc 10500
gactcggccg ccatgatcct catcaagcag ccctccaaga tgaatggtat catcatcacc 10560
accaacatgt ttggcgatat catctccgac gaggcctccg tcatccccgg ttctctgggt 10620
ctgctgccct ccgcctctct ggcttctctg cccgacacca acgaggcgtt cggtctgtac 10680
gagccctgtc acggatctgc ccccgatctc ggcaagcaga aggtcaaccc cattgccacc 10740
attctgtctg ccgccatgat gctcaagttc tctcttaaca tgaagcccgc cggtgacgct 10800
gttgaggctg ccgtcaagga gtccgtcgag gctggtatca ctaccgccga tatcggaggc 10860
tcttcctcca cctccgaggt cggagacttg ttgccaacaa ggtcaaggag ctgctcaaga 10920
aggagtaagt cgtttctacg acgcattgat ggaaggagca aactgacgcg cctgcgggtt 10980
ggtctaccgg cagggtccgc tagtgtataa gactctataa aaagggccct gccctgctaa 11040
tgaaatgatg atttataatt taccggtgta gcaaccttga ctagaagaag cagattgggt 11100
gtgtttgtag tggaggacag tggtacgttt tggaaacagt cttcttgaaa gtgtcttgtc 11160
tacagtatat tcactcataa cctcaatagc caagggtgta gtcggtttat taaaggaagg 11220
gagttgtggc tgatgtggat atcgatagtt ggagcaaggg agaaatgtag agtgtgaaag 11280
actcactatg gtccgggctt atctcgacca atagccaaag tctggagttt ctgagagaaa 11340
aaggcaagat acgtatgtaa caaagcgacg catggtacaa taataccgga ggcatgtatc 11400
atagagagtt agtggttcga tgatggcact ggtgcctggt atgactttat acggctgact 11460
acatatttgt cctcagacat acaattacag tcaagcactt acccttggac atctgtaggt 11520
accccccggc caagacgatc tcagcgtgtc gtatgtcgga ttggcgtagc tccctcgctc 11580
gtcaattggc tcccatctac tttcttctgc ttggctacac ccagcatgtc tgctatggct 11640
cgttttcgtg ccttatctat cctcccagta ttaccaactc taaatgacat gatgtgattg 11700
ggtctacact ttcatatcag agataaggag tagcacagtt gcataaaaag cccaactcta 11760
atcagcttct tcctttcttg taattagtac aaaggtgatt agcgaaatct ggaagcttag 11820
ttggccctaa aaaaatcaaa aaaagcaaaa aacgaaaaac gaaaaaccac agttttgaga 11880
acagggaggt aacgaaggat cgtatatata tatatatata tatataccca cggatcccga 11940
gaccggcctt tgattcttcc ctacaaccaa ccattctcac caccctaatt cacaaccatg 12000
gctcccgacg ccgacaagct gcgacagcga aaggctcagt ccatccagga cactgccgat 12060
tctcaggcta ccgagctcaa gattggcacc ctgaagggtc tccaaggcac cgagatcgtc 12120
attgatggcg acatctacga catcaaagac ttcgatcacc ctggaggcga atccatcatg 12180
acctttggtg gcaacgacgt tactgccacc tacaagatga ttcatcccta ccactcgaag 12240
catcacctgg agaagatgaa aaaggtcggt cgagtgcccg actacacctc cgagtacaag 12300
ttcgatactc ccttcgaacg agagatcaaa caggaggtct tcaagattgt gcgaagaggt 12360
cgagagtttg gaacacctgg ctacttcttt cgagccttct gctacatcgg tctcttcttt 12420
tacctgcagt atctctgggt taccactcct accactttcg cccttgctat cttctacggt 12480
gtgtctcagg ccttcattgg cctgaacgtc cagcacgacg ccaaccacgg agctgcctcc 12540
aaaaagccct ggatcaacaa tttgctcggc ctgggtgccg actttatcgg aggctccaag 12600
tggctctgga tgaaccagca ctggacccat cacacttaca ccaaccatca cgagaaggat 12660
cccgacgccc tgggtgcaga gcctatgctg ctcttcaacg actatccctt gggtcacccc 12720
aagcgaaccc tcattcatca cttccaagcc ttctactatc tgtttgtcct tgctggctac 12780
tgggtgtctt cggtgttcaa ccctcagatc ctggacctcc agcaccgagg tgcccaggct 12840
gtcggcatga agatggagaa cgactacatt gccaagtctc gaaagtacgc tatcttcctg 12900
cgactcctgt acatctacac caacattgtg gctcccatcc agaaccaagg cttttcgctc 12960
accgtcgttg ctcacattct tactatgggt gtcgcctcca gcctgaccct cgctactctg 13020
ttcgccctct cccacaactt cgagaacgca gatcgggatc ccacctacga ggctcgaaag 13080
ggaggcgagc ctgtctgttg gttcaagtcg caggtggaaa cctcctctac ttacggtggc 13140
ttcatttccg gttgccttac aggcggactc aactttcagg tcgagcatca cctgtttcct 13200
cgaatgtcct ctgcctggta cccctacatc gctcctaccg ttcgagaggt ctgcaaaaag 13260
cacggcgtca agtacgccta ctatccctgg gtgtggcaga acctcatctc gaccgtcaag 13320
tacctgcatc agtccggaac tggctcgaac tggaagaacg gtgccaatcc ctactctggc 13380
aagctgtaag cggccgcatg tacatacaag attatttata gaaatgaatc gcgatcgaac 13440
aaagagtacg agtgtacgag taggggatga tgataaaagt ggaagaagtt ccgcatcttt 13500
ggatttatca acgtgtagga cgatacttcc tgtaaaaatg caatgtcttt accataggtt 13560
ctgctgtaga tgttattaac taccattaac atgtctactt gtacagttgc agaccagttg 13620
gagtatagaa tggtacactt accaaaaagt gttgatggtt gtaactacga tatataaaac 13680
tgttgacggg atctgcgtac actgttt 13707
<210>33
<211>1350
<212>DNA
<213>纤细裸藻
<220>
<221>其它特征
<223>用于解脂耶氏酵母的合成的Δ-5去饱和酶(密码子优化的)
<400>33
atggctctct cccttactac cgagcagctg ctcgagcgac ccgacctggt tgccatcgac 60
ggcattctct acgatctgga aggtcttgcc aaggtccatc ccggaggcga cttgatcctc 120
gcttctggtg cctccgatgc ttctcctctg ttctactcca tgcaccctta cgtcaagccc 180
gagaactcga agctgcttca acagttcgtg cgaggcaagc acgaccgaac ctccaaggac 240
attgtctaca cctacgactc tccctttgca caggacgtca agcgaactat gcgagaggtc 300
atgaaaggtc ggaactggta tgccacacct ggattctggc tgcgaaccgt tggcatcatt 360
gctgtcaccg ccttttgcga gtggcactgg gctactaccg gaatggtgct gtggggtctc 420
ttgactggat tcatgcacat gcagatcggc ctgtccattc agcacgatgc ctctcatggt 480
gccatcagca aaaagccctg ggtcaacgct ctctttgcct acggcatcga cgtcattgga 540
tcgtccagat ggatctggct gcagtctcac atcatgcgac atcacaccta caccaatcag 600
catggtctcg acctggatgc cgagtccgca gaaccattcc ttgtgttcca caactaccct 660
gctgccaaca ctgctcgaaa gtggtttcac cgattccagg cctggtacat gtacctcgtg 720
cttggagcct acggcgtttc gctggtgtac aaccctctct acatcttccg aatgcagcac 780
aacgacacca ttcccgagtc tgtcacagcc atgcgagaga acggctttct gcgacggtac 840
cgaacccttg cattcgttat gcgagctttc ttcatctttc gaaccgcctt cttgccctgg 900
tatctcactg gaacctccct gctcatcacc attcctctgg tgcccactgc taccggtgcc 960
ttcctcacct tctttttcat cttgtctcac aacttcgatg gctcggagcg aatccccgac 1020
aagaactgca aggtcaagag ctccgagaag gacgttgaag ccgatcagat cgactggtac 1080
agagctcagg tggagacctc ttccacctac ggtggaccca ttgccatgtt ctttactggc 1140
ggtctcaact tccagatcga gcatcacctc tttcctcgaa tgtcgtcttg gcactatccc 1200
ttcgtgcagc aagctgtccg agagtgttgc gaacgacacg gagttcggta cgtcttctac 1260
cctaccattg tgggcaacat catttccacc ctcaagtaca tgcacaaagt cggtgtggtt 1320
cactgtgtca aggacgctca ggattcctaa 1350
<210>34
<211>449
<212>PRT
<213>纤细裸藻
<400>34
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu
1 5 10 15
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val
20 25 30
His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser
35 40 45
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys
50 55 60
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp
65 70 75 80
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr
85 90 95
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe
100 105 110
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp
115 120 125
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe
130 135 140
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly
145 150 155 160
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile
165 170 175
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met
180 185 190
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu
195 200 205
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr
210 215 220
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val
225 230 235 240
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe
245 250 255
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg
260 265 270
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg
275 280 285
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly
290 295 300
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala
305 310 315 320
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu
325 330 335
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val
340 345 350
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser
355 360 365
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe
370 375 380
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro
385 390 395 400
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg
405 410 415
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys
420 425 430
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp
435 440 445
Ser
<210>35
<211>1392
<212>DNA
<213>多甲藻CCMP626
<220>
<221>其它特征
<223>用于解脂耶氏酵母的合成的Δ-5去饱和酶(密码子优化的)
<400>35
atggctcccg acgccgacaa gctgcgacag cgaaaggctc agtccatcca ggacactgcc 60
gattctcagg ctaccgagct caagattggc accctgaagg gtctccaagg caccgagatc 120
gtcattgatg gcgacatcta cgacatcaaa gacttcgatc accctggagg cgaatccatc 180
atgacctttg gtggcaacga cgttactgcc acctacaaga tgattcatcc ctaccactcg 240
aagcatcacc tggagaagat gaaaaaggtc ggtcgagtgc ccgactacac ctccgagtac 300
aagttcgata ctcccttcga acgagagatc aaacaggagg tcttcaagat tgtgcgaaga 360
ggtcgagagt ttggaacacc tggctacttc tttcgagcct tctgctacat cggtctcttc 420
ttttacctgc agtatctctg ggttaccact cctaccactt tcgcccttgc tatcttctac 480
ggtgtgtctc aggccttcat tggcctgaac gtccagcacg acgccaacca cggagctgcc 540
tccaaaaagc cctggatcaa caatttgctc ggcctgggtg ccgactttat cggaggctcc 600
aagtggctct ggatgaacca gcactggacc catcacactt acaccaacca tcacgagaag 660
gatcccgacg ccctgggtgc agagcctatg ctgctcttca acgactatcc cttgggtcac 720
cccaagcgaa ccctcattca tcacttccaa gccttctact atctgtttgt ccttgctggc 780
tactgggtgt cttcggtgtt caaccctcag atcctggacc tccagcaccg aggtgcccag 840
gctgtcggca tgaagatgga gaacgactac attgccaagt ctcgaaagta cgctatcttc 900
ctgcgactcc tgtacatcta caccaacatt gtggctccca tccagaacca aggcttttcg 960
ctcaccgtcg ttgctcacat tcttactatg ggtgtcgcct ccagcctgac cctcgctact 1020
ctgttcgccc tctcccacaa cttcgagaac gcagatcggg atcccaccta cgaggctcga 1080
aagggaggcg agcctgtctg ttggttcaag tcgcaggtgg aaacctcctc tacttacggt 1140
ggcttcattt ccggttgcct tacaggcgga ctcaactttc aggtcgagca tcacctgttt 1200
cctcgaatgt cctctgcctg gtacccctac atcgctccta ccgttcgaga ggtctgcaaa 1260
aagcacggcg tcaagtacgc ctactatccc tgggtgtggc agaacctcat ctcgaccgtc 1320
aagtacctgc atcagtccgg aactggctcg aactggaaga acggtgccaa tccctactct 1380
ggcaagctgt aa 1392
<210>36
<211>463
<212>PRT
<213>多甲藻CCMP626
<400>36
Met Ala Pro Asp Ala Asp Lys Leu Arg Gln Arg Lys Ala Gln Ser Ile
1 5 10 15
Gln Asp Thr Ala Asp Ser Gln Ala Thr Glu Leu Lys Ile Gly Thr Leu
20 25 30
Lys Gly Leu Gln Gly Thr Glu Ile Val Ile Asp Gly Asp Ile Tyr Asp
35 40 45
Ile Lys Asp Phe Asp His Pro Gly Gly Glu Ser Ile Met Thr Phe Gly
50 55 60
Gly Asn Asp Val Thr Ala Thr Tyr Lys Met Ile His Pro Tyr His Ser
65 70 75 80
Lys His His Leu Glu Lys Met Lys Lys Val Gly Arg Val Pro Asp Tyr
85 90 95
Thr Ser Glu Tyr Lys Phe Asp Thr Pro Phe Glu Arg Glu Ile Lys Gln
100 105 110
Glu Val Phe Lys Ile Val Arg Arg Gly Arg Glu Phe Gly Thr Pro Gly
115 120 125
Tyr Phe Phe Arg Ala Phe Cys Tyr Ile Gly Leu Phe Phe Tyr Leu Gln
130 135 140
Tyr Leu Trp Val Thr Thr Pro Thr Thr Phe Ala Leu Ala Ile Phe Tyr
145 150 155 160
Gly Val Ser Gln Ala Phe Ile Gly Leu Asn Val Gln His Asp Ala Asn
165 170 175
His Gly Ala Ala Ser Lys Lys Pro Trp Ile Asn Asn Leu Leu Gly Leu
180 185 190
Gly Ala Asp Phe Ile Gly Gly Ser Lys Trp Leu Trp Met Asn Gln His
195 200 205
Trp Thr His His Thr Tyr Thr Asn His His Glu Lys Asp Pro Asp Ala
210 215 220
Leu Gly Ala Glu Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Gly His
225 230 235 240
Pro Lys Arg Thr Leu Ile His His Phe Gln Ala Phe Tyr Tyr Leu Phe
245 250 255
Val Leu Ala Gly Tyr Trp Val Ser Ser Val Phe Asn Pro Gln Ile Leu
260 265 270
Asp Leu Gln His Arg Gly Ala Gln Ala Val Gly Met Lys Met Glu Asn
275 280 285
Asp Tyr Ile Ala Lys Ser Arg Lys Tyr Ala Ile Phe Leu Arg Leu Leu
290 295 300
Tyr Ile Tyr Thr Asn Ile Val Ala Pro Ile Gln Asn Gln Gly Phe Ser
305 310 315 320
Leu Thr Val Val Ala His Ile Leu Thr Met Gly Val Ala Ser Ser Leu
325 330 335
Thr Leu Ala Thr Leu Phe Ala Leu Ser His Asn Phe Glu Asn Ala Asp
340 345 350
Arg Asp Pro Thr Tyr Glu Ala Arg Lys Gly Gly Glu Pro Val Cys Trp
355 360 365
Phe Lys Ser Gln Val Glu Thr Ser Ser Thr Tyr Gly Gly Phe Ile Ser
370 375 380
Gly Cys Leu Thr Gly Gly Leu Asn Phe Gln Val Glu His His Leu Phe
385 390 395 400
Pro Arg Met Ser Ser Ala Trp Tyr Pro Tyr Ile Ala Pro Thr Val Arg
405 410 415
Glu Val Cys Lys Lys His Gly Val Lys Tyr Ala Tyr Tyr Pro Trp Val
420 425 430
Trp Gln Asn Leu Ile Ser Thr Val Lys Tyr Leu His Gln Ser Gly Thr
435 440 445
Gly Ser Asn Trp Lys Asn Gly Ala Asn Pro Tyr Ser Gly Lys Leu
450 455 460
<210>37
<211>1350
<212>DNA
<213>纤细裸藻
<220>
<221>其它特征
<222>(1)..(1350)
<223>Δ-5去饱和酶
<400>37
atggctctca gtcttaccac agaacagctg ttagaacgcc ctgatttggt tgcgattgat 60
ggcatcctct acgaccttga agggcttgcc aaagttcatc caggaggaga tttgattctc 120
gcttctggtg cctctgatgc ctcccctctc ttttattcaa tgcatccata cgtcaaaccg 180
gagaattcca aattgcttca acagttcgtc cgagggaagc atgaccgcac ctcgaaggac 240
attgtctaca cgtatgattc tcccttcgca caagacgtta agcggacaat gcgcgaggtg 300
atgaaaggga ggaactggta cgcaacccct ggcttctggc tgcgcaccgt tgggatcatc 360
gccgtgacgg ccttttgcga gtggcactgg gctaccacgg ggatggtgct gtggggcctg 420
ttgactggat tcatgcacat gcagatcggc ttatccatcc agcatgatgc gtcccacggg 480
gccatcagca agaagccttg ggtcaacgcc ctcttcgcct acggcattga cgtcatcgga 540
tcgtcccggt ggatttggct gcagtcgcac atcatgcggc accacaccta caccaaccag 600
cacggcctcg acctggatgc ggagtcggca gagccgttcc tggtgttcca caactacccc 660
gccgcaaaca ccgcccgaaa gtggttccac cgcttccaag cttggtacat gtaccttgtg 720
ctgggggcat acggggtatc gctggtgtac aacccgctct acattttccg gatgcagcac 780
aatgacacca tcccagagtc tgtcacggcc atgcgggaga atggctttct gcggcgctac 840
cgcacacttg cattcgtgat gcgagctttc ttcatcttcc ggaccgcatt cttgccctgg 900
tacctcactg ggacctcatt gctgatcacc attcctctgg tgcccactgc aactggtgcc 960
ttcttgacgt tcttcttcat tttgtcccac aattttgatg gctccgaacg gatccccgac 1020
aagaactgca aggttaagag ctctgagaag gacgttgagg ctgaccaaat tgactggtat 1080
cgggcgcagg tggagacgtc ctccacatac ggtggcccca tcgccatgtt cttcactggc 1140
ggtctcaatt tccagatcga gcaccacctc tttccccgga tgtcgtcttg gcactacccc 1200
ttcgtccagc aggcggtccg ggagtgttgc gaacgccatg gagtgcgata tgttttctac 1260
cctaccatcg tcggcaacat catctccacc ctgaagtaca tgcataaggt gggtgtcgtc 1320
cactgcgtga aggacgcaca ggattcctga 1350
<210>38
<211>3806
<212>DNA
<213>人工序列
<220>
<223>质粒pPrD17S
<400>38
ggccgcatcg gatcccgggc ccgtcgactg cagaggcctg catgcaagct tggcgtaatc 60
atggtcatag ctgtttcctg tgtgaaattg ttatccgctc acaattccac acaacatacg 120
agccggaagc ataaagtgta aagcctgggg tgcctaatga gtgagctaac tcacattaat 180
tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg 240
aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 300
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 360
ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 420
ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 480
cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 540
actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 600
cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 660
tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 720
gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 780
caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 840
agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 900
tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 960
tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 1020
gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 1080
gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa 1140
aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 1200
atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc 1260
gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat 1320
acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc 1380
ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc 1440
tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag 1500
ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg 1560
ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg 1620
atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag 1680
taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt 1740
catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga 1800
atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc 1860
acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc 1920
aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc 1980
ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc 2040
cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca 2100
atattattga agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat 2160
ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt 2220
ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca cgaggccctt 2280
tcgtctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac 2340
ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc 2400
gggtgttggc gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag 2460
agtgcaccat atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag 2520
gcgccattcg ccattcaggc tgcgcaactg ttgggaaggg cgatcggtgc gggcctcttc 2580
gctattacgc cagctggcga aagggggatg tgctgcaagg cgattaagtt gggtaacgcc 2640
agggttttcc cagtcacgac gttgtaaaac gacggccagt gaattcgagc tcggtacctc 2700
gcgaatgcat ctagatccat ggctaccaag cagccctacc agttccctac tctgaccgag 2760
atcaagcgat ctcttccctc cgagtgcttt gaagcctcgg tccctctgtc cttgtactac 2820
accgtgcgaa tcgtcgctat tgccgttgct ctggccttcg gactcaacta cgctcgagcc 2880
cttcccgtgg tcgagtctct gtgggcactc gacgctgccc tttgttgcgg ttacgttctg 2940
ctccaaggca ttgtcttctg gggattcttt accgtgggtc acgatgctgg acatggtgcc 3000
ttctctcgat accacctgct caactttgtc gttggcacct ttatccactc cctcattctt 3060
actcccttcg agtcgtggaa gctcacacat cgacaccatc acaagaacac cggaaacatc 3120
gaccgagacg aaatcttcta ccctcagcga aaggccgacg atcatcctct gtctcgaaac 3180
ctcgtcctgg ctctcggtgc cgcttggttt gcctaccttg tcgagggctt tcctccccga 3240
aaggtcaacc acttcaaccc cttcgaacct ctgtttgtgc gacaggtggc tgccgttgtc 3300
atttccctct ctgctcactt cgccgtcctg gcactgtccg tgtatctgag ctttcagttc 3360
ggtctcaaga caatggctct gtactactat ggacccgtct tcgtgttcgg ctccatgctc 3420
gtcattacta cctttctgca tcacaatgac gaggaaactc cttggtacgg agattccgac 3480
tggacctacg tcaagggcaa cttgtcttcc gtggaccgat cttacggtgc cttcatcgac 3540
aacctctcgc acaacattgg cacacaccag atccaccatc tgtttcccat cattcctcac 3600
tacaagctca accgagccac cgctgccttc caccaggcct ttcccgaact tgtccgaaag 3660
agcgacgagc ccattctcaa ggctttctgg agagttggtc gactttacgc caactacgga 3720
gtcgtggatc ccgacgcaaa gctgtttact ctcaaggagg ccaaagctgc ctccgaggct 3780
gccaccaaga ccaaggctac ttaagc 3806
<210>39
<211>8165
<212>DNA
<213>人工序列
<220>
<223>质粒pZUF17
<400>39
gtacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca 60
ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat 120
taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc 180
tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 240
aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 300
aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 360
ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 420
acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 480
ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 540
tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 600
tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 660
gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 720
agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 780
tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 840
agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 900
tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 960
acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 1020
tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 1080
agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc 1140
tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact 1200
acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc 1260
tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt 1320
ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta 1380
agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg 1440
tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt 1500
acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc 1560
agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt 1620
actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc 1680
tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc 1740
gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa 1800
ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac 1860
tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa 1920
aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt 1980
tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa 2040
tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct 2100
gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc 2160
gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc 2220
acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt 2280
agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg 2340
ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt 2400
ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta 2460
taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt 2520
aacgcgaatt ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca 2580
actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg 2640
gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta 2700
aaacgacggc cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc 2760
ccctcgaggt cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct 2820
tcgcctcaag gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat 2880
taattttcgg gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat 2940
atacatcatg atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc 3000
gcctccaact gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag 3060
actccatcta ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt 3120
acttagtatt attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa 3180
tttataatgg cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat 3240
gggaaatctt aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca 3300
gcaacgaaaa aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag 3360
aacagctatt cacacgttac tattgagatt attattggac gagaatcaca cactcaactg 3420
tctttctctc ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct 3480
agtcatttca tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca 3540
aattcaacaa ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc 3600
tctggtgtgc ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt 3660
tcttgttata taatcctttt gtttattaca tgggctggat acataaaggt attttgattt 3720
aattttttgc ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta 3780
ccatactttt gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga 3840
cgttccgcag aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg 3900
ctccctgaga tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta 3960
ctactgttga tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat 4020
gattcattac cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca 4080
attaatcata gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca 4140
tgctacttgg gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg 4200
acagtaatta attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt 4260
agttcaacgt attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc 4320
cattggacag atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag 4380
gtcgtctgac catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca 4440
cagttaaatt acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca 4500
gccagccttc tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc 4560
tcggccgaca attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg 4620
ctgtccgaga gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc 4680
ctcagagtcg cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga 4740
tcgggcaagc tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga 4800
cagctcggcc agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa 4860
ctccttgtac tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt 4920
ttcctcggca ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt 4980
ggtgatatcg gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc 5040
aatatctgcg aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt 5100
gagggggagc acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat 5160
catgcacaca taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac 5220
atccagagaa gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc 5280
aaaggcggac ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag 5340
gagactgaaa taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa 5400
gtatatgtta tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg 5460
ctatcggtcc aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa 5520
aatgtgatca tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg 5580
cgccgaaaac gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat 5640
ccaagcacac tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata 5700
ctcgtcgact caggcgacga cggaattcct gcagcccatc tgcagaattc aggagagacc 5760
gggttggcgg cgtatttgtg tcccaaaaaa cagccccaat tgccccggag aagacggcca 5820
ggccgcctag atgacaaatt caacaactca cagctgactt tctgccattg ccactagggg 5880
ggggcctttt tatatggcca agccaagctc tccacgtcgg ttgggctgca cccaacaata 5940
aatgggtagg gttgcaccaa caaagggatg ggatgggggg tagaagatac gaggataacg 6000
gggctcaatg gcacaaataa gaacgaatac tgccattaag actcgtgatc cagcgactga 6060
caccattgca tcatctaagg gcctcaaaac tacctcggaa ctgctgcgct gatctggaca 6120
ccacagaggt tccgagcact ttaggttgca ccaaatgtcc caccaggtgc aggcagaaaa 6180
cgctggaaca gcgtgtacag tttgtcttaa caaaaagtga gggcgctgag gtcgagcagg 6240
gtggtgtgac ttgttatagc ctttagagct gcgaaagcgc gtatggattt ggctcatcag 6300
gccagattga gggtctgtgg acacatgtca tgttagtgta cttcaatcgc cccctggata 6360
tagccccgac aataggccgt ggcctcattt ttttgccttc cgcacatttc cattgctcgg 6420
tacccacacc ttgcttctcc tgcacttgcc aaccttaata ctggtttaca ttgaccaaca 6480
tcttacaagc ggggggcttg tctagggtat atataaacag tggctctccc aatcggttgc 6540
cagtctcttt tttcctttct ttccccacag attcgaaatc taaactacac atcacacaat 6600
gcctgttact gacgtcctta agcgaaagtc cggtgtcatc gtcggcgacg atgtccgagc 6660
cgtgagtatc cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat gacacaatcc 6720
gaaagtcgct agcaacacac actctctaca caaactaacc cagctctcca tggctgagga 6780
taagaccaag gtcgagttcc ctaccctgac tgagctgaag cactctatcc ctaacgcttg 6840
ctttgagtcc aacctcggac tctcgctcta ctacactgcc cgagcgatct tcaacgcatc 6900
tgcctctgct gctctgctct acgctgcccg atctactccc ttcattgccg ataacgttct 6960
gctccacgct ctggtttgcg ccacctacat ctacgtgcag ggtgtcatct tctggggttt 7020
ctttaccgtc ggtcacgact gtggtcactc tgccttctcc cgataccact ccgtcaactt 7080
catcattggc tgcatcatgc actctgccat tctgactccc ttcgagtcct ggcgagtgac 7140
ccaccgacac catcacaaga acactggcaa cattgataag gacgagatct tctaccctca 7200
tcggtccgtc aaggacctcc aggacgtgcg acaatgggtc tacaccctcg gaggtgcttg 7260
gtttgtctac ctgaaggtcg gatatgctcc tcgaaccatg tcccactttg acccctggga 7320
ccctctcctg cttcgacgag cctccgctgt catcgtgtcc ctcggagtct gggctgcctt 7380
cttcgctgcc tacgcctacc tcacatactc gctcggcttt gccgtcatgg gcctctacta 7440
ctatgctcct ctctttgtct ttgcttcgtt cctcgtcatt actaccttct tgcatcacaa 7500
cgacgaagct actccctggt acggtgactc ggagtggacc tacgtcaagg gcaacctgag 7560
ctccgtcgac cgatcgtacg gagctttcgt ggacaacctg tctcaccaca ttggcaccca 7620
ccaggtccat cacttgttcc ctatcattcc ccactacaag ctcaacgaag ccaccaagca 7680
ctttgctgcc gcttaccctc acctcgtgag acgtaacgac gagcccatca ttactgcctt 7740
cttcaagacc gctcacctct ttgtcaacta cggagctgtg cccgagactg ctcagatttt 7800
caccctcaaa gagtctgccg ctgcagccaa ggccaagagc gactaagcgg ccgcaagtgt 7860
ggatggggaa gtgagtgccc ggttctgtgt gcacaattgg caatccaaga tggatggatt 7920
caacacaggg atatagcgag ctacgtggtg gtgcgaggat atagcaacgg atatttatgt 7980
ttgacacttg agaatgtacg atacaagcac tgtccaagta caatactaaa catactgtac 8040
atactcatac tcgtacccgg gcaacggttt cacttgagtg cagtggctag tgctcttact 8100
cgtacagtgt gcaatactgc gtatcatagt ctttgatgta tatcgtattc attcatgtta 8160
gttgc 8165
<210>40
<211>8174
<212>DNA
<213>人工序列
<220>
<223>质粒pZUFPrD17S
<400>40
catggctacc aagcagccct accagttccc tactctgacc gagatcaagc gatctcttcc 60
ctccgagtgc tttgaagcct cggtccctct gtccttgtac tacaccgtgc gaatcgtcgc 120
tattgccgtt gctctggcct tcggactcaa ctacgctcga gcccttcccg tggtcgagtc 180
tctgtgggca ctcgacgctg ccctttgttg cggttacgtt ctgctccaag gcattgtctt 240
ctggggattc tttaccgtgg gtcacgatgc tggacatggt gccttctctc gataccacct 300
gctcaacttt gtcgttggca cctttatcca ctccctcatt cttactccct tcgagtcgtg 360
gaagctcaca catcgacacc atcacaagaa caccggaaac atcgaccgag acgaaatctt 420
ctaccctcag cgaaaggccg acgatcatcc tctgtctcga aacctcgtcc tggctctcgg 480
tgccgcttgg tttgcctacc ttgtcgaggg ctttcctccc cgaaaggtca accacttcaa 540
ccccttcgaa cctctgtttg tgcgacaggt ggctgccgtt gtcatttccc tctctgctca 600
cttcgccgtc ctggcactgt ccgtgtatct gagctttcag ttcggtctca agacaatggc 660
tctgtactac tatggacccg tcttcgtgtt cggctccatg ctcgtcatta ctacctttct 720
gcatcacaat gacgaggaaa ctccttggta cggagattcc gactggacct acgtcaaggg 780
caacttgtct tccgtggacc gatcttacgg tgccttcatc gacaacctct cgcacaacat 840
tggcacacac cagatccacc atctgtttcc catcattcct cactacaagc tcaaccgagc 900
caccgctgcc ttccaccagg cctttcccga acttgtccga aagagcgacg agcccattct 960
caaggctttc tggagagttg gtcgacttta cgccaactac ggagtcgtgg atcccgacgc 1020
aaagctgttt actctcaagg aggccaaagc tgcctccgag gctgccacca agaccaaggc 1080
tacttaagcg gccgcaagtg tggatgggga agtgagtgcc cggttctgtg tgcacaattg 1140
gcaatccaag atggatggat tcaacacagg gatatagcga gctacgtggt ggtgcgagga 1200
tatagcaacg gatatttatg tttgacactt gagaatgtac gatacaagca ctgtccaagt 1260
acaatactaa acatactgta catactcata ctcgtacccg ggcaacggtt tcacttgagt 1320
gcagtggcta gtgctcttac tcgtacagtg tgcaatactg cgtatcatag tctttgatgt 1380
atatcgtatt cattcatgtt agttgcgtac gagccggaag cataaagtgt aaagcctggg 1440
gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt 1500
cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt 1560
tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc 1620
tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg 1680
ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg 1740
ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac 1800
gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 1860
gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct 1920
ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg 1980
tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct 2040
gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 2100
tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt 2160
tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc 2220
tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca 2280
ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat 2340
ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac 2400
gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt 2460
aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc 2520
aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg 2580
cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg 2640
ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc 2700
cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta 2760
ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg 2820
ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct 2880
ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 2940
gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 3000
ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 3060
ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 3120
gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 3180
ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 3240
cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 3300
ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 3360
aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 3420
gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 3480
gcacatttcc ccgaaaagtg ccacctgacg cgccctgtag cggcgcatta agcgcggcgg 3540
gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt 3600
tcgctttctt cccttccttt ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc 3660
gggggctccc tttagggttc cgatttagtg ctttacggca cctcgacccc aaaaaacttg 3720
attagggtga tggttcacgt agtgggccat cgccctgata gacggttttt cgccctttga 3780
cgttggagtc cacgttcttt aatagtggac tcttgttcca aactggaaca acactcaacc 3840
ctatctcggt ctattctttt gatttataag ggattttgcc gatttcggcc tattggttaa 3900
aaaatgagct gatttaacaa aaatttaacg cgaattttaa caaaatatta acgcttacaa 3960
tttccattcg ccattcaggc tgcgcaactg ttgggaaggg cgatcggtgc gggcctcttc 4020
gctattacgc cagctggcga aagggggatg tgctgcaagg cgattaagtt gggtaacgcc 4080
agggttttcc cagtcacgac gttgtaaaac gacggccagt gaattgtaat acgactcact 4140
atagggcgaa ttgggtaccg ggccccccct cgaggtcgat ggtgtcgata agcttgatat 4200
cgaattcatg tcacacaaac cgatcttcgc ctcaaggaaa cctaattcta catccgagag 4260
actgccgaga tccagtctac actgattaat tttcgggcca ataatttaaa aaaatcgtgt 4320
tatataatat tatatgtatt atatatatac atcatgatga tactgacagt catgtcccat 4380
tgctaaatag acagactcca tctgccgcct ccaactgatg ttctcaatat ttaaggggtc 4440
atctcgcatt gtttaataat aaacagactc catctaccgc ctccaaatga tgttctcaaa 4500
atatattgta tgaacttatt tttattactt agtattatta gacaacttac ttgctttatg 4560
aaaaacactt cctatttagg aaacaattta taatggcagt tcgttcattt aacaatttat 4620
gtagaataaa tgttataaat gcgtatggga aatcttaaat atggatagca taaatgatat 4680
ctgcattgcc taattcgaaa tcaacagcaa cgaaaaaaat cccttgtaca acataaatag 4740
tcatcgagaa atatcaacta tcaaagaaca gctattcaca cgttactatt gagattatta 4800
ttggacgaga atcacacact caactgtctt tctctcttct agaaatacag gtacaagtat 4860
gtactattct cattgttcat acttctagtc atttcatccc acatattcct tggatttctc 4920
tccaatgaat gacattctat cttgcaaatt caacaattat aataagatat accaaagtag 4980
cggtatagtg gcaatcaaaa agcttctctg gtgtgcttct cgtatttatt tttattctaa 5040
tgatccatta aaggtatata tttatttctt gttatataat ccttttgttt attacatggg 5100
ctggatacat aaaggtattt tgatttaatt ttttgcttaa attcaatccc ccctcgttca 5160
gtgtcaactg taatggtagg aaattaccat acttttgaag aagcaaaaaa aatgaaagaa 5220
aaaaaaaatc gtatttccag gttagacgtt ccgcagaatc tagaatgcgg tatgcggtac 5280
attgttcttc gaacgtaaaa gttgcgctcc ctgagatatt gtacattttt gcttttacaa 5340
gtacaagtac atcgtacaac tatgtactac tgttgatgca tccacaacag tttgttttgt 5400
ttttttttgt tttttttttt tctaatgatt cattaccgct atgtatacct acttgtactt 5460
gtagtaagcc gggttattgg cgttcaatta atcatagact tatgaatctg cacggtgtgc 5520
gctgcgagtt acttttagct tatgcatgct acttgggtgt aatattggga tctgttcgga 5580
aatcaacgga tgctcaatcg atttcgacag taattaatta agtcatacac aagtcagctt 5640
tcttcgagcc tcatataagt ataagtagtt caacgtatta gcactgtacc cagcatctcc 5700
gtatcgagaa acacaacaac atgccccatt ggacagatca tgcggataca caggttgtgc 5760
agtatcatac atactcgatc agacaggtcg tctgaccatc atacaagctg aacaagcgct 5820
ccatacttgc acgctctcta tatacacagt taaattacat atccatagtc taacctctaa 5880
cagttaatct tctggtaagc ctcccagcca gccttctggt atcgcttggc ctcctcaata 5940
ggatctcggt tctggccgta cagacctcgg ccgacaatta tgatatccgt tccggtagac 6000
atgacatcct caacagttcg gtactgctgt ccgagagcgt ctcccttgtc gtcaagaccc 6060
accccggggg tcagaataag ccagtcctca gagtcgccct taggtcggtt ctgggcaatg 6120
aagccaacca caaactcggg gtcggatcgg gcaagctcaa tggtctgctt ggagtactcg 6180
ccagtggcca gagagccctt gcaagacagc tcggccagca tgagcagacc tctggccagc 6240
ttctcgttgg gagaggggac taggaactcc ttgtactggg agttctcgta gtcagagacg 6300
tcctccttct tctgttcaga gacagtttcc tcggcaccag ctcgcaggcc agcaatgatt 6360
ccggttccgg gtacaccgtg ggcgttggtg atatcggacc actcggcgat tcggtgacac 6420
cggtactggt gcttgacagt gttgccaata tctgcgaact ttctgtcctc gaacaggaag 6480
aaaccgtgct taagagcaag ttccttgagg gggagcacag tgccggcgta ggtgaagtcg 6540
tcaatgatgt cgatatgggt tttgatcatg cacacataag gtccgacctt atcggcaagc 6600
tcaatgagct ccttggtggt ggtaacatcc agagaagcac acaggttggt tttcttggct 6660
gccacgagct tgagcactcg agcggcaaag gcggacttgt ggacgttagc tcgagcttcg 6720
taggagggca ttttggtggt gaagaggaga ctgaaataaa tttagtctgc agaacttttt 6780
atcggaacct tatctggggc agtgaagtat atgttatggt aatagttacg agttagttga 6840
acttatagat agactggact atacggctat cggtccaaat tagaaagaac gtcaatggct 6900
ctctgggcgt cgcctttgcc gacaaaaatg tgatcatgat gaaagccagc aatgacgttg 6960
cagctgatat tgttgtcggc caaccgcgcc gaaaacgcag ctgtcagacc cacagcctcc 7020
aacgaagaat gtatcgtcaa agtgatccaa gcacactcat agttggagtc gtactccaaa 7080
ggcggcaatg acgagtcaga cagatactcg tcgactcagg cgacgacgga attcctgcag 7140
cccatctgca gaattcagga gagaccgggt tggcggcgta tttgtgtccc aaaaaacagc 7200
cccaattgcc ccggagaaga cggccaggcc gcctagatga caaattcaac aactcacagc 7260
tgactttctg ccattgccac tagggggggg cctttttata tggccaagcc aagctctcca 7320
cgtcggttgg gctgcaccca acaataaatg ggtagggttg caccaacaaa gggatgggat 7380
ggggggtaga agatacgagg ataacggggc tcaatggcac aaataagaac gaatactgcc 7440
attaagactc gtgatccagc gactgacacc attgcatcat ctaagggcct caaaactacc 7500
tcggaactgc tgcgctgatc tggacaccac agaggttccg agcactttag gttgcaccaa 7560
atgtcccacc aggtgcaggc agaaaacgct ggaacagcgt gtacagtttg tcttaacaaa 7620
aagtgagggc gctgaggtcg agcagggtgg tgtgacttgt tatagccttt agagctgcga 7680
aagcgcgtat ggatttggct catcaggcca gattgagggt ctgtggacac atgtcatgtt 7740
agtgtacttc aatcgccccc tggatatagc cccgacaata ggccgtggcc tcattttttt 7800
gccttccgca catttccatt gctcggtacc cacaccttgc ttctcctgca cttgccaacc 7860
ttaatactgg tttacattga ccaacatctt acaagcgggg ggcttgtcta gggtatatat 7920
aaacagtggc tctcccaatc ggttgccagt ctcttttttc ctttctttcc ccacagattc 7980
gaaatctaaa ctacacatca cacaatgcct gttactgacg tccttaagcg aaagtccggt 8040
gtcatcgtcg gcgacgatgt ccgagccgtg agtatccacg acaagatcag tgtcgagacg 8100
acgcgttttg tgtaatgaca caatccgaaa gtcgctagca acacacactc tctacacaaa 8160
ctaacccagc tctc 8174
Claims (13)
1.一种分离的核酸分子,所述分离的核酸分子选自:
a)编码Δ17去饱和酶的分离的核苷酸序列,所述分离的核苷酸序
列如SEQ ID NO:7所示;和
b)与(a)完全互补的分离的核苷酸序列。
2.一种嵌合基因,其包含操作性连接在一起的权利要求1的分离的核酸分子和适宜的调节序列。
3.一种转化的耶氏酵母属物种(Yarrowia sp.),其包含权利要求1的分离的核酸分子。
4.权利要求3的转化耶氏酵母属物种,其选自解脂耶氏酵母(Yarrowia lipolytica)ATCC#20362、解脂耶氏酵母ATCC#8862、解脂耶氏酵母ATCC#18944和解脂耶氏酵母ATCC#76982。
5.一种生产二十碳五烯酸的方法,所述方法包括:
a)提供一种宿主细胞,所述宿主细胞包含:
i)编码SEQ ID NO:6所示的Δ17去饱和酶多肽的分离的核苷酸分子;和
ii)花生四烯酸的来源;
b)在其中表达编码Δ17去饱和酶多肽的核苷酸分子和花生四烯酸转变为二十碳五烯酸的条件下培养步骤(a)的宿主细胞;和
c)任选地回收步骤(b)的二十碳五烯酸。
6.一种生产二十碳四烯酸的方法,所述方法包括:
a)提供一种宿主细胞,所述宿主细胞包含:
i)编码SEQ ID NO:6所示的Δ17去饱和酶多肽的分离的核苷酸分子;和
ii)二均γ亚麻酸的来源;
b)在其中表达编码Δ17去饱和酶多肽的核苷酸分子和二均γ亚麻酸转变为二十碳四烯酸的条件下培养步骤(a)的宿主细胞;和
c)任选地回收步骤(b)的二十碳四烯酸。
7.权利要求5或6的方法,其中:
a)所述分离的核酸分子如SEQ ID NO:7的核酸序列所示;和
b)所述宿主细胞为解脂耶氏酵母。
8.权利要求5或6的方法,其中所述宿主细胞选自:藻类、细菌、卵菌和真菌。
9.权利要求5或6的方法,其中所述宿主细胞是酵母。
10.权利要求8的方法,其中所述宿主细胞为选自以下的真菌:破囊壶菌属物种(Thraustochytrium sp.)、裂殖壶菌属物种(Schizochytrium sp.)和被孢霉菌属物种(Mortierella sp.)。
11.权利要求9的方法,其中所述酵母为油脂酵母。
12.权利要求11的方法,其中所述油脂酵母选自:耶氏酵母属(Yarrowia)、假丝酵母属(Candida)、红酵母属(Rhodotorula)、红冬孢酵母属(Rhodosporidium)、隐球酵母属(Cryptococcus)、丝孢酵母属(Trichosporon)和油脂酵母属(Lipomyces)。
13.权利要求12的方法,其中所述耶氏酵母选自:解脂耶氏酵母ATCC #20362、解脂耶氏酵母ATCC #8862、解脂耶氏酵母ATCC#18944和解脂耶氏酵母ATCC #76982。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US79357506P | 2006-04-20 | 2006-04-20 | |
US60/793,575 | 2006-04-20 | ||
PCT/US2007/009572 WO2007123999A2 (en) | 2006-04-20 | 2007-04-19 | Δ17 desaturases and their use in making polyunsaturated fatty acids |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101646766A CN101646766A (zh) | 2010-02-10 |
CN101646766B true CN101646766B (zh) | 2013-03-27 |
Family
ID=38625589
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2007800139991A Expired - Fee Related CN101646766B (zh) | 2006-04-20 | 2007-04-19 | △17去饱和酶及其用于制备多不饱和脂肪酸的用途 |
Country Status (9)
Country | Link |
---|---|
US (1) | US7465793B2 (zh) |
EP (1) | EP2010648B1 (zh) |
JP (1) | JP2009534032A (zh) |
CN (1) | CN101646766B (zh) |
AT (1) | ATE542892T1 (zh) |
AU (1) | AU2007240741B2 (zh) |
DK (1) | DK2010648T3 (zh) |
MX (1) | MX2008013306A (zh) |
WO (1) | WO2007123999A2 (zh) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2659993C (en) * | 2006-08-24 | 2016-01-05 | Basf Plant Science Gmbh | Isolation and characterization of a novel pythium omega 3 desaturase with specificity to all omega 6 fatty acids longer than 18 carbon chains |
CA2663807C (en) | 2006-10-30 | 2018-05-01 | E.I. Du Pont De Nemours And Company | .delta.17 desaturase and its use in making polyunsaturated fatty acids |
US7709239B2 (en) * | 2006-12-07 | 2010-05-04 | E.I. Du Pont De Nemours And Company | Mutant Δ8 desaturase genes engineered by targeted mutagenesis and their use in making polyunsaturated fatty acids |
US8119860B2 (en) * | 2007-04-16 | 2012-02-21 | E. I. Du Pont De Nemours And Company | Delta-9 elongases and their use in making polyunsaturated fatty acids |
WO2010045324A1 (en) * | 2008-10-14 | 2010-04-22 | Monsanto Technology Llc | Utilization of fatty acid desaturases from hemiselmis spp. |
PL2376638T3 (pl) | 2008-12-12 | 2014-01-31 | Basf Plant Science Gmbh | Desaturazy oraz sposób wytwarzania wielonienasyconych kwasów tłuszczowych w organizmach transgenicznych |
WO2012027676A1 (en) * | 2010-08-26 | 2012-03-01 | E. I. Du Pont De Nemours And Company | Mutant delta-9 elongases and their use in making polyunsaturated fatty acids |
US20160208297A1 (en) | 2013-08-27 | 2016-07-21 | Kyoto University | Omega3 unsaturated fatty acid enzyme and method for producing eicosapentaenoic acid |
WO2016104607A1 (ja) * | 2014-12-25 | 2016-06-30 | 国立大学法人京都大学 | 新規ω3不飽和脂肪酸酵素およびエイコサペンタエン酸の製造方法 |
WO2017097218A1 (zh) * | 2015-12-09 | 2017-06-15 | 江南大学 | 来源于寄生疫霉的用于合成多不饱和脂肪酸的ω-3脂肪酸脱饱和酶、含所述脂肪酸脱饱和酶的载体、重组微生物及其应用 |
CN105368851B (zh) * | 2015-12-09 | 2019-05-07 | 江南大学 | 一种来源于寄生疫霉的ω-3脱饱和酶、含所述脱饱和酶的载体、重组微生物及其应用 |
CN106916796B (zh) * | 2015-12-28 | 2021-06-29 | 丰益(上海)生物技术研发中心有限公司 | 裂殖壶菌delta-12脂肪酸脱饱和酶相关序列及其应用 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1305532A (zh) * | 1998-06-11 | 2001-07-25 | 纳幕尔杜邦公司 | 用于改变玉米脂质特征的去饱和酶的基因 |
WO2005083053A2 (de) * | 2004-02-27 | 2005-09-09 | Basf Plant Science Gmbh | Verfahren zur herstellung von ungesättigten omega-3-fettsäuren in transgenen organismen |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4683202A (en) | 1985-03-28 | 1987-07-28 | Cetus Corporation | Process for amplifying nucleic acid sequences |
US6635451B2 (en) | 2001-01-25 | 2003-10-21 | Abbott Laboratories | Desaturase genes and uses thereof |
US7125672B2 (en) * | 2003-05-07 | 2006-10-24 | E. I. Du Pont De Nemours And Company | Codon-optimized genes for the production of polyunsaturated fatty acids in oleaginous yeasts |
US7238482B2 (en) | 2003-05-07 | 2007-07-03 | E. I. Du Pont De Nemours And Company | Production of polyunsaturated fatty acids in oleaginous yeasts |
EP2169052B1 (de) | 2003-08-01 | 2016-01-06 | BASF Plant Science GmbH | Verfahren zur Herstellung mehrfach ungesättigter Fettsäuren in transgenen Organismen |
WO2005047479A2 (en) | 2003-11-12 | 2005-05-26 | E.I. Dupont De Nemours And Company | Delta-15 desaturases suitable for altering levels of polyunsaturated fatty acids in oilseed plants and oleaginous yeast |
EP4219670A3 (de) | 2004-02-27 | 2023-08-09 | BASF Plant Science GmbH | Verfahren zur herstellung mehrfach ungesättigter fettsäuren in transgenen pflanzen |
US7192762B2 (en) | 2004-11-04 | 2007-03-20 | E. I. Du Pont De Nemours And Company | Mortierella alpina glycerol-3-phosphate o-acyltransferase for alteration of polyunsaturated fatty acids and oil content in oleaginous organisms |
US7189559B2 (en) | 2004-11-04 | 2007-03-13 | E. I. Du Pont De Nemours And Company | Mortierella alpina lysophosphatidic acid acyltransferase homolog for alteration of polyunsaturated fatty acids and oil content in oleaginous organisms |
US7588931B2 (en) | 2004-11-04 | 2009-09-15 | E. I. Du Pont De Nemours And Company | High arachidonic acid producing strains of Yarrowia lipolytica |
DE102005013779A1 (de) | 2005-03-22 | 2006-09-28 | Basf Plant Science Gmbh | Verfahren zur Herstellung von mehrfach ungesättigten C20- und C22-Fettsäuren mit mindestens vier Doppelbindungen in transgenen Pflanzen |
-
2007
- 2007-04-18 US US11/787,772 patent/US7465793B2/en not_active Expired - Fee Related
- 2007-04-19 CN CN2007800139991A patent/CN101646766B/zh not_active Expired - Fee Related
- 2007-04-19 DK DK07775772.2T patent/DK2010648T3/da active
- 2007-04-19 WO PCT/US2007/009572 patent/WO2007123999A2/en active Application Filing
- 2007-04-19 AT AT07775772T patent/ATE542892T1/de active
- 2007-04-19 JP JP2009506580A patent/JP2009534032A/ja not_active Ceased
- 2007-04-19 EP EP07775772A patent/EP2010648B1/en not_active Not-in-force
- 2007-04-19 MX MX2008013306A patent/MX2008013306A/es not_active Application Discontinuation
- 2007-04-19 AU AU2007240741A patent/AU2007240741B2/en not_active Ceased
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1305532A (zh) * | 1998-06-11 | 2001-07-25 | 纳幕尔杜邦公司 | 用于改变玉米脂质特征的去饱和酶的基因 |
WO2005083053A2 (de) * | 2004-02-27 | 2005-09-09 | Basf Plant Science Gmbh | Verfahren zur herstellung von ungesättigten omega-3-fettsäuren in transgenen organismen |
Non-Patent Citations (1)
Title |
---|
咸漠,等.Δ6去饱和酶催化合成γ-亚麻酸.《高等学校化学学报》.2002,第23卷(第4期),624-627. * |
Also Published As
Publication number | Publication date |
---|---|
US20070249026A1 (en) | 2007-10-25 |
ATE542892T1 (de) | 2012-02-15 |
EP2010648B1 (en) | 2012-01-25 |
JP2009534032A (ja) | 2009-09-24 |
WO2007123999A3 (en) | 2008-04-24 |
AU2007240741B2 (en) | 2013-01-31 |
WO2007123999A2 (en) | 2007-11-01 |
MX2008013306A (es) | 2008-10-27 |
EP2010648A2 (en) | 2009-01-07 |
US7465793B2 (en) | 2008-12-16 |
DK2010648T3 (da) | 2012-05-21 |
AU2007240741A1 (en) | 2007-11-01 |
CN101646766A (zh) | 2010-02-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DK2087105T3 (da) | Delta 17-desaturase og anvendelse heraf ved fremstilling af flerumættede fedtsyrer | |
CN101646766B (zh) | △17去饱和酶及其用于制备多不饱和脂肪酸的用途 | |
CN101365788B (zh) | Δ-9延伸酶及其在制备多不饱和脂肪酸中的用途 | |
RU2763170C2 (ru) | Производство олигосахаридов человеческого молока в микроорганизмах-хозяевах с модифицированным импортом/экспортом | |
CN101939434B (zh) | 用于在大豆中提高种子贮藏油脂的生成和改变脂肪酸谱的来自解脂耶氏酵母的dgat基因 | |
KR102319845B1 (ko) | 조류 숙주 세포에 대한 crispr-cas 시스템 | |
CN101437953B (zh) | 用于改变含油生物的多不饱和脂肪酸和油含量的二酰基甘油酰基转移酶 | |
CA2683497C (en) | .delta.8 desaturases and their use in making polyunsaturated fatty acids | |
DK2140006T3 (en) | DELTA-5 desaturases AND USE THEREOF FOR THE PRODUCTION OF polyunsaturated fatty acids | |
KR102147005B1 (ko) | Fad2 성능 유전자좌 및 표적화 파단을 유도할 수 있는 상응하는 표적 부위 특이적 결합 단백질 | |
KR102711998B1 (ko) | 조작되고 완전-기능 맞춤 당단백질 | |
DK2087106T3 (en) | MUTATING DELTA8 DESATURATION GENES CONSTRUCTED BY TARGETED MUTAGENES AND USE THEREOF IN THE MANUFACTURE OF MULTI-Saturated FAT ACIDS | |
DK2623594T3 (da) | Antistof mod human prostaglandin-E2-receptor EP4 | |
KR20220012327A (ko) | 피토칸나비노이드 및 피토칸나비노이드 전구체의 생산을 위한 방법 및 세포 | |
CN101815432A (zh) | 涉及编码核苷二磷酸激酶(ndk)多肽及其同源物的基因的用于修改植物根构造的方法 | |
CN101827938A (zh) | 涉及rt1基因、相关的构建体和方法的具有改变的根构造的植物 | |
KR20130138760A (ko) | 고농도의 에이코사펜타엔산 생성을 위한 재조합 미생물 숙주 세포 | |
CN109996874A (zh) | 10-甲基硬脂酸的异源性产生 | |
CN101883843A (zh) | 破坏过氧化物酶体生物合成因子蛋白(pex)以改变含油真核生物中多不饱和脂肪酸和总脂质含量 | |
CN101918560B (zh) | 在氮限制条件下具有改变的农学特性的植物以及涉及编码lnt2多肽及其同源物的基因的相关构建体和方法 | |
CN115927299A (zh) | 增加双链rna产生的方法和组合物 | |
DK2935601T3 (en) | RECOMBINANT MICROBELL CELLS PRODUCING AT LEAST 28% EICOSAPENTAIC ACID AS DRY WEIGHT | |
CN101868545B (zh) | 具有改变的根构造的植物、涉及编码富含亮氨酸重复序列激酶(llrk)多肽及其同源物的基因的相关构建体和方法 | |
KR20140000711A (ko) | 수크로스 이용에 있어서의 야로위아 리포라이티카에서의 사카로마이세스 세레비지애 suc2 유전자의 용도 | |
CN101848931B (zh) | 具有改变的根构造的植物、涉及编码exostosin家族多肽及其同源物的基因的相关的构建体和方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20130327 Termination date: 20140419 |