CN1491282A - 异戊二烯醇的制备方法 - Google Patents
异戊二烯醇的制备方法 Download PDFInfo
- Publication number
- CN1491282A CN1491282A CNA018227147A CN01822714A CN1491282A CN 1491282 A CN1491282 A CN 1491282A CN A018227147 A CNA018227147 A CN A018227147A CN 01822714 A CN01822714 A CN 01822714A CN 1491282 A CN1491282 A CN 1491282A
- Authority
- CN
- China
- Prior art keywords
- gene
- recombinant
- dna
- hmg1
- mutant
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- ASUAYTHWZCLXAN-UHFFFAOYSA-N prenol Chemical compound CC(C)=CCO ASUAYTHWZCLXAN-UHFFFAOYSA-N 0.000 title claims abstract description 70
- 238000002360 preparation method Methods 0.000 title description 37
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 273
- 108020004414 DNA Proteins 0.000 claims abstract description 165
- 238000000034 method Methods 0.000 claims abstract description 108
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims abstract description 74
- 230000014509 gene expression Effects 0.000 claims abstract description 64
- 108020004511 Recombinant DNA Proteins 0.000 claims abstract description 48
- 230000010354 integration Effects 0.000 claims abstract description 27
- 238000012258 culturing Methods 0.000 claims abstract description 21
- 239000001963 growth medium Substances 0.000 claims abstract description 9
- 101150091750 HMG1 gene Proteins 0.000 claims description 144
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 123
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 123
- 239000002609 medium Substances 0.000 claims description 111
- 230000004927 fusion Effects 0.000 claims description 82
- CBIDRCWHNCKSTO-UHFFFAOYSA-N prenyl diphosphate Chemical compound CC(C)=CCO[P@](O)(=O)OP(O)(O)=O CBIDRCWHNCKSTO-UHFFFAOYSA-N 0.000 claims description 74
- 239000002773 nucleotide Substances 0.000 claims description 69
- 125000003729 nucleotide group Chemical group 0.000 claims description 69
- 241000588724 Escherichia coli Species 0.000 claims description 61
- 239000000243 solution Substances 0.000 claims description 60
- 108010007508 Farnesyltranstransferase Proteins 0.000 claims description 51
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 49
- 108090000895 Hydroxymethylglutaryl CoA Reductases Proteins 0.000 claims description 48
- 239000000203 mixture Substances 0.000 claims description 47
- 239000008103 glucose Substances 0.000 claims description 44
- 102100028501 Galanin peptides Human genes 0.000 claims description 37
- 238000013518 transcription Methods 0.000 claims description 34
- 230000035897 transcription Effects 0.000 claims description 34
- 244000005700 microbiome Species 0.000 claims description 33
- 101150094690 GAL1 gene Proteins 0.000 claims description 32
- 101100121078 Homo sapiens GAL gene Proteins 0.000 claims description 31
- 230000037361 pathway Effects 0.000 claims description 28
- 235000000346 sugar Nutrition 0.000 claims description 28
- KJTLQQUUPVSXIM-ZCFIWIBFSA-M (R)-mevalonate Chemical compound OCC[C@](O)(C)CC([O-])=O KJTLQQUUPVSXIM-ZCFIWIBFSA-M 0.000 claims description 24
- KJTLQQUUPVSXIM-UHFFFAOYSA-N DL-mevalonic acid Natural products OCCC(O)(C)CC(O)=O KJTLQQUUPVSXIM-UHFFFAOYSA-N 0.000 claims description 24
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 claims description 24
- 239000012527 feed solution Substances 0.000 claims description 23
- OJISWRZIEWCUBN-QIRCYJPOSA-N (E,E,E)-geranylgeraniol Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CO OJISWRZIEWCUBN-QIRCYJPOSA-N 0.000 claims description 22
- XWRJRXQNOHXIOX-UHFFFAOYSA-N geranylgeraniol Natural products CC(C)=CCCC(C)=CCOCC=C(C)CCC=C(C)C XWRJRXQNOHXIOX-UHFFFAOYSA-N 0.000 claims description 22
- OJISWRZIEWCUBN-UHFFFAOYSA-N geranylnerol Natural products CC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CCO OJISWRZIEWCUBN-UHFFFAOYSA-N 0.000 claims description 22
- 230000002103 transcriptional effect Effects 0.000 claims description 22
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 20
- 150000001413 amino acids Chemical group 0.000 claims description 19
- 229910052799 carbon Inorganic materials 0.000 claims description 18
- 108010026318 Geranyltranstransferase Proteins 0.000 claims description 15
- 108010041601 histidyl-aspartyl-glutamyl-leucine Proteins 0.000 claims description 15
- 108090000769 Isomerases Proteins 0.000 claims description 13
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 claims description 12
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 claims description 12
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 claims description 10
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 claims description 10
- 101000892220 Geobacillus thermodenitrificans (strain NG80-2) Long-chain-alcohol dehydrogenase 1 Proteins 0.000 claims description 10
- 101000780443 Homo sapiens Alcohol dehydrogenase 1A Proteins 0.000 claims description 10
- 150000003863 ammonium salts Chemical class 0.000 claims description 10
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 claims description 9
- 235000011114 ammonium hydroxide Nutrition 0.000 claims description 9
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 claims description 8
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 claims description 7
- 229910052921 ammonium sulfate Inorganic materials 0.000 claims description 7
- 235000011130 ammonium sulphate Nutrition 0.000 claims description 7
- 239000007788 liquid Substances 0.000 claims description 7
- 108010006229 Acetyl-CoA C-acetyltransferase Proteins 0.000 claims description 6
- 210000001840 diploid cell Anatomy 0.000 claims description 6
- 229910052943 magnesium sulfate Inorganic materials 0.000 claims description 6
- 235000019341 magnesium sulphate Nutrition 0.000 claims description 6
- 239000004094 surface-active agent Substances 0.000 claims description 6
- 101710183613 Diphosphomevalonate decarboxylase Proteins 0.000 claims description 5
- 108700040132 Mevalonate kinases Proteins 0.000 claims description 5
- 229910021293 PO 4 Inorganic materials 0.000 claims description 5
- 240000008042 Zea mays Species 0.000 claims description 5
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 5
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 5
- 235000005822 corn Nutrition 0.000 claims description 5
- 108091000116 phosphomevalonate kinase Proteins 0.000 claims description 5
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 claims description 4
- 101000642268 Homo sapiens Speckle-type POZ protein Proteins 0.000 claims description 4
- 108010000775 Hydroxymethylglutaryl-CoA synthase Proteins 0.000 claims description 4
- 102100036422 Speckle-type POZ protein Human genes 0.000 claims description 4
- 239000001110 calcium chloride Substances 0.000 claims description 4
- 229910001628 calcium chloride Inorganic materials 0.000 claims description 4
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 3
- 230000000977 initiatory effect Effects 0.000 claims description 3
- 238000011084 recovery Methods 0.000 claims description 3
- 239000012266 salt solution Substances 0.000 claims description 2
- -1 alkenyl diphosphate Chemical compound 0.000 abstract description 9
- RRHGJUQNOFWUDK-UHFFFAOYSA-N Isoprene Chemical compound CC(=C)C=C RRHGJUQNOFWUDK-UHFFFAOYSA-N 0.000 abstract description 8
- 239000001177 diphosphate Substances 0.000 abstract description 4
- 235000011180 diphosphates Nutrition 0.000 abstract description 4
- 239000013612 plasmid Substances 0.000 description 153
- 101100178203 Arabidopsis thaliana HMGB3 gene Proteins 0.000 description 131
- 108700010013 HMGB1 Proteins 0.000 description 131
- 101150021904 HMGB1 gene Proteins 0.000 description 131
- 102100037907 High mobility group protein B1 Human genes 0.000 description 131
- 239000012634 fragment Substances 0.000 description 118
- 238000004519 manufacturing process Methods 0.000 description 100
- 238000003752 polymerase chain reaction Methods 0.000 description 96
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 87
- 101150080339 BTS1 gene Proteins 0.000 description 81
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 79
- 239000013615 primer Substances 0.000 description 78
- 239000013598 vector Substances 0.000 description 61
- 101710165761 (2E,6E)-farnesyl diphosphate synthase Proteins 0.000 description 58
- 101710156207 Farnesyl diphosphate synthase Proteins 0.000 description 58
- 101710125754 Farnesyl pyrophosphate synthase Proteins 0.000 description 58
- 101710089428 Farnesyl pyrophosphate synthase erg20 Proteins 0.000 description 58
- 101710150389 Probable farnesyl diphosphate synthase Proteins 0.000 description 58
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 55
- NUHSROFQTUXZQQ-UHFFFAOYSA-N isopentenyl diphosphate Chemical class CC(=C)CCO[P@](O)(=O)OP(O)(O)=O NUHSROFQTUXZQQ-UHFFFAOYSA-N 0.000 description 53
- 210000004027 cell Anatomy 0.000 description 50
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 45
- 108090000790 Enzymes Proteins 0.000 description 42
- 238000012217 deletion Methods 0.000 description 39
- 230000037430 deletion Effects 0.000 description 38
- PCDUALPXEOKZPE-DXCABUDRSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoic acid Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O PCDUALPXEOKZPE-DXCABUDRSA-N 0.000 description 36
- 101150084072 ERG20 gene Proteins 0.000 description 35
- 239000013604 expression vector Substances 0.000 description 35
- 230000000694 effects Effects 0.000 description 33
- 125000003275 alpha amino acid group Chemical group 0.000 description 30
- 230000015572 biosynthetic process Effects 0.000 description 30
- 238000009396 hybridization Methods 0.000 description 29
- 239000000523 sample Substances 0.000 description 27
- 102000004190 Enzymes Human genes 0.000 description 26
- 239000000306 component Substances 0.000 description 26
- 230000035772 mutation Effects 0.000 description 26
- 238000003786 synthesis reaction Methods 0.000 description 25
- 238000000246 agarose gel electrophoresis Methods 0.000 description 24
- 108010034529 leucyl-lysine Proteins 0.000 description 23
- 238000006467 substitution reaction Methods 0.000 description 23
- 108091034117 Oligonucleotide Proteins 0.000 description 22
- CABVTRNMFUVUDM-VRHQGPGLSA-N (3S)-3-hydroxy-3-methylglutaryl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C[C@@](O)(CC(O)=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 CABVTRNMFUVUDM-VRHQGPGLSA-N 0.000 description 21
- 241000880493 Leptailurus serval Species 0.000 description 21
- 239000000047 product Substances 0.000 description 21
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 20
- 238000006243 chemical reaction Methods 0.000 description 20
- 238000010367 cloning Methods 0.000 description 20
- 239000012528 membrane Substances 0.000 description 20
- 241000282326 Felis catus Species 0.000 description 19
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 19
- 102000004169 proteins and genes Human genes 0.000 description 19
- 108090000854 Oxidoreductases Proteins 0.000 description 18
- OFBQJSOFQDEBGM-UHFFFAOYSA-N Pentane Chemical compound CCCCC OFBQJSOFQDEBGM-UHFFFAOYSA-N 0.000 description 18
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 18
- 230000001939 inductive effect Effects 0.000 description 18
- 102100035111 Farnesyl pyrophosphate synthase Human genes 0.000 description 17
- 101150064873 ispA gene Proteins 0.000 description 17
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 16
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 16
- 108010068380 arginylarginine Proteins 0.000 description 16
- 238000010586 diagram Methods 0.000 description 16
- 101100397224 Bacillus subtilis (strain 168) isp gene Proteins 0.000 description 15
- 101100052502 Shigella flexneri yciB gene Proteins 0.000 description 15
- 108090000765 processed proteins & peptides Proteins 0.000 description 15
- 230000001965 increasing effect Effects 0.000 description 14
- OINNEUNVOZHBOX-QIRCYJPOSA-N 2-trans,6-trans,10-trans-geranylgeranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\COP(O)(=O)OP(O)(O)=O OINNEUNVOZHBOX-QIRCYJPOSA-N 0.000 description 13
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 13
- 238000001514 detection method Methods 0.000 description 13
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 13
- 102000004196 processed proteins & peptides Human genes 0.000 description 13
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 12
- 241000894006 Bacteria Species 0.000 description 12
- 101150106356 FPS gene Proteins 0.000 description 12
- OINNEUNVOZHBOX-XBQSVVNOSA-N Geranylgeranyl diphosphate Natural products [P@](=O)(OP(=O)(O)O)(OC/C=C(\CC/C=C(\CC/C=C(\CC/C=C(\C)/C)/C)/C)/C)O OINNEUNVOZHBOX-XBQSVVNOSA-N 0.000 description 12
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 12
- 108010005233 alanylglutamic acid Proteins 0.000 description 12
- 239000002299 complementary DNA Substances 0.000 description 12
- 229920001184 polypeptide Polymers 0.000 description 12
- 108091008146 restriction endonucleases Proteins 0.000 description 12
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 12
- VWFJDQUYCIWHTN-FBXUGWQNSA-N Farnesyl diphosphate Natural products CC(C)=CCC\C(C)=C/CC\C(C)=C/COP(O)(=O)OP(O)(O)=O VWFJDQUYCIWHTN-FBXUGWQNSA-N 0.000 description 11
- 101100390535 Mus musculus Fdft1 gene Proteins 0.000 description 11
- 101100390536 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) erg-6 gene Proteins 0.000 description 11
- 238000000636 Northern blotting Methods 0.000 description 11
- 101150116391 erg9 gene Proteins 0.000 description 11
- INOZZBHURUDQQR-AJNGGQMLSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 INOZZBHURUDQQR-AJNGGQMLSA-N 0.000 description 10
- VWFJDQUYCIWHTN-YFVJMOTDSA-N 2-trans,6-trans-farnesyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CO[P@](O)(=O)OP(O)(O)=O VWFJDQUYCIWHTN-YFVJMOTDSA-N 0.000 description 10
- 229920001817 Agar Polymers 0.000 description 10
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 10
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 10
- QMNFFXRFOJIOKZ-UHFFFAOYSA-N Cycloguanyl Natural products CC1(C)N=C(N)N=C(N)N1C1=CC=C(Cl)C=C1 QMNFFXRFOJIOKZ-UHFFFAOYSA-N 0.000 description 10
- 102100039371 ER lumen protein-retaining receptor 1 Human genes 0.000 description 10
- 102100039291 Geranylgeranyl pyrophosphate synthase Human genes 0.000 description 10
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 10
- 101000812437 Homo sapiens ER lumen protein-retaining receptor 1 Proteins 0.000 description 10
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 10
- 239000008272 agar Substances 0.000 description 10
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 10
- 239000003550 marker Substances 0.000 description 10
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 9
- 102000004286 Hydroxymethylglutaryl CoA Reductases Human genes 0.000 description 9
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 9
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 9
- 108010079364 N-glycylalanine Proteins 0.000 description 9
- 239000002033 PVDF binder Substances 0.000 description 9
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 9
- 238000002105 Southern blotting Methods 0.000 description 9
- 108010087924 alanylproline Proteins 0.000 description 9
- 108010062796 arginyllysine Proteins 0.000 description 9
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 9
- 238000004925 denaturation Methods 0.000 description 9
- 230000036425 denaturation Effects 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 9
- 239000013605 shuttle vector Substances 0.000 description 9
- 239000003549 soybean oil Substances 0.000 description 9
- 235000012424 soybean oil Nutrition 0.000 description 9
- OZRFYUJEXYKQDV-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-carboxypropanoyl)amino]-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]butanedioic acid Chemical compound OC(=O)CC(N)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(O)=O OZRFYUJEXYKQDV-UHFFFAOYSA-N 0.000 description 8
- 101150064299 AUR1 gene Proteins 0.000 description 8
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 8
- 239000005695 Ammonium acetate Substances 0.000 description 8
- 101000626964 Cyanophora paradoxa Uncharacterized 24.3 kDa protein in psbH-rpl11 intergenic region Proteins 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- 238000012408 PCR amplification Methods 0.000 description 8
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 8
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 8
- 235000019257 ammonium acetate Nutrition 0.000 description 8
- 229940043376 ammonium acetate Drugs 0.000 description 8
- 238000013461 design Methods 0.000 description 8
- 238000010790 dilution Methods 0.000 description 8
- 239000012895 dilution Substances 0.000 description 8
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 8
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- 239000000758 substrate Substances 0.000 description 8
- 230000009466 transformation Effects 0.000 description 8
- QVVDVENEPNODSI-BTNSXGMBSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylidene Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QVVDVENEPNODSI-BTNSXGMBSA-N 0.000 description 7
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 7
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 7
- 101150045041 ERG8 gene Proteins 0.000 description 7
- 101100297762 Gibberella zeae (strain ATCC MYA-4620 / CBS 123657 / FGSC 9075 / NRRL 31084 / PH-1) PKS12 gene Proteins 0.000 description 7
- 101150056978 HMGS gene Proteins 0.000 description 7
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 7
- 239000006180 TBST buffer Substances 0.000 description 7
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 7
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 7
- 239000003623 enhancer Substances 0.000 description 7
- 229930182830 galactose Natural products 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 238000003780 insertion Methods 0.000 description 7
- 230000037431 insertion Effects 0.000 description 7
- 108010090894 prolylleucine Proteins 0.000 description 7
- 238000011160 research Methods 0.000 description 7
- 229940031439 squalene Drugs 0.000 description 7
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 7
- 150000003505 terpenes Chemical class 0.000 description 7
- 238000001262 western blot Methods 0.000 description 7
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 6
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 6
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 6
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 6
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 6
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 6
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 6
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 6
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 6
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 6
- GURLOFOJBHRPJN-AAEUAGOBSA-N Asn-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GURLOFOJBHRPJN-AAEUAGOBSA-N 0.000 description 6
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 6
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 6
- YQPSDMUGFKJZHR-QRTARXTBSA-N Asn-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N YQPSDMUGFKJZHR-QRTARXTBSA-N 0.000 description 6
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 6
- YTXCCDCOHIYQFC-GUBZILKMSA-N Asp-Met-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTXCCDCOHIYQFC-GUBZILKMSA-N 0.000 description 6
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 6
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 6
- KCSDYJSCUWLILX-BJDJZHNGSA-N Cys-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N KCSDYJSCUWLILX-BJDJZHNGSA-N 0.000 description 6
- KKUVRYLJEXJSGX-MXAVVETBSA-N Cys-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KKUVRYLJEXJSGX-MXAVVETBSA-N 0.000 description 6
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 6
- 101150051269 ERG10 gene Proteins 0.000 description 6
- 101150082479 GAL gene Proteins 0.000 description 6
- 101100025321 Gibberella zeae (strain ATCC MYA-4620 / CBS 123657 / FGSC 9075 / NRRL 31084 / PH-1) ERG19 gene Proteins 0.000 description 6
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 6
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 6
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 6
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 6
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 6
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 6
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 6
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 6
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 6
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 6
- QYOGJYIRKACXEP-SLBDDTMCSA-N Ile-Asn-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N QYOGJYIRKACXEP-SLBDDTMCSA-N 0.000 description 6
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 6
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 6
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 6
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 6
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 6
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 6
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 6
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 6
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 6
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 6
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 6
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 6
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 6
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 6
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 6
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 6
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 6
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 6
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 6
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 6
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 6
- 101100445407 Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) erg10B gene Proteins 0.000 description 6
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 6
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 6
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 6
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 6
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 6
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 6
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 6
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 6
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 6
- 239000013614 RNA sample Substances 0.000 description 6
- 241000235070 Saccharomyces Species 0.000 description 6
- 101100011891 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ERG13 gene Proteins 0.000 description 6
- 101100025327 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) MVD1 gene Proteins 0.000 description 6
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 6
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 6
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 6
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 6
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 6
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 6
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 6
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 6
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 6
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 6
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 6
- 238000007792 addition Methods 0.000 description 6
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 238000003556 assay Methods 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 210000004899 c-terminal region Anatomy 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 108020001507 fusion proteins Proteins 0.000 description 6
- 102000037865 fusion proteins Human genes 0.000 description 6
- GVVPGTZRZFNKDS-JXMROGBWSA-N geranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CO[P@](O)(=O)OP(O)(O)=O GVVPGTZRZFNKDS-JXMROGBWSA-N 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 6
- 101150066555 lacZ gene Proteins 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 238000002741 site-directed mutagenesis Methods 0.000 description 6
- 235000002639 sodium chloride Nutrition 0.000 description 6
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 6
- 238000011144 upstream manufacturing Methods 0.000 description 6
- 108010009962 valyltyrosine Proteins 0.000 description 6
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 5
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 5
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 5
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 5
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 5
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 5
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 5
- 101100480861 Caldanaerobacter subterraneus subsp. tengcongensis (strain DSM 15242 / JCM 11007 / NBRC 100824 / MB4) tdh gene Proteins 0.000 description 5
- 101100447466 Candida albicans (strain WO-1) TDH1 gene Proteins 0.000 description 5
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 5
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 5
- 101100364969 Dictyostelium discoideum scai gene Proteins 0.000 description 5
- 101150071502 ERG12 gene Proteins 0.000 description 5
- GVVPGTZRZFNKDS-YFHOEESVSA-N Geranyl diphosphate Natural products CC(C)=CCC\C(C)=C/COP(O)(=O)OP(O)(O)=O GVVPGTZRZFNKDS-YFHOEESVSA-N 0.000 description 5
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 5
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 5
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 5
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 5
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 5
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 5
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 5
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 5
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 5
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 5
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 5
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 5
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 5
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 5
- CKSBRMUOQDNPKZ-SRVKXCTJSA-N Lys-Gln-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CKSBRMUOQDNPKZ-SRVKXCTJSA-N 0.000 description 5
- KZKVVWBOGDKHKE-QTKMDUPCSA-N Met-Thr-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 KZKVVWBOGDKHKE-QTKMDUPCSA-N 0.000 description 5
- 101100364971 Mus musculus Scai gene Proteins 0.000 description 5
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 5
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 5
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 5
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 5
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 5
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 5
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 5
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 5
- 230000003321 amplification Effects 0.000 description 5
- 238000000137 annealing Methods 0.000 description 5
- 102220396429 c.235T>G Human genes 0.000 description 5
- 238000005119 centrifugation Methods 0.000 description 5
- 239000007795 chemical reaction product Substances 0.000 description 5
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 5
- CPJRRXSHAYUTGL-UHFFFAOYSA-N isopentenyl alcohol Chemical compound CC(=C)CCO CPJRRXSHAYUTGL-UHFFFAOYSA-N 0.000 description 5
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 5
- 238000013507 mapping Methods 0.000 description 5
- 229910052757 nitrogen Inorganic materials 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- 239000002243 precursor Substances 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 230000008439 repair process Effects 0.000 description 5
- 239000006152 selective media Substances 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 101150088047 tdh3 gene Proteins 0.000 description 5
- KJIOQYGWTQBHNH-UHFFFAOYSA-N undecanol Chemical compound CCCCCCCCCCCO KJIOQYGWTQBHNH-UHFFFAOYSA-N 0.000 description 5
- 108010073969 valyllysine Proteins 0.000 description 5
- XDIYNQZUNSSENW-NUVHGKSTSA-N (2r,3s,4s,5r)-2,3,4,5,6-pentahydroxyhexanal;(2r,3s,4r,5r)-2,3,4,5,6-pentahydroxyhexanal Chemical compound OC[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)C=O.OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O XDIYNQZUNSSENW-NUVHGKSTSA-N 0.000 description 4
- AJPADPZSRRUGHI-RFZPGFLSSA-L 1-deoxy-D-xylulose 5-phosphate(2-) Chemical compound CC(=O)[C@@H](O)[C@H](O)COP([O-])([O-])=O AJPADPZSRRUGHI-RFZPGFLSSA-L 0.000 description 4
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 4
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 4
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 4
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 4
- 241000726103 Atta Species 0.000 description 4
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 4
- 108020004705 Codon Proteins 0.000 description 4
- PRHGYQOSEHLDRW-VGDYDELISA-N Cys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N PRHGYQOSEHLDRW-VGDYDELISA-N 0.000 description 4
- 108010022535 Farnesyl-Diphosphate Farnesyltransferase Proteins 0.000 description 4
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 4
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 4
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 4
- 101150009006 HIS3 gene Proteins 0.000 description 4
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 4
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 4
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 4
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 4
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 4
- 108010065958 Isopentenyl-diphosphate Delta-isomerase Proteins 0.000 description 4
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 4
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 4
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 4
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 4
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 4
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 4
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 4
- 108091036060 Linker DNA Proteins 0.000 description 4
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 4
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 4
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 4
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 4
- 102000004316 Oxidoreductases Human genes 0.000 description 4
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 4
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 4
- 101100394989 Rhodopseudomonas palustris (strain ATCC BAA-98 / CGA009) hisI gene Proteins 0.000 description 4
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 4
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 4
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 4
- SDNVRAKIJVKAGS-LKTVYLICSA-N Tyr-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N SDNVRAKIJVKAGS-LKTVYLICSA-N 0.000 description 4
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 150000001298 alcohols Chemical class 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000027455 binding Effects 0.000 description 4
- 229940041514 candida albicans extract Drugs 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 238000000855 fermentation Methods 0.000 description 4
- 230000004151 fermentation Effects 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 239000001301 oxygen Substances 0.000 description 4
- 229910052760 oxygen Inorganic materials 0.000 description 4
- 108010015796 prolylisoleucine Proteins 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 230000000717 retained effect Effects 0.000 description 4
- 102220196434 rs771442299 Human genes 0.000 description 4
- 238000011218 seed culture Methods 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 238000009423 ventilation Methods 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 239000012138 yeast extract Substances 0.000 description 4
- 239000007218 ym medium Substances 0.000 description 4
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 3
- QWTLUPDHBKBULE-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[[2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QWTLUPDHBKBULE-UHFFFAOYSA-N 0.000 description 3
- SNBCLPGEMZEWLU-QXFUBDJGSA-N 2-chloro-n-[[(2r,3s,5r)-3-hydroxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methyl]acetamide Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CNC(=O)CCl)[C@@H](O)C1 SNBCLPGEMZEWLU-QXFUBDJGSA-N 0.000 description 3
- 108010036211 5-HT-moduline Proteins 0.000 description 3
- MKPCNMXYTMQZBE-UHFFFAOYSA-N 7h-purin-6-amine;sulfuric acid;dihydrate Chemical compound O.O.OS(O)(=O)=O.NC1=NC=NC2=C1NC=N2.NC1=NC=NC2=C1NC=N2 MKPCNMXYTMQZBE-UHFFFAOYSA-N 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 3
- RCQRKPUXJAGEEC-ZLUOBGJFSA-N Ala-Cys-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O RCQRKPUXJAGEEC-ZLUOBGJFSA-N 0.000 description 3
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 3
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 3
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 3
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 3
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 3
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 3
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 3
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 3
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 3
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 3
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 3
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 3
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 3
- 241000203069 Archaea Species 0.000 description 3
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 3
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 3
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 3
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 3
- PQBHGSGQZSOLIR-RYUDHWBXSA-N Arg-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PQBHGSGQZSOLIR-RYUDHWBXSA-N 0.000 description 3
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 3
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 3
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 3
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 3
- 241000235349 Ascomycota Species 0.000 description 3
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 3
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 3
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 3
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 3
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 3
- YNQMEIJEWSHOEO-SRVKXCTJSA-N Asn-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YNQMEIJEWSHOEO-SRVKXCTJSA-N 0.000 description 3
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 3
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 3
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 3
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 3
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 3
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 3
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 3
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 3
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 3
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 3
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 3
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 3
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 3
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 3
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 3
- 102100030981 Beta-alanine-activating enzyme Human genes 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 3
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 3
- YXPNKXFOBHRUBL-BJDJZHNGSA-N Cys-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N YXPNKXFOBHRUBL-BJDJZHNGSA-N 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 241000206602 Eukaryota Species 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 3
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 3
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 3
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 3
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 3
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 3
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 3
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 3
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 3
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 3
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 3
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 3
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 3
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 3
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 3
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 3
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 3
- SNFUTDLOCQQRQD-ZKWXMUAHSA-N Glu-Ile Chemical class CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SNFUTDLOCQQRQD-ZKWXMUAHSA-N 0.000 description 3
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 3
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 3
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 3
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 3
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 3
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 3
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 3
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 3
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 3
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 3
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 3
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 3
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 3
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 3
- 101000773364 Homo sapiens Beta-alanine-activating enzyme Proteins 0.000 description 3
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 3
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 3
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 3
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 3
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 3
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 3
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 3
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 3
- WMDZARSFSMZOQO-DRZSPHRISA-N Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WMDZARSFSMZOQO-DRZSPHRISA-N 0.000 description 3
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 3
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 3
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 3
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 3
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 3
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 3
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 3
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 3
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 3
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 3
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 3
- AZLASBBHHSLQDB-GUBZILKMSA-N Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(C)C AZLASBBHHSLQDB-GUBZILKMSA-N 0.000 description 3
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 3
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 3
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 3
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 3
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 3
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 3
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 3
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 3
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 3
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 3
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 3
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 3
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 3
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 3
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 3
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 3
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 3
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 3
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 3
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 3
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 3
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 3
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 3
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 3
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 3
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 3
- DBOMZJOESVYERT-GUBZILKMSA-N Met-Asn-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N DBOMZJOESVYERT-GUBZILKMSA-N 0.000 description 3
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 3
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 3
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 3
- CNTNPWWHFWAZGA-JYJNAYRXSA-N Met-Met-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CNTNPWWHFWAZGA-JYJNAYRXSA-N 0.000 description 3
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 3
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 3
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 3
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- 239000004677 Nylon Substances 0.000 description 3
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 3
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 3
- CPTJPDZTFNKFOU-MXAVVETBSA-N Phe-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N CPTJPDZTFNKFOU-MXAVVETBSA-N 0.000 description 3
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 3
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 3
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 3
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 3
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 3
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 3
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 3
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 3
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 3
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 3
- JUJGNDZIKKQMDJ-IHRRRGAJSA-N Pro-His-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O JUJGNDZIKKQMDJ-IHRRRGAJSA-N 0.000 description 3
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 3
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 3
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 3
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 3
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 3
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 3
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 3
- 101000897764 Saccharolobus solfataricus (strain ATCC 35092 / DSM 1617 / JCM 11322 / P2) Hexaprenyl pyrophosphate synthase Proteins 0.000 description 3
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 3
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 3
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 3
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 3
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 3
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 3
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 3
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 3
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 3
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 3
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 3
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 101150114468 TUB1 gene Proteins 0.000 description 3
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 3
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 3
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 3
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 3
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 3
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 3
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 3
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 3
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 3
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 3
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 3
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 3
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 3
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 3
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 3
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 3
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 3
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 3
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 3
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 3
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 3
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 3
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 3
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 3
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 3
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 3
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 3
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 3
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 3
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 3
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 3
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 3
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 3
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 3
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 3
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 3
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 3
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 3
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 3
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 3
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 3
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 3
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 3
- 238000002835 absorbance Methods 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- FPIPGXGPPPQFEQ-OVSJKPMPSA-N all-trans-retinol Chemical compound OC\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-OVSJKPMPSA-N 0.000 description 3
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 229960005091 chloramphenicol Drugs 0.000 description 3
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 3
- 230000004186 co-expression Effects 0.000 description 3
- 210000004748 cultured cell Anatomy 0.000 description 3
- 108010025198 decaglycine Proteins 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 239000003517 fume Substances 0.000 description 3
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 3
- 239000011521 glass Substances 0.000 description 3
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 3
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 3
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010084389 glycyltryptophan Proteins 0.000 description 3
- STKYPAFSDFAEPH-LURJTMIESA-N glycylvaline Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 108010091871 leucylmethionine Proteins 0.000 description 3
- 108010012058 leucyltyrosine Proteins 0.000 description 3
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 239000012533 medium component Substances 0.000 description 3
- 230000037353 metabolic pathway Effects 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 229920001778 nylon Polymers 0.000 description 3
- 239000003921 oil Substances 0.000 description 3
- 235000019198 oils Nutrition 0.000 description 3
- 150000007524 organic acids Chemical class 0.000 description 3
- 235000005985 organic acids Nutrition 0.000 description 3
- 239000003960 organic solvent Substances 0.000 description 3
- 108010012581 phenylalanylglutamate Proteins 0.000 description 3
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 3
- SHUZOJHMOBOZST-UHFFFAOYSA-N phylloquinone Natural products CC(C)CCCCC(C)CCC(C)CCCC(=CCC1=C(C)C(=O)c2ccccc2C1=O)C SHUZOJHMOBOZST-UHFFFAOYSA-N 0.000 description 3
- 239000002504 physiological saline solution Substances 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 102220340464 rs1555191603 Human genes 0.000 description 3
- 102220225547 rs754043874 Human genes 0.000 description 3
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 238000003756 stirring Methods 0.000 description 3
- 230000009469 supplementation Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 230000001131 transforming effect Effects 0.000 description 3
- 230000007704 transition Effects 0.000 description 3
- 108700004896 tripeptide FEG Proteins 0.000 description 3
- 108010020532 tyrosyl-proline Proteins 0.000 description 3
- 108010003137 tyrosyltyrosine Proteins 0.000 description 3
- ACWBQPMHZXGDFX-QFIPXVFZSA-N valsartan Chemical compound C1=CC(CN(C(=O)CCCC)[C@@H](C(C)C)C(O)=O)=CC=C1C1=CC=CC=C1C1=NN=NN1 ACWBQPMHZXGDFX-QFIPXVFZSA-N 0.000 description 3
- 108700026220 vif Genes Proteins 0.000 description 3
- FQTLCLSUCSAZDY-UHFFFAOYSA-N (+) E(S) nerolidol Natural products CC(C)=CCCC(C)=CCCC(C)(O)C=C FQTLCLSUCSAZDY-UHFFFAOYSA-N 0.000 description 2
- LSLXWOCIIFUZCQ-SRVKXCTJSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-3-methyl-1-oxobutyl]amino]-3-methyl-1-oxobutyl]amino]-3-methylbutanoic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O LSLXWOCIIFUZCQ-SRVKXCTJSA-N 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 2
- YTPMCWYIRHLEGM-BQYQJAHWSA-N 1-[(e)-2-propylsulfonylethenyl]sulfonylpropane Chemical compound CCCS(=O)(=O)\C=C\S(=O)(=O)CCC YTPMCWYIRHLEGM-BQYQJAHWSA-N 0.000 description 2
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 2
- FPIPGXGPPPQFEQ-UHFFFAOYSA-N 13-cis retinol Natural products OCC=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-UHFFFAOYSA-N 0.000 description 2
- 101150111160 ALA1 gene Proteins 0.000 description 2
- 101150118692 ALS5 gene Proteins 0.000 description 2
- 102000013563 Acid Phosphatase Human genes 0.000 description 2
- 108010051457 Acid Phosphatase Proteins 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 2
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 2
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 2
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 2
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 2
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 2
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 2
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 2
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 241000221198 Basidiomycota Species 0.000 description 2
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical compound [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 2
- 101100025317 Candida albicans (strain SC5314 / ATCC MYA-2876) MVD gene Proteins 0.000 description 2
- 102100021391 Cationic amino acid transporter 3 Human genes 0.000 description 2
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 2
- 101100098873 Chondrus crispus TUBB gene Proteins 0.000 description 2
- 239000004971 Cross linker Substances 0.000 description 2
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 2
- PJWIPBIMSKJTIE-DCAQKATOSA-N Cys-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N PJWIPBIMSKJTIE-DCAQKATOSA-N 0.000 description 2
- 230000004543 DNA replication Effects 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 241000588722 Escherichia Species 0.000 description 2
- 102100021736 Galectin-1 Human genes 0.000 description 2
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 2
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 2
- DDNIZQDYXDENIT-FXQIFTODSA-N Gln-Glu-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DDNIZQDYXDENIT-FXQIFTODSA-N 0.000 description 2
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 2
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 2
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 2
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 2
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- IDNNYVGVSZMQTK-IHRRRGAJSA-N His-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N IDNNYVGVSZMQTK-IHRRRGAJSA-N 0.000 description 2
- ORZGPQXISSXQGW-IHRRRGAJSA-N His-His-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O ORZGPQXISSXQGW-IHRRRGAJSA-N 0.000 description 2
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 2
- LQGCNWWLGGMTJO-ULQDDVLXSA-N His-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N LQGCNWWLGGMTJO-ULQDDVLXSA-N 0.000 description 2
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 2
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 2
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 2
- JERJIYYCOGBAIJ-OBAATPRFSA-N Ile-Tyr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JERJIYYCOGBAIJ-OBAATPRFSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- 102100027665 Isopentenyl-diphosphate Delta-isomerase 1 Human genes 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 2
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 2
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 2
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 2
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 2
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 2
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 2
- 101150079299 MVD1 gene Proteins 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- ABSPRNADVQNDOU-UHFFFAOYSA-N Menaquinone 1 Natural products C1=CC=C2C(=O)C(CC=C(C)C)=C(C)C(=O)C2=C1 ABSPRNADVQNDOU-UHFFFAOYSA-N 0.000 description 2
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 2
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 2
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 2
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- FQTLCLSUCSAZDY-ATGUSINASA-N Nerolidol Chemical compound CC(C)=CCC\C(C)=C\CC[C@](C)(O)C=C FQTLCLSUCSAZDY-ATGUSINASA-N 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 239000001888 Peptone Substances 0.000 description 2
- 108010080698 Peptones Proteins 0.000 description 2
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 2
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 2
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 2
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 description 2
- 101100068851 Rattus norvegicus Glra1 gene Proteins 0.000 description 2
- 108091006230 SLC7A3 Proteins 0.000 description 2
- 101100299589 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PTH1 gene Proteins 0.000 description 2
- 241000235343 Saccharomycetales Species 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- BXLYSRPHVMCOPS-ACZMJKKPSA-N Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO BXLYSRPHVMCOPS-ACZMJKKPSA-N 0.000 description 2
- 244000061456 Solanum tuberosum Species 0.000 description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 description 2
- 101100177443 Spinacia oleracea HEMB gene Proteins 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- 101150025182 TUBB1 gene Proteins 0.000 description 2
- 102000004243 Tubulin Human genes 0.000 description 2
- 108090000704 Tubulin Proteins 0.000 description 2
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 2
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 2
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 2
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 2
- FDKDGFGTHGJKNV-FHWLQOOXSA-N Tyr-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FDKDGFGTHGJKNV-FHWLQOOXSA-N 0.000 description 2
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 2
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 2
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 238000005273 aeration Methods 0.000 description 2
- 238000013019 agitation Methods 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- XXROGKLTLUQVRX-UHFFFAOYSA-N allyl alcohol Chemical compound OCC=C XXROGKLTLUQVRX-UHFFFAOYSA-N 0.000 description 2
- HGLAYHSJJRYIBI-UHFFFAOYSA-N allyl diphosphate Chemical compound OP(O)(=O)OP(O)(=O)OCC=C HGLAYHSJJRYIBI-UHFFFAOYSA-N 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 239000008346 aqueous phase Substances 0.000 description 2
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 2
- 239000002585 base Substances 0.000 description 2
- 239000011648 beta-carotene Substances 0.000 description 2
- 235000013734 beta-carotene Nutrition 0.000 description 2
- 230000010261 cell growth Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 239000013611 chromosomal DNA Substances 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000004440 column chromatography Methods 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 239000008121 dextrose Substances 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 2
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 238000004821 distillation Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 238000012224 gene deletion Methods 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 230000008676 import Effects 0.000 description 2
- 238000009776 industrial production Methods 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 2
- 150000007522 mineralic acids Chemical class 0.000 description 2
- 125000000896 monocarboxylic acid group Chemical group 0.000 description 2
- 235000019796 monopotassium phosphate Nutrition 0.000 description 2
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 2
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 2
- WASNIKZYIWZQIP-AWEZNQCLSA-N nerolidol Natural products CC(=CCCC(=CCC[C@@H](O)C=C)C)C WASNIKZYIWZQIP-AWEZNQCLSA-N 0.000 description 2
- JPXMTWWFLBLUCD-UHFFFAOYSA-N nitro blue tetrazolium(2+) Chemical compound COC1=CC(C=2C=C(OC)C(=CC=2)[N+]=2N(N=C(N=2)C=2C=CC=CC=2)C=2C=CC(=CC=2)[N+]([O-])=O)=CC=C1[N+]1=NC(C=2C=CC=CC=2)=NN1C1=CC=C([N+]([O-])=O)C=C1 JPXMTWWFLBLUCD-UHFFFAOYSA-N 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 235000016709 nutrition Nutrition 0.000 description 2
- 235000019319 peptone Nutrition 0.000 description 2
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 2
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 2
- 235000019175 phylloquinone Nutrition 0.000 description 2
- 239000011772 phylloquinone Substances 0.000 description 2
- MBWXNTAXLNYFJB-NKFFZRIASA-N phylloquinone Chemical compound C1=CC=C2C(=O)C(C/C=C(C)/CCC[C@H](C)CCC[C@H](C)CCCC(C)C)=C(C)C(=O)C2=C1 MBWXNTAXLNYFJB-NKFFZRIASA-N 0.000 description 2
- 229960001898 phytomenadione Drugs 0.000 description 2
- SCVFZCLFOSHCOH-UHFFFAOYSA-M potassium acetate Chemical compound [K+].CC([O-])=O SCVFZCLFOSHCOH-UHFFFAOYSA-M 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- BDERNNFJNOPAEC-UHFFFAOYSA-N propan-1-ol Chemical compound CCCO BDERNNFJNOPAEC-UHFFFAOYSA-N 0.000 description 2
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 102200155719 rs121918454 Human genes 0.000 description 2
- 102220053979 rs727505279 Human genes 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- CRDAMVZIKSXKFV-UHFFFAOYSA-N trans-Farnesol Natural products CC(C)=CCCC(C)=CCCC(C)=CCO CRDAMVZIKSXKFV-UHFFFAOYSA-N 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 229940057402 undecyl alcohol Drugs 0.000 description 2
- 108010021199 valyl-valyl-valine Proteins 0.000 description 2
- 239000000341 volatile oil Substances 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- OENHQHLEOONYIE-JLTXGRSLSA-N β-Carotene Chemical compound CC=1CCCC(C)(C)C=1\C=C\C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C OENHQHLEOONYIE-JLTXGRSLSA-N 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- CRDAMVZIKSXKFV-FBXUGWQNSA-N (2-cis,6-cis)-farnesol Chemical compound CC(C)=CCC\C(C)=C/CC\C(C)=C/CO CRDAMVZIKSXKFV-FBXUGWQNSA-N 0.000 description 1
- 239000000260 (2E,6E)-3,7,11-trimethyldodeca-2,6,10-trien-1-ol Substances 0.000 description 1
- KQRHTCDQWJLLME-XUXIUFHCSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-aminopropanoyl]amino]-4-methylpentanoyl]amino]propanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)N KQRHTCDQWJLLME-XUXIUFHCSA-N 0.000 description 1
- JEOQACOXAOEPLX-WCCKRBBISA-N (2s)-2-amino-5-(diaminomethylideneamino)pentanoic acid;1,3-thiazolidine-4-carboxylic acid Chemical compound OC(=O)C1CSCN1.OC(=O)[C@@H](N)CCCN=C(N)N JEOQACOXAOEPLX-WCCKRBBISA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-L (R)-2-Hydroxy-3-(phosphonooxy)-propanal Natural products O=C[C@H](O)COP([O-])([O-])=O LXJXRIRHZLFYRP-VKHMYHEASA-L 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 125000003903 2-propenyl group Chemical group [H]C([*])([H])C([H])=C([H])[H] 0.000 description 1
- NKDFYOWSKOHCCO-YPVLXUMRSA-N 20-hydroxyecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@](C)(O)[C@H](O)CCC(C)(O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 NKDFYOWSKOHCCO-YPVLXUMRSA-N 0.000 description 1
- 240000006409 Acacia auriculiformis Species 0.000 description 1
- 241001480832 Actibacterium Species 0.000 description 1
- 241000186361 Actinobacteria <class> Species 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- LJTZPXOCBZRFBH-CIUDSAMLSA-N Ala-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N LJTZPXOCBZRFBH-CIUDSAMLSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- HCZXHQADHZIEJD-CIUDSAMLSA-N Ala-Leu-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HCZXHQADHZIEJD-CIUDSAMLSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- IAUSCRHURCZUJP-CIUDSAMLSA-N Ala-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CS)C(O)=O IAUSCRHURCZUJP-CIUDSAMLSA-N 0.000 description 1
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- 235000019489 Almond oil Nutrition 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 239000004254 Ammonium phosphate Substances 0.000 description 1
- 241001244729 Apalis Species 0.000 description 1
- 101100190282 Arabidopsis thaliana PHE1 gene Proteins 0.000 description 1
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- NJPLPRFQLBZAMH-IHRRRGAJSA-N Asn-Tyr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O NJPLPRFQLBZAMH-IHRRRGAJSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- WDMNFNXKGSLIOB-GUBZILKMSA-N Asp-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N WDMNFNXKGSLIOB-GUBZILKMSA-N 0.000 description 1
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- 108010023063 Bacto-peptone Proteins 0.000 description 1
- 241000193764 Brevibacillus brevis Species 0.000 description 1
- 101100268670 Caenorhabditis elegans acc-3 gene Proteins 0.000 description 1
- 101100000858 Caenorhabditis elegans act-3 gene Proteins 0.000 description 1
- 101100492805 Caenorhabditis elegans atm-1 gene Proteins 0.000 description 1
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 1
- 101100129088 Caenorhabditis elegans lys-2 gene Proteins 0.000 description 1
- 101100298222 Caenorhabditis elegans pot-1 gene Proteins 0.000 description 1
- 101100298225 Caenorhabditis elegans pot-2 gene Proteins 0.000 description 1
- BHPQYMZQTOCNFJ-UHFFFAOYSA-N Calcium cation Chemical compound [Ca+2] BHPQYMZQTOCNFJ-UHFFFAOYSA-N 0.000 description 1
- 240000005209 Canarium indicum Species 0.000 description 1
- 235000004322 Canarium indicum Nutrition 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000186226 Corynebacterium glutamicum Species 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- 241000235646 Cyberlindnera jadinii Species 0.000 description 1
- WKELHWMCIXSVDT-UBHSHLNASA-N Cys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WKELHWMCIXSVDT-UBHSHLNASA-N 0.000 description 1
- OTXLNICGSXPGQF-KBIXCLLPSA-N Cys-Ile-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTXLNICGSXPGQF-KBIXCLLPSA-N 0.000 description 1
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 1
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 1
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 1
- XWTGTTNUCCEFJI-UBHSHLNASA-N Cys-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N XWTGTTNUCCEFJI-UBHSHLNASA-N 0.000 description 1
- RIONIAPMMKVUCX-IHPCNDPISA-N Cys-Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CS)N)C(O)=O)C1=CC=C(O)C=C1 RIONIAPMMKVUCX-IHPCNDPISA-N 0.000 description 1
- ZFHXNNXMNLWKJH-HJPIBITLSA-N Cys-Tyr-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZFHXNNXMNLWKJH-HJPIBITLSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-N D-glyceraldehyde 3-phosphate Chemical compound O=C[C@H](O)COP(O)(O)=O LXJXRIRHZLFYRP-VKHMYHEASA-N 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 102100033072 DNA replication ATP-dependent helicase DNA2 Human genes 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 101150023788 ERG19 gene Proteins 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241000192125 Firmicutes Species 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- 101150038242 GAL10 gene Proteins 0.000 description 1
- 108010001498 Galectin 1 Proteins 0.000 description 1
- 102100024637 Galectin-10 Human genes 0.000 description 1
- 229930191978 Gibberellin Natural products 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- 101100175482 Glycine max CG-3 gene Proteins 0.000 description 1
- OOFLZRMKTMLSMH-UHFFFAOYSA-N H4atta Chemical compound OC(=O)CN(CC(O)=O)CC1=CC=CC(C=2N=C(C=C(C=2)C=2C3=CC=CC=C3C=C3C=CC=CC3=2)C=2N=C(CN(CC(O)=O)CC(O)=O)C=CC=2)=N1 OOFLZRMKTMLSMH-UHFFFAOYSA-N 0.000 description 1
- 241000205062 Halobacterium Species 0.000 description 1
- 101100231525 Haloferax volcanii (strain ATCC 29605 / DSM 3757 / JCM 8879 / NBRC 14742 / NCIMB 2012 / VKM B-1768 / DS2) hmgA gene Proteins 0.000 description 1
- 244000043261 Hevea brasiliensis Species 0.000 description 1
- 102100022128 High mobility group protein B2 Human genes 0.000 description 1
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 1
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
- WGHJXSONOOTTCZ-JYJNAYRXSA-N His-Glu-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WGHJXSONOOTTCZ-JYJNAYRXSA-N 0.000 description 1
- VTMLJMNQHKBPON-QWRGUYRKSA-N His-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 VTMLJMNQHKBPON-QWRGUYRKSA-N 0.000 description 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- 101000927313 Homo sapiens DNA replication ATP-dependent helicase DNA2 Proteins 0.000 description 1
- 101000952691 Homo sapiens Dephospho-CoA kinase Proteins 0.000 description 1
- 101001045791 Homo sapiens High mobility group protein B2 Proteins 0.000 description 1
- 101001081533 Homo sapiens Isopentenyl-diphosphate Delta-isomerase 1 Proteins 0.000 description 1
- 101000926206 Homo sapiens Putative glutathione hydrolase 3 proenzyme Proteins 0.000 description 1
- HEFNNWSXXWATRW-UHFFFAOYSA-N Ibuprofen Chemical compound CC(C)CC1=CC=C(C(C)C(O)=O)C=C1 HEFNNWSXXWATRW-UHFFFAOYSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- 102000004195 Isomerases Human genes 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 240000006024 Lactobacillus plantarum Species 0.000 description 1
- 235000013965 Lactobacillus plantarum Nutrition 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 208000002720 Malnutrition Diseases 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- DZMGFGQBRYWJOR-YUMQZZPRSA-N Met-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O DZMGFGQBRYWJOR-YUMQZZPRSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 1
- MUDYEFAKNSTFAI-JYJNAYRXSA-N Met-Tyr-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O MUDYEFAKNSTFAI-JYJNAYRXSA-N 0.000 description 1
- CKAVKDJBSNTJDB-SRVKXCTJSA-N Met-Val-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCSC CKAVKDJBSNTJDB-SRVKXCTJSA-N 0.000 description 1
- 241000202974 Methanobacterium Species 0.000 description 1
- 101100202339 Mus musculus Slc6a13 gene Proteins 0.000 description 1
- 241001467460 Myxogastria Species 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 101100005280 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cat-3 gene Proteins 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 102220484783 Norrin_S75P_mutation Human genes 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- YEEFZOKPYOUXMX-KKUMJFAQSA-N Phe-Gln-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YEEFZOKPYOUXMX-KKUMJFAQSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- ACJULKNZOCRWEI-ULQDDVLXSA-N Phe-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O ACJULKNZOCRWEI-ULQDDVLXSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- 101100271190 Plasmodium falciparum (isolate 3D7) ATAT gene Proteins 0.000 description 1
- RVGRUAULSDPKGF-UHFFFAOYSA-N Poloxamer Chemical compound C1CO1.CC1CO1 RVGRUAULSDPKGF-UHFFFAOYSA-N 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000589776 Pseudomonas putida Species 0.000 description 1
- 102100034060 Putative glutathione hydrolase 3 proenzyme Human genes 0.000 description 1
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 101100202330 Rattus norvegicus Slc6a11 gene Proteins 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 241000220317 Rosa Species 0.000 description 1
- 101100174613 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) TDH3 gene Proteins 0.000 description 1
- 101900183287 Saccharomyces cerevisiae Acetyl-CoA acetyltransferase Proteins 0.000 description 1
- 101900281549 Saccharomyces cerevisiae Diphosphomevalonate decarboxylase Proteins 0.000 description 1
- 101900354897 Saccharomyces cerevisiae Mevalonate kinase Proteins 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 241000235345 Schizosaccharomycetaceae Species 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- HZNFKPJCGZXKIC-DCAQKATOSA-N Ser-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N HZNFKPJCGZXKIC-DCAQKATOSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- PRXRUNOAOLTIEF-ADSICKODSA-N Sorbitan trioleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCC\C=C/CCCCCCCC)[C@H]1OC[C@H](O)[C@H]1OC(=O)CCCCCCC\C=C/CCCCCCCC PRXRUNOAOLTIEF-ADSICKODSA-N 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 229930182558 Sterol Natural products 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241000205101 Sulfolobus Species 0.000 description 1
- 241000205098 Sulfolobus acidocaldarius Species 0.000 description 1
- 101150010219 TEF2 gene Proteins 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 241000223257 Thermomyces Species 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 229920004894 Triton X-305 Polymers 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- IOUPEELXVYPCPG-UHFFFAOYSA-N Valylglycine Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 1
- FPIPGXGPPPQFEQ-BOOMUCAASA-N Vitamin A Natural products OC/C=C(/C)\C=C\C=C(\C)/C=C/C1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-BOOMUCAASA-N 0.000 description 1
- 229930003448 Vitamin K Natural products 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 108010084094 alanyl-alanyl-alanyl-alanine Proteins 0.000 description 1
- 108010054982 alanyl-leucyl-alanyl-leucine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 239000012670 alkaline solution Substances 0.000 description 1
- OENHQHLEOONYIE-UKMVMLAPSA-N all-trans beta-carotene Natural products CC=1CCCC(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C OENHQHLEOONYIE-UKMVMLAPSA-N 0.000 description 1
- 239000008168 almond oil Substances 0.000 description 1
- 101150087698 alpha gene Proteins 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 235000019270 ammonium chloride Nutrition 0.000 description 1
- 229910000148 ammonium phosphate Inorganic materials 0.000 description 1
- 235000019289 ammonium phosphates Nutrition 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000002788 anti-peptide Effects 0.000 description 1
- 239000002518 antifoaming agent Substances 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 230000015807 ascospore-type prospore assembly Effects 0.000 description 1
- 238000000211 autoradiogram Methods 0.000 description 1
- 238000012365 batch cultivation Methods 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- TUPZEYHYWIEDIH-WAIFQNFQSA-N beta-carotene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CCCC1(C)C)C=CC=C(/C)C=CC2=CCCCC2(C)C TUPZEYHYWIEDIH-WAIFQNFQSA-N 0.000 description 1
- 229960002747 betacarotene Drugs 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 229910000019 calcium carbonate Inorganic materials 0.000 description 1
- 229910001424 calcium ion Inorganic materials 0.000 description 1
- 238000005251 capillar electrophoresis Methods 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 235000021466 carotenoid Nutrition 0.000 description 1
- 150000001747 carotenoids Chemical class 0.000 description 1
- 239000012159 carrier gas Substances 0.000 description 1
- 230000034303 cell budding Effects 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 229930002875 chlorophyll Natural products 0.000 description 1
- 235000019804 chlorophyll Nutrition 0.000 description 1
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000006482 condensation reaction Methods 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 229910000365 copper sulfate Inorganic materials 0.000 description 1
- ARUVKPQLZAKDPS-UHFFFAOYSA-L copper(II) sulfate Chemical compound [Cu+2].[O-][S+2]([O-])([O-])[O-] ARUVKPQLZAKDPS-UHFFFAOYSA-L 0.000 description 1
- 239000012531 culture fluid Substances 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- GVJHHUAWPYXKBD-UHFFFAOYSA-N d-alpha-tocopherol Natural products OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 239000013530 defoamer Substances 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- MNNHAPBLZZVQHP-UHFFFAOYSA-N diammonium hydrogen phosphate Chemical compound [NH4+].[NH4+].OP([O-])([O-])=O MNNHAPBLZZVQHP-UHFFFAOYSA-N 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 108010030074 endodeoxyribonuclease MluI Proteins 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 150000002085 enols Chemical class 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- DNVPQKQSNYMLRS-APGDWVJJSA-N ergosterol Chemical compound C1[C@@H](O)CC[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@H](C)/C=C/[C@H](C)C(C)C)CC[C@H]33)C)C3=CC=C21 DNVPQKQSNYMLRS-APGDWVJJSA-N 0.000 description 1
- 150000002170 ethers Chemical class 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 229930002886 farnesol Natural products 0.000 description 1
- 229940043259 farnesol Drugs 0.000 description 1
- 108091005640 farnesylated proteins Proteins 0.000 description 1
- 239000011790 ferrous sulphate Substances 0.000 description 1
- 235000003891 ferrous sulphate Nutrition 0.000 description 1
- 235000021323 fish oil Nutrition 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 238000010359 gene isolation Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 125000002686 geranylgeranyl group Chemical group [H]C([*])([H])/C([H])=C(C([H])([H])[H])/C([H])([H])C([H])([H])/C([H])=C(C([H])([H])[H])/C([H])([H])C([H])([H])/C([H])=C(C([H])([H])[H])/C([H])([H])C([H])([H])C([H])=C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 239000003448 gibberellin Substances 0.000 description 1
- IXORZMNAPKEEDV-OBDJNFEBSA-N gibberellin A3 Chemical class C([C@@]1(O)C(=C)C[C@@]2(C1)[C@H]1C(O)=O)C[C@H]2[C@]2(C=C[C@@H]3O)[C@H]1[C@]3(C)C(=O)O2 IXORZMNAPKEEDV-OBDJNFEBSA-N 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 239000001307 helium Substances 0.000 description 1
- 229910052734 helium Inorganic materials 0.000 description 1
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 1
- 229910000359 iron(II) sulfate Inorganic materials 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 229940072205 lactobacillus plantarum Drugs 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010049589 leucyl-leucyl-leucine Proteins 0.000 description 1
- 238000005567 liquid scintillation counting Methods 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- GVALZJMUIHGIMD-UHFFFAOYSA-H magnesium phosphate Chemical compound [Mg+2].[Mg+2].[Mg+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O GVALZJMUIHGIMD-UHFFFAOYSA-H 0.000 description 1
- 239000004137 magnesium phosphate Substances 0.000 description 1
- 229960002261 magnesium phosphate Drugs 0.000 description 1
- 229910000157 magnesium phosphate Inorganic materials 0.000 description 1
- 235000010994 magnesium phosphates Nutrition 0.000 description 1
- 229940099596 manganese sulfate Drugs 0.000 description 1
- 239000011702 manganese sulphate Substances 0.000 description 1
- 235000007079 manganese sulphate Nutrition 0.000 description 1
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 239000013028 medium composition Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 238000012269 metabolic engineering Methods 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 230000000696 methanogenic effect Effects 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 239000011259 mixed solution Substances 0.000 description 1
- 239000003147 molecular marker Substances 0.000 description 1
- 229920003052 natural elastomer Polymers 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 229920001194 natural rubber Polymers 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 235000018343 nutrient deficiency Nutrition 0.000 description 1
- 239000004006 olive oil Substances 0.000 description 1
- 235000008390 olive oil Nutrition 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 239000002304 perfume Substances 0.000 description 1
- 239000002831 pharmacologic agent Substances 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 235000011056 potassium acetate Nutrition 0.000 description 1
- 108091005629 prenylated proteins Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 235000019260 propionic acid Nutrition 0.000 description 1
- 238000002731 protein assay Methods 0.000 description 1
- 239000012460 protein solution Substances 0.000 description 1
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 description 1
- 150000004053 quinones Chemical group 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000007670 refining Methods 0.000 description 1
- 229960003471 retinol Drugs 0.000 description 1
- 235000020944 retinol Nutrition 0.000 description 1
- 239000011607 retinol Substances 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 102200009646 rs104895219 Human genes 0.000 description 1
- 102220095194 rs876660470 Human genes 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 229930004725 sesquiterpene Natural products 0.000 description 1
- 150000004354 sesquiterpene derivatives Chemical class 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000028070 sporulation Effects 0.000 description 1
- 239000007362 sporulation medium Substances 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 150000003432 sterols Chemical class 0.000 description 1
- 235000003702 sterols Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 235000010384 tocopherol Nutrition 0.000 description 1
- 229960001295 tocopherol Drugs 0.000 description 1
- 229930003799 tocopherol Natural products 0.000 description 1
- 239000011732 tocopherol Substances 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 230000005758 transcription activity Effects 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 108010036320 valylleucine Proteins 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 235000019155 vitamin A Nutrition 0.000 description 1
- 239000011719 vitamin A Substances 0.000 description 1
- 235000019168 vitamin K Nutrition 0.000 description 1
- 239000011712 vitamin K Substances 0.000 description 1
- 150000003721 vitamin K derivatives Chemical class 0.000 description 1
- 229940045997 vitamin a Drugs 0.000 description 1
- 229940046010 vitamin k Drugs 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- GVJHHUAWPYXKBD-IEOSBIPESA-N α-tocopherol Chemical compound OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-IEOSBIPESA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1085—Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
一种制备异戊二烯醇的方法,其特征在于该方法包括:通过将一个表达重组DNA或一个用于基因组整合的DNA导入宿主内而构建一个重组体,这两种DNA均含有异戊二烯基二磷酸合酶基因或其变体基因;培养得到的重组体;然后从获得的培养基中收集异戊二烯醇。
Description
技术领域
本发明涉及异戊二烯醇的制备方法。
背景技术
萜类化合物(类异戊二烯)的生物合成,开始于香叶基二磷酸(GPP;C10),法呢基二磷酸(FPP;C15)和香叶基香叶基二磷酸(GGPP;C20)的合成,这三种物质都属于直链异戊二烯基二磷酸,顺次经由异戊烯基二磷酸(IPP;C5)与一个烯丙基二磷酸底物的缩合反应进行(图1)。图1中,方框中的缩写和名称表示酶。特定的,hmgR表示羟甲基戊二酰基辅酶A(HMG辅酶A)还原酶;GGPS表示GGPP合酶;FPS表示FPP合酶。
在各种异戊二烯基二磷酸中,FPP是生物合成中最重要的中间体,也是许多种类萜类化合物合成的前体,例如,包括麦角固醇(维生素原D2)的类固醇,醌类物质的侧链(维生素K;VK),倍半萜烯,(角)鲨烯(SQ),法呢基化蛋白质的锚着分子,天然橡胶,等等。
GGPP也是萜类化合物生物合成的一个重要中间体,并对一些化合物的生物合成而言是必不可少的,诸如:视黄醇(维生素A;VA),β-胡罗卜素(维生素原A),叶绿醌(维生素K1;VK1),香叶基香叶基蛋白质的锚着分子,叶绿素的侧链,赤霉素,以及古细菌的醚脂。
法呢醇(FOH;C15)和香叶基香叶醇(GGOH;C20),分别是FPP和GGPP的醇衍生物,它们的异构体如橙花叔醇(NOH;C15)是用于制造香水的香精油中的芳香物质。它们也在各种不同的化合物,包括上述作为药理剂的维生素的合成中作为重要的起始物质(图1)。
因此,希望能够建立一个体系,可大量制备所谓活性型异戊二烯醇的纯产物,而不是顺和反((Z)-和(E)-)两种异构体的混合物。
尽管业界普遍认为所有IPP的生物合成均通过甲羟戊酸途径完成(该途径中,IPP由乙酰辅酶A经由甲羟戊酸而合成),但是M.Rohmer等人于80年代末阐明了利用细菌合成IPP的新途径。该途径被称为非甲羟戊酸途径或DXP(1-脱氧木酮糖5-磷酸)途径,该途径中,IPP由甘油醛-3-磷酸盐合成,丙酮酸经由1-脱氧木酮糖5-磷酸合成。
目前,GGOH是采用化学合成的方法制备的(例如,见日本未审查专利公开No.8-133999)。可是,化学法合成GGOH所需的步骤多于化学合成短碳链的FOH或NOH所需的步骤,因此也需要更高的成本。此外,尽管化学方法合成的GGOH和天然存在的GGOH相比,具有相同的碳骨架结构,但是获得的却是(E)-型(反式)和(Z)-型(顺式)、两种双键模式的混合物。(E,E,E)-GGOH(以下缩写为(全-E)-GGOH)是生物体代谢途径中合成的构型,具有工业价值。为获得单一构型的(全-E)-GGOH,需采用柱层析精炼,高精度蒸馏等方法。不过,由于GGOH是一种热不稳定的烯丙醇,因此很难采用高精度蒸馏的方法。同样,采用柱层析法精炼的方法也不适用于工业生产实践,因为该方法需要大量的溶剂和柱填料,还需要分析、回收连续洗脱馏分,及去除溶剂的复杂操作;因此,该方法不仅操作复杂而且需要很高的成本。目前情况下,希望能够建立一种通过控制顺反两种几何异构体的产生,或利用某些特性,如反应产物的重复结构来进行(全-E)-GGOH生物合成的方法。不过,这样的方法还没有建立起来。GGOH合成的底物由细胞,如芽殖的啤酒糖酵母细胞中的甲羟戊酸途径提供。然而,即使采用被认为是GGOH合成中的关键酶HMG辅酶A还原酶,其作用也仅增加了通过合成FPP来合成鲨烯的能力。(日本未审查专利公开No.5-192184;Donald et al.,(1997)Appl.Environ Microbiol.63,3341-3344)。此外,即使培养一种获得了固醇吸收能力的鲨烯合酶基因缺乏的特殊芽殖酵母菌株,也仅在培养液中显示出每升1.3mg的FOH积累量。(Chambon et al.,(1990)Curr.Genet.18,41-46):而NOH的生物合成方法还不知道。关于GGOH的生物合成,在日本未审查专利公开No.9-238692中,通过培养植物细胞可得到每升培养液0.66-3.25mg的产率。不过,这一方法需要昂贵的、不适用于工业应用的植物细胞培养基,以及需要对培养细胞进行光照。因此,这一方法即使同传统的从天然产物,如香精油中制备GGOH的方法相比,都是不实用的。已经知道的方法中还没有适用于工业化的生物合成GGOH的方法,例如,通过培养微生物来生物合成GGOH。
发明内容
本发明的一个目标是提供一种通过培养重组DNA转化的重组体来制备异戊二烯醇的方法,该重组DNA含有一个异戊二烯基二磷酸合酶基因。
针对上述难题的解决方法进行了大量广泛而且集中的研究后,本发明人努力开发了异戊二烯醇的制备体系,即将与合成异戊二烯基二磷酸相关的酶基因导入宿主。作为宿主,采用的是那些很早就已经被广泛应用于发酵工业,可通过甲羟戊酸途径或DXP途径合成异戊二烯基二磷酸,且使用易于进行多种遗传工程技术改造的微生物,例如:单细胞真核生物(特别是酵母)或原核生物(如细菌,特别是大肠杆菌)。为了在酵母内构建与异戊二烯基二磷酸合成相关的酶基因(例如,以HMG辅酶A还原酶基因,IPPΔ-异构酶基因,多种异戊二烯基二磷酸合成酶基因,或其突变体或融合基因为代表的甲羟戊酸途径相关的酶基因)在宿主细胞内人工表达的体系,构建了含有组成型(永久表达型)或诱导型表达启动子、及各种营养缺陷标记的表达穿梭载体。然后,将一个关注的基因或其突变体整合到载体中,再将该载体导入宿主细胞。发明人成功地在得到的重组体的培养物中获得异戊二烯醇(特别是香叶基香叶醇),达成上述目标。本发明也因而得以完成。当细菌,特别是大肠杆菌,被用作宿主时,采用了一种常规载体,将与异戊二烯基二磷酸合成相关的一种酶基因(例如:FPP合酶基因的突变体,或IPPΔ异构酶基因)导入宿主细胞。培养该重组体,脱磷酸化后,从得到的培养物中获得了香叶基香叶醇。这就达成了上述目标,本发明也得以完成。
本发明总结如下:
(1)一种制备异戊二烯醇(例如:香叶基香叶醇)的方法,包括:通过将均含有异戊二烯基二磷酸合酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建一个重组体;培养得到的重组体;及从得到的培养物中回收异戊二烯醇。
(2)一种制备异戊二烯醇的方法,包括:通过将均含有异戊二烯基二磷酸合酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA以及均含有羟甲基戊二酰辅酶A还原酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建一个重组体;培养得到的重组体;及从得到的培养物中回收异戊二烯醇。
(3)一种制备香叶基香叶醇的方法,包括:通过将均含有异戊二烯基二磷酸合酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA以及均含有异戊烯基二磷酸Δ异构酶基因的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建一个重组体;培养得到的重组体;及从得到的培养物中回收香叶基香叶醇。
(4)异戊二烯基二磷酸合酶基因可选自下列基因(a)和(b)以及融合基因(c)和(d):
(a)法呢基二磷酸合酶基因或其突变体
(b)香叶基香叶基二磷酸合酶基因或其突变体
(c)由法呢基二磷酸合酶基因或其突变体和香叶基香叶基二磷酸合酶基因或其突变体组成的融合基因
(d)添加一个编码His Asp Glu Leu氨基酸序列的核苷酸序列的上述基因(a)或(b)或融合基因(c)。
法呢基二磷酸合酶基因的特定实例包括:一个编码如SEQ ID NO:2或4中所示氨基酸序列的基因;香叶基香叶基二磷酸合酶基因的特定实例包括:一个编码如SEQ ID NO:6中所示氨基酸序列的基因。
(5)一种制备香叶基香叶醇的方法,包括:通过将均含有羟甲基戊二酰辅酶A还原酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建一个重组体;培养得到的重组体;及从得到的培养物中回收香叶基香叶醇。
(6)一种制备香叶基香叶醇的方法,包括:通过将均含有羟甲基戊二酰辅酶A还原酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA以及含有选自下列(e)到(j)的基因的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建一个重组体:
(e)异戊烯基二磷酸Δ-异构酶基因
(f)甲羟戊酸激酶基因
(g)乙酰辅酶A乙酰基转移酶基因
(h)羟甲基戊二酰辅酶A合酶基因
(i)磷酸甲羟戊酸激酶基因
(j)二磷酸甲羟戊酸脱羧酶基因
培养得到的重组体;及从得到的培养物中回收香叶基香叶醇。
(7)根据上述的方法,可制备出浓度至少为0.05mg/L的香叶基香叶醇。在这些方法中可采用宿主的特定实例包括酵母(例如:啤酒糖酵母)和大肠杆菌。在这些方法中可采用的优选啤酒糖酵母株包括:A451株,YPH499株,YPH500株,W303-1A株和W303-1B株,或这些菌株的任意一种衍生的菌株。
(8)一个用于表达的重组DNA,含有选自上述基因(a)、基因(b)、融合基因(c)和融合基因(d)中的任一基因,以及转录启动子和转录终止子。
(9)转录启动子可选自ADH1启动子、TDH3(GAP)启动子、TEF2启动子、GAL1启动子和tac启动子的任意一个;转录终止子可以是CYC1终止子。
(10)一种重组体,该重组体是通过将上述重组DNA导入宿主而获得的。宿主的特定实例如上所述。
(11)一种制备异戊二烯醇的方法,包括:采用含下列组分(i)-(vi)之一的培养基,培养具有产生异戊二烯醇能力的微生物:
(i)糖
(ii)醇
(11i)氨气,氨水和/或铵盐
(iv)氢氧化钠和硫酸的混合物
(v)KH2PO4、硫酸镁、硫酸铵、玉米浸渍液、氯化钙和表面活性剂的混合物
(vi)上述组分(i)-(v)中的两种或以上的混合物;
并且从得到的培养物中回收异戊二烯醇。
在上文(11)中所描述的方法中,可以使用补料液培养微生物,该补料液含有下列组分(i)、(ii)或(iii)或这些组分中的两种或两种以上的混合物:
(i)糖
(ii)醇
(iii)氨气、氨水和/或铵盐。
补料液可含有下述的组分,也可按下述方式添加到培养基中,例如:
简言之,补料液的碳源组分从培养开始到培养开始后12-24小时之间仅由葡萄糖组成,在此之后碳源组分更换为含乙醇的组分。这一更换可按照如下方式进行,即在培养开始12-24小时后,乙醇与补料液中总碳源组分的比率为50%或以上。作为选择,在培养开始12-24小时后,补料液的碳源组分可仅由乙醇组成。
术语“补料”指:在培养过程中,将一种特定的溶液或组分以任意方式补充或添加进培养液中。将特定组分补充或添加进发酵罐的培养方法被称为“补料分批培养”。
异戊二烯醇在培养物中积累的浓度至少为0.1g/L或更高,优选地为1g/L或更高。香叶基香叶醇可作为异戊二烯醇的一个特定实例;微生物的特定实例包括酵母,如啤酒糖酵母。本发明可采用啤酒糖酵母A451株、YPH499株、W303-1A株或W303-1B株,或任一上述菌株的衍生菌株。
此外,在上文(11)描述的方法中,优选的微生物是重组体。这样的重组体的特定实例,可以是下面的(a)或(b):
a)通过将均含有甲羟戊酸途径相关基因或其突变体,或异戊二烯基二磷酸合酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建的重组体。
b)通过将均含有异戊二烯基二磷酸合酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA以及均含有甲羟戊酸途径相关基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建的重组体。
宿主的特定实例包括啤酒糖酵母。更为明确地可采用啤酒糖酵母A451株、YPH499株、YPH500株、W303-1A株或W303-1B株,或任一上述菌株的衍生菌株。
甲羟戊酸途径相关基因的特定实例包括羟甲基戊二酰辅酶A还原酶基因(即HMG1基因)。
异戊二烯基二磷酸合酶基因的特定实例,包括选自下列基因(a)、基因(b)、融合基因(c)和融合基因(d)的任意基团:
(a)法呢基二磷酸合酶基因或其突变体
(b)香叶基香叶基二磷酸合酶基因或其突变体
(c)由法呢基二磷酸合酶基因或其突变体和香叶基香叶基二磷酸合酶基因或其突变体组成的融合基因
(d)添加一个编码His Asp Glu Leu氨基酸序列的核苷酸序列的上述基因(a)或(b)或融合基因(c)。
此外,本发明采用的微生物可以是一种原养型微生物、一种双倍体细胞,或者既是原养型微生物,也是双倍体细胞。
此外,本发明以培养基pH控制为特征。pH控制是通过采用如氨气、铵盐溶液、氢氧化钠溶液或硫酸来实现的。
下文中,将对本发明进行更为具体的描述。本说明含公开于日本专利申请No.2000-403067的说明书和/或附图中的内容,基于此专利申请,本申请要求优先权。
利用代谢工程技术,本发明人试图建立一个制备活性异戊二烯醇,尤其是(全-E)-香叶基香叶醇(以下称为“GGOH”)的体系。
业界相信GGOH是以香叶基香叶基二磷酸为前体(GGPP)合成而来。通常,简单地提高GGPP合酶活性将仅引起从异戊烯基二磷酸(IPP)和3,3-二甲烯丙基焦磷酸(DMAPP)合成GGPP的加速,而且不可预知这种提高是否会导致GGOH的产生(图1)。另外,已知在体内,GGPP仅作为类胡罗卜素和异戊二烯化的蛋白质等多种终产物合成的前体(图1)。这样,即使在GGPP合成加速、这些终产物水平也可望提高的情况下,仍不可预知是否能够建立一种具有工业应用价值的GGPP合成体系。即使通过提高HMG辅酶A还原酶的酶活性或上文列于(e)-(j)中的酶的活性,从而提高了HMG辅酶A还原酶(一种甲羟戊酸途径中的关键酶)的表达水平,仍不可预知哪种合成(即FPP合成或GGPP合成)会加强;因此,不能期望上述表达水平的提高对GGPP的合成是有效果的。此外,根据迄今为止积累的资料,也不能预期FPP合酶基因的表达(FPP合酶催化法呢基二磷酸(FPP)的合成,是FOH的前体)会对GGOH的制备有效(图1)。
本发明中,通过构建用于将异戊二烯基二磷酸合酶基因、HMG辅酶A还原酶基因和/或IPPΔ-异构酶基因导入宿主细胞的各种重组DNA,及用这些重组DNA构建重组体,发明人从而发展了大量制备异戊二烯醇、特别是GGOH的体系。
1.用于表达的重组DNA或用于基因组整合的DNA片段的制备
本发明中,通过将一个转录启动子DNA和一个转录终止子DNA连接或插入到一个异戊二烯基二磷酸合酶基因内,可获得用于转化宿主细胞的一个用于表达的重组DNA的实例。预先制备一个含有异戊二烯基二磷酸合酶基因的基因表达盒也是可能的,该异戊二烯基二磷酸合酶基因已连接了一个转录启动子和一个转录终止子,然后将该基因表达盒整合到一个载体内。启动子和终止子的连接或插入可按任意顺序进行,但优选的是将启动子和终止子分别连接到异戊二烯基二磷酸合酶基因的上游和下游。作为选择,本发明中,一个异戊二烯基二磷酸合酶基因,一个转录启动子和一个转录终止子可被顺序掺入一个适当的DNA,如一个载体中。如果正确地考虑转录方向,就可按任意顺序完成掺入。
异戊二烯基二磷酸合酶基因的特定实例包括:法呢基二磷酸合酶基因(称为“FPP合酶基因”)和香叶基香叶基二磷酸合酶基因(称为“GGPP合酶基因”)。FPP合酶基因的特定实例包括:啤酒糖酵母ERG20(SEQ ID NO:1)、大肠杆菌ispA(SEQ ID NO:3)和嗜热脂肪芽孢杆菌来源的FPP合酶基因(日本未审查专利公开No.5-219961;美国专利No.5,786,192)。GGPP合酶基因的特定实例包括啤酒糖酵母BTS1(SEQ ID NO:5),嗜酸热硫化叶菌(日本未审查专利公开No.7-308913;美国专利No.5,773,273)和栖热嗜热菌Tth(日本未审查专利公开No.9-107974;美国专利No.6,107,072)。这些基因可通过常规基因分离方法或使用商业试剂盒而获得。本发明中,也可能采用一个FPP合酶基因的突变体或一个GGPP合酶基因的突变体。
此外,本发明中,可构建一个载体,该载体含有一个由GGPP合酶基因或其突变体和FPP合酶基因或其突变体组成的融合基因,使得经GGPP合酶基因和FPP合酶基因表达产生的多肽具有融合蛋白的形式。本发明中,这样一种用两种或两种以上基因构建、使融合蛋白作为一种表达产物而产生出来的基因,被称为“融合基因”。为制备融合基因,可采用下述方法,即用一种适当的限制性内切酶消化一个DNA,然后将用同一种酶作用下预消化的另一个DNA以特定的方式连接到前一个DNA,该方式中后者DNA编码的蛋白质的氨基酸序列的读码框架不发生移码。
此外,本发明中,为添加内质网(ER)过渡信号(氨基酸序列表现为His Asp Glu Leu(SEQ ID NO:24);下文参见“HDEL序列”)到异戊二烯基二磷酸合酶基因或其突变体或上述融合基因表达产生的蛋白质的C-端上,编码该氨基酸序列的核苷酸序列可被添加到异戊二烯基二磷酸合酶基因或融合基因上,从而构建出一种修饰的基因。
此外,本发明中,通过将单独的羟甲基戊二酰辅酶A(HMG辅酶A)还原酶基因(SEQ ID NO:7)或其突变体,或含上述异戊二烯基二磷酸合酶基因(包括其突变体)的融合基因导入宿主中,并表达基因,也可能制备异戊二烯醇(特别是GGOH)。作为选择,也可将异戊二烯基二磷酸合酶基因或其突变体和HMG辅酶A还原酶基因或其突变体一起导入宿主中,从而共表达两种基因。HMG辅酶A还原酶基因的特定实例包括啤酒糖酵母HMG1和HMG2。
上述异戊二烯基二磷酸合酶基因和HMG辅酶A还原酶基因的突变体可以是缺失突变体基因,即有一部分区域的缺失(例如:HMG辅酶A还原酶基因最多可缺失2217个核苷酸),或者是在野生型基因或上述缺失突变基因的核苷酸序列中具有缺失、替代或添加1个或几个至10个核苷酸的突变基因。因此,由这样的突变基因编码的氨基酸序列可能有一个(或多个)突变。特定的,野生型异戊二烯基二磷酸合酶的氨基酸序列(FPP合酶:SEQ ID NO:2或4;GGPP合酶:SEQ ID NO:6)或野生型MMG辅酶A还原酶的氨基酸序列(SEQ ID NO:8)可能有一个(或多个)突变,如缺失、替代或添加一个或几个氨基酸(例如:1-10个,优选的是1-3个)。野生型HMG辅酶A还原酶的氨基酸序列(SEQ ID NO:8)最多有739个氨基酸的缺失,且这种缺失突变型酶还可能进一步发生一个(或几个)突变,例如缺失、添加、替代或插入了一个或几个氨基酸(例如:1-10个,优选的是1-3个)。特定的,本发明可采用图2B中阐明的野生型HMG辅酶A还原酶基因或其缺失突变体,且由于核苷酸替代的原因,由它们编码的氨基酸序列可能会有1-10个位点特异性替代,可参见图2A。此外,当一个野生型异戊二烯基二磷酸合酶(例如:SEQ ID NO:1,3或5)或野生型HMG辅酶A还原酶基因(SEQ ID NO:7)经PCR(聚合酶链反应)扩增后,由于DNA聚合酶,如Taq DNA聚合酶的低准确性,使得核苷酸的替代突变发生在得到的DNA片段上,被称为“PCR错误”。例如在本发明中可用到的一种HMG辅酶A还原酶基因所编码的多肽具有替代突变,该突变是由于以野生型HMG辅酶A还原酶基因(SEQ ID NO:7)为模板,由于PCR错误造成的核苷酸替代引起的;该HMG辅酶A还原酶基因被命名为“HMG1”。在以野生型HMG辅酶A还原酶基因(SEQ ID NO:7)为模板,由于PCR错误造成的核苷酸替代的实施方案见图2A。HMG1’具有如SEQ ID NO:9中所示的核苷酸序列,由其编码的氨基酸序列如SED ID NO:10中所示。图2A中,核苷酸突变按如下顺序表示:替代前的相关核苷酸(以一个字母代码表示),按照在野生型HMG辅酶A还原酶基因的起始密码子中第一个核苷酸的位置为1进行记数的该核苷酸的位置,替代后的核苷酸(以一个字母代码表示)。包含在PCR错误型HMG辅酶A还原酶的氨基酸序列中的氨基酸突变,按如下顺序表达:替代前的相关氨基酸(以一个字母代码表示),该氨基酸在HMG辅酶A还原酶中的位置,替代后的氨基酸(以一个字母代码表示)。此外,上述的PCR错误型核苷酸序列可通过诸如定点诱变的技术进行部分修饰。这种修饰过的HMG辅酶A还原酶基因也可用于本发明。PCR错误引起的核苷酸替代的实施方案如图2A所示。此外,上述的PCR错误类型核苷酸序列可通过诸如定点诱变形成技术进行部分修饰。编码该修饰型HMG辅酶A还原酶(SEQ ID NO:12)的基因(SEQ ID NO:11)也可用于本发明。
此外,PCR错误型HMG辅酶A还原酶基因HMG1’的缺失突变体HMG1Δ基因(Fig 2B)可作为HMG辅酶A还原酶基因(包括PCR错误型)的一个实例,该基因编码了预测跨膜域缺失的缺失突变体。该图中最高一行表示没有缺失的HMG1’。由细实线(一)指出的部分表示缺失区域。下表1显示在每个缺失突变体基因中,HMG1’基因(SEQ ID NO:9)缺失了哪一个区域。根据缺失模式,HMG1’缺失突变体表示为“HMG1Δxxy”,“xx”表示缺失模式,“y”是一个工作数字(任意数字)。图2B中,“Δ026”即HMG1Δ02y的一个实例。(其它缺失模式实例也以相同方式表达)
表1
缺失的实施方式
预测跨膜
缺失突变体 域的缺失
的命名 引物1 引物2 质粒 (s) 缺失区域 缺失后序列
HMG1Δ02y HMG1(558-532) HMG1(799-825) pYHMG02X #2-#3 Nucleotide Positions 559-798 SEQ ID NO:13
HMG1Δ04y HMG1(1191-1165) HMG1(1267-1293) pYHMG04X #6 Nucleotide Positions 1192-1266 SEQ ID NO:14
HMG1Δ05y HMG1(1380-1354) HMG1(1573-1599) pYHMG05X #7 Nucleotide Positions 1381-1572 SEQ ID NO:15
HMG1Δ06y HMG1(558-532) HMG1(1267-1293) pYHMG06X #2-#6 Nucleotide Positions 559-1266 SEQ ID NO:16
HMG1Δ07y HMG1(558-532) HMG1(1573-1599) pYHMG07X #2-#7 Nucleotide Positions 559-1572 SEQ ID NO:17
HMG1Δ08y HMG1(27-1) HMG1(1573-1599) pYHMG08X #1-#7 Nucleotide Positions 27-1572 SEQ ID NO:18
HMG1Δ10y HMG1(27-1) HMG1(1816-1842) pYHMG10X #1-#7(-605aa) Nucleotide Positions 27-1815 SEQ ID NO:19
HMG1Δ11y HMG1(27-1) HMG1(1891-1917) pYHMG11X #1-#7(-631aa) Nucleotide Positions 27-1890 SEQ ID NO:20
HMG1Δ12y HMG1(27-1) HMG1(1990-2016) pYHMG12X #1-#7(-663aa) Nucleotide Positions 27-1989 SEQ ID NO:21
HMG1Δ13y HMG1(27-1) HMG1(2218-2244) pYHMG13X #1-#7(-739aa) Nucleotide Positions 27-2217 SEQ ID NO:22
引物序列
HMG1(27-1) 5′TTT CAG TCC CTT GAA TAG CGG CGG CAT 3′ SEQ ID NO:77
HMG1(558-532) 5′GTC TGC TTG GGT TAC ATT TTC TGA AAA 3′ SEQ ID NO:61
HMG1(799-825) 5′CAC AAA ATC AAG ATT GCC CAG TAT GCC 3′ SEQ ID NO:78
HMG1(1191-1165) 5′AGA AGA TAC GGA TTT CTT TTC TGC TTT 3′ SEQ ID NO:79
HMG1(1267-1293) 5′AAC TTT GGT GCA AAT TGG GTC AAT GAT 3′ SEQ ID NO:80
HMG1(1380-1354) 5′TTG CTC TTT AAA GTT TTC AGA GGC ATT 3′ SEQ ID NO:81
HMG1(1573-1599) 5′CAT ACC AGT TAT ACT GCA GAC CAA TTG 3′ SEQ ID NO:62
HMG1(1816-1842) 5′GCA TTA TTA AGT AGT GGA AAT ACA AAA 3′ SEQ ID NO:82
HMG1(1891-1917) 5′CCT TTG TAC GCT TTG GAG AAA AAA TTA 3′ SEQ ID NO:83
HMG1(1990-2016) 5′TCT GAT CGT TTA CCA TAT AAA AAT TAT 3′ SEQ ID NO:84
HMG1(2218-2244) 5′AAG GAT GGT ATG ACA AGA GGC CCA GTA 3′ SEQ ID NO:85
此外,本发明中,通过将一个异戊烯基二磷酸Δ-异构酶(IPPΔ-异构酶)基因和上述异戊二烯基二磷酸合酶基因或其突变体一起导入宿主中而制备异戊二烯醇,尤其是GGOH,也是可能的。IPPΔ-异构酶基因的特定实例包括大肠杆菌来源的idi(SEQ ID NO:32)。异戊二烯基二磷酸合酶基因的特定实例包括大肠杆菌来源的ispAm突变基因(Y79M:SEQ ID NO:37;Y79E;SEQ ID NO:35;Y79D:SEQ ID NO:33)和嗜热脂肪芽孢杆菌来源的fpsm(Y81M:SEQ ID NO:39)。从ispA来源的突变基因编码突变酶,其中位于野生型FPP合酶(SEQ IDNO:4)79位上的氨基酸残基Tyr经替代突变被改变为Asp(SEQ IDNO:34)、Glu(SEQ ID NO:36)或Met(SEQ ID NO:38)。
本发明用到的DNA只要可在宿主细胞中遗传保留,就不用特定限制。可采用DNA的特定实例包括:质粒DNA、噬菌体、逆转录转座子DNA、酵母人工染色体DNA(YAC:酵母人工染色体)等。至于用于基因组整合的DNA片段,这些片段不需要复制能力。这样就可采用PCR或化学合成的方法制备DNA片段。
可用质粒DNA的特定实例包括YCp型大肠杆菌-酵母穿梭载体,如:pRS413、pRS414、pRS415、pRS416、YCp50、pAUR112或pAUR123;YEp型大肠杆菌-酵母穿梭载体,如:pYBS2或YEp13;YIp型大肠杆菌-酵母穿梭载体,如:pRS403,pRS404,pRS405,pRS406,pAUR101或pAUR135;大肠杆菌衍生质粒(例如:ColE质粒,如:pBR322,pBR325,pUC18,pUC19,pUC118,pUC119,pTV118N,pTV119N,pBluescript,pHSG298,pHSG396或pTrc99A;p15A质粒,如:pACYC177或pACYC184;和pSC101质粒,如:pMW118,pMW119,pMW218或pMW219)和枯草芽孢杆菌衍生质粒(例如:pUB110,pTP5)。可用的噬菌体DNA的特定实例包括λ噬菌体(Charon4A,Charon21A,EMBL3,EMBL4,λgt10,λgt11,λZAP),φX174,M13mp18和M13mp19。可用的逆转录转座子的特定实例包括Ty因子。YAC载体的特定实例包括pYACC2。
当将各种重组DNA导入宿主时,很多情况下会采用选择性标记基因。不过,如果有一种合适的分析方法,采用标记基因并不是必要的。
组成型(永久表达型)启动子或诱导型启动子都可作为转录启动子。术语“组成型启动子”意指一个涉及主要代谢途径的基因的转录启动子。这样的启动子在很多生长条件下都具有转录能力。“诱导型启动子”意指仅在特定生长条件下才具有转录活性的启动子,且其转录活性在其他生长条件下受到抑制。
只要在诸如酵母的宿主内具有转录活性,任何转录启动子都可被采用。例如:GAL1启动子、GAL10启动子、TDH3(GAP)启动子、ADH1启动子、TEF2启动子或相似的启动子都可用于酵母内的表达。用于大肠杆菌表达的启动子,可采用trp启动子、lac启动子、trc启动子、tac启动子等。
此外,如需要,重组DNA可包含顺式元件,如增强子,剪切信号,聚腺苷酸附加信号,选择性标记等。可用的选择性标记的特定实例包括标记基因,如:指示物是非营养缺陷型表型的URA3,LEU2,TRP1和HIS3,和抗生素抗性基因,如:Ampr,Tetr,Cmr,Kmr和AUR1-C。
只要在诸如酵母的宿主内具有活性,就可采用来源于任何基因的转录终止子。对于酵母中表达,可采用ADH1终止子,CYC1终止子等终止子。对于大肠杆菌中表达,可采用rrnB终止子。为在细菌细胞中表达出感兴趣的基因,也可将一个SD序列(代表性的是:5’-AGGAGG-3’)作为核糖体结合位点整合到基因起始密码子的上游,用于有效翻译。
可通过在所用质粒名称之后指出相关基因名称来命名和识别本发明制备的表达载体,这些表达载体是作为用于基因转移的重组DNA。表2显示的是采用pRS435GAP质粒时,表达载体的名称和组成之间的关系。采用pRS434、pRS444、pRS445质粒和上述启动子组合时,也可以相同于pRS435GAP的方式描述所述关系。
表2
表达载体命名 | 组成 |
PRS435GG | 连接了GGPP合酶基因BTS1的pRS435GAP质粒 |
pRS435F | 连接了FPP合酶基因ERG20的pRS435GAP质粒 |
pRS435GGF | 连接了一个融合基因的pRS435GAP质粒,该融合基因按顺序连接GGPP合酶基因BTS1和FPP合酶基因ERG20 |
pRS435FGG | 连接了一个融合基因的pRS435GAP质粒,该融合基因按顺序连接FPP合酶基因ERG20和GGPP合酶基因BTS1 |
pRS435GGHDEL | 连接了一个编码HDEL序列的核苷酸序列的pRS435GG质粒 |
pRS435FHDEL | 连接了一个编码HDEL序列的核苷酸序列的pRS435F质粒 |
pRS435FGGHDEL | 连接了一个编码HDEL序列的核苷酸序列的pRS435FGG质粒 |
pRS435GGFHDEL | 连接了一个编码HDEL序列的核苷酸序列的pRS435GGF质粒 |
当HMG1基因被连接到pRS434GAP质粒时,得到的载体表达为“pRS434GAP-HMG1”。表3显示的是采用pRS434GAP质粒时,表达载体的名称和其组成之间的关系。当采用pRS435GAP、pRS445GAP或类似的质粒时,名称与组成之间的关系也可以相同方式描述。
表3
表达载体名称 | 组成 |
pRS434GAP-HMG1 连接了HMG辅酶A还原酶基因HMG1的pRS434GAP质粒 | |
pRS434GAP-HMG1Δ 连接了HMG辅酶A还原酶基因HMG1的缺失突变基因HMG1Δ的pRS434GAP质粒 |
2.重组体的制备
通过将本发明的各种重组DNA以特定方式导入宿主,可以获得本发明的重组体,该方式即:能够表达不同异戊二烯基二磷酸合酶基因或其融合基因、和/或HMG辅酶A还原酶基因(包括这些基因的突变体;无另外说明,也适用于本发明的其余部分)、或IPPΔ异构酶基因。本发明采用的宿主类型无需特别限定。只要是可产生异戊二烯醇的宿主,就可采用。优选的宿主是酵母或大肠杆菌。
本发明中,将包含一个转录启动子、一个转录终止子、一个异戊二烯基二磷酸合酶基因、HMG辅酶A还原酶基因、IPPΔ异构酶基因或上文中(e)-(j)所列基因之一的重组DNA导入包括单细胞真核生物如酵母在内的真菌、原核生物、动物细胞、植物细胞等;从而获得重组体。
本发明可采用的真菌包括:粘菌、藻状菌、子囊菌、担子菌和半知菌。在真菌当中,一些单细胞真核生物如酵母在工业应用上扮演重要角色。可以列举出属于子囊菌、担子菌、或半知菌的酵母。本发明可采用酵母的特定实例包括属于子囊菌的酵母,尤其是芽殖酵母,如啤酒糖酵母(称作Baker’s酵母)、产朊假丝酵母或毕赤巴斯德酵母;裂殖酵母,如粟酒裂殖酵母。本发明可采用的酵母菌株只要可产生异戊二烯醇就无需特别限定。在采用啤酒糖酵母的情况下,可采用菌株的特定实例包括如下所示的A451、EUG5、EUG8、EUG12、EUG27、YPH499、YPH500、W303-1A、W303-1B、ATCC28382、AURGG101、AURGG102、AH1和YH1。就将重组DNA导入酵母中的方法而言,可应用电穿孔方法、原生质球法、或乙酸锂方法。
A451(ATCC200589,MATa can1 leu2 trp1 ura3 aro7)
YPH499(ATCC76625,MATa ura3-52 lys2-801 ade2-101 trp1-Δ63 his3-Δ200leu2-Δ1,Stratagene,La Jolla,CA)
YPH500(ATCC76626,MATa ura3-52 lys2-801 ade2-101 trp1-Δ63 his3-Δ200leu2-Δ1,Stratagene)
W303-1A(MATa leu2-3 leu2-112 his3-11 ade2-1 ura3-1 trp1-1 can1-100)
W303-1B(MATa leu2-3 leu2-112 his3-11 ade2-1 ura3-1 trp1-1 can1-100)
AURGG101(A451,aur1::AUR1-C):本发明构建的A451衍生菌株;整合有AUR1-C基因。
AURGG102(A451,aur1::GAL1p-BTS1&AUR1-C):本发明构建的A451衍生菌株;含有GAL1启动子,BTS1和CYC1终止子及AUR1位点上的AUR1-C基因。
EUG5,EUG8(A451,ERG9p::URA3-GAL1p):本发明构建的A451衍生菌株;含有鲨烯合酶基因ERG9,转化株选择标记基因URA3和转录启动子GAL1p。
EUG12(YPH499,ERG9p::URA3-GAL1p):本发明构建的YPH499衍生菌株;含有ERG9,URA3和GAL1p。
EUG27(YPH500,ERG9p::URA3-GAL1p):本发明构建的YPH500衍生菌株;含有ERG9,URA3和GAL1p。
AH1菌株(pRS434GAP-HMG1/A451):本发明构建的A451衍生菌株;A451菌株中导入有pRS434GAP-HMG1。
YH1菌株(pRS434GAP-HMG1/YPH499):本发明构建的YPH499衍生菌株;YPH499菌株中导入有pRS434GAP-HMG1。
本发明可采用的原核生物包括古生菌和细菌。就古生菌而言,可列举出:产甲烷微生物,如甲烷杆菌;嗜盐微生物,如嗜盐杆菌;和嗜温嗜酸微生物,如硫化叶菌。就细菌而言,可列举出多种在工业或科学研究领域具有非常高应用价值的革兰氏阴性或革兰氏阳性细菌,如:埃希氏菌,如大肠杆菌;杆菌,如枯草芽孢杆菌或短芽孢杆菌;假单胞菌,如恶臭假单胞菌;土壤杆菌,如根癌土壤杆菌或发根土壤杆菌;棒状杆菌,如谷氨酸棒杆菌;乳酸菌,如胚芽乳杆菌和放射菌,如放线菌或链霉菌。
当一种细菌,如大肠杆菌被用做宿主时,本发明优选的重组DNA可在宿主内自主复制,且由一个转录启动子,一个如核糖体RNA结合位点的SD序列,和本发明的基因组成。该重组DNA中也可正确插入一个转录终止子,该DNA还可包含控制启动子的基因。本发明采用的大肠杆菌菌株的特定实例包括:BL21,DH5α,HB101,JM101,JM109,MV1184,TH2,XL1-Blue和Y-1088,但不仅限于这些菌株。就转录启动子而言,只要可在宿主,如大肠杆菌中,指导本发明基因的表达,就可以采用。例如,可采用一种大肠杆菌或噬菌体来源的启动子,如trp启动子,lac启动子,PL启动子或PR启动子。也可采用人工改变过结构的启动子。就导入重组载体到细菌的方法而言,可采用将DNA转移入细菌的方法。如,可采用利用钙离子、电穿孔等方法。
可运用PCR(聚合酶链反应)法或DNA印迹杂交法验证本发明的基因是否被导入宿主细胞内。例如,从得到的重组体制备DNA,采用一对特异于转移动DNA的引物,进行PCR。然后,将扩增产物经由琼脂糖凝胶电泳,聚丙烯酰胺凝胶电泳或毛细管电泳纯化,继而由溴化乙啶、SYBR Green溶液等进行染色,或用紫外检测仪进行DNA的检测。通过检测扩增产物是一条谱带或谱峰,就可验证转移的DNA。作为选择,可通过带荧光染料或类似物的标记引物进行PCR操作,由此检测扩增产物。
3.异戊二烯醇的制备
本发明通过培养上述重组体,该重组体含有一个异戊二烯基二磷酸合酶基因或其突变体(包括融合基因)、和/或一个HMG辅酶A还原酶基因或其突变体、或一个选自上文(e)-(j)的甲羟戊酸途径相关酶基因;及从得到的培养物中进行回收,从而可获得异戊二烯醇。此处所用术语“培养物”指下列任一物质:培养上清液、培养细胞或微生物本身、或来自培养细胞或微生物中的破裂产物。本发明重组体的培养采用培养其宿主的传统方法。GGOH可作为异戊二烯醇的一个特定实例。异戊二烯醇可单独或以混合物形式在培养物中积累。
就培养得自微生物宿主的重组体的培养基而言,无论天然或合成的培养基,只要含有可被微生物同化的碳源、氮源和无机盐,并可有效培养重组体,都可被采用。就碳源而言,可列举出:碳水化合物,如葡萄糖、半乳糖、果糖、蔗糖、棉子糖、淀粉;有机酸,如乙酸、丙酸;醇类,如乙醇和丙醇。就氮源而言,可列举:氨水;无机或有机酸的铵盐,如氯化铵、硫酸铵、乙酸铵、磷酸铵;其它含氮化合物;蛋白胨;肉提取物;玉米浸渍液,多种氨基酸,等。就无机物而言,可列举出:磷酸二氢钾、磷酸氢二钾、磷酸镁、硫酸镁、氯化钠、硫酸亚铁、硫酸锰、硫酸铜、碳酸钙,等等。通常,重组体在通氧、26-42℃的条件下进行培养。但宿主为啤酒糖酵母时,优选的是在30℃培养重组体2-7天;当宿主为大肠杆菌时,在37℃培养重组体12-18小时。pH的调节是利用无机或有机酸、碱溶液等物质来进行的。
在培养一个整合了含有可诱导的转录启动子的表达载体的重组体时,可在需要时向培养基中添加一种诱导剂。例如,在载体中用到GAL1启动子时,可采用半乳糖作为碳源。在培养用含有一个可用异丙基-β-D-硫代半乳糖苷(IPTG)诱导的启动子的表达载体转化的微生物时,可在培养基中添加IPTG。
在上述条件下进行培养,宿主可高产量地产出异戊二烯醇。为大量制备异戊二烯醇,可应用一种发酵罐培养设备。尤其是当宿主为YPH499啤酒糖酵母,且导入的质粒DNA为pRS435GGF或pRS434GAP-HMG1时,重组体可产出每升培养基1.5mg甚至更多的异戊二烯醇;根据培养条件,重组体可产出128mg甚至更多的异戊二烯醇。
本发明中,通过添加诸如萜类化合物、油类、或表面活性剂之类的物质,有可能提高异戊二烯醇的产出率。这些添加物的特定实例包括如下物质:
萜类化合物:鲨烯、生育酚、IPP、DMAPP
油类:大豆油、鱼油、杏仁油、橄榄油
表面活性剂:Tergitol,Triton X-305,Span85,ADEKANOL LG109(Asahi Denka),ADEKANOL LG294(Asahi Denka),ADEKANOL LG295S(Asahi Denka),ADEKANOL LG297(Asahi Denka),ADEKANOL B-3009A(Asahi Denka),ADEKA PLURONIC L-61(Asahi Denka)
油类浓度为0.01%或大于0.01%,优选为1-3%。表面活性剂的浓度为0.005-1%,优选为0.05-0.5%。萜类化合物浓度为0.01%或大于0.01%,优选为1-3%。
此外,本发明中,采用含有下列化合物(i)-(vi)中任意一种的培养基,并从得到的培养物中回收异戊二烯醇,也可培养具有产出异戊二烯醇能力的微生物。进一步采用含下述化合物(i)-(v)中任意一种的补料液可以进行补料-批式培养。
(i)糖
(ii)醇
(iii)氨气、氨水和/或铵盐
(iv)氢氧化钠和硫酸的混合物
(v)磷酸二氢钾、硫酸镁、硫酸铵、玉米漫渍液、氯化钙和一种表面活性剂的混合物
(vi)两种或两种以上上述化合物(i)-(v)的混合物
上述的糖的特定实例包括:葡萄糖、蔗糖、半乳糖和乳糖。上述醇组合的特定实例包括:甲醇、乙醇、丙醇、异丙醇和丁醇。
就补料液中的碳源组分而言,优选的是葡萄糖和乙醇的组合,更为优选的是在培养基中添加用于控制pH的氨气或一种铵盐(例如:乙酸铵)。就补料方法而言,优选的是在培养开始到开始后12-24小时采用碳源仅为葡萄糖的补料液,然后就更换为乙醇作为另一种碳源组分的补料液。作为选择,葡萄糖可以是贯穿整个培养过程的单一碳源物质。乙醇在补料液中与总碳源中可以是任何比例。可以是50%或更多,甚至100%。
无需在培养基中补充特定营养物质就能增殖的菌株被称为原养型。通常,原养型是在营养需求上同相应的野生型菌株具有相同表型的菌株。另一方面,为构建重组体,营养缺陷型(营养缺陷型的突变菌株)经常被用做宿主菌株。可通过补充营养缺陷突变,将该营养缺陷型的表型改造成与相应原养型相同的表型。简言之,就是将对应于导致营养缺陷型突变的突变基因的一种野生型基因转移至营养缺陷体。当与导致营养缺陷型突变的突变基因相比,野生型基因更占优势时,也可采用具有野生型基因的菌株,通过交配或接合营养缺陷型来补充突变。本发明中,例如,通过将一些在产GGOH克隆(YH1菌株含一个由GGPP合酶基因与FPP合酶基因组成的融合基因)的基因组引起营养需求的突变基因替换为相应的野生型基因,然后用一种与保留的突变基因相比,具有优势的野生型基因的YPH500衍生克隆与上述得到的克隆杂交,可获得一种原养型。本发明中,优选的是采用是二倍体细胞,同时也是原养型的微生物。
培养后,如果微生物或细胞内产生醇,可采用诸如匀浆器处理破坏微生物或细胞,从而回收异戊二烯醇。作为选择,采用有机溶剂可无需破坏细胞直接提取出醇类物质。如果本发明的异戊二烯醇在微生物或细胞外产生,培养液将直接利用,或通过离心等方法除去微生物或细胞。因此,可通过例如有机溶剂抽提法,从培养物中提取出目的异戊二烯醇。如有必要,可通过多种类型的色谱法或类似方法提纯和分离异戊二烯醇。
本发明中,宿主菌株和载体的优选组合,及这些组合与异戊二烯醇产量之间的关系在下表4中阐明。
表4
启动子 | 基因 | 宿主 | 添加试剂 | GGOH产量(mg/l)1 | GGOH产量(mg/l)2 | GGOH产量(mg/l)3 | |
宿主 | ---- | ---- | Sc A451AURGG101Sc YPH499Ec JM109 | ---- | (0.00-0.02)(0.00-0.02)(0.00)(0.00) | ||
HMG1 | GAPADH1GAL1GAL1GAPGAPGAP | HMG1HMG1HMG1HMG1HMG1HMG1HMG1 | Sc A451Sc A451Sc A451Sc AURGG101Sc EUG8Sc EUG12Sc EUG27 | ------- | 0.070.050.050.050.050.050.05 | 0.07-0.350.05-0.140.05-0.10.05-2.20.05-0.160.05-1.030.05-0.63 | ------- |
HMG1Δ | GAL1 HMG1Δ044 Sc A451 - 0.05 - - |
GAL1 HMG1Δ056 Sc A451 - 0.05 0.05-0.07 -GAL1 HMG1Δ062 Sc A451 - 0.05 0.05-0.07 -GAL1 HMG1Δ078 Sc A451 - 0.05 - -GAL1 HMG1Δ081 Sc A451 - 0.05 - -GAL1 HMG1Δ112 Sc A451 - 0.05 0.05-0.06 -GAL1 HMG1Δ122 Sc A451 - 0.05 0.05-0.06 -GAL1 HMG1Δ044 Sc AURGG101 - 0.05 0.05-2.2 2.2-7.9GAL1 HMG1Δ062 Sc AURGG101 - 0.05 0.05-0.06 -GAL1 HMG1Δ075 Sc AURGG101 - 0.05 0.05-0.06 -GAL1 HMG1Δ081 Sc AURGG101 - 0.05 - -GAP HMGΔ044 Sc EUG5 - 0.05 0.05-0.09 -GAP HMGΔ056 Sc EUG5 - 0.05 0.05-0.11 -GAP HMGΔ062 Sc EUG5 - 0.05 0.05-0.13 -GAP HMGΔ076 Sc EUG5 - 0.05 0.05-0.15 -GAP HMGΔ081 Sc EUG5 - 0.05 0.05-0.14 -GAP HMGΔ100 Sc EUG5 - 0.05 0.05-0.18 -GAP HMGΔ112 Sc EUG5 - 0.05 0.05-0.34 -GAP HMGΔ122 Sc EUG5 - 0.05 0.05-0.13 -GAP HMGΔ133 Sc EUG5 - 0.05 0.05-0.71 -GAP HM6Δ026 Sc EUG12 - 0.05 0.05-0.63 -GAP HMGΔ044 Sc EUG12 - 0.05 0.05-0.44 -GAP HMGΔ056 Sc EUG12 - 0.05 0.05-0.40 -GAP HMGΔ062 Sc EUG12 - 0.05 0.05-0.45 -GAP HMGΔ076 Sc EUG12 - 0.05 0.05-0.55 -GAP HMGΔ081 Sc EUG12 - 0.05 0.05-0.49 -GAP HMGΔ100 Sc EUG12 - 0.05 0.05-0.44 -GAP HMGΔ112 Sc EUG12 - 0.05 0.05-0.53 -GAP HMGΔ122 Sc EUG12 - 0.05 0.05-0.50 -GAP HMGΔ133 Sc EUG12 - 0.05 0.05-0.44 - |
HMG1+HMG1Δ | GAL1,GAP HMG1Δ044,HMG1 Sc AURGG101 - 0.27 0.27-0.93 - |
FPS基因 | GAP ERG20 Sc A451 - 0.05 0.05-0.07 -lac fpsm Ec JM109 IPP&DMAPP 6.9 6.9-16 -tac ispAm(Y79D) Ec JM109 IPP&DMAPP 0.06 0.06-0.12 -tac ispAm(Y79E) Ec JM109 IPP&DMAPP 0.14 0.14-0.26 -tac ispAm Ec JM109 IPP&DMAPP 6.0 6.0-22 - |
BTS1(YIp) | GAL1 整合了BTS1 Sc A451(AURGG102) - 0.05 0.05-0.07 -GAL1 整合了BTS1 Sc YPH499(AURGG703) - 0.05 0.05-0.07 - |
BTS1(YEp) | GAP BTS1 Sc A451 - 0.10 0.10-0.58 -GAP BTS1 Sc YPH499 - 0.05 0.05-0.15 -GAP BTS1 Sc EUG8 - 0.05 0.05-1.4 -GAP BTS1 Sc EUG12 - 0.05 0.05-1.6 -GAP BTS1 Sc EUG27 - 0.05 0.05-1.5 - |
HMG1Δ+FPS基因 | GAL1,GAP HMG1Δ044,ERG20 Sc AURGG101 - 0.38 0.38-11.3 -GAL1,GAP HMG1Δ044,ispA Sc AURGG101 - 0.11 0.11-1.64 - |
HMG1+BTS1(YEp) | GAP,GAP HMG1,BTS1 Sc YPH499 - 0.14 0.14-0.58 -TEF,GAP HMG1,BTS1 Sc YPH499 - 0.13 0.13-0.63 - |
HMG1+BTS1(YIp) | GAP,GAL1 HMG1,BTS1 Sc AURGG102 - 0.05 0.05-1.3 -GAP,GAL1 HMG1,BTS1 Sc AURGG703 - 0.05 0.05-0.46 - |
HMG1Δ+BTS1(YEp) | GAL1,GAP HMG1Δ044,BTS1 Sc AURGG101 - 0.46 0.46-9.8 - |
GAP,GAL1 HMG1Δ027,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.08 -GAP,GAL1 HMG1Δ044,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.42 -GAP,GAL1 HMG1Δ045,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.61 -GAP,GAL1 HMG1Δ059,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.10 -GAP,GAL1 HMG1Δ062,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.42 -GAP,GAL1 HMG1Δ063,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.11 -GAP,GAL1 HMG1Δ075,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.40 -GAP,GAL1 HMG1Δ083,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.21 -GAP,GAL1 HMG1Δ094,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.14 -GAP,GAL1 HMG1Δ106,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.12 -GAP,GAL1 HMG1Δ122,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.16 -GAP,GAL1 HMG1Δ123,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.09 -GAP,GAL1 HMG1Δ134,整合了BTS1 Sc AURGG102 - 0.05 0.05-0.11 -GAP,GAL1 HMG1Δ044,整合了BTS1 Sc AURGG703 - 0.12 0.12-0.20 -GAP,GAL1 HMG1Δ062,整合了BTS1 Sc AURGG703 - 0.16 0.16-0.24 - |
HMG1Δ+MVN基因 | GAL1,GAP HMG1Δ044,HMGS Sc AURGG101 - 0.20 0.20-1.3 -GAL1,GAP HMG1Δ044,ERG12 Sc AURGG101 - 0.21 0.21-1.0 -GAL1,GAP HMG1Δ044,ERG8 Sc AURGG101 - 0.12 0.12-2.7 -GAL1,GAP HMG1Δ044,ERG10 Sc AURGG101 - 0.17 0.17-1.22 -GAL1,GAP HMG1Δ044,ERG19 Sc AURGG101 - 0.05 0.05-1.89 -GAL1,GAP HMG1Δ044,ID11 Sc AURGG101 - 0.05 0.05-0.79 -GAL1,GAP HMG1Δ044,idi Sc AURGG101 - 0.43 0.43-1.2 - |
FPS基因+idi | tac,idi ispAm,idi Ec JM109 - 0.07 - - |
FGG融合体 | GAP ERG20-BTS1 Sc YPH499 - 0.06 0.06-0.27 -GAP ERG20-BTS1-HDEL Sc YPH499 - 0.05 0.05-0.12 - |
GGF融合体 | GAP BTS1-ERG20 Sc A451 - 0.05 0.05-0.35 0.46-0.98GAP BTS1-ERG20 Sc YPH499 - 0.17 0.17-0.46 1.3-2.5GAP BTS1-ERG20 Sc EUG5 - 0.05 0.05-2.9 5.1-6.6GAP BTS1-ERG20 Sc EUG12 - 0.07 0.07-5.4 2.0-3.8GAP BTS1-ERG20-HDEL Sc A451 - 0.05 0.05-0.07 0.07-0.56GAP BTS1-ERG20-HDEL Sc YPH499 - 0.44 0.44-0.80 1.6-1.9GAP BTS1-ERG20-HDEL Sc EUG5 - 0.21 0.21-0.35 5.5-6.5GAP BTS1-ERG20-HDEL Sc EUG12 - 1.3 1.3-5.6 - |
HMG1+BTSHDEL | GAP,GAP HMG1,BTS1-HDEL Sc YPH499 - 0.14 0.14-0.23 - |
HMG1+FGG融合体 | GAP,GAP HMG1,ERG20-BTS Sc YPH499 - 0.20 0.20-0.46 -GAP,GAP HMG1,ERG20-BTS1-HDEL Sc YPH499 - 0.11 0.11-0.29 - |
HMG1+GGF融合体 | GAP,GAP HMG1,BTS1-ERG20 Sc A451 - 0.05 0.05-4.1 -GAP,GAP HMG1,BTS1-ERG20 Sc YPH499 - 0.46 0.46-2.1 2.1-128GAP,GAP HMG1,BTS1-ERG20-HDEL Sc A451 - 0.05 0.05-5.1 -GAP,GAP HMG1,BTS1-ERG20-HDEL Sc YPH499 - 1.0 1.0-1.9 2.2-5.6 |
在“GGOH产量”一栏中,标“1”的一栏显示的是下限,标“2”的一栏显示的是优选范围,标“3”的一栏显示的是更为优选的范围
“宿主”一栏中,“Sc”表示啤酒糖酵母,“Ec”表示大肠杆菌
“fps”表示嗜热脂肪芽胞杆菌FPS基因
“fpsm”表示嗜热脂肪芽胞杆菌FPSm(Y81M)基因
“idi”表示大肠杆菌IPP异构酸基因。/质粒为p3-47-13
“ispAm”表示大肠杆菌ispAm(Y79M)基因。/质粒为pALispA16m
(i)当质粒pRS445GG被导入宿主A451或YPH499菌株时,GGOH产量增加(大约平均为0.4mg/L),该质粒是通过将GGPP合酶基因BTS1整合到pRS445GAP而得到的。
(ii)当通过将一个由GGPP合酶基因BTS1和FPP合酶基因ERG20组成的融合基因整合到质粒pRS435GAP或pRS445GAP而制备质粒pRS435FGG、pRS445FGG、pRS435GGF、pRS445GGF并将其导入宿主A451或YPH499菌株时;或当制备含有编码连接到上述融合基因一个末端的HDEL序列的核苷酸序列的质粒(即含有一个修饰基因的质粒,它使得HDEL序列被添加到通过融合基因表达产生的多肽的C-端)pRS435FGGHDEL、pRS445FGGHDEL或pRS435GGHDEL并将其导入A451或YPH499菌株时,具有ERG20-BTS1融合基因的菌株,能平均产出0.2mg/L的GGOH;具有BTS1-ERG20融合基因的菌株,能平均产出0.39mg/L的GGOH;具有BTS1-ERG20-HDEL融合基因的菌株,能平均产出0.62mg/L的GGOH。
(iii)当将质粒pRS434GAP-HMG1(整合了HMG辅酶A还原酶基因HMG1的pRS434GAP)和含有上述融合基因的质粒pRS435GGF导入宿主YPH499,且共表达时,该重组体平均产出1.55mg/L GGOH。当在YMO培养基(补充有大豆油等物质的YM培养基)中30℃培养该重组体7天,可产出5.61mg/L的GGOH。
(iv)当pRS435GGFHDEL和pRS434GAP-HMG1都被导入宿主YPH499中,并共表达,该重组体平均产出1.50mg/L的GGOH。当在YMO培养基中30℃培养该重组体7天,可产出5.64mg/L的GGOH。
(v)当pRS435GGF和pRS434GAP-HMG1都被导入宿主YPH499中,并在发酵罐中培养得到的重组体109小时,可产出128mg/L的GGOH。
(vi)当HMG辅酶A还原酶基因和GGF融合基因被共表达时,大多数重组克隆产出100mg/L或更多的GGOH,最多可产出189mg/L的GGOH。
(vii)当补料-分批培养pRS435GGF/YH3克隆时,该克隆是通过将一种共表达HMG辅酶A还原酶基因和GGF融合基因的克隆转化到一种原养型微生物中,并随后二倍化而获得的,培养开始20小时后采用500g/L的葡萄糖溶液作为补料液,该克隆产生0.47g/L的GGOH;采用400g/L乙醇溶液作为补料液时,该克隆产生1.16g/L的GGOH。
(viii)当补料-分批培养pRS435GGF/YH3克隆时,下列条件下,该克隆能够产生2.5g/L的GGOH:培养开始21小时后乙醇占补料液中总碳源比例为71%,且补料液中添加有乙酸铵。
附图简述
图1为一幅显示甲羟戊酸途径相关酶代谢途径的图表。
图2A为一个显示一种替代突变模式的表框。
图2B为一幅缺失型HMG1基因的构建示意图。
图3为一幅显示质粒pRS414的示意图。
图4为一幅显示质粒pYES2的示意图。
图5显示ADH1启动子和终止子的序列。
图6A为一幅显示质粒pRS414PTadh的示意图。
图6B为一幅显示质粒pRS414TPadh的示意图。
图7A为一幅显示质粒pRS434GAP的示意图。
图7B为一幅显示质粒pRS434TEF的示意图。
图7C为一幅显示质粒pRS435GAP的示意图。
图7D为一幅显示质粒pRS444GAP的示意图。
图7E为一幅显示质粒pRS444TEF的示意图。
图7F为一幅显示质粒pRS445GAP的示意图。
图8为一幅显示每种插入pT7载体的甲羟戊酸途径相关酶的方向示意图。
图9为一幅质粒Palhmg106的物理图谱。
图10为一幅显示在ORF182片段上的限制性酶识别位点的示意图。
图11为一幅显示用于嗜热脂肪芽孢杆菌FPP合酶突变基因(Y81M)的表达载体的示意图。
图12呈出显示DNA印迹杂交结果的照片。
图13呈出显示PCR图谱结果的照片。
图14呈出显示RNA印迹结果的照片。
图15A呈出显示异戊二烯基二磷酸合酶在粗酶溶液中的比活性曲线图。
图15B呈出显示异戊二烯基二磷酸合酶在粗酶溶液中的比活性曲线图。
图16为一幅显示通过将连接有一个组成型启动子的HMG1基因导入A451,从而获得的重组体的GGOH产量的曲线图。
图17为一幅显示以A451或AURGG101(均保留有含GAL1p-HMG1的YEp表达载体)为宿主获得的每种重组体的GGOH产量的曲线图。
图18为一幅显示通过将质粒pYES2-HMG导入AURGG102或AURGG703,从而获得的每种重组体的GGOH产量的曲线图。
图19为一幅显示当将一个缺失型HMG1’基因插入含GAL1p的pYES2载体时,每种重组体的GGOH产量的曲线图。
图20为一幅显示当将一个缺失型HMG1’基因插入含GAL1p的pYES2载体时,每种重组体的GGOH产量的曲线图。
图21为一幅显示当将一个缺失型HMG1’基因插入含GAL1p的pYES2载体时,每种重组体的GGOH产量的曲线图。
图22为一幅显示当将一个缺失型HMG1’基因插入含GAL1p的pYES2载体时,每种重组体的GGOH产量的曲线图。
图23为一幅显示在含有IPP和DMAPP的培养基中培养保留有p4M、p16M等的重组大肠杆菌时的GGOH产量的曲线图。
图24为一幅显示用于构建BTS1-ERG20融合基因的引物,及这些引物的位点和方向的示意图。
图25为一幅显示将ERG20-BTS1融合基因导入A451衍生的克隆时GGOH产量的检测结果曲线图。
图26为一幅显示将ERG20-BTS1融合基因导入YPH499衍生的克隆时GGOH产量的检测结果曲线图。
图27为一幅显示经TEF2p-HMG1转化的YPH499衍生的克隆中GGOH产量的检测结果曲线图。
图28为一幅显示经TDH3p-HMG1转化的YPH499衍生的克隆中GGOH产量的检测结果曲线图。
图29A为一幅显示在A451衍生的克隆中GGOH产量的检测结果曲线图。
图29B为一幅显示在A451衍生的克隆中GGOH产量的检测结果曲线图。
图30A为一幅显示在YPH499衍生的克隆中GGOH产量的检测结果曲线图。
图30B为一幅显示在YPH499衍生的克隆中GGOH产量的检测结果曲线图。
图31A为一幅显示经pRS435GGF或pRS435GGFHDEL转化的A451菌株在有指出的糖组合物的条件下培养2天的GGOH产量的曲线图。
图31B为一幅显示经pRS435GGF或pRS435GGFHDEL转化的A451菌株在有指出的糖组合物的条件下培养4天的GGOH产量的曲线图。
图31C为一幅显示经pRS435GGF或pRS435GGFHDEL转化的A451菌株在有指出的糖组合物的条件下培养7天的GGOH产量的曲线图。
图32A为一幅显示经pRS435GGF或pRS435GGFHDEL转化的AH1菌株在有指出的糖组合物的条件下培养2天的GGOH产率曲线图。
图32B为一幅显示经pRS435GGF或pRS435GGFHDEL转化的AH1菌株在有指出的糖组合物的条件下培养4天的GGOH产量的曲线图。
图32C为一幅显示经pRS435GGF或pRS435GGFHDEL转化的AH1菌株在有指出的糖组合物的条件下培养7天的GGOH产量的曲线图。
图33A为一幅显示经pRS435GGF或pRS435GGFHDEL转化的EUG5菌株在有指出的糖组合物的条件下培养2天的GGOH产量的曲线图。
图33B为一幅显示经pRS435GGF或pRS435GGFHDEL转化的EUG5菌株在有指出的糖组合物的条件下培养4天的GGOH产量的曲线图。
图33C为一幅显示经pRS435GGF或pRS435GGFHDEL转化的EUG5菌株在有指出的糖组合物的条件下培养7天的GGOH产量的曲线图。
图34A为一幅显示经pRS435GGF或pRS435GGFHDEL转化的YPH499菌株在有指出的糖组合物的条件下培养2天的GGOH产量的曲线图。
图34B为一幅显示经pRS435GGF或pRS435GGFHDEL转化的YPH499菌株在有指出的糖组合物的条件下培养4天的GGOH产量的曲线图。
图34C为一幅显示经pRS435GGF或pRS435GGFHDEL转化的YPH499菌株在有指出的糖组合物的条件下培养7天的GGOH产量的曲线图。
图35A为一幅显示经pRS435GGF或pRS435GGFHDEL转化的YH1菌株在有指出的糖组合物的条件下培养2天的GGOH产量的曲线图。
图35B为一幅显示经pRS435GGF或pRS435GGFHDEL转化的YH1菌株在有指出的糖组合物的条件下培养4天的GGOH产量的曲线图。
图35C为一幅显示经pRS435GGF或pRS435GGFHDEL转化的YH1菌株在有指出的糖组合物的条件下培养7天的GGOH产量的曲线图。
图36A为一幅显示经pRS435GGF或pRS435GGFHDEL转化的EUG12菌株在有指出的糖组合物的条件下培养2天的GGOH产量的曲线图。
图36B为一幅显示经pRS435GGF或pRS435GGFHDEL转化的EUG12菌株在有指出的糖组合物的条件下培养4天的GGOH产量的曲线图。
图36C为一幅显示经pRS435GGF或pRS435GGFHDEL转化的EUG12菌株在有指出的糖组合物的条件下培养7天的GGOH产量的曲线图。
图37为一幅显示在含有大豆油培养基的发酵罐内培养pRS435GGF/YH1(pRS435GAP-HMG1/YPH499)克隆时的异戊二烯醇产量的检测结果曲线图。
图38为一幅显示在含大豆油培养基的发酵罐内培养15-2克隆时的异戊二烯醇产量的检测结果曲线图。
图39呈出显示用于验证融合基因表达的RNA印迹杂交法检测结果的照片。
图40呈出显示蛋白质印迹法检测结果的照片。
图41A呈出显示共表达HMG辅酶A还原酶基因和一个由GGPP合酶基因和FPP合酶基因组成的融合基因的克隆的GGOH产量的检测结果曲线图。
图41B呈出显示在共表达HMG辅酶A还原酶基因和一个由GGPP合酶基因和FPP合酶基因组成的融合基因的克隆的GGOH产量的检测结果曲线图。
图41C呈出显示在双表达HMG辅酶A还原酶基因和一个由GGPP合酶基因和FPP合酶基因组成的融合基因的克隆的GGOH产量的检测结果曲线图。
图42为一幅显示一种补料液的流速条件的曲线图。
发明的优选实施方案
下文中,本发明将引用下列实施例对本发明进行更为明确的描述。不过,本发明的技术范围不仅限于这些实例。
实施例包含下列内容:
(1)表达载体,如pRS435GAP,是采用pRS载体(Stratagene),pYES载体(Invitrogen),啤酒糖酵母YPH499来源的基因组DNA等制备的。
(2)甲羟戊酸途径相关酶基因的克隆
克隆乙酰辅酶A乙酰基转移酶基因、HMG辅酶A还原酶基因或其突变体、甲羟戊酸激酶基因、磷酸甲羟戊酸激酶基因、二磷酸甲羟戊酸脱羧酶基因、异戊烯基二磷酸Δ异构酶基因、法呢基二磷酸合酶基因或其替代突变体、及香叶基香叶基二磷酸合酶基因后,制备它们的表达载体。
(3)将含甲羟戊酸途径相关酶基因的质粒导入A451、YPH499等宿主。同样,通过将A451或YPH499基因组DNA中的ERG9转录启动子替换为pYES2衍生的GAL1转录启动子,构建突变菌株(EUG菌株),并将之作为宿主,用于导入异戊二烯基二磷酸合酶基因。
(4)AURGG101是一种整合有AUR1-C基因的A451衍生菌株;AURGG102是一种含GAL1启动子、BTS1和CYC1终止子,且其AUR1基因座有AUR1-C基因的A451衍生的菌株,构建这两种菌株,作为宿主,用于导入基因。
(5)将异戊二烯基二磷酸合酶基因表达载体导入宿主,然后在YM7培养基(pH用NaOH调至7的YM培养基)、YMO培养基、IPP和含DMAPP培养基等培养基中分别培养该宿主。将每种培养液进行分离和提取,并定量测定其中的异戊二烯醇(特别是GGOH)。为大量获得异戊二烯醇,在发酵罐中对已显示良好结果的重组体进行长期培养。此外,分析了当重组体被更换为原养型微生物,并二倍体化后对GGOH产量的影响。同样,分析了添加进培养基中的化合物(如乙醇或氨气)对GGOH产量的影响。
[实施例1]表达载体的结构
(1)大肠杆菌-啤酒糖酵母穿梭载体
质粒pRS404、pRS404和pRS414(图3)购自Stratagene。质粒pAUR123购自Takara,质粒pYES2(图4)购自Invitrogen(Carlsbad,CA)。
(2)基因组DNA
采用购自Takara的Dr.GenTLETM试剂盒(一种用于酵母的基因组DNA制备试剂盒),并根据试剂盒所附说明从啤酒糖酵母YPH499制备啤酒糖酵母基因组DNA。
按如下步骤从大肠杆菌JM109(Takara)制备大肠杆菌基因组DNA。简言之,在1.5ml 2xYT培养基中培养JM109细胞,并通过离心收获。将567μl TE(pH8.0),3μl 20mg/ml蛋白酶K(BoehringerMannheim,Mannheim,Germany)和30μl 10%SDS加入这些细胞中。将得到的混合物37℃保温1小时后,加入100μl 5M NaCl,使之混合。再加入80μl CTAB/NaCl溶液(10%CTAB,0.7MNaCl),65℃加热得到的混合物10分钟。用700μl氯仿/异戊醇(24∶1)抽提该混合物,进一步用600μl苯酚/氯仿/异戊醇(25∶24∶1)抽提水相。抽提后,将0.6体积的异丙醇添加到水相,而后离心水相。用70%乙醇洗涤沉淀,干燥,继而将之溶解于100μl TE(pH8.0)中,获得大肠杆菌基因组DNA溶液。测定DNA的OD260值,并定量测定DNA。然后,将TE加入溶液中,获得一种1μg/μL浓度的DNA。
采用Wizard PureFection质粒DNA纯化系统(Promega,Madison,WI)制备大肠杆菌质粒DNA。
(3)ADH1p-ADH1t片段插入到pRS414
用Nael和PvuII消化pRS414质粒(图3),从而获得一个长度为4.1kbp的无flori和LacZ部分的片段。经琼脂糖凝胶电泳纯化该片段。分别用BamHI消化、Klenow酶钝端化质粒pAUR123。然后,经琼脂糖凝胶电泳纯化一个长度为1.0kbp、含ADH1转录启动子(ADH1p)和ADH1转录终止子(ADH1t)(图5;SEQ ID NO:23)的片段。来源于pRS414的长度为4.1kbp的片段仍保留了大肠杆菌和酵母的复制起点,用于大肠杆菌的转化标记Ampγ,和用于酵母营养缺陷型标记TRP1。另一方面,来源于pAUR123的长度为1.0kbp的片段含ADH1p、ADH1t和一个位于它们中间的克隆位点。通过DNA连接试剂盒(Takara)将这两个片段彼此连接,并转化进大肠杆菌SURE2超感受态细胞(Stratagene,La Jolla,CA)。
从得到的重组体中制备质粒DNA。用SalI和ScaI得到的DNA图谱显示出:ADH1p-ADHt片段以两个方向被插入pRS414中,从而产生出两种质粒pRS414PTadh(图6A)和pRS414TPadh(图6B)。
(4)CYC1t片段插入到pRS载体
由PCR制备CYC1t(CYC1转录终止子)片段。以下列各种寡聚DNA、Xhol-Tcyc1FW和ApaI-Tcyc1RV为PCR引物。模板采用pYES2。
XhoI-Tcyc1FW:5′-TGC ATC TCG AGG GCC GCA TCA TGT AAT TAG-3′(SEQID NO:40)
ApaI-Tcyc1RV:5′-CAT TAG GGC CCG GCC GCA AAT TAA AGC CTT CG-3′(SEQ ID NO:41)
简言之,制备50μl的反应溶液,该溶液含0.1μg pYES2、50pmol每种引物DNA、含MgSO4的1x Pfu缓冲液(Promega,Madison,WI)、10nmol dNTPs、1.5单位的PfuDNA聚合酶(Promega)和1μl完全匹配聚合酶增强子(Stratagene)。反应条件如下:95℃第一次变性2分钟;95℃变性45秒,共30次循环,60℃退火30秒,72℃延伸1分钟;和72℃最终延伸5分钟。反应完成后,于4℃储存该溶液。用XhoI和ApaI消化扩增DNA,经琼脂糖凝胶电泳纯化得到的260bpDNA片段,从而获得CYC1t-XA。
将CyC1t-XA插入到pRS404和pRS406的XhoI-ApaI位点,从而分别获得pRS404Tcyc和pRS406Tcyc。
(5)转录启动子的制备
以pAUR123或酵母基因组DNA为模板,通过PCR制备含转录启动子的DNA片段。可采用的引物DNA如下:
SacI-Ptdh3FW:5′-CAC GGA GCT CCA GTT CGA GTT TAT CAT TAT CAA-3′
(SEQ ID NO:42)
SacII-Ptdh3RV:5′-CTC TCC GCG GTT TGT TTG TTT ATG TGT GTT TAT TC-3′
(SEQ ID NO:43)
SacI-Ptef2FW:5′-CCG CGA GCT CTT ACC CAT AAG GTT GTT TGT GAC G-3′
(SEQ ID NO:44)
SacII-Ptef2RV:5′-CTT TCC GCG GGT TTA GTT AAT TAT AGT TCG TTG
ACC-3′(SEQ ID NO:45)
就扩增ADH1转录启动子(ADH1p)而言,以SacI-Padh1FW和SacII-Padh1RV为PCR引物,以pAUR123为模板。就扩增TDH3(GAP)转录启动子(TDH3p(GAPp))而言,以SacI-Ptdh3FW和SacII-Ptdh3RV为PCR引物;就扩增TEF2转录启动子(TEF2p)而言,以SacI-Ptef2FW和SacII-Ptef2RV为PCR引物。对这些启动子而言,以酵母基因组DNA为模板。制备100μl反应溶液,含0.1μg pAUR123或0.46μg酵母基因组DNA、100pmol每种引物DNA、1x ExTaq缓冲液(Takara)、20nmoldNTPs、0.5U的ExTaqDNA聚合酶(Takara)和1μl完全匹配聚合酶增强子(Stratagene)。反应条件如下:95℃第一次变性2分钟;95℃变性45秒,共30次循环,60℃1分钟,72℃延伸2分钟,72℃延伸4分钟。反应完成后,于4℃储存该溶液。用SacI和SacII消化扩增的4种类型DNA,且经琼脂糖凝胶电泳纯化分离出得到的620bp、680bp、710bp和400bpDNA片段,从而分别获得TDH3p和TEF2p。
(6)2μDNA复制起点区域的制备
用SspI和NheI消化一种YEp载体,pYES2。经琼脂糖凝胶电泳纯化得到的长度为1.5kbp、含2μDNA复制起点(2μori)的片段,然后将其钝端化。该DNA片段被命名为2μOriSN。
(7)YEp型表达载体的制备
将2μOriSN插入pRS404Tcyc和pRS406Tcyc的NaeI位点,该位点事先经过BAP(细菌碱性磷酸酯酶:Takara)的预处理。将得到的质粒转化到大肠杆菌SURE2后,制备质粒DNA。用DraIII、和EcoRI、HapI或PstI、和PvuII消化质粒DNA;继而经琼脂糖凝胶电泳检测2μori的插入情况及方向。将2μori分别插入pRS404Tcyc和pRS405Tcyc,插入的方向与2μori插入pYES2的方向相同,获得的质粒被分别命名为pRS434Tcyc2μOri和pRS435Tcyc2μOri。将2μori插入pRS404Tcyc和pRS405Tcyc,插入的方向与2μori插入pYES2的方向相反,获得的相应质粒被分别命名为pRS444Tcyc2μOri和pRS445Tcyc2μOri。
将含ADH1p、TDH3p(GAPp)、PGK1p或TEF2p片段的转录启动子插入到上述四种质粒,即pRS434Tcyc2μOri、pRS435Tcyc2μOri、pRS444Tcyc2μOri、pRS445Tcyc2μOri的SacI-SacII位点,克隆DNA。获得下列四种质粒:(i)从pRS445Tcyc2μOri获得pRS434GAP和pRS434TEF;(ii)从pRS435Tcyc2μOri获得pRS435GAP;(iii)从pRS444Tcyc2μOri获得pRS444GAP和pRS444TEF;(iv)从pRS445Tcyc2μOri获得pRS445GAP(图7A-7F)。
本发明制备的表达载体总结于下表5中。
表5
载体 | 类型 | 标记和方向* | 启动子、终止子和方向* | ori和方向* |
pRS414PTadhpRS414TPadhPRS434GAPpRS434TEFpRS435GAPpRS444GAPpRS444TEFpRS445GAP | YCpYCpYEpYEpYEpYEpYEpYEp | TRP1 +TRP1 +TRP1 +TRP1 +LEU2 +TRP1 +TRP1 +LEU2 + | ADH1 ADH1 +ADH1 ADH1 -TDH3 CYC1 -TEF2 CYC1 -TDH3 CYC1 -TDH3 CYC1 -TEF2 CYC1 -TDH3 CYC1 - | ARS4&CEN6 +ARS4&CEN6 +2μ +2μ +2μ +2μ -2μ -2μ - |
*出现在标记和基因表达转录单位后的“+”和“-”标记分别表示下游方向和上游方向。出现在ori之后的“+”标记表示ori被插入的方向与其插入pRS(用于YCp载体)或pYES(用于YEp载体)的方向相同;出现在ori之后的“-”标记表示ori被插入的方向与其插入pRS(用于YCp载体)或pYES(用于YEp载体)的方向相反。
(8)YEp型表达载体在酵母中的导入
为了检查已制备的YEp型表达载体的DNA复制区域是否发挥作用,采用Frozen-EZ酵母转化II(Zymo Research,Orange,CA)将大约40ng的每种YEp型表达载体导入YPH499菌株。(步骤遵照试剂盒所附说明)。然后,检测30℃下、在SD-W(DOB+CMS(-Trp);BI0101,Vista,CA)琼脂平板上生长的菌落。其结果如下表6所示:
表6
GAP | TEF | |
pRS 434 | >1000 | >1000 |
435 | >1000 | - |
444 | >1000 | >1000 |
445 | >1000 | - |
表6中的结果显示:本发明制备的各种YEp型载体通常作为载体被保留。
【实施例2】甲羟戊酸途径相关酶基因的克隆
采用一种购自Clontech(Palo Alto,CA)的啤酒糖酵母DBY746来源的cDNA库“Quick-Clone cDNA″对源自酵母cDNA的基因进行克隆。
(1)法呢基二磷酸合酶基因的克隆
(1-1)啤酒糖酵母来源的FPP合酶基因ERG20:
以上述cDNA为模板,PCR(聚合酶链反应)扩增一个编码啤酒糖酵母FPP合酶基因ERG20(SEQ ID NO:1)、长度大约为0.9kbp的DNA片段。采用的PCR引物如下:
引物1(SCFPS1):5’-ATG GCT TCA GAA AAA GAA ATT AG-3’
(SEQ ID NO:46)
引物2(SCFPS2):5’-CTA TTT GCT TCT CTT GTA AAC TT-3’
(SEQ ID NO:47)
10x Ex Taq缓冲液(Takara) 5μl
2.5mM dNTP混合物 4μl
5U/μl Ex Taq(Takara) 1μl
10pmol 引物1
10pmol 引物2
0.5ng cDNA
共计50μl
在上述反应溶液中进行PCR反应,即30个循环过程,每个循环包括:94℃45秒,55℃1分钟和72℃2分钟。
如无其它说明,以下实施例的PCR反应条件均与上述条件相同。
经琼脂糖凝胶电泳纯化扩增后的片段,然后通过T/A连接将其克隆到pT7Blue-T(Novagen,Madison,WI)。发现ERG20被插入pT7Blue-T的方向与lacZ在该质粒上的方向相同(图8)。检测克隆后片段的核苷酸序列,并将其同记录于SGD(酵母基因组数据库,http://genome-www.stanford.edu/Saccharomyces/)中相应的核苷酸序列比较。结果没有在1-300和610-1059的核苷酸位置上发现PCR错误。
制备的质粒DNA被命名为pT7ERG20。
(1-2)大肠杆菌来源的FPP合酶基因ispA:
以大肠杆菌基因组DNA为模板、以下列合成寡聚DNA为引物,PCR克隆大肠杆菌来源的FPP合酶基因ispA(SEQ ID NO:3)。
ISPA1:5′-TGA GGC ATG CAA TTT CCG CAG CAA CTC G-3′
(SEQ ID NO:48)
ISPA2:5′-TC AGA ATT CAT CAG GGG CCT ATT AAT AC-3′
(SEQ ID NO:49)
在100μl反应溶液中进行PCR反应,该反应溶液含有1x EXTaq缓冲液、0.5mM dNTPs、100pmol ISPA1、100pmol ISPA2、0.2μg大肠杆菌基因组DNA和5单位ExTaq,反应经过30个循环过程,每个循环包括94℃1分钟、55℃1分钟、72℃1.5分钟。PCR产物由EcoRI和SphI消化后,经琼脂糖凝胶电泳纯化,得到一个长度为1.0kbp的DNA片段。将该片段插入pALTER-Ex2(Promega)的EcoRI-SphI位点后,再将其导入大肠杆菌JM109,用于克隆基因。结果获得质粒pALispA4、pALispA8、pALispA15、pALispA16、pALispA18;采用限制性内切酶EcoRI、SphI、NdeI、SmaI和BamHI得到的图谱证实ispA基因被正确地导入这些质粒中。
(1-3)嗜热脂肪芽孢杆菌来源的FPP合酶基因
用NotI和SmaI消化公开于日本未审查专利公开No.5-219961中的pFE15,经纯化,得到含一个长度为2.9kbp转录单位的FPP合酶基因片段。将该基因片段插入到pACYC177(Nippon Gene)的ScaI位点,从而制备出一个含有嗜热脂肪芽孢杆菌来源的FPP合酶基因fps(SEQ ID NO:25)的表达载体。
(2)香叶基香叶基二磷酸合酶基因的克隆
啤酒糖酵母来源的GGPP合酶基因BTS1(SEQ ID NO:5)克隆方法如下文。
简言之,即基于记录在GenBank(http://www.ncbi.nlm.nih.gov/Genbank/index.html/)(A.N.U31632)(Y.Jiang,et al.,J.Biol.Chem.270(37),21793-21799(1995))中关于啤酒糖酵母来源的GGPP合酶基因的资料,构建一对引物,该引物与基因编码的蛋白质的N-末端和C-末端匹配。以这些引物和一个酵母cDNA库(Clontech;No.CL7220-1)为模板,进行PCR。
N-末端引物:5’-ATG GAG GCC AAG ATA GAT GAG CT-3’
(SEQ ID NO:50)
C-末端引物:5,-TCA CAA TTC GGA TAA GTG GTC TA-3’
(SEQ ID NO:51)
采用完全匹配聚合酶增强子(Stratagene)进行PCR,需进行30个循环过程,每个循环包括:94℃变性45秒、55℃退火1分钟、和72℃延伸2分钟。
感兴趣的片段(长度大约为1.0kbp)已经得到证实。将该BTS1片段克隆到可TA克隆的pT7Blue T载体中,然后对BTS1的全部区域进行测序。结果显示该基因的核苷酸序列与记录于GenBank(SEQ IDNO:5)中相应的核苷酸序列完全相同。这就证实:该基因是啤酒糖酵母来源的GGPP合酶基因。
(3)乙酰辅酶A乙酰转移酶基因的克隆
采用ExTaq DNA聚合酶,PCR扩增一个长度大约为1.2kbp的基因组DNA片段,该片段编码啤酒糖酵母乙酰辅酶A乙酰转移酶基因ERG10(SEQ ID NO:26)。将得到的片段克隆到pRS435GAP和pRS445GAP的SacII-XbaI位点。采用的PCR引物如下:
引物1(SacII-ERG10):5’-TCC
CCG CGG ATG TCT CAG AAC GTTTAC ATT GT-3’(SEQ ID NO:52)
引物2(XbaI-ERG10):5’-TGC
TCT AGA TCA TAT CTT TTC AAT GACAAT GGA-3’(SEQ ID NO:53)
(下划线部分表示限制性内切酶识别位点。)
在完全匹配聚合酶增强子的共存在下进行PCR操作,需经过30个循环过程,每个循环包括95℃45秒、60℃1分钟、和72℃2分钟。由SmaI、ScaI、NcoI和BamHI识别位点图谱检验得到的质粒是否与设计的一致。成功制备的质粒被分别命名为pRS435GAP-ERG10和pRS445GAP-ERG10。
(4)HMG辅酶A合酶基因的克隆
以cDNA为模板,采用PCR扩增一个长度大约为1.5kbp的片段,该片段编码啤酒糖酵母HMG辅酶A合酶基因HMGS(SEQ ID NO:27)。退火温度采用50℃。采用的PCR引物如下:
引物1(HMGS-1-2):5’-ATG AAA CTC TCA ACT AAA CTT TGT T-3’
(SEQ ID NO:54)
引物2(scHMGS-15):5’-GTT CAG CAA GAT GCA ATC GAT GGG G-3’
(SEQ ID NO:55)
用琼脂糖凝胶电泳纯化PCR片段,再经T/A连接将其克隆到pT7Blue-T。结果发现HMGS被插入pT7Blue的方向与lacZ在该质粒上的方向相反(图8)。检测该克隆片段的核苷酸序列。与SGD中相应的核苷酸序列比较的结果显示:由于PCR错误,39位上的核苷酸A(以起始密码子ATG的第一个核苷酸A为第1位)被改变为G(A39G;下文中的PCR错误以相同方式表达)。
此外,也发现了另外5个错误:T144C、T223C、T1038C、C1122T和A1370G。在这些PCR错误中,T223C和A1370G引起编码氨基酸的改变。T223C将75位的Ser改变为Pro(S75P;下文中,氨基酸序列错误以相同方式表示),A1370G引起另一个氨基酸序列错误K457R。
得到的质粒被命名为pT7HMGS。
(5)HMG辅酶A还原酶基因的克隆
克隆啤酒糖酵母来源的HMG辅酶A还原酶基因HMG1的方法描述如下。
简言之,即基于记录在GenBank关于啤酒糖酵母来源的HMG辅酶A还原酶基因HMG1(A.N.M22002)(M.E.Basson,et al.,Mol.Cell.Biol.8,3797-3808(1988):SEQ ID NO:7)的资料,设计一对引物,该引物匹配了由该基因编码蛋白质的N-末端和C-末端。以这两种引物和酵母cDNA库(Clontech)为模板,进行PCR。
N-末端引物:5’-ATG CCG CCG CTA TTC AAG GGA CT-3’
(SEQ ID NO:56)
C-末端引物:5’-TTA GGA TTT AAT GCA GGT GAC GG-3’
(SEQ ID NO:57)
采用完全匹配聚合酶增强子进行PCR,需进行30个循环过程,每个循环包括94℃变性45秒、55℃退火1分钟、和72℃延伸2分钟。
感兴趣的片段(长度大约为3.2kbp)已经得到证实。将该片段(HMG1)克隆到可TA克隆的pT7BlueT载体中,从而获得pT7-HMG1。检测该克隆HMG1的核苷酸序列。结果显示获得了如SEQ ID NO:9所示的核苷酸序列和如SEQ ID NO:10所示的氨基酸序列。由于PCR错误,检测得到的核苷酸序列与记录于GenBank中相应的核苷酸序列有部分不同(图2A)。这个包含PCR错误的突变型HMG辅酶A还原酶基因被命名为HMG1’。
(6)HMG辅酶A还原酶基因PCR错误的纠正
通过来自pT7HMG1的HMG1片段的亚克隆纠正PCR错误,而在HMG1区域纠正这些错误会引起氨基酸替代突变。
简言之,即从含有HMG1’,HMG辅酶A还原酶基因HMG1的PCR错误型DNA的质粒pT7HMG1亚克隆一个HMG1’基因片段。然后,通过定点诱变纠正在HMG1区域的PCR错误所引起的氨基酸替代突变,从而得到pALHMG106。该制备的具体方法描述如下。
pT7HMG1质粒被用做克隆的HMG1。就用于导入定点突变的载体而言,可采用购买的pALTER-1(Promega)。
根据Promega出版的“Protocols and Application Guide,3rdedition,1996 Promega,ISBN1-882274-57-1”中描述的步骤进行定点诱变。就用于导入突变的寡聚核苷酸而言,可采用下列三种化学合成的寡聚核苷酸。
HMG1(190-216):5′-CCAAATAAAGACTCCAACACTCTATTT-3′
(SEQ ID NO:58)
HMG1(1807-1833):5′-GAATTAGAAGCATTATTAAGTAGTGGA-3′
(SEQ ID NO:59)
HMG1(2713-2739):5′-GGATTTAACGCACATGCAGCTAATTTA-3′
(SEQ ID NO:60)
首先,用SmaI、ApaLI、SalI消化pT7HMG1,然后经琼脂糖凝胶电泳纯化,得到一个长度为3.2kbp的HMG1片段。将该片段插入到pALTER-1的SmaI-SalI位点,从而制备了pALHMG1。经碱变性处理该质粒后,将上述用于导入突变的寡聚核苷酸、Amp修复寡聚核苷酸(Promega)作为修复寡聚核苷酸、和Tet剔除寡聚核苷酸(Promega)作为剔除寡聚核苷酸进行退火。将得到的质粒导入大肠杆菌ES1301(Promega)。然后,将保留有已导入定点突变的质粒转化体采用125μg/ml的氨苄青霉素进行富集培养,继而制备质粒DNA。由具有如下序列的引物检测得到的质粒DNA的核苷酸序列。结果显示所有对应于HMG1(190-216)、HMG1(1807-1833)和HMG1(2713-2739)的序列已被纠正为指定序列(SEQ ID NO:11)。由已纠正核苷酸序列(SEQID NO:12)编码的氨基酸序列与由HMG1’(SEQ ID NO:10)(沉默突变体)编码的氨基酸序列相一致。
HMG1(558-532)5′-GTCTGCTTGGGTTACATTTTCTGAAAA-3′
(SEQ ID NO:61)
HMG1(1573-1599)5′-CATACCAGTTATACTGCAGACCAATTG-3′
(SEQ ID NO:62)
HMG1(2458-2484)5′-GAATACTCATTAAAGCAAATGGTAGAA-3′
(SEQ ID NO:63)
已纠正HMG1区域内序列的质粒被命名为pALHMG106(图9)。
(7)甲羟戊酸激酶基因的克隆
以cDNA为模板,采用PCR扩增一个长度大约为1.3kbp的片段,该片段编码啤酒糖酵母甲羟戊酸激酶基因ERG12(SEQ ID NO:28)。采用的PCR引物如下所示。
引物1(ATM-1):5’-AA
C TGC AGA TGT CAT TAC CGT TCT TAA CTTC-3’
(SEQ ID NO:64)
引物2(ATM-2):5’-CC
G AGC TCT TAT GAA GTC CAT GGT AAA TTCG-3’
(SEQ ID NO:65)
(下划线部分表示限制性内切酶识别位点。)
用PstI和SacI消化得到的片段,经琼脂糖凝胶电泳纯化后,将其克隆到pT7Blue的PstI-SacI位点。通过这些步骤,ERG12插入pT7Blue的方向与lacZ在该质粒的方向相反(图8)。将克隆片段的核苷酸序列进行测定,并与记录在SGD中的相应的序列相对照。结果没有发现PCR错误。
得到的质粒DNA被命名为pT7ERG12。
(8)磷酸甲羟戊酸激酶基因的克隆
以cDNA为模板,采用PCR扩增一个长度大约为1.3kbp的片段,该片段编码啤酒糖酵母ERG8(SEQ ID NO:29)。采用的PCR引物如下所示。
引物1(YSCE-1):5’-AAC TGC AGA TGT CAT TAC CGT TCT TAACTT C-3’
(SEQ ID NO:66)
引物2(YSCE-2):5’-CCG AGC TCT TAT GAA GTC CAT GGT AAATTC G-3’
(SEQ ID NO:67)
琼脂糖凝胶电泳纯化PCR扩增片段后,通过T/A连接将其克隆到pT7Blue-T中。经过这些步骤,ERG8插入pT7Blue-T中的方向与lacZ在该质粒中的方向相反(图8)。检测克隆片段的核苷酸序列,并与记录于SGD中的相应序列相对照。结果发现下面的PGR错误:A70C、A72G、G146A、C171G、G224C、A306G、T387C、G574T、C637G、G638C、G729A、G739A、T759A、A879G和A1222G。在这些错误中,A70C和A72G引起T24P的一个氨基酸错误;G146A引起G49E的一个氨基酸错误;G224C引起S75T的一个氨基酸错误;G574T引起A192S的一个氨基酸错误;C637G和G638C引起R213A的一个氨基酸错误;G739A引起D247N的一个氨基酸错误;A1222G引起T408A的一个氨基酸错误。
得到的质粒DNA被命名为pT7ERG8。
(9)二磷酸甲羟戊酸脱羧酶基因的克隆
以cDNA为模板,采用PCR扩增一个长度大约为1.2kbp的片段,该片段编码啤酒糖酵母二磷酸甲羟戊酸脱羧酶基因ERG19(MVD1)(SEQID NO:30)。采用的PCR引物如下所示。
引物1(SCU-1):5’-AA
C TGC AGA TGA CCG TTT ACA GAG CAT CCGT-3’
(SEQ ID NO:68)
引物2(SCU-2):5’-CG
G AAT TCT TAT TCC TTT GGT AGA CCA GTCT-3’
(SEQ ID NO:69)
(下划线表示限制性内切酶识别位点)
用PstI和EcoRI消化扩增片段,经琼脂糖凝胶电泳纯化后,将其克隆到pT7Blue的PstI-EcoRI位点。通过这些步骤,ERG19(MVD1)插入pT7Blue的方向与lacZ在该质粒中的方向相反(图8)。将克隆片段的核苷酸序列进行测定,并与记录在SGD中的相应的序列对照。结果没有发现PCR错误。
得到的质粒DNA被命名为pT7ERG19。
(10)异戊烯基二磷酸Δ异构酶基因的克隆
(10-1)啤酒糖酵母来源的IPPΔ异构酶基因IDII
以cDNA为模板,采用PCR扩增一个长度大约为0.9kbp的片段,该片段编码啤酒糖酵母IDII基因(SEQ ID NO:31)。采用的PCR引物为引物1(SCIPP-1)和引物2(SCIPP-2)。
引物1(SCIPP-1):5’-ATG ACT GCC GAC AAC AAT AGT AT-3’
(SEQ ID NO:70)
引物2(SCIPP-2):5’-TTA TAG CAT TCT ATG AAT TTG CC-3’
(SEQ ID NO:71)
琼脂糖凝胶电泳纯化PCR扩增片段后,通过T/A连接将其克隆到pT7Blue-T。经过这些步骤,IDII插入pT7Blue-T的方向与lacZ在该质粒中的方向相反(图8)。将克隆片段的核苷酸序列进行测定,并与记录在SGD中的相应的序列相对照。结果没有发现PCR错误。
得到的质粒DNA被命名为pT7IDII。
(10-2)大肠杆菌来源的IPPΔ异构酶基因idi
以质粒p3-47-13(Hemmi et al.,(1998)J.Biochem.123,1088-1096)为模板,将含有大肠杆菌ORF182(预期可以编码一个同源于IPPΔ异构酶的多肽的开放读码框架;基因名称:idi)的基因组DNA克隆到该质粒中,PCR扩增一个长度大约为0.55kbp的ORF182片段。在完全匹配聚合酶增强子的共存在下进行30个循环过程以完成PCR反应,每个循环包括95℃45秒、60℃1分钟、和72℃2分钟。采用的PCR引物如下所示。
引物1(SacII-ORF182(1-23)):5’-TCC
CCG CGG ATG CAA ACGGAA CAC GTC ATT TT-3’
(SEQ ID NO:72)
引物2(XbaI-ORF182(549-525)):5’-TGC TCT AGA TTA TTTAAG CTG GGT AAA TGC AGA-3’
(SEQ ID NO:73)
(下划线部分表示限制性内切酶识别位点)
用SpeI、DraIII和AluI消化PCR产物后,经琼脂糖凝胶电泳纯化。结果获得如图10所示的物理图谱,该物理图谱与EcoGene(
http://bmb.med.miami.edu/EcoGene/EcoWeb/)的ORF182片段(idi)的核苷酸序列数据(SEQ ID NO:32)一致。然后,用SacII和XbaI消化扩增后得到的长度为0.55kbp的片段,经琼脂糖凝胶电泳纯化后,将其克隆到pRS435GAP和pRS445GAP的SacII-XbaI位点。得到的质粒被分别命名为pRS435GAP-ORF182和pRS445GAP-ORF182。
大肠杆菌IPPΔ异构酶基因(SEQ ID NO:32)曾被称为OPF182(根据NCBI BLAST检索;GenBank Accession No.AE000372),不过Hahn等人,(1999)J.Bacteriol.,181:4499-4504命名这个基因为idi。就被克隆进idi的质粒而言,本发明用到了在Hemmi等人,(1998)J.Biochem.123:1088-1096中描述的p3-47-11和p3-47-13。
【实施例3】突变基因的克隆
(1)大肠杆菌FPP合酶基因被转化为GGPP合酶基因
(FPP合酶基因突变体的克隆)
采用实施例2(1-2)节中获得的pALispA4、pALispA8、pALispA15、pALispA16和pALispA18,根据Promega出版的“Protocols and Applications Guide,3rd edition,1996 Promega,ISBN 1-882274-57-1”中描述的方法,通过替代突变修饰编码多肽79位的氨基酸残基Tyr的密码子,该多肽由大肠杆菌ispA编码。下列用于导入突变的寡聚核苷酸(有时为“突变寡聚核苷酸”)是通过化学合成法制备的。
ISPA-D:5′-ATC ATG AAT TAA TGA
GTC AGC GTG G
AT GCA TTC AAC GGC
GGC AGC-3′(SEQ ID NO:74)
ISPA-E:5′-ATC ATG AAT TAA TGA
TTC AGC GTG G
AT GCA TTC AAC GGC
GGC AGC-3′(SEQ ID NO:75)
ISPA-M:5′-ATC ATG AAT TAA TGA
CAT AGC GTG G
AT GCA TTC AAC GGC
GGC AGC-3′(SEQ ID NO:76)
在上述突变寡聚核苷酸ISPA-M中,在16-18位的核苷酸(有下划线的3个核苷酸)相当于编码野生型FPP合酶79位的氨基酸残基Tyr的密码子;设计这三个核苷酸以使该密码子编码Met。同样,设计突变寡聚核苷酸ISPA-D和IDPA-E以使这个密码子分别编码Asp和Glu。设计在上述突变寡聚核苷酸中的26-31位上的核苷酸(下划线的6个核苷酸)以使一个EcoT22I(NsiI)位点由于替代突变的原因而重新形成。带有该位点的突变基因可轻易通过限制性内切酶图谱被辨别出。这些突变寡聚核苷酸经由T4多核苷酸激酶(Promega)在其5’末端的磷酸化后,在使用前需通过Nick Column(Pharmacia Biotech,Uppsala,Sweden)的凝胶过滤,使其纯化。在突变体的导入过程中,Cm修复寡聚核苷酸(Promega)被用作修复寡聚核苷酸,Tet剔除寡聚核苷酸(Promega)被用作剔除寡聚核苷酸。Cm修复寡聚核苷酸,Tet剔除寡聚核苷酸和突变寡聚核苷酸被退火到经碱变性处理的pALispA16后,将其转化到大肠杆菌ES1301 mutS(Promega)。从生长在有20μg/ml Cm(氨霉素)存在的环境中的大肠杆菌菌落制备质粒DNA并转化到大肠杆菌JM109。然后,从生长在含有20μg/ml氯霉素的琼脂平板上的菌落中制备出质粒DNA。这些含有替代突变型ispA(称为“ispAm”)的质粒分别被命名为p4D、p4E和p4M,ispAm是通过以pALispA4为模板,以ISPA-D、ISPA-E或ISPA-M为突变寡聚核苷酸构建的。同样,通过以pALisp8为模板制备的那些质粒分别被命名为p8D、p8E和p8M;通过以pALisp15为模板制备的那些质粒分别被命名为p15D、p15E和p15M;通过以pALisp16为模板制备的那些质粒分别被命名为p16D、p16E和p16M;通过以pALisp18为模板制备的那些质粒分别被命名为p18D、p18E和p18M。
编码Y79D突变型氨基酸序列(SEQ ID NO:34)的基因如SEQ IDNO:33所示;编码Y79E突变型氨基酸序列(SEQ ID NO:36)的基因如SEQ ID NO:35所示;编码T79M突变型氨基酸序列(SEQ ID NO:38)的基因如SEQ ID NO:37所示。这样,获得的质粒就可被恰当地选择和利用。
(2)嗜热脂肪芽孢杆菌FPP合酶基因突变体的克隆
从公开在Ohnuma et al.,(1996)J.Biol.Chem.,271,30748-30754中的pFPS(Y81M)制备出含嗜热脂肪芽孢杆菌FPP合酶基因(fps:SEQ ID NO:39)的一个替代突变体基因的表达载体。
pFPS是一种在pTV118N(Takara)的lac启动子下游整合了fps的质粒,并且在IPTG存在的环境中,该质粒可在大肠杆菌中表达嗜热脂肪芽孢杆菌FPP合酶基因。首先,通过定点诱变将Y81M突变(即替代突变,该突变将被FPP合酶基因编码的氨基酸序列的81位上的Tyr改变为Met)导入FPP合酶基因[从而获得pFPS(Y81M)]。Y81M突变体的导入使得被编码在pEPS(Y81M)上的酶的反应产物特异性被改变;在没有降低编码酶的比活性的情况下,FPP合酶基因被修饰成为一个GGPP合酶基因。随后,由PshBI消化Pfps(Y81M),并由Klenow酶对其进行钝端化。接着,纯化含转录单位的一个长度为2.7Kbp的片段,并将其插入到pACYC177中Amp’基因的HincII位点。以与Amp’基因相反的方向插入突变fps基因片段得到的质粒被命名为pFPS21m;以与Amp’基因相反的方向插入突变fps基因片段的质粒被命名为pFPS31m(图11)。
(3)HMG辅酶A还原酶基因缺失突变体的克隆
均含有一个组成型启动子ADH1p的载体pRS414PTadh和Prs414TPadh经限制性内切酶消化后,插入HMG1,从而制备出质粒pRS414PTadh-HMG1和PRS414TPadh-HMG1。
用BamHI、SalI和ScaI消化实施例2(5)中制备的pT7-HMG1,获得有PCR错误的HMG1’基因。将该基因导入到pYES2(Invitrogen,Carlsbad,CA)的BamHI-XhoI位点,从而获得一个重组载体pYES-HMG1。该载体中的核苷酸序列经证实是SEQ ID NO:3的核苷酸序列。pYES是一个用于在酵母中表达的穿梭载体,该酵母有作为复制始点的酵母2μmDNA的ori和可被半乳糖诱导的GAL1启动子(图4)。
为制备用于HMG辅酶A还原酶基因缺失突变体的表达载体,该缺失突变体有与HMG辅酶A还原酶跨膜区域相应的区域缺失,以上文中制备的pYES-HMG1为模板进行PCR反应,从而产生缺失部分HMG1编码区域的DNA片段(包括载体部分)。用Klenow酶使获得片段钝端化,通过自连接环化后,导入大肠杆菌JM109。然后制备质粒DNA。用作引物的合成DNA序列及其组合如上表1中所示。
就获得的每种质粒DNA而言,373A DNA测序仪(Perkin Elmer,Foster City,CA)证实在HMG1的缺失区域上游与下游之间的氨基酸的读码框架上没有发生偏移,在结合位点周围也没有因PCR错误引起的氨基酸替代。这样就获得了在结合位点周围没有因PCR错误引起的氨基酸替代,且在读码框架没有任何偏移的情况下可以缺失一部分基因的下列质粒。根据缺失模式将HMG1基因的缺失突变体表示为,如“Δ02y”(y代表任意工作数字),包含有Δ02y的pYES2载体可表示为,如pYHMG026。(这一表示方式也适用于其它缺失突变体。)
HMG1Δ026:SEQ ID NO:13
HMG1Δ044:SEQ ID NO:14
HMG1Δ056:SEQ ID NO:15
HMG1Δ062:SEQ ID NO:16
HMG1Δ076:SEQ ID NO:17
HMG1Δ081:SEQ ID NO:18
HMG1Δ100:SEQ ID NO:19
HMG1Δ112:SEQ ID NO:20
HMG1Δ122:SEQ ID NO:21
HMG1Δ133:SEQ ID NO:22
质粒:
pYHMG026,pYHMG027,pYHMG044,pYHMG045,pYHMG059,pYHMG062,
pYHMG063,pYHMG065,pYHMG076,pYHMG081,pYHMG083,pYHMG085,
pYHMG094,pYHMG100,pYHMG106,pYHMG107,pYHMG107,pYHMG109,
pYHMG112,pYHMG122,pYHMG123,pYHMG125,pYHMG133和pYHMG134,
【实施例4】基因亚克隆到载体中
对具有一个组成型转录启动子的大肠杆菌-啤酒糖酵母YEp穿梭载体而言,采用实施例1中制备的pRS载体。
(1)FPP合酶基因型的亚克隆
(1-1)啤酒糖酵母来源的FPP合酶基因ERG20:
用XbaI和BamHI消化实施例2(1-1)节描述的pT7ERG20,经琼脂糖凝胶电泳纯化获得长度为1.1kbp的ERG20基因片段。将该片段插入到pRS435GAP和pRS445GAP的XbaI-BamHI位点,从而分别获得pRS435GAP-ERG20和pRS445GAP-ERG20。
(1-2)大肠杆菌来源的FPP合酶基因ispA:
用SphI和EcoRI消化实施例2(1-2)节描述的pALispA4,经琼脂糖凝胶电泳纯化得到长度为1.0kbp的ispA基因片段。将SphI-SacII接头DNA(5’-pTTT CCG CGG AAA CAT G-3’;SEQ ID NO:86)和EcoRI-Eco52I接头DNA(5’-pAAT TGA CGG CCG TC-3’;SEQ IDNO:87)连接到该片段。然后,经SacII和Eco52I消化该片段。为了亚克隆,将得到的长度为1.0kbp的SacII-Eco52I片段插入pRS435GAP和pRS445GAP的SacII-EGo52I位点。就每种亚克隆质粒而言,得到了SacI、SacII、NdeI、NsiI(EcoT22I)、Aor51HI、XbaI、SmaI、BamHI、PstI、NdeI、PvuII和EcoT14I的识别位点图谱,然后挑选出按设计构建的质粒。挑选出的质粒分别被命名为pRS435GAP-ispA和pRS445GAP-ispA。
(1-3)嗜热脂肪芽孢杆菌来源的FPP合酶基因
将嗜热脂肪芽孢杆菌来源的FPP合酶基因从一个染色体PCR片段直接克隆到一个载体中。
(2)GGPP合酶基因或其突变体的亚克隆
(2-1)啤酒糖酵母来源的GGPP合酶基因BTS1:
用BamHI和SalI消化实施例2(2)中描述的pT7Blue-T载体,获得一个编码BTS1的片段,将其导入到pYES2(Invitrogen)的BamHI-XhoI位点。得到的重组载体被命名为pYESGGPS。
用BamHI和MluI消化pYESGGPS后,经琼脂糖凝胶电泳纯化,得到一个长度为1.3kbp的片段。将该片段插入到pRS435GAP和pRS445GAP的BamHI-MluI位点,从而分别获得pRS435GAP-BTS1和pRS445GAP-BTS1。
(2-2)大肠杆菌来源的GGPP合酶基因(替代突变型FPP合酶基因)ispAm:
用SphI和EcoRI消化实施例3(1)中描述的p16M后,经琼脂糖凝胶电泳纯化,得到一个长度为1.0kbp的片段,该片段编码ispAm基因。将本实施例(1-2)中描述的SphI-SacII接头DNA和EcoRI-Eco52I接头DNA连接到该片段,然后,经SacII和Eco52I消化。为了亚克隆,将得到的SacII-Eco52I片段(1.0kbp)插入到pRS435GAP和pRS445GAP的SacII-Eco52I位点。对每种亚克隆质粒而言,得到SacI、SacII、NdeI、NsiI(EcoT22I)、Aor51HI、XbaI、SmaI、BamHI、PstI、PvuII和EcoT14I的识别位点图谱,然后挑选出按设计构建的质粒。在这些识别位点中,NsiI(EcoT22I)识别位点是当导入一个替代突变体到ispA中时被新导入的位点。如质粒可被该限制性内切酶切割,就证实质粒中的基因是ispA突变基因ispAm。挑选出的质粒被分别命名为pRS435GAP-ispAm和pRS445GAP-ispAm。
(3)乙酰辅酶A乙酰转移酶基因的亚克隆
将乙酰辅酶A乙酰转移酶基因ERG10从一个基因组PCR片段直接克隆到pRS载体中。
(4)HMG辅酶A合酶基因的亚克隆
从实施例2(4)中描述的pT7HMGS制备一个长度为1.5kbp编码HMGS基因的BamHI-SalI片段,将其插入到pRS435GAP和pRS445GAP的BamHI-SalI位点。通过KpnI限制位点图谱检测HMGS-亚克隆质粒后,选择按设计构建的质粒。挑选出的质粒分别被命名为pRS435GAP-HMGS和pRS445GAP-HMGS。
(5)HMG辅酶A还原酶基因或其突变体的亚克隆
用BamHI、SalI和ScaI消化实施例2(5)中描述的pT7Blue-T载体,从而切除编码一个PGR错误型突变HMG辅酶A还原酶的HMG1’基因。将该基因插入到pYES2(Invitrogen)的BamHI-XhoI位点。得到的质粒被命名为pYES-HMG1。
用SmaI和SalI消化载体pRS414PTadh和pRS414TPadh,这两种载体都含有一个组成型ADH1p。将HMG1基因插入消化后的两种载体,从而制备出pRS414PTadh-HMG1和pRS414TPadh-HMG1。
进一步用SmaI和SalI消化在实施例2(6)中描述的pALHMG106(图9),经琼脂糖凝胶电泳纯化得到一个长度为3.2kbp的片段,该片段编码已纠正PCR错误的HMG1基因。将该片段插入到pRS434GAP、pRS444GAP、pRS434TEF、pRS444TEF、pRS434PGK、pRS444PGK的SmaI-SalI位点。通过使用XhoI、SpeI、NaeI和SphI得到的限制性内切酶图谱,及确认插入的长度为3.2kbp的HMG1基因的边缘区域的核苷酸序列,来检验HMG1-亚克隆质粒的物理图谱。然后,挑选出精确按设计构建的质粒,并分别命名为pRS434GAP-HMG1、pRS444GAP-HMG1、pRS434TEF-HMG1、pRS444TEF-HMG1、pRS434PGK-HMG1、pRS444PGK-HMG1。
从pYES2衍生的质粒中获得HMG辅酶A还原酶基因的缺失突变体,该衍生质粒掺入了实施例3描述的HMG1的相应缺失突变体,以前面章节中描述的方式将HMG辅酶A还原酶基因的缺失突变体克隆到pRS434GAP中。
(6)甲羟戊酸激酶基因的亚克隆
从实施例2(7)中描述的pT7ERG12制备一个长度为1.3kbp的SmaI-SalI片段,该片段编码ERG12基因,将其插入到pRS435GAP和pRS445GAP的SmaI-SalI位点。经KpnI识别位点图谱检测ERG12-亚克隆质粒,继而挑选出精确按设计构建的质粒,并分别命名为pRS435GAP-ERG12和pRS445GAP-ERG12。
(7)磷酸甲羟戊酸激酶基因的亚克隆
从实施例2(8)描述的pT7ERG8制备一个长度为1.3kbp BamI-SalI片段,该片段编码ERG8基因,并将其插入到pRS435GAP和pRS445GAP的SmaI-SalI位点。经XbaI识别位点图谱检验ERG8-亚克隆的质粒,继而挑选出精确按设计构建的质粒,并分别命名为pRS435GAP-ERG8和pRS445GAP-ERG8。
(8)二磷酸甲羟戊酸脱羧酶基因的亚克隆
用BamHI和SalI消化实施例1(9)中描述的pT7ERG19,经琼脂糖凝胶电泳纯化,得到一个长度为1.5kbp的BamHI-SalI片段,该片段编码ERG19基因。将该片段插入到pRS435GAP和pRS445GAP的BamHI-SalI位点。挑选出精确按设计构建的质粒,并分别命名为pRS435GAP-ERG19和pRS445GAP-ERG19。
(9)异戊烯基二磷酸Δ异构酶基因的亚克隆
(9-1)啤酒糖酵母来源的IPPΔ异构酶基因IDI1:
从实施例2(10-1)中描述的pT7IDI1制备一个长度为0.9kbp的BamHI-SalI片段,并将其插入到pRS435GAP和pRS445GAP的BamHI-SalI位点。用NcoI和BamHI识别位点图谱检测亚克隆质粒,挑选出精确按设计构建的质粒,并分别命名为pRS435GAP-IDI1和pRS445GAP-IDI1。
(9-2)大肠杆菌来源的IPPΔ异构酶基因ORF182(idi):
按实施例2(10-2)的描述将ORF182(idi)从一个基因组PGR片段直接克隆到pRS载体。
【实施例5】AURGG101、AURGG102和AURGG703的制备
以实施例4(2-1)中描述的pYESGGPS为模板,并采用下列引物PYES2(1-27)和PYES2(861-835),经PCR制备一个长度为1.9kbp的SalI片段,该片段具有GAL1启动子=BTS1=CYC1终止子(GAL1p-BTS1-CYC1t)一级结构。
PYES2(1-27):5′-GGC CGC AAA TTA AAG CCT TCG AGC GTC-3′
(SEQ ID NO:88)
PYES2(861-835):5′-ACG GAT TAG AAG CCG CCG AGC GGG TGA-3′
(SEQ ID NO:89)
将该片段插入到pAUR101(Takara)的SalI位点,从而获得pAURGG115。经DNA测序证实:在pAURGG115中的BTS1基因没有PCR错误。
用Eco065I线性化pAURGG115质粒,通过乙酸锂法将该质粒导入A451菌株和YPH499菌株。然后,挑选出能够在30℃、含有1μg/ml金担子素的YPD琼脂平板(1%酵母提取物、2%蛋白胨、2%葡萄糖、2%琼脂)上生长的菌落,将其作为转化体。
在金担子素选择性平板上重新培养该转化体,以挑选出单菌落。
结果获得A451衍生的重组体的两种菌株:AURGG101和AURGG102。同样获得作为YPH499衍生的重组体的菌株AURGG703。下述DNA印迹杂交(图12)和PCR图谱(图13)结果显示:BTS1基因没有被整合到AURGG101中,并且在该菌株中,AURI被一个标记基因AUR1-C所替代。另一方面,发现GAL1启动子=BTS1=CYC1终止子被整合在AURGG102的AUR1基因座。
【实施例6】EUG菌株的构建
从SGD获得鲨烯合酶基因ERG9附近的一个基因图谱。根据该图谱,设计用于扩增DNA片段的各种PCR引物DNA,扩增片段被用于替代ERG9转录启动子(ERG9p)。另一方面,以pYES2Δ为模板,该模板是通过用NaeI和NheI消化pYES2,再由Klenow酶将该质粒钝端化,通过自连接删除2μori而获得的,采用PCR扩增法制备一个长度为1.8kbp的DNA片段,该片段含一个转化体选择标记基因URA3和一个转录启动于GALIp。
采用的PCR扩增引物如下所示:
E-MCSf:5′-GCC GTT
GAC AGA GG
G TCC GAG CIC GGT ACC AAG-3′
(SEQ ID NO:90)
E-URA3r:5′-CAT ACT
GAC CCA TT
G TCA ATG GGT AAT AAC TGA T-3′
(SEQ ID NO:91)
在上述每种引物中,都加入一个Eaml105I识别位点(下划线部分),使得可以通过T/A连接,将一个长度为0.7kbp、含有YHR189W的一个下游部分的DNA片段和一个长度为0.9kbp、含有ERG9的一个上游部分的DNA片段连接到一个长度为1.8kbp片段上。YHR189W片段是以YHR189Wf和YHR189Wr为引物,以YPH499基因组DNA为模板,经PCR扩增制备的。ERG9片段是以ERG9f和ERG9r为引物,以YPH499基因组DNA为模板,经PCR扩增制备的。YPH499染色体DNA是用一个酵母基因组DNA制备试剂盒“Dr.GenTLETM”(Takara)制备的。
YHR189Wf:5′-TGT CCG GTA AAT GGA GAC-3′(SEQ ID NO:92)
YHR189Wr:5′-TGT TCT CGC TGC TCG TTT-3′(SEQ ID NO:93)
ERG9f:5′-ATG GGA AAG CTA TTA CAA T-3′(SEQ ID NO:94)
ERG9r:5′-CAA GGT TGC AAT GGC CAT-3′(SEQ ID NO:95)
简言之,用Eaml105I消化长度为1.8kbp的DNA片段后,将其连接到长度为0.7kbp的DNA片段上。以得到的片段为模板,采用上述引物YHR189Wf和E-MCSf进行第二次PCR扩增。再用Eaml105I消化经上述步骤得到的长度为2.5kbp的DNA片段,将其连接到长度为0.9kbp的DNA片段上。以得到的片段为模板,采用下述引物YHR189W-3f和ERG9-2r进行第三次PCR扩增。结果得到长度为3.4kbp的DNA片段。该DNA片段将用于转化。
YHR189W-3f:5’-CAA TGT AGG GCT ATA TAT G-3’(SEQ ID NO:96)
ERG9-2r:5’-AAC TTG GGG AAT GGC ACA-3’(SEQ ID NO:97)
采用购自Zymo Research(Orange,CA)的Frozen EZ酵母转化II试剂盒将载体导入酵母菌株。将获得的重组体培养在特定琼脂培养基(称为SGR(-URA)培养基)上,培养温度为30℃,该培养基是通过将CSM(-URA)(购自EIO 101,Vista,CA)和硫酸腺嘌呤(终浓度为40mg/L)加入到SGR培养基中而获得的。将生长于该培养基上的菌落再次涂布于相同培养基上,可得到分离的单菌落。
获得的重组体被命名为EUG(ERG9p::URA3-GAL1p)克隆。在这些克隆中,A451衍生的克隆被命名为EUG1-EUG10;YPH499衍生的克隆被命名为EUG11-EUG20;YPH500衍生的克隆被命名为EUG21-EUG30;W303-1A衍生的克隆被命名为EUG31-EUG50;W303-1B衍生的克隆被命名为EUG51-EUG70。
可挑选出那些显示出生长速率下降的克隆,生长速率下降是由于SD培养基中的葡萄糖抑制了ERG9表达而造成的。结果从A451中获得EUG5和EUG8;从YPH499中获得EUG12;从YPH500中获得EUG27。
采用Dr.GenTLETM,从EUG5、EUG8、EUG12和EUG27制备基因组DNA,以基因组DNA为模板,进行PCR扩增。结果证实:长度为1.8kbp、含有URA3和GAL1p的PCR片段被整合到每个基因组的ERG9编码区域的上游。
【实施例7】基因和酶活性分析
本实施例中,采用各种不同的技术,包括异戊二烯基二磷酸合酶的酶活测定、RNA印迹杂交、DNA印迹杂交、PCR作图和异戊二烯醇产量测定,来分析所制备的不同重组酵母克隆中(就其制备而言,见描述异戊二烯醇制备的实施例8-13)的基因表达情况。
(1)DNA印迹法
采用酵母DNA纯化试剂盒Dr.GenTLETM,根据试剂盒所附说明制备酵母DNA。
制备的酵母DNA经NdeI和StuI消化后,采用0.8%琼脂糖凝胶电泳,每泳道DNA上样量为3μg。分子量标记采用0.5μg的1kb的序列梯和0.5μg的λ/HindIII(均购自Promega,Madison,WI)。电泳完成后,根据传统方法,将DNA碱变性、中和、并通过20x SSC的毛细管印迹转移到Hybond N尼龙膜(Amersham,Buckinghamshire,England)。在最适交联条件下,用UV交联仪(Stratagene)对尼龙膜进行照射,以固定DNA到膜上。
(2)RNA印迹法
根据Current Protocols in Molecular Biology,John Wiley &Sons,pp.13.12.2-13.12.3中描述的方法,并稍加修改而制备RNA。该修改即:将已制备的RNA样品进一步用Dnase I处理。
经甲醛变性的琼脂糖凝胶电泳分离RNA后,根据传统方法,将RNA通过20x SSC的毛细管印迹转移到Hybond N尼龙膜。每泳道电泳上样量为5μg总RNA。分子量标记采用20ng的DIG-RNA标记I。在最适交联条件下用UV交联仪(Stratagene)对得到的膜进行紫外照射,以固定RNA到膜上。
(3)PCR作图
为检测一个源自pAURGG115(实施例5中制备的一个YIp载体)的片段是如何被整合进染色体的,以上述已制备的0.3-0.6μg的酵母DNA为模板,采用合成寡聚核苷酸引物AUR-FWc和AUR-RVc,或AUR-SAL1和AUR-SAL2的组合为引物进行PCR扩增。PCR条件如下:30个循环过程,每个循环包括94℃变性30秒,55℃退火1分钟,72℃延伸3分钟。
AUR-FWc:5′-TCT CGA AAA AGG GTT TGC CAT-3′(SEQ ID NO:98)
AUR-RVc:5′-TCA CTA GGT GTA AAG AGG GCT-3′(SEQ ID NO:99)
AUR-SAL1:5′-TGT TGA AGC TTG CAT GCC TGC-3′(SEQ ID NO:100)
AUR-SAL2:5′-TTG TAA AAC GAC GGC CAG TGA-3′(SEQ ID NO:101)
(4)DIG标记的探针DNA的制备
制备探针I、II、III和V(表7),作为杂交探针。
表7.杂交探针
探针编号 | 基因 | 模板 | 引物1 | 引物2 |
IIIIIIV | ERG20BTS1HMG1AUR1 | pT7ERG20pYES2-GGPS6pYHMG1pAUR123 | SCFPS1BTS1(1-21)HMG1(1267-1293)AUR-RV | SCFPS2BTS1(1008-982)HMG1(2766-2740)AUR-FW |
探针I:
以实施例2(1-1)中制备的pT7ERG20为模板,以SCEPS1和SCEPS2为引物,用PCR DIG探针合成试剂盒(Roche Diagnostics,MannheimGermany)合成一种DIG标记的探针DNA。实验条件与试剂盒所附制造商的说明一致。PCR需进行30个循环过程,每个循环包括94℃30秒、58℃1分钟、72℃3分钟。继而经琼脂糖凝胶电泳检测得到的DIG标记的探针DNA的合成状态。
探针II:
一种DIG标记的探针DNA,其合成方式与合成探针I描述的方式相同,引物是合成寡聚核苷酸BSTS1(1-21)和BTS1(1008-988),模板为(见实施例4(2-1))pYESGGPS。
BTS1(1-21):5′-ATG GAG GCC AAG ATA GAT GAG-3′(SEQ ID NO:102)
BTS1(1008-988):5′-TCA CAA TTC GGA TAA GTG GTC-3′(SEQ ID NO:103)
探针III:
一种DIG标记的探针DNA,其合成方式与合成探针I描述的方式相同,采用的引物是合成寡聚核苷酸HMG1(1267-1293)和HMG1(2766-2740),模板为(见实施例3(3))pYES-HMG1。
HMG1(1267-1293):5′-AAC TTT GGT GCA AAT TGG GTC AAT GAT-3′
(SEQ ID NO:80)
HMG1(2766-2740):5′-TCC TAA TGC CAA GAA AAC AGC TGT CAC-3′
(SEQ ID NO:104)
探针V:
一种DIG标记的探针DNA,其合成方式与合成探针I描述的方式相同,采用的引物是合成寡聚核苷酸AUR-FW和AUR-RV,模板为pAUR123(Takara)。
AUR-FW:5′-ATG GCA AAC CCT TTT TCG AGA-3′(SEQ ID NO:105)
AUR-RV:5′-AGC CCT CTT TAC ACC TAG TGA-3′(SEQ ID NO:106)
(5)探针的杂交和检测
在探针浓度为20ng/ml、42℃条件下、采用DIG Easy Hyb(Roche)进行持续时间24小时的DNA印迹杂交。在探针浓度为100ng/ml、50℃下、采用DIG Easy Hyb进行持续时间24小时的RNA印迹杂交。每次杂交前,在DIG Easy Hyb溶液中先进行持续时间24小时的预杂交,预杂交温度与各次杂交的温度相同。杂交完成后,于65℃、用2x SSC、0.1%SDS洗涤膜3次,每次持续时间为10分钟;然后于65℃、用2xSSC、0.1%SDS洗涤膜2次,每次持续时间为15-20分钟。接着,采用DIG发光检测试剂盒(Roche)使膜上的DIG标记探针可以化学发光,再将印迹曝光到X射线胶片上,使其显影。
(6)酶活性的测定
在制备的重组体中,下列宿主菌株和重组体被应用在本实验。根据Current Protocols in Molecular Biology,John Wiley &Sons,Inc.,pp.13.7.1-13.7.2中描述的乙酸锂法,或者采用Frozen EZ酵母转化II试剂盒(Zymo Research,Orange,CA)(操作步骤与试剂盒所附说明一致),将各个载体导入宿主。在下列清单中,通过将pYES-HMG1导入到A451中获得克隆1-2;通过将pYHMG044导入到A451中获得克隆3-2;通过将pYES-HMG1导入到AURGG101中获得克隆13-2;通过将pYHMG044导入到AURGG101中获得克隆15-2。
No.1宿主菌株:A451
No.2 GAL1p-BTS1(YIp):AURGG101(A451,aur1::AUR1-C)
No.3 GAL1p-BTS1(YIp):AURGG102(A451,aur1::BTS1-AUR1-C)
No.4 GAL1p-HMG1(YEp):1-2(pYES-HMG1/A451)
No.5 GAL1p-HMG1Δ(YEp):3-2(pYHMG044/A451)
No.6 GAL1p-HMG1(YEp)&GAL1p-BTS1(YIp):13-2(pYES-HMG1/AURGG101)
No.7 GAL1p-HMG1Δ(YEp)&GAL1p-BTS1(YIp):15-2(pYHMG044/AURGG101)
No.8 GAL1p-HMG1(YEp)&GAL1p-BTS1(YIp):24-1(pYES-HMG1/AURGG102)
No.9 GAL1p-HMG1Δ(YEp)&GAL1p-BTS1(YIp):27-2(pYHMG045/AURGG102)
No.10 GAL1p-HMG1Δ(YEp)&GAL1p-BTS1(YIp):31-2(pYHMG076/AURGG102)
在26℃分别预培养No.1-10的菌株/克隆。用生理盐水洗涤1毫升的预培养物,将其加入到300ml Erlenmeyer烧瓶中的100ml培养液中,于26℃、120次/分钟的振荡条件下培养。该培养采用SD或SG培养基(将SD培养基中的葡萄糖组分替换为半乳糖)。保留URA3标记的重组体被培养在SD-U[加CSM(-URA)的SD培养基]或SG-U[加CSM(-URA)的SG培养基]。AURGG菌株则培养在含1μg/ml金担子素的培养基中。
测定细胞的OD600值,以监测细胞生长。当OD600值达到3-4(培养开始后23-52小时)时停止培养。将培养物在冰中冷却后,制备DNA、RNA和粗酶溶液,方法描述如下。
通过离心从每种培养液中收获细胞后,采用与制备RNA相同的方式,在4℃用玻璃珠破碎细胞。然后,将细胞悬浮在无菌水中,制成菌悬液。再用冷冻微离心机以12,000rpm转速、离心10分钟该菌悬液,回收得到的上清作为粗酶提取物。以BSA为标准蛋白质,由Bio-Rad蛋白测定(Bio-Rad,Hercules,CA)检测该粗酶提取物中的蛋白质浓度。简言之,即37℃下,在200μl、含下列物质的反应混合物中反应10μg的粗酶提取物40分钟。
0.125mM[14C]IPP(185 GBq/mol)
0.125mM香叶基二磷酸(Sigma Chemical,St.Louis,MO)
100mM Tris·HCl(pH7.0)
10mM NaF,5mM MgCl2
5mM 2-巯基乙醇
0.05%Triton X-100
0.005%BSA
反应完毕,用水饱和的丁醇提取延伸的异戊二烯基二磷酸。经由液体闪烁计数检测一份异戊二烯基二磷酸的放射活性。用马铃薯酸磷酸酶将剩余样品脱去磷酸,再用薄层层析法[板:LKC18(Whatman,Clifton,NJ);显影剂∶H2O/丙酮=1∶19]显影,接着根据Koyama等人的方法(Koyama T.,Fujii,H.and Ogura,K.,1985,Meth.Enzymol.110:153-155)用Bio图像分析仪BAS2000(Fuji Flim)进行放射自显影成像,以检测相对放射活性。
(7)观察结果
(7-1)DNA印迹杂交和PCR图谱
DNA印迹杂交结果如图12所示。AUR1附近的PCR图谱结果如图13所示。在图12和图13中,1-10泳道对应的是上文(6)中采用的菌株/克隆(No.1-No.7)的编号。
出现在每泳道编号下的“N”表示DNA被NdeI消化;“S”表示DNA被StuI消化。自下列菌株/克隆制备出每泳道上样的各种DNA。
泳道1:A451;泳道2:AURGG101;泳道3:AURGG102;泳道4:pYES-HMG1/A451;泳道5:pYHMG044/A451;泳道6:pYES-HMG1/AURGG101;泳道7:pYHMG044/AURGG101;泳道8:pYES-HMG1/AURGG102;泳道9:pYHMG045/AURGG102;和泳道10:pYHMG076/AURGG102
发现检测的所有菌株/克隆中ERG20(FPP合酶基因)都是相同的,并且在每个菌株/克隆基因组中的ERG20附近区域没有变化(图12)。
当以BTS1(GGPP合酶基因)和AUR1为探针时,发现BTS1被整合进AURGG102的AUR1区域,而AURGG101中出现的条带与在宿主菌株A451中出现的条带相同。在AURGG101中,只有AUR1基因被pAUR101来源的AUR1-C基因所替代;发现GAL1-BTS1片段没有被整合进该菌株的基因组中。当由PCR检测因染色体整合引起的AUR1位点复制时,仅在AURGG102中检测到预期的条带,在AURGG101中没有检测到预期条带(图13)。
图12中,当以HMG1为探针时,在NdeI消化的各种DNA(泳道4-7)中出现一个质粒来源的条带。在StuI消化的各种DNA中,原先预期能够如在克隆1-2(No.4)的情况一样,出现一个长度为8.2kbp、来源自质粒的条带(重叠了一个长度为8.3kbp、来源自基因组的条带)。然而,由于基因组中的HMG1临近区域与导入质粒发生重组,使得在克隆13-2(No.6)和克隆15-2(No.7)中观察到一个条带偏移。
从DNA印迹杂交和PCR图谱的结果中,可以总结本实施例采用的菌株/克隆基因型如下表8中所示。该表中,“AUR”指一种加入了金担子素的培养基。“培养基1”指用于预培养的培养基,“培养基2”指用于主培养的培养基。
表8
菌株/克隆No. | 命名 | 整合基因 | 质粒中基因 | 培养基1 | 培养基2 |
12345678910 | A451AURGG101AURGG1021-23-213-215-224-127-231-2 | --BTS1----BTS1BTS1BTS1 | ---HMG1HMG1Δ044HMG1HMG1Δ044HMG1HMG1Δ045HMG1Δ076 | SDSD-AURSD-AURSD-USD-USD-U-AURSD-U-AURSD-U-AURSD-U-AURSD-U-AUR | SGSG-AURSG-AURSG-USG-USG-U-AURSG-U-AURSG-U-AURSG-U-AURSG-U-AUR |
(7-2)RNA印迹杂交
RNA印迹杂交结果如图14所示。杂交时用到了表7中所列的探针I、II、III和V。
图14中,泳道1-10所用的菌株/克隆同图12中所用的菌株/克隆相同。标记“-”表示在SD培养基中的转录物,标记“+”表示在SG培养基中的转录物。
当SG培养基诱导GAL1p转录时,在克隆13-2(No.6)和克隆15-2(No.7)中,ERG20转录物显示出减少的趋势。
当在GAL1转录启动子控制下,SG培养基诱导基因的转录时,BTS1转录物仅在特定的菌株中增加,即AURGG102(No.3),该菌株中,染色体整合了GAL1p-BTS1片段。
然而,当同HMG1转录物比较时,发现BTS1的转录诱导程度比较低。当由SG培养基诱导转录时,在克隆No.4-No.7中HMG1转录物有非常显著的增加,这些克隆都通过质粒导入了GAL1p-HMG1片段。
(7-3)异戊二烯基二磷酸合酶活性
以香叶基二磷酸(GPP)和14C标记的IPP为烯丙基二磷酸底物,测定在粗酶组分中异戊二烯基二磷酸合酶活性。
简言之,将以(GPP)和[14C]IPP为底物合成的各个异戊二烯基二磷酸脱磷酸,经TLC显影,继而检测每个斑点的放射活性。结果是,FPP合酶活性最高,其次是HexPP(六异戊二烯基二磷酸)合酶活性,并发现该酶活性远高于GGPP合酶活性。然后,从放射自显影图中推算出反应产物的相对含量,继而推算总蛋白质的比活性。结果如图15所示。图15A中,上图显示FPP合酶(FPS)活性,下图显示GGPP合酶(GGPS)活性。图15B中,上图显示HexPP合酶(HexPS)活性,下图显示PT酶(总异戊二烯基二磷酸合酶)活性。阴影柱显示在SD培养基中的结果,空白柱显示在SG培养基中的结果。总异戊二烯基二磷酸合酶活性的大部分是FPP合酶活性。因SG培养基造成的该活性的增加是可以观察到的。尤其是FPP合酶活性在克隆13-2(No.6)和15-2(No.7)中有显著的增加。总体上,当GPP被作为烯丙基底物时,GGPP合酶活性大约为FPP合酶活性的1/20000,大约是HexPP合酶活性的1/300。在SG培养基中,HexPP合酶活性是降低的。
[实施例8]DNA导入宿主及宿主的培养
(通过啤酒糖酵母表达)
该实施例中,为构建一个甲羟戊酸途径相关酶基因在啤酒糖酵母中永久表达的体系,通过将啤酒糖酵母来源的基因导入含有一个组成型启动子和多种不同营养缺陷型标记的表达穿梭载体,从而制备用于甲羟戊酸途径相关酶基因的表达载体。然后评估这些基因高表达对异戊二烯醇制备的效果。
(1)酵母的转化
将得到的用于每个甲羟戊酸途径相关酶基因的表达载体导入宿主。在新导入的载体里,每个甲羟戊酸途径相关酶基因被插入到TDH3转录启动子TDH3p(=GAPp)的下游。就宿主而言,可采用下列菌株/克隆。
A451
AURGG101
YPH499
AURGG703
YPH500
W303-1A
W303-1B
EUG5(来源自A451)
EUG8(来源自A451)
EUG12(来源自YPH499)
EUG24(来源自YPH500)
EUG27(来源自YPH500)
15-2(pYHMG044/AURGG101)(来源自AURGG101)
(2)酵母的培养
根据采用的标记基因,在SD选择性培养基中,预培养每个导入甲羟戊酸途径相关酶基因的酵母克隆。然后,将25μl的预培养液加入到2.5ml的YM或SG(SD的葡萄糖组分被半乳糖替代)培养基中,以130rpm、26℃条件下反复振荡培养4天。在将预培养液加入SG培养基前,预先用生理盐水洗涤细胞,以免葡萄糖组分被带入培养基。当采用YPH499来源的克隆时,在培养基中加入硫酸腺嘌呤,浓度为40μg/ml。
(3)戊烷抽提
在甲羟戊酸途径相关酶基因转化的酵母克隆的培养完成后,检测培养物的30倍稀释液的OD600值。然后,加入2.5ml甲醇到该稀释液中,并使其混合。再加入5ml戊烷,并剧烈振荡。继而将得到的混合物静置后,转移戊烷层到一个干净的玻璃试管中。将该试管置于通风橱内,使戊烷蒸发以浓缩溶质。随后,以10μl 1.0ml/L十一醇作为内部标准物加入其中,从而制备出用于GC/MS的样品。
(4)GC/MS分析
用HP6890/5973 GC/MS系统(Hewlett-Packard,Wilmington,DE)将戊烷抽提组分进行分离、鉴定和定量检测。采用的色谱柱是HP-5MS(0.25mm×30m;膜厚0.25μm)。分析条件如下所示。相同的条件也适用于本说明书中所有的GC/MS分析。
入口温度:250℃
检测器温度:260℃
[MS区域温度]
MS四极杆温度:150℃
MS离子源温度:230℃
质量扫描范围:35-200
[进样参数]
自动进样模式
样品容量:2μl
甲醇洗涤3遍,正己烷洗涤2遍
分流比:1/20
载气:氦气1.0ml/分钟
溶剂延迟:2分钟
[炉加热条件]
115℃90秒
以70℃/分钟的速度加热至250℃,并维持2分钟
以70℃/分钟的速度加热至300℃,并维持7分钟
0小时后
内部标准:0.01μl 1-十一醇于乙醇中
可信标准:(全E)-橙花叔醇(Eisai)
(全E)-法呢醇(Sigma)
(全E)-香叶基香叶醇(Eisai)
鲨烯(Tokyo Kasei Kogyo)
(5)结果
基因、表达载体、宿主、培养条件(培养基、温度及培养时间)与最大GGOH产量之间的关系总结于下表9中。
表9.最大GGOH产量
基因 导入的DNA | 宿主 | 培养基 | 温度(℃) | 培养时间(hr) | GGOH产量(mg/l) |
HMG1pRS434GAP-HMG1pRS444GAP-HMG1pRS444TEF-HMG1pYES-HMG1pYES-HMG1pRS414TPadh-HMG1 | Sc A451Sc A451Sc A451Sc A451Sc AURGG101Sc YPH499 | YMYMYMSGSGYM* | 262630262628 | 969696484896 | 0.3480.1280.0690.1002.200.136 |
HMG1ΔpYHMG044pYHMG056pYHMG062pYHMG076pYHMG081pYHMG112pYHMG122pYHMG044pYHMG044pYHMG044pYHMG062pYHMG076pYHMG081 | Sc A451Sc A451Sc A451Sc A451Sc A451Sc A451Sc A451Sc AURGG101Sc AURGG101Sc AURGG101Sc AURGG101Sc AURGG101Sc AURGG101 | SGSGSGSGSGSGSGSGSGSGSGSGSG | 26262626262626262630262626 | 48484848484848489696484848 | 0.0530.0700.0650.0500.0510.0640.0622.200.7297.950.0610.0620.052 |
HMG1+HMG1ΔpYHMG044+pRS434GAP-HMG1pYHMG044+pRS444GAP-HMG1 | Sc AURGG101Sc AURGG101 | SGSG | 2626 | 9696 | 0.9270.739 |
ERG20(YEp)pRS435GAP-ERG20pRS445GAP-ERG20 | Sc A451Sc A451 | YMYM | 2626 | 9696 | 0.0670.073 |
BTS1(基因组)pAURGG115(Eoo0651消化的)pAURGG115(Eoo0651消化的) | Sc A451(AURGG102)Sc YPH499(AURGG703) | SGSG | 3028 | 9898 | 0.0750.093 |
BTS1(Yep vector)pRS445GAp-BTS1pRS435GAP-BTS1pRS435GAP-BTS1(=pRS435GG)pRS445GAP-BTS1(=pRS445GG)pRS435GAP-BTS1(=pRS435GG)pRS445GAP-BTS1(=pRS445GG) | Sc A451Sc YPH499Sc W303-1ASc W303-1ASc W303-1BSc W303-1B | YMYMYMYMYMYM | 262830303030 | 969696969696 | 0.5850.2040.7260.1890.8530.254 |
HMG1Δ(YEp)+ERG20(YEp)pYHMG044+pRS435GAP-ERG20pYHMG044+pRS445GAP-ERG20 | Sc AURGG101Sc AURGG101 | SGSG | 2626 | 9696 | 11.31.24 |
HMG1Δ(YEp)+ispA(YEp)pYHMG044+pRS435GAP-ispApYHMG044+pRS445GAP-ispA | Sc AURGG101Sc AURGG101 | SGSG | 2626 | 9696 | 1.640.900 |
HMG1(YEp)+BTS1(YEp)pRS434GAP-HMG1(=pRS435GG)pRS434GAP-HMG1(=pRS445GG)pRS434TEF-HMG1(=pRS435GG)pRS434TEF-HMG1(=pRS445GG) | Sc YPH499Sc YPH499Sc YPH499Sc YPH499 | YMYMYMYM | 26262626 | 96969696 | 0.5810.3500.5090.630 |
HMG1(YEp)+BTS1(基因组)pYES-HMG1pYES-HMG1pYES-HMG1 | Sc AURGG102Sc AURGG102Sc AURGG703 | SGSGSG | 262626 | 489696 | 0.0901.2800.462 |
HMG1Δ(YEp)+BTS1(YEp)pYHMG044(=pRS435GG)pYHMG044(=pRS445GG) | Sc AURGG101Sc AURGG101 | SGSG | 2626 | 9696 | 9.768.82 |
HMG1Δ(YEp)→BTS1(基因组)
pYHMG027 Sc AURGG102 SG 26 48 0.078
pYHMG044 Sc AURGG102 SG 26 48 0.120
PYHMG044 Sc AURGG102 SG 26 96 0.415
pYHMG045 Sc AURGG102 SG 26 48 0.610
pYHMG059 Sc AURGG102 SG 26 48 0.099
pYHMG062 Sc AURGG102 SG 26 48 0.120
pYHMG062 Sc AURGG102 SG 26 48 0.418
pYHMG063 Sc AURGG102 SG 26 48 0.110
pYHMG076 Sc AURGG102 SG 26 48 0.400
pYHMG083 Sc AURGG102 SG 26 48 0.210
pYHMG094 Sc AURGG102 SG 26 48 0.170
pYHMG108 Sc AURGG102 SG 26 48 0.120
pYHMG122 Sc AURGG102 SG 26 48 0.160
pYHMG123 Sc AURGG102 SG 26 48 0.097
pYHMG134 Sc AURGG102 SG 26 48 0.110
pYHMG044 Sc AURGG703 SG 26 96 0.201
pYHMG062 Sc AURGG703 SG 26 96 0.243HMG1Δ(YEp)+ispAm(YEp)
pYHMG044+pRS435GAP-lspAm Sc AURGG101 SG 26 96 1.36
HMG1Δ(YEp)+ORF182(YEp)
pYHMG044+pRS435GAP-ORF182 Sc AURGG101 SG 26 96 0.626
pYHMG044+pRS445GAP-ORF182 Sc AURGG101 SG 26 96 1.16
HMG1Δ(YEp)+HMGS(YEp)
pYHMG044+pRS435GAP-HMGS Sc AURGG101 SG 26 96 1.30
pYHMG044+pRS445GAP-HMGS Sc AURGG101 SG 26 96 0.883
HMG1Δ(YEp)+ERG12(YEp)
pYHMG044+pRS435GAP-ERG12 Sc AURGG101 SG 26 96 0.702
pYHMG044+pRS445GAP-ERG12 Sc AURGG101 SG 26 96 1.01
HMG1Δ(YEp)+ERG8(YEp)
pYHMG044+pRS435GAP-ERG8 Sc AURGG101 SG 26 96 0.700
pYHMG044+pRS445GAP-FRG8 Sc AURGG101 SG 26 96 2.72
HMG1Δ(YEp)+ERG10(YEp)
pYHMG044+pRS435GAP-ERG10 Sc AURGG101 SG 26 96 1.16
pYHMG044+pRS445GAP-ERG10 Sc AURGG101 SG 26 96 1.22
HMG1Δ(YEp)+ERG19(YEp)
pYHMG044+pRS435GAP-ERG19 Sc AURGG101 SG 26 96 1.89
pYHMG044+pRS446GAP-FRG19 Sc AURGG101 SG 26 96 1.02
fpsm(Y81M)
pFPSm21 Ec JM109 2xYT*** 37 16 16.1
ispAm(Y79M) p16M Ec JM109 2xYT*** 37 16 21.9
ispAm(Y79D)
p15D Ec JM109 2xYT*** 37 16 0.12
ispAm(Y79E)
p4E Ec JM109 2xYT*** 37 16 0.26
ispAm(Y79M)+idi
pALispA16m+p3-47-13 Ec JM109 2xYT 37 16 0.07
- Sc A451** 0.02
- Sc AURGG101 0.02
- Sc YPH499 0.00
- Sc YPH500 0.00
- Sc W303-1A 0.00
- Sc W303-1B 0.00
Ec JM109 0.00
HMG1
pRS434GAP-HMG1 Sc EUG8 YM 30 96 0.16
pRS444GAP-HMG1 Sc EUG8 YM 30 96 0.12
pRS434GAP-HMG1 Sc EUG12 YM 30 96 1.03
pRS444GAP-HMG1 Sc EUG12 YM 30 96 1.02
pRS434GAP-HMG1 Sc EUG12 YM 30 96 0.55
pRS434GAP-HMG1 Sc EUG27 YM 30 96 0.55
pRS434GAP-HMG1 Sc EUG27 YM 30 96 0.63
HMG1Δ
pYHMG044 Sc AURGG101 YMO 26 157 3.58
pRS434GAP-HMG026 Sc EUG5 YM 30 96 0.09
pRS434GAP-HMG044 Sc EUG5 YM 30 96 0.09
pRS434GAP-HMG056 Sc EUG5 YM 30 96 0.11
pRS434GAP-HMG062 Sc EUG5 YM 30 96 0.13
pRS434GAP-HMG076 Sc EUG5 YM 30 96 0.15
pRS434GAP-HMG081 Sc EUG5 YM 30 96 0.14
pRS434GAP-HMG100 Sc EUG5 YM 30 96 0.18
pRS434GAP-HMG112 Sc EUG5 YM 30 96 0.34
pRS434GAP-HMG122 Sc EUG5 YM 30 96 0.13
pRS434GAP-HMG133 Sc EUG5 YM 30 96 0.71
pRS434GAP-HMG026 Sc EUG12 YM 30 96 0.63
pRS434GAP-HMG044 Sc EUG12 YM 30 98 0.44
pRS434GAP-HMG056 Sc EUG12 YM 30 96 0.4
pRS434GAP-HMG062 Sc EUG12 YM 30 96 0.45
pRS434GAP-HMG076 Sc EUG12 YM 30 96 0.55
PRS434GAP-HMG081 Sc EUG12 YM 30 96 0.49
pRS434GAP-HMG100 Sc EUG12 YM 30 96 0.44
pRS434GAP-HMG112 Sc EUG12 YM 30 96 0.53
pRS434GAP-HMG122 Sc EUG12 YM 30 96 0.5
pRS434GAP-HMG133 Sc EUG12 YM 30 96 0.44BTS1(载体)
pRS435GG Sc EUG8 YM 30 96 1.4
pRS435GG Sc EUG12 YM 30 96 1.58
pRS435GG Sc EUG27 YM 30 96 1.53FPS基因
pFPSm21 Ec JM109 2xYT*** 37 16 16.1
pFPSm31 Ec JM109 2xYT*** 37 16 6.9
p4D Ec JM109 2xYT*** 37 16 0.09
p4E Ec JM109 2xYT*** 37 16 0.26
p4M Ec JM109 2xYT*** 37 16 15.5
p8M Ec JM109 2xYT*** 37 16 0.31
p15D Ec JM109 2xYT*** 37 16 0.12
p15E Ec JM109 2xYT*** 37 16 0.21
p16D Ec JM109 2xYT*** 37 18 0.06
p16E Ec JM109 2xYT*** 37 16 0.88
p16M Ec JM109 2xYT*** 37 16 21.9
p18E Ec JM109 2xYT*** 37 16 0.14
p16M Ec JM109 2xYT*** 37 16 6FPS基因+idi
p16M+p3-47-13 Ec JM109 2xYT 37 16 0.07GGHDEL
pRS445GGHDEL Sc YPH499 YM 30 96 0.23FGG融合体
pRS435FGG Sc YPH499 YM 30 96 0.46
pRS435FGGHDEL Sc YPH499 YM 30 96 0.29
GGF融合体
pRS435GGF Sc A451 YM 30 96 0.28
pRS435GGF Sc A451 YMO 30 96 0.48
pRS435GGF Sc A451 YM 30 168 0.28
pRS435GGF Sc A451 YMO 30 168 1.01
pRS435GGF Sc YPH499 YM 30 98 2.1
pRS435GGF Sc YPH499 YMO 30 96 1.49
pRS435GGF Sc YPH499 YM 30 168 0.37
pRS435GGF Sc YPH499 YMO 30 168 2.92
pRS435GGF Sc EUG5 YM 30 96 5.2
pRS435GGF Sc EUG5 YMO 30 96 4.2
pRS435GGF Sc EUG5 YM(100) 30 96 3.5
pRS435GGF Sc EUG5 YM 30 168 7.32
pRS435GGF Sc EUG5 YMO 30 168 10.1
pRS435GGF Sc EUG12 YM 30 96 0.47
pRS435GGF Sc EUG12 YMO 30 96 2.38
pRS435GGF Sc EUG12 YM(20) 30 96 7.04
pRS435GGF Sc EUG12 YM 30 168 1.43
pRS435GGF Sc EUG12 YMO 30 168 5.78
pRS435GGFHDEL Sc A451 YMO 30 96 0.13
pRS435GGFHDEL Sc YPH499 YM 30 96 1.9
pRS435GGFHDEL Sc YPH499 YMO 30 96 1.69
pRS435GGFHDEL Sc YPH499 YM 30 168 0.54
pRS435GGFHDEL Sc YPH499 YMO 30 168 2.5
pRS435GGFHDEL Sc EUG5 YM 30 96 5.78
pRS435GGFHDEL Sc EUG5 YMO 30 96 3.97
pRS435GGFHDEL Sc EUG5 YM(75) 30 96 3.94
pRS435GGFHDEL Sc EUG5 YM 30 168 6.99
pRS435GGFHDEL Sc EUG5 YMO 30 168 10.8
pRS435GGFHDEL Sc EUG12 YM 30 96 0.6
pRS435GGFHDEL Sc EUG12 YMO 30 96 2.33
pRS435GGFHDEL Sc EUG12 YM(20) 30 98 8.01
pRS435GGFHDEL Sc EUG12 YM 30 188 1.18
pRS435GGFHDEL Sc EUG12 YMO 30 168 5.78HMG1+GGF融合体
pRS434GAP-HMG1+pRS435GGF Sc A451 YM 30 98 0.55
pRS434GAP-HMG1+pRS435GGF Sc A451 YMO 30 96 0.36
pRS434GAP-HMG1+pRS435GGF Sc A451 YM 30 168 0.93
pRS434GAP-HMG1+pRS435GGF Sc A451 YMO 30 168 1.01
pRS434GAP-HMG1+pRS435GGF Sc A451 YM(50) 30 16B 4.54
pRS434GAP-HMG1+pRS435GGF Sc YPH499 YM 30 96 0.78
pRS434GAP-HMG1+pRS435GGF Sc YPH499 YMO 30 96 2.46
pRS434GAP-HMG1+pRS435GGF Sc YPH499 YM(50) 30 98 2.25
pRS434GAP-HMG1+pRS435GGF Sc YPH499 YMO 33 109 128
pRS434GAP-HMG1+pRS435GGF Sc YPH499 YM 30 168 1.28
pRS434GAP-HMG1+pRS435GGF Sc YPH499 YMO 30 168 5.66
pRS434GAP-HMG1+pRS435GGF Sc YPH499 YM(100) 30 168 2.5
pRS434GAP-HMG1+pRS435GGFHDEL Sc A451 YM 30 96 0.54
pRS434GAP-HMG1+pRS435GGFHDEL Sc A451 YMO 30 96 0.41
pRS434GAP-HMG1+pRS435GGFHDEL Sc A451 YM 30 168 0.76
pRS434GAP-HMG1+pRS435GGFHDEL Sc A451 YMO 30 188 3.49
pRS434GAP-HMG1+pRS435GGFHDEL Sc A451 YM(75) 30 168 5.74
pRS434GAP-HMG1+pRS435GGFHDEL Sc YPH499 YM 30 96 1
pRS434GAP-HMG1+pRS435GGFHDEL Sc YPH499 YMO 30 96 2.6
pRS434GAP-HMG1+pRS435GGFHDEL Sc YPH499 YM(100) 30 168 2.85
pRS434GAP-HMG1+pRS435GGFHDEL Sc YPH499 YM 30 168 2.45
pRS434GAP-HMG1+pRS435GGFHDEL Sc YPH499 YMO 30 68 6.16
*:加入了0.1%ADEKANOL+5%Glc。
**:当培养宿主本身时,在任何培养条件下都不产生GGOH。
***:加入了IPP和DMAPP
在“基因”一栏中,“YEp″意指采用YEp载体转移的基因,“基因组”意指通过基因组整合转移的基因。
在上表中,在宿主一栏中的“Ec”意指大肠杆菌,“Sc”意指啤酒糖酵母。在“培养基”一栏中,“YM(20)”意指该YM培养基含20%Glc-80%Gal的初始糖成分,并且在2天的培养中,Glc被进一步加进该培养基中,达到5%的终浓度。培养基的其它组分及数值有相同的意义。
(5-1)通过ERG20表达制备GGOH
当pRS435GAP-ERG或pRS445GAP-ERG被导入A451时,重组体高效产出GGOH。当采用pRS445GAP-ERG时,可产出0.73mg/L的GGOH(表9)。
(5-2)通过BTS1表达制备GGOH
当pRS435GAP-BTS1或pRS445GAP-BTS1被导入A451或YPH499时,GGOH产量大幅度增加(表9)。当宿主为A451时,重组体平均产出0.10-0.11mg/L的GGOH,最多为0.585mg/L(表9)。此外,当pRS435GAP-BTS1或pRS445GAP-BTS1被导入W303-1A或W303-1B时,最多产出0.19-0.85mg/L的GGOH(表9)。
(5-3)通过HMG辅酶A还原酶基因或其突变体表达制备GGOH
(i)当组成型启动子连接的HMG1基因被导入A451时GGOH的制备
(表达为“组成型启动子;HMG1;A451”;该表示方式也应用于下文描述的其它重组体)
GGOH产量的测定结果如图16所示。在图16中,434和444分别表示采用pRS434GAP和pRS444GAP载体时的结果。该图表中的右柱显示的是培养基因导入前的宿主(A451)所得到的结果。
这些结果显示:在pRS343GAP-HMG/A451中,GGOH产率有改善,平均产出0.105mg/L的GGOH,并且根据菌落的不同,仅活化了HMG1基因的转录,最多就可产出0.348mg/L的GGOH(表9)。这样,该重组体被认为对GGOH的制备是有效的。
(ii)诱导型启动子;HMG1;A451 & AURGG101
将质粒pYES2-HMG导入到A451和AURGG101(A451,aur1::AUR1-C)中,该质粒是通过将一个HMG1基因(HMG1’,一个PCR错误型HMG1)插入到含诱导型启动子GAL1p的载体pYES2而获得。
结果获得GGOH高产克隆。AURGG101衍生的克隆的GGOH产量平均达到1.1mg/L,最多达到2.2mg/L(图17)。
(iii)诱导型启动子;HMG1 &BTS1;AURGG102 &AURGG703
将质粒pYES-HMG导入到A451衍生的AURGG102和YPH499衍生的AURGG703(该宿主染色体中整合有BTS1)中,该质粒是通过将一个HMG1’插入到含诱导型启动子GAL1p的载体pYES2而获得。
结果,当以AURGG102或AURGG703为宿主时,只要采用GAP1p就可获得GGOH高产克隆(图18)。AURGG102衍生的克隆最多可产出1.28mg/L的GGOH。
(iv)诱导型启动子;HMG1Δ;A451
将下列质粒分别导入A451中,这些质粒均通过将HMG1’基因的缺失突变体插入到含诱导型启动子GAL1p的载体pYES2而获得。
pYHMG026
pYHMG044
pYHMG056
pYHMG062
pYHMG076
pYHMG081
pYHMG100
pYHMG112
pYHMG122
在SG培养基中培养得到的重组体,继而测定GGOH产量(图19)。图19中,“HMG1Δ026”表示当把pYHMG026导入A451时的结果。其它基因的导入也以相同方式表示。
当用诱导型启动子表达HMG1基因的缺失突变体时,获得GGOH高产克隆。HMG1Δ056和HMG1Δ062对GGOH制备是有效的。(HMG062/A451克隆平均产出0.063mg/L的GGOH。)
(v)诱导型启动子;HMG1Δ;AURGG101
将下列质粒分别导入AURGG101中,这些质粒均通过将HMG1’基因的缺失突变体插入到含诱导型启动子GAL1p的载体pYES2中而获得。
pYHMG026
pYHMG044
pYHMG056
pYHMG062
pYHMG076
pYHMG081
pYHMG100
pYHMG112
pYHMG122
pYHMG133
在SG培养基中培养得到的重组体,继而测定GGOH产量(图20)。结果在HMG1Δ044中测到大约3.1mg的GGOH产量。在图20中,最右边的柱表示基因导入前的宿主AURGG101的GGOH产量。
(vi)诱导型启动子;HMG1Δ&BTS1;AURGG102
将下列质粒分别导入AURGG102中,这些质粒均通过将HMG1’基因的缺失突变体插入到含诱导型启动子GAL1p的载体pYES2而获得。
pYHMG027
pYHMG044
pYHMG045
pYHMG059
pYHMG062
pYHMG063
pYHMG076
pYHMG083
pYHMG094
pYHMG106
pYHMG112
pYHMG123
pYHMG134
在SG培养基中培养得到的重组体,继而测定GGOH产量。
结果当导入pYHMG045时,获得平均产出0.36mg/L GGOH的克隆(图21)。
(vii)诱导型启动子;HMG1Δ&BTS1;AURGG703
将质粒pYHMG044和pYHMG062分别导入AURGG703中,这两种质粒均通过将HMG1’基因的缺失突变体插入到含诱导型启动子GAL1p的载体pYES2而获得。在SG培养基中培养得到的重组体,继而测定GGOH产量。
结果是导入pYHMG062的克隆平均产出0.21mg/L GGOH的克隆(图22)。
(5-4)通过BTS1和HMG辅酶A还原酶基因共表达制备GGOH
将pRS435GAP-HMG1或pRS445GAP-HMG1同BTS1一起导入YPH499中,测定得到的重组体的GGOH产量。结果当导入pRS435GAP-HMG1时,获得一个最多产出0.58mg/L GGOH的克隆(表9)。
(5-5)通过ispAm、0RF182(idi)、HMGS、ERG8、ERG10或ERG19与HMG辅酶A还原酶基因的一个缺失突变体(HMG1Δ)共表达制备GGOH
将ispAm、ORF182(idi)、HMGS、ERG8、ERG10或ERG19与HMG1Δ一起导入AURGG101中,继而测定得到的重组体的GGOH产量。结果获得最多可产出0.6-2.7mg/L GGOH的克隆。
[实施例9]DNA导入宿主及宿主的培养
(在大肠杆菌中的表达)
将下列载体导入大肠杆菌JM109中,用大肠杆菌FPP合酶基因ispA的表达载体:pALisp4、pALisp15、pALisp16和pALisp18;用作通过转化自ispA的GGPP合酶基因ispA(Y79D)、ispA(Y79E)和ispA(Y79M)的替代突变的表达载体:p4D、p4E、p4M、p8M、p15D、p15E、p16D、p16E、p16M、p18E和p18M;以及用作嗜热脂肪芽孢杆菌FPP合酶基因fps的Y81M突变体的表达载体:pFPSm21和pFPSm31。预培养得到的重组体。将0.5ml的预培养液加入到装有50ml含2xYT和1mM IPTG的培养基的300m1烧瓶中。如有需要,还可加入抗生素(氨苄青霉素和氯霉素)、5mM(大约0.12%(w/v))IPP和5mM DMAPP,37℃下振荡培养细胞16小时。
培养完成后,将马铃薯酸磷酸酶加入到培养液上清中,经超声波破碎产生的沉淀,继而用有机溶剂戊烷将异戊二烯醇抽提出来。然后,由GC/MS对异戊二烯醇进行鉴别和定量测定。进一步地,为确定在没有IPP和DMAPP加入的情况下,是否能够产出异戊二烯醇,将实施例3(1)中获得的质粒p16M(命名为pALispA16m)和实施例2(10-2)中获得的保留有IPPΔ异构酶基因idi的p3-47-13导入大肠杆菌JM109中,然后预培养。将0.5ml的预培养液加入到装有50ml含2xYT和1mM IPTG的培养基的300ml烧瓶中。如有需要,还可加入抗生素(氨苄青霉素和氯霉素)。然后,37℃下振荡培养细胞16小时。
当加入IPP和DMAPP到培养基中时,GGOH产量结果如下。当导入突变体fps(图23的pFPSm21和pFPSm31)时,GGOH产量为16.1mg/L和6.9mg/L。当导入突变体ispA时,保留p4M、p16M和p18M的JM109细胞分别产出15.5mg/L、21.9mg/L和6.0mg/L的GGOH(图23)。在导入Y79M突变的p4M和p16M中,验证出其沉淀组分中有高GGOH活性。从这些结果,可以认为pALispA4和pALispA16在大肠杆菌细胞中能表达具有活性的FPP合酶基因,且其替代突变型质粒p4M和p16M也有足够的表达活性。
当没有加入IPP和DMAPP到培养基中时,在保留pALispA16m的JM109中,GGOH产量为0.07mg/L。当pALispA16m和p3-47-13(保留IPPΔ异构酶基因)被共表达时,以GGOH计算,异戊二烯醇的产率为0.12mg/L。
[实施例10]通过融合基因表达制备异戊二烯醇
假定被啤酒糖酵母BTS1编码的GGPP合酶倾向以FPP而非DMAPP(二甲基烯丙基二磷酸)为起始底物。可以认为,在加强FPP合成能力的同时,需要加强从IPP合成GGPP(GGOH的前体)的能力。
考虑到这一点,该实施例中试图构建由BTS1和ERG20组成的融合基因,以在啤酒糖酵母细胞中将其表达,并确定GGOH的产率是否改善。进一步也试图掺入在BTS1、ERG20或其融合基因的下游编码一个ER过渡信号的核苷酸序列,并检测其对异戊二烯醇制备的影响。
(1)质粒DNA的制备
以pYESGGPS和pT7ER20为模板进行PCR反应,这两种质粒,前者是掺入GGPP合酶基因BTS1的pYES2质粒,后者是掺入FPP合酶基因ERG20的pT7质粒。PCR引物如下所示。
SacII-BTS1:5′-TCC
CCG CGG ATG GAG GCC AAG ATA GAY-3′
(SEQ ID NO:107)
BTS1-XhoI:5′-CAA
CTC GAG TCA CAA TTC GGA TAA GTG-3′
(SEQ ID NO:108)
ERG20HDEL-XbaI:5′-GC
T CTA GAG TTC GTC GTG TTT GCT TCT CTT GTA
AAC TT-3′(SEQ ID NO:109)
BTS1HDEL-XhoI:5′-TAT
CTC GAG TCA CAA TTC GTC ATG TAA ATT GG-3′
(SEQ ID NO:110)
BTSI-109I:5′-GCA
GGG ACC CCA ATT CGG ATA AGT GGT C-3′
(SEQ ID NO:111)
109I-BTS1:5′-GTA
GGG TCC CTG GAG GCC AAG ATA GAT G-3′
(SEQ ID NO:112)
ERG20-109I:5′-GCA
GGG ACC CTT TGC TTC TCT TGT AAA CT-3′
(SEQ ID NO:113)
109I-ERG20:5′-GTA
GGG TCC TCA GAA AAA GAA ATT AGG AG-3′
(SEQ ID NO:114)
-21:5′-TGT AAA ACG ACG GCC AGT-3′(SEQ ID NO:115)
T7:5′-TAA TAC GAC TCA CTA TAG GG-3′(SEQ ID NO:116)
ERG20HDEL-XbaI:5′-GC
T CTA GAG TTC GTC GTG TTT GCT TCT CTT GTA
AAC TT-3′(SEQ ID NO:117)
BTSlHDEL-XhoI:5′-TAT
CTC GAG TCA CAA TTC GTC ATG TAA ATT GG-3′
(SEQ ID NO:118)
ERG20HDEL-XbaI的3-8位的核苷酸与BTS1HDEL-XhoI4-9位的核苷酸(下划线部分)表示用于载体连接的SacII、XhoI或XbaI识别位点。BTS1-109I、109I-BTS1、ERG20-109I和109I-ERG20的4-10位的核苷酸(下划线部分)分别表示用于融合基因制备的Eco0109I识别位点。
在下述反应溶液中进行PCR。
1xKOD-Plus缓冲液(Toyobo)
0.2mM dNTPs
0.25mM MgSO4
15pmol引物1
15pmol引物2
0.01-0.1μg模板DNA
1单位KOD-Plus DNA聚合酶(Toyobo)
共计:50μl
KOD-Plus含1.6μg/μl的KOD抗体。接着在94℃下初始变性2分钟,再进行30个循环过程完成PCR,每个循环包括94℃15秒、55℃30秒和68℃1分钟。然后,将溶液置于68℃2分钟。
如表10和图24所示的方法采用模板和引物(引物1+引物2)的组合进行第一次PCR。PCR产物的命名如表10和图24所示。在图24中,最终质粒的命名如最左边的列所示。以灰色字母书写的序列表示氨基酸序列。其中,GS被导入融合基因的结合序列,HDEL被作为ER过渡信号插入。箭头指示用于PCR的各个引物的位置及方向。
表10
模板 | 引物1 | 引物2 | PCR产物 |
pT7ERG20 | SacII-BTS1 | BTS1HDEL-XhoI | #6 |
pYESGGPS | SacII-ERG20 | ERG20HDEL-XbaI | #7 |
pYESGGPS | SacII-BTS1 | BTS1-109I | #9 |
pT7ERG20 | T7 | ERG20-109I | #10 |
pT7ERG20 | 109I-ERG20 | -21 | #11 |
pYESGGPS | 109-BTS1 | BTS1-XhoI | #12 |
pT7ERG20 | 109I-ERG20 | ERG20HDEL-XbaI | #13 |
pYESGGPS | 109I-BTS1 | BTS1HDEL-XhoI | #14 |
用限制性酶Eco0109I消化PCR产物#9、#10、#11、#12、#13和#14。然后,将#9和#11、#10和#12、#9和#13、及#10和#14彼此单独连接。以得到的连接溶液为模板,以SacII-BTS1和-21的组合、T7和BTS1-XhoI的组合、SacII-BTS1和ERG20HDEL-XbaI的组合、及T7和BTS1HDEL-XhoI的组合为引物1和引物2,进行第二次PCR,反应条件与进行第一次PCR的条件相同。结果获得第二次PCR的产物#9-#11、#10-#12、#9-#13和#10-#14。
用SacII和BamHI消化产物#9-#11,并将其插入到pRS435GAP和pRS445GAP的SacII-BamHI位点,从而分别获得pRS435GGF和pRS445GGF。
用XbaI和XhoI消化产物#10-#12,并将其插入到pRS435GAP和pRS445GAP的XbaI-XhoI位点,从而分别获得pRS435FGG和pRS445FGG。
用SacII和XbaI消化产物#9-#13,并将其插入到pRS435GAP的SacII-XbaI位点,从而获得pRS435GGFHDEL。
用XbaI和XhoI消化产物#10-#14,并将其插入到pRS435GAP和pRS445GAP的XbaI-XhoI位点,从而分别获得pRS435FGGHDEL和pRS445FGGHDEL。
用SacII和XbaI消化产物#7,并将其插入到pRS435GAP和pRS445GAP的SacII-XbaI位点,从而分别获得pRS435FHDEL和pRS445FHDEL。
用BamHI和XhoI消化产物#6,并将其插入到pRS435GAP和pRS445GAP的BamHI-XhoI位点,从而分别获得pRS435GGHDEL和pRS445GGHDEL。
通过DNA测序证实每一种得到的质粒DNA都具有与设计精确一致的核苷酸序列。
就用于分别表达非融合BTS1和ERG20基因的质粒而言,采用了pRS435GAP-BTS1(称为pRS435GG)、pRS445GAP-BTS1(称为pRS445GG)、pRS435GAP-ERG20(称为pRS435F)和pRS445GAP-ERG20(称为pRS445F)。就用于表达HMG1的质粒而言,采用的是pRS434TEF-HMG1和pRS434GAP-HMG1。
(2)重组体的制备
用Frozen EZ酵母转化试剂盒(Zymo Research,Orange,CA)将上述步骤制备的质粒导入宿主,从而制备出重组体。就宿主而言,可以采用A451、YPH499、AH1(pRS434GAP-HMG1/A451)、YH1(pRS434GAP-HMG1/YPH499)、EUG5和EUG12。
(3)异戊二烯醇产量的测定
将除EUG菌株以外的重组体接种到SD(合成右旋糖)选择性液体培养基中。EUG菌株被接种到SGR培养基(即将SD培养基中的葡萄糖组分用半乳糖和棉籽糖替代后得到的培养基)中。然后全部在30℃下培养,以制备预培养液。将10或25μl的预培养液加入到1.0或2.5ml的YM7+ade培养基(YM,pH7,40μg/ml硫酸腺嘌呤)或YMO培养基[YM7+ade,1%大豆油,0.1%ADEKANOL LG-109(Asahi Denka Kogyo,Tokyo,Japan)]中,然后以130rpm,30℃下反复振荡培养4天或7天。
培养完成后,将等体积的甲醇加入到培养液中,将之混合。再将大约2体积的戊烷加入到该混合液中,剧烈振荡后静置。将得到的相应戊烷层转移到一个干净的玻璃试管中,然后置于通风橱内。戊烷在通风橱内挥发后即可浓缩溶质组分。随后,通过GC/MS鉴定异戊二烯醇,并以十一醇为内部标准定量检测。与此同时,可以将20μl培养液用水稀释30倍后,在600nm测其吸光度,以考查细胞生长的程度。
就GC/MS分析法而言,采用PH6890/5973 GC/MS系统(Hewlett-Packard,Wilmington,DE)。
(4)观察结果
通过表达融合基因获得的最大GGOH产量如表11所列。
表11通过融合基因表达的最大GGOH产量
基因 | 表达载体 | 宿主 | 培养基 | 温度(℃) | 培养时间 | GGOH产量(mg/l) |
435FHDEL | pRS435FHDELpRS445FHDELpRS435FHDELpRS445FHDEL | Sc EUG5Sc EUG5Sc EUG12Sc EUG12 | YMYMYMYM | 30303030 | 96969696 | 0.1710.1060.0900.056 |
435GGHDEL | pRS445GGHDEL+pRS434GAP-HMG1pRS435GGHDELpRS445GGHDELpRS435GGHDELpRS445GGHDEL | Sc YPH499Sc EUG5Sc EUG5Sc EUG12Sc EUG12 | YMYMYMYMYM | 3030303030 | 9696969696 | 0.2270.1680.3970.7330.825 |
435FGG | pRS435FGGpRS445FGGpRS435FGG+pRS434GAP-HMG1pRS445FGG+pRS434GAP-HMG1pRS435FGGpRS445FGGpRS435FGGpRS445FGG | Sc YPH499Sc YPH499Sc YPH499Sc YPH499Sc EUG5Sc EUG5Sc EUG12Sc EUG12 | YMYMYMYMYMYMYMYM | 3030303030303030 | 9696969696969696 | 0.2710.1580.4620.6482.462.174.833.65 |
435GGF | pR5435GGFpRS435GGFpRS435GGFpRS435GGFpRS435GGF+pRS434GAP-HMG1pRS435GGF+pRS434GAP-HMG1pRS435GGF+pRS434GAP-HMG1pRS435GGF+pRS434GAP-HMG1pRS435GGFpRS435GGFpRS435GGFpRS435GGFpRS445GGFpRS435GGF+pRS434GAP-HMG1pRS435GGF+pRS434GAP-HMG1pRS435GGF+pRS434GAP-HMG1pRS435GGF+pRS434GAP-HMG1pRS445GGF+pRS434GAP-HMG1pRS435GGFpRS435GGFpRS435GGFpRS435GGFpRS445GGFpRS435GGFpRS435GGFpRS435GGFpRS435GGFpRS445GGF | Sc A451Sc A451Sc A451Sc A451Sc A451Sc A451Sc A451Sc A451Sc YPH499Sc YPH499Sc YPH499Sc YPH499Sc YPH499Sc YPH499Sc YPH499Sc YPH499Sc YPH499Sc YPH499Sc EUG5Sc EUG5Sc EUG5Sc EUG5Sc EUG5Sc EUG12Sc EUG12Sc EUG12Sc EUG12Sc EUG12 | YMYMYMOYMOYMYMYMOYMOYMYMYMOYMOYMYMYMYMOYMOYMYMYMYMOYMOYMYMYMYMOYMOYM | 30303030303030303030303030303030303030303030303030303030 | 96168981689616896168961689616896961689616896961689616898961689616896 | 0.3540.2830.4751.000.5460.9280.3621.010.4580.3711.492.920.3172.101.282.465.861.015.207.321.2010.10.8614.671.182.385.023.25 |
435FGGHDEL | pRS435FGGHDELpRS445FGGHDELpRS435FGGHDEL+pRS434GAP-HMG1pRS445FGGHDEL+pRS434GAP-HMG1pRS435FGGHDELpRS445FGGHDELpRS435FGGHDELpRS435FGGHDELpRS445FGGHDEL | Sc YPH499Sc YPH488Sc YPH498Sc YPH499Sc EUG5Sc EUG5Sc EUG12Sc EUG12Sc EUG12 | YMYMYMYMYMYMYMYMYM | 303030303030303030 | 9696969696989816896 | 0.1210.0860.2940.3850.7860.5042.0411.430.521 |
435GGFHDEL pRS435GGFHDEL Sc A451 YM 30 96 0.072
pRS435GGFHDEL Sc A451 YMO 30 96 0.126
pRS435GGFHDEL+pRS434GAP-HMG1 Sc A451 YM 30 96 0.540
pRS435GGFHDEL+pRS434GAP-HMG1 Sc A451 YM 30 168 0.760
pRS435GGFHDEL+PRS434GAP-HMG1 Sc A451 YMO 30 96 0.414
pRS435GGFHDEL+pRS434GAP-HMG1 Sc A451 YMO 30 168 3.49
pRS435GGFHDEL Sc YPH499 YM 30 96 0.805
pRS435GGFHDEL Sc YPH499 YM 30 168 0.541
pRS435GGFHDEL Sc YPH499 YMO 30 96 1.69
pRS435GGFHDEL Sc YPH499 YMO 30 168 2.50
pRS435GGFHDEL+pRS434GAP-HMG1 Sc YPH499 YM 30 96 1.90
pRS435GGFHDEL+pRS434GAP-HMG1 Sc YPH499 YM 30 168 2.45
pRS435GGFHDEL+pRS434GAP-HMG1 Sc YPH499 YMO 30 96 2.60
pRS435GGFHDEL Sc EUG5 YM 30 95 5.78
pRS435GGFHDEL Sc EUG5 YM 30 168 6.99
pRS435GGFHDEL Sc EUG5 YMO 30 96 3.97
pRS435GGFHDEL Sc EUG5 YMO 30 168 10.6
pRS435GGFHDEL Sc EUG12 YM 30 96 3.00
pRS435GGFHDEL Sc EUG12 YM 30 168 1.43
pRS435GGFHDEL Sc EUG12 YMO 30 96 2.33
pRS435GGFHDEL Sc EUG12 YMO 30 168 5.78
EUG(ERG9p::URA3-GAL1p)
- Sc EUG5 YM 30 96 0.179
- Sc EUG5 YM 30 168 0.230
- Sc EUG5 YMO 30 96 0.232
HMG1Δ pRS434GAP-HMG026 Sc EUG5 YM 30 96 0.093
pRS434GAP-HMG044 Sc EUG5 YM 30 96 0.087
pRS434GAP-HMG056 Sc EUG5 YM 30 96 0.108
pRS434GAP-HMG062 Sc ELG5 YM 30 96 0.132
pRS434GAP-HMG076 Sc EUG5 YM 30 96 0.148
pRS434GAP-HMG081 Sc EUG5 YM 30 96 0.140
pRS434GAP-HMG100 Sc ELG5 YM 30 96 0.184
pRS434GAP-1MG112 Sc EUG5 YM 30 96 0.340
pRS434GAP-HMG122 Sc EUG5 YM 30 96 0.127
pRS434GAP-HMG133 Sc EUG5 YM 30 96 0.714
HMG1 pRS434GAP-HMG1 Sc EUG8 YM 30 96 0.074
BTS1 pRS435GAP-BTS1 Sc EUG8 YM 30 96 1.42
pRS445GAP-BTS1 Sc EUG8 YM 30 96 0.067
- Sc EUG12 YM 30 72 0.081
- Sc EUG12 YM 30 96 0.194
- Sc EUG12 YM 30 168 0.335
HMG1 pRS434GAP-HMG1 Sc EUG12 YM 30 96 0.705
pRS444GAP-HMG1 Sc EUG12 YM 30 96 2.05
ERG20 pRS435GAP-ERG20 Sc EUG12 YM 30 72 6.63
pRS435GAP-ERG20 Sc EUG12 YM 30 96 0.260
pRS445GAP-ERG20 Sc EUG12 YM 30 96 0.381
BTS1 pRS435GAP-BTS1 Sc EUG12 YM 30 96 1.75
pRS445GAP-BTS1 Sc EUG12 YM 30 96 3.70
HMG1Δ pRS434GAP-HMG026 Sc EUG12 YM 30 96 0.629
pRS434GAP-HMG044 Sc EUG12 YM 30 96 0.428
pRS434GAP-HMG056 Sc EUG12 YM 30 96 0.402
pRS434GAP-HMG062 Sc EUG12 YM 30 96 0.445
pRS434GAP-HMG076 Sc EUG12 YM 30 96 0.479
pRS434GAP-HMG081 Sc EUG12 YM 30 96 0.488
pRS434GAP-HMG100 Sc EUG12 YM 30 96 0.440
pRS434GAP-HMG112 Sc EUG12 YM 30 96 0.534
pRS434GAP-HMG122 Sc EUG12 YM 30 96 0.499
pRS434GAP-HMG133 Sc EUG12 YM 30 96 0.440
- Sc EUG27 YM 30 96 0.063
HMG1 pRS434GAP-HMG1 Sc EUG27 YM 30 96 0.723
pRS444GAP-HMG1 Sc EUG27 YM 30 96 0.205
ERG20 pRS435GAP-ERG20 Sc EUG27 YM 30 96 0.661
pRS445GAP-ERG20 Sc EUG27 YM 30 96 0.297
BTS1 pRS435GAP-BTS1 Sc EUG27 YM 30 96 0.761
pRS445GAP-BTS1 Sc EUG27 YM 30 96 0.595
(4-2)在A451中ERG20和BTS1
在A451中表达融合基因而引起的异戊二烯醇产量的变化如图25所示。在图25中,“435GGF”代表pRS435GGF,“435GGFHDEL”代表pRS435GGFHDEL(在以下的图中,这些术语具有相同的意义)。OD600表示在600nm处的吸光度。图25也显示了当整合了非融合BTS1的表达载体被导入A451(435GG)中时的结果。
甚至导入pRS445GAP-BTS1(在该图中表示为“445GG/A451”)时,都能检测到平均0.44mg/L的GGOH产量。
(4-3)在YPH499中ERG20和BTS1的表达
在YPH499中表达融合基因所引起的异戊二烯醇产量的变化如图26所示。在图26中,“499”代表YPH499;“435GGF”代表pRS435GGF;“445GGFHDEL”代表pRS445GGFHDEL(在以下的图中,这些术语具有相同的意义)。图26也显示了当整合了非融合BTS1的表达载体被导入YPH499中时的结果。
当导入的是pRS435GAP-BTS1(该图中表示为“435GG/499”)时,能检测到平均0.11mg/L的GGOH产量。当导入的是整合有ERG20-BTS1融合基因的pRS435FGG(该图中表示为“435FGG/499”)时,能检测到平均0.20mg/L的GGOH产量。当导入的是pRS435GGF(该图中表示为“435GGF/499”)时,能检测到平均0.39mg/L的GGOH产量。当导入的是pRS35GGFHDEL(该图中表示为“435GGFHDEL/499”)时,能检测到平均0.62mg/L的GGOH产量。这就验证:融合基因和HDEL序列对改进GGOH产率是有效的。
(4-4)在YPH499中HMG1、ERG20和BTS1的表达
本发明者认为,通过在克隆pRS434GAP-HMG1/YPH499(YH1)和pRS434TEF-HMG1/YPH499中共表达HMG1和其它基因,是有可能从而获得具有更高GGOH产率的克隆的。克隆pRS434GAP-HMG1/YPH499(YH1)和pRS434TEF-HMG1/YPH499是通过导入一个HMG1表达载体到YPH499中而获得的。
图27显示的是:当把一个非融合EPG20或BTS1整合的表达载体进一步导入宿主克隆时,异戊二烯醇的产量,该宿主克隆是通过将pRS434TEF-HMG1导入到YPH499中而获得的。图27中,“434TEFp-HMG1”表示导入pRS434TEF-HMG1的克隆。TEFp是TEF2基因的转录启动子。“499”表示YPH499。“435F”表示pRS435F,“445F”表示pRS445F。(这些术语在下面的其它图中也具有相同的意义)当单独把BTS1导入YPH499中时,GGOH产量仅为0.11mg/L。另一方面,当把BTS1表达载体导入TEF2p-HMG1-转化的克隆中时,GGOH产量平均为0.40mg/L(见图27中的“435GG&434TEFp-HMG1/499”);当把BTS1表达载体导入GAPp-HMG1-转化的克隆中时,产出GGOH为0.49mg/L(见图27中的“435GG&434GAPp-HMG1/499”)。这就显示出通过HMG辅酶A还原酶基因和一个异戊二烯基二磷酸合酶基因的共表达,构建一个大量制备异戊二烯醇体系的可能性。
随后,以GAPp-HMG1转化的YH1(pRS434GAP-HMG1/YPH499)为宿主,本发明制备的ERG20-BTS1融合基因或含HDEL信号的基因与TDH3转录启动子GAPp(TDH3p)一起在该宿主中得以表达。测定所得克隆的异戊二烯醇的产量。
结果如图28所示。图28中,“434GAPp-HMG1”表示导入pRS434GAP-HMG1的克隆。GAPp是TDH3基因的转录启动子。当一个连接有HDEL信号的异戊二烯基二磷酸合酶基因和HMG1共表达时,GGOH产率可以提高。通过导入ERG20-BTS1融合基因可以进一步提高产率。尤其是pRS435GGF转化克隆与pRS435GGFHDEL转化克隆表现出显著的提高。它们分别平均产出1.55mg/L和1.50mg/L的GGOH。(见图28中“435GGF & 434GAPp-HMG1/499”和“435GGFHEDL &434GAPp-HMG1/499”)
(4-5)在含大豆油的培养基中的异戊二烯醇产量
ERG20-BTS1融合基因转化的克隆,即本发明构建的产GGOH重组体,在YM7(YM,pH7)培养基和YMO(YM7,0.1%ADEKANOL LG109,1%大豆油)培养基中各培养4-7天,继而测定异戊二烯醇产量。以A451衍生的克隆为宿主得到的结果如图29A和29B所示。以YPH499衍生的克隆为宿主得到的结果如图30A和30B所示。在图29A和29B中,“AH1”代表pRS434GAPp-HMG1/451,“GGFHDEL”代表pRS435GGFHDEL。“-1”代表培养4天后的产量,“-2”代表培养7天后的产量。由于细胞悬浮在YMO培养基中的大豆油里,细胞量以细胞的个数表示。“103细胞/μl”意指每微升的细胞个数除以1000。
在YM7培养基中培养7天的pRS435GGF/A451平均产出0.26mg/L的GGOH(图29A;上图;GGF/A451-2),在YMO培养基中产量增加到平均0.98mg/L的GGOH(图29B;上图;GGF/A451-2)。同样,当以AH1(pRS434GAP-HMG1转化的A451)为宿主时,pRS435GGFHDEL转化的重组体平均产出2.5mg/L的GGOH(图29B;中图;GGFHDEL/AH1-2),在YMO培养基中最多产出3.5mg/L的GGOH(表11;见基因名称“435GGFHDEL”下面一行)。甚至当以EUG5为宿主时,该宿主的获得是通过将A451的ERG9转录启动子替换为GAL1启动子(见实施例6),pRS435GGF转化的重组体在YMO培养基中也能产出更多的GGOH。当在YM7培养基中培养7天该重组体,可以平均产出6.6mg/L的GGOH(图29A;下图;GGF-EUG5-2),而在YMO培养基中培养7天该重组体,GGOH产量可增加到平均9.6mg/L(图29B;下图;GGF/EUG5-2)。
当以YPH499衍生的克隆为宿主时,也可观测到YMO培养基的使用对GGOH产率的促进作用(图30A和30B)。在YM7培养基培养7天的pRS435GGF转化的YPH499可以产出平均0.19mg/L的GGOH(图30A;上图;GGF/YPH499-2),而在YMO培养基中培养7天可以产出平均2.5mg/L的GGOH(图30B;上图;GGF/YPH499-2)。进一步地,将pRS435GGF或pRS435GGFHDEL导入共表达HMG1的YH1中,并在YMO培养基中培养7天后,两种重组体均产出平均5.6mg/L的GGOH(图30B;中图;GGF/YH1-2和GGFHDEL/YH1-2)。当以EUG12为宿主时,该宿主构建自YPH499,且构建方式与构建EUG5的方式相同(见实施例6),pRS435GGF转化的重组体或pRS435GGFHDEL转化的重组体产出大约3.7-4.0mg/L的GGOH。因此建议:可用HMG1和一个异戊二烯基二磷酸合酶基因的组合提高YPH499衍生的克隆的GGOH产率。
[实施例11]培养基中不同葡萄糖-半乳糖组分对异戊二烯醇制备的作用
(1)载体导入宿主及宿主的培养
该实施例中,检测了在芽殖酵母中异戊二烯醇的产出如何随着不同葡萄糖-半乳糖(Glc-Gal)组成比例变化。另外,也检测了BTS1-ERG20融合基因的表达对异戊二烯醇制备的作用。
采用购自Zymo Research(Orange,CA)的Frozen EZ酵母转化II型试剂盒将载体导入酵母宿主。就用于BTS1-ERG20融合基因的表达载体而言,采用pRS435GGF和pRS435GGFHDEL。就宿主而言,可用A451、YPH499、AH1、EUG5和EUG12。将每一种得到的转化体分别培养在以适当的营养缺陷体为指示因子的基于SGR的选择性培养基的琼脂平板上。为克隆,进行2次在选择性培养基琼脂平板上的培养。
将制备的转化体预培养在SGR选择性培养基上。然后,把0.01-0.05ml的预培养液加入到1-5ml的YM7培养基中,在18mm直径的试管中,30℃、130rpm反复振荡培养。预先制备五种有下列糖组分(Glc-Gal组成比例)的YM7培养基:0%Glc-100%Gal;20%Glc-80%Gal;50%Glc-50%Gal;75%Glc-25%Gal;和100%Glc-0%Gal。首先,在这些培养基中30℃、130rpm反复振荡培养细胞。培养两天后,向每种培养基中进一步加入Glc,以达到5%的终浓度(w/v)。继续培养细胞到第七天。
(2)结果
(2-1)通过A451制备GGOH
当pRS435GGF和pRS435GGFHDEL被分别导入A451时的GGOH产量如图31A-31C所示。在两种情况中,检测出很少的GGOH。典型地,当初始葡萄糖比例为20%时,培养4天的pRS435GGFHDEL/A451显示出最高的GGOH产量(平均为0.56mg/L)。
(2-2)通过AH1制备GGOH
当pRS435GGF和pRS435GGFHDEL被分别导入AH1时的GGOH产量如图33A-32C所示。当初始葡萄糖比例为20%时,培养2-4天的BTS1-ERG20融合基因转化的AH1克隆也显示出最高的GGOH产量(3.32mg/L)。当初始葡萄糖比例为50-80%时,培养7天的这些克隆显示出最高的GGOH产量(平均4.13mg/L)。
(2-3)通过EUG5制备GGOH
当pRS435GGF和pRS435GGFHDEL被分别导入EUG5时的GGOH产量如图33A-33C所示。当初始葡萄糖比例为20-80%时,培养2-4天的pRS435GGF转化克隆和pRS435GGFHDEL转化克隆中获得了好的结果。
(2-4)通过YPH499制备GGOH
当pRS435GGF和pRS435GGFHDEL被分别导入YPH499时的GGOH产量如图34A-34C所示。同通过A451制备GGOH的情况一样,检测出很少的GGOH。
(2-5)通过YH1制备GGOH
当pRS435GGF和pRS435GGFHDEL被分别导入YH1时的GGOH产量如图35A-35C所示。当初始葡萄糖比例为100%时,培养7天的重组体克隆获得了高的GGOH产量。
(2-6)通过EUG12制备
当pRS435GGF和pRS435GGFHDEL被分别导入EUG12时的GGOH产量如图36A-36C所示。当初始葡萄糖比例为20%时,两种重组体克隆都显示出高的异戊二烯醇产率。当pRS435GGF/EUG12在初始葡萄糖比例为20%时培养4天,尽管细胞量相当于OD600=1.1,该克隆仍产出7.6mg/L的FOH及5.4mg/L的GGOH。考虑到细胞平均产量,这些制备结果被认为是非常有效。
[实施例12]pRS435GGF/YH1和15-2的发酵罐培养
(1)pRS435GGF/YH1
为大量制备GGOH,将可以优先产出5.6mg/LGGOH(见实施例10)的pRS435GGF/YH1克隆按下文描述的条件在发酵罐中培养。
<发酵罐培养基>
5%葡萄糖(YM本身含1%葡萄糖。因此,葡萄糖的终浓度为6%)
YM肉汤(Difco)
3%大豆油(Nacalai Tesque)
0.1%ADEKANOL LG109(Asahi Denka)
<操作条件>
培养设备:MSJ-U 10L培养设备(B.E.Marubishi)
培养基体积:5L
培养温度:33℃
通气速率:1vvm
搅拌速率:300rpm
pH:用4N的氢氧化钠溶液和2N的盐酸溶液按下列参数适当地控制:
比例区 1.00
不敏感区 0.15
控制周期 16秒
全冲程 1秒
最小冲程 0秒
结果是,培养RS435GGF/YH1克隆115小时产出128mg/L的GGOH。与此同时,鲨烯(SQ)、FOH和橙花叔醇(NOH)的产量分别为15mg/L、5mg/L和几乎为0。因此,发明者成功构建了一个通过发酵大量单独制备GGOH的体系(图37)。
(2)15-2克隆
将实施例7中描述的15-2克隆(保留pYHMG044的AURGG101)从斜面接种到GSM-URA(BIO101)+DOB(BI0101)培养基(200ml,置于一个配有一个阀门的500ml三颈瓶),并于30℃、130rpm培养2天。随后,重复3次离心(1500rpm,5分钟,4℃)和用灭菌的生理盐水洗涤的操作,以完全将培养液中所含的葡萄糖除去。接着,与上文(1)中培养pRS435GGF/YH1所用的条件相同,在发酵罐中培养50ml的该预培养液。然而,采用的培养基如下所示,培养温度为26℃。
<发酵罐培养基>
5%半乳糖(YM含1%葡萄糖。因此,糖的终浓度为6%)
无氨基酸YMB(Difco)
1%大豆油(Nacalai Tesque)
0.1%ADEKANOL LG109(Asahi Denka)
结果是,培养150小时的15-2克隆可以产出3mg/L的GGOH。(图38)
[实施例13]融合基因的表达
为了验证pRS435GGF转化的细胞和pRS435GGFHDEL转化的细胞是否表达各自的融合基因产物,需要通过RNA印迹杂交和蛋白质印迹杂交分析转录产物和翻译产物。
(1)RNA印迹杂交
图39显示的是:对YPH499衍生重组体和YH1衍生重组体的转录产物进行RNA印迹杂交分析的结果,YPH499衍生重组体中独立导入了pRS435GGF和pRS435GGFHDEL,YH1衍生重组体中独立导入了pRS435GGF和pRS435GGFHDEL。就探针而言,采用的是位于编码α微管蛋白的TUB1基因编码区的一个DNA片段和ERG20探针,BTS1探针和HMG1探针,后三种探针列于实施例7的表7中。TUB1探针的制备方式与实施例7描述的方式相同,即采用下文介绍的寡聚核苷酸TUB1f-2和TUB1r-2。RNA的制备及RNA印迹杂交以实施例7描述的方式进行。
TUB1f-2:5’-ACG GTA AGA AAT CCA AGC-3’(SEQ ID NO:119)
TUB1r-2:5’-TAT GAG TCG GCA CCC ACT-3’(SEQ ID NO:120)
图39中,“-”指来自那些没有导入含异戊二烯基二磷酸合酶基因质粒的细胞的RNA样品;“GGF”指来自pRS435GGF转化重组体的RNA样品;“HDEL”指来自pRS435GGFHDEL转化重组体的RNA样品;“HMG1”指来自YH1衍生的重组体的RNA样品。从采用探针TUB1(微管蛋白α基因)获得的结果看,可以认定从每一份制备样本中得到了几乎等量的信使RNA。当采用探针ERG20和探针BTS1时,检测出一个过表达、共有的长度为3.1kbp的条带,这说明融合基因被有效转录了。采用探针ERG20检测出长度为1.8kb的条带被认为是基因组中的野生型ERG20基因的转录产物。采用探针HMG1获得的结果显示:在所有预先导入pRS434GAP-HMG1的克隆(即YH1衍生的克隆;图39中标有“HMG1”的泳道中都大量检测出长度为4.1kbp的RNA(质粒来源HMG1的转录产物)。这说明即使异戊二烯基二磷酸合酶表达质粒被作为第二质粒进一步导入,HMG1的转录仍有效地进行。
(2)蛋白质印迹杂交
根据被ERG20和BTS1编码多肽的C端序列,化学合成具有下述氨基酸序列的多肽。以这些多肽为抗原,通过传统方法(描述在常见的实验手册如:F.M.Ausubel et al.Eds,Short Protocols inMolocular Biology,4th Edition,(1999)John Wiley & Sons,Inc.,New York)制备鼠抗体。将下列肽各2mg交联到KLH(KeyholeLimpet Hemocyanin),并用作抗原。
BTS1-C:NH2 Cys Tyr Ile Ile Asp His Leu Ser Glu Leu COOH(SEQ ID NO:121)
ERG20-C:NH2 Cys Leu Asn Lys Val Tyr Lys Arg Ser Lys COOH(SEQ ID NO:
122)
从6种菌株/克隆:YPH499、pRS435F/YPH499、pRS435GGF/YPH499、pRS435FGG/YPH499、pRS435GGFHDEL/YPH499和pRS435GGF/YH1中制备蛋白质的方法如下,由蛋白质印迹分析制备的蛋白质。简言之,将每种菌株/克隆的预培养液(检测600nm处的吸光度,并且每种培养掖需经生理盐水稀释以获得等量的细胞)接种至一种选择性培养基[对YPH499而言,采用SD培养基DOB(dropout base:碳源为葡萄糖的基本培养基),其中加入了可作为氨基酸或核酸组分的CSM(完全的补充混合物);对pRS435F/YPH499而言,采用SD-L培养基(缺少亮氨酸的SD培养基);对pRS435GGF/YPH499而言,采用SD-L培养基;对pRS435GG/YPH499而言,采用SD-L培养基;对pRS435GGFHDEL/YPH499而言,采用SD-L培养基;对pRS435GGF/YH1而言,采用SD-LW培养基(缺少亮氨酸和色氨酸的SD培养基)],并在30℃、130rpm振荡培养4天。离心收获细胞后,每1g细胞加入2ml的Y-PER(PIERCE,Rockford,IL),并于室温下剧烈摇动1小时以制备总蛋白溶液。20μg得到的总蛋白通过SDS-聚丙烯酰胺凝胶电泳(SDS-PAGE;操作步骤见Short Protocols in Molecular Biology,4th Edition,(1999)John Wiley & Sons,Inc.,New York)以分子量为基础分离开,再通过蛋白质印迹(见Short Protocols inMolecular Biology,4th Edition,(1999)John Wiley & Sons,Inc.,New York)检测导入得到的重组体的基因的表达情况。本节用到的蛋白质印迹技术在下列几点有部分改动。
1)用第一抗体处理PVDF膜的条件
常规步骤中,在TBST中的第一抗体的10-1000倍稀释液中振荡PVDF膜30-60分钟。
改动步骤中,在TBST中的上文提到的抗肽的2000倍稀释液中振荡PVDF膜60分钟。
2)PVDF膜的洗涤条件
常规步骤中,用200mlTBST溶液洗涤膜4次,每次15分钟。
改动步骤中,用80ml的TBST溶液洗涤膜5次,每次5分钟。
3)用第二抗体对PVDF膜的处理
常规步骤中,用封闭液将抗IgG(H+L)碱性磷酸酶结合物稀释200-2000倍,然后将PVDF膜浸泡其中并振荡30-60分钟。
改动步骤中,将PVDF膜浸泡在TBST中的抗IgG(H+L)碱性磷酸酶结合物4000倍稀释液中,并振荡30分钟。
4)检测抗原蛋白质条带的方法
常规步骤中,用TBST和TBS洗涤的PVDF膜浸泡在BCIP(4-溴-4-氯-3-吲哚氧基-磷酸盐)/NBT(氮蓝四唑)混合物中30分钟,以上色。
改动步骤中,用TBST和TBS洗涤的PVDF膜浸泡在含有稳定的底物小鼠溶液(Promega,Madison,WI)的ProtoBlot II AP System中20-40秒以上色。
蛋白质印迹分析结果如图40所示。在图40中,“M”代表分子标记泳道。“F”、“GGF”、“GGFHDEL”和“GGF/YH1”代表分别来自pRS435F/YPH499、pRS435GGF/YPH499、pRS435FGG/YPH499、pRS435GGFHDEL/YPH499和pRS435GGF/YH1的蛋白质电泳后的泳道。
当采用可检测被BTS1编码多肽的抗BTS1-C小鼠抗体时,可检测出相当于大约79kDa融合蛋白(分别为GGF、FGG和GGFHDEL)的多肽(见图40,空心三角标记表示迁移率的条带)。从这些结果中,发现来源自ERG20-BTS1融合基因的FPP合酶-GGPP合酶融合蛋白实际上在pRS435GGF转化重组体和pRS435GGFHDEL转化重组体中被表达。同样,预期由于在来自YPH499的蛋白质中没有检测出条带,在非重组细胞中被BTS1基因编码的GGPP合酶几乎没有表达。
当采用可检测被ERG20编码多肽的抗ERG20-C小鼠抗体时,结果显示分子量(大约40kDa)相当于被ERG20编码的FPP合酶的蛋白质的表达水平在pRS435F转化克隆中有提高(泳道“F”)(实心三角标记表示迁移率的条带)。在pRS435GGF转化克隆(泳道“GGF”)和pRS435GGFHDEL转化克隆(泳道“GGFHDEL”)中,检测出了相当于融合蛋白(GGF和GGFHDEL)的多肽(显示了对应于79kDa迁移率的条带,以空心三角标记)。用抗ERG20-C抗体没有检测出应该在pRS435FGG转化克隆中表达的融合基因。相信这种情况的发生是因为在融合酶中GGPP合酶被融合到的FPP合酶C端,使得可以识别FPP合酶的C端部分的抗ERG20-C抗体不能很好地识别融合酶。在采用抗ERG20-C抗体检测出的其它条带中,一个大约45kDa的条带被认为是一个非特异性检测的蛋白质。小于40kDa的条带被认为是非特异性检测的蛋白质或是含有被ERG20基因编码氨基酸序列的蛋白质的降解产物。
[实施例14]当HMG辅酶A还原酶基因和GGF融合基因共表达时制备GGOH
为了确定增强HMG辅酶A还原酶基因和编码GGPP合酶-FPP合酶融合蛋白的BTS1-ERG20融合基因的表达是否也有可能从除YPH499以外的菌株(ATCC76625、MATa ura 3-52 lys2-801 ade2-101 trp1Δ63his3Δ200 leu2Δ1)中获得有利于工业生产GGOH的克隆,采用pRS434GAP-HMG1、pRS435GGF和pRS435GGFHDEL转化下列菌株。
INVSc1(MATa/MATa ura3-52/ura3-52 trp1-289/trp1-289 his3Δ1/his3Δ1 leu2/leu2)
YPH500(ATCC76626,MATa ura3-52 lys2-801 ade2-101 trp1Δ63 his3Δ200 leu2Δ1)
YPH501(ATCC76627,MATa/MATa ura3-52/ura3-52 lys2-801/lys2-801 ade2-101/
ade2-101 trp1Δ63/trp1-Δ63 his3Δ200/his3Δ200 leu2Δ1/leu2Δ1)
W303-1A(ATCC208352,MATa leu2-3 leu2-112 his3-11 ade2-1 ura3-1 trp1-1 can1-100)
W303-1B(ATCC208353,MATa leu2-3 leu2-112 his3-11 ade2-1 ura3-1 trp1-1 can1-100)
简言之,将pRS434GAP-HMG1导入INVSc1、YPH500、YPH501、W303-1A和W303-1B,从而分别获得IH1、YH2、YH3、WH1和WH2。以这些重组体为宿主,再导入pRS435GGF,获得GGF/IH1、GGF/YH2、GGF/YH3、GGF/WH1和GGF/WH2。同样,导入pRS435GGFHDEL到这些宿主内,可获得HDEL/IH1、HDEL/YH2、HDEL/YH3、HDEL/WH1和HDEL/WH2。在YPHO7富集培养基里培养这些重组体,继而测定异戊二烯醇的产率。结果如图41所示。每种重组体都优先产出GGOH,并且它们中的大部分都可产出100mg/L或更多的GGOH。尤其是HDEL/YH2克隆可最多产出189mg/L的GGOH。
[实施例15]当共表达HMG-Co还原酶基因和GGF融合基因的克隆被转化成原养型并二倍体化后GGOH的制备
通过用相应的野生型基因替换导致营养缺陷体的突变基因,使产GGOH克隆GGF/YH1被转化为原养型微生物(一种无需在培养基中补充特定营养物质也可以生长的菌株),再通过与一个YPH500衍生的克隆杂交进行二倍体化,从而获得GGF/YH3-AHKU克隆。制备步骤描述如下。
(1)导入HIS3和ADE2到GGF/YH1及导入LYS2和URA3到YPH500
以用PvuII和XhoI消化的pRS403GAP为模板,以寡聚核苷酸HIS3-L(5’TTT TAA GAG CTT GGT GAG CGC 3’(SEQ ID NO:123))和HIS3-R(5’TCG AGT TCA AGA GAA AAA AAA 3’(SEQ ID NO:124))为引物DNA,按如下条件PCR制备一个HIS3片段。
0.1μg/L模板DNA 1μL
100pmol引物DNA1 1μL
100pmol引物DNA2 1μL
10×Pyrobest缓冲液 10μL
2mM dNTPmix 8μL
5u/μLPyrobest DNA聚合酶 0.5μL
H2O 78.5μL
采用相同的方式,即以经PvuII和XhoI消化的pRS406GAP为模板,以寡聚核苷酸URA3-L(5’TTC AAT TCA TCA TTT TTT TTT 3’(SEQID NO:125))和URA3-R(5’GGG TAA TAA CTG ATA TAA TTA 3’(SEQID NO:126))为模板DNA,制备URA3片段。
A451染色体DNA为模板,以ADE-1(5’ATG GAT TCT AGA ACAGTT GGT 3’(SEQ ID NO:127))和ADE-2(5’TTA CTT GTT TTC TAGATA AGC 3’(SEQ ID NO:128))或LYS-1(5’ATG ACT AAC GAA AAGGTC TGG 3’(SEQ ID NO:129))和LYS-2(5’TTA AGC TGC TGC GGAGCT TCC 3’(SEQ ID NO:130))为引物DNA,在相同的反应条件下制备ADE2片段和LYS2片段。将得到的HIS3片段和ADE2片段相继导入pRS435GGF/YH1,从而获得pRS435GGF/YH1-AH,它表现出无组氨酸需求和无腺嘌呤需求的表型。另一方面,将LYS2片段和URA3片段相继导入YPH500,从而获得YPHS00-KU,该重组体表现出无赖氨酸需求和无尿嘧啶需求的表型。
(2)杂交
在YM培养基中30℃培养pRS435GGF/YH1-AH和YPH500-KU,并在DOB(dropout base)琼脂平板培养基上划线,使得两种克隆彼此交错。然后,在30℃下培养细胞3天。挑出出现在平板上的菌落,并将之培养在前孢子形成平板培养基上(含1.6g酵母提取物、0.6g聚蛋白胨、100ml 20%葡萄糖和4g/L琼脂),继而在孢子形成平板培养基上(含2g乙酸钾、0.2g酵母提取物、500μl 20%葡萄糖和4g/L琼脂)培养。通过显微镜观察证实有孢子形成。因为可以在DOB平板(一种基本培养基)上生长,可以证实该克隆已转化为原养型,并且因为可以在孢子形成培养基上形成孢子,也证实该克隆已二倍体化。该克隆被命名为GGF/YH3-AHKU。
[实施例16]通过补料-分批培养制备GGOH(1)
在下述条件下对GGF/YH3-AHKU进行补料-分批培养,并测定GGOH产量。
(1)种子前培养
培养基组分如下:酵母提取物5g/L、麦芽提取物5g/L、Bacto蛋白胨10g/L、葡萄糖5g/L。
该培养基的pH未调节。将50ml该培养基置于一个500mlSakaguchi烧瓶中,并于120℃下灭菌20分钟。从斜面刮下一满接种环的GGF/YH3-AHKU,于120rpm、30℃下反复振荡培养24小时。OD(于562nm处测26倍的稀释液)达到0.4。
(2)种子培养
培养基组分如下:葡萄糖20g/L、MAMENO(Ajinomoto Co.,Inc.)310mg/L(以总氮量计算)、KH2PO43g/L、MgSO40.5g/L、硫酸铵5g/L、CaCl20.5g/L、和消泡剂0.1ml/L。如果MAMENO的总氮浓度为63g/L,则加入到培养基中的MAMENO的量为4.9ml/L。
完全各种溶解培养基组分后,用KOH溶液将每一组分的pH值调节到5.0。调整液体体积后,将每一组分在120℃下灭菌20分钟。
种子培养采用1升的小型发酵罐。将300ml的培养基置于发酵罐内,再接种0.06-1ml的种子前培养物。接种前,将培养基的pH调节到5.5。通气率为1/2vvm,温度设置为30℃。用氨水控制pH为5.5。搅拌,开始时为500rpm,然后设置为级联控制,使得溶解氧(DO)>20%。在pH值上升的时间点终止种子培养。OD(562nm处测51倍稀释液)达到0.18-0.2。上述的溶解氧量是以饱和量为100%进行计算。
(3)主培养
培养基组成如下表12所示。区块A、区块B、区块C和区块D的液体体积与主培养物总体积的比分别为20%、20%、30%和20%。用硫酸处理玉米漫渍液(CSL),使其pH值调整为2.0,然后在80℃预灭菌1小时。列于表12中的CSL浓度是根据总氮量表示的。加入了31.4g/L的CSL本身才达到该浓度。将每一区块在120℃下灭菌20分钟后,混合在一起。调整液体体积后,将混合液置于容器中。
表12
组分 浓度 区块 |
葡萄糖 2g/l 区块A |
MgSO4 1.7g/l 区块B硫酸铵 3g/lKH2PO4 10g/l |
CSL 2.3g/l 区块C消泡剂 0.26g/lH2SO4 pH=2.0预灭菌 80℃×60分钟 |
CaCl2 0.7g/l 区块D |
主培养采用1升的小型发酵罐。将10%种子培养液加入到上面的混合液后,罐内主培养总体积为300ml。由于灭菌后培养基的pH值在2.6左右,在接种前需要提高。通气速率为1/2vvm,温度设置为30℃。用氨水控制pH为5.5。搅拌,开始时为500rpm,然后设置为级联控制,使得溶解氧(DO)>20%。葡萄糖的补料开始于培养2小时后,以流速逐渐增加的方式进行,如图42所示。最大流速为3.5ml/小时,这相当于大约5.8g葡萄糖/L/小时。当通过搅拌难以保证足够的溶解氧(DO>20%)时,就增加通气体积。大约培养20小时后,补充预先确定量的补料液。与此同时,OD(562nm处测101倍的稀释液)达到0.9-1.0。
其后,根据培养条件改变补料和其它参数。
(1)培养条件的检验
在相同条件下培养细胞20小时后,检测葡萄糖和乙醇对GGOH生产的影响。
简言之,将400g/L的乙醇溶液(区块1)和500g/L的葡萄糖溶液(区块2)补料到培养液中。进一步将区块1中的乙醇溶液和区块2中的葡萄糖溶液1∶1混合以制备区块3。补料液的流速最大设置为3.5ml/小时,并且需控制得使培养液中的底物浓度为1.0g/L或更低。累积GGOH量如表13所示。发现通过补料可作为碳源的乙醇,增加了GGOH的累积量。
表13
GGOH累积量(g/l) | |
区块1 | 1.16 |
区块2 | 0.47 |
区块3 | 0.58 |
[实施例17]通过补料-分批培养制备GGOH(2)
将GGF/YH3-AHKU克隆接种至200ml的DOB(dropout base)葡萄糖基本培养基(Q BIOgene,Carlsbad,CA),在30℃下旋转培养3天。随后,将得到的培养液全部接种至3.35L的含0.09%葡萄糖、0.075%KH2PO4、0.14%硫酸镁、0.45%硫酸铵、5.4%玉米浸渍液、0.031%氯化钙和0.15%ADEKANOL LG109(Asahi Denka)的培养基(用氨水预调为pH5.5),并在下述条件下培养。采用罐1、罐2和罐3进行分批培养。
培养设备:MSJ-U2W(10L发酵罐)(B.E.Marubishi,Chiyoda-ku,Tokyo)
培养温度:33℃
通气速率:0.74vvm
搅拌速率:900rpm
pH5.5(用4N的氢氧化钠溶液和2N的盐酸溶液调节)
培养4小时后,开始补充40%的葡萄糖溶液。培养21小时后,将罐2的补料液改变为40%葡萄糖+3.3%乙酸铵溶液;罐3的补料液改变为1.65%乙酸铵+50%乙醇+20%葡萄糖溶液。然后,进一步培养细胞。以实施例8中所述方式将无菌操作取出的培养液样品进行异戊二烯醇的分析和定量检测。为保持培养基中糖浓度为0.1%或低于0.1%,需调整补料速率(最大为18.7g/h)。
结果显示该方法使FOH的产生最小化,并可通过微生物有效制备GGOH(表14)。通过补充除葡萄糖以及乙醇和乙酸铵,使培养基中的GGOH浓度达到2.5g/L。
表14
罐1(补料溶液:40%葡萄糖)
培养时间(小时) | 0 | 21 | 45 | 69 | 93 | 117 | 165 |
FOH(mg/L) | 0.0 | 2.0 | 1.0 | 8.0 | 19 | 24 | 23 |
GGOH(mg/L) | 0.0 | 27 | 35 | 190 | 660 | 840 | 890 |
OD600 | 0.0 | 52 | 86 | 93 | 120 | 120 | 110 |
罐2 (补料溶液:40%葡萄糖、3.3%乙酸铵)
培养时间(小时) | 0 | 21 | 45 | 69 | 93 | 117 | 165 |
FOH(mg/L) | 0.0 | 1.0 | 1.0 | 1.0 | 0.3 | 0.8 | 5.0 |
GGOH(mg/L) | 0.0 | 26 | 56 | 120 | 37 | 100 | 710 |
OD600 | 0.0 | 49 | 91 | 120 | 120 | 120 | 170 |
罐3(补料溶液:20%葡萄糖、1.65%乙酸铵、50%乙醇)
培养时间(小时) | 0 | 21 | 45 | 69 | 93 | 117 | 165 |
FOH(mg/L) | 0.0 | 2.0 | 4.0 | 17 | 33 | 38 | 77 |
GGOH(mg/L) | 0.0 | 30 | 210 | 550 | 830 | 1000 | 2500 |
OD600 | 0.0 | 63 | 160 | 160 | 120 | 140 | 120 |
此处引述的所有出版物、专利和专利申请在此引用作为参考。
工业实用性
本发明提供了制备异戊二醇烯醇的方法。既然有可能根据本发明提供的方法大量获得异戊二烯醇(尤其是香叶基香叶醇)的,因此可以用于生产生物体内重要的物质,及用作试剂,以发现活性异戊二烯醇新的生理学活性。这样,本发明的方法就具有应用价值。
序列表文本
SEQ ID NO:24:合成肽
SEQ ID NOS:25-120:合成DNA
SEQ ID NO:121:合成肽
SEQ ID NO:122:合成肽
SEQ ID NOS:123-130:合成DNA
序列表<110>丰田自动车株式会社<120>异戊二烯醇的制备方法<130>PH-1413-PCT<150>JP2000-403067<151>2000-12-28<160>130<170>PatentIn Ver.2.0<210>1<211>1059<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<220><221>CDS<222>(1)..(1056)<400>1atg gct tca gaa aaa gaa att agg aga gag aga ttc ttg aac gtt ttc 48Met Ala Ser Glu Lys Glu Ile Arg Arg Glu Arg Phe Leu Asn Val Phe1 5 10 15cct aaa tta gta gag gaa ttg aac gca tcg ctt ttg gct tac ggt atg 96Pro Lys Leu Val Glu Glu Leu Asn Ala Ser Leu Leu Ala Tyr Gly Met
20 25 30cct aag gaa gca tgt gac tgg tat gcc cac tca ttg aac tac aac act 144Pro Lys Glu Ala Cys Asp Trp Tyr Ala His Ser Leu Asn Tyr Asn Thr
35 40 45cca ggc ggt aag cta aat aga ggt ttg tcc gtt gtg gac acg tat gct 192Pro Gly Gly Lys Leu Asn Arg Gly Leu Ser Val Val Asp Thr Tyr Ala
50 55 60att ctc tcc aac aag acc gtt gaa caa ttg ggg caa gaa gaa tac gaa 240Ile Leu Ser Asn Lys Thr Val Glu Gln Leu Gly Gln Glu Glu Tyr Glu65 70 75 80aag gtt gcc att cta ggt tgg tgc att gag ttg ttg cag gct tac ttc 288Lys Val Ala Ile Leu Gly Trp Cys Ile Glu Leu Leu Gln Ala Tyr Phe
85 90 95ttg gtc gcc gat gat atg atg gac aag tcc att acc aga aga ggc caa 336Leu Val Ala Asp Asp Met Met Asp Lys Ser Ile Thr Arg Arg Gly Gln
100 105 110cca tgt tgg tac aag gtt cct gaa gtt ggg gaa att gcc atc aat gac 384Pro Cys Trp Tyr Lys Val Pro Glu Val Gly Glu Ile Ala Ile Asn Asp
115 120 125gca ttc atg tta gag gct gct atc tac aag ctt ttg aaa tct cac ttc 432Ala Phe Met Leu Glu Ala Ala Ile Tyr Lys Leu Leu Lys Ser His Phe
130 135 140aga aac gaa aaa tac tac ata gat atc acc gaa ttg ttc cat gag gtc 480Arg Asn Glu Lys Tyr Tyr Ile Asp Ile Thr Glu Leu Phe His Glu Val145 150 155 160acc ttc caa acc gaa ttg ggc caa ttg atg gac tta atc act gca cct 528Thr Phe Gln Thr Glu Leu Gly Gln Leu Met Asp Leu Ile Thr Ala Pro
165 170 175gaa gac aaa gtc gac ttg agt aag ttc tcc cta aag aag cac tcc ttc 576Glu Asp Lys Val Asp Leu Ser Lys Phe Ser Leu Lys Lys His Ser Phe
180 185 190ara gtt act ttc aag act gct tac tat tct ttc tac ttg cct gtc gca 624Ile Val Thr Phe Lys Thr Ala Tyr Tyr Ser Phe Tyr Leu Pro Val Ala
195 200 205ttg gcc atg tac gtt gcc ggt atc acg gat gaa aag gat ttg aaa caa 672Leu Ala Met Tyr Val Ala Gly Ile Thr Asp Glu Lys Asp Leu Lys Gln
210 215 220gcc aga gat gtc ttg att cca ttg ggt gaa tac ttc caa att caa gat 720Ala Arg Asp Val Leu Ile Pro Leu Gly Glu Tyr Phe Gln Ile Gln Asp225 230 235 240gac tac tta gac tgc ttc ggt acc cca gaa cag atc ggt aag atc ggt 768Asp Tyr Leu Asp Cys Phe Gly Thr Pro Glu Gln Ile Gly Lys Ile Gly
245 250 255aca gat atc caa gat aac aaa tgt tct tgg gta atc aac aag gca ttg 816Thr Asp Ile Gln Asp Asn Lys Cys Ser Trp Val Ile Asn Lys Ala Leu
260 265 270gaa ctt gct tcc gca gaa caa aga aag act tta gac gaa aat tac ggt 864Glu Leu Ala Ser Ala Glu Gln Arg Lys Thr Leu Asp Glu Asn Tyr Gly
275 280 285aag aag gac tca gtc gca gaa gcc aaa tgc aaa aag att ttc aat gac 912Lys Lys Asp Ser Val Ala Glu Ala Lys Cys Lys Lys Ile Phe Asn Asp
290 295 300ttg aaa att gaa cag cta tac cac gaa tat gaa gag tct att gcc aag 960Leu Lys Ile Glu Gln Leu Tyr His Glu Tyr Glu Glu Ser Ile Ala Lys305 310 315 320gat ttg aag gcc aaa att tct cag gtc gat gag tct cgt ggc ttc aaa 1008Asp Leu Lys Ala Lys Ile Ser Gln Val Asp Glu Ser Arg Gly Phe Lys
325 330 335gct gat gtc tta act gcg ttc ttg aac aaa gtt tac aag aga agc aaa 1056Ala Asp Val Leu Thr Ala Phe Leu Asn Lys Val Tyr Lys Arg Ser Lys
340 345 350tag 1059<210>2<211>352<212>PRT<213>啤酒糖酵母(saccharomyces cerevisiae)<400>2Met Ala Ser Glu Lys Glu Ile Arg Arg Glu Arg Phe Leu Asn Val Phe1 5 10 15Pro Lys Leu Val Glu Glu Leu Asn Ala Ser Leu Leu Ala Tyr Gly Met
20 25 30Pro Lys Glu Ala Cys Asp Trp Tyr Ala His Ser Leu Asn Tyr Asn Thr
35 40 45Pro Gly Gly Lys Leu Asn Arg Gly Leu Ser Val Val Asp Thr Tyr Ala
50 55 60Ile Leu Ser Asn Lys Thr Val Glu Gln Leu Gly Gln Glu Glu Tyr Glu65 70 75 80Lys Val Ala Ile Leu Gly Trp Cys Ile Glu Leu Leu Gln Ala Tyr Phe
85 90 95Leu Val Ala Asp Asp Met Met Asp Lys Ser Ile Thr Arg Arg Gly Gln
100 105 110Pro Cys Trp Tyr Lys Val Pro Glu Val Gly Glu Ile Ala Ile Asn Asp
115 120 125Ala Phe Met Leu Glu Ala Ala Ile Tyr Lys Leu Leu Lys Ser His Phe
130 135 140Arg Asn Glu Lys Tyr Tyr Ile Asp Ile Thr Glu Leu Phe His Glu Val145 150 155 160Thr Phe Gln Thr Glu Leu Gly Gln Leu Met Asp Leu Ile Thr Ala Pro
165 170 175Glu Asp Lys Val Asp Leu Ser Lys Phe Ser Leu Lys Lys His Ser Phe
180 185 190Ile Val Thr Phe Lys Thr Ala Tyr Tyr Ser Phe Tyr Leu Pro Val Ala
195 200 205Leu Ala Met Tyr Val Ala Gly Ile Thr Asp Glu Lys Asp Leu Lys Gln
210 215 220Ala Arg Asp Val Leu Ile Pro Leu Gly Glu Tyr Phe Gln Ile Gln Asp225 230 235 240Asp Tyr Leu Asp Cys Phe Gly Thr Pro Glu Gln Ile Gly Lys Ile Gly
245 250 255Thr Asp Ile Gln Asp Asn Lys Cys Ser Trp Val Ile Asn Lys Ala Leu
260 265 270Glu Leu Ala Ser Ala Glu Gln Arg Lys Thr Leu Asp Glu Asn Tyr Gly
275 280 285Lys Lys Asp Ser Val Ala Glu Ala Lys Cys Lys Lys Ile Phe Asn Asp
290 295 300Leu Lys Ile Glu Gln Leu Tyr His Glu Tyr Glu Glu Ser Ile Ala Lys305 310 315 320Asp Leu Lys Ala Lys Ile Ser Gln Val Asp Glu Ser Arg Gly Phe Lys
325 330 335Ala Asp Val Leu Thr Ala Phe Leu Asn Lys Val Tyr Lys Arg Ser Lys
340 345 350<210>3<211>900<212>DNA<213>大肠杆菌(Escherichia coli)<220><221>CDS<222>(1)..(897)<400>3atg gac ttt ccg cag caa ctc gaa gcc tgc gtt aag cag gcc aac cag 48Met Asp Phe Pro Gln Gln Leu Glu Ala Cys Val Lys Gln Ala Asn Gln1 5 10 15gcg ctg agc cgt ttt atc gcc cca ctg ccc ttt cag aac act ccc gtg 96Ala Leu Ser Arg Phe Ile Ala Pro Leu Pro Phe Gln Asn Thr Pro Val
20 25 30gtc gaa acc atg cag tat ggc gca tta tta ggt ggt aag cgc ctg cga 144Val Glu Thr Met Gln Tyr Gly Ala Leu Leu Gly Gly Lys Arg Leu Arg
35 40 45cct ttc ctg gtt tat gcc acc ggt cat atg ttc ggc gtt agc aca aac 192Pro Phe Leu Val Tyr Ala Thr Gly His Met Phe Gly Val Ser Thr Asn
50 55 60acg ctg gac gca ccc gct gcc gcc gtt gag tgt atc cac gct tac tca 240Thr Leu Asp Ala Pro Ala Ala Ala Val Glu Cys Ile His Ala Tyr Ser65 70 75 80tta att cat gat gat tta ccg gca atg gat gat gac gat ctg cgt cgc 288Leu Ile His Asp Asp Leu Pro Ala Met Asp Asp Asp Asp Leu Arg Arg
85 90 95ggt ttg cca acc tgc cat gtg aag ttt ggc gaa gca aac gcg att ctc 336Gly Leu Pro Thr Cys His Val Lys Phe Gly Glu Ala Asn Ala Ile Leu
100 105 110gct ggc gac gct tta caa acg ctg gcg ttc tcg att tta agc gat gcc 384Ala Gly Asp Ala Leu Gln Thr Leu Ala Phe Ser Ile Leu Ser Asp Ala
115 120 125gat atg ccg gaa gtg tcg gac cgc gac aga att tcg atg att tct gaa 432Asp Met Pro Glu Val Ser Asp Arg Asp Arg Ile Ser Met Ile Ser Glu
130 135 140ctg gcg agc gcc agt ggt att gcc gga atg tgc ggt ggt cag gca tta 480Leu Ala Ser Ala Ser Gly Ile Ala Gly Met Cys Gly Gly Gln Ala Leu145 150 155 160gat tta gac gcg gaa ggc aaa cac gta cct ctg gac gcg ctt gag cgt 528Asp Leu Asp Ala Glu Gly Lys His Val Pro Leu Asp Ala Leu Glu Arg
165 170 175att cat cgt cat aaa acc ggc gca ttg att cgc gcc gcc gtt cgc ctt 576Ile His Arg His Lys Thr Gly Ala Leu Ile Arg Ala Ala Val Arg Leu
180 185 190ggt gca tta agc gcc gga gat aaa gga cgt cgt gct ctg ccg gta ctc 624Gly Ala Leu Ser Ala Gly Asp Lys Gly Arg Arg Ala Leu Pro Val Leu
195 200 205gac aag tat gca gag agc atc ggc ctt gcc ttc cag gtt cag gat gac 672Asp Lys Tyr Ala Glu Ser Ile Gly Leu Ala Phe Gln Val Gln Asp Asp
210 215 220atc ctg gat gtg gtg gga gat act gca acg ttg gga aaa cgc cag ggt 720Ile Leu Asp Val Val Gly Asp Thr Ala Thr Leu Gly Lys Arg Gln Gly225 230 235 240gcc gac cag caa ctt ggt aaa agt acc tac cct gca ctt ctg ggt ctt 768Ala Asp Gln Gln Leu Gly Lys Ser Thr Tyr Pro Ala Leu Leu Gly Leu
245 250 255gag caa gcc cgg aag aaa gcc cgg gat ctg atc gac gat gcc cgt cag 816Glu Gln Ala Arg Lys Lys Ala Arg Asp Leu Ile Asp Asp Ala Arg Gln
260 265 270tcg ctg aaa caa ctg gct gaa cag tca ctc gat acc tcg gca ctg gaa 864Ser Leu Lys Gln Leu Ala Glu Gln Ser Leu Asp Thr Ser Ala Leu Glu
275 280 285gcg cta gcg gac tac atc atc cag cgt aat aaa taa 900Ala Leu Ala Asp Tyr Ile Ile Gln Arg Asn Lys
290 295<210>4<211>299<212>PRT<213>大肠杆菌(Escherichia coli)<400>4Met Asp Phe Pro Gln Gln Leu Glu Ala Cys Val Lys Gln Ala Asn Gln1 5 10 15Ala Leu Ser Arg Phe Ile Ala Pro Leu Pro Phe Gln Asn Thr Pro Val
20 25 30Val Glu Thr Met Gln Tyr Gly Ala Leu Leu Gly Gly Lys Arg Leu Arg
35 40 45Pro Phe Leu Val Tyr Ala Thr Gly His Met Phe Gly Val Ser Thr Asn
50 55 60Thr Leu Asp Ala Pro Ala Ala Ala Val Glu Cys Ile His Ala Tyr Ser65 70 75 80Leu Ile His Asp Asp Leu Pro Ala Met Asp Asp Asp Asp Leu Arg Arg
85 90 95Gly Leu Pro Thr Cys His Val Lys Phe Gly Glu Ala Asn Ala Ile Leu
100 105 110Ala Gly Asp AIa Leu Gln Thr Leu Ala Phe Ser Ile Leu Ser Asp Ala
115 120 125Asp Met Pro Glu Val Ser Asp Arg Asp Arg Ile Ser Met Ile Ser Glu
130 135 140Leu Ala Ser Ala Ser Gly Ile Ala Gly Met Cys Gly Gly Gln Ala Leu145 150 155 160Asp Leu Asp Ala Glu Gly Lys His Val Pro Leu Asp Ala Leu Glu Arg
165 170 175Ile His Arg His Lys Thr Gly Ala Leu Ile Arg Ala Ala Val Arg Leu
180 185 190Gly Ala Leu Ser Ala Gly Asp Lys Gly Arg Arg Ala Leu Pro Val Leu
195 200 205Asp Lys Tyr Ala Glu Ser Ile Gly Leu Ala Phe Gln Val Gln Asp Asp
210 215 220Ile Leu Asp Val Val Gly Asp Thr Ala Thr Leu Gly Lys Arg Gln Gly225 230 235 240Ala Asp Gln Gln Leu Gly Lys Ser Thr Tyr Pro Ala Leu Leu Gly Leu
245 250 255Glu Gln Ala Arg Lys Lys Ala Arg Asp Leu Ile Asp Asp Ala Arg Gln
260 265 270Ser Leu Lys Gln Leu Ala Glu Gln Ser Leu Asp Thr Ser Ala Leu Glu
275 280 285Ala Leu Ala Asp Tyr Ile Ile Gln Arg Asn Lys
290 295<210>5<211>1008<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<220><221>CDS<222>(1)..(1005)<400>5atg gag gcc aag ata gat gag ctg atc aat aat gat cct gtt tgg tcc 48Met Glu Ala Lys Ile Asp Glu Leu Ile Asn Asn Asp Pro Val Trp Serl 5 10 15agc caa aat gaa agc ttg att tca aaa cct tat aat cac atc ctt ttg 96Ser Gln Asn Glu Ser Leu Ile Ser Lys Pro Tyr Asn His Ile Leu Leu
20 25 30aaa cct ggc aag aac ttt aga cta aat tta ata gtt caa att aac aga l44Lys Pro Gly Lys Asn Phe Arg Leu Asn Leu Ile Val Gln Ile Asn Arg
35 40 45gtt atg aat ttg ccc aaa gac cag ctg gcc ata gtt tcg caa att gtt 192Val Met Asn Leu Pro Lys Asp Gln Leu Ala Ile Val Ser Gln Ile Val
50 55 60gag ctc ttg cat aat tcc agc ctt tta atc gac gat ata gaa gat aat 240Glu Leu Leu His Asn Ser Ser Leu Leu Ile Asp Asp Ile Glu Asp Asn65 70 75 80gct ccc ttg aga agg gga cag acc act tct cac tta atc ttc ggt gta 288Ala Pro Leu Arg Arg Gly Gln Thr Thr Ser His Leu Ile Phe Gly Val
85 90 95ccc tcc act ata aac acc gca aat tat atg tat ttc aga gcc atg caa 336Pro Ser Thr Ile Asn Thr Ala Asn Tyr Met Tyr Phe Arg Ala Met Gln
100 105 110ctt gta tcg cag cta acc aca aaa gag cct ttg tat cat aat ttg att 384Leu Val Ser Gln Leu Thr Thr Lys Glu Pro Leu Tyr His Asn Leu Ile
115 120 125acg att ttc aac gaa gaa ttg atc aat cta cat agg gga caa ggc ttg 432Thr IIe Phe Asn Glu Glu Leu Ile Asn Leu His Arg Gly Gln Gly Leu
130 135 140gat ata tac tgg aga gac ttt ctg cct gaa atc ata cct act cag gag 480Asp Ile Tyr Trp Arg Asp Phe Leu Pro Glu Ile Ile Pro Thr Gln Glu145 150 155 160atg tat ttg aat atg gtt atg aat aaa aca ggc ggc ctt ttc aga tta 528Met Tyr Leu Asn Met Val Met Asn Lys Thr Gly Gly Leu Phe Arg Leu
165 170 175acg ttg aga ctc atg gaa gcg ctg tct cct tcc tca cac cac ggc cat 576Thr Leu Arg Leu Met Glu Ala Leu Ser Pro Ser Ser His His Gly His
180 185 190tcg ttg gtt cct ttc ata aat ctt ctg ggt att att tat cag att aga 624Ser Leu Val Pro Phe Ile Asn Leu Leu Gly Ile Ile Tyr Gln Ile Arg
195 200 205gat gat tac ttg aat ttg aaa gat ttc caa atg tcc agc gaa aaa ggc 672Asp Asp Tyr Leu Asn Leu Lys Asp Phe Gln Met Ser Ser Glu Lys Gly
210 215 220ttt gct gag gac att aca gag ggg aag tta tct ttt ccc atc gtc cac 720Phe Ala Glu Asp Ile Thr Glu Gly Lys Leu Ser Phe Pro Ile Val His225 230 235 240gcc ctt aac ttc act aaa acg aaa ggt caa act gag caa cac aat gaa 768Ala Leu Asn Phe Thr Lys Thr Lys Gly Gln Thr Glu Gln His Asn Glu
245 250 255att cta aga att ctc ctg ttg agg aca agt gat aaa gat ata aaa cta 816Ile Leu Arg Ile Leu Leu Leu Arg Thr Ser Asp Lys Asp Ile Lys Leu
260 265 270aag ctg att caa ata ctg gaa ttc gac acc aat tca ttg gcc tac acc 864Lys Leu Ile Gln Ile Leu Glu Phe Asp Thr Asn Ser Leu Ala Tyr Thr
275 280 285aaa aat ttt att aat caa tta gtg aat atg ata aaa aat gat aat gaa 912Lys Asn Phe Ile Asn Gln Leu Val Asn Met Ile Lys Asn Asp Asn Glu
290 295 300aat aag tat tta cct gat ttg gct tcg cat tcc gac acc gcc acc aat 960Asn Lys Tyr Leu Pro Asp Leu Ala Ser His Ser Asp Thr Ala Thr Asn305 310 315 320tta cat gac gaa ttg tta tat ata ata gac cac tta tcc gaa ttg tga 1008Leu His Asp Glu Leu Leu Tyr Ile Ile Asp His Leu Ser Glu Leu
325 330 335<210>6<211>335<212>PRT<213>啤酒糖酵母(saccharomyces cerevisiae)<400>6Met Glu Ala Lys Ile Asp Glu Leu Ile Asn Asn Asp Pro Val Trp Ser1 5 10 15Ser Gln Asn Glu Ser Leu IIe Ser Lys Pro Tyr Asn His Ile Leu Leu
20 25 30Lys Pro Gly Lys Asn Phe Arg Leu Asn Leu Ile Val Gln Ile Asn Arg
35 40 45Val Met Asn Leu Pro Lys Asp Gln Leu Ala Ile Val Ser Gln Ile Val
50 55 60Glu Leu Leu His Asn Ser Ser Leu Leu Ile Asp Asp Ile Glu Asp Asn65 70 75 80Ala Pro Leu Arg Arg Gly Gln Thr Thr Ser His Leu Ile Phe Gly Val
85 90 95Pro Ser Thr Ile Asn Thr Ala Asn Tyr Met Tyr Phe Arg Ala Met Gln
100 105 110Leu Val Ser Gln Leu Thr Thr Lys Glu Pro Leu Tyr His Asn Leu Ile
115 120 125Thr Ile Phe Asn Glu Glu Leu Ile Asn Leu His Arg Gly Gln Gly Leu
130 135 140Asp Ile Tyr Trp Arg Asp Phe Leu Pro Glu Ile Ile Pro Thr Gln Glu145 150 155 160Met Tyr Leu Asn Met Val Met Asn Lys Thr Gly Gly Leu Phe Arg Leu
165 170 175Thr Leu Arg Leu Met Glu Ala Leu Ser Pro Ser Ser His His Gly His
180 185 190Ser Leu Val Pro Phe IIe Asn Leu Leu Gly Ile Ile Tyr Gln Ile Arg
195 200 205Asp Asp Tyr Leu Asn Leu Lys Asp Phe Gln Met Ser Ser Glu Lys Gly
210 215 220Phe Ala Glu Asp Ile Thr Glu Gly Lys Leu Ser Phe Pro Ile Val His225 230 235 240Ala Leu Asn Phe Thr Lys Thr Lys Gly Gln Thr Glu Gln His Asn Glu
245 250 255Ile Leu Arg IIe Leu Leu Leu Arg Thr Ser Asp Lys Asp Ile Lys Leu
260 265 270Lys Leu Ile Gln Ile Leu Glu Phe Asp Thr Asn Ser Leu Ala Tyr Thr
275 280 285Lys Asn Phe Ile Asn Gln Leu Val Asn Met Ile Lys Asn Asp Asn Glu
290 295 300Asn Lys Tyr Leu Pro Asp Leu Ala Ser His Ser Asp Thr Ala Thr Asn305 310 315 320Leu His Asp Glu Leu Leu Tyr Ile Ile Asp His Leu Ser Glu Leu
325 330 335<210>7<211>3165<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<220><221>CDS<222>(1)..(3162)<400>7atg ccg ccg cta ttc aag gga ctg aaa cag atg gca aag cca att gcc 48Met Pro Pro Leu Phe Lys Gly Leu Lys Gln Met Ala Lys Pro IIe Ala1 5 10 15tat gtt tca aga ttt tcg gcg aaa cga cca att cat ata ata ctt ttt 96Tyr Val Ser Arg Phe Ser Ala Lys Arg Pro Ile His Ile Ile Leu Phe
20 25 30tct cta atc ata tcc gca ttc gct tat cta tcc gtc att cag tat tac 144Ser Leu Ile Ile Ser Ala Phe Ala Tyr Leu Ser Val Ile Gln Tyr Tyr
35 40 45ttc aat ggt tgg caa cta gat tca aat agt gtt ttt gaa act gct cca 192Phe Asn Gly Trp Gln Leu Asp Ser Asn Ser Val Phe Glu Thr Ala Pro
50 55 60aat aaa gac tcc aac act cta ttt caa gaa tgt tcc cat tac tac aga 240Asn Lys Asp Ser Asn Thr Leu Phe Gln Glu Cys Ser His Tyr Tyr Arg65 70 75 80gat tcc tct cta gat ggt tgg gta tca atc acc gcg cat gaa gct agt 288Asp Ser Ser Leu Asp Gly Trp Val Ser Ile Thr Ala His Glu Ala Ser
85 90 95gag tta cca gcc cca cac cat tac tat cta tta aac ctg aac ttc aat 336Glu Leu Pro Ala Pro His His Tyr Tyr Leu Leu Asn Leu Asn Phe Asn
100 105 110agt cct aat gaa act gac tcc att cca gaa cta gct aac acg gtt ttt 384Ser Pro Asn Glu Thr Asp Ser Ile Pro Glu Leu Ala Asn Thr Val Phe
115 120 125gag aaa gat aat aca aaa tat att ctg caa gaa gat ctc agt gtt tcc 432Glu Lys Asp Asn Thr Lys Tyr Ile Leu Gln Glu Asp Leu Ser Val Ser
130 135 140aaa gaa att tct tct act gat gga acg aaa tgg agg tta aga agt gac 480Lys Glu Ile Ser Ser Thr Asp Gly Thr Lys Trp Arg Leu Arg Ser Asp145 150 155 160aga aaa agt ctt ttc gac gta aag acg tta gca tat tct ctc tac gat 528Arg Lys Ser Leu Phe Asp Val Lys Thr Leu Ala Tyr Ser Leu Tyr Asp
165 170 175gta ttt tca gaa aat gta acc caa gca gac ccg ttt gac gtc ctt att 576Val Phe Set Glu Asn Val Thr Gln Ala Asp Pro Phe Asp Val Leu Ile
180 185 190atg gtt act gcc tac cta atg atg ttc tac acc ata ttc ggc ctc ttc 624Met Val Thr Ala Tyr Leu Met Met Phe Tyr Thr Ile Phe Gly Leu Phe
195 200 205aat gac atg agg aag acc ggg tca aat ttt tgg ttg agc gcc tct aca 672Asn Asp Met Arg Lys Thr Gly Ser Asn Phe Trp Leu Ser Ala Ser Thr
210 215 220gtg gtc aat tct gca tca tca ctt ttc tta gca ttg tat gtc acc caa 720Val Val Asn Ser Ala Ser Ser Leu Phe Leu Ala Leu Tyr Val Thr Gln225 230 235 240tgt att cta ggc aaa gaa gtt tcc gca tta act ctt ttt gaa ggt ttg 768Cys Ile Leu Gly Lys Glu Val Ser Ala Leu Thr Leu Phe Glu Gly Leu
245 250 255cct ttc att gta gtt gtt gtt ggt ttc aag cac aaa atc aag att gcc 816Pro Phe Ile Val Val Val Val Gly Phe Lys His Lys Ile Lys Ile Ala
260 265 270cag tat gcc ctg gag aaa ttt gaa aga gtc ggt tta tct aaa agg att 864Gln Tyr Ala Leu Glu Lys Phe Glu Arg Val Gly Leu Ser Lys Arg Ile
275 280 285act acc gat gaa atc gtt ttt gaa tcc gtg agc gaa gag ggt ggt cgt 912Thr Thr Asp Glu Ile Val Phe Glu Ser Val Ser Glu Glu Gly Gly Arg
290 295 300ttg att caa gac cat ttg ctt tgt att ttt gcc ttt atc gga tgc tct 960Leu Ile Gln Asp His Leu Leu Cys Ile Phe Ala Phe Ile Gly Cys Ser305 310 315 320atg tat gct cac caa ttg aag act ttg aca aac ttc tgc ata tta tca 1008Met Tyr Ala His Gln Leu Lys Thr Leu Thr Asn Phe Cys Ile Leu Ser
325 330 335gca ttt atc cta att ttt gaa ttg att tta act cct aca ttt tat tct 1056Ala Phe Ile Leu Ile Phe Glu Leu Ile Leu Thr Pro Thr Phe Tyr Ser
340 345 350gct atc tta gcg ctt aga ctg gaa atg aat gtt atc cac aga tct act 1104Ala Ile Leu Ala Leu Arg Leu Glu Met Asn Val Ile His Arg Ser Thr
355 360 365att atc aag caa aca tta gaa gaa gac ggt gtt gtt cca tct aca gca 1152Ile Ile Lys Gln Thr Leu Glu Glu Asp Gly Val Val Pro Ser Thr Ala
370 375 380aga atc att tct aaa gca gaa aag aaa tcc gta tct tct ttc tta aat 1200Arg Ile Ile Ser Lys Ala Glu Lys Lys Ser Val Ser Ser Phe Leu Asn385 390 395 400ctc agt gtg gtt gtc att atc atg aaa ctc tct gtc ata ctg ttg ttt 1248Leu Ser Val Val Val Ile Ile Met Lys Leu Ser Val Ile Leu Leu Phe
405 410 415gtc ttc atc aac ttt tat aac ttt ggt gca aat tgg gtc aat gat gcc 1296Val Phe Ile Asn Phe Tyr Asn Phe Gly Ala Asn Trp Val Asn Asp Ala
420 425 430ttc aat tca ttg tacttc gat aag gaa cgt gtt tct cta cca gat ttt 1344Phe Asn Ser Leu Tyr Phe Asp Lys Glu Arg Val Ser Leu Pro Asp Phe
435 440 445att acc tcg aat gcc tct gaa aac ttt aaa gag caa gct att gtt agt 1392Ile Thr Ser Asn Ala Ser Glu Asn Phe Lys Glu Gln Ala Ile Val Ser
450 455 460gtc acc cca tta tta tat tac aaa ccc att aag tcc tac caa cgc att 1440Val Thr Pro Leu Leu Tyr Tyr Lys Pro Ile Lys Ser Tyr Gln Arg Ile465 470 475 480gag gat atg gtt ctt cta ttg ctt cgt aat gtc agt gtt gcc att cgt 1488Glu Asp Met Val Leu Leu Leu Leu Arg Asn Val Ser Val Ala Ile Arg
485 490 495gat agg ttc gtc agt aaa tta gtt ctt tcc gcc tta gta tgc agt gct 1536Asp Arg Phe Val Ser Lys Leu Val Leu Ser Ala Leu Val Cys Ser Ala
500 505 510gtc atc aat gtg tat tta ttg aat gct gct aga att cat acc agt tat 1584Val Ile Asn Val Tyr Leu Leu Asn Ala Ala Arg Ile His Thr Ser Tyr
515 520 525act gca gac caa ttg gtg aaa act gaa gtc acc aag aag tct ttt act 1632Thr Ala Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr
530 535 540gct cct gta caa aag gct tct aca cca gtt tta acc aat aaa aca gtc 1680Ala Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asn Lys Thr Val545 550 555 560att tct gga tcg aaa gtc aaa agt tta tca tct gcg caa tcg agc tca 1728Ile Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser
565 570 575tca gga cct tca tca tct agt gag gaa gat gat tcc cgc gat att gaa 1776Ser Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu
580 585 590agc ttg gat aag aaa ata cgt cct tta gaa gaa tta gaa gca tta tta 1824Ser Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Leu Leu
595 600 605agt agt gga aat aca aaa caa ttg aag aac aaa gag gtc gct gcc ttg 1872Ser Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu
610 615 620gtt att cac ggt aag tta cct ttg tac gct ttg gag aaa aaa tta ggt 1920Val Ile His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly625 630 635 640gat act acg aga gcg gtt gcg gta cgt agg aag gct ctt tca att ttg 1968Asp Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu
645 650 655gca gaa gct cct gta tta gca tct gat cgt tta cca tat aaa aat tat 2016Ala Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr
660 665 670gac tac gac cgc gta ttt ggc gct tgt tgt gaa aat gtt ata ggt tac 2064Asp Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr
675 680 685atg cct ttg ccc gtt ggt gtt ata ggc ccc ttg gtt atc gat ggt aca 2112Met Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Ile Asp Gly Thr
690 695 700tct tat cat ata cca atg gca act aca gag ggt tgt ttg gta gct tct 2160Ser Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser705 710 715 720gcc atg cgt ggc tgt aag gca atc aat gct ggc ggt ggt gca aca act 2208Ala Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr
725 730 735gtt tta act aag gat ggt atg aca aga ggc cca gta gtc cgt ttc cca 2256Val Leu Thr Lys Asp Gly Met Thr Arg Gly Pro Val Val Arg Phe Pro
740 745 750act ttg aaa aga tct ggt gcc tgt aag ata tgg tta gac tca gaa gag 2304Thr Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu
755 760 765gga caa aac gca att aaa aaa gct ttt aac tct aca tca aga ttt gca 2352Gly Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala
770 775 780cgt ctg caa cat att caa act tgt cta gca gga gat tta ctc ttc atg 2400Arg Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met785 790 795 800aga ttt aga aca act act ggt gac gca atg ggt atg aat atg att tct 2448Arg Phe Arg Thr Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser
805 810 815aaa ggt gtc gaa tac tca tta aag caa atg gta gaa gag tat ggc tgg 2496Lys Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp
820 825 830gaa gat atg gag gtt gtc tcc gtt tct ggt aac tac tgt acc gac aaa 2544Glu Asp Met Glu Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys
835 840 845aaa cca gct gcc atc aac tgg atc gaa ggt cgt ggt aag agt gtc gtc 2592Lys Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val
850 855 860gca gaa gct act att cct ggt gat gtt gtc aga aaa gtg tta aaa agt 2640Ala Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser865 870 875 880gat gtt tcc gca ttg gtt gag ttg aac att gct aag aat ttg gtt gga 2688Asp Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly
885 890 895tct gca atg gct ggg tct gtt ggt gga ttt aac gca cat gca gct aat 2736Ser Ala Met Ala Gly Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn
900 905 910tta gtg aca gct gtt ttc ttg gca tta gga caa gat cct gca caa aat 2784Leu Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn
915 920 925gtt gaa agt tcc aac tgt ata aca ttg atg aaa gaa gtg gac ggt gat 2832Val Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp
930 935 940ttg aga att tcc gta tcc atg cca tcc atc gaa gta ggt acc atc ggt 2880Leu Arg Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly945 950 955 960ggt ggt act gtt cta gaa cca caa ggt gcc atg ttg gac tta tta ggt 2928Gly Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly
965 970 975gta aga ggc ccg cat gct acc gct cct ggt acc aac gca cgt caa tta 2976Val Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu
980 985 990gca aga ata gtt gcc tgt gcc gtc ttg gca ggt gaa tta tcc tta tgt 3024Ala Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys
995 1000 1005gct gcc cta gca gcc ggc cat ttg gtt caa agt cat atg acc cac aac 3072Ala Ala Leu Ala Ala Gly His Leu Val Gln Ser His Met Thr His Asn1010 1015 1020agg aaa cct gct gaa cca aca aaa cct aac aat ttg gac gcc act gat 3120Arg Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu Asp Ala Thr Asp1025 1030 1035 1040ata aat cgt ttg aaa gat ggg tcc gtc acc tgc att aaa tcc taa 3165Ile Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser
1045 1050<210>8<21l>1054<212>PRT<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>8Met Pro Pro Leu Phe Lys Gly Leu Lys Gln Met Ala Lys Pro Ile Ala1 5 10 15Tyr Val Ser Arg Phe Ser Ala Lys Arg Pro Ile His Ile Ile Leu Phe
20 25 30Ser Leu Ile Ile Ser Ala Phe Ala Tyr Leu Ser Val Ile Gln Tyr Tyr
35 40 45Phe Asn Gly Trp Gln Leu Asp Ser Asn Ser Val Phe Glu Thr Ala Pro
50 55 60Asn Lys Asp Ser Asn Thr Leu Phe Gln Glu Cys Ser His Tyr Tyr Arg65 70 75 80Asp Ser Ser Leu Asp Gly Trp Val Ser Ile Thr Ala His Glu Ala Ser
85 90 95Glu Leu Pro Ala Pro His His Tyr Tyr Leu Leu Asn Leu Asn Phe Asn
100 105 110Ser Pro Asn Glu Thr Asp Ser Ile Pro Glu Leu Ala Asn Thr Val Phe
115 120 125Glu Lys Asp Asn Thr Lys Tyr Ile Leu Gln Glu Asp Leu Ser Val Ser
130 135 140Lys Glu Ile Ser Ser Thr Asp Gly Thr Lys Trp Arg Leu Arg Ser Asp145 150 155 160Arg Lys Ser Leu Phe Asp Val Lys Thr Leu Ala Tyr Ser Leu Tyr Asp
165 170 175Val Phe Ser Glu Asn Val Thr Gln Ala Asp Pro Phe Asp Val Leu Ile
180 185 190Met Val Thr Ala Tyr Leu Met Met Phe Tyr Thr Ile Phe Gly Leu Phe
195 200 205Asn Asp Met Arg Lys Thr Gly Ser Asn Phe Trp Leu Ser Ala Ser Thr
210 215 220Val Val Asn Ser Ala Ser Ser Leu Phe Leu Ala Leu Tyr Val Thr Gln225 230 235 240Cys Ile Leu Gly Lys Glu Val Ser Ala Leu Thr Leu Phe Glu Gly Leu
245 250 255Pro Phe Ile Val Val Val Val Gly Phe Lys His Lys Ile Lys Ile Ala
260 265 270Gln Tyr Ala Leu Glu Lys Phe Glu Arg Val Gly Leu Ser Lys Arg Ile
275 280 285Thr Thr Asp Glu Ile Val Phe Glu Ser Val Ser Glu Glu Gly Gly Arg
290 295 300Leu Ile Gln Asp His Leu Leu Cys Ile Phe Ala Phe Ile Gly Cys Ser305 310 315 320Met Tyr Ala His Gln Leu Lys Thr Leu Thr Asn Phe Cys Ile Leu Ser
325 330 335Ala Phe Ile Leu Ile Phe Glu Leu Ile Leu Thr Pro Thr Phe Tyr Ser
340 345 350Ala Ile Leu Ala Leu Arg Leu Glu Met Asn Val Ile His Arg Ser Thr
355 360 365Ile Ile Lys Gln Thr Leu Glu Glu Asp Gly Val Val Pro Ser Thr Ala
370 375 380Arg Ile Ile Ser Lys Ala Glu Lys Lys Ser Val Ser Ser Phe Leu Asn385 390 395 400Leu Ser Val Val Val Ile Ile Met Lys Leu Ser Val Ile Leu Leu Phe
405 410 415Val Phe Ile Asn Phe Tyr Asn Phe Gly Ala Asn Trp Val Asn Asp Ala
420 425 430Phe Asn Ser Leu Tyr Phe Asp Lys Glu Arg Val Ser Leu Pro Asp Phe
435 440 445Ile Thr Ser Asn Ala Ser Glu Asn Phe Lys Glu Gln Ala Ile Val Ser
450 455 460Val Thr Pro Leu Leu Tyr Tyr Lys Pro Ile Lys Ser Tyr Gln Arg Ile465 470 475 480Glu Asp Met Val Leu Leu Leu Leu Arg Asn Val Ser Val Ala Ile Arg
485 490 495Asp Arg Phe Val Ser Lys Leu Val Leu Ser Ala Leu Val Cys Ser Ala
500 505 510Val Ile Asn Val Tyr Leu Leu Asn Ala Ala Arg Ile His Thr Ser Tyr
515 520 525Thr Ala Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr
530 535 540Ala Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asn Lys Thr Val545 550 555 560Ile Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser
565 570 575Ser Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu
580 585 590Ser Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Leu Leu
595 600 605Ser Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu
610 615 620Val Ile His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly625 630 635 640Asp Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu
645 650 655Ala Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr
660 665 670Asp Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr
675 680 685Met Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Ile Asp Gly Thr
690 695 700Ser Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser705 710 715 720Ala Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr
725 730 735Val Leu Thr Lys Asp Gly Met Thr Arg Gly Pro Val Val Arg Phe Pro
740 745 750Thr Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu
755 760 765Gly Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala
770 775 780Arg Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met785 790 795 800Arg Phe Arg Thr Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser
805 810 815Lys Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp
820 825 830Glu Asp Met Glu Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys
835 840 845Lys Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val850 855 860Ala Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser865 870 875 880Asp Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly
885 890 895Ser Ala Met Ala Gly Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn
900 905 910Leu Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn
915 920 925Val Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp
930 935 940Leu Arg Ile 5er Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly945 950 955 960Gly Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly
965 970 975Val Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu
980 985 990Ala Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys
995 1000 1005Ala Ala Leu Ala Ala Gly His Leu Val Gln Ser His Met Thr His Asn1010 1015 1020Arg Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu Asp Ala Thr Asp1025 1030 1035 1040Ile Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser
1045 1050<210>9<211>3165<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<220><22l>CDS<222>(1)..(3162)<400>9atg ccg ccg cta ttc aag gga ctg aaa cag atg gca aag cca att gcc 48Met Pro Pro Leu Phe Lys Gly Leu Lys Gln Met Ala Lys Pro Ile Ala1 5 10 15tat gtt tca aga ttt tcg gcg aaa cga cca att cat ata ata ctt ttt 96Tyr Val Ser Arg Phe Ser Ala Lys Arg Pro Ile His Ile Ile Leu Phe
20 25 30tct cta atc ata tcc gca ttc gct tat cta tcc gtc att cag tat tac 144Ser Leu Ile Ile Ser Ala Phe Ala Tyr Leu Ser Val Ile Gln Tyr Tyr
35 40 45ttc aat ggt tgg caa cta gat tca aat agt gtt ttt gaa act gct cca 192Phe Asn Gly Trp Gln Leu Asp Ser Asn Ser Val Phe Glu Thr Ala Pro
50 55 60aat aaa gac ttc aac act cta ttt caa gaa tgt tcc cat tac tac aga 240Asn Lys Asp Phe Asn Thr Leu Phe Gln Glu Cys Ser His Tyr Tyr Arg65 70 75 80gat tcc tct cta gat ggt tgg gta tca atc acc gcg cat gaa gct agt 288Asp Ser Ser Leu Asp Gly Trp Val Ser Ile Thr Ala His Glu Ala Ser
85 90 95gag tta cca gcc cca cac cat tac tat cta tta aac ctg aac ttc aat 336Glu Leu Pro Ala Pro His His Tyr Tyr Leu Leu Asn Leu Asn Phe Asn
100 105 110agt cct aat gaa act gac tcc att cca gaa cta gct aac acg gtt ttt 384Ser Pro Asn Glu Thr Asp Ser Ile Pro Glu Leu Ala Asn Thr Val Phe
115 120 125gag aaa gat aat aca aaa tat att ctg caa gaa gat ctc agc gtt tcc 432Glu Lys Asp Asn Thr Lys Tyr Ile Leu Gln Glu Asp Leu Ser Val Ser
130 135 140aaa gaa att tct tct act gat gga acg aaa tgg agg tta aga agt gac 480Lys Glu Ile Ser Ser Thr Asp Gly Thr Lys Trp Arg Leu Arg Ser Asp145 150 155 160aga aaa agt ctt ttc gac gta aag acg tta gca tat tct ctc tac gat 528Arg Lys Ser Leu Phe Asp Val Lys Thr Leu Ala Tyr Ser Leu Tyr Asp
165 170 175gta ttt tca gaa aat gta acc caa gca gac ccg ttt gac gtc ctt att 576Val Phe Ser Glu Asn Val Thr Gln Ala Asp Pro Phe Asp Val Leu Ile
180 185 190atg gtt act gcc tac cta atg atg ttc tac acc ata ttc ggc ctc ttc 624Met Val Thr Ala Tyr Leu Met Met Phe Tyr Thr Ile Phe Gly Leu Phe
195 200 205aat gac atg agg aag acc ggg tca aat ttt tgg ttg agc gcc tct aca 672Asn Asp Met Arg Lys Thr Gly Ser Asn Phe Trp Leu Ser Ala Ser Thr
210 215 220gtg gtc aat tct gca tca tca ctt ttc tta gca ttg tat gtc acc caa 720Val Val Asn Ser Ala Ser Ser Leu Phe Leu Ala Leu Tyr Val Thr Gln225 230 235 240tgt att cta ggc aaa gaa gtt tcc gca tta act ctt ttt gaa ggt ttg 768Cys Ile Leu Gly Lys Glu Val Ser Ala Leu Thr Leu Phe Glu Gly Leu
245 250 255cct ttc att gta gtt gtt gtt ggt ttc aag cac aaa atc aag att gcc 816Pro Phe Ile Val Val Val Val Gly Phe Lys His Lys Ile Lys Ile Ala
260 265 270cag tat gcc ctg gag aaa ttt gaa aga gtc ggt tta tct aaa agg att 864Gln Tyr Ala Leu Glu Lys Phe Glu Arg Val Gly Leu Ser Lys Arg Ile
275 280 285act acc gat gaa atc gtt ttt gaa tcc gtg agc gaa gag ggt ggt cgt 912Thr Thr Asp Glu Ile Val Phe Glu Ser Val Ser Glu Glu Gly Gly Arg
290 295 300ttg att caa gac cat ttg ctt tgt att ttt gcc ttt atc gga tgc tct 960Leu Ile Gln Asp His Leu Leu Cys Ile Phe Ala Phe Ile Gly Cys Ser305 310 315 320atg tat gct cac caa ttg aag act ttg aca aac ttc tgc ata tta tca 1008Met Tyr Ala His Gln Leu Lys Thr Leu Thr Asn Phe Cys Ile Leu Ser
325 330 335gca ttt atc cta att ttc gaa ttg att tta act cct aca ttt tat tct 1056Ala Phe Ile Leu Ile Phe Glu Leu Ile Leu Thr Pro Thr Phe Tyr Ser
340 345 350gct atc tta gcg ctt aga ctg gaa atg aat gtt atc cac aga tct act 1104Ala Ile Leu Ala Leu Arg Leu Glu Met Asn Val Ile His Arg Ser Thr
355 360 365att atc aag caa aca tta gaa gaa gac ggt gtt gtt cca tct aca gca 1152Ile Ile Lys Gln Thr Leu Glu Glu Asp Gly Val Val Pro Ser Thr Ala
370 375 380aga atc att tct aag gca gaa aag aaa tcc gta tct tct ttc tta aat 1200Arg Ile Ile Ser Lys Ala Glu Lys Lys Ser Val Ser Ser Phe Leu Asn385 390 395 400ctc agt gtg gtt gtc att atc atg aaa ctc tct gtc ata ctg ttg ttc 1248Leu Ser Val Val Val Ile Ile Met Lys Leu Ser Val Ile Leu Leu Phe
405 410 415gtc ttc atc aac ttt tat aac ttt ggt gca aat tgg gtc aat gat gcc 1296Val Phe Ile Asn Phe Tyr Asn Phe Gly Ala Asn Trp Val Asn Asp Ala
420 425 430ttc aat tca ttg tac ttc gat aag gaa cgt gtt tct cta cca gat ttt 1344Phe Asn Ser Leu Tyr Phe Asp Lys Glu Arg ValSer Leu Pro Asp Phe
435 440 445att acc tcg aat gcc tct gaa aac ttt aaa gag caa gct att gtt agt 1392Ile Thr Ser Asn Ala Ser Glu Asn Phe Lys Glu Gln Ala Ile Val Ser
450 455 460gtc acc cca tta tta tat tac aaa ccc att aag tcc tac caa cgc att 1440Val Thr Pro Leu Leu Tyr Tyr Lys Pro Ile Lys Ser Tyr Gln Arg Ile465 470 475 480gag gat atg gtt ctt cta ttg ctt cgt aat gtc agt gtt gcc att cgt 1488Glu Asp Met Val Leu Leu Leu Leu Arg Asn Val Ser Val Ala Ile Arg
485 490 495gat agg ttc gtc agt aaa tta gtt ctt tcc gcc tta gta tgc agt gct 1536Asp Arg Phe Val Ser Lys Leu Val Leu Ser Ala Leu Val Cys Ser Ala
500 505 510gtc atc aat gtg tat tta tta aat gct gct aga att cat acc agt tat 1584Val Ile Asn Val Tyr Leu Leu Asn Ala Ala Arg Ile His Thr Ser Tyr
515 520 525act gca gac caa ttg gtg aag act gaa gtc acc aag aag tct ttt act 1632Thr Ala Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr
530 535 540gct cct gta caa aag gct tct aca cca gtt tta acc aat aaa aca gtc 1680Ala Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asn Lys Thr Val545 550 555 560att tct gga tcg aaa gtc aaa agt tta tca tct gcg caa tcg agc tca 1728Ile Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser
565 570 575tca gga cct tca tca tct agt gag gaa gat gat tcc cgc gat att gaa 1776Ser Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu
580 585 590agc ttg gat aag aaa ata cgt cct tta gaa gaa tta gaa gca tca tta 1824Ser Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Ser Leu
595 600 605agt agt gga aat aca aaa caa ttg aag aac aaa gag gtc gct gcc ttg 1872Ser Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu
610 615 620gtt att cac ggt aag tta cct ttg tac gct ttg gag aaa aaa tta ggt 1920Val Ile His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly625 630 635 640gat act acg aga gcg gtt gcg gta cgt agg aag gct ctt tca att ttg 1968Asp Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu
645 650 655gca gaa gct cct gta tta gca tct gat cgt tta cca tat aaa aat tat 2016Ala Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr
660 665 670gac tac gac cgc gta ttt ggc gct tgt tgt gaa aat gtt ata ggt tac 2064Asp Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr
675 680 685atg cct ttg ccc gtt ggt gtt ata ggc ccc ttg gtt atc gat ggt aca 2112Met Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Ile Asp Gly Thr
690 695 700tct tat cat ata cca atg gca act aca gag ggt tgt ttg gta gct tct 2160Ser Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser705 710 715 720gcc atg cgt ggc tgt aag gca atc aat gct ggc ggt ggt gca aca act 2208Ala Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr
725 730 735gtt tta act aag gat ggt atg aca aga ggc cca gta gtc cgt ttc cca 2256Val Leu Thr Lys Asp Gly Met Thr Arg Gly Pro Val Val Arg Phs Pro
740 745 750act ttg aaa aga tct ggt gcc tgt aag ata tgg tta gac tca gaa gag 2304Thr Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu
755 760 765gga caa aac gca att aaa aaa gct ttt aac tct aca tca aga ttt gca 2352Gly Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala
770 775 780cgt ctg caa cat att caa act tgt cta gca gga gat tta ctc ttc atg 2400Arg Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met785 790 795 800aga ttt aga aca act act ggt gac gca atg ggt atg aat atg att tct 2448Arg Phe Arg Thr Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser
805 810 815aag ggt gtc gaa tac tca tta aag caa atg gta gaa gag tat ggc tgg 2496Lys Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp
820 825 830gaa gat atg gag gtt gtc tcc gtt tct ggt aac tac tgt acc gac aaa 2544Glu Asp Met Glu Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys
835 840 845aaa cca gct gcc atc aac tgg atc gaa ggt cgt ggt aag agt gtc gtc 2592Lys Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val
850 855 860gca gaa gct act att cct ggt gat gtt gtc aga aaa gtg tta aaa agt 2640Ala Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser865 870 875 880gat gtt tcc gca ttg gtt gag ttg aac att gct aag aat ttg gtt gga 2688Asp Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly
885 890 895tct gca atg gct ggg tct gtt ggt gga ttt aac gca cgt gca gct aat 2736Ser Ala Met Ala Gly Ser Val Gly Gly Phe Asn Ala Arg Ala Ala Asn
900 905 910tta gtg aca gct gtt ttc ttg gca tta gga caa gat cct gca caa aat 2784Leu Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn
915 920 925gtc gaa agt tcc aac tgt ata aca ttg atg aaa gaa gtg gac ggt gat 2832Val Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp
930 935 940ttg aga att tcc gta tcc atg cca tcc atc gaa gta ggt acc atc ggt 2880Leu Arg Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly945 950 955 960ggt ggt act gtt cta gaa cca caa ggt gcc atg ttg gac tta tta ggt 2928Gly Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly
965 970 975gta aga ggc cca cat gct acc gct cct ggt acc aac gca cgt caa tta 2976Val Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu
980 985 990gca aga ata gtt gcc tgt gcc gtc ttg gca ggt gaa tta tcc tta tgt 3024Ala Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys
995 1000 1005gct gcc cta gca gcc ggc cat ttg gtt caa agt cat atg acc cac aac 3072Ala Ala Leu Ala Ala Gly His Leu Val Gln Ser His Met Thr His Asn1010 1015 1020agg aaa cct gct gaa cca aca aaa cct aac aat ttg gac gcc act gat 3120Arg Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu Asp Ala Thr Asp1025 1030 1035 1040ata aat cgt ttg aaa gat ggg tcc gtc acc tgc att aaa tcc taa 3165Ile Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser
1045 1050<210>10<211>1054<212>PRT<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>10Met Pro Pro Leu Phe Lys Gly Leu Lys Gln Met Ala Lys Pro Ile Ala1 5 10 15Tyr Val Ser Arg Phe Ser Ala Lys Arg Pro Ile His Ile Ile Leu Phe
20 25 30Ser Leu Ile Ile Ser Ala Phe Ala Tyr Leu Ser Val Ile Gln Tyr Tyr
35 40 45Phe Asn Gly Trp Gln Leu Asp Ser Asn Ser Val Phe Glu Thr Ala Pro
50 55 60Asn Lys Asp Phe Asn Thr Leu Phe Gln Glu Cys Ser His Tyr Tyr Arg65 70 75 80Asp Ser Ser Leu Asp Gly Trp Val Ser Ile Thr Ala His Glu Ala Ser
85 90 95Glu Leu Pro Ala Pro His His Tyr Tyr Leu Leu Asn Leu Asn Phe Asn
100 105 110Ser Pro Asn Glu Thr Asp Ser Ile Pro Glu Leu Ala Asn Thr Val Phe
115 120 125Glu Lys Asp Asn Thr Lys Tyr Ile Leu Gln Glu Asp Leu Ser Val Ser
130 135 140Lys Glu Ile Ser Ser Thr Asp Gly Thr Lys Trp Arg Leu Arg Ser Asp145 150 155 160Arg Lys Ser Leu Phe Asp Val Lys Thr Leu Ala Tyr Ser Leu Tyr Asp
165 170 175Val Phe Ser Glu Asn Val Thr Gln Ala Asp Pro Phe Asp Val Leu Ile
180 185 190Met Val Thr Ala Tyr Leu Met Met Phe Tyr Thr Ile Phe Gly Leu Phe
195 200 205Asn Asp Met Arg Lys Thr Gly Ser Asn Phe Trp Leu Ser Ala Ser Thr
210 215 220Val Val Asn Ser Ala Ser Ser Leu Phe Leu Ala Leu Tyr Val Thr Gln225 230 235 240Cys Ile Leu Gly Lys Glu Val Ser Ala Leu Thr Leu Phe Glu Gly Leu
245 250 255Pro Phe Ile Val Val Val Val Gly Phe Lys His Lys Ile Lys Ile Ala
260 265 270Gln Tyr Ala Leu Glu Lys Phe Glu Arg Val Gly Leu Ser Lys Arg Ile
275 280 285Thr Thr Asp Glu Ile Val Phe Glu Ser Val Ser Glu Glu Gly Gly Arg
290 295 300Leu Ile Gln Asp His Leu Leu Cys Ile Phe Ala Phe Ile Gly Cys Ser305 310 315 320Met Tyr Ala His Gln Leu Lys Thr Leu Thr Asn Phe Cys Ile Leu Ser
325 330 335Ala Phe Ile Leu Ile Phe Glu Leu Ile Leu Thr Pro Thr Phe Tyr Ser
340 345 350Ala Ile Leu Ala Leu Arg Leu Glu Met Asn Val Ile His Arg Ser Thr
355 360 365Ile Ile Lys Gln Thr Leu Glu Glu Asp Gly Val Val Pro Ser Thr Ala
370 375 380Arg Ile Ile Ser Lys Ala Glu Lys Lys Ser Val Ser Ser Phe Leu Asn385 390 395 400Leu Ser Val Val Val Ile Ile Met Lys Leu Ser Val Ile Leu Leu Phe
405 410 415Val Phe Ile Asn Phe Tyr Asn Phe Gly Ala Asn Trp Val Asn Asp Ala
420 425 430Phe Asn Ser Leu Tyr Phe Asp Lys Glu Arg Val Ser Leu Pro Asp Phe
435 440 445Ile Thr Ser Asn Ala Ser Glu Asn Phe Lys Glu Gln Ala Ile Val Ser
450 455 460Val Thr Pro Leu Leu Tyr Tyr Lys Pro Ile Lys Ser Tyr Gln Arg Ile465 470 475 480Glu Asp Met Val Leu Leu Leu Leu Arg Asn Val Ser Val Ala Ile Arg
485 490 495Asp Arg Phe Val Ser Lys Leu Val Leu Ser Ala Leu Val Cys Ser Ala
500 505 510Val Ile Asn Val Tyr Leu Leu Asn Ala Ala Arg Ile His Thr Ser Tyr
515 520 525Thr Ala Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr
530 535 540Ala Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asn Lys Thr Val545 550 555 560Ile Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser
565 570 575Ser Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu
580 585 590Ser Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Ser Leu
595 600 605Ser Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu
610 615 620Val Ile His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly625 630 635 640Asp Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu
645 650 655Ala Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr
660 665 670Asp Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr
675 680 685Met Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Ile Asp Gly Thr
690 695 700Ser Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser705 710 715 720Ala Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr
725 730 735Val Leu Thr Lys Asp Gly Met Thr Arg Gly Pro Val Val Arg Phe Pro
740 745 750Thr Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu
755 760 765Gly Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala
770 775 780Arg Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met785 790 795 800Arg Phe Arg Thr Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser
805 810 815Lys Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp
820 825 830Glu Asp Met Glu Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys
835 840 845Lys Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val
850 855 860Ala Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser865 870 875 880Asp Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly
885 890 895Ser Ala Met Ala Gly Ser Val Gly Gly Phe Asn Ala Arg Ala Ala Asn
900 905 910Leu Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn
915 920 925Val Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp
930 935 940Leu Arg Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly945 950 955 960Gly Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly
965 970 975Val Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu
980 985 990Ala Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys
995 1000 1005Ala Ala Leu Ala Ala Gly His Leu Val Gln Ser His Met Thr His Asn1010 1015 1020Arg Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu AspAla Thr Asp1025 1030 1035 1040Ile Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser
1045 1050<210>11<211>3165<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<220><221>CDS<222>(1)..(3162)<400>11atg ccg ccg cta ttc aag gga ctg aaa cag atg gca aag cca att gcc 48Met Pro Pro Leu Phe Lys Gly Leu Lys Gln Met Ala Lys Pro Ile Alal 5 10 15tat gtt tca aga ttt tcg gcg aaa cga cca att cat ata ata ctt ttt 96Tyr Val Ser Arg Phe Ser Ala Lys Arg Pro Ile His Ile Ile Leu Phe
20 25 30tct cta atc ata tcc gca ttc gct tat cta tcc gtc att cag tat tac 144Ser Leu Ile Ile Ser Ala Phe Ala Tyr Leu Ser Val Ile Gln Tyr Tyr
35 40 45ttc aat ggt tgg caa cta gat tca aat agt gtt ttt gaa act gct cca 192Phe Asn Gly Trp Gln Leu Asp Ser Asn Ser Val Phe Glu Thr Ala Pro
50 55 60aat aaa gac tcc aac act cta ttt caa gaa tgt tcc cat tac tac aga 240Asn Lys Asp Ser Asn Thr Leu Phe Gln Glu Cys Ser His Tyr Tyr Arg65 70 75 80gat tcc tct cta gat ggt tgg gta tca atc acc gcg cat gaa gct agt 288Asp Ser Ser Leu Asp Gly Trp Val Ser Ile Thr Ala His Glu Ala Ser
85 90 95gag tta cca gcc cca cac cat tac tat cta tta aac ctg aac ttc aat 336Glu Leu Pro Ala Pro His His Tyr Tyr Leu Leu Asn Leu Asn Phe Asn
100 105 1l0agt cct aat gaa act gac tcc att cca gaa cta gct aac acg gtt ttt 384Ser Pro Asn Glu Thr Asp Ser Ile Pro Glu Leu Ala Asn Thr Val Phe
115 120 125gag aaa gat aat aca aaa tat att ctg caa gaa gat ctc agc gtt tcc 432Glu Lys Asp Asn Thr Lys Tyr Ile Leu Gln Glu Asp Leu Ser Val Ser
130 135 140aaa gaa att tct tct act gat gga acg aaa tgg agg tta aga agt gac 480Lys Glu Ile Ser Ser Thr Asp Gly Thr Lys Trp Arg Leu Arg Ser Asp145 150 155 160aga aaa agt ctt ttc gac gta aag acg tta gca tat tct ctc tac gat 528Arg Lys Ser Leu Phe Asp Val Lys Thr Leu Ala Tyr Ser Leu Tyr Asp
165 170 175gta ttt tca gaa aat gta acc caa gca gac ccg ttt gac gtc ctt att 576Val Phe Ser Glu Asn Val Thr Gln Ala Asp Pro Phe Asp Val Leu Ile
180 185 190atg gtt act gcc tac cta atg atg ttc tac acc ata ttc ggc ctc ttc 624Met Val Thr Ala Tyr Leu Met Met Phe Tyr Thr Ile Phe Gly Leu Phe
195 200 205aat gac atg agg aag acc ggg tca aat ttt tgg ttg agc gcc tct aca 672Asn Asp Met Arg Lys Thr Gly Ser Asn Phe Trp Leu Ser Ala Ser Thr
210 215 220gtg gtc aat tct gca tca tca ctt ttc tta gca ttg tat gtc acc caa 720Val Val Asn Ser Ala Ser Ser Leu Phe Leu Ala Leu Tyr Val Thr Gln225 230 235 240tgt att cta ggc aaa gaa gtt tcc gca tta act ctt ttt gaa ggt ttg 768Cys Ile Leu Gly Lys Glu Val Ser Ala Leu Thr Leu Phe Glu Gly Leu
245 250 255cct ttc att gta gtt gtt gtt ggt ttc aag cac aaa atc aag att gcc 816Pro Phe Ile Val Val Val Val Gly Phe Lys His Lys Ile Lys Ile Ala
260 265 270cag tat gcc ctg gag aaa ttt gaa aga gtc ggt tta tct aaa agg att 864Gln Tyr Ala Leu Glu Lys Phe Glu Arg Val Gly Leu Ser Lys Arg Ile
275 280 285act acc gat gaa atc gtt ttt gaa tcc gtg agc gaa gag ggt ggt cgt 912Thr Thr Asp Glu Ile Val Phe Glu Ser Val Ser Glu Glu Gly Gly Arg
290 295 300ttg att caa gac cat ttg crt tgt att ttt gcc ttt atc gga tgc tct 960Leu Ile Gln Asp His Leu Leu Cys Ile Phe Ala Phe Ile Gly Cys Ser305 310 315 320atg tat gct cac caa ttg aag act ttg aca aac ttc tgc ata tta tca 1008Met Tyr Ala His Gln Leu Lys Thr Leu Thr Asn Phe Cys Ile Leu Ser
325 330 335gca ttt atc cta att ttc gaa ttg att tta act cct aca ttt tat tct 1056Ala Phe Ile Leu Ile Phe Glu Leu Ile Leu Thr Pro Thr Phe Tyr Ser
340 345 350gct atc tta gcg ctt aga ctg gaa atg aat gtt atc cac aga tct act 1104Ala Ile Leu Ala Leu Arg Leu Glu Met Asn Val Ile His Arg Ser Thr
355 360 365att atc aag caa aca tta gaa gaa gac ggt gtt gtt cca tct aca gca 1152Ile Ile Lys Gln Thr Leu Glu Glu Asp Gly Val Val Pro Ser Thr Ala
370 375 380aga atc att tct aag gca gaa aag aaa tcc gta tct tct ttc tta aat 1200Arg Ile Ile Ser Lys Ala Glu Lys Lys Ser Val Ser Ser Phe Leu Asn385 390 395 400ctc agt gtg gtt gtc att atc atg aaa ctc tct gtc ata ctg ttg ttc 1248Leu Ser Val Val Val Ile Ile Met Lys Leu Ser Val Ile Leu Leu Phe
405 410 415gtc ttc atc aac ttt tat aac ttt ggt gca aat tgg gtc aat gat gcc 1296Val Phe Ile Asn Phe Tyr Asn Phe Gly Ala Asn Trp Val Asn Asp Ala
420 425 430ttc aat tca ttg tacttc gat aag gaa cgt gtt tct cta cca gat ttt 1344Phe Asn Ser Leu Tyr Phe Asp Lys Glu Arg Val Ser Leu Pro Asp Phe
435 440 445att acc tcg aat gcc tct gaa aac ttt aaa gag caa gct att gtt agt 1392Ile Thr Ser Asn Ala Ser Glu Asn Phe Lys Glu Gln Ala Ile ValSer
450 455 460gtc acc cca tta tta tat tac aaa ccc att aag tcc tac caa cgc att 1440Val Thr Pro Leu Leu Tyr Tyr Lys Pro Ile Lys Ser Tyr Gln Arg Ile465 470 475 480gag gat atg gtt ctt cta ttg ctt cgt aat gtc agt gtt gcc att cgt 1488Glu Asp Met Val Leu Leu Leu Leu Arg Asn Val Ser ValAla Ile Arg
485 490 495gat agg ttc gtc agt aaa tta gtt ctt tcc gcc tta gta tgc agt gct 1536Asp Arg Phe Val Ser Lys Leu Val Leu Ser Ala Leu Val Cys Ser Ala
500 505 510gtc atc aat gtg tat tta tta aat gct gct aga att cat acc agt tat 1584Val Ile Asn Val Tyr Leu Leu Asn Ala Ala Arg Ile His Thr Ser Tyr
515 520 525act gca gac caa ttg gtg aag act gaa gtc acc aag aag tct ttt act 1632Thr Ala Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr
530 535 540gct cct gta caa aag gct tct aca cca gtt tta acc aat aaa aca gtc 1680Ala Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asa Lys Thr Val545 550 555 560att tct gga tcg aaa gtc aaa agt tta tca tct gcg caa tcg agc tca 1728Ile Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser
565 570 575tca gga cct tca tca tct agt gag gaa gat gat tcc cgc gat att gaa 1776Ser Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu
580 585 590agc ttg gat aag aaa ata cgt cct tta gaa gaa tta gaa gca tta tta 1824Ser Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Leu Leu
595 600 605agt agt gga aat aca aaa caa ttg aag aac aaa gag gtc gct gcc ttg 1872Ser Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu
610 615 620gtt att cac ggt aag tta cct ttg tac gct ttg gag aaa aaa tta ggt 1920Val Ile His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly625 630 635 640gat act acg aga gcg gtt gcg gta cgt agg aag gct ctt tca att ttg 1968Asp Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu
645 650 655gca gaa gct cct gta tta gca tct gat cgt tta cca tat aaa aat tat 2016Ala Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr
660 665 670gac tac gac cgc gta ttt ggc gct tgt tgt gaa aat gtt ata ggt tac 2064Asp Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr
675 680 685atg cct ttg ccc gtt ggt gtt ata ggc ccc ttg gtt atc gat ggt aca 2112Met Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Ile Asp Gly Thr
690 695 700tct tat cat ata cca atg gca act aca gag ggt tgt ttg gta gct tct 2160Ser Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser705 710 715 720gcc atg cgt ggc tgt aag gca atc aat gct ggc ggt ggt gca aca act 2208Ala Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr
725 730 735gtt tta act aag gat ggt atg aca aga ggc cca gta gtc cgt ttc cca 2256Val Leu Thr Lys Asp Gly Met Thr Arg Gly Pro Val Val Arg Phe Pro
740 745 750act ttg aaa aga tct ggt gcc tgt aag ata tgg tta gac tca gaa gag 2304Thr Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu
755 760 765gga caa aac gca att aaa aaa gct ttt aac tct aca tca aga ttt gca 2352Gly Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala
770 775 780cgt ctg caa cat att caa act tgt cta gca gga gat tta ctc ttc atg 2400Arg Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met785 790 795 800aga ttt aga aca act act ggt gac gca atg ggt atg aat atg att tct 2448Arg Phe Arg Thr Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser
805 810 815aag ggt gtc gaa tac tca tta aag caa atg gta gaa gag tat ggc tgg 2496Lys Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp
820 825 830gaa gat atg gag gtt gtc tcc gtt tct ggt aac tac tgt acc gac aaa 2544Glu Asp Met Glu Val Val ger Val Ser Gly Asn Tyr Cys Thr Asp Lys
835 840 845aaa cca gct gcc atc aac tgg atc gaa ggt cgt ggt aag agt gtc gtc 2592Lys Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val
850 855 860gca gaa gct act att cct ggt gat gtt gtc aga aaa gtg tta aaa agt 2640Ala Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser865 870 875 880gat gtt tcc gca ttg gtt gag ttg aac att gct aag aat ttg gtt gga 2688Asp Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly
885 890 895tct gca atg gct ggg tct gtt ggt gga ttt aac gca cat gca gct aat 2736Ser Ala Mer Ala Gly Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn
900 905 910tta gtg aca gct gtt ttc ttg gca tta gga caa gat cct gca caa aat 2784Leu Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn
915 920 925gtc gaa agt tcc aac tgt ata aca ttg atg aaa gaa gtg gac ggt gat 2832Val Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp
930 935 940ttg aga att tcc gta tcc atg cca tcc atc gaa gta ggt acc atc ggt 2880Leu Arg Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly945 950 955 960ggt ggt act gtt cta gaa cca caa ggt gcc atg ttg gac tta tta ggt 2928Gly Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly
965 970 975gta aga ggc cca cat gct acc gct cct ggt acc aac gca cgt caa tta 2976Val Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu
980 985 990gca aga ata gtt gcc tgt gcc gtc ttg gca ggt gaa tta tcc tta tgt 3024Ala Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys
995 1000 1005gct gcc cta gca gcc ggc cat ttg gtt caa agt cat atg acc cac aac 3072Ala Ala Leu Ala Ala Gly His Leu Val Gin Ser His Met Thr His Asn1010 1015 1020agg aaa cct gct gaa cca aca aaa cct aac aat ttg gac gcc act gat 3120Arg Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu Asp Ala Thr Asp1025 1030 1035 1040ata aat cgt ttg aaa gat ggg tcc gtc acc tgc att aaa tcc taa 3165Ile Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser
1045 1050<210>12<21l>1054<212>PRT<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>12Met Pro Pro Leu Phe Lys Gly Leu Lys Gln Met Ala Lys Pro Ile Ala1 5 10 15Tyr Val Ser Arg Phe Ser Ala Lys Arg Pro Ile His Ile Ile Leu Phe
20 25 30Ser Leu Ile Ile Ser Ala Phe Ala Tyr Leu Ser Val Ile Gln Tyr Tyr
35 40 45Phe Asn Gly Trp Gln Leu Asp Ser Asn Ser Val Phe Glu Thr Ala Pro
50 55 60Asn Lys Asp Ser Asn Thr Leu Phe Gln Glu Cys Ser His Tyr Tyr Arg65 70 75 80Asp Ser Ser Leu Asp Gly Trp Val Ser Ile Thr Ala His Glu Ala Ser
85 90 95Glu Leu Pro Ala Pro His His Tyr Tyr Leu Leu Asn Leu Asn Phe Asn
100 105 110Ser Pro Asn Glu Thr Asp Ser Ile Pro Glu Leu Ala Asn Thr Val Phe
115 120 125Glu Lys Asp Asn Thr Lys Tyr Ile Leu Gln Glu Asp Leu Ser Val Ser
130 135 140Lys Glu Ile Ser Ser Thr Asp Gly Thr Lys Trp Arg Leu Arg Ser Asp145 150 155 160Arg Lys Ser Leu Phe Asp Val Lys Thr Leu Ala Tyr Ser Leu Tyr Asp
165 170 175Val Phe Ser Glu Asn Val Thr Gln Ala Asp Pro Phe Asp Val Leu Ile
180 185 190Met Val Thr Ala Tyr Leu Met Met Phe Tyr Thr Ile Phe Gly Leu Phe
195 200 205Asn Asp Met Arg Lys Thr Gly Ser Asn Phe Trp Leu Ser Ala Ser Thr
210 215 220Val Val Asn Ser Ala Ser Ser Leu Phe Leu Ala Leu Tyr Val Thr Gln225 230 235 240Cys Ile Leu Gly Lys Glu Val Ser Ala Leu Thr Leu Phe Glu Gly Leu
245 250 255Pro Phe Ile Val Val Val Val Gly Phe Lys His Lys Ile Lys Ile Ala
260 265 270Gln Tyr Ala Leu Glu Lys Phe Glu Arg Val Gly Leu Ser Lys Arg Ile
275 280 285Thr Thr Asp Glu Ile Val Phe Glu Ser Val Ser Glu Glu Gly Gly Arg
290 295 300Leu Ile Gln Asp His Leu Leu Cys Ile Phe Ala Phe Ile Gly Cys Ser305 310 315 320Met Tyr Ala His Gln Leu Lys Thr Leu Thr Asn Phe Cys Ile Leu Ser
325 330 335Ala Phe Ile Leu Ile Phe Glu Leu Ile Leu Thr Pro Thr Phe Tyr Ser
340 345 350Ala Ile Leu Ala Leu Arg Leu Glu Met Asn Val Ile His Arg Ser Thr
355 360 365Ile Ile Lys Gln Thr Leu Glu Glu Asp Gly Val Val Pro Ser Thr Ala
370 375 380Arg Ile Ile Ser Lys Ala Glu Lys Lys Ser Val Ser Ser Phe Leu Asn385 390 395 400Leu Ser Val Val Val Ile Ile Met Lys Leu Ser Val Ile Leu Leu Phe
405 410 415Val Phe Ile Asn Phe Tyr Asn Phe Gly Ala Asn Trp Val Asn Asp Ala
420 425 430Phe Asn Ser Leu Tyr Phe Asp Lys Glu Arg Val Ser Leu Pro Asp Phe
435 440 445Ile Thr Ser Asn Ala Ser Glu Asn Phe Lys Glu Gln Ala Ile Val Ser
450 455 460Val Thr Pro Leu Leu Tyr Tyr Lys Pro Ile Lys Ser Tyr Gln Arg Ile465 470 475 480Glu Asp Met Val Leu Leu Leu Leu Arg Asn Val Ser Val Ala Ile Arg
485 490 495Asp Arg Phe Val Ser Lys Leu Val Leu Ser Ala Leu Val Cys Ser Ala
500 505 510Val Ile Asn Val Tyr Leu Leu Asn Ala Ala Arg Ile His Thr Ser Tyr
515 520 525Thr Ala Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr
530 535 540Ala Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asn Lys Thr Val545 550 555 560Ile Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser
565 570 575Ser Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu
580 585 590Ser Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Leu Leu
595 600 605Ser Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu
610 615 620Val Ile His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly625 630 635 640Asp Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu
645 650 655Ala Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr
660 665 670Asp Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr
675 680 685Met Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Ile Asp Gly Thr
690 695 700Ser Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu ValAla Ser705 710 715 720Ala Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr
725 730 735Val Leu Thr Lys Asp Gly Met Thr Arg Gly Pro Val Val Arg Phe Pro
740 745 750Thr Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu
755 760 765Gly Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala
770 775 780Arg Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met785 790 795 800Arg Phe Arg Thr Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser
805 810 815Lys Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp
820 825 830Glu Asp Met Glu Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys
835 840 845Lys Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val
850 855 860Ala Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser865 870 875 880Asp Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly
885 890 895Ser Ala Met Ala Gly Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn
900 905 910Leu Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn
915 920 925Val Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp
930 935 940Leu Arg Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly945 950 955 960Gly Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly
965 970 975Val Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu
980 985 990Ala Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys
995 1000 1005Ala Ala Leu Ala Ala Gly His Leu Val Gln Ser His Met Thr His Asn1010 1015 1020Arg Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu Asp Ala Thr Asp1025 1030 1035 1040Ile Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser
1045 1050<210>13<211>2925<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>13atgccgccgc tattcaaggg actgaaacag atggcaaagc caattgccta tgtttcaaga 60ttttcggcga aacgaccaat tcatataata cttttttctc taatcatatc cgcattcgct 120tatctatccg tcattcagta ttacttcaat ggttggcaac tagattcaaa tagtgttttt 180gaaactgctc caaataaaga cttcaacact ctatttcaag aatgttccca ttactacaga 240gattcctctc tagatggttg ggtatcaatc accgcgcatg aagctagtga gttaccagcc 300ccacaccatt actatctatt aaacctgaac ttcaatagtc ctaatgaaac tgactccatt 360ccagaactag ctaacacggt ttttgagaaa gataatacaa aatatattct gcaagaagat 420ctcagcgttt ccaaagaaat ttcttctact gatggaacga aatggaggtt aagaagtgac 480agaaaaagtc ttttcgacgt aaagacgtta gcatattctc tctacgatgt attttcagaa 540aatgtaaccc aagcagacca caaaatcaag attgcccagt atgccctgga gaaatttgaa 600agagtcggtt tatctaaaag gattactacc gatgaaatcg tttttgaatc cgtgagcgaa 660gagggtggtc gtttgattca agaccatttg ctttgtattt ttgcctttat cggatgctct 720atgtatgctc accaattgaa gactttgaca aacttctgca tattatcagc atttatccta 780attttcgaat tgattttaac tcctacattt tattctgcta tcttagcgct tagactggaa 840atgaatgtta tccacagatc tactattatc aagcaaacat tagaagaaga cggtgttgtt 900ccatctacag caagaatcat ttctaaggca gaaaagaaat ccgtatcttc tttcttaaat 960ctcagtgtgg ttgtcattat catgaaactc tctgtcatac tgttgttcgt cttcatcaac 1020ttttataact ttggtgcaaa ttgggtcaat gatgccttca attcattgta cttcgataag 1080gaacgtgttt ctctaccaga ttttattacc tcgaatgcct ctgaaaactt taaagagcaa 1140gctattgtta gtgtcacccc attattatat tacaaaccca ttaagtccta ccaacgcatt 1200gaggatatgg ttcttctatt gcttcgtaat gtcagtgttg ccattcgtga taggttcgtc 1260agtaaattag ttctttccgc cttagtatgc agtgctgtca tcaatgtgta tttattaaat 1320gctgctagaa ttcataccag ttatactgca gaccaattgg tgaagactga agtcaccaag 1380aagtctttta ctgctcctgt acaaaaggct tctacaccag ttttaaccaa taaaacagtc 1440atttctggat cgaaagtcaa aagtttatca tctgcgcaat cgagctcatc aggaccttca 1500tcatctagtg aggaagatga ttcccgcgat attgaaagct tggataagaa aatacgtcct 1560ttagaagaat tagaagcatc attaagtagt ggaaatacaa aacaattgaa gaacaaagag 1620gtcgctgcct tggttattca cggtaagtta cctttgtacg ctttggagaa aaaattaggt 1680gatactacga gagcggttgc ggtacgtagg aaggctcttt caattttggc agaagctcct 1740gtattagcat ctgatcgttt accatataaa aattatgact acgaccgcgt atttggcgct 1800tgttgtgaaa atgttatagg ttacatgcct ttgcccgttg gtgttatagg ccccttggtt 1860atcgatggta catcttatca tataccaatg gcaactacag agggttgttt ggtagcttct 1920gccatgcgtg gctgtaaggc aatcaatgct ggcggtggtg caacaactgt tttaactaag 1980gatggtatga caagaggccc agtagtccgt ttcccaactt tgaaaagatc tggtgcctgt 2040aagatatggt tagactcaga agagggacaa aacgcaatta aaaaagcttt taactctaca 2100tcaagatttg cacgtctgca acatattcaa acttgtctag caggagattt actcttcatg 2160agatttagaa caactactgg tgacgcaatg ggtatgaata tgatttctaa gggtgtcgaa 2220tactcattaa agcaaatggt agaagagtat ggctgggaag atatggaggt tgtctccgtt 2280tctggtaact actgtaccga caaaaaacca gctgccatca actggatcga aggtcgtggt 2340aagagtgtcg tcgcagaagc tactattcct ggtgatgttg tcagaaaagt gttaaaaagt 2400gatgtttccg cattggttga gttgaacatt gctaagaatt tggttggatc tgcaatggct 2460gggtctgttg gtggatttaa cgcacgtgca gctaatttag tgacagctgt tttcttggca 2520ttaggacaag atcctgcaca aaatgtcgaa agttccaact gtataacatt gatgaaagaa 2580gtggacggtg atttgagaat ttccgtatcc atgccatcca tcgaagtagg taccatcggt 2640ggtggtactg ttctagaacc acaaggtgcc atgttggact tattaggtgt aagaggccca 2700catgctaccg ctcctggtac caacgcacgt caattagcaa gaatagttgc ctgtgccgtc 2760ttggcaggtg aattatcctt atgtgctgcc ctagcagccg gccatttggt tcaaagtcat 2820atgacccaca acaggaaacc tgctgaacca acaaaaccta acaatttgga cgccactgat 2880ataaatcgtt tgaaagatgg gtccgtcacc tgcattaaat cctaa 2925<210>14<211>3090<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>14atgccgccgc tattcaaggg actgaaacag atggcaaagc caattgccta tgtttcaaga 60ttttcggcga aacgaccaat tcatataata cttttttctc taatcatatc cgcattcgct 120tatctatccg tcattcagta ttacttcaat ggttggcaac tagattcaaa tagtgttttt 180gaaactgctc caaataaaga cttcaacact ctatttcaag aatgttccca ttactacaga 240gattcctctc tagatggttg ggtatcaatc accgcgcatg aagctagtga gttaccagcc 300ccacaccatt actatctatt aaacctgaac ttcaatagtc ctaatgaaac tgactccatt 360ccagaactag ctaacacggt ttttgagaaa gataatacaa aatatattct gcaagaagat 420ctcagcgttt ccaaagaaat ttcttctact gatggaacga aatggaggtt aagaagtgac 480agaaaaagtc ttttcgacgt aaagacgtta gcatattctc tctacgatgt attttcagaa 540aatgtaaccc aagcagaccc gtttgacgtc cttattatgg ttactgccta cctaatgatg 600ttctacacca tattcggcct cttcaatgac atgaggaaga ccgggtcaaa tttttggttg 660agcgcctcta cagtggtcaa ttctgcatca tcacttttct tagcattgta tgtcacccaa 720tgtattctag gcaaagaagt ttccgcatta actctttttg aaggtttgcc tttcattgta 780gttgttgttg gtttcaagca caaaatcaag attgcccagt atgccctgga gaaatttgaa 840agagtcggtt tatctaaaag gattactacc gatgaaatcg tttttgaatc cgtgagcgaa 900gagggtggtc gtttgattca agaccatttg ctttgtattt ttgcctttat cggatgctct 960atgtatgctc accaattgaa gactttgaca aacttctgca tattatcagc atttatccta 1020attttcgaat tgattttaac tcctacattt tattctgcta tcttagcgct tagactggaa 1080atgaatgtta tccacagatc tactattatc aagcaaacat tagaagaaga cggtgttgtt 1140ccatctacag caagaatcat ttctaaggca gaaaagaaat ccgtatcttc taactttggt 1200gcaaattggg tcaatgatgc cttcaattca ttgtacttcg ataaggaacg tgtttctcta 1260ccagatttta ttacctcgaa tgcctctgaa aactttaaag agcaagctat tgttagtgtc 1320accccattat tatattacaa acccattaag tcctaccaac gcattgagga tatggttctt 1380ctattgcttc gtaatgtcag tgttgccatt cgtgataggt tcgtcagtaa attagttctt 1440tccgccttag tatgcagtgc tgtcatcaat gtgtatttat taaatgctgc tagaattcat 1500accagttata ctgcagacca attggtgaag actgaagtca ccaagaagtc ttttactgct 1560cctgtacaaa aggcttctac accagtttta accaataaaa cagtcatttc tggatcgaaa 1620gtcaaaagtt tatcatctgc gcaatcgagc tcatcaggac cttcatcatc tagtgaggaa 1680gatgattccc gcgatattga aagcttggat aagaaaatac gtcctttaga agaattagaa 1740gcatcattaa gtagtggaaa tacaaaacaa ttgaagaaca aagaggtcgc tgccttggtt 1800attcacggta agttaccttt gtacgctttg gagaaaaaat taggtgatac tacgagagcg 1860gttgcggtac gtaggaaggc tctttcaatt ttggcagaag ctcctgtatt agcatctgat 1920cgtttaccat ataaaaatta tgactacgac cgcgtatttg gcgcttgttg tgaaaatgtt 1980ataggttaca tgcctttgcc cgttggtgtt ataggcccct tggttatcga tggtacatct 2040tatcatatac caatggcaac tacagagggt tgtttggtag cttctgccat gcgtggctgt 2100aaggcaatca atgctggcgg tggtgcaaca actgttttaa ctaaggatgg tatgacaaga 2160ggcccagtag tccgtttccc aactttgaaa agatctggtg cctgtaagat atggttagac 2220tcagaagagg gacaaaacgc aattaaaaaa gcttttaact ctacatcaag atttgcacgt 2280ctgcaacata ttcaaacttg tctagcagga gatttactct tcatgagatt tagaacaact 2340actggtgacg caatgggtat gaatatgatt tctaagggtg tcgaatactc attaaagcaa 2400atggtagaag agtatggctg ggaagatatg gaggttgtct ccgtttctgg taactactgt 2460accgacaaaa aaccagctgc catcaactgg atcgaaggtc gtggtaagag tgtcgtcgca 2520gaagctacta ttcctggtga tgttgtcaga aaagtgttaa aaagtgatgt ttccgcattg 2580gttgagttga acattgctaa gaatttggtt ggatctgcaa tggctgggtc tgttggtgga 2640tttaacgcac gtgcagctaa tttagtgaca gctgttttct tggcattagg acaagatcct 2700gcacaaaatg tcgaaagttc caactgtata acattgatga aagaagtgga cggtgatttg 2760agaatttccg tatccatgcc atccatcgaa gtaggtacca tcggtggtgg tactgttcta 2820gaaccacaag gtgccatgtt ggacttatta ggtgtaagag gcccacatgc taccgctcct 2880ggtaccaacg cacgtcaatt agcaagaata gttgcctgtg ccgtcttggc aggtgaatta 2940tccttatgtg ctgccctagc agccggccat ttggttcaaa gtcatatgac ccacaacagg 3000aaacctgctg aaccaacaaa acctaacaat ttggacgcca ctgatataaa tcgtttgaaa 3060gatgggtccg tcacctgcat taaatcctaa 3090<210>15<211>2973<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>15atgccgccgc tattcaaggg actgaaacag atggcaaagc caattgccta tgtttcaaga 60ttttcggcga aacgaccaat tcatataata cttttttctc taatcatatc cgcattcgct 120tatctatccg tcattcagta ttacttcaat ggttggcaac tagattcaaa tagtgttttt 180gaaactgctc caaataaaga cttcaacact ctatttcaag aatgttccca ttactacaga 240gattcctctc tagatggttg ggtatcaatc accgcgcatg aagctagtga gttaccagcc 300ccacaccatt actatctatt aaacctgaac ttcaatagtc ctaatgaaac tgactccatt 360ccagaactag ctaacacggt ttttgagaaa gataatacaa aatatattct gcaagaagat 420ctcagcgttt ccaaagaaat ttcttctact gatggaacga aatggaggtt aagaagtgac 480agaaaaagtc ttttcgacgt aaagacgtta gcatattctc tctacgatgt attttcagaa 540aatgtaaccc aagcagaccc gtttgacgtc cttattatgg ttactgccta cctaatgatg 600ttctacacca tattcggcct cttcaatgac atgaggaaga ccgggtcaaa tttttggttg 660agcgcctcta cagtggtcaa ttctgcatca tcacttttct tagcattgta tgtcacccaa 720tgtattctag gcaaagaagt ttccgcatta actctttttg aaggtttgcc tttcattgta 780gttgttgttg gtttcaagca caaaatcaag attgcccagt atgccctgga gaaatttgaa 840agagtcggtt tatctaaaag gattactacc gatgaaatcg tttttgaatc cgtgagcgaa 900gagggtggtc gtttgattca agaccatttg ctttgtattt ttgcctttat cggatgctct 960atgtatgctc accaattgaa gactttgaca aacttctgca tattatcagc atttatccta 1020attttcgaat tgattttaac tcctacattt tattctgcta tcttagcgct tagactggaa 1080atgaatgtta tccacagatc tactattatc aagcaaacat tagaagaaga cggtgttgtt 1140ccatctacag caagaatcat ttctaaggca gaaaagaaat ccgtatcttc tttcttaaat 1200ctcagtgtgg ttgtcattat catgaaactc tctgtcatac tgttgttcgt cttcatcaac 1260ttttataact ttggtgcaaa ttgggtcaat gatgccttca attcattgta cttcgataag 1320gaacgtgttt ctctaccaga ttttattacc tcgaatgcct ctgaaaactt taaagagcaa 1380cataccagtt atactgcaga ccaattggtg aagactgaag tcaccaagaa gtcttttact 1440gctcctgtac aaaaggcttc tacaccagtt ttaaccaata aaacagtcat ttctggatcg 1500aaagtcaaaa gtttatcatc tgcgcaatcg agctcatcag gaccttcatc atctagtgag 1560gaagatgatt cccgcgatat tgaaagcttg gataagaaaa tacgtccttt agaagaatta 1620gaagcatcat taagtagtgg aaatacaaaa caattgaaga acaaagaggt cgctgccttg 1680gttattcacg gtaagttacc tttgtacgct ttggagaaaa aattaggtga tactacgaga 1740gcggttgcgg tacgtaggaa ggctctttca attttggcag aagctcctgt attagcatct 1800gatcgtttac catataaaaa ttatgactac gaccgcgtat ttggcgcttg ttgtgaaaat 1860gttataggtt acatgccttt gcccgttggt gttataggcc ccttggttat cgatggtaca 1920tcttatcata taccaatggc aactacagag ggttgtttgg tagcttctgc catgcgtggc 1980tgtaaggcaa tcaatgctgg cggtggtgca acaactgttt taactaagga tggtatgaca 2040agaggcccag tagtccgttt cccaactttg aaaagatctg gtgcctgtaa gatatggtta 2100gactcagaag agggacaaaa cgcaattaaa aaagctttta actctacatc aagatttgca 2160cgtctgcaac atattcaaac ttgtctagca ggagatttac tcttcatgag atttagaaca 2220actactggtg acgcaatggg tatgaatatg atttctaagg gtgtcgaata ctcattaaag 2280caaatggtag aagagtatgg ctgggaagat atggaggttg tctccgtttc tggtaactac 2340tgtaccgaca aaaaaccagc tgccatcaac tggatcgaag gtcgtggtaa gagtgtcgtc 2400gcagaagcta ctattcctgg tgatgttgtc agaaaagtgt taaaaagtga tgtttccgca 2460ttggttgagt tgaacattgc taagaatttg gttggatctg caatggctgg gtctgttggt 2520ggatttaacg cacgtgcagc taatttagtg acagctgttt tcttggcatt aggacaagat 2580cctgcacaaa atgtcgaaag ttccaactgt ataacattga tgaaagaagt ggacggtgat 2640ttgagaattt ccgtatccat gccatccatc gaagtaggta ccatcggtgg tggtactgtt 2700ctagaaccac aaggtgccat gttggactta ttaggtgtaa gaggcccaca tgctaccgct 2760cctggtacca acgcacgtca attagcaaga atagttgcct gtgccgtctt ggcaggtgaa 2820ttatccttat gtgctgccct agcagccggc catttggttc aaagtcatat gacccacaac 2880aggaaacctg ctgaaccaac aaaacctaac aatttggacg ccactgatat aaatcgtttg 2940aaagatgggt ccgtcacctg cattaaatcc taa 2973<210>16<211>2457<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>16atgccgccgc tattcaaggg actgaaacag atggcaaagc caattgccta tgtttcaaga 60ttttcggcga aacgaccaat tcatataata cttttttctc taatcatatc cgcattcgct 120tatctatccg tcattcagta ttacttcaat ggttggcaac tagattcaaa tagtgttttt 180gaaactgctc caaataaaga cttcaacact ctatttcaag aatgttccca ttactacaga 240gattcctctc tagatggttg ggtatcaatc accgcgcatg aagctagtga gttaccagcc 300ccacaccatt actatctatt aaacctgaac ttcaatagtc ctaatgaaac tgactccatt 360ccagaactag ctaacacggt ttttgagaaa gataatacaa aatatattct gcaagaagat 420ctcagcgttt ccaaagaaat ttcttctact gatggaacga aatggaggtt aagaagtgac 480agaaaaagtc ttttcgacgt aaagacgtta gcatattctc tctacgatgt attttcagaa 540aatgtaaccc aagcagacaa ctttggtgca aattgggtca atgatgcctt caattcattg 600tacttcgata aggaacgtgt ttctctacca gattttatta cctcgaatgc ctctgaaaac 660tttaaagagc aagctattgt tagtgtcacc ccattattat attacaaacc cattaagtcc 720taccaacgca ttgaggatat ggttcttcta ttgcttcgta atgtcagtgt tgccattcgt 780gataggttcg tcagtaaatt agttctttcc gccttagtat gcagtgctgt catcaatgtg 840tatttattaa atgctgctag aattcatacc agttatactg cagaccaatt ggtgaagact 900gaagtcacca agaagtcttt tactgctcct gtacaaaagg cttctacacc agttttaacc 960aataaaacag tcatttctgg atcgaaagtc aaaagtttat catctgcgca atcgagctca 1020tcaggacctt catcatctag tgaggaagat gattcccgcg atattgaaag cttggataag 1080aaaatacgtc ctttagaaga attagaagca tcattaagta gtggaaatac aaaacaattg 1140aagaacaaag aggtcgctgc cttggttatt cacggtaagt tacctttgta cgctttggag 1200aaaaaattag gtgatactac gagagcggtt gcggtacgta ggaaggctct ttcaattttg 1260gcagaagctc ctgtattagc atctgatcgt ttaccatata aaaattatga ctacgaccgc 1320gtatttggcg cttgttgtga aaatgttata ggttacatgc ctttgcccgt tggtgttata 1380ggccccttgg ttatcgatgg tacatcttat catataccaa tggcaactac agagggttgt 1440ttggtagctt ctgccatgcg tggctgtaag gcaatcaatg ctggcggtgg tgcaacaact 1500gttttaacta aggatggtat gacaagaggc ccagtagtcc gtttcccaac tttgaaaaga 1560tctggtgcct gtaagatatg gttagactca gaagagggac aaaacgcaat taaaaaagct 1620tttaactcta catcaagatt tgcacgtctg caacatattc aaacttgtct agcaggagat 1680ttactcttca tgagatttag aacaactact ggtgacgcaa tgggtatgaa tatgatttct 1740aagggtgtcg aatactcatt aaagcaaatg gtagaagagt atggctggga agatatggag 1800gttgtctccg tttctggtaa ctactgtacc gacaaaaaac cagctgccat caactggatc 1860gaaggtcgtg gtaagagtgt cgtcgcagaa gctactattc ctggtgatgt tgtcagaaaa 1920gtgttaaaaa gtgatgtttc cgcattggtt gagttgaaca ttgctaagaa tttggttgga 1980tctgcaatgg ctgggtctgt tggtggattt aacgcacgtg cagctaattt agtgacagct 2040gttttcttgg cattaggaca agatcctgca caaaatgtcg aaagttccaa ctgtataaca 2100ttgatgaaag aagtggacgg tgatttgaga atttccgtat ccatgccatc catcgaagta 2160ggtaccatcg gtggtggtac tgttctagaa ccacaaggtg ccatgttgga cttattaggt 2220gtaagaggcc cacatgctac cgctcctggt accaacgcac gtcaattagc aagaatagtt 2280gcctgtgccg tcttggcagg tgaattatcc ttatgtgctg ccctagcagc cggccatttg 2340gttcaaagtc atatgaccca caacaggaaa cctgctgaac caacaaaacc taacaatttg 2400gacgccactg atataaatcg tttgaaagat gggtccgtca cctgcattaa atcctaa 2457<210>17<211>2151<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>17atgccgccgc tattcaaggg actgaaacag atggcaaagc caattgccta tgtttcaaga 60ttttcggcga aacgaccaat tcatataata cttttttctc taatcatatc cgcattcgct 120tatctatccg tcattcagta ttacttcaat ggttggcaac tagattcaaa tagtgttttt 180gaaactgctc caaataaaga cttcaacact ctatttcaag aatgttccca ttactacaga 240gattcctctc tagatggttg ggtatcaatc accgcgcatg aagctagtga gttaccagcc 300ccacaccatt actatctatt aaacctgaac ttcaatagtc ctaatgaaac tgactccatt 360ccagaactag ctaacacggt ttttgagaaa gataatacaa aatatattct gcaagaagat 420ctcagcgttt ccaaagaaat ttcttctact gatggaacga aatggaggtt aagaagtgac 480agaaaaagtc ttttcgacgt aaagacgtta gcatattctc tctacgatgt attttcagaa 540aatgtaaccc aagcagacca taccagttat actgcagacc aattggtgaa gactgaagtc 600accaagaagt cttttactgc tcctgtacaa aaggcttcta caccagtttt aaccaataaa 660acagtcattt ctggatcgaa agtcaaaagt ttatcatctg cgcaatcgag ctcatcagga 720ccttcatcat ctagtgagga agatgattcc cgcgatattg aaagcttgga taagaaaata 780cgtcctttag aagaattaga agcatcatta agtagtggaa atacaaaaca attgaagaac 840aaagaggtcg ctgccttggt tattcacggt aagttacctt tgtacgcttt ggagaaaaaa 900ttaggtgata ctacgagagc ggttgcggta cgtaggaagg ctctttcaat tttggcagaa 960gctcctgtat tagcatctga tcgtttacca tataaaaatt atgactacga ccgcgtattt 1020ggcgcttgtt gtgaaaatgt tataggttac atgcctttgc ccgttggtgt tataggcccc 1080ttggttatcg atggtacatc ttatcatata ccaatggcaa ctacagaggg ttgtttggta 1140gcttctgcca tgcgtggctg taaggcaatc aatgctggcg gtggtgcaac aactgtttta 1200actaaggatg gtatgacaag aggcccagta gtccgtttcc caactttgaa aagatctggt 1260gcctgtaaga tatggttaga ctcagaagag ggacaaaacg caattaaaaa agcttttaac 1320tctacatcaa gatttgcacg tctgcaacat attcaaactt gtctagcagg agatttactc 1380ttcatgagat ttagaacaac tactggtgac gcaatgggta tgaatatgat ttctaagggt 1440gtcgaatact cattaaagca aatggtagaa gagtatggct gggaagatat ggaggttgtc 1500tccgtttctg gtaactactg taccgacaaa aaaccagctg ccatcaactg gatcgaaggt 1560cgtggtaaga gtgtcgtcgc agaagctact attcctggtg atgttgtcag aaaagtgtta 1620aaaagtgatg tttccgcatt ggttgagttg aacattgcta agaatttggt tggatctgca 1680atggctgggt ctgttggtgg atttaacgca cgtgcagcta atttagtgac agctgttttc 1740ttggcattag gacaagatcc tgcacaaaat gtcgaaagtt ccaactgtat aacattgatg 1800aaagaagtgg acggtgattt gagaatttcc gtatccatgc catccatcga agtaggtacc 1860atcggtggtg gtactgttct agaaccacaa ggtgccatgt tggacttatt aggtgtaaga 1920ggcccacatg ctaccgctcc tggtaccaac gcacgtcaat tagcaagaat agttgcctgt 1980gccgtcttgg caggtgaatt atccttatgt gctgccctag cagccggcca tttggttcaa 2040agtcatatga cccacaacag gaaacctgct gaaccaacaa aacctaacaa tttggacgcc 2100actgatataa atcgtttgaa agatgggtcc gtcacctgca ttaaatccta a 2151<210>18<211>1620<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>18atgccgccgc tattcaaggg actgaaacat accagttata ctgcagacca attggtgaag 60actgaagtca ccaagaagtc ttttactgct cctgtacaaa aggcttctac accagtttta 120accaataaaa cagtcatttc tggatcgaaa gtcaaaagtt tatcatctgc gcaatcgagc 180tcatcaggac cttcatcatc tagtgaggaa gatgattccc gcgatattga aagcttggat 240aagaaaatac gtcctttaga agaattagaa gcatcattaa gtagtggaaa tacaaaacaa 300ttgaagaaca aagaggtcgc tgccttggtt attcacggta agttaccttt gtacgctttg 360gagaaaaaat taggtgatac tacgagagcg gttgcggtac gtaggaaggc tctttcaatt 420ttggcagaag ctcctgtatt agcatctgat cgtttaccat ataaaaatta tgactacgac 480cgcgtatttg gcgcttgttg tgaaaatgtt ataggttaca tgcctttgcc cgttggtgtt 540ataggcccct tggttatcga tggtacatct tatcatatac caatggcaac tacagagggt 600tgtttggtag cttctgccat gcgtggctgt aaggcaatca atgctggcgg tggtgcaaca 660actgttttaa ctaaggatgg tatgacaaga ggcccagtag tccgtttccc aactttgaaa 720agatctggtg cctgtaagat atggttagac tcagaagagg gacaaaacgc aattaaaaaa 780gcttttaact ctacatcaag atttgcacgt ctgcaacata ttcaaacttg tctagcagga 840gatttactct tcatgagatt tagaacaact actggtgacg caatgggtat gaatatgatt 900tctaagggtg tcgaatactc attaaagcaa atggtagaag agtatggctg ggaagatatg 960gaggttgtct ccgtttctgg taactactgt accgacaaaa aaccagctgc catcaactgg 1020atcgaaggtc gtggtaagag tgtcgtcgca gaagctacta ttcctggtga tgttgtcaga 1080aaagtgttaa aaagtgatgt ttccgcattg gttgagttga acattgctaa gaatttggtt 1140ggatctgcaa tggctgggtc tgttggtgga tttaacgcac gtgcagctaa tttagtgaca 1200gctgttttct tggcattagg acaagatcct gcacaaaatg tcgaaagttc caactgtata 1260acattgatga aagaagtgga cggtgatttg agaatttccg tatccatgcc atccatcgaa 1320gtaggtacca tcggtggtgg tactgttcta gaaccacaag gtgccatgtt ggacttatta 1380ggtgtaagag gcccacatgc taccgctcct ggtaccaacg cacgtcaatt agcaagaata 1440gttgcctgtg ccgtcttggc aggtgaatta tccttatgtg ctgccctagc agccggccat 1500ttggttcaaa gtcatatgac ccacaacagg aaacctgctg aaccaacaaa acctaacaat 1560ttggacgcca ctgatataaa tcgtttgaaa gatgggtccg tcacctgcat taaatcctaa 1620<210>19<211>1377<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>19atgccgccgc tattcaaggg actgaaagca tcattaagta gtggaaatac aaaacaattg 60aagaacaaag aggtcgctgc cttggttatt cacggtaagt tacctttgta cgctttggag 120aaaaaattag gtgatactac gagagcggtt gcggtacgta ggaaggctct ttcaattttg 180gcagaagctc ctgtattagc atctgatcgt ttaccatata aaaattatga ctacgaccgc 240gtatttggcg cttgttgtga aaatgttata ggttacatgc ctttgcccgt tggtgttata 300ggccccttgg ttatcgatgg tacatcttat catataccaa tggcaactac agagggttgt 360ttggtagctt ctgccatgcg tggctgtaag gcaatcaatg ctggcggtgg tgcaacaact 420gttttaacta aggatggtat gacaagaggc ccagtagtcc gtttcccaac tttgaaaaga 480tctggtgcct gtaagatatg gttagactca gaagagggac aaaacgcaat taaaaaagct 540tttaactcta catcaagatt tgcacgtctg caacatattc aaacttgtct agcaggagat 600ttactcttca tgagatttag aacaactact ggtgacgcaa tgggtatgaa tatgatttct 660aagggtgtcg aatactcatt aaagcaaatg gtagaagagt atggctggga agatatggag 720gttgtctccg tttctggtaa ctactgtacc gacaaaaaac cagctgccat caactggatc 780gaaggtcgtg gtaagagtgt cgtcgcagaa gctactattc ctggtgatgt tgtcagaaaa 840gtgttaaaaa gtgatgtttc cgcattggtt gagttgaaca ttgctaagaa tttggttgga 900tctgcaatgg ctgggtctgt tggtggattt aacgcacgtg cagctaattt agtgacagct 960gttttcttgg cattaggaca agatcctgca caaaatgtcg aaagttccaa ctgtataaca 1020ttgatgaaag aagtggacgg tgatttgaga atttccgtat ccatgccatc catcgaagta 1080ggtaccatcg gtggtggtac tgttctagaa ccacaaggtg ccatgttgga cttattaggt 1140gtaagaggcc cacatgctac cgctcctggt accaacgcac gtcaattagc aagaatagtt 1200gcctgtgccg tcttggcagg tgaattatcc ttatgtgctg ccctagcagc cggccatttg 1260gttcaaagtc atatgaccca caacaggaaa cctgctgaac caacaaaacc taacaatttg 1320gacgccactg atataaatcg tttgaaagat gggtccgtca cctgcattaa atcctaa 1377<210>20<211>1302<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>20atgccgccgc tattcaaggg actgaaacct ttgtacgctt tggagaaaaa attaggtgat 60actacgagag cggttgcggt acgtaggaag gctctttcaa ttttggcaga agctcctgta 120ttagcatctg atcgtttacc atataaaaat tatgactacg accgcgtatt tggcgcttgt 180tgtgaaaatg ttataggtta catgcctttg cccgttggtg ttataggccc cttggttatc 240gatggtacat cttatcatat accaatggca actacagagg gttgtttggt agcttctgcc 300atgcgtggct gtaaggcaat caatgctggc ggtggtgcaa caactgtttt aactaaggat 360ggtatgacaa gaggcccagt agtccgtttc ccaactttga aaagatctgg tgcctgtaag 420atatggttag actcagaaga gggacaaaac gcaattaaaa aagcttttaa ctctacatca 480agatttgcac gtctgcaaca tattcaaact tgtctagcag gagatttact cttcatgaga 540tttagaacaa ctactggtga cgcaatgggt atgaatatga tttctaaggg tgtcgaatac 600tcattaaagc aaatggtaga agagtatggc tgggaagata tggaggttgt ctccgtttct 660ggtaactact gtaccgacaa aaaaccagct gccatcaact ggatcgaagg tcgtggtaag 720agtgtcgtcg cagaagctac tattcctggt gatgttgtca gaaaagtgtt aaaaagtgat 780gtttccgcat tggttgagtt gaacattgct aagaatttgg ttggatctgc aatggctggg 840tctgttggtg gatttaacgc acgtgcagct aatttagtga cagctgtttt cttggcatta 900ggacaagatc ctgcacaaaa tgtcgaaagt tccaactgta taacattgat gaaagaagtg 960gacggtgatt tgagaatttc cgtatccatg ccatccatcg aagtaggtac catcggtggt 1020ggtactgttc tagaaccaca aggtgccatg ttggacttat taggtgtaag aggcccacat 1080gctaccgctc ctggtaccaa cgcacgtcaa ttagcaagaa tagttgcctg tgccgtcttg 1140gcaggtgaat tatccttatg tgctgcccta gcagccggcc atttggttca aagtcatatg 1200acccacaaca ggaaacctgc tgaaccaaca aaacctaaca atttggacgc cactgatata 1260aatcgtttga aagatgggtc cgtcacctgc attaaatcct aa 1302<210>21<211>1203<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>21atgccgccgc tattcaaggg actgaaatct gatcgtttac catataaaaa ttatgactac 60gaccgcgtat ttggcgcttg ttgtgaaaat gttataggtt acatgccttt gcccgttggt 120gttataggcc ccttggttat cgatggtaca tcttatcata taccaatggc aactacagag 180ggttgtttgg tagcttctgc catgcgtggc tgtaaggcaa tcaatgctgg cggtggtgca 240acaactgttt taactaagga tggtatgaca agaggcccag tagtccgttt cccaactttg 300aaaagatctg gtgcctgtaa gatatggtta gactcagaag agggacaaaa cgcaattaaa 360aaagctttta actctacatc aagatttgca cgtctgcaac atattcaaac ttgtctagca 420ggagatttac tcttcatgag atttagaaca actactggtg acgcaatggg tatgaatatg 480atttctaagg gtgtcgaata ctcattaaag caaatggtag aagagtatgg ctgggaagat 540atggaggttg tctccgtttc tggtaactac tgtaccgaca aaaaaccagc tgccatcaac 600tggatcgaag gtcgtggtaa gagtgtcgtc gcagaagcta ctattcctgg tgatgttgtc 660agaaaagtgt taaaaagtga tgtttccgca ttggttgagt tgaacattgc taagaatttg 720gttggatctg caatggctgg gtctgttggt ggatttaacg cacgtgcagc taatttagtg 780acagctgttt tcttggcatt aggacaagat cctgcacaaa atgtcgaaag ttccaactgt 840ataacattga tgaaagaagt ggacggtgat ttgagaattt ccgtatccat gccatccatc 900gaagtaggta ccatcggtgg tggtactgtt ctagaaccac aaggtgccat gttggactta 960ttaggtgtaa gaggcccaca tgctaccgct cctggtacca acgcacgtca attagcaaga 1020atagttgcct gtgccgtctt ggcaggtgaa ttatccttat gtgctgccct agcagccggc 1080catttggttc aaagtcatat gacccacaac aggaaacctg ctgaaccaac aaaacctaac 1140aatttggacg ccactgatat aaatcgtttg aaagatgggt ccgtcacctg cattaaatcc 1200taa 1203<210>22<211>975<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>22atgccgccgc tattcaaggg actgaaaaag gatggtatga caagaggccc agtagtccgt 60ttcccaactt tgaaaagatc tggtgcctgt aagatatggt tagactcaga agagggacaa 120aacgcaatta aaaaagcttt taactctaca tcaagatttg cacgtctgca acatattcaa 180acttgtctag caggagattt actcttcatg agatttagaa caactactgg tgacgcaatg 240ggtatgaata tgatttctaa gggtgtcgaa tactcattaa agcaaatggt agaagagtat 300ggctgggaag atatggaggt tgtctccgtt tctggtaact actgtaccga caaaaaacca 360gctgccatca actggatcga aggtcgtggt aagagtgtcg tcgcagaagc tactattcct 420ggtgatgttg tcagaaaagt gttaaaaagt gatgtttccg cattggttga gttgaacatt 480gctaagaatt tggttggatc tgcaatggct gggtctgttg gtggatttaa cgcacgtgca 540gctaatttag tgacagctgt tttcttggca ttaggacaag atcctgcaca aaatgtcgaa 600agttccaact gtataacatt gatgaaagaa gtggacggtg atttgagaat ttccgtatcc 660atgccatcca tcgaagtagg taccatcggt ggtggtactg ttctagaacc acaaggtgcc 720atgttggact tattaggtgt aagaggccca catgctaccg ctcctggtac caacgcacgt 780caattagcaa gaatagttgc ctgtgccgtc ttggcaggtg aattatcctt atgtgctgcc 840ctagcagccg gccatttggt tcaaagtcat atgacccaca acaggaaacc tgctgaacca 900acaaaaccta acaatttgga cgccactgat ataaatcgtt tgaaagatgg gtccgtcacc 960tgcattaaat cctaa 975<210>23<211>992<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>23ggatcctcta gctccctaac atgtaggtgg cggaggggag atatacaata gaacagatac 60cagacaagac ataatgggct aaacaagact acaccaatta cactgcctca ttgatggtgg 120tacataacga actaatactg tagccctaga cttgatagcc atcatcatat cgaagtttca 180ctaccctttt tccatttgcc atctattgaa gtaataatag gcgcatgcaa cttcttttct 240ttttttttct tttctctctc ccccgttgtt gtctcaccat atccgcaatg acaaaaaaat 300gatggaagac actaaaggaa aaaattaacg acaaagacag caccaacaga tgtcgttgtt 360ccagagctga tgaggggtat ctcgaagcac acgaaacttt ttccttcctt cattcacgca 420cactactctc taatgagcaa cggtatacgg ccttccttcc agttacttga atttgaaata 480aaaaaagttt gctgtcttgc tatcaagtat aaatagacct gcaattatta atcttttgtt 540tcctcgtcat tgttctcgtt ccctttcttc cttgtttctt tttctgcaca atatttcaag 600ctataccaag catacaatca actggtaccc gggtcgactc gagctctaga ggttaactaa 660gcgaatttct tatgatttat gatttttatt attaaataag ttataaaaaa aataagtgta 720tacaaatttt aaagtgactc ttaggtttta aaacgaaaat tcttattctt gagtaactct 780ttcctgtagg tcaggttgct ttctcaggta tagcatgagg tcgctcttat tgaccacatc 840tctaccggca tgccgagcaa atgcctgcaa atcgctcccc atttcaccca attgtagata 900tgctaactcc agcaatgagt tgatgaatct cggtgtgtat tttatgtcct cagaggacaa 960cacctgttgt aatcgttctt ccacacggat cc 992<210>24<211>4<212>PRT<213>人工序列<220><223>人工序列说明:合成肽<400>24His Asp Glu Leu1<210>25<211>894<212>DNA<213>嗜热脂肪芽孢杆菌(Bacillus stearothermophilus)<400>25gtggcgcagc tttcagttga acagtttctc aacgagcaaa aacaggcggt ggaaacagcg 60ctctcccgtt atatagagcg cttagaaggg ccggcgaagc tgaaaaaggc gatggcgtac 120tcattggagg ccggcggcaa acgaatccgt ccgttgctgc ttctgtccac cgttcgggcg 180ctcggcaaag acccggcggt cggattgccc gtcgcctgcg cgattgaaat gatccatacg 240tactctttga tccatgatga tttgccgagc atggacaacg atgatttgcg gcgcggcaag 300ccgacgaacc ataaagtgtt cggcgaggcg atggccatct tggcggggga cgggttgttg 360acgtacgcgt ttcaattgat caccgaaatc gacgatgagc gcatccctcc ttccgtccgg 420cttcggctca tcgaacggct ggcgaaagcg gccggtccgg aagggatggt cgccggtcag 480gcagccgata tggaaggaga ggggaaaacg ctgacgcttt cggagctcga atacattcat 540cggcataaaa ccgggaaaat gctgcaatac agcgtgcacg ccggcgcctt gatcggcggc 600gctgatgccc ggcaaacgcg ggagcttgac gaattcgccg cccatctagg ccttgccttt 660caaattcgcg atgatattct cgatattgaa ggggcagaag aaaaaatcgg caagccggtc 720ggcagcgacc aaagcaacaa caaagcgacg tatccagcgt tgctgtcgct tgccggcgcg 780aaggaaaagt tggcgttcca tatcgaggcg gcgcagcgcc atttacggaa cgccgacgtt 840gacggcgccg cgctcgccta tatttgcgaa ctggtcgccg cccgcgacca ttaa 894<210>26<211>1197<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>26atgtctcaga acgtttacat tgtatcgact gccagaaccc caattggttc attccagggt 60tctctatcct ccaagacagc agtggaattg ggtgctgttg ctttaaaagg cgccttggct 120aaggttccag aattggatgc atccaaggat tttgacgaaa ttatttttgg taacgttctt 180tctgccaatt tgggccaagc tccggccaga caagttgctt tggctgccgg tttgagtaat 240catatcgttg caagcacagt taacaaggtc tgtgcatccg ctatgaaggc aatcattttg 300ggtgctcaat ccatcaaatg tggtaatgct gatgttgtcg tagctggtgg ttgtgaatct 360atgactaacg caccatacta catgccagca gcccgtgcgg gtgccaaatt tggccaaact 420gttcttgttg atggtgtcga aagagatggg ttgaacgatg cgtacgatgg tctagccatg 480ggtgtacacg cagaaaagtg tgcccgtgat tgggatatta ctagagaaca acaagacaat 540tttgccatcg aatcctacca aaaatctcaa aaatctcaaa aggaaggtaa attcgacaat 600gaaattgtac ctgttaccat taagggattt agaggtaagc ctgatactca agtcacgaag 660gacgaggaac ctgctagatt acacgttgaa aaattgagat ctgcaaggac tgttttccaa 720aaagaaaacg gtactgttac tgccgctaac gcttctccaa tcaacgatgg tgctgcagcc 780gtcatcttgg tttccgaaaa agttttgaag gaaaagaatt tgaagccttt ggctattatc 840aaaggttggg gtgaggccgc tcatcaacca gctgatttta catgggctcc atctcttgca 900gttccaaagg ctttgaaaca tgctggcatc gaagacatca attctgttga ttactttgaa 960ttcaatgaag ccttttcggt tgtcggtttg gtgaacacta agattttgaa gctagaccca 1020tctaaggtta atgtatatgg tggtgctgtt gctctaggtc acccattggg ttgttctggt 1080gctagagtgg ttgttacact gctatccatc ttacagcaag aaggaggtaa gatcggtgtt 1140gccgccattt gtaatggtgg tggtggtgct tcctctattg tcattgaaaa gatatga 1197<210>27<211>1476<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>27atgaaactct caactaaact ttgttggtgt ggtattaaag gaagacttag gccgcaaaag 60caacaacaat tacacaatac aaacttgcaa atgactgaac taaaaaaaca aaagaccgct 120gaacaaaaaa ccagacctca aaatgtcggt attaaaggta tccaaattta catcccaact 180caatgtgtca accaatctga gctagagaaa tttgatggcg tttctcaagg taaatacaca 240attggtctgg gccaaaccaa catgtctttt gtcaatgaca gagaagatat ctactcgatg 300tccctaactg ttttgtctaa gttgatcaag agttacaaca tcgacaccaa caaaattggt 360agattagaag tcggtactga aactctgatt gacaagtcca agtctgtcaa gtctgtcttg 420atgcaattgt ttggtgaaaa cactgacgtc gaaggtattg acacgcttaa tgcctgttac 480ggtggtacca acgcgttgtt caactctttg aactggattg aatctaacgc atgggatggt 540agagacgcca ttgtagtttg cggtgatatt gccatctacg ataagggtgc cgcaagacca 600accggtggtg ccggtactgt tgctatgtgg atcggtcctg atgctccaat tgtatttgac 660tctgtaagag cttcttacat ggaacacgcc tacgattttt acaagccaga tttcaccagc 720gaatatcctt acgtcgatgg tcatttttca ttaacttgtt acgtcaaggc tcttgatcaa 780gtttacaaga gttattccaa gaaggctatt tctaaagggt tggttagcga tcccgctggt 840tcggatgctt tgaacgtttt gaaatatttc gactacaacg ttttccatgt tccaacctgt 900aaattggtca caaaatcata cggtagatta ctatataacg atttcagagc caatcctcaa 960ttgttcccag aagttgacgc cgaattagct actcgcgatt atgacgaatc tttaaccgat 1020aagaacattg aaaaaacttt tgttaatgtt gctaagccat tccacaaaga gagagttgcc 1080caatctttga ttgttccaac aaacacaggt aacatgtaca ccgcatctgt ttatgccgcc 1140tttgcatctc tattaaacta tgttggatct gacgacttac aaggcaagcg tgttggttta 1200ttttcttacg gttccggttt agctgcatct ctatattctt gcaaaattgt tggtgacgtc 1260caacatatta tcaaggaatt agatattact aacaaattag ccaagagaat caccgaaact 1320ccaaaggatt acgaagctgc catcgaattg agagaaaatg cccatttgaa gaagaacttc 1380aaacctcaag gttccattga gcatttgcaa agtggtgttt actacttgac caacatcgat 1440gacaaattta gaagatctta cgatgttaaa aaataa 1476<210>28<211>1332<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>28atgtcattac cgttcttaac ttctgcaccg ggaaaggtta ttatttttgg tgaacactct 60gctgtgtaca acaagcctgc cgtcgctgct agtgtgtctg cgttgagaac ctacctgcta 120ataagcgagt catctgcacc agatactatt gaattggact tcccggacat tagctttaat 180cataagtggt ccatcaatga tttcaatgcc atcaccgagg atcaagtaaa ctcccaaaaa 240ttggccaagg ctcaacaagc caccgatggc ttgtctcagg aactcgttag tcttttggat 300ccgttgttag ctcaactatc cgaatccttc cactaccatg cagcgttttg tttcctgtat 360atgtttgttt gcctatgccc ccatgccaag aatattaagt tttctttaaa gtctacttta 420cccatcggtg ctgggttggg ctcaagcgcc tctatttctg tatcactggc cttagctatg 480gcctacttgg gggggttaat aggatctaat gacttggaaa agctgtcaga aaacgataag 540catatagtga atcaatgggc cttcataggt gaaaagtgta ttcacggtac cccttcagga 600atagataacg ctgtggccac ttatggtaat gccctgctat ttgaaaaaga ctcacataat 660ggaacaataa acacaaacaa ttttaagttc ttagatgatt tcccagccat tccaatgatc 720ctaacctata ctagaattcc aaggtctaca aaagatcttg ttgctcgcgt tcgtgtgttg 780gtcaccgaga aatttcctga agttatgaag ccaattctag atgccatggg tgaatgtgcc 840ctacaaggct tagagatcat gactaagtta agtaaatgta aaggcaccga tgacgaggct 900gtagaaacta ataatgaact gtatgaacaa ctattggaat tgataagaat aaatcatgga 960ctgcttgtct caatcggtgt ttctcatcct ggattagaac ttattaaaaa tctgagcgat 1020gatttgagaa ttggctccac aaaacttacc ggtgctggtg gcggcggttg ctctttgact 1080ttgttacgaa gagacattac tcaagagcaa attgacagct tcaaaaagaa attgcaagat 1140gattttagtt acgagacatt tgaaacagac ttgggtggga ctggctgctg tttgttaagc 1200gcaaaaaatt tgaataaaga tcttaaaatc aaatccctag tattccaatt atttgaaaat 1260aaaactacca caaagcaaca aattgacgat ctattattgc caggaaacac gaatttacca 1320tggacttcat aa 1332<210>29<211>1356<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>29atgtcagagt tgagagcctt cagtgcccca gggaaagcgt tactagctgg tggatattta 60gttttagata caaaatatga agcatttgta gtcggattat cggcaagaat gcatgctgta 120gcccatcctt acggttcatt gcaagggtct gataagtttg aagtgcgtgt gaaaagtaaa 180caatttaaag atggggagtg gctgtaccat ataagtccta aaagtggctt cattcctgtt 240tcgataggcg gatctaagaa ccctttcatt gaaaaagtta tcgctaacgt atttagctac 300tttaaaccta acatggacga ctactgcaat agaaacttgt tcgttattga tattttctct 360gatgatgcct accattctca ggaggatagc gttaccgaac atcgtggcaa cagaagattg 420agttttcatt cgcacagaat tgaagaagtt cccaaaacag ggctgggctc ctcggcaggt 480ttagtcacag ttttaactac agctttggcc tccttttttg tatcggacct ggaaaataat 540gtagacaaat atagagaagt tattcataat ttagcacaag ttgctcattg tcaagctcag 600ggtaaaattg gaagcgggtt tgatgtagcg gcggcagcat atggatctat cagatataga 660agattcccac ccgcattaat ctctaatttg ccagatattg gaagtgctac ttacggcagt 720aaactggcgc atttggttga tgaagaagac tggaatatta cgattaaaag taaccattta 780ccttcgggat taactttatg gatgggcgat attaagaatg gttcagaaac agtaaaactg 840gtccagaagg taaaaaattg gtatgattcg catatgccag aaagcttgaa aatatataca 900gaactcgatc atgcaaattc tagatttatg gatggactat ctaaactaga tcgcttacac 960gagactcatg acgattacag cgatcagata tttgagtctc ttgagaggaa tgactgtacc 1020tgtcaaaagt atcctgaaat cacagaagtt agagatgcag ttgccacaat tagacgttcc 1080tttagaaaaa taactaaaga atctggtgcc gatatcgaac ctcccgtaca aactagctta 1140ttggatgatt gccagacctt aaaaggagtt cttacttgct taatacctgg tgctggtggt 1200tatgacgcca ttgcagtgat tactaagcaa gatgttgatc ttagggctca aaccgctaat 1260gacaaaagat tttctaaggt tcaatggctg gatgtaactc aggctgactg gggtgttagg 1320aaagaaaaag atccggaaac ttatcttgat aaataa 1356<210>30<211>1191<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>30atgaccgttt acacagcatc cgttaccgca cccgtcaaca tcgcaaccct taagtattgg 60gggaaaaggg acacgaagtt gaatctgccc accaattcgt ccatatcagt gactttatcg 120caagatgacc tcagaacgtt gacctctgcg gctactgcac ctgagtttga acgcgacact 180ttgtggttaa atggagaacc acacagcatc gacaatgaaa gaactcaaaa ttgtctgcgc 240gacctacgcc aattaagaaa ggaaatggaa tcgaaggacg cctcattgcc cacattatct 300caatggaaac tccacattgt ctccgaaaat aactttccta cagcagctgg tttagcttcc 360tccgctgctg gctttgctgc attggtctct gcaattgcta agttatacca attaccacag 420tcaacttcag aaatatctag aatagcaaga aaggggtctg gttcagcttg tagatcgttg 480tttggcggat acgtggcctg ggaaatggga aaagctgaag atggtcatga ttccatggca 540gtacaaatcg cagacagctc tgactggcct cagatgaaag cttgtgtcct agttgtcagc 600gatattaaaa aggatgtgag ttccactcag ggtatgcaat tgaccgtggc aacctccgaa 660ctatttaaag aaagaattga acatgtcgta ccaaagagat ttgaagtcat gcgtaaagcc 720attgttgaaa aagatttcgc cacctttgca aaggaaacaa tgatggattc caactctttc 780catgccacat gtttggactc tttccctcca atattctaca tgaatgacac ttccaagcgt 840atcatcagtt ggtgccacac cattaatcag ttttacggag aaacaatcgt tgcatacacg 900tttgatgcag gtccaaatgc tgtgttgtac tacttagctg aaaatgagtc gaaactcttt 960gcatttatct ataaattgtt tggctctgtt cctggatggg acaagaaatt tactactgag 1020cagcttgagg ctttcaacca tcaatttgaa tcatctaact ttactgcacg tgaattggat 1080cttgagttgc aaaaggatgt tgccagagtg attttaactc aagtcggttc aggcccacaa 1140gaaacaaacg aatctttgat tgacgcaaag actggtctac caaaggaata a 1191<210>31<211>867<212>DNA<213>啤酒糖酵母(Saccharomyces cerevisiae)<400>31atgactgccg acaacaatag tatgccccat ggtgcagtat ctagttacgc caaattagtg 60caaaaccaaa cacctgaaga cattttggaa gagtttcctg aaattattcc attacaacaa 120agacctaata cccgatctag tgagacgtca aatgacgaaa gcggagaaac atgtttttct 180ggtcatgatg aggagcaaat taagttaatg aatgaaaatt gtattgtttt ggattgggac 240gataatgcta ttggtgccgg taccaagaaa gtttgtcatt taatggaaaa tattgaaaag 300ggtttactac atcgtgcatt ctccgtcttt attttcaatg aacaaggtga attactttta 360caacaaagag ccactgaaaa aataactttc cctgatcttt ggactaacac atgctgctct 420catccactat gtattgatga cgaattaggt ttgaagggta agctagacga taagattaag 480ggcgctatta ctgcggcggt gagaaaacta gatcatgaat taggtattcc agaagatgaa 540actaagacaa ggggtaagtt tcacttttta aacagaatcc attacatggc accaagcaat 600gaaccatggg gtgaacatga aattgattac atcctatttt ataagatcaa cgctaaagaa 660aacttgactg tcaacccaaa cgtcaatgaa gttagagact tcaaatgggt ttcaccaaat 720gatttgaaaa ctatgtttgc tgacccaagt tacaagttta cgccttggtt taagattatt 780tgcgagaatt acttattcaa ctggtgggag caattagatg acctttctga agtggaaaat 840gacaggcaaa ttcatagaat gctataa 867<210>32<211>549<212>DNA<213>大肠杆菌(Escherichia coli)<400>32atgcaaacgg aacacgtcat tttattgaat gcacagggag ttcccacggg tacgctggaa 60aagtatgccg cacacacggc agacacccgc ttacatctcg cgttctccag ttggctgttt 120aatgccaaag gacaattatt agttacccgc cgcgcactga gcaaaaaagc atggcctggc 180gtgtggacta actcggtttg tgggcaccca caactgggag aaagcaacga agacgcagtg 240atccgccgtt gccgttatga gcttggcgtg gaaattacgc ctcctgaatc tatctatcct 300gactttcgct accgcgccac cgatccgagt ggcattgtgg aaaatgaagt gtgtccggta 360tttgccgcac gcaccactag tgcgttacag atcaatgatg atgaagtgat ggattatcaa 420tggtgtgatt tagcagatgt attacacggt attgatgcca cgccgtgggc gttcagtccg 480tggatggtga tgcaggcgac aaatcgcgaa gccagaaaac gattatctgc atttacccag 540cttaaataa 549<210>33<211>900<212>DNA<213>大肠杆菌(Escherichia coli)<220><22l>CDS<222>(1)..(897)<400>33atg gac ttt ccg cag caa ctc gaa gcc tgc gtt aag cag gcc aac cag 48Met Asp Phe Pro Gln Gln Leu Glu Ala Cys Val Lys Gln Ala Asn Gln1 5 10 15gcg ctg agc cgt ttt atc gcc cca ctg ccc ttt cag aac act ccc gtg 96Ala Leu Ser Arg Phe Ile Ala Pro Leu Pro Phe Gln Asn Thr Pro Val
20 25 30gtc gaa acc atg cag tat ggc gca tta tta ggt ggt aag cgc ctg cga 144Val Glu Thr Met Gln Tyr Gly Ala Leu Leu Gly Gly Lys Arg Leu Arg
35 40 45cct ttc ctg gtt tat gcc acc ggt cat atg ttc ggc gtt agc aca aac 192Pro Phe Leu Val Tyr Ala Thr Gly His Met Phe Gly Val Ser Thr Asn
50 55 60acg ctg gac gca ccc gct gcc gcc gtt gaa tgc atc cac gct gac tca 240Thr Leu Asp Ala Pro Ala Ala Ala Val Glu Cys Ile His Ala Asp Ser65 70 75 80tta att cat gat gat tta ccg gca atg gat gat gac gat ctg cgt cgc 288Leu Ile His Asp Asp Leu Pro Ala Met Asp Asp Asp Asp Leu Arg Arg
85 90 95ggt ttg cca acc tgc cat gtg aag ttt ggc gaa gca aac gcg att ctc 336Gly Leu Pro Thr Cys His Val Lys Phe Gly Glu Ala Asn Ala Ile Leu
100 105 110gct ggc gaa gct tta caa acg ctg gcg ttc tcg att tta agc gat gcc 384Ala Gly Asp Ala Leu Gln Thr Leu Ala Phe Ser Ile Leu Ser Asp Ala
115 120 125gat atg ccg gaa gtg tcg gac cgc gac aga att tcg atg att tct gaa 432Asp Met Pro Glu Val Ser Asp Arg Asp Arg Ile Ser Met Ile Ser Glu
130 135 140ctg gcg agc gcc agt ggt att gcc gga atg tgc ggt ggt cag gca tta 480Leu Ala Ser Ala Ser Gly Ile Ala Gly Met Cys Gly Gly Gln Ala Leu145 150 155 160gat tta gac gcg gaa ggc aaa cac gta cct ctg gac gcg ctt gag cgt 528Asp Leu Asp Ala Glu Gly Lys His Val Pro Leu Asp Ala Leu Glu Arg
165 170 175att cat cgt cat aaa acc ggc gca ttg att cgc gcc gcc gtt cgc ctt 576Ile His Arg His Lys Thr Gly Ala Leu Ile Arg Ala Ala Val Arg Leu
180 185 190ggt gca tta agc gcc gga gat aaa gga cgt cgt gct ctg ccg gta ctc 624Gly Ala Leu Ser Ala Gly Asp Lys Gly Arg Arg Ala Leu Pro Val Leu
195 200 205gac aag tat gca gag agc atc ggc ctt gcc ttc cag gtt cag gat gac 672Asp Lys Tyr Ala Glu Ser Ile Gly Leu Ala Phe Gln Val Gln Asp Asp
210 215 220atc ctg gat gtg gtg gga gat act gca acg ttg gga aaa cgc cag ggt 720Ile Leu Asp Val Val Gly Asp Thr Ala Thr Leu Gly Lys Arg Gln Gly225 230 235 240gcc gac cag caa ctt ggt aaa agt acc tac cct gca ctt ctg ggt ctt 768Ala Asp Gln Gln Leu Gly Lys Ser Thr Tyr Pro Ala Leu Leu Gly Leu
245 250 255gag caa gcc cgg aag aaa gcc cgg gat ctg atc gac gat gcc cgt cag 816Glu Gln Ala Arg Lys Lys Ala Arg Asp Leu Ile Asp Asp Ala Arg Gln
260 265 270tcg ctg aaa caa ctg gct gaa cag tca ctc gat acc tcg gca ctg gaa 864Ser Leu Lys Gln Leu Ala Glu Gln Ser Leu Asp Thr Ser Ala Leu Glu
275 280 285gcg cta gcg gac tac atc atc cag cgt aat aaa taa 900Ala Leu Ala Asp Tyr Ile Ile Gln Arg Asn Lys
290 295<210>34<211>299<212>PRT<213>大肠杆菌(Escherichia coli)<400>34Met Asp Phe Pro Gln Gln Leu Glu Ala Cys Val Lys Gln Ala Asn Glnl 5 10 15Ala Leu Ser Arg Phe Ile Ala Pro Leu Pro Phe Gln Asn Thr Pro Val
20 25 30Val Glu Thr Met Gln Tyr Gly Ala Leu Leu Gly Gly Lys Arg Leu Arg
35 40 45Pro Phe Leu Val Tyr Ala Thr Gly His Met Phe Gly Val Ser Thr Asn
50 55 60Thr Leu Asp Ala Pro Ala Ala Ala Val Glu Cys Ile His Ala Asp Ser65 70 75 80Leu Ile His Asp Asp Leu Pro Ala Met Asp Asp Asp Asp Leu Arg Arg
85 90 95Gly Leu Pro Thr Cys His Val Lys Phe Gly Glu Ala Asn Ala Ile Leu
100 105 110Ala Gly Asp Ala Leu Gln Thr Leu Ala Phe Ser Ile Leu Ser Asp Ala
115 120 125Asp Met Pro Glu Val Ser Asp Arg Asp Arg Ile Ser Met Ile Ser Glu
130 135 140Leu Ala Ser Ala Ser Gly Ile Ala Gly Met Cys Gly Gly Gln Ala Leu145 150 155 160Asp Leu Asp Ala Glu Gly Lys His Val Pro Leu Asp Ala Leu Glu Arg
165 170 175Ile His Arg His Lys Thr Gly Ala Leu Ile Arg Ala Ala Val Arg Leu
180 185 190Gly Ala Leu Ser Ala Gly Asp Lys Gly Arg Arg Ala Leu Pro Val Leu
195 200 205Asp Lys Tyr Ala Glu Ser Ile Gly Leu Ala Phe Gln Val Gln Asp Asp210 215 220Ile Leu Asp Val Val Gly Asp Thr Ala Thr Leu Gly Lys Arg Gln Gly225 230 235 240Ala Asp Gln Gln Leu Gly Lys Ser Thr Tyr Pro Ala Leu Leu Gly Leu
245 250 255Glu Gln Ala Arg Lys Lys Ala Arg Asp Leu Ile Asp Asp Ala Arg Gln
260 265 270Ser Leu Lys Gln Leu Ala Glu Gln Ser Leu Asp Thr Ser Ala Leu Glu
275 280 285Ala Leu Ala Asp Tyr Ile Ile Gln Arg Asn Lys
290 295<210>35<211>900<212>DNA<213>大肠杆菌(Escherichia coli)<220><221>CDS<222>(1)..(897)<400>35atg gac ttt ccg cag caa ctc gaa gcc tgc gtt aag cag gcc aac cag 48Met Asp Phe Pro Gln Gln Leu Glu Ala Cys Val Lys Gln Ala Asn Gln1 5 10 15gcg ctg agc cgt ttt atc gcc cca ctg ccc ttt cag aac act ccc gtg 96Ala Leu Ser Arg Phe Ile Ala Pro Leu Pro Phe Gln Asn Thr Pro Val
20 25 30gtc gaa acc atg cag tat ggc gca tta tta ggt ggt aag cgc ctg cga 144Val Glu Thr Met Gln Tyr Gly Ala Leu Leu Gly Gly Lys Arg Leu Arg
35 40 45cct ttc ctg gtt tat gcc acc ggt cat atg ttc ggc gtt agc aca aac 192Pro Phe Leu Val Tyr Ala Thr Gly His Met Phe Gly Val Ser Thr Asn
50 55 60acg ctg gac gca ccc gct gcc gcc gtt gaa tgc atc cac gct gaa tca 240Thr Leu Asp Ala Pro Ala Ala Ala Val Glu Cys Ile His Ala Glu Ser65 70 75 80tta att cat gat gat tta ccg gca atg gat gat gac gat ctg cgt cgc 288Leu Ile His Asp Asp Leu Pro Ala Met Asp Asp Asp Asp Leu Arg Arg
85 90 95ggt ttg cca acc tgc cat gtg aag ttt ggc gaa gca aac gcg att ctc 336Gly Leu Pro Thr Cys His Val Lys Phe Gly Glu Ala Asn Ala Ile Leu
100 105 110gct ggc gac gct tta caa acg ctg gcg ttc tcg att tta agc gat gcc 384Ala Gly Asp Ala Leu Gln Thr Leu Ala Phe Ser Ile Leu Ser Asp Ala
115 120 125gat atg ccg gaa gtg tcg gac cgc gac aga att tcg atg att tct gaa 432Asp Met Pro Glu Val Ser Asp Arg Asp Arg Ile Ser Met Ile Ser Glu
130 135 140ctg gcg agc gcc agt ggt att gcc gga atg tgc ggt ggt cag gca tta 480Leu Ala Ser Ala Ser Gly Ile Ala Gly Met Cys Gly Gly Gln Ala Leu145 150 155 160gat tta gac gcg gaa ggc aaa cac gta cct ctg gac gcg ctt gag cgt 528Asp Leu Asp Ala Glu Gly Lys His Val Pro Leu Asp Ala Leu Glu Arg
165 170 175att cat cgt cat aaa acc ggc gca ttg att cgc gcc gcc gtt cgc ctt 576Ile His Arg His Lys Thr Gly Ala Leu Ile Arg Ala Ala Val Arg Leu
180 185 190ggt gca tta agc gcc gga gat aaa gga cgt cgt gct ctg ccg gta ctc 624Gly Ala Leu Ser Ala Gly Asp Lys Gly Arg Arg Ala Leu Pro Val Leu
195 200 205gac aag tat gca gag agc atc ggc ctt gcc ttc cag gtt cag gat gac 672Asp Lys Tyr Ala Glu Ser Ile Gly Leu Ala Phe Gln Val Gln Asp Asp
210 215 220atc ctg gat gtg gtg gga gat act gca acg ttg gga aaa cgc cag ggt 720Ile Leu Asp Val Val Gly Asp Thr Ala Thr Leu Gly Lys Arg Gln Gly225 230 235 240gcc gac cag caa ctt ggt aaa agt acc tac cct gca ctt ctg ggt ctt 768Ala Asp Gln Gln Leu Gly Lys Ser Thr Tyr Pro Ala Leu Leu Gly Leu
245 250 255gag caa gcc cgg aag aaa gcc cgg gat ctg atc gac gat gcc cgt cag 816Glu Gln Ala Arg Lys Lys Ala Arg Asp Leu Ile Asp Asp Ala Arg Gln
260 265 270tcg ctg aaa caa ctg gct gaa cag tca ctc gat acc tcg gca ctg gaa 864Ser Leu Lys Gln Leu Ala Glu Gln Ser Leu Asp Thr Ser Ala Leu Glu
275 280 285gcg cta gcg gac tac atc atc cag cgt aat aaa taa 900Ala Leu Ala Asp Tyr Ile Ile Gln Arg Asn Lys
290 295<210>36<211>299<212>PRT<213>大肠杆菌(Escherichia coli)<400>36Met Asp Phe Pro Gln Gln Leu Glu Ala Cys Val Lys Gln Ala Asn Gln1 5 10 15Ala Leu Ser Arg Phe Ile Ala Pro Leu Pro Phe Gln Asn Thr Pro Val
20 25 30Val Glu Thr Met Gln Tyr Gly Ala Leu Leu Gly Gly Lys Arg Leu Arg
35 40 45Pro Phe Leu Val Tyr Ala Thr Gly His Met Phe Gly Val Ser Thr Asn
50 55 60Thr Leu Asp Ala Pro Ala Ala Ala Val Glu Cys Ile His Ala Glu Ser 65 70 75 80Leu Ile His Asp Asp Leu Pro AlaMet Asp Asp Asp Asp Leu Arg Arg
85 90 95Gly Leu Pro Thr Cys His ValLys Phe Gly Glu Ala Asn Ala Ile Leu
100 105 110Ala Gly Asp Ala Leu Gln Thr Leu Ala Phe Ser Ile Leu Ser Asp Ala
115 120 125Asp Met Pro Glu Val Ser Asp Arg Asp Arg Ile Ser Met Ile Ser Glu
130 135 140Leu Ala Ser Ala Ser Gly Ile Ala Gly Met Cys Gly Gly Gln Ala Leu145 150 155 160Asp Leu Asp Ala Glu Gly Lys His Val Pro Leu Asp Ala Leu Glu Arg
165 170 175Ile His Arg His Lys Thr Gly Ala Leu Ile Arg Ala Ala Val Arg Leu
180 185 190Gly Ala Leu Ser Ala Gly Asp Lys Gly Arg Arg Ala Leu Pro Val Leu
195 200 205Asp Lys Tyr Ala Glu Ser Ile Gly Leu Ala Phe Gln Val Gln Asp Asp
210 215 220Ile Leu Asp Val Val Gly Asp Thr Ala Thr Leu Gly Lys Arg Gln Gly225 230 235 240Ala Asp Gln Gln Leu Gly Lys Ser Thr Tyr Pro Ala Leu Leu Gly Leu
245 250 255Glu Gln Ala Arg Lys Lys Ala Arg Asp Leu Ile Asp Asp Ala Arg Gln
260 265 270Ser Leu Lys Gln Leu Ala Glu Gln Ser Leu Asp Thr Ser Ala Leu Glu
275 280 285Ala Leu Ala Asp Tyr Ile Ile Gln Arg Asn Lys
290 295<210>37<211>900<212>DNA<213>大肠杆菌(Escherichia coli)<220><221>CDS<222>(1)..(897)<400>37atg gac ttt ccg cag caa ctc gaa gcc tgc gtt aag cag gcc aac cag 48Met Asp Phe Pro Gln Gln Leu Glu Ala Cys Val Lys Gln Ala Asn Gln1 5 10 15gcg ctg agc cgt ttt atc gcc cca ctg ccc ttt cag aac act ccc gtg 96Ala Leu Ser Arg Phe Ile Ala Pro Leu Pro Phe Gln Asn Thr Pro Val
20 25 30gtc gaa acc atg cag tat ggc gca tta tta ggt ggt aag cgc ctg cga 144Val Glu Thr Met Gln Tyr Gly Ala Leu Leu Gly Gly Lys Arg Leu Arg
35 40 45cct ttc ctg gtt tat gcc acc ggt cat atg ttc ggc gtt agc aca aac 192Pro Phe Leu Val Tyr Ala Thr Gly His Met Phe Gly Val Ser Thr Asn
50 55 60acg ctg gac gca ccc gct gcc gcc gtt gaa tgc atc cac gct atg tca 240Thr Leu Asp Ala Pro Ala Ala Ala Val Glu Cys Ile His Ala Met Ser65 70 75 80tta att cat gat gat tta ccg gca atg gat gat gac gat ctg cgt cgc 288Leu Ile His Asp Asp Leu Pro Ala Met Asp Asp Asp Asp Leu Arg Arg
85 90 95ggt ttg cca acc tgc cat gtg aag ttt ggc gaa gca aac gcg att ctc 336Gly Leu Pro Thr Cys His Val Lys Phe Gly Glu Ala Asn Ala Ile Leu
100 105 110gct ggc gac gct tta caa acg ctg gcg ttc tcg att tta agc gat gcc 384Ala Gly Asp Ala Leu Gln Thr Leu Ala Phe Ser Ile Leu Ser Asp Ala
115 120 125gat atg ccg gaa gtg tcg gac cgc gac aga att tcg atg att tct gaa 432Asp Met Pro Glu Val Ser Asp Arg Asp Arg Ile Ser Met Ile Ser Glu
130 135 140ctg gcg agc gcc agt ggt att gcc gga atg tgc ggt ggt cag gca tta 480Leu Ala Ser Ala Ser Gly Ile Ala Gly Met Cys Gly Gly Gln Ala Leu145 150 155 160gat tta gac gcg gaa ggc aaa cac gta cct ctg gac gcg ctt gag cgt 528Asp Leu Asp Ala Glu Gly Lys His Val Pro Leu Asp Ala Leu Glu Arg
165 170 175att cat cgt cat aaa acc ggc gca ttg att cgc gcc gcc gtt cgc ctt 576Ile His Arg His Lys Thr Gly Ala Leu Ile Arg Ala Ala Val Arg Leu
180 185 190ggt gca tta agc gcc gga gat aaa gga cgt cgt gct ctg ccg gta ctc 624Gly Ala Leu Ser Ala Gly Asp Lys Gly Arg Arg Ala Leu Pro Val Leu
195 200 205gac aag tat gca gag agc atc ggc ctt gcc ttc cag gtt cag gat gac 672Asp Lys Tyr Ala Glu Ser Ile Gly Leu Ala Phe Gln Val Gln Asp Asp
210 215 220atc ctg gat gtg gtg gga gat act gca acg ttg gga aaa cgc cag ggt 720Ile Leu Asp Val Val Gly Asp Thr Ala Thr Leu Gly Lys Arg Gln Gly225 230 235 240gcc gac cag caa ctt ggt aaa agt acc tac cct gca ctt ctg ggt ctt 768Ala Asp Gln Gln Leu Gly Lys Ser Thr Tyr Pro Ala Leu Leu Gly Leu
245 250 255gag caa gcc cgg aag aaa gcc cgg gat ctg atc gac gat gcc cgt cag 816Glu Gln Ala Arg Lys Lys Ala Arg Asp Leu Ile Asp Asp Ala Arg Gln
260 265 270tcg ctg aaa caa ctg gct gaa cag tca ctc gat acc tcg gca ctg gaa 864Ser Leu Lys Gln Leu Ala Glu Gln Ser Leu Asp Thr Ser Ala Leu Glu
275 280 285gcg cta gcg gac tac atc atc cag cgt aat aaa taa 900Ala Leu Ala Asp Tyr Ile Ile Gln Arg Asn Lys
290 295<210>38<211>299<212>PRT<213>大肠杆菌(Escherichia coli)<400>38Met Asp Phe Pro Gln Gln Leu Glu Ala Cys Val Lys Gln Ala Asn Gln1 5 10 15Ala Leu Ser Arg Phe Ile Ala Pro Leu Pro Phe Gln Asn Thr Pro Val
20 25 30Val Glu Thr Met Gln Tyr Gly Ala Leu Leu Gly Gly Lys Arg Leu Arg
35 40 45Pro Phe Leu Val Tyr Ala Thr Gly His Met Phe Gly Val Ser Thr Asn
50 55 60Thr Leu Asp Ala Pro Ala Ala Ala Val Glu Cys Ile His Ala Met Ser65 70 75 80Leu Ile His Asp Asp Leu Pro Ala Met Asp Asp Asp Asp Leu Arg Arg
85 90 95Gly Leu Pro Thr Cys His Val Lys Phe Gly Glu Ala Asn Ala Ile Leu
100 105 110Ala Gly Asp Ala Leu Gln Thr Leu Ala Phe Ser Ile Leu Ser Asp Ala
115 120 125Asp Met Pro Glu Val Ser Asp Arg Asp Arg Ile Ser Met Ile Ser Glu
130 135 140Leu Ala Ser Ala Ser Gly Ile Ala Gly Met Cys Gly Gly Gln Ala Leu145 150 155 160Asp Leu Asp Ala Glu Gly Lys His Val Pro Leu Asp Ala Leu Glu Arg
165 170 175Ile His Arg His Lys Thr Gly Ala Leu Ile Arg Ala Ala Val Arg Leu
180 185 190Gly Ala Leu Ser Ala Gly Asp Lys Gly Arg Arg Ala Leu Pro Val Leu
195 200 205Asp Lys Tyr Ala Glu Ser Ile Gly Leu Ala Phe Gln Val Gln Asp Asp
210 215 220Ile Leu Asp Val Val Gly Asp Thr Ala Thr Leu Gly Lys Arg Gln Gly225 230 235 240Ala Asp Gln Gln Leu Gly Lys Ser Thr Tyr Pro Ala Leu Leu Gly Leu
245 250 255Glu Gln Ala Arg Lys Lys Ala Arg Asp Leu Ile Asp Asp Ala Arg Gln
260 265 270Ser Leu Lys Gln Leu Ala Glu Gln Ser Leu Asp Thr Ser Ala Leu Glu
275 280 285Ala Leu Ala Asp Tyr Ile Ile Gln Arg Asn Lys
290 295<210>39<211>894<212>DNA<213>嗜热脂肪芽孢杆菌(Bacillus stearothermophilus)<400>39gtggcgcagc tttcagttga acagtttctc aacgagcaaa aacaggcggt ggaaacagcg 60ctctcccgtt atatagagcg cttagaaggg ccggcgaagc tgaaaaaggc gatggcgtac 120tcattggagg ccggcggcaa acgaatccgt ccgttgctgc ttctgtccac cgttcgggcg 180ctcggcaaag acccggcggt cggattgccc gtcgcctgcg cgattgaaat gatccatacg 240atgtctttga ttcatgatga tttgccgagc atggacaacg atgatttgcg gcgcggcaag 300ccgacgaacc ataaagtgtt cggcgaggcg atggccatct tggcggggga cgggttgttg 360acgtacgcgt ttcaattgat caccgaaatc gacgatgagc gcatccctcc ttccgtccgg 420cttcggctca tcgaacggct ggcgaaagcg gccggtccgg aagggatggt cgccggtcag 480gcagccgata tggaaggaga ggggaaaacg ctgacgcttt cggagctcga atacattcat 540cggcataaaa ccgggaaaat gctgcaatac agcgtgcacg ccggcgcctt gatcggcggc 600gctgatgccc ggcaaacgcg ggagcttgac gaattcgccg cccatctagg ccttgccttt 660caaattcgcg atgatattct cgatattgaa ggggcagaag aaaaaatcgg caagccggtc 720ggcagcgacc aaagcaacaa caaagcgacg tatccagcgt tgctgtcgct tgccggcgcg 780aaggaaaagt tggcgttcca tatcgaggcg gcgcagcgcc atttacggaa cgccgacgtt 840gacggcgccg cgctcgccta tatttgcgaa ctggtcgccg cccgcgacca ttaa 894<210>40<211>30<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>40tgcatctcga gggccgcatc atgtaattag 30<210>41<211>32<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>41cattagggcc cggccgcaaa ttaaagcctt cg 32<210>42<211>33<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>42cacggagctc cagttcgagt ttatcattat caa 33<210>43<211>35<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>43ctctccgcgg tttgtttgtt tatgtgtgtt tattc 35<210>44<211>34<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>44ccgcgagctc ttacccataa ggttgtttgt gacg 34<210>45<211>36<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>45ctttccgcgg gtttagttaa ttatagttcg ttgacc 36<210>46<211>23<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>46atggcttcag aaaaagaaat tag 23<210>47<211>23<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>47ctatttgctt ctcttgtaaa ctt 23<210>48<211>28<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>48tgaggcatgc aatttccgca gcaactcg 28<210>49<211>28<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>49tcagaattca tcaggggcct attaatac 28<210>50<211>23<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>50atggaggcca agatagatga gct 23<210>51<211>23<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>51tcacaattcg gataagtggt cta 23<210>52<211>32<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>52tccccgcgga tgtctcagaa cgtttacatt gt 32<210>53<211>33<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>53tgctctagat catatctttt caatgacaat gga 33<210>54<211>25<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>54atgaaactct caactaaact ttgtt 25<210>55<211>25<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>55gttcagcaag atgcaatcga tgggg 25<210>56<211>23<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>56atgccgccgc tattcaaggg act 23<210>57<211>23<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>57ttaggattta atgcaggtga cgg 23<210>58<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>58ccaaataaag actccaacac tctattt 27<210>59<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>59gaattagaag cattattaag tagtgga 27<210>60<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>60ggatttaacg cacatgcagc taattta 27<210>61<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>61gtctgcttgg gttacatttt ctgaaaa 27<210>62<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>62cataccagtt atactgcaga ccaattg 27<210>63<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>63gaatactcat taaagcaaat ggtagaa 27<210>64<211>31<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>64aactgcagat gtcattaccg ttcttaactt c 31<210>65<211>31<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>65ccgagctctt atgaagtcca tggtaaattc g 31<210>66<211>31<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>66aactgcagat gtcattaccg ttcttaactt c 31<210>67<211>31<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>67ccgagctctt atgaagtcca tggtaaattc g 31<210>68<211>31<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>68aactgcagat gaccgtttac acagcatccg t 31<210>69<211>31<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>69cggaattctt attcctttgg tagaccagtc t 31<210>70<211>23<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>70atgactgccg acaacaatag tat 23<210>71<211>23<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>71ttatagcatt ctatgaattt gcc 23<210>72<211>32<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>72tccccgcgga tgcaaacgga acacgtcatt tt 32<210>73<211>33<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>73tgctctagat tatttaagct gggtaaatgc aga 33<210>74<211>45<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>74atcatgaatt aatgagtcag cgtggatgca ttcaacggcg gcagc 45<210>75<211>45<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>75atcatgaatt aatgattcag cgtggatgca ttcaacggcg gcagc 45<210>76<211>45<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>76atcatgaatt aatgacatag cgtggatgca ttcaacggcg gcagc 45<210>77<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>77tttcagtccc ttgaatagcg gcggcat 27<210>78<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>78cacaaaatca agattgccca gtatgcc 27<210>79<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>79agaagatacg gatttctttt ctgcttt 27<210>80<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>80aactttggtg caaattgggt caatgat 27<210>81<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>81ttgctcttta aagttttcag aggcatt 27<210>82<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>82gcattattaa gtagtggaaa tacaaaa 27<210>83<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>83cctttgtacg ctttggagaa aaaatta 27<210>84<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>84tctgatcgtt taccatataa aaattat 27<210>85<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>85aaggatggta tgacaagagg cccagta 27<210>86<211>16<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>86tttccgcgga aacatg 16<210>87<211>14<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>87aattgacggc cgtc 14<210>88<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>88ggccgcaaat taaagccttc gagcgtc 27<210>89<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>89acggattaga agccgccgag cgggtga 27<210>90<211>33<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>90gccgttgaca gagggtccga gctcggtacc aag 33<210>91<211>34<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>91catactgacc cattgtcaat gggtaataac tgat 34<210>92<211>18<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>92tgtccggtaa atggagac 18<210>93<211>18<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>93tgttctcgct gctcgttt 18<210>94<211>19<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>94atgggaaagc tattacaat 19<210>95<211>18<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>95caaggttgca atggccat 18<210>96<211>19<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>96caatgtaggg ctatatatg 19<210>97<211>18<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>97aacttgggga atggcaca 18<210>98<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>98tctcgaaaaa gggtttgcca t 21<210>99<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>99tcactaggtg taaagagggc t 21<210>100<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>100tgttgaagct tgcatgcctg c 21<210>101<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>101ttgtaaaacg acggccagtg a 21<210>102<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>102atggaggcca agatagatga g 21<210>103<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>103tcacaattcg gataagtggt c 21<210>104<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>104tcctaatgcc aagaaaacag ctgtcac 27<210>105<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>105atggcaaacc ctttttcgag a 21<210>106<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>106agccctcttt acacctagtg a 21<210>107<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>107tccccgcgga tggaggccaa gatagat 27<210>108<211>27<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>108caactcgagt cacaattcgg ataagtg 27<210>109<211>38<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>109gctctagagt tcgtcgtgtt tgcttctctt gtaaactt 38<210>110<211>32<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>110tatctcgagt cacaattcgt catgtaaatt gg 32<210>111<211>28<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>111gcagggaccc caattcggat aagtggt c 28<210>112<211>28<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>112gtagggtccc tggaggccaa gatagatg 28<210>113<211>29<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>113gcagggaccc tttgcttctc ttgtaaact 29<210>114<211>29<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>114gtagggtcct cagaaaaaga aattaggag 29<210>115<211>18<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>115tgtaaaacga cggccagt 18<210>116<211>20<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>116taatacgact cactataggg 20<210>117<211>38<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>117gctctagagt tcgtcgtgtt tgcttctctt gtaaactt 38<210>118<211>32<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>118tatctcgagt cacaattcgt catgtaaatt gg 32<210>119<211>18<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>119acggtaagaa atccaagc 18<210>120<211>18<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>120tatgagtcgg cacccact 18<210>121<211>10<212>PRT<213>人工序列<220><223>人工序列说明:合成肽<400>121Cys Tyr Ile Ile Asp His Leu Ser Glu Leu1 5 10<210>122<211>10<212>PRT<213>人工序列<220><223>人工序列说明:合成肽<400>122Cys Leu Asn Lys Val Tyr Lys Arg Ser Lys1 5 10<210>123<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>123ttttaagagc ttggtgagcg c 21<210>124<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>124tcgagttcaa gagaaaaaaa a 21<210>125<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>125ttcaattcat catttttttt t 21<210>126<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>126gggtaataac tgatataatt a 21<210>127<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>127atggattcta gaacagttgg t 21<210>128<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成NDA<400>128ttacttgttt tctagataag c 21<210>129<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>129atgactaacg aaaaggtctg g 21<210>130<211>21<212>DNA<213>人工序列<220><223>人工序列说明:合成DNA<400>130ttaagctgct gcggagcttc c 21
Claims (47)
1.一种制备异戊二烯醇的方法,包括:通过将均含有异戊二烯基二磷酸合酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建一个重组体;培养得到的重组体;及从得到的培养物中回收异戊二烯醇。
2.一种制备异戊二烯醇的方法,包括:通过将均含有异戊二烯基二磷酸合酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA以及均含有羟甲基戊二酰辅酶A还原酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建一个重组体;培养得到的重组体;及从得到的培养物中回收异戊二烯醇。
3.根据权利要求1或2的方法,其中的异戊二烯醇为香叶基香叶醇。
4.一种制备香叶基香叶醇的方法,包括:通过将均含有异戊二烯基二磷酸合酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA以及均含有异戊烯基二磷酸Δ异构酶基因的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建一个重组体;培养得到的重组体;及从得到的培养物中回收香叶基香叶醇。
5.根据权利要求1-4中任意一项的方法,其中的异戊二烯基二磷酸合酶基因选自下列基因(a)和(b)以及融合基因(c)和(d):
(a)法呢基二磷酸合酶基因或其突变体
(b)香叶基香叶基二磷酸合酶基因或其突变体
(c)由法呢基二磷酸合酶基因或其突变体和香叶基香叶基二磷酸合酶基因或其突变体组成的融合基因
(d)添加一个编码His Asp Glu Leu氨基酸序列的核苷酸序列的上述基因(a)或(b)或融合基因(c)。
6.根据权利要求5的方法,其中法呢基二磷酸合酶基因编码如SEQID NO:2或4所示的氨基酸序列。
7.根据权利要求5的方法,其中香叶基香叶基二磷酸合酶基因编码如SEQ ID NO:6所示的氨基酸序列。
8.一种制备香叶基香叶醇的方法,包括:通过将均含有羟甲基戊二酰辅酶A还原酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建一个重组体;培养得到的重组体;及从得到的培养物中回收香叶基香叶醇。
9.一种制备香叶基香叶醇的方法,包括:通过将均含有羟甲基戊二酰辅酶A还原酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA以及均含有选自下列(e)到(j)的基因的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建一个重组体:
(e)异戊烯基二磷酸Δ-异构酶基因
(f)甲羟戊酸激酶基因
(g)乙酰辅酶A乙酰基转移酶基因
(h)羟甲基戊二酰辅酶A合酶基因
(i)磷酸甲羟戊酸激酶基因
(j)二磷酸甲羟戊酸脱羧酶基因;
培养得到的重组体;及从得到的培养物中回收香叶基香叶醇。
10.根据权利要求3-9中任意一项的方法,其中在得到得培养物中香叶基香叶醇的浓度至少为0.05mg/L。
11.根据权利要求1-10中任意一项的方法,其中的宿主为酵母或大肠杆菌。
12.根据权利要求11的方法,其中的酵母为啤酒糖酵母。
13.根据权利要求13的方法,其中的啤酒糖酵母为啤酒糖酵母A451菌株、YPH499菌株、YPH500菌株、W303-1A菌株或W303-1B菌株,或一种衍生自上述任一菌株的菌株。
14.一种用于表达的重组DNA,含有选自以下基因(a)和基因(b)以及融合基因(c)和(d)的一种基因,以及转录启动子和转录终止子:
(a)法呢基二磷酸合酶基因或其突变体
(b)香叶基香叶基二磷酸合酶基因或其突变体
(c)由法呢基二磷酸合酶基因或其突变体和香叶基香叶基二磷酸合酶基因或其突变体组成的融合基因
(d)添加一个编码His Asp Glu Leu氨基酸序列的核苷酸序列的上述基因(a)或(b)或融合基因(c)。
15.根据权利要求14的重组DNA,其中的转录启动子是选自ADH1启动子、TDH3(GAP)启动子、TEF2启动子、GAL1启动子和tac启动子的任意一种启动子。
16.根据权利要求14的重组DNA,其中的转录终止子为CYC1终止子。
17.通过将权利要求14-16中任意一项的重组DNA导入宿主而获得的重组体。
18.根据权利要求17的重组体,其中的宿主为酵母或大肠杆菌。
19.根据权利要求18的重组体,其中的酵母为啤酒糖酵母。
20.根据权利要求19的重组体,其中的啤酒糖酵母为啤酒糖酵母A451菌株、YPH499菌株、YPH500菌株、W303-1A菌株或W303-1B菌株,或一种衍生自上述任一菌株的菌株。
21.根据权利要求18-20中任意一项的重组体,其中的重组体是原养型。
22.根据权利要求18-20中任意一项的重组体,其中的重组体是二倍体细胞。
23.根据权利要求18-20中任意一项的重组体,其中的重组体是原养型,且是二倍体细胞。
24.一种制备异戊二烯醇的方法,包括:采用含下列组分(i)-(vi)的任意一种的培养基,培养具有产生异戊二烯醇能力的微生物:
(i)糖
(ii)醇
(iii)氨气,氨水和/或铵盐
(iv)氢氧化钠和硫酸的混合物
(v)KH2PO4、硫酸镁、硫酸铵、玉米浸渍液、氯化钙和表面活性剂的混合物
(vi)上述组分(i)-(v)中的两种或以上的混合物;
并且从得到的培养物中回收异戊二烯醇。
25.根据权利要求24的方法,其中采用补料液培养微生物,该补料液含有下列组分(i)、(ii)或(iii)或这些组分中的两种或两种以上的混合物:
(i)糖
(ii)醇
(iii)氨气、氨水和/或铵盐。
26.根据权利要求24的方法,其中补料液的碳源组分从培养开始到培养开始后12-24小时之间仅由葡萄糖组成,在此之后碳源组分更换为含乙醇的组分。
27.根据权利要求24的方法,其中在培养开始12-24小时后,补料液中的乙醇占总的碳源组分的比例为50%或更多。
28.根据权利要求24的方法,其中在培养开始12-24小时后,补料液中的碳源组分仅由乙醇组成。
29.根据权利要求24的方法,其中所述积累于培养物中的异戊二烯醇浓度至少为0.1g/L或更高。
30.根据权利要求24的方法,其中所述积累于培养物中的异戊二烯醇浓度至少为1g/L或更高。
31.根据权利要求24的方法,其中的异戊二烯醇为香叶基香叶醇。
32.根据权利要求24的方法,其中的微生物为酵母。
33.根据权利要求32的方法,其中的酵母为啤酒糖酵母。
34.根据权利要求33的方法,其中的啤酒糖酵母为啤酒糖酵母A451菌株、YPH499菌株、YPH500菌株、W303-1A菌株或W303-1B菌株,或一种衍生自上述任一菌株的菌株。
35.根据权利要求24的方法,其中的微生物为重组体。
36.根据权利要求35的方法,其中的重组体是通过将均含有甲羟戊酸途径相关基因或其突变体,或异戊二烯基二磷酸合酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建的。
37.根据权利要求35的方法,其中的重组体是通过将均含有异戊二烯基二磷酸合酶基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA以及均含有甲羟戊酸途径相关基因或其突变体的一个用于表达的重组DNA或一个用于基因组整合的DNA导入宿主而构建的。
38.根据权利要求36或37的方法,其中的宿主为啤酒糖酵母。
39.根据权利要求38的方法,其中的啤酒糖酵母为啤酒糖酵母A451菌株、YPH499菌株、YPH500菌株、W303-1A菌株或W303-1B菌株,或一种衍生自上述任一菌株的菌株。
40.根据权利要求36或37的方法,其中的甲羟二羧酸途径相关基因为羟甲基戊二酰辅酶A还原酶基因。
41.根据权利要求40的方法,其中的羟甲基戊二酰辅酶A还原酶基因为HMG1基因。
42.根据权利要求36或37的方法,其中的异戊二烯基二磷酸合酶基因是选自下列基因(a)和(b)以及融合基因(c)和(d)的任意一种:
(a)法呢基二磷酸合酶基因或其突变体
(b)香叶基香叶基二磷酸合酶基因或其突变体
(c)由法呢基二磷酸合酶基因或其突变体和香叶基香叶基二磷酸合酶基因或其突变体组成的融合基因
(d)添加一个编码His Asp Glu Leu氨基酸序列的核苷酸序列的上述基因(a)或(b)或融合基因(c)。
43.根据权利要求24的方法,其中的微生物是原养型。
44.根据权利要求24的方法,其中的微生物是二倍体细胞。
45.根据权利要求24的方法,其中的微生物是原养型,且是二倍体细胞。
46.根据权利要求24的方法,其中培养基的pH是被控制的。
47.根据权利要求46的方法,其中pH的控制是采用氨气、铵盐溶液、氢氧化钠溶液或硫酸进行的。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2000403067 | 2000-12-28 | ||
JP403067/2000 | 2000-12-28 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1491282A true CN1491282A (zh) | 2004-04-21 |
Family
ID=18867247
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA018227147A Pending CN1491282A (zh) | 2000-12-28 | 2001-12-20 | 异戊二烯醇的制备方法 |
Country Status (6)
Country | Link |
---|---|
US (1) | US7501268B2 (zh) |
EP (1) | EP1354955A4 (zh) |
JP (1) | JP3894119B2 (zh) |
CN (1) | CN1491282A (zh) |
CA (1) | CA2433529A1 (zh) |
WO (1) | WO2002053746A1 (zh) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9102952B2 (en) | 2009-12-22 | 2015-08-11 | Global Bioenergies Sa | Process for the production of isoprenol from mevalonate employing a diphospho-mevolonate decarboxylase |
CN106987532A (zh) * | 2007-03-28 | 2017-07-28 | Reg生命科学有限责任公司 | 提高脂肪酸衍生物的产量 |
US10844406B2 (en) | 2006-05-19 | 2020-11-24 | Genomatica, Inc. | Production of fatty acids and derivatives thereof |
CN112251427A (zh) * | 2020-10-22 | 2021-01-22 | 中国农业科学院深圳农业基因组研究所 | 一种倍半萜合酶IlTPS1,编码核苷酸序列及其应用 |
US11046635B2 (en) | 2006-05-19 | 2021-06-29 | Genomatica, Inc. | Recombinant E. coli for enhanced production of fatty acid derivatives |
US11434512B2 (en) | 2006-05-19 | 2022-09-06 | Genomatica, Inc. | Production of fatty acid esters |
Families Citing this family (59)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1354956A4 (en) * | 2000-12-28 | 2004-10-27 | Toyota Motor Co Ltd | PROCESS FOR PRODUCING PRENYL ALCOHOL |
WO2006014837A1 (en) * | 2004-07-27 | 2006-02-09 | The Regents Of The University Of California | Genetically modified host cells and use of same for producing isoprenoid compounds |
EP2371967B1 (en) | 2005-03-18 | 2015-06-03 | DSM IP Assets B.V. | Production of carotenoids in oleaginous yeast and fungi |
US8163980B2 (en) * | 2005-07-05 | 2012-04-24 | The Regents Of The University Of California | Polynucleotides encoding isoprenoid modifying enzymes and methods of use thereof |
US20090135730A1 (en) * | 2005-10-24 | 2009-05-28 | Seeker Wireless Pty. Limited | Detection in Mobile Service Maintenance |
US7854774B2 (en) * | 2006-05-26 | 2010-12-21 | Amyris Biotechnologies, Inc. | Fuel components, fuel compositions and methods of making and using same |
US8691555B2 (en) | 2006-09-28 | 2014-04-08 | Dsm Ip Assests B.V. | Production of carotenoids in oleaginous yeast and fungi |
WO2008091627A2 (en) * | 2007-01-22 | 2008-07-31 | Genomatica, Inc. | Methods and organisms for growth-coupled production of 3-hydroxypropionic acid |
CN103540559A (zh) * | 2007-02-09 | 2014-01-29 | 加利福尼亚大学董事会 | 利用重组微生物生产生物燃料 |
TWI568847B (zh) | 2007-03-16 | 2017-02-01 | 奇諾麥提卡公司 | 用於1,4-丁二醇及其前驅物之生物合成的組合物及方法 |
WO2008124523A1 (en) * | 2007-04-04 | 2008-10-16 | The Regents Of The University Of California | Butanol production by recombinant microorganisms |
WO2009006429A1 (en) * | 2007-06-29 | 2009-01-08 | Regents Of The University Of California | Host cells and methods for producing 3-methyl-2-buten-1-ol, 3-methyl-3-buten-1-ol, and 3-methyl-butan-1-ol |
AP2741A (en) * | 2007-07-05 | 2013-09-30 | Univ California | Polynucleotides encoding isoprenoid modifying enzymes and methods of use thereof |
TWI453284B (zh) * | 2007-08-10 | 2014-09-21 | Genomatica Inc | 合成烯酸及其衍生物之方法 |
US7947483B2 (en) | 2007-08-10 | 2011-05-24 | Genomatica, Inc. | Methods and organisms for the growth-coupled production of 1,4-butanediol |
WO2009036497A1 (en) * | 2007-09-17 | 2009-03-26 | Seeker Wireless Pty Limited | Systems and methods for triggering location based voice and/or data communications to or from mobile radio terminals |
CA2700211C (en) * | 2007-09-20 | 2019-07-30 | Amyris Biotechnologies, Inc. | Production of isoprenoids |
JP2011500031A (ja) * | 2007-10-12 | 2011-01-06 | ザ レジェンツ オブ ザ ユニヴァースティ オブ カリフォルニア | イソプロパノールを生産するように操作された微生物 |
CN107090424A (zh) * | 2008-01-22 | 2017-08-25 | 基因组股份公司 | 利用合成气或其它气态碳源和甲醇的方法和有机体 |
BRPI0909690A2 (pt) | 2008-03-05 | 2019-02-26 | Genomatica Inc | organismos produtores de alcoóis primários |
CN113430119A (zh) | 2008-03-27 | 2021-09-24 | 基因组股份公司 | 用于产生己二酸和其他化合物的微生物 |
US8787171B2 (en) * | 2008-04-07 | 2014-07-22 | Wavemarket, Inc. | Efficient collection of wireless transmitter characteristics |
EP2285973A4 (en) * | 2008-05-01 | 2012-04-25 | Genomatica Inc | PROCESS FOR PREPARING METHACRYLIC ACID |
CA2725549A1 (en) * | 2008-06-17 | 2009-12-23 | Genomatica, Inc. | Microorganisms and methods for the biosynthesis of fumarate, malate, and acrylate |
US20100021978A1 (en) * | 2008-07-23 | 2010-01-28 | Genomatica, Inc. | Methods and organisms for production of 3-hydroxypropionic acid |
JP5912529B2 (ja) | 2008-09-10 | 2016-04-27 | ゲノマチカ, インク. | 1,4−ブタンジオールの生成のための微生物体 |
WO2010057022A1 (en) * | 2008-11-14 | 2010-05-20 | Genomatica, Inc. | Microorganisms for the production of methyl ethyl ketone and 2-butanol |
AU2009327490A1 (en) * | 2008-12-16 | 2011-07-28 | Genomatica, Inc. | Microorganisms and methods for conversion of syngas and other carbon sources to useful products |
EP3318626B1 (en) * | 2009-04-30 | 2020-01-15 | Genomatica, Inc. | Organisms for the production of 1,3-butanediol |
AU2010242849A1 (en) | 2009-04-30 | 2011-11-24 | Genomatica, Inc. | Organisms for the production of isopropanol, n-butanol, and isobutanol |
JP5773990B2 (ja) | 2009-05-07 | 2015-09-02 | ゲノマチカ, インク. | アジペート、ヘキサメチレンジアミン、及び6−アミノカプロン酸の生合成のための微生物及び方法 |
EP2429587A4 (en) * | 2009-05-15 | 2014-10-08 | Genomatica Inc | ORGANISMS FOR THE PREPARATION OF CYCLOHEXANONE |
KR20170080709A (ko) * | 2009-06-04 | 2017-07-10 | 게노마티카 인코포레이티드 | 발효액 성분들의 분리 방법 |
PL2438178T3 (pl) | 2009-06-04 | 2018-09-28 | Genomatica Inc | Mikroorganizmy do produkcji 1,4-butanodiolu i pokrewne sposoby |
JP4821886B2 (ja) * | 2009-06-04 | 2011-11-24 | トヨタ自動車株式会社 | 組換え酵母、当該組換え酵母を用いた分岐アルコールの製造方法 |
US8420375B2 (en) * | 2009-06-10 | 2013-04-16 | Genomatica, Inc. | Microorganisms and methods for carbon-efficient biosynthesis of MEK and 2-butanol |
EP3190174A1 (en) | 2009-08-05 | 2017-07-12 | Genomatica, Inc. | Semi-synthetic terephthalic acid via microorganisms that produce muconic acid |
BR112012005296A2 (pt) | 2009-09-09 | 2019-01-15 | Genomatica Inc | microorganismos e métodos para a coprodução de isopropanol com alcoóis primários, diós e ácidos. |
EP2488652A4 (en) | 2009-10-13 | 2014-04-09 | Genomatica Inc | MICRO-ORGANISMS FOR THE PREPARATION OF 1,4-BANDANDIOL, 4-HYDROXYBUTANAL, 4-HYDROXYBUTYRYL-COA, PUTRESCIN AND CORRESPONDING COMPOUNDS, AND CORRESPONDING METHODS |
KR20180014240A (ko) | 2009-10-23 | 2018-02-07 | 게노마티카 인코포레이티드 | 아닐린의 제조를 위한 미생물 |
WO2011066076A1 (en) | 2009-11-25 | 2011-06-03 | Genomatica, Inc. | Microorganisms and methods for the coproduction of 1,4-butanediol and gamma-butyrolactone |
BR112012014062A2 (pt) | 2009-12-10 | 2017-04-04 | Genomatica Inc | organismo microbiano que não ocorre naturalmente e método para produzir 1,3 bdo |
CA2787314A1 (en) | 2010-01-29 | 2011-08-04 | Genomatica, Inc. | Microorganisms and methods for the biosynthesis of p-toluate and terephthalate |
US8445244B2 (en) * | 2010-02-23 | 2013-05-21 | Genomatica, Inc. | Methods for increasing product yields |
US8048661B2 (en) * | 2010-02-23 | 2011-11-01 | Genomatica, Inc. | Microbial organisms comprising exogenous nucleic acids encoding reductive TCA pathway enzymes |
US8244236B2 (en) | 2010-04-29 | 2012-08-14 | Wavemarket, Inc. | System and method for aggregating and disseminating mobile device tag data |
US9023636B2 (en) | 2010-04-30 | 2015-05-05 | Genomatica, Inc. | Microorganisms and methods for the biosynthesis of propylene |
US8580543B2 (en) | 2010-05-05 | 2013-11-12 | Genomatica, Inc. | Microorganisms and methods for the biosynthesis of butadiene |
BR112013001635A2 (pt) | 2010-07-26 | 2016-05-24 | Genomatica Inc | micro-organismo e métodos para a biossíntese de aromáticos, 2, 4-pentadienoato e 1,3-butadieno |
EP2681313B1 (en) * | 2011-02-28 | 2016-04-20 | OrganoBalance GmbH | A yeast cell for the production of terpenes and uses thereof |
CN104685058B (zh) | 2012-06-04 | 2020-07-07 | 基因组股份公司 | 制造4-羟基丁酸酯、1,4-丁二醇和相关化合物的微生物和方法 |
BR112015003329A2 (pt) | 2012-08-17 | 2017-09-26 | Evolva Sa | produção aumentada de terpenos e terpenoides |
WO2016008885A1 (en) * | 2014-07-14 | 2016-01-21 | Photanol B.V. | Biosynthesis of sesquiterpenes in cyanobacteria |
US10767197B2 (en) * | 2015-10-20 | 2020-09-08 | Buckman Laboratories International, Inc. | Method to enhance yeast growth for fermentative bioproduct production, and nutrient composition for same |
KR102106038B1 (ko) * | 2017-05-11 | 2020-04-29 | 경상대학교 산학협력단 | 형질전환 생물체 선별용 마커 조성물, 형질전환 생물체 및 형질전환 방법 |
US20210002678A1 (en) * | 2018-03-20 | 2021-01-07 | Manus Bio, Inc. | Enzymes, cells and methods for production of 3-(4-farnesyloxyphenyl)propionic acid and derivatives thereof |
CN112080440B (zh) * | 2020-09-24 | 2023-11-03 | 江南大学 | 一种生产法尼烯的酿酒酵母工程菌及其应用 |
CN112094765B (zh) * | 2020-09-24 | 2022-02-01 | 江南大学 | 一种生产法尼烯的酿酒酵母工程菌的构建和应用 |
KR20240009457A (ko) * | 2021-05-19 | 2024-01-22 | 시데로 바이오사이언스 엘엘씨 | 페리틴 또는 이의 동족체를 포함하는 재조합 숙주 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5306862A (en) | 1990-10-12 | 1994-04-26 | Amoco Corporation | Method and composition for increasing sterol accumulation in higher plants |
US5460949A (en) * | 1990-11-15 | 1995-10-24 | Amoco Corporation | Method and composition for increasing the accumulation of squalene and specific sterols in yeast |
EP0509841A3 (en) * | 1991-04-18 | 1993-08-18 | Tonen Corporation | Co-expression system of protein disulfide isomerase gene and useful polypeptide gene and process for producing the polypeptide using its system |
JP3198842B2 (ja) * | 1994-03-24 | 2001-08-13 | トヨタ自動車株式会社 | ゲラニルゲラニル二リン酸合成酵素およびそれをコードするdna |
JP3657298B2 (ja) * | 1994-11-08 | 2005-06-08 | 株式会社クラレ | ゲラニルゲラニオ−ルの製造方法 |
JP3151371B2 (ja) | 1995-03-10 | 2001-04-03 | 麒麟麦酒株式会社 | カロチノイド生産量の増量に有用なdna鎖 |
JP3209103B2 (ja) * | 1996-07-03 | 2001-09-17 | トヨタ自動車株式会社 | 変異型プレニル二燐酸合成酵素 |
EP1095033A1 (en) * | 1998-07-06 | 2001-05-02 | Eastman Chemical Company | Method of producing vitamin e |
EP1095002A4 (en) * | 1998-07-06 | 2005-08-03 | Dcv Inc | PROCESS FOR PRODUCING VITAMINS |
JP4013535B2 (ja) * | 2000-12-28 | 2007-11-28 | トヨタ自動車株式会社 | 微生物によるプレニルアルコールの高生産方法 |
-
2001
- 2001-12-20 CN CNA018227147A patent/CN1491282A/zh active Pending
- 2001-12-20 JP JP2002555252A patent/JP3894119B2/ja not_active Expired - Fee Related
- 2001-12-20 WO PCT/JP2001/011214 patent/WO2002053746A1/ja not_active Application Discontinuation
- 2001-12-20 US US10/451,643 patent/US7501268B2/en not_active Expired - Fee Related
- 2001-12-20 CA CA002433529A patent/CA2433529A1/en not_active Abandoned
- 2001-12-20 EP EP01272514A patent/EP1354955A4/en not_active Withdrawn
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10844406B2 (en) | 2006-05-19 | 2020-11-24 | Genomatica, Inc. | Production of fatty acids and derivatives thereof |
US11046635B2 (en) | 2006-05-19 | 2021-06-29 | Genomatica, Inc. | Recombinant E. coli for enhanced production of fatty acid derivatives |
US11434512B2 (en) | 2006-05-19 | 2022-09-06 | Genomatica, Inc. | Production of fatty acid esters |
CN106987532A (zh) * | 2007-03-28 | 2017-07-28 | Reg生命科学有限责任公司 | 提高脂肪酸衍生物的产量 |
CN106987532B (zh) * | 2007-03-28 | 2021-06-08 | 基因组股份公司 | 提高脂肪酸衍生物的产量 |
US9102952B2 (en) | 2009-12-22 | 2015-08-11 | Global Bioenergies Sa | Process for the production of isoprenol from mevalonate employing a diphospho-mevolonate decarboxylase |
CN112251427A (zh) * | 2020-10-22 | 2021-01-22 | 中国农业科学院深圳农业基因组研究所 | 一种倍半萜合酶IlTPS1,编码核苷酸序列及其应用 |
Also Published As
Publication number | Publication date |
---|---|
JPWO2002053746A1 (ja) | 2004-05-13 |
EP1354955A1 (en) | 2003-10-22 |
CA2433529A1 (en) | 2002-07-11 |
US7501268B2 (en) | 2009-03-10 |
EP1354955A4 (en) | 2005-01-05 |
US20070087425A1 (en) | 2007-04-19 |
JP3894119B2 (ja) | 2007-03-14 |
WO2002053746A1 (en) | 2002-07-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1491282A (zh) | 异戊二烯醇的制备方法 | |
CN1556855A (zh) | 3-羟基丙酸及其它有机化合物 | |
US20220056485A1 (en) | Method for producing fragrant alcohols | |
CN1492931A (zh) | 异戊二烯醇的生产方法 | |
CN101080490A (zh) | 改进的甲羟戊酸激酶 | |
CN1910278A (zh) | 生产1,2-丙二醇的进化微生物 | |
CN1806042A (zh) | 具有反馈抗性的甲羟戊酸激酶 | |
CN1993377A (zh) | 丙氨酸2,3氨基变位酶 | |
CN110904018B (zh) | 5-氨基乙酰丙酸生产菌株及其构建方法和应用 | |
US20170314049A1 (en) | Methods and Materials for Biosynthesis of Manoyl Oxide | |
US20040063182A1 (en) | Process for producing prenyl alcohol | |
CN1795270A (zh) | 编码具有d-乳酸脱氢酶活性蛋白质的dna及其用途 | |
CN1165621C (zh) | 法呢基二磷酸合酶 | |
KR20120082141A (ko) | 바이오 에탄올 생산 향상용 피루브산 탈카르복실산 효소와 알코올 탈수소화 효소를 코딩하는 유전자가 도입된 형질전환체 및 그 균주를 이용한 에탄올 생성방법 | |
WO2019233853A1 (en) | Microorganisms and the production of fine chemicals | |
CN1517436A (zh) | 氧化还原酶 | |
CN1056189C (zh) | 编码阿凡曼链霉菌支链α-酮酸脱氢酶复合物的基因 | |
CN1639340A (zh) | 控制乙醇产生的方法 | |
CN1578831A (zh) | 具有实施生物技术工艺的增强能力的真菌微生物 | |
JP4379087B2 (ja) | ゲラニルゲラニル二リン酸合成酵素及びゲラニルゲラニオールの製造方法 | |
CN114174506B (zh) | 芳樟醇合酶 | |
CN1483080A (zh) | 发酵生产d-对羟基苯基甘氨酸和d-苯基甘氨酸 | |
CN111201321B (zh) | 基因修饰的异丙基苹果酸异构酶酶复合物及用其制备伸长的2-酮酸和c5-c10化合物的方法 | |
CN1571838A (zh) | 实现基因高表达的系统 | |
CN1798835A (zh) | 来自 C a n d i d a A l bi c a n s的甘油-3-磷酸磷酸酶和甘油-3-磷酸脱氢酶 ,编码它们的基因, 包含该基因的载体和宿主细胞, 和使用该宿主细胞生产甘油的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |