JP2001161370A - Gene of enzyme involved in actinomyces-derived mevalonate pathway - Google Patents
Gene of enzyme involved in actinomyces-derived mevalonate pathwayInfo
- Publication number
- JP2001161370A JP2001161370A JP34837599A JP34837599A JP2001161370A JP 2001161370 A JP2001161370 A JP 2001161370A JP 34837599 A JP34837599 A JP 34837599A JP 34837599 A JP34837599 A JP 34837599A JP 2001161370 A JP2001161370 A JP 2001161370A
- Authority
- JP
- Japan
- Prior art keywords
- ala
- leu
- gly
- val
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 95
- KJTLQQUUPVSXIM-UHFFFAOYSA-N DL-mevalonic acid Natural products OCCC(O)(C)CC(O)=O KJTLQQUUPVSXIM-UHFFFAOYSA-N 0.000 title claims abstract description 70
- 230000037361 pathway Effects 0.000 title claims abstract description 60
- 102000004190 Enzymes Human genes 0.000 title claims abstract description 33
- 108090000790 Enzymes Proteins 0.000 title claims abstract description 33
- KJTLQQUUPVSXIM-ZCFIWIBFSA-M (R)-mevalonate Chemical compound OCC[C@](O)(C)CC([O-])=O KJTLQQUUPVSXIM-ZCFIWIBFSA-M 0.000 title claims abstract 7
- 241000186046 Actinomyces Species 0.000 title description 4
- -1 isoprenoid compounds Chemical class 0.000 claims abstract description 33
- 238000004519 manufacturing process Methods 0.000 claims abstract description 22
- 238000012258 culturing Methods 0.000 claims abstract description 16
- 241000588724 Escherichia coli Species 0.000 claims description 68
- 238000000034 method Methods 0.000 claims description 59
- 239000002773 nucleotide Substances 0.000 claims description 55
- 125000003729 nucleotide group Chemical group 0.000 claims description 55
- 150000001413 amino acids Chemical group 0.000 claims description 53
- 102000004169 proteins and genes Human genes 0.000 claims description 31
- 230000000694 effects Effects 0.000 claims description 19
- ACTIUHUUMQJHFO-UHFFFAOYSA-N Coenzym Q10 Natural products COC1=C(OC)C(=O)C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)=C(C)C1=O ACTIUHUUMQJHFO-UHFFFAOYSA-N 0.000 claims description 18
- ACTIUHUUMQJHFO-UPTCCGCDSA-N coenzyme Q10 Chemical compound COC1=C(OC)C(=O)C(C\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CCC=C(C)C)=C(C)C1=O ACTIUHUUMQJHFO-UPTCCGCDSA-N 0.000 claims description 18
- 239000013598 vector Substances 0.000 claims description 18
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 17
- 235000017471 coenzyme Q10 Nutrition 0.000 claims description 17
- NPCOQXAVBJJZBQ-UHFFFAOYSA-N reduced coenzyme Q9 Natural products COC1=C(O)C(C)=C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)C(O)=C1OC NPCOQXAVBJJZBQ-UHFFFAOYSA-N 0.000 claims description 17
- 229940035936 ubiquinone Drugs 0.000 claims description 17
- 108010000775 Hydroxymethylglutaryl-CoA synthase Proteins 0.000 claims description 14
- 102100028888 Hydroxymethylglutaryl-CoA synthase, cytoplasmic Human genes 0.000 claims description 14
- 101710183613 Diphosphomevalonate decarboxylase Proteins 0.000 claims description 12
- 102000057412 Diphosphomevalonate decarboxylases Human genes 0.000 claims description 12
- 108700040132 Mevalonate kinases Proteins 0.000 claims description 11
- 102100024279 Phosphomevalonate kinase Human genes 0.000 claims description 11
- 102000002678 mevalonate kinase Human genes 0.000 claims description 11
- 108091000116 phosphomevalonate kinase Proteins 0.000 claims description 11
- 235000021466 carotenoid Nutrition 0.000 claims description 8
- 150000001747 carotenoids Chemical class 0.000 claims description 8
- 238000007792 addition Methods 0.000 claims description 6
- 238000012217 deletion Methods 0.000 claims description 6
- 230000037430 deletion Effects 0.000 claims description 6
- 108090000895 Hydroxymethylglutaryl CoA Reductases Proteins 0.000 claims description 5
- 102000004286 Hydroxymethylglutaryl CoA Reductases Human genes 0.000 claims description 5
- 150000001875 compounds Chemical class 0.000 claims description 5
- PFRQBZFETXBLTP-RCIYGOBDSA-N 2-[(2e,6e,10e,14e,18e)-3,7,11,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaen-1-yl]-3-methyl-1,4-dihydronaphthalene-1,4-dione Chemical compound C1=CC=C2C(=O)C(C/C=C(C)/CC/C=C(C)/CC/C=C(C)/CC/C=C(C)/CC/C=C(C)/CCC=C(C)C)=C(C)C(=O)C2=C1 PFRQBZFETXBLTP-RCIYGOBDSA-N 0.000 claims description 4
- 238000006467 substitution reaction Methods 0.000 claims description 3
- 238000003780 insertion Methods 0.000 claims description 2
- 230000037431 insertion Effects 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 239000003814 drug Substances 0.000 abstract description 5
- 235000015170 shellfish Nutrition 0.000 abstract description 4
- 206010028980 Neoplasm Diseases 0.000 abstract description 3
- 208000001132 Osteoporosis Diseases 0.000 abstract description 3
- 201000011510 cancer Diseases 0.000 abstract description 3
- 230000002265 prevention Effects 0.000 abstract description 3
- 230000006696 biosynthetic metabolic pathway Effects 0.000 abstract description 2
- 235000013402 health food Nutrition 0.000 abstract description 2
- 230000023597 hemostasis Effects 0.000 abstract description 2
- 238000000576 coating method Methods 0.000 abstract 1
- 108020004414 DNA Proteins 0.000 description 72
- KJTLQQUUPVSXIM-ZCFIWIBFSA-N (R)-mevalonic acid Chemical compound OCC[C@](O)(C)CC(O)=O KJTLQQUUPVSXIM-ZCFIWIBFSA-N 0.000 description 63
- 210000004027 cell Anatomy 0.000 description 49
- 239000002609 medium Substances 0.000 description 31
- 235000018102 proteins Nutrition 0.000 description 28
- 241000187747 Streptomyces Species 0.000 description 20
- 229940023064 escherichia coli Drugs 0.000 description 19
- 239000012634 fragment Substances 0.000 description 16
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 16
- 239000013604 expression vector Substances 0.000 description 15
- 108010050848 glycylleucine Proteins 0.000 description 15
- NUHSROFQTUXZQQ-UHFFFAOYSA-N isopentenyl diphosphate Chemical compound CC(=C)CCO[P@](O)(=O)OP(O)(O)=O NUHSROFQTUXZQQ-UHFFFAOYSA-N 0.000 description 15
- 238000010369 molecular cloning Methods 0.000 description 15
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 14
- 239000013612 plasmid Substances 0.000 description 14
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 13
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 13
- 244000005700 microbiome Species 0.000 description 13
- 108010049041 glutamylalanine Proteins 0.000 description 12
- 241000238631 Hexapoda Species 0.000 description 11
- 241000187180 Streptomyces sp. Species 0.000 description 11
- 108010061238 threonyl-glycine Proteins 0.000 description 11
- 239000013611 chromosomal DNA Substances 0.000 description 10
- 239000000243 solution Substances 0.000 description 10
- CABVTRNMFUVUDM-VRHQGPGLSA-N (3S)-3-hydroxy-3-methylglutaryl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C[C@@](O)(CC(O)=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 CABVTRNMFUVUDM-VRHQGPGLSA-N 0.000 description 9
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 9
- 108010047495 alanylglycine Proteins 0.000 description 9
- 241000894006 Bacteria Species 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- 108010079364 N-glycylalanine Proteins 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 210000004102 animal cell Anatomy 0.000 description 8
- 230000012010 growth Effects 0.000 description 8
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 8
- 108091008146 restriction endonucleases Proteins 0.000 description 8
- 239000000523 sample Substances 0.000 description 8
- 241001446247 uncultured actinomycete Species 0.000 description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 238000007796 conventional method Methods 0.000 description 7
- 238000009396 hybridization Methods 0.000 description 7
- OKZYCXHTTZZYSK-ZCFIWIBFSA-N (R)-5-phosphomevalonic acid Chemical compound OC(=O)C[C@@](O)(C)CCOP(O)(O)=O OKZYCXHTTZZYSK-ZCFIWIBFSA-N 0.000 description 6
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 6
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 6
- 101100444336 Claviceps purpurea (strain 20.1) easH gene Proteins 0.000 description 6
- 101100409508 Escherichia coli prrC gene Proteins 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 101150084291 ORFC gene Proteins 0.000 description 6
- 101100222028 Salmonella enteritidis csgC gene Proteins 0.000 description 6
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 6
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 6
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- 108010041407 alanylaspartic acid Proteins 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 238000005119 centrifugation Methods 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 230000014509 gene expression Effects 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 150000003505 terpenes Chemical class 0.000 description 6
- 241000701447 unidentified baculovirus Species 0.000 description 6
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 5
- 241000186063 Arthrobacter Species 0.000 description 5
- 241000588722 Escherichia Species 0.000 description 5
- 101100536194 Escherichia coli prrB gene Proteins 0.000 description 5
- 101100275984 Halothiobacillus neapolitanus (strain ATCC 23641 / c2) csoS4A gene Proteins 0.000 description 5
- 101100275987 Halothiobacillus neapolitanus (strain ATCC 23641 / c2) csoS4B gene Proteins 0.000 description 5
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 5
- 238000002105 Southern blotting Methods 0.000 description 5
- 101100406376 Streptomyces antibioticus orfB gene Proteins 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 101100169253 Walleye dermal sarcoma virus orfA gene Proteins 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 229960000723 ampicillin Drugs 0.000 description 5
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 5
- 108010068380 arginylarginine Proteins 0.000 description 5
- 108010047857 aspartylglycine Proteins 0.000 description 5
- 101150089204 easF gene Proteins 0.000 description 5
- 101150017627 easG gene Proteins 0.000 description 5
- GJXWDTUCERCKIX-UHFFFAOYSA-N fosmidomycin Chemical compound O=CN(O)CCCP(O)(O)=O GJXWDTUCERCKIX-UHFFFAOYSA-N 0.000 description 5
- 229950006501 fosmidomycin Drugs 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 235000002639 sodium chloride Nutrition 0.000 description 5
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 4
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 4
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 4
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 4
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 4
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 4
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 4
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 4
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 4
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 241000193830 Bacillus <bacterium> Species 0.000 description 4
- 241000186146 Brevibacterium Species 0.000 description 4
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical compound [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 4
- 241000186226 Corynebacterium glutamicum Species 0.000 description 4
- 241000282326 Felis catus Species 0.000 description 4
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 4
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 4
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 4
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 4
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 4
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 4
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 4
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 4
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 4
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 4
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 4
- 101710173438 Late L2 mu core protein Proteins 0.000 description 4
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 4
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 4
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 4
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 4
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 4
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 4
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 4
- 241000985284 Leuciscus idus Species 0.000 description 4
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 4
- 102000004316 Oxidoreductases Human genes 0.000 description 4
- 108090000854 Oxidoreductases Proteins 0.000 description 4
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 4
- 101710188315 Protein X Proteins 0.000 description 4
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 4
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 4
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 4
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 4
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 4
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 4
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 4
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 4
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 4
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 4
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 4
- 229930003448 Vitamin K Natural products 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 108010054813 diprotin B Proteins 0.000 description 4
- 238000010353 genetic engineering Methods 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 4
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 4
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 4
- SHUZOJHMOBOZST-UHFFFAOYSA-N phylloquinone Natural products CC(C)CCCCC(C)CCC(C)CCCC(=CCC1=C(C)C(=O)c2ccccc2C1=O)C SHUZOJHMOBOZST-UHFFFAOYSA-N 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 235000000346 sugar Nutrition 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 235000019168 vitamin K Nutrition 0.000 description 4
- 239000011712 vitamin K Substances 0.000 description 4
- 150000003721 vitamin K derivatives Chemical class 0.000 description 4
- 229940046010 vitamin k Drugs 0.000 description 4
- 239000012138 yeast extract Substances 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- 241000186361 Actinobacteria <class> Species 0.000 description 3
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 3
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 3
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- 241000192542 Anabaena Species 0.000 description 3
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 3
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 3
- 241000190831 Chromatium Species 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 3
- 241000588698 Erwinia Species 0.000 description 3
- 101100312842 Escherichia coli prrD gene Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 241000206602 Eukaryota Species 0.000 description 3
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 3
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 3
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 3
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 3
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 3
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 3
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 3
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 3
- 102000001253 Protein Kinase Human genes 0.000 description 3
- 241000589516 Pseudomonas Species 0.000 description 3
- 241000235070 Saccharomyces Species 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 3
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 3
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 3
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 3
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 229940041514 candida albicans extract Drugs 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 108020004707 nucleic acids Proteins 0.000 description 3
- 150000007523 nucleic acids Chemical class 0.000 description 3
- 102000039446 nucleic acids Human genes 0.000 description 3
- 150000007524 organic acids Chemical class 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 108060006633 protein kinase Proteins 0.000 description 3
- 239000012521 purified sample Substances 0.000 description 3
- 239000011347 resin Substances 0.000 description 3
- 229920005989 resin Polymers 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 230000005030 transcription termination Effects 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- JLIDBLDQVAYHNE-YKALOCIXSA-N (+)-Abscisic acid Chemical compound OC(=O)/C=C(/C)\C=C\[C@@]1(O)C(C)=CC(=O)CC1(C)C JLIDBLDQVAYHNE-YKALOCIXSA-N 0.000 description 2
- SXOUIMVOMIGLHO-AATRIKPKSA-N (E)-3-(indol-2-yl)acrylic acid Chemical compound C1=CC=C2NC(/C=C/C(=O)O)=CC2=C1 SXOUIMVOMIGLHO-AATRIKPKSA-N 0.000 description 2
- ZWZOCNTYMUOGPQ-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylpentanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylpentanoic acid Chemical compound CCC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)CC)C(O)=O ZWZOCNTYMUOGPQ-UHFFFAOYSA-N 0.000 description 2
- 108010036211 5-HT-moduline Proteins 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 2
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 2
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 2
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 2
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 2
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 2
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 2
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 2
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 2
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 2
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 2
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 2
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 2
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 2
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 2
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 2
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 2
- ZDBWKBCKYJGKGP-DCAQKATOSA-N Arg-Leu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O ZDBWKBCKYJGKGP-DCAQKATOSA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 2
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 2
- KSUALAGYYLQSHJ-RCWTZXSCSA-N Arg-Met-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSUALAGYYLQSHJ-RCWTZXSCSA-N 0.000 description 2
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 2
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 2
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 2
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 2
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 2
- XOZYYXMHMIEJET-XIRDDKMYSA-N Arg-Trp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XOZYYXMHMIEJET-XIRDDKMYSA-N 0.000 description 2
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 2
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 2
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 2
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 2
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 2
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 2
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 2
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 2
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 2
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 2
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- ODNWIBOCFGMRTP-SRVKXCTJSA-N Asp-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CN=CN1 ODNWIBOCFGMRTP-SRVKXCTJSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 2
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 2
- KOWYNSKRPUWSFG-IHPCNDPISA-N Asp-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)O)N KOWYNSKRPUWSFG-IHPCNDPISA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 2
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- JEBFVOLFMLUKLF-IFPLVEIFSA-N Astaxanthin Natural products CC(=C/C=C/C(=C/C=C/C1=C(C)C(=O)C(O)CC1(C)C)/C)C=CC=C(/C)C=CC=C(/C)C=CC2=C(C)C(=O)C(O)CC2(C)C JEBFVOLFMLUKLF-IFPLVEIFSA-N 0.000 description 2
- 241001203868 Autographa californica Species 0.000 description 2
- 102000004031 Carboxy-Lyases Human genes 0.000 description 2
- 108090000489 Carboxy-Lyases Proteins 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 241000186216 Corynebacterium Species 0.000 description 2
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 2
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 2
- ZLFRUAFDAIFNHN-LKXGYXEUSA-N Cys-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O ZLFRUAFDAIFNHN-LKXGYXEUSA-N 0.000 description 2
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 2
- OHCQJHSOBUTRHG-KGGHGJDLSA-N FORSKOLIN Chemical compound O=C([C@@]12O)C[C@](C)(C=C)O[C@]1(C)[C@@H](OC(=O)C)[C@@H](O)[C@@H]1[C@]2(C)[C@@H](O)CCC1(C)C OHCQJHSOBUTRHG-KGGHGJDLSA-N 0.000 description 2
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 2
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 2
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 2
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 2
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 2
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 2
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 2
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 2
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 2
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 2
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 2
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 2
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 2
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 2
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 2
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 2
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 2
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 2
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 2
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 2
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 2
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 2
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 2
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 2
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 description 2
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 2
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 2
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 2
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 2
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 2
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 2
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 2
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 2
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 2
- IDNNYVGVSZMQTK-IHRRRGAJSA-N His-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N IDNNYVGVSZMQTK-IHRRRGAJSA-N 0.000 description 2
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 2
- YOSQCYUFZGPIPC-PBCZWWQYSA-N His-Asp-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YOSQCYUFZGPIPC-PBCZWWQYSA-N 0.000 description 2
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 2
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 2
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 2
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 2
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 2
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 2
- 101000578940 Homo sapiens PDZ domain-containing protein MAGIX Proteins 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 2
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 2
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 2
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 2
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 2
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 2
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 2
- XGPNARQELVNWIP-UHFFFAOYSA-N Ile-Phe-Thr-Pro Chemical compound C1CCC(C(O)=O)N1C(=O)C(C(C)O)NC(=O)C(NC(=O)C(N)C(C)CC)CC1=CC=CC=C1 XGPNARQELVNWIP-UHFFFAOYSA-N 0.000 description 2
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 2
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- RRHGJUQNOFWUDK-UHFFFAOYSA-N Isoprene Chemical group CC(=C)C=C RRHGJUQNOFWUDK-UHFFFAOYSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 2
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 2
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 2
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 2
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 2
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 2
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 2
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 2
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 2
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 2
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 2
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 2
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 2
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 2
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 2
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 2
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 2
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 2
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 2
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 2
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 241000589323 Methylobacterium Species 0.000 description 2
- 241001467578 Microbacterium Species 0.000 description 2
- 101100293261 Mus musculus Naa15 gene Proteins 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 101150082943 NAT1 gene Proteins 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 102100028326 PDZ domain-containing protein MAGIX Human genes 0.000 description 2
- 241000588696 Pantoea ananatis Species 0.000 description 2
- MDHZEOMXGNBSIL-DLOVCJGASA-N Phe-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MDHZEOMXGNBSIL-DLOVCJGASA-N 0.000 description 2
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 2
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 2
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 2
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 2
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 2
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- 241000192608 Phormidium Species 0.000 description 2
- 108091000080 Phosphotransferase Proteins 0.000 description 2
- 229920002565 Polyethylene Glycol 400 Polymers 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 2
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 2
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 2
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 2
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 2
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 2
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 2
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 2
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 2
- NBDHWLZEMKSVHH-UVBJJODRSA-N Pro-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 NBDHWLZEMKSVHH-UVBJJODRSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- 241000191025 Rhodobacter Species 0.000 description 2
- 241000187561 Rhodococcus erythropolis Species 0.000 description 2
- 241000190967 Rhodospirillum Species 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 2
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 2
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 2
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 2
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 2
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- 241000607720 Serratia Species 0.000 description 2
- 235000019764 Soybean Meal Nutrition 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- 241000192707 Synechococcus Species 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 2
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 2
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 2
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- ZOCJFNXUVSGBQI-HSHDSVGOSA-N Thr-Trp-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ZOCJFNXUVSGBQI-HSHDSVGOSA-N 0.000 description 2
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 2
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- 241000223230 Trichosporon Species 0.000 description 2
- JLTQXEOXIJMCLZ-ZVZYQTTQSA-N Trp-Gln-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 JLTQXEOXIJMCLZ-ZVZYQTTQSA-N 0.000 description 2
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 2
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 2
- ARKBYVBCEOWRNR-UBHSHLNASA-N Trp-Ser-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O ARKBYVBCEOWRNR-UBHSHLNASA-N 0.000 description 2
- XQMGDVVKFRLQKH-BBRMVZONSA-N Trp-Val-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O)=CNC2=C1 XQMGDVVKFRLQKH-BBRMVZONSA-N 0.000 description 2
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 2
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 2
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 2
- YYZPVPJCOGGQPC-JYJNAYRXSA-N Tyr-His-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYZPVPJCOGGQPC-JYJNAYRXSA-N 0.000 description 2
- RIFVTNDKUMSSMN-ULQDDVLXSA-N Tyr-His-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O RIFVTNDKUMSSMN-ULQDDVLXSA-N 0.000 description 2
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 2
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 2
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 2
- NXPDPYYCIRDUHO-ULQDDVLXSA-N Tyr-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 NXPDPYYCIRDUHO-ULQDDVLXSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 2
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 2
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 2
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 2
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 2
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 2
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- POFQRHFHYPSCOI-FHWLQOOXSA-N Val-Trp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N POFQRHFHYPSCOI-FHWLQOOXSA-N 0.000 description 2
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 2
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 241000588901 Zymomonas Species 0.000 description 2
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 238000000246 agarose gel electrophoresis Methods 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 150000001298 alcohols Chemical class 0.000 description 2
- OENHQHLEOONYIE-UKMVMLAPSA-N all-trans beta-carotene Natural products CC=1CCCC(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C OENHQHLEOONYIE-UKMVMLAPSA-N 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 229910021529 ammonia Inorganic materials 0.000 description 2
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 2
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 2
- 108010089442 arginyl-leucyl-alanyl-arginine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 235000013793 astaxanthin Nutrition 0.000 description 2
- MQZIGYBFDRPAKN-ZWAPEEGVSA-N astaxanthin Chemical compound C([C@H](O)C(=O)C=1C)C(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)C(=O)[C@@H](O)CC1(C)C MQZIGYBFDRPAKN-ZWAPEEGVSA-N 0.000 description 2
- 239000001168 astaxanthin Substances 0.000 description 2
- 229940022405 astaxanthin Drugs 0.000 description 2
- 235000013734 beta-carotene Nutrition 0.000 description 2
- 239000011648 beta-carotene Substances 0.000 description 2
- TUPZEYHYWIEDIH-WAIFQNFQSA-N beta-carotene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CCCC1(C)C)C=CC=C(/C)C=CC2=CCCCC2(C)C TUPZEYHYWIEDIH-WAIFQNFQSA-N 0.000 description 2
- 229960002747 betacarotene Drugs 0.000 description 2
- 229910000019 calcium carbonate Inorganic materials 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 239000012228 culture supernatant Substances 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 101150089730 gly-10 gene Proteins 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 2
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 208000019622 heart disease Diseases 0.000 description 2
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 2
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 2
- 230000003308 immunostimulating effect Effects 0.000 description 2
- PLVPPLCLBIEYEA-UHFFFAOYSA-N indoleacrylic acid Natural products C1=CC=C2C(C=CC(=O)O)=CNC2=C1 PLVPPLCLBIEYEA-UHFFFAOYSA-N 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 238000001638 lipofection Methods 0.000 description 2
- 150000007522 mineralic acids Chemical class 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 235000005985 organic acids Nutrition 0.000 description 2
- 239000003973 paint Substances 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 102000020233 phosphotransferase Human genes 0.000 description 2
- 230000001766 physiological effect Effects 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 108010029895 rubimetide Proteins 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 150000004354 sesquiterpene derivatives Chemical class 0.000 description 2
- 150000002653 sesterterpene derivatives Chemical class 0.000 description 2
- 239000004455 soybean meal Substances 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 239000008107 starch Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 235000007586 terpenes Nutrition 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- OENHQHLEOONYIE-JLTXGRSLSA-N β-Carotene Chemical compound CC=1CCCC(C)(C)C=1\C=C\C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C OENHQHLEOONYIE-JLTXGRSLSA-N 0.000 description 2
- QGVLYPPODPLXMB-UBTYZVCOSA-N (1aR,1bS,4aR,7aS,7bS,8R,9R,9aS)-4a,7b,9,9a-tetrahydroxy-3-(hydroxymethyl)-1,1,6,8-tetramethyl-1,1a,1b,4,4a,7a,7b,8,9,9a-decahydro-5H-cyclopropa[3,4]benzo[1,2-e]azulen-5-one Chemical compound C1=C(CO)C[C@]2(O)C(=O)C(C)=C[C@H]2[C@@]2(O)[C@H](C)[C@@H](O)[C@@]3(O)C(C)(C)[C@H]3[C@@H]21 QGVLYPPODPLXMB-UBTYZVCOSA-N 0.000 description 1
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- DMASLKHVQRHNES-UPOGUZCLSA-N (3R)-beta,beta-caroten-3-ol Chemical compound C([C@H](O)CC=1C)C(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C DMASLKHVQRHNES-UPOGUZCLSA-N 0.000 description 1
- JKQXZKUSFCKOGQ-JLGXGRJMSA-N (3R,3'R)-beta,beta-carotene-3,3'-diol Chemical compound C([C@H](O)CC=1C)C(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)C[C@@H](O)CC1(C)C JKQXZKUSFCKOGQ-JLGXGRJMSA-N 0.000 description 1
- ZNAIHAPCDVUWRX-DUCUPYJCSA-N (4s,4as,5as,6s,12ar)-7-chloro-4-(dimethylamino)-1,6,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4,4a,5,5a-tetrahydrotetracene-2-carboxamide;4-amino-n-(4,6-dimethylpyrimidin-2-yl)benzenesulfonamide;(2s,5r,6r)-3,3-dimethyl-7-oxo-6-[(2-phenylacetyl)amino]-4-t Chemical compound CC1=CC(C)=NC(NS(=O)(=O)C=2C=CC(N)=CC=2)=N1.N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1.C1=CC(Cl)=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(=O)C(C(N)=O)=C(O)[C@@]4(O)C(=O)C3=C(O)C2=C1O ZNAIHAPCDVUWRX-DUCUPYJCSA-N 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-L (R)-2-Hydroxy-3-(phosphonooxy)-propanal Natural products O=C[C@H](O)COP([O-])([O-])=O LXJXRIRHZLFYRP-VKHMYHEASA-L 0.000 description 1
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 1
- AJPADPZSRRUGHI-RFZPGFLSSA-N 1-deoxy-D-xylulose 5-phosphate Chemical compound CC(=O)[C@@H](O)[C@H](O)COP(O)(O)=O AJPADPZSRRUGHI-RFZPGFLSSA-N 0.000 description 1
- ICFIZJQGJAJRSU-UHFFFAOYSA-N 2,3-Dimethoxy-5-methyl-6-<3,7,11,15,19,23,27,31-octamethyl-dotriacontaoctaen-(2,6,10,14,18,22,26,30)-yl>benzochinon Natural products COC1=C(OC)C(=O)C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)=C(C)C1=O ICFIZJQGJAJRSU-UHFFFAOYSA-N 0.000 description 1
- HYPYXGZDOYTYDR-HAJWAVTHSA-N 2-methyl-3-[(2e,6e,10e,14e)-3,7,11,15,19-pentamethylicosa-2,6,10,14,18-pentaenyl]naphthalene-1,4-dione Chemical compound C1=CC=C2C(=O)C(C/C=C(C)/CC/C=C(C)/CC/C=C(C)/CC/C=C(C)/CCC=C(C)C)=C(C)C(=O)C2=C1 HYPYXGZDOYTYDR-HAJWAVTHSA-N 0.000 description 1
- OKZYCXHTTZZYSK-UHFFFAOYSA-N 3-hydroxy-3-methyl-5-phosphonooxypentanoic acid Chemical compound OC(=O)CC(O)(C)CCOP(O)(O)=O OKZYCXHTTZZYSK-UHFFFAOYSA-N 0.000 description 1
- NPOAOTPXWNWTSH-UHFFFAOYSA-N 3-hydroxy-3-methylglutaric acid Chemical compound OC(=O)CC(O)(C)CC(O)=O NPOAOTPXWNWTSH-UHFFFAOYSA-N 0.000 description 1
- 241000190969 Afifella marina Species 0.000 description 1
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 1
- VGMNWQOPSFBBBG-XUXIUFHCSA-N Ala-Leu-Leu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VGMNWQOPSFBBBG-XUXIUFHCSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- 241001147780 Alicyclobacillus Species 0.000 description 1
- 241000880915 Allochromatium warmingii Species 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 239000004254 Ammonium phosphate Substances 0.000 description 1
- 241000192537 Anabaena cylindrica Species 0.000 description 1
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- 241000185996 Arthrobacter citreus Species 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 239000012619 Butyl Sepharose® Substances 0.000 description 1
- 101100327917 Caenorhabditis elegans chup-1 gene Proteins 0.000 description 1
- BHPQYMZQTOCNFJ-UHFFFAOYSA-N Calcium cation Chemical compound [Ca+2] BHPQYMZQTOCNFJ-UHFFFAOYSA-N 0.000 description 1
- 101100348617 Candida albicans (strain SC5314 / ATCC MYA-2876) NIK1 gene Proteins 0.000 description 1
- 102100033779 Collagen alpha-4(IV) chain Human genes 0.000 description 1
- 241001517047 Corynebacterium acetoacidophilum Species 0.000 description 1
- 241000186145 Corynebacterium ammoniagenes Species 0.000 description 1
- 241000807905 Corynebacterium glutamicum ATCC 14067 Species 0.000 description 1
- 239000004212 Cryptoxanthin Substances 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-N D-glyceraldehyde 3-phosphate Chemical compound O=C[C@H](O)COP(O)(O)=O LXJXRIRHZLFYRP-VKHMYHEASA-N 0.000 description 1
- 229920002271 DEAE-Sepharose Polymers 0.000 description 1
- 108010076804 DNA Restriction Enzymes Proteins 0.000 description 1
- SUZLHDUTVMZSEV-UHFFFAOYSA-N Deoxycoleonol Natural products C12C(=O)CC(C)(C=C)OC2(C)C(OC(=O)C)C(O)C2C1(C)C(O)CCC2(C)C SUZLHDUTVMZSEV-UHFFFAOYSA-N 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241001452028 Escherichia coli DH1 Species 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- 241000617590 Escherichia coli K1 Species 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- 101100321669 Fagopyrum esculentum FA02 gene Proteins 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- 101150094690 GAL1 gene Proteins 0.000 description 1
- 102100021736 Galectin-1 Human genes 0.000 description 1
- 102100024637 Galectin-10 Human genes 0.000 description 1
- 101001011019 Gallus gallus Gallinacin-10 Proteins 0.000 description 1
- 101001011021 Gallus gallus Gallinacin-12 Proteins 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- 229930191978 Gibberellin Natural products 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- 241001401556 Glutamicibacter mysorens Species 0.000 description 1
- 241001524188 Glutamicibacter nicotianae Species 0.000 description 1
- 241001524175 Glutamicibacter protophormiae Species 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- PNMUAGGSDZXTHX-BYPYZUCNSA-N Gly-Gln Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(N)=O PNMUAGGSDZXTHX-BYPYZUCNSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- 102100036738 Guanine nucleotide-binding protein subunit alpha-11 Human genes 0.000 description 1
- 102000002812 Heat-Shock Proteins Human genes 0.000 description 1
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 1
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- 101000710870 Homo sapiens Collagen alpha-4(IV) chain Proteins 0.000 description 1
- 101100283445 Homo sapiens GNA11 gene Proteins 0.000 description 1
- 101100293260 Homo sapiens NAA15 gene Proteins 0.000 description 1
- 101150102264 IE gene Proteins 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 1
- 241000283160 Inia Species 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 241001531414 Isochromatium buderi Species 0.000 description 1
- 241000186984 Kitasatospora aureofaciens Species 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- 239000004782 Lambda Substances 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- 101001090725 Leuconostoc gelidum Bacteriocin leucocin-A Proteins 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- CUICVBQQHMKBRJ-LSJOCFKGSA-N Met-His-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O CUICVBQQHMKBRJ-LSJOCFKGSA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 241000589308 Methylobacterium extorquens Species 0.000 description 1
- 241001148217 Methylobacterium rhodesianum Species 0.000 description 1
- SIGQQUBJQXSAMW-LURJTMIESA-N Mevalonic acid 5-pyrophosphate Chemical compound OC(=O)C[C@](O)(C)CCOP(O)(=O)OP(O)(O)=O SIGQQUBJQXSAMW-LURJTMIESA-N 0.000 description 1
- 241000144155 Microbacterium ammoniaphilum Species 0.000 description 1
- 101100412856 Mus musculus Rhod gene Proteins 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- 102100026781 N-alpha-acetyltransferase 15, NatA auxiliary subunit Human genes 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 101150012394 PHO5 gene Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241000157908 Paenarthrobacter aurescens Species 0.000 description 1
- 241001524178 Paenarthrobacter ureafaciens Species 0.000 description 1
- 241000157907 Paeniglutamicibacter sulfureus Species 0.000 description 1
- 241000588701 Pectobacterium carotovorum Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- 241000192605 Phormidium sp. Species 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- 101710130262 Probable Vpr-like protein Proteins 0.000 description 1
- 108010009736 Protein Hydrolysates Proteins 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000191023 Rhodobacter capsulatus Species 0.000 description 1
- 241000191043 Rhodobacter sphaeroides Species 0.000 description 1
- 241000190932 Rhodopseudomonas Species 0.000 description 1
- 241000190950 Rhodopseudomonas palustris Species 0.000 description 1
- 241000190982 Rhodothalassium salexigens Species 0.000 description 1
- 241000190980 Rhodovibrio salinarum Species 0.000 description 1
- 101100007329 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) COS1 gene Proteins 0.000 description 1
- 101100221606 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) COS7 gene Proteins 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 241000311088 Schwanniomyces Species 0.000 description 1
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- 241000881765 Serratia ficaria Species 0.000 description 1
- 241000607717 Serratia liquefaciens Species 0.000 description 1
- 241000607715 Serratia marcescens Species 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000187758 Streptomyces ambofaciens Species 0.000 description 1
- 241001454747 Streptomyces aureus Species 0.000 description 1
- 241000971005 Streptomyces fungicidicus Species 0.000 description 1
- 241000970979 Streptomyces griseochromogenes Species 0.000 description 1
- 241000187398 Streptomyces lividans Species 0.000 description 1
- 241000813830 Streptomyces olivogriseus Species 0.000 description 1
- 241000970898 Streptomyces rameus Species 0.000 description 1
- 241000345373 Streptomyces sp. CL190 Species 0.000 description 1
- 241000946755 Streptomyces tanashiensis Species 0.000 description 1
- 241000187123 Streptomyces vinaceus Species 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 101100242191 Tetraodon nigroviridis rho gene Proteins 0.000 description 1
- 241000190988 Thermochromatium tepidum Species 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- 241000255993 Trichoplusia ni Species 0.000 description 1
- OTWIOROMZLNAQC-XIRDDKMYSA-N Trp-His-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OTWIOROMZLNAQC-XIRDDKMYSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- DHINLYMWMXQGMQ-IHRRRGAJSA-N Val-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 DHINLYMWMXQGMQ-IHRRRGAJSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- JKQXZKUSFCKOGQ-LQFQNGICSA-N Z-zeaxanthin Natural products C([C@H](O)CC=1C)C(C)(C)C=1C=CC(C)=CC=CC(C)=CC=CC=C(C)C=CC=C(C)C=CC1=C(C)C[C@@H](O)CC1(C)C JKQXZKUSFCKOGQ-LQFQNGICSA-N 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- QOPRSMDTRDMBNK-RNUUUQFGSA-N Zeaxanthin Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CCC(O)C1(C)C)C=CC=C(/C)C=CC2=C(C)CC(O)CC2(C)C QOPRSMDTRDMBNK-RNUUUQFGSA-N 0.000 description 1
- 241000588902 Zymomonas mobilis Species 0.000 description 1
- AJPADPZSRRUGHI-MGVUJODPSA-N [(2R,3S)-5-deuterio-2,3-dihydroxy-4-oxopentyl] dihydrogen phosphate Chemical compound [2H]CC(=O)[C@@H](O)[C@H](O)COP(O)(O)=O AJPADPZSRRUGHI-MGVUJODPSA-N 0.000 description 1
- XMWHRVNVKDKBRG-CRCLSJGQSA-N [(2s,3r)-2,3,4-trihydroxy-3-methylbutyl] dihydrogen phosphate Chemical compound OC[C@](O)(C)[C@@H](O)COP(O)(O)=O XMWHRVNVKDKBRG-CRCLSJGQSA-N 0.000 description 1
- MZVQCMJNVPIDEA-UHFFFAOYSA-N [CH2]CN(CC)CC Chemical group [CH2]CN(CC)CC MZVQCMJNVPIDEA-UHFFFAOYSA-N 0.000 description 1
- OJFDKHTZOUZBOS-CITAKDKDSA-N acetoacetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 OJFDKHTZOUZBOS-CITAKDKDSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 208000033325 adult 1 primary lateral sclerosis Diseases 0.000 description 1
- 238000005273 aeration Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000012670 alkaline solution Substances 0.000 description 1
- JKQXZKUSFCKOGQ-LOFNIBRQSA-N all-trans-Zeaxanthin Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CC(O)CC1(C)C)C=CC=C(/C)C=CC2=C(C)CC(O)CC2(C)C JKQXZKUSFCKOGQ-LOFNIBRQSA-N 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019270 ammonium chloride Nutrition 0.000 description 1
- 229910000148 ammonium phosphate Inorganic materials 0.000 description 1
- 235000019289 ammonium phosphates Nutrition 0.000 description 1
- 150000003863 ammonium salts Chemical class 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 229940030225 antihemorrhagics Drugs 0.000 description 1
- 230000003078 antioxidant effect Effects 0.000 description 1
- 239000012062 aqueous buffer Substances 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 230000001174 ascending effect Effects 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 235000002360 beta-cryptoxanthin Nutrition 0.000 description 1
- DMASLKHVQRHNES-ITUXNECMSA-N beta-cryptoxanthin Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CC(O)CC1(C)C)C=CC=C(/C)C=CC2=C(C)CCCC2(C)C DMASLKHVQRHNES-ITUXNECMSA-N 0.000 description 1
- 230000023555 blood coagulation Effects 0.000 description 1
- 230000004097 bone metabolism Effects 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 229910001424 calcium ion Inorganic materials 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- WORJEOGGNQDSOE-UHFFFAOYSA-N chloroform;methanol Chemical compound OC.ClC(Cl)Cl WORJEOGGNQDSOE-UHFFFAOYSA-N 0.000 description 1
- 229930002875 chlorophyll Natural products 0.000 description 1
- 235000019804 chlorophyll Nutrition 0.000 description 1
- 229940106705 chlorophyll Drugs 0.000 description 1
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 1
- 238000011098 chromatofocusing Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- OHCQJHSOBUTRHG-UHFFFAOYSA-N colforsin Natural products OC12C(=O)CC(C)(C=C)OC1(C)C(OC(=O)C)C(O)C1C2(C)C(O)CCC1(C)C OHCQJHSOBUTRHG-UHFFFAOYSA-N 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 229910000365 copper sulfate Inorganic materials 0.000 description 1
- ARUVKPQLZAKDPS-UHFFFAOYSA-L copper(II) sulfate Chemical compound [Cu+2].[O-][S+2]([O-])([O-])[O-] ARUVKPQLZAKDPS-UHFFFAOYSA-L 0.000 description 1
- 235000019244 cryptoxanthin Nutrition 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000011033 desalting Methods 0.000 description 1
- FCRACOPGPMPSHN-UHFFFAOYSA-N desoxyabscisic acid Natural products OC(=O)C=C(C)C=CC1C(C)=CC(=O)CC1(C)C FCRACOPGPMPSHN-UHFFFAOYSA-N 0.000 description 1
- MNNHAPBLZZVQHP-UHFFFAOYSA-N diammonium hydrogen phosphate Chemical compound [NH4+].[NH4+].OP([O-])([O-])=O MNNHAPBLZZVQHP-UHFFFAOYSA-N 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 229930004069 diterpene Natural products 0.000 description 1
- 125000000567 diterpene group Chemical group 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 108010030074 endodeoxyribonuclease MluI Proteins 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 239000011790 ferrous sulphate Substances 0.000 description 1
- 235000003891 ferrous sulphate Nutrition 0.000 description 1
- 239000012091 fetal bovine serum Substances 0.000 description 1
- 125000005519 fluorenylmethyloxycarbonyl group Chemical group 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 229960002518 gentamicin Drugs 0.000 description 1
- IXORZMNAPKEEDV-UHFFFAOYSA-N gibberellic acid GA3 Natural products OC(=O)C1C2(C3)CC(=C)C3(O)CCC2C2(C=CC3O)C1C3(C)C(=O)O2 IXORZMNAPKEEDV-UHFFFAOYSA-N 0.000 description 1
- 239000003448 gibberellin Substances 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 239000006451 grace's insect medium Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 239000002874 hemostatic agent Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 1
- 229910000359 iron(II) sulfate Inorganic materials 0.000 description 1
- 230000001678 irradiating effect Effects 0.000 description 1
- 238000001155 isoelectric focusing Methods 0.000 description 1
- 239000002949 juvenile hormone Substances 0.000 description 1
- 229930014550 juvenile hormone Natural products 0.000 description 1
- 150000003633 juvenile hormone derivatives Chemical class 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- GVALZJMUIHGIMD-UHFFFAOYSA-H magnesium phosphate Chemical compound [Mg+2].[Mg+2].[Mg+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O GVALZJMUIHGIMD-UHFFFAOYSA-H 0.000 description 1
- 239000004137 magnesium phosphate Substances 0.000 description 1
- 229910000157 magnesium phosphate Inorganic materials 0.000 description 1
- 229960002261 magnesium phosphate Drugs 0.000 description 1
- 235000010994 magnesium phosphates Nutrition 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 229940099596 manganese sulfate Drugs 0.000 description 1
- 239000011702 manganese sulphate Substances 0.000 description 1
- 235000007079 manganese sulphate Nutrition 0.000 description 1
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 229960001961 meglutol Drugs 0.000 description 1
- LXKDFTDVRVLXFY-WQWYCSGDSA-N menaquinone-8 Chemical compound C1=CC=C2C(=O)C(C/C=C(C)/CC/C=C(C)/CC/C=C(C)/CC/C=C(C)/CC/C=C(C)/CC/C=C(C)/CC/C=C(C)/CCC=C(C)C)=C(C)C(=O)C2=C1 LXKDFTDVRVLXFY-WQWYCSGDSA-N 0.000 description 1
- 101150030900 mgr gene Proteins 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 235000013379 molasses Nutrition 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 229930003658 monoterpene Natural products 0.000 description 1
- 150000002773 monoterpene derivatives Chemical class 0.000 description 1
- 235000002577 monoterpenes Nutrition 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000002611 ovarian Effects 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 101150019841 penP gene Proteins 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- QGVLYPPODPLXMB-QXYKVGAMSA-N phorbol Natural products C[C@@H]1[C@@H](O)[C@]2(O)[C@H]([C@H]3C=C(CO)C[C@@]4(O)[C@H](C=C(C)C4=O)[C@@]13O)C2(C)C QGVLYPPODPLXMB-QXYKVGAMSA-N 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 235000021317 phosphate Nutrition 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 150000003097 polyterpenes Chemical class 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000003449 preventive effect Effects 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- BDERNNFJNOPAEC-UHFFFAOYSA-N propan-1-ol Chemical compound CCCO BDERNNFJNOPAEC-UHFFFAOYSA-N 0.000 description 1
- 235000019260 propionic acid Nutrition 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- WQGWDDDVZFFDIG-UHFFFAOYSA-N pyrogallol Chemical compound OC1=CC=CC(O)=C1O WQGWDDDVZFFDIG-UHFFFAOYSA-N 0.000 description 1
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 102220201851 rs143406017 Human genes 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 229940098362 serratia marcescens Drugs 0.000 description 1
- 229930004725 sesquiterpene Natural products 0.000 description 1
- 229930002368 sesterterpene Natural products 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 150000003535 tetraterpenes Chemical class 0.000 description 1
- 235000009657 tetraterpenes Nutrition 0.000 description 1
- 229940126585 therapeutic drug Drugs 0.000 description 1
- KBPHJBAIARWVSC-XQIHNALSSA-N trans-lutein Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CC(O)CC1(C)C)C=CC=C(/C)C=CC2C(=CC(O)CC2(C)C)C KBPHJBAIARWVSC-XQIHNALSSA-N 0.000 description 1
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 1
- 150000003648 triterpenes Chemical class 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- ICFIZJQGJAJRSU-SGHXUWJISA-N ubiquinone-8 Chemical compound COC1=C(OC)C(=O)C(C\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CCC=C(C)C)=C(C)C1=O ICFIZJQGJAJRSU-SGHXUWJISA-N 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 229930195735 unsaturated hydrocarbon Natural products 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000019143 vitamin K2 Nutrition 0.000 description 1
- 239000011728 vitamin K2 Substances 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 229940041603 vitamin k 3 Drugs 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 235000010930 zeaxanthin Nutrition 0.000 description 1
- 239000001775 zeaxanthin Substances 0.000 description 1
- 229940043269 zeaxanthin Drugs 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Description
【0001】[0001]
【発明の属する技術分野】本発明は、放線菌由来のメバ
ロン酸経路に関与する複数の酵素をコードするDNA、
該DNAに含まれる各酵素をコードするDNAとそれに
よってコードされるタンパク質、該DNAを含むベクタ
ー及び形質転換体、並びに該DNAを用いたイソプレノ
イド化合物の製造方法に関する。[0001] The present invention relates to a DNA encoding a plurality of enzymes involved in the mevalonate pathway derived from actinomycetes,
The present invention relates to a DNA encoding each enzyme contained in the DNA, a protein encoded thereby, a vector and a transformant containing the DNA, and a method for producing an isoprenoid compound using the DNA.
【0002】[0002]
【従来の技術】イソプレノイド(テルペンとも称され
る)は、炭素数5のイソプレン単位を基本骨格に持つ一
群の有機化合物の総称であり、イソペンテニルピロリン
酸(IPP)の重合によって生合成される。(C5H8)
nの不飽和炭化水素以外に、それらの酸化還元生成物
(アルコール、ケトン、酸など)、炭素の脱離した化合
物などが多くの植物および動物体内に見い出されてい
る。イソプレノイドは、炭素数によりヘミテルペン(C
5)、モノテルペン(C10)、セスキテルペン(C1
5)、ジテルペン(C20)、セスタテルペン(C2
5)、トリテルペン(C30)、テトラテルペン(C4
0、カロテノイド)、およびその他のポリテルペンに分
類することができる。アブシジン酸、幼若ホルモン、ジ
ベレリン、フォルスコリン、ホルボールなど生理活性を
示す化合物も多い。また、構造の一部にイソプレン構造
を有する複合テルペンとしてクロロフィル、ビタミン
K、ユビキノン、tRNAなどがあり、これらも有用な
生理活性を示す。2. Description of the Related Art Isoprenoids (also referred to as terpenes) are a general term for a group of organic compounds having an isoprene unit having 5 carbon atoms as a basic skeleton, and are biosynthesized by polymerization of isopentenyl pyrophosphate (IPP). (C 5 H 8)
In addition to n unsaturated hydrocarbons, their redox products (alcohols, ketones, acids, etc.), compounds from which carbon has been eliminated, and the like have been found in many plants and animals. Isoprenoids are classified into hemiterpenes (C
5), monoterpene (C10), sesquiterpene (C1)
5), diterpenes (C20), sesterterpenes (C2)
5), triterpene (C30), tetraterpene (C4
0, carotenoids), and other polyterpenes. There are many compounds that show physiological activity, such as abscisic acid, juvenile hormone, gibberellin, forskolin, and phorbol. In addition, chlorophyll, vitamin K, ubiquinone, tRNA and the like as complex terpenes having an isoprene structure in a part of the structure include useful physiological activities.
【0003】例えば、イソプレノイド化合物の1種であ
るユビキノンは電子伝達系の必須成分として、生体内で
重要な機能を果たしており、心疾患に効果のある医薬品
として使用されているほか、欧米では健康食品としての
需要が増大している。また、ビタミンKは血液凝固系に
関与する重要なビタミンであり、止血剤として利用され
ているほか、最近では骨代謝への関与が示唆され、骨粗
鬆症の治療薬への応用が期待されており、フィロキノン
とメナキノンは医薬品として認可されている。[0003] For example, ubiquinone, which is one of the isoprenoid compounds, plays an important role in the living body as an essential component of the electron transport system, and is used as a drug effective for heart disease. As demand increases. Vitamin K is an important vitamin involved in the blood coagulation system, and is used as a hemostatic agent. Recently, it is suggested that vitamin K is involved in bone metabolism, and is expected to be applied to a therapeutic drug for osteoporosis. Philoquinone and menaquinone have been approved as pharmaceuticals.
【0004】また、ユビキノンやビタミンKには貝類の
付着阻害作用があり、貝類付着防止塗料への応用が期待
される。さらに、カロチノイドには抗酸化作用があり、
β−カロチン、アスタキサンチン、クリプトキサンチン
など、がん予防や免疫賦活活性を有するものとして期待
されているものもある。このように、イソプレノイド化
合物の中には生体にとって有用な物質が含まれており、
これらの安価な製造方法が確立されれば、社会的にも医
学的にも多大な恩恵があると思われる。[0004] Ubiquinone and vitamin K have an inhibitory effect on the adhesion of shellfish, and are expected to be applied to paints for preventing adhesion of shellfish. In addition, carotenoids have an antioxidant effect,
Some of them, such as β-carotene, astaxanthin, and cryptoxanthin, are expected to have cancer prevention and immunostimulatory activities. As described above, isoprenoid compounds include substances useful for living organisms,
The establishment of these inexpensive manufacturing methods would have significant social and medical benefits.
【0005】発酵法によるイソプレノイド化合物の生産
は以前から検討されており、培養条件の検討や変異処理
による菌株育種、さらに遺伝子工学的手法による生産量
の向上への試みもなされている。しかし、その効果は個
々の化合物種に限定されており、イソプレノイド化合物
全般に効果のある方法は知られていない。イソプレノイ
ド化合物の基本骨格単位であるイソペンテニルピロリン
酸(IPP)は、動物や酵母などの真核生物ではアセチ
ルCoAからメバロン酸を経由して生合成される(メバ
ロン酸経路)ことが証明されている。[0005] Production of isoprenoid compounds by fermentation has been studied for some time, and attempts have been made to examine culture conditions, breed strains by mutagenesis, and to improve production by genetic engineering techniques. However, its effect is limited to individual compound species, and no effective method is known for isoprenoid compounds in general. Isopentenyl pyrophosphate (IPP), a basic skeleton unit of isoprenoid compounds, has been proven to be biosynthesized from acetyl-CoA via mevalonic acid in eukaryotes such as animals and yeasts (mevalonic acid pathway). .
【0006】メバロン酸経路では3−ヒドロキシ−3−
メチルグルタリルCoA(HMG-CoA)リダクターゼが律速酵素
であると考えられており(Mo1. Bio1. Cell, 5, 655(19
94))、酵母において、HMG-CoAリダクターゼを高発現さ
せ、カロテノイドの生産性を向上させる試みがなされて
いる(三沢ら、カロテノイド研究談話会講演要旨集(199
7))。In the mevalonate pathway, 3-hydroxy-3-
Methylglutaryl-CoA (HMG-CoA) reductase is considered to be the rate-limiting enzyme (Mo1. Bio1. Cell, 5, 655 (19
94)), attempts have been made to increase the expression of HMG-CoA reductase in yeast and to improve the productivity of carotenoids (Mizawa et al., Proceedings of the Carotenoid Research Symposium (199)
7)).
【0007】大腸菌などの原核生物では、別の経路、即
ち、ピルビン酸とグリセルアルデヒド3−リン酸が縮合
して生じる1−デオキシ−D−キシルロース5−リン酸
を経由してIPPが生合成される非メバロン酸経路が多
くの原核生物において発見されており(Biochem. J., 2
95, 517(1993))、13Cラベル化基質を使った実験から1
デオキシ−D−キシルロース5−リン酸は2−C−メチル
−D−エリスリトール4−リン酸を経由してIPPへと
転換されることが証明されている(Tetrahedron Lett.
38, 4769 (1997);Tetrahedron Lett. 39, 4509 (199
8))。特に、大腸菌ではIPPは非メバロン酸経路での
み合成されることが実証されている(Rohmer, M. In Co
mprehensive Natural Products Chemistry, Vol. 2: Is
oprenoidsincluding carotenoids and steroids; Barto
n, D. Nakanishi, K. Eds. Elsevier: Amsterdam, 199
9; pp. 45-67)。[0007] In prokaryotes such as Escherichia coli, IPP is biosynthesized via another pathway, namely, 1-deoxy-D-xylulose 5-phosphate generated by the condensation of pyruvate and glyceraldehyde 3-phosphate. Non-mevalonate pathways have been discovered in many prokaryotes (Biochem. J., 2
95, 517 (1993)), from experiments using 13 C-labeled substrates.
Deoxy-D-xylulose 5-phosphate has been shown to be converted to IPP via 2-C-methyl-D-erythritol 4-phosphate (Tetrahedron Lett.
38, 4769 (1997); Tetrahedron Lett. 39, 4509 (199
8)). In particular, it has been demonstrated that IPP is synthesized only in the non-mevalonate pathway in E. coli (Rohmer, M. In Co.).
mprehensive Natural Products Chemistry, Vol. 2: Is
oprenoidsincluding carotenoids and steroids; Barto
n, D. Nakanishi, K. Eds. Elsevier: Amsterdam, 199
9; pp. 45-67).
【0008】一方、放線菌 Streptomyces sp. CL190 株
(J. Antibiot. 43:444, 1990)はメバロン酸経由でI
PPを合成していることは分かっており(Tetrahedron
Lett. 31:6025, 1990. とTetrahedron Lett. 37:7979,
1996.)、本発明者らはこれまでに、放線菌 Streptomyc
es sp. CL190 株から、メバロン酸経路上の一つの反応
を触媒する酵素、3−ヒドロキシー3−メチルグルタリ
ル CoA (HMG−CoA)レダクターゼをコードする遺
伝子(hmgr)を既にクローニングしていた(J. Bac
teriol. 181:1256, 1999)。しかしながら、本放線菌株
に存在すると考えられる、メバロン酸経路に関与する他
の酵素をコードする遺伝子は未だクローニングされてい
ない。On the other hand, the actinomycete Streptomyces sp. Strain CL190 (J. Antibiot. 43: 444, 1990) is used for the transfer of I via mevalonic acid.
It is known that PP is synthesized (Tetrahedron
Lett. 31: 6025, 1990. and Tetrahedron Lett. 37: 7979,
1996.), the present inventors have previously reported that actinomycetes Streptomyc
The gene (hmgr) encoding 3-hydroxy-3-methylglutaryl CoA (HMG-CoA) reductase, an enzyme that catalyzes one reaction on the mevalonate pathway, has already been cloned from es sp. strain CL190 (J . Bac
teriol. 181: 1256, 1999). However, genes encoding other enzymes involved in the mevalonate pathway, which are considered to be present in this actinomycete strain, have not been cloned yet.
【0009】[0009]
【発明が解決しようとする課題】本発明の課題の一つ
は、心疾患、骨粗鬆症、止血、がん予防、免疫賦活等を
目的とした医薬品、健康食品および貝類付着防止塗料等
に有用なイソプレノイド化合物の生合成経路の一つであ
るメバロン酸経路に関与する遺伝子群を含むDNAを提
供することである。さらに本発明の別の課題は、上記D
NAを宿主細胞に導入して得た形質転換体を培養するこ
とによってイソプレノイド化合物を製造する方法を提供
することである。One of the objects of the present invention is to provide isoprenoids useful for medicines, health foods, and shellfish adhesion preventive paints for the purpose of heart disease, osteoporosis, hemostasis, cancer prevention, immunostimulation, etc. An object of the present invention is to provide a DNA containing a group of genes involved in the mevalonate pathway, which is one of the biosynthetic pathways of compounds. Still another object of the present invention is to provide the above-mentioned D
An object of the present invention is to provide a method for producing an isoprenoid compound by culturing a transformant obtained by introducing NA into a host cell.
【0010】[0010]
【課題を解決するための手段】本発明者らは、上記課題
を解決するために鋭意検討した結果、放線菌 Streptomy
ces sp. CL190 株のメバロン酸経路に関与する遺伝子を
取得し、それを大腸菌に形質転換して得た形質転換体を
培養したところ、イソプレノイド化合物の1種であるユ
ビキノンの生産量が向上していることを見出し本発明を
完成するに至った。Means for Solving the Problems The present inventors have conducted intensive studies to solve the above-mentioned problems, and as a result, have found that Streptomyces
A gene involved in the mevalonate pathway of ces sp. strain CL190 was obtained and transformed into Escherichia coli. The resulting transformant was cultured. As a result, the production of ubiquinone, one of the isoprenoid compounds, was improved. And completed the present invention.
【0011】即ち、本発明によれば、下記の何れかを有
するDNA: (1)配列番号1の塩基配列; (2)配列番号1において1から数個の塩基が欠失、置
換、付加及び/または挿入されている塩基配列であっ
て、メバロン酸経路を機能させるのに必要な酵素を全て
コードする塩基配列;または (3)配列番号1の塩基配列とストリンジェントな条件
下でハイブリダイズすることができる塩基配列であっ
て、メバロン酸経路を機能させるのに必要な酵素を全て
コードする塩基配列:が提供される。好ましくは、メバ
ロン酸経路を機能させるのに必要な酵素は、少なくとも
ホスホメバロン酸キナーゼ、ジホスホメバロン酸デカル
ボキシラーゼ、メバロン酸キナーゼ、HMG−CoAレ
ダクターゼ及びHMG−CoAシンターゼである。本発
明の別の側面によれば、上記DNAによりコードされる
タンパク質が提供される。That is, according to the present invention, a DNA having any one of the following: (1) a base sequence of SEQ ID NO: 1; (2) a deletion, substitution, addition, And / or an inserted nucleotide sequence encoding all of the enzymes necessary for the function of the mevalonate pathway; or (3) hybridizing with the nucleotide sequence of SEQ ID NO: 1 under stringent conditions. A base sequence that encodes all of the enzymes necessary for the functioning of the mevalonate pathway. Preferably, the enzymes necessary for functioning the mevalonate pathway are at least phosphomevalonate kinase, diphosphomevalonate decarboxylase, mevalonate kinase, HMG-CoA reductase and HMG-CoA synthase. According to another aspect of the present invention, there is provided a protein encoded by the DNA.
【0012】本発明のさらに別の側面によれば、下記の
何れかを有するDNA: (1)配列番号2の塩基配列、配列番号2において1か
ら数個の塩基が欠失、置換、付加及び/または挿入され
ている塩基配列であって、ホスホメバロン酸キナーゼを
コードする塩基配列、あるいは配列番号2の塩基配列と
ストリンジェントな条件下でハイブリダイズすることが
できる塩基配列であって、ホスホメバロン酸キナーゼを
コードする塩基配列; (2)配列番号3の塩基配列、配列番号3において1か
ら数個の塩基が欠失、置換、付加及び/または挿入され
ている塩基配列であって、ジホスホメバロン酸デカルボ
キシラーゼをコードする塩基配列、あるいは配列番号3
の塩基配列とストリンジェントな条件下でハイブリダイ
ズすることができる塩基配列であって、ジホスホメバロ
ン酸デカルボキシラーゼをコードする塩基配列; (3)配列番号4の塩基配列、配列番号4において1か
ら数個の塩基が欠失、置換、付加及び/または挿入され
ている塩基配列であって、メバロン酸キナーゼをコード
する塩基配列、あるいは配列番号4の塩基配列とストリ
ンジェントな条件下でハイブリダイズすることができる
塩基配列であって、メバロン酸キナーゼをコードする塩
基配列; (4)配列番号5の塩基配列;あるいは (5)配列番号7の塩基配列、配列番号7において1か
ら数個の塩基が欠失、置換、付加及び/または挿入され
ている塩基配列であって、HMG−CoAシンターゼを
コードする塩基配列、あるいは配列番号7の塩基配列と
ストリンジェントな条件下でハイブリダイズすることが
できる塩基配列であって、HMG−CoAシンターゼを
コードする塩基配列;が提供される。本発明のさらに別
の側面によれば、上記DNAによるコードされるタンパ
ク質が提供される。According to still another aspect of the present invention, a DNA having any one of the following: (1) a nucleotide sequence of SEQ ID NO: 2; And / or a base sequence that is inserted and is a base sequence that encodes phosphomevalonate kinase or a base sequence that can hybridize under stringent conditions to the base sequence of SEQ ID NO: 2; (2) the base sequence of SEQ ID NO: 3, which is a base sequence in which one to several bases have been deleted, substituted, added and / or inserted in SEQ ID NO: 3, wherein diphosphomevalonate decarboxylase is SEQ ID NO: 3, or the nucleotide sequence encoding
A base sequence capable of hybridizing under stringent conditions with the base sequence of SEQ ID NO: 4; a base sequence encoding diphosphomevalonate decarboxylase; (3) a base sequence of SEQ ID NO: 4; Is a nucleotide sequence having deletion, substitution, addition and / or insertion of a base, which is capable of hybridizing under stringent conditions to the nucleotide sequence encoding mevalonate kinase or the nucleotide sequence of SEQ ID NO: 4. A nucleotide sequence encoding mevalonate kinase; (4) a nucleotide sequence of SEQ ID NO: 5; or (5) a nucleotide sequence of SEQ ID NO: 7; , A base sequence that has been substituted, added and / or inserted and that codes for HMG-CoA synthase, A nucleotide sequence capable of hybridizing with the nucleotide sequence under stringent conditions of SEQ ID NO: 7, the nucleotide sequence encoding HMG-CoA synthase; is provided. According to still another aspect of the present invention, there is provided a protein encoded by the DNA.
【0013】本発明のさらに別の側面によれば、下記の
何れかを有するタンパク質: (1)配列番号8のアミノ酸配列、配列番号8において
1から数個のアミノ酸が欠失、置換、付加及び/または
挿入されているアミノ酸配列であって、ホスホメバロン
酸キナーゼ活性を有するアミノ酸配列、あるいは配列番
号8のアミノ酸配列と60%以上の相同性を有するアミ
ノ酸配列であって、ホスホメバロン酸キナーゼ活性を有
するアミノ酸配列; (2)配列番号9のアミノ酸配列、配列番号9において
1から数個のアミノ酸が欠失、置換、付加及び/または
挿入されているアミノ酸配列であって、ジホスホメバロ
ン酸デカルボキシラーゼ活性を有するアミノ酸配列、あ
るいは配列番号9のアミノ酸配列と60%以上の相同性
を有するアミノ酸配列であって、ジホスホメバロン酸デ
カルボキシラーゼ活性を有するアミノ酸配列; (3)配列番号10のアミノ酸配列、配列番号10にお
いて1から数個のアミノ酸が欠失、置換、付加及び/ま
たは挿入されているアミノ酸配列であって、メバロン酸
キナーゼ活性を有するアミノ酸配列、あるいは配列番号
10のアミノ酸配列と60%以上の相同性を有するアミ
ノ酸配列であって、メバロン酸キナーゼ活性を有するア
ミノ酸配列; (4)配列番号11のアミノ酸配列;あるいは (5)配列番号13のアミノ酸配列、配列番号13にお
いて1から数個のアミノ酸が欠失、置換、付加及び/ま
たは挿入されているアミノ酸配列であって、HMG−C
oAシンターゼ活性を有するアミノ酸配列、あるいは配
列番号13のアミノ酸配列と60%以上の相同性を有す
るアミノ酸配列であって、HMG−CoAシンターゼ活
性を有するアミノ酸配列;が提供される。According to yet another aspect of the invention, a protein having any of the following: (1) the amino acid sequence of SEQ ID NO: 8, wherein one to several amino acids are deleted, substituted, added and An amino acid sequence having phosphomevalonate kinase activity, which is an amino acid sequence having phosphomevalonate kinase activity or an amino acid sequence having 60% or more homology with the amino acid sequence of SEQ ID NO: 8 Sequence; (2) an amino acid sequence of SEQ ID NO: 9, in which one to several amino acids are deleted, substituted, added and / or inserted in SEQ ID NO: 9, wherein the amino acid has diphosphomevalonate decarboxylase activity Sequence or an amino acid sequence having 60% or more homology with the amino acid sequence of SEQ ID NO: 9 An amino acid sequence having diphosphomevalonate decarboxylase activity; (3) an amino acid sequence of SEQ ID NO: 10, in which one to several amino acids are deleted, substituted, added and / or inserted in SEQ ID NO: 10 An amino acid sequence having mevalonate kinase activity, or an amino acid sequence having 60% or more homology with the amino acid sequence of SEQ ID NO: 10 and having an mevalonate kinase activity; (4) SEQ ID NO: 11 Or (5) the amino acid sequence of SEQ ID NO: 13, wherein the amino acid sequence of SEQ ID NO: 13 in which one to several amino acids are deleted, substituted, added, and / or inserted, wherein HMG-C
an amino acid sequence having oA synthase activity or an amino acid sequence having 60% or more homology with the amino acid sequence of SEQ ID NO: 13 and having HMG-CoA synthase activity;
【0014】本発明のさらに別の側面によれば、本発明
の上記DNAを含むベクターが提供される。本発明のさ
らに別の側面によれば、上記ベクターを有する形質転換
体が提供される。好ましくは形質転換体は大腸菌であ
る。本発明のさらに別の側面によれば、本発明の上記D
NAを含むベクターを宿主に形質転換して作製した形質
転換体を培養して培養物中にイソプレノイド化合物を生
成させる工程、及び該培養物からイソプレノイド化合物
を採取する工程を含む、イソプレノイド化合物の製造方
法が提供される。好ましくは、イソプレノイド化合物
は、ユビキノン、ビタミンK2、またはカロテノイドか
ら選択されるイソプレノイド化合物である。According to still another aspect of the present invention, there is provided a vector comprising the above DNA of the present invention. According to still another aspect of the present invention, there is provided a transformant having the above vector. Preferably, the transformant is E. coli. According to yet another aspect of the present invention, the above D of the present invention
A method for producing an isoprenoid compound, comprising the steps of: culturing a transformant produced by transforming a vector containing NA into a host to produce an isoprenoid compound in a culture; and collecting the isoprenoid compound from the culture. Is provided. Preferably, the isoprenoid compound is a isoprenoid compound selected ubiquinone, vitamin K 2 or carotenoid.
【0015】[0015]
【発明の実施の形態】以下、本発明の実施方法および実
施態様について詳細に説明する。本明細書において「1
から数個の塩基が欠失、置換、付加及び/または挿入さ
れている」とは、例えば1〜20個、好ましくは1〜1
5個、より好ましくは1〜10個、さらに好ましくは1
〜5個の任意の数の塩基が欠失、置換、付加及び/また
は挿入されていることを意味する。本明細書において
「1から数個のアミノ酸が欠失、置換、付加及び/また
は挿入されている」とは、例えば1〜20個、好ましく
は1〜15個、より好ましくは1〜10個、さらに好ま
しくは1〜5個の任意の数のアミノ酸が欠失、置換、付
加及び/または挿入されていることを意味する。本明細
書において「ストリンジェントな条件下でハイブリダイ
ズすることができる」とは、DNAをプローブとして使用
し、コロニー・ハイブリダイゼーション法、プラークハ
イブリダイゼーション法、あるいはサザンブロットハイ
ブリダイゼーション法等を用いることにより得られるDN
Aを意味し、具体的には、コロニーあるいはプラーク由
来のDNAまたは該DNAの断片を固定化したフィルターを用
いて、0.7〜1.0MのNaCl存在下、65℃でハイブリ
ダイゼーションを行った後、0.1〜2倍程度のSSC溶
液(1倍濃度のSSC溶液の組成は、150mM塩化ナトリ
ウム、15mMクエン酸ナトリウム)を用い、65℃条件
下でフィルターを洗浄することにより同定できるDNAを
あげることができる。ハイブリダイゼーションは、Mole
cular Cloning: A laboratory Mannual, 2nd Ed., Cold
Spring HarborLaboratory, Cold Spring Harbor, NY.,
1989. 以後 "モレキュラークローニング第2版" と略
す)等に記載されている方法に準じて行うことができ
る。ストリンジェントな条件下でハイブリダイズするこ
とができるDNAとしては、プローブとして使用するD
NAの塩基配列と一定以上の相同性を有するDNAが挙
げられ、相同性は、例えば60%以上、好ましくは70
%以上、より好ましくは80%以上、さらに好ましくは
90%以上、特に好ましくは95%以上、最も好ましく
は98%以上である。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a method and an embodiment of the present invention will be described in detail. In this specification, “1”
"In which several bases are deleted, substituted, added and / or inserted", for example, from 1 to 20, preferably from 1 to 1
5, more preferably 1 to 10, even more preferably 1
It means that any number of bases from 5 to 5 have been deleted, substituted, added and / or inserted. In the present specification, "1 to several amino acids are deleted, substituted, added and / or inserted" means, for example, 1 to 20, preferably 1 to 15, more preferably 1 to 10, More preferably, 1 to 5 amino acids are deleted, substituted, added and / or inserted. As used herein, `` can hybridize under stringent conditions '' means that DNA is used as a probe and colony hybridization, plaque hybridization, or Southern blot hybridization is used. Obtained DN
A, specifically, hybridization was performed at 65 ° C. in the presence of 0.7 to 1.0 M NaCl using a filter on which colony or plaque-derived DNA or a fragment of the DNA was immobilized. Thereafter, DNA that can be identified by washing the filter at 65 ° C. using an SSC solution of about 0.1 to 2 times (the composition of the SSC solution having a 1 × concentration is 150 mM sodium chloride and 15 mM sodium citrate) is used. I can give it. Hybridization is performed by Mole
cular Cloning: A laboratory Mannual, 2 nd Ed., Cold
Spring Harbor Laboratory, Cold Spring Harbor, NY.,
1989. Hereinafter, abbreviated as "Molecular Cloning 2nd Edition") and the like. DNAs that can hybridize under stringent conditions include the D
DNA having a certain degree of homology with the base sequence of NA is mentioned, and the homology is, for example, 60% or more, preferably 70% or more.
% Or more, more preferably 80% or more, further preferably 90% or more, particularly preferably 95% or more, and most preferably 98% or more.
【0016】本発明はまた、配列番号8、9、10また
は13のアミノ酸配列と60%以上の相同性を有するア
ミノ酸配列であって所望の活性を有するアミノ酸を有す
るタンパク質に関する。配列番号8、9、10または1
3のアミノ酸配列との相同性は60%以上であれば特に
制限はなく、例えば、60%以上、好ましくは70%以
上、より好ましくは80%以上、さらに好ましくは90
%以上、特に好ましくは95%以上、最も好ましくは9
8%以上である。The present invention also relates to a protein having an amino acid sequence having 60% or more homology with the amino acid sequence of SEQ ID NO: 8, 9, 10 or 13 and having an amino acid having a desired activity. SEQ ID NO: 8, 9, 10 or 1
The homology with the amino acid sequence of No. 3 is not particularly limited as long as it is 60% or more, and is, for example, 60% or more, preferably 70% or more, more preferably 80% or more, and further preferably 90% or more.
% Or more, particularly preferably 95% or more, most preferably 9% or more.
8% or more.
【0017】本明細書で言う「メバロン酸経路」とは、 (1)アセトアセチルCoAがHMG−CoAに変換す
る工程(HMG−CoAシンターゼが触媒する); (2)HMG−CoAがメバロン酸に変換する工程(H
MG−CoAレダクターゼが触媒する); (3)メバロン酸がピロホスホメバロン酸に変換する工
程(メバロン酸(MVA)キナーゼ、ホスホメバロン酸
(PMVA)キナーゼにより触媒及び調節される);及
び (4)ピロホスホメバロン酸がイソペンテニルピロリン
酸(IPP)に変換する工程(ジホスホメバロン酸(P
MVA)デカルボキシラーゼが触媒する):によってI
PPが生合成される経路である。メバロン酸経路を機能
させるのに必要な酵素としては、少なくともPMVAキ
ナーゼ、PMVAデカルボキシラーゼ、MVAキナー
ゼ、HMG−CoAレダクターゼ及びHMG−CoAシ
ンターゼが挙げられる。The term "mevalonate pathway" as used herein means: (1) a step of converting acetoacetyl-CoA to HMG-CoA (catalyzed by HMG-CoA synthase); (2) a step of converting HMG-CoA to mevalonate. Step of converting (H
(3) the step of converting mevalonate to pyrophosphomevalonate (catalyzed and regulated by mevalonate (MVA) kinase, phosphomevalonate (PMVA) kinase); and (4) pyro Step of converting phosphomevalonic acid to isopentenyl pyrophosphate (IPP) (diphosphomevalonic acid (P
MVA) decarboxylase catalyzed):
This is the pathway by which PP is biosynthesized. Enzymes required for the functioning of the mevalonate pathway include at least PMVA kinase, PMVA decarboxylase, MVA kinase, HMG-CoA reductase and HMG-CoA synthase.
【0018】次に、本発明のDNAの取得方法およびそ
の利用方法について説明する。 (A)放線菌のメバロン酸経路に関与する酵素をコード
するDNAの取得 前記した通り、本発明者らはこれまでに、放線菌 Strep
tomyces sp. CL190 株から、メバロン酸経路上の一つの
反応を触媒する酵素、3−ヒドロキシー3−メチルグル
タリル CoA (HMG−CoA)レダクターゼをコードす
る遺伝子(hmgr)をクローニングしている(J. Bac
teriol. 181:1256, 1999)。本発明の配列番号1の塩基
配列を有するDNAは、このhmgr遺伝子をプローブ
として使用することによって取得することができる。h
mgr遺伝子の塩基配列としては、配列番号6に記載の
塩基配列を挙げることができる。メバロン酸経路に関与
する酵素をコードするDNA領域の取得法としては、具体
的には以下の方法をあげることができる。Next, a method for obtaining the DNA of the present invention and a method for using the same will be described. (A) Acquisition of DNA Encoding Enzyme Involved in Actinomycete Mevalonate Pathway As described above, the present inventors have previously described Streptomyces strep
A gene (hmgr) encoding 3-hydroxy-3-methylglutaryl CoA (HMG-CoA) reductase, an enzyme that catalyzes one reaction on the mevalonate pathway, has been cloned from tomyces sp. strain CL190 (J. Bac
teriol. 181: 1256, 1999). The DNA having the nucleotide sequence of SEQ ID NO: 1 of the present invention can be obtained by using the hmgr gene as a probe. h
As the nucleotide sequence of the mgr gene, the nucleotide sequence of SEQ ID NO: 6 can be mentioned. Specific examples of a method for obtaining a DNA region encoding an enzyme involved in the mevalonate pathway include the following methods.
【0019】放線菌、例えばStreptomyces sp. CL190
株を適当な培地、例えばGPY培地(1%グルコース、
0.4%ポリペプトン、0.4%イーストエクストラク
ト、0.5%MgSO4・7H2O、0.1%K2HP
O4)で適当な温度(例えば、30℃)で数日間培養す
る。培養後、得られた培養液より遠心分離により菌体を
取得し、菌体より、定法(モレキュラークローニング第
2版)に従い染色体DNAを単離精製する。得られた染
色体DNAを適当な制限酵素(例えば、SnaBIな
ど)で切断した後,hmgr遺伝子をプローブとして用
いたサザンハイブリダイゼーション(モレキュラークロ
ーニング第2版)を行う。サザンハイブリダイゼーショ
ンの結果、特定の位置(例えば、染色体DNAの消化の
ための制限酵素としてSnaBIを使用した場合には、
6.7kbの位置)にプローブのシグナルが検出され
る。Actinomycetes, such as Streptomyces sp. CL190
The strain is placed in a suitable medium, such as GPY medium (1% glucose,
0.4% polypeptone, 0.4% yeast extract, 0.5% MgSO 4 · 7H 2 O, 0.1% K 2 HP
Incubate for several days at an appropriate temperature (eg, 30 ° C.) at O 4 ). After culturing, cells are obtained from the resulting culture by centrifugation, and chromosomal DNA is isolated and purified from the cells according to a standard method (Molecular Cloning, 2nd edition). After cutting the obtained chromosomal DNA with an appropriate restriction enzyme (for example, SnaBI), Southern hybridization (molecular cloning second edition) using the hmgr gene as a probe is performed. As a result of Southern hybridization, a specific position (for example, when SnaBI is used as a restriction enzyme for digestion of chromosomal DNA,
The probe signal is detected at 6.7 kb).
【0020】次に、Streptomyces sp. CL190 株の染色
体DNAを上記と同じ制限酵素(例えば、SnaBI)
で再度切断後、アガロースゲル電流泳動を行い、サザン
ハイブリダイゼーションの結果シグナルが検出された位
置(制限酵素としてSnaBIを使用した場合には、
6.7kbの位置)に対応するDNA断片をアガロース
ゲルから抽出して回収する。この回収したDNA断片を
T4DNAポリメラーゼ(宝酒造から購入)を用いて平
滑末端にし、適当なプラスミド(例えば、pUC118
など)に挿入し、放線菌Streptomyces sp. CL190 株の
染色体DNAライブラリーを作製する。この染色体DN
Aライブラリーを用いて好適な宿主(例えば、E. coli
JM109株など)を定法(モレキュラークローニング第2
版)に従って形質転換し、形質転換体をhmgr遺伝子
をプローブに用いたコロニーハイブリダイゼーション法
によりスクリーニングすることによりhmgr遺伝子を
含むプラスミドを持つ大腸菌の形質転換体を単離するこ
とができる。単離した形質転換体から、常法に従いプラ
スミドを抽出することにより、hmgr遺伝子を含むD
NA断片を単離することができる。Next, the chromosomal DNA of Streptomyces sp. Strain CL190 was ligated to the same restriction enzyme (for example, SnaBI) as described above.
After re-cutting, agarose gel electrophoresis was performed, and the position at which a signal was detected as a result of Southern hybridization (when SnaBI was used as a restriction enzyme,
DNA fragment corresponding to 6.7 kb) is extracted from the agarose gel and collected. The recovered DNA fragment is blunt-ended using T4 DNA polymerase (purchased from Takara Shuzo), and an appropriate plasmid (for example, pUC118) is used.
) To produce a chromosomal DNA library of Streptomyces sp. CL190 strain. This chromosome DN
Using a suitable host (eg, E. coli)
JM109 strain, etc.) (Molecular cloning second
Version), and by screening the transformant by a colony hybridization method using the hmgr gene as a probe, a transformant of Escherichia coli having a plasmid containing the hmgr gene can be isolated. By extracting a plasmid from the isolated transformant according to a conventional method, D
The NA fragment can be isolated.
【0021】上記方法により単離できるDNAの一例と
しては、配列番号1の塩基配列を有するDNAが挙げら
れる。また、配列番号1の塩基配列の部分配列である配
列番号2、配列番号3、配列番号4、配列番号5または
配列番号7の塩基配列を有するDNAも本発明の範囲内
である。配列番号2、配列番号3、配列番号4、配列番
号5または配列番号7の塩基配列を有するDNAは、配
列番号1の塩基配列の情報に基づいて制限酵素処理また
はPCRなどを適宜使用して当業者に公知の常法により
取得することができる。An example of a DNA that can be isolated by the above method is a DNA having the nucleotide sequence of SEQ ID NO: 1. Also, a DNA having the nucleotide sequence of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, or SEQ ID NO: 7, which is a partial sequence of the nucleotide sequence of SEQ ID NO: 1, is within the scope of the present invention. The DNA having the nucleotide sequence of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5 or SEQ ID NO: 7 is obtained by appropriately treating the DNA with the nucleotide sequence of SEQ ID NO: 1 with restriction enzyme treatment or PCR. It can be obtained by a conventional method known to a trader.
【0022】PCR法により配列番号2、配列番号3、配
列番号4、配列番号5または配列番号7の塩基配列を有
するDNAを取得するためには、放線菌Streptomyces s
p. CL190 株の染色体DNAまたは配列番号1の塩基配
列を有するDNAなどを鋳型として使用し、配列番号
2、配列番号3、配列番号4、配列番号5または配列番
号7の塩基配列を増幅できるように設計した1対のプラ
イマーを使用して、TaKaRa LA-PCRTM Kit Ver.2(宝酒造
社製)またはExpandTM High-Fidelity PCR System(べ一
リンガー・マンハイム社製)等を用い、DNAThermal Cycl
er(パーキンエルマージャパン社製)にてPCRを行う。な
お、後のクローニング操作を容易にするために、プライ
マーには適当な制限酵素部位を付加させておくことが好
ましい。In order to obtain a DNA having the nucleotide sequence of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5 or SEQ ID NO: 7 by PCR, the actinomycetes Streptomyces ssp.
Using the chromosomal DNA of the p.CL190 strain or a DNA having the nucleotide sequence of SEQ ID NO: 1 as a template, the nucleotide sequence of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5 or SEQ ID NO: 7 can be amplified. using a pair of primers designed using the TaKaRa LA-PCR TM Kit Ver.2 (Takara Shuzo Co., Ltd.) or Expand TM High-Fidelity PCR System (base made one Ringer Mannheim) or the like, DNAThermal Cycl
Perform PCR using er (PerkinElmer Japan). In order to facilitate the subsequent cloning operation, it is preferable to add an appropriate restriction enzyme site to the primer.
【0023】PCRの反応条件として、94℃で30秒間(変
性)、55℃で30秒〜1分間(アニーリング)、72℃で2分
間(伸長)からなる反応工程を1サイクルとして、例え
ば30サイクル行った後、72℃で7分間反応させる条件を
挙げることができる。次いで、増幅されたDNA断片を、
大腸菌で増幅可能な適切なベクター中にクローニングす
ることができる。クローニングは、常法、例えば、モレ
キュラークローニング第2版、Current Protocols in M
olecular Biology, Supplement 1〜38, John Wiley & S
ons (1987-1997)(以下、カレント・プロトコールズ・
イン・モレキュラー・バイオロジーと略す)、DNA Clon
ing 1: CoreTechniques, A Practical Approach, Secon
d Edition, Oxford University Press (1995)等に記載
された方法、あるいは市販のキット、例えばSuperScrip
t Plasmid System for cDNA Synthesis and Plasmid Cl
oning (ライフ・テクノロジーズ社製)やZAP-cDNASynthe
sis Kit〔ストラタジーン(Staratagene)社製〕を用いて
行なうことができる。As a reaction condition of PCR, a reaction process consisting of 94 ° C. for 30 seconds (denaturation), 55 ° C. for 30 seconds to 1 minute (annealing), and 72 ° C. for 2 minutes (extension) is defined as one cycle, for example, 30 cycles. After the reaction, conditions for reacting at 72 ° C. for 7 minutes can be exemplified. Next, the amplified DNA fragment is
It can be cloned into an appropriate vector that can be amplified in E. coli. Cloning is performed by a conventional method, for example, Molecular Cloning, 2nd edition, Current Protocols in M
olecular Biology, Supplement 1-38, John Wiley & S
ons (1987-1997) (hereafter, Current Protocols
In Molecular Biology), DNA Clon
ing 1: CoreTechniques, A Practical Approach, Secon
d Edition, Oxford University Press (1995), or a commercially available kit such as SuperScrip
t Plasmid System for cDNA Synthesis and Plasmid Cl
oning (manufactured by Life Technologies) and ZAP-cDNASynthe
It can be performed using sis Kit (manufactured by Stratagene).
【0024】クローニングベクターとしては、大腸菌K1
2株中で自律複製できるものであれば、ファージベクタ
ー、プラスミドベクター等いずれでも使用できる、大腸
菌の発現用ベクターをクローニングベクターとして用い
てもよい。具体的には、ZAPExpress〔ストラタジーン社
製、Strategies, 5, 58 (1992)〕、pBluescrlpt IISK
(+)〔Nuclelc Acids Research, 17, 9494(1989)〕、Lam
bda ZAP II(ストラタジーン社製)、λgt10、λgt11〔DN
A Cloning, A Practical Approach, 1, 49(1985)〕、λ
TriplEx(クローンテック社製)、λExCell(ファルマシア
社製)、pT7T318U(ファルマシア社製)、pcD2〔Mo1. Cen.
Bio1., 3, 280 (1983)〕、pMW218(和光純薬社製)、pUC
118(宝酒造社製)、pEG400〔J.Bac., 172, 2392 (199
0)〕、pQE-30 (QIAGEN社製)等をあげることができる。As a cloning vector, Escherichia coli K1
Escherichia coli expression vectors, which can be used as phage vectors or plasmid vectors, as long as they can autonomously replicate in the two strains, may be used as cloning vectors. Specifically, ZAPExpress (Stratagies, 5, 58 (1992)), pBluescrlpt IISK
(+) (Nuclelc Acids Research, 17, 9494 (1989)), Lam
bda ZAP II (Stratagene), λgt10, λgt11 (DN
A Cloning, A Practical Approach, 1, 49 (1985)), λ
TriplEx (Clontech), λExCell (Pharmacia), pT7T318U (Pharmacia), pcD2 (Mo1.Cen.
Bio1, 3., 280 (1983)), pMW218 (manufactured by Wako Pure Chemical Industries), pUC
118 (Takara Shuzo), pEG400 (J. Bac., 172, 2392 (199
0)], pQE-30 (manufactured by QIAGEN) and the like.
【0025】得られた形質転換株より、目的とするDNA
を含有したプラスミドを常法、例えば、モレキュラーク
ローニング第2版、カレント・プロトコールズ・イン・
モレキュラー・バイオロジー、DNA Cloning 1: Core Te
chniques, A Practical Approach, Second Edition, Ox
ford University Press (1995)等に記載された方法によ
り取得することができる。上記方法により、配列番号
2、配列番号3、配列番号4、配列番号5又は配列番号
7の塩基配列を有するDNAを取得することができる。From the obtained transformant, the target DNA
Can be prepared by a conventional method, for example, Molecular Cloning 2nd Edition, Current Protocols in
Molecular biology, DNA Cloning 1: Core Te
chniques, A Practical Approach, Second Edition, Ox
It can be obtained by the method described in ford University Press (1995) and the like. According to the above method, DNA having the nucleotide sequence of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, or SEQ ID NO: 7 can be obtained.
【0026】また、配列番号1、配列番号2、配列番号
3、配列番号4または配列番号7の塩基配列において1
から数個の塩基が欠失、置換、付加及び/または挿入さ
れている塩基配列であって、特定の活性を有する酵素タ
ンパク質をコードする塩基配列、あるいは配列番号1、
配列番号2、配列番号3、配列番号4または配列番号7
の塩基配列とストリンジェントな条件下でハイブリダイ
ズすることができる塩基配列であって、特定の活性を有
する酵素タンパク質をコードする塩基配列も本発明の範
囲内である。In the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NO: 7,
A base sequence in which several bases have been deleted, substituted, added and / or inserted, wherein the base sequence encodes an enzyme protein having a specific activity, or SEQ ID NO: 1,
SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NO: 7
A nucleotide sequence that can hybridize with a nucleotide sequence under stringent conditions, and that encodes an enzyme protein having a specific activity, is also within the scope of the present invention.
【0027】例えば、配列番号1、配列番号2、配列番
号3、配列番号4または配列番号7の塩基配列を有する
放線菌由来のDNA断片の塩基配列を利用し、他の微生物
等より、該DNAのホモログを適当な条件下でスクリーニ
ングすることにより単離することができる。あるいは、
上記したような変異DNAは、化学合成、遺伝子工学的
手法、突然変異誘発などの当業者に既知の任意の方法で
作製することもできる。具体的には、配列番号1、配列
番号2、配列番号3、配列番号4または配列番号7の塩
基配列を有するDNAを利用し、これらDNAに変異を
導入することにより変異DNAを取得することができ
る。For example, using the nucleotide sequence of a DNA fragment derived from an actinomycete having the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 7 Can be isolated by screening under appropriate conditions. Or,
The mutant DNA as described above can also be prepared by any method known to those skilled in the art, such as chemical synthesis, genetic engineering techniques, and mutagenesis. Specifically, it is possible to obtain a mutant DNA by using a DNA having the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 7 and introducing a mutation into these DNAs. it can.
【0028】例えば、配列番号1、配列番号2、配列番
号3、配列番号4または配列番号7の塩基配列を有する
DNAに対し、変異原となる薬剤と接触作用させる方
法、紫外線を照射する方法、遺伝子工学的手法等を用い
て行うことができる。遺伝子工学的手法の一つである部
位特異的変異誘発法は特定の位置に特定の変異を導入で
きる手法であることから有用であり、モレキュラークロ
ーニング第2版、カレント・プロトコールズ・イン・モ
レキュラー・バイオロジー、NucleicAcids Research, 1
0, 6487, 1982、Nucleic Acids Research, 12, 9441, 1
984、Nucleic Acids Research, 13, 4431, 1985、Nucle
ic Acids Research, 13, 8749, 1985、Proc. Natl. Aca
d. Sci. USA, 79, 6409, 1982、Proc. Natl. Acad. Sc
i. USA, 82, 488, 1985、Gene, 34, 315, 1985、Gene,
102, 67, 1991等に記載の方法に準じて行うことができ
る。For example, a method of bringing a DNA having the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NO: 7 into contact with a mutagenic agent, a method of irradiating ultraviolet rays, It can be performed using a genetic engineering technique or the like. Site-directed mutagenesis, which is one of the genetic engineering techniques, is useful because it is a technique that can introduce a specific mutation at a specific position. Molecular cloning, 2nd edition, Current Protocols in Molecular Co., Ltd. Biology, NucleicAcids Research, 1
0, 6487, 1982, Nucleic Acids Research, 12, 9441, 1
984, Nucleic Acids Research, 13, 4431, 1985, Nucle
ic Acids Research, 13, 8749, 1985, Proc. Natl. Aca
d. Sci. USA, 79, 6409, 1982, Proc. Natl. Acad. Sc
i. USA, 82, 488, 1985, Gene, 34, 315, 1985, Gene,
102, 67, 1991 and the like.
【0029】(B)放線菌のメバロン酸経路の酵素をコ
ードするDNAを有する形質転換体の作製と上記酵素タ
ンパク質の発現 上記のようにして得られたDNAを宿主細胞中で発現さ
せるためには、まず、目的とする該DNA断片を、制限
酵素あるいはDNA分解酵素で、該遺伝子を含む適当な
長さのDNA断片とした後に、発現ベクター中において
プロモーターの下流に挿入し、次いで上記DNAを挿入
した発現ベクターを、発現ベクターに適合した宿主細胞
中に導入する。(B) Preparation of a Transformant Having DNA Encoding the Actinomycete Mevalonate Pathway Enzyme and Expression of the Enzyme Protein In order to express the DNA obtained as described above in host cells, First, the DNA fragment of interest is converted into a DNA fragment of an appropriate length containing the gene with a restriction enzyme or a DNA degrading enzyme, and then inserted downstream of a promoter in an expression vector. The obtained expression vector is introduced into a host cell suitable for the expression vector.
【0030】宿主細胞としては、目的とする遺伝子を発
現できるものは全て用いることができる。例えば、エッ
シェリヒア属、セラチア属、コリネバクテリウム属、ブ
レビバクテリウム属、シュードモナス属、バチルス属、
ミクロバクテリウム属等に属する細菌、クルイベロミセ
ス属、サッカロマイセス属、シゾサッカロマイセス属、
トリコスポロン属、シワニオミセス属等に属する酵母や
動物細胞、昆虫細胞等をあげることができる。なお、本
発明のDNAはメバロン酸経路に関与する酵素をコード
するものであるが、宿主細胞としてはメバロン酸経路を
本来有さない細菌でも、メバロン酸経路を有する酵母や
動物細胞、昆虫細胞の何れでも構わない。As the host cell, any cell that can express the gene of interest can be used. For example, Escherichia, Serratia, Corynebacterium, Brevibacterium, Pseudomonas, Bacillus,
Bacteria belonging to the genus Microbacterium, Kluyveromyces, Saccharomyces, Schizosaccharomyces,
Examples include yeast, animal cells, insect cells, and the like belonging to the genus Trichosporon, Siwaniomyces and the like. Although the DNA of the present invention encodes an enzyme involved in the mevalonate pathway, even a bacterium which does not originally have a mevalonate pathway as a host cell can be used for yeast, animal cells, and insect cells having the mevalonate pathway. Either is acceptable.
【0031】発現ベクターとしては、上記宿主細胞にお
いて自立複製可能ないしは染色体中への組込みが可能
で、上記目的とするDNAを転写できる位置にプロモータ
ーを含有しているものが用いられる。細菌等を宿主細胞
として用いる場合は、上記DNAを発現させるための発現
ベクターは該細菌中で自立複製可能であると同時に、プ
ロモーター、リボソーム結合配列、上記DNAおよび転写
終結配列より構成された組換えベクターであることが好
ましい。プロモーターを制御する遺伝子が含まれていて
もよい。As the expression vector, those which can replicate autonomously in the above-mentioned host cells or can be integrated into a chromosome, and which contain a promoter at a position where the above-mentioned target DNA can be transcribed are used. When a bacterium or the like is used as a host cell, an expression vector for expressing the above DNA is capable of autonomous replication in the bacterium and, at the same time, has a recombination composed of a promoter, a ribosome binding sequence, the above DNA and a transcription termination sequence. It is preferably a vector. A gene that controls the promoter may be included.
【0032】発現ベクターとしては、例えば、pBTrP2、
pBTac1、pBTac2(いずれもべ一リンガーマンハイム社よ
り市販)、pKK233-2(Pharmacia社製)、pSE280(Invitroge
n社製)、pGEMEX-1(Promega社製)、pQE-8(QIAGEN社製)、
pQE-30(QIAGEN社製)、pKYP10(特開昭58-110600)、pKYP2
00〔Agrc.Biol.Chem., 48, 669(1984)〕、PLSA1〔Agrc.
Blo1. Chem., 53, 277(1989)〕、pGEL1〔Proc. Natl.
Acad. Sci. USA, 82,4306 (1985)〕、pBluescrlptII SK
+、pBluescriptII SK(-)(Stratagene社製)、pTrS30(FER
MBP-5407)、pTrS32(FERM BP-5408)、pGEX(Pharmacia社
製)、pET-3(Novagen社製)、pTerm2(US4686191、US49390
94、US5160735)、pSupex、pUB110、pTP5、pC194、pUC18
〔Gene, 33, 103(1985)〕、pUC19〔Gene, 33, 103(198
5)〕、pSTV28(宝酒造社製)、pSTV29(宝酒造社製)、pUC1
18(宝酒造社製)、pPA1(特開昭63-233798)、pEG400〔J.
Bacterio1., 172, 2392(1990)〕、pQE-30(QIAGEN社製)
等を例示することができる。As an expression vector, for example, pBTrP2,
pBTac1, pBTac2 (all commercially available from Behringer Ringermanheim), pKK233-2 (Pharmacia), pSE280 (Invitroge
n), pGEMEX-1 (Promega), pQE-8 (QIAGEN),
pQE-30 (manufactured by QIAGEN), pKYP10 (JP-A-58-110600), pKYP2
00 (Agrc. Biol. Chem., 48, 669 (1984)), PLSA1 (Agrc.
Blo1.Chem., 53, 277 (1989)], pGEL1 (Proc. Natl.
Acad. Sci. USA, 82,4306 (1985)], pBluescrlptII SK
+, PBluescriptII SK (-) (Stratagene), pTrS30 (FER
MBP-5407), pTrS32 (FERM BP-5408), pGEX (Pharmacia), pET-3 (Novagen), pTerm2 (US4686191, US49390)
94, US5160735), pSupex, pUB110, pTP5, pC194, pUC18
(Gene, 33, 103 (1985)), pUC19 (Gene, 33, 103 (198
5)), pSTV28 (Takara Shuzo), pSTV29 (Takara Shuzo), pUC1
18 (manufactured by Takara Shuzo Co., Ltd.), pPA1 (JP-A-63-233798), pEG400 (J.
Bacterio 1., 172, 2392 (1990)), pQE-30 (QIAGEN)
And the like.
【0033】プロモーターとしては、宿主細胞中で発現
できるものであればいかなるものでもよい。例えば、tr
pプロモーター(P trp)、lacプロモーター(P lac)、PLプ
ロモーター、PRプロモーター、PSEプロモーター等の、
大腸菌やファージ等に由来するプロモーター、SP01プロ
モーター、SP02プロモーター、penPプロモーター等をあ
げることができる。またP trpを2つ直列させたプロモー
ター(P trp×2)、tacプロモーター、letlプロモータ
ー、lacT7プロモーターのように人為的に設計改変され
たプロモーター等も用いることができる。As the promoter, any promoter can be used as long as it can be expressed in a host cell. For example, tr
p promoter (P trp), lac promoter (P lac), P L promoter, P R promoter and P SE promoter,
Promoters derived from Escherichia coli, phage and the like, SP01 promoter, SP02 promoter, penP promoter and the like can be mentioned. Further, artificially designed and modified promoters such as a promoter in which two P trps are connected in series (P trp × 2), a tac promoter, a letl promoter, and a lacT7 promoter can also be used.
【0034】リボソーム結合配列としては、宿主細胞中
で発現できるものであればいかなるものでもよいが、シ
ャインーダルガノ(Shine-Dalgamo)配列と開始コドンと
の間を適当な距離(例えば6〜18塩基)に調節したプラス
ミドを用いることが好ましい。目的とするDNAの発現に
は転写終結配列は必ずしも必要ではないが、好適には構
造遺伝子直下に転写終結配列を配置することが望まし
い。The ribosome binding sequence is not particularly limited as long as it can be expressed in a host cell, but an appropriate distance (for example, 6 to 18) is used between the Shine-Dalgamo sequence and the initiation codon. (Base) is preferably used. Although a transcription termination sequence is not always necessary for expression of the target DNA, it is desirable to arrange the transcription termination sequence immediately below the structural gene.
【0035】宿主細胞としては、Escherichia属、Coryn
ebacterium属、Brevibacterium属、Bacillus属、Microb
acterium属、Serratia属、Pseudomonas属、Agrobacteri
um属、Alicyclobacillus属、Anabaena属、Anacystis
属、Arthrobacter属、Azobacter属、Chromatium属、Erw
inia属、Methylobacterium属、Phormidium属、Rhodobac
ter属、Rhodopseudomonas属、Rhodospiri11um属、Scene
desmun属、Streptomyces属、Synnecoccus属、Zymomonas
属等に属する微生物をあげることができ、好ましくは、
Escherichia属、Corynebacterium属、Brevibacterium
属、Bacillus属、Pseudomonas属、Agrobacterium属、Al
icyclobacillus属、Anabaena属、Anacystis属、Arthrob
acter属、Azobacter属、Chromatium属、Erwinia属、Met
hylobacterium属、Phormidium属、Rhodobacter属、Rhod
opseudomonas属、Rhodospirillum属、Scenedesmun属、S
treptomyces属、Synnecoccus属、Zymomonas属に属する
微生物等をあげることができる。As the host cell, Escherichia, Coryn
genus ebacterium, genus Brevibacterium, genus Bacillus, Microb
acterium, Serratia, Pseudomonas, Agrobacteri
um, Alicyclobacillus, Anabaena, Anacystis
Genus, Arthrobacter, Azobacter, Chromatium, Erw
genus inia, Methylobacterium, Phormidium, Rhodobac
Genus ter, Rhodopseudomonas, Rhodospiri11um, Scene
genus desmun, genus Streptomyces, genus Synnecoccus, Zymomonas
Microorganisms belonging to the genus and the like can be given, preferably,
Escherichia, Corynebacterium, Brevibacterium
Genus, Bacillus, Pseudomonas, Agrobacterium, Al
genus icyclobacillus, genus Anabaena, genus Anacystis, Arthrob
acter, Azobacter, Chromatium, Erwinia, Met
hylobacterium, Phormidium, Rhodobacter, Rhod
genus opseudomonas, genus Rhodospirillum, genus Scenedesmun, S
Microorganisms belonging to the genus treptomyces, the genus Synnecoccus, the genus Zymomonas and the like can be mentioned.
【0036】該微生物の具体例として、例えば、Escher
ichia coli XL1-Blue、Escherichiacoli XL2-Blue、Esc
herichia coli DH1、Escherichia coli DH5α、Escheri
chia coli MC1000、Escherichia coli KY3276、Escheri
chia coli W1485、Escherichia coli JM109、Escherich
ia coli HB101、Escherichia coli No49、Escherichia
coli W3110、Escherichia coli NY49、Escherichia col
i MP347、Escherichia coli NM522、Bacillus subtili
s、Bacillus amyloliquefacines、Brevibacterium ammo
niagenes、Brevibacterium immariophilum ATCC14068、
Brevibacteriumsaccharolyticum ATCC14066、Brevibact
erium flavum ATCC14067、Brevibacterium lactofermen
tum ATCC13869、Corynebacterium glutamicum ATCC1303
2、Corynebacterium glutamicum ATCC14297、Corynebac
terium acetoacidophilum ATCC13870、Microbacterium
ammoniaphilum ATCC15354、Serratia ficaria、Serrati
afontico1a、Serratia liquefaciens、Serratia marces
cens、Pseudomonas sp.D-0110、Agrobacterium radioba
cter、Agrobacterium rhizogenes、Agrobacterium rub
i、Anabaena cylindrica、Anabaena do1iolum、Anabaen
a flosaquae、Arthrobacter aurescens、Arthrobacter
citreus、Arthrobacter globformis、Arthrobacter hyd
rocarboglutamicus、Arthrobacter mysorens、Arthroba
cter nicotianae、Arthrobacter paraffineus、Arthrob
acter protophormiae、Arthrobacter roseoparaffinu
s、Arthrobacter sulfureus、Arthrobacter ureafacien
s、Chromatium buderi、Chromatium tepidum、Chromati
um vinosum、Chromatium warmingii、Chromatium fluvi
atile、Erwinia uredovora、Erwinia carotovora、Erwi
nia ananas、Erwnia herbicola、Erwinia punctata、Er
winia terreus、Methylobacterium rhodesianum、Methy
lobacterium extorquens、Phormidium sp. ATCC29409、
Rhodobacter capsulatus、Rhodobacter sphaeroides、R
hodopseudomonasblastica、Rhodopseudomonas marina、
Rhodopseudomonas palustris、Rhodospirillum rubru
m、Rhodospirillum salexigens、Rhodospirillum salin
arum、Streptomyces ambofaciens、Streptomyces aureo
faciens、Streptomyces aureus、Streptomyces fungici
dicus、Streptomyces griseochromogenes、Streptomyce
s griseus、Streptomyces lividans、Streptomyces oli
vogriseus、Streptomyces rameus、Streptomyces tanas
hiensis、Streptomyces vinaceus、Zymomonas mobilis
等をあげることができる。Specific examples of the microorganism include, for example, Escher
ichia coli XL1-Blue, Escherichiacoli XL2-Blue, Esc
herichia coli DH1, Escherichia coli DH5α, Escheri
chia coli MC1000, Escherichia coli KY3276, Escheri
chia coli W1485, Escherichia coli JM109, Escherich
ia coli HB101, Escherichia coli No49, Escherichia
coli W3110, Escherichia coli NY49, Escherichia col
iMP347, Escherichia coli NM522, Bacillus subtili
s, Bacillus amyloliquefacines, Brevibacterium ammo
niagenes, Brevibacterium immariophilum ATCC14068,
Brevibacteriumsaccharolyticum ATCC14066, Brevibact
erium flavum ATCC14067, Brevibacterium lactofermen
tum ATCC13869, Corynebacterium glutamicum ATCC1303
2, Corynebacterium glutamicum ATCC14297, Corynebac
terium acetoacidophilum ATCC13870, Microbacterium
ammoniaphilum ATCC15354, Serratia ficaria, Serrati
afontico1a, Serratia liquefaciens, Serratia marces
cens, Pseudomonas sp.D-0110, Agrobacterium radioba
cter, Agrobacterium rhizogenes, Agrobacterium rub
i, Anabaena cylindrica, Anabaena do1iolum, Anabaen
a flosaquae, Arthrobacter aurescens, Arthrobacter
citreus, Arthrobacter globformis, Arthrobacter hyd
rocarboglutamicus, Arthrobacter mysorens, Arthroba
cter nicotianae, Arthrobacter paraffineus, Arthrob
acter protophormiae, Arthrobacter roseoparaffinu
s, Arthrobacter sulfureus, Arthrobacter ureafacien
s, Chromatium buderi, Chromatium tepidum, Chromati
um vinosum, Chromatium warmingii, Chromatium fluvi
atile, Erwinia uredovora, Erwinia carotovora, Erwi
nia ananas, Erwnia herbicola, Erwinia punctata, Er
winia terreus, Methylobacterium rhodesianum, Methy
lobacterium extorquens, Phormidium sp.ATCC29409,
Rhodobacter capsulatus, Rhodobacter sphaeroides, R
hodopseudomonasblastica, Rhodopseudomonas marina,
Rhodopseudomonas palustris, Rhodospirillum rubru
m, Rhodospirillum salexigens, Rhodospirillum salin
arum, Streptomyces ambofaciens, Streptomyces aureo
faciens, Streptomyces aureus, Streptomyces fungici
dicus, Streptomyces griseochromogenes, Streptomyce
s griseus, Streptomyces lividans, Streptomyces oli
vogriseus, Streptomyces rameus, Streptomyces tanas
hiensis, Streptomyces vinaceus, Zymomonas mobilis
Etc. can be given.
【0037】組換えベクターの導入方法としては、上記
宿主細胞へDNAを導入する方法であればいずれも用いる
ことができ、例えば、カルシウムイオンを用いる方法
〔Proc. Nat1. Acad. SCl. USA, 69, 2110(1972)〕、プ
ロトプラスト法(特開昭63-2483942)、またはGene, 17,
107(1982)やMolecular & General Genetics, 168, 111
(1979)に記載の方法等をあげることができる。As a method for introducing a recombinant vector, any method for introducing DNA into the above host cells can be used. For example, a method using calcium ions [Proc. Nat1. Acad. SCl. USA, 69 , 2110 (1972)), the protoplast method (JP-A-63-2483942), or Gene, 17,
107 (1982) and Molecular & General Genetics, 168, 111
(1979).
【0038】酵母を宿主細胞として用いる場合には、発
現ベクターとして、例えば、YEp13(ATCC37115)、YEp24
(ATCC37051)、Ycp5O(ATCC37419)、pHS19、pHS15等を例
示することができる。プロモーターとしては、酵母中で
発現できるものであればいかなるものでもよく、例え
ば、PHO5プロモーター、PGKプロモーター、GAPプロモー
ター、ADHプロモーター、gal1プロモーター、gal10プロ
モーター、ヒートショックタンパク質プロモーター、MF
α1プロモーター、CUP1プロモーター等のプロモーター
をあげることができる。When yeast is used as a host cell, examples of expression vectors include YEp13 (ATCC37115) and YEp24.
(ATCC37051), Ycp5O (ATCC37419), pHS19, pHS15 and the like. Any promoter can be used as long as it can be expressed in yeast.For example, PHO5 promoter, PGK promoter, GAP promoter, ADH promoter, gal1 promoter, gal10 promoter, heat shock protein promoter, MF
Promoters such as α1 promoter and CUP1 promoter can be mentioned.
【0039】宿主細胞としては、サッカロミセス・セレ
ビシェ(Saccharomyces cerevisae)、シゾサッカロミセ
ス・ボンベ(Schizosaccharomyces pombe)、クリュイベ
ロミセス・ラクチス(Kluyveromyces lactis)、トリコス
ポロン・プルランス(Trichosporon pu11ulans)、シュワ
ニオミセス・アルビウス(Schwanniomyces a11uvius)等
をあげることができる。As the host cells, Saccharomyces cerevisae, Schizosaccharomyces pombe, Kluyveromyces lactis, Trichosporon pullonias saccharomyces. Schwanniomyces a11uvius) and the like.
【0040】組換えベクターの導入方法としては、酵母
にDNAを導入する方法であればいずれも用いることがで
き、例えば、エレクトロポレーション法〔Methods. Enz
ymol, 194, 182(1990)〕、スフェロブラスト法〔Proc.
Natl. Acad. Sci. USA, 75,1929(1978)〕、酢酸リチウ
ム法〔J.Bacteriol., 153, 163(1983)〕、あるいはPro
c. Nat1 . Acad. Sci. USA, 75, 1929(1978)に記載の方
法等をあげることができる。As a method for introducing a recombinant vector, any method for introducing DNA into yeast can be used. For example, electroporation [Methods. Enz.
ymol, 194, 182 (1990)), spheroblast method (Proc.
Natl. Acad. Sci. USA, 75, 1929 (1978)], lithium acetate method [J. Bacteriol., 153, 163 (1983)], or Pro
c. Nat1. Acad. Sci. USA, 75, 1929 (1978).
【0041】動物細胞を宿主細胞として用いる場合に
は、発現ベクターとして、例えば、pcDNAI、pcDM8(フナ
コシ社より市販)、pAGE107〔特開平3-22979; Cytotechn
ology,3, 133,(1990)〕、pAS3-3(特開平2-227075)、pCD
M8〔Nature, 329, 840,(1987)〕、pcDNAI/AmP(Invitrog
en社製)、pREP4(Invitrogen社製)、pAGE103〔J.Bloche
m., 101, 1307(1987)〕、pAGE210等を例示することがで
きる。When an animal cell is used as a host cell, examples of the expression vector include pcDNAI, pcDM8 (commercially available from Funakoshi), pAGE107 [JP-A-3-22979;
ology, 3, 133, (1990)), pAS3-3 (Japanese Patent Application Laid-Open No. 2-227075), pCD
M8 (Nature, 329, 840, (1987)), pcDNAI / AmP (Invitrog
en), pREP4 (Invitrogen), pAGE103 (J. Bloche
m., 101, 1307 (1987)], pAGE210, and the like.
【0042】プロモーターとしては、動物細胞中で発現
できるものであればいずれも用いることができ、例え
ば、サイトメガロウイルス(ヒトCMV)のIE(immediate ea
rly)遺伝子のプロモーター、SV40の初期プロモーター、
レトロウイルスのプロモーター、メタロチオネインプロ
モーター、ヒートショックプロモーター、SRαプロモー
ター等をあげることができる。また、ヒトCMVのIE遺伝
子のエンハンサーをプロモーターと共に用いてもよい。
宿主細胞としては、ナマルバ細胞、HBT5637(特開昭63-2
99)、COS1細胞、COS7細胞、CHO細胞等をあげることがで
きる。Any promoter can be used as long as it can be expressed in animal cells. For example, cytomegalovirus (human CMV) IE (immediate ea) can be used.
rly) gene promoter, SV40 early promoter,
Retrovirus promoters, metallothionein promoters, heat shock promoters, SRα promoters and the like can be mentioned. Further, an enhancer of the IE gene of human CMV may be used together with the promoter.
As the host cell, Namalva cell, HBT5637 (Japanese Patent Laid-Open No. 63-2
99), COS1 cells, COS7 cells, CHO cells and the like.
【0043】動物細胞への組換えベクターの導入法とし
ては、動物細胞にDNAを導入できるいかなる方法も用い
ることができ、例えば、エレクトロポーレーション法
〔Cytotechnology, 3, 133(1990)〕、リン酸カルシウム
法(特開平2-227075)、リポフェクション法〔Proc. Nat
1. Acad. Sci., USA, 84, 7413(1987)〕、virology, 5
2,456(1973)に記載の方法等を用いることができる。形
質転換体の取得および培養は、特開平2-227075号公報あ
るいは特開平2-257891号公報に記載されている方法に準
じて行なうことができる。As a method for introducing a recombinant vector into animal cells, any method capable of introducing DNA into animal cells can be used, for example, an electroporation method [Cytotechnology, 3, 133 (1990)], a calcium phosphate method. (Japanese Patent Application Laid-Open No. 2-227075), lipofection method (Proc. Nat.
1. Acad. Sci., USA, 84, 7413 (1987)], virology, 5
2, 456 (1973). Acquisition and culture of the transformant can be performed according to the method described in JP-A-2-227075 or JP-A-2-25791.
【0044】昆虫細胞を宿主として用いる場合には、例
えばバキュロウイルス・イクスプレッション・ベクター
ズ・ア・ラボラトリー・マニュアル(Baculovirus Expre
ssion Vectors, A Laboratory Manua1)、カレント・プ
ロトコールズ・イン・モレキュラー・バイオロジー、Bi
o/Technology, 6, 47(1988)等に記載された方法によっ
て、タンパク質を発現することができる。即ち、組換え
遺伝子導入ベクターおよびバキュロウイルスを昆虫細胞
に共導入して昆虫細胞培養上清中に組換えウイルスを得
た後、さらに組換えウイルスを昆虫細胞に感染させ、タ
ンパク質を発現させることができる。該方法において用
いられる遺伝子導入ベクターとしては、例えば、pVL139
2、pVL1393、pBlueBacIII(ともにインビトロジェン社
製)等をあげることができる。When insect cells are used as hosts, for example, Baculovirus Expression Vectors A Laboratory Manual (Baculovirus Expre
ssion Vectors, A Laboratory Manua1), Current Protocols in Molecular Biology, Bi
The protein can be expressed by the method described in o / Technology, 6, 47 (1988). That is, after the recombinant gene transfer vector and the baculovirus are co-transfected into insect cells to obtain the recombinant virus in the insect cell culture supernatant, the recombinant virus is further infected into insect cells to express the protein. it can. The gene transfer vector used in the method includes, for example, pVL139
2, pVL1393, pBlueBacIII (both from Invitrogen) and the like.
【0045】バキュロウイルスとしては、例えば、夜盗
蛾科昆虫に感染するウイルスであるアウトグラファ・カ
リフォルニカ・ヌクレアー・ポリヘドロシス・ウイルス
(Autographa californica nuclear polyhedrosis viru
s)等を用いることができる。昆虫細胞としては、Spodop
tera frugiperdaの卵巣細胞であるSf9、Sf21〔バキュロ
ウイルス・エクスプレッション・ベクターズ、ア・ラボ
ラトリー・マニュアル、ダブリュー・エイチ・フリーマ
ン・アンド・カンパニー(W. H. Freeman andCompany)、
ニューヨーク(New York)、(1992)〕、Trichoplusia ni
の卵巣細胞であるHigh5(インビトロジェン社製)等を用
いることができる。Examples of the baculovirus include, for example, Autographa californica nuclea polyhedrosis virus which is a virus that infects night insects.
(Autographa californica nuclear polyhedrosis viru
s) and the like can be used. As insect cells, Spodop
Tera frugiperda ovary cells Sf9, Sf21 (Baculovirus Expression Vectors, A Laboratory Manual, WH Freeman and Company,
New York, (1992)], Trichoplusia ni
Ovarian cell High5 (Invitrogen) or the like can be used.
【0046】組換えウイルスを調製するための、昆虫細
胞への上記組換え遺伝子導入ベクターと上記バキュロウ
イルスの共導入方法としては、例えば、リン酸カルシウ
ム法(特開平2-227075)、リポフェクション法〔Proc. Na
t1. Acad. Sci. USA, 84, 7413(1987)〕等をあげること
ができる。遺伝子の発現方法としては、直接発現以外
に、モレキュラークローニング第2版に記載されている
方法等に準じて、分泌生産、融合タンパク質発現等を行
うことができる。酵母、動物細胞または昆虫細胞により
発現させた場合には、糖あるいは糖鎖が付加されたタン
パク質を得ることができる。As a method for preparing a recombinant virus, the above-described recombinant gene transfer vector and the above baculovirus are co-transfected into insect cells by, for example, a calcium phosphate method (Japanese Patent Laid-Open No. 2-227075), a lipofection method [Proc. Na
t1. Acad. Sci. USA, 84, 7413 (1987)]. As a method for expressing the gene, secretory production, fusion protein expression, and the like can be performed according to the method described in Molecular Cloning, 2nd edition, etc., in addition to direct expression. When expressed by yeast, animal cells, or insect cells, a sugar or sugar chain-added protein can be obtained.
【0047】上記DNAを組み込んだ組換え体DNAを保有す
る形質転換体を培地に培養し、培養物中にメバロン酸経
路に関与する酵素を生成蓄積させ、該培養物より該タン
パク質を採取することにより、メバロン酸経路に関与す
る酵素タンパク質を製造することができる。かくして製
造されるタンパク質(放線菌のメバロン酸経路に関与す
る酵素)も本発明の範囲内である。A transformant having the recombinant DNA into which the above DNA has been incorporated is cultured in a medium, an enzyme involved in the mevalonate pathway is produced and accumulated in the culture, and the protein is collected from the culture. As a result, an enzyme protein involved in the mevalonate pathway can be produced. Proteins thus produced (enzymes involved in the actinomycetes mevalonate pathway) are also within the scope of the present invention.
【0048】本発明のDNAを保持する形質転換体を培
地で培養する方法は、宿主の培養に用いられる通常の方
法に従って行うことができる。本発明の形質転換体が大
腸菌等の原核生物、酵母菌等の真核生物である場合、こ
れら微生物を培養する培地は、該微生物が資化し得る炭
素源、窒素源、無機塩類等を含有し、形質転換体の培養
を効率的に行える培地であれば天然培地、合成培地のい
ずれでもよい。炭素源としては、それぞれの微生物が資
化し得るものであればよく、グルコース、フラクトー
ス、スクロース、これらを含有する糖蜜、デンプンある
いはデンプン加水分解物等の炭水化物、酢酸、プロピオ
ン酸等の有機酸、エタノール、プロパノールなどのアル
コール類が用いられる。The method for culturing the transformant carrying the DNA of the present invention in a medium can be carried out according to a usual method used for culturing a host. When the transformant of the present invention is a prokaryote such as Escherichia coli or a eukaryote such as yeast, the culture medium for culturing these microorganisms contains a carbon source, a nitrogen source, inorganic salts, and the like that can be assimilated by the microorganism. Any of a natural medium and a synthetic medium may be used as long as the medium can efficiently culture the transformant. The carbon source may be any one that can be assimilated by each microorganism, such as glucose, fructose, sucrose, molasses containing these, carbohydrates such as starch or starch hydrolysate, acetic acid, organic acids such as propionic acid, and ethanol. And alcohols such as propanol.
【0049】窒素源としては、アンモニア、塩化アンモ
ニウム、硫酸アンモニウム、酢酸アンモニウム、リン酸
アンモニウム、等の各種無機酸や有機酸のアンモニウム
塩、その他含窒素化合物、並びに、ペプトン、肉エキ
ス、酵母エキス、コーンスチープリカー、カゼイン加水
分解物、大豆粕および大豆粕加水分解物、各種発酵菌体
およびその消化物等が用いられる。無機物としては、リ
ン酸第一カリウム、リン酸第二カリウム、リン酸マグネ
シウム、硫酸マグネシウム、塩化ナトリウム、硫酸第一
鉄、硫酸マンガン、硫酸銅、炭酸カルシウム等が用いら
れる。Examples of the nitrogen source include ammonium salts of various inorganic acids and organic acids such as ammonia, ammonium chloride, ammonium sulfate, ammonium acetate, and ammonium phosphate, and other nitrogen-containing compounds, peptone, meat extract, yeast extract, and corn corn. Chip liquor, casein hydrolyzate, soybean meal and soybean meal hydrolyzate, various fermented cells and digests thereof are used. As the inorganic substance, potassium (I) phosphate, potassium (II) phosphate, magnesium phosphate, magnesium sulfate, sodium chloride, ferrous sulfate, manganese sulfate, copper sulfate, calcium carbonate and the like are used.
【0050】培養は、振盪培養または深部通気撹拌培養
などの好気的条件下で行う。培養温度は15〜40℃がよ
く、培養時間は、通常16時間〜7日間である。培養中pH
は、3.0〜9.0に保持する。pHの調整は、無機あるいは有
機の酸、アルカリ溶液、尿素、炭酸カルシウム、アンモ
ニアなどを用いて行う。また培養中必要に応じて、アン
ピシリンやテトラサイクリン等の抗生物質を培地に添加
してもよい。プロモーターとして誘導性のプロモーター
を用いた発現ベクターで形質転換した微生物を培養する
ときには、必要に応じてインデューサーを培地に添加し
てもよい。例えば、lacプロモーターを用いた発現ベク
ターで形質転換した微生物を培養するときにはイソプロ
ピル−β−D−チオガラクトピラノシド(IPTG)等を、trp
プロモーターを用いた発現ベクターで形質転換した微生
物を培養するときにはインドールアクリル酸(IAA)等を
培地に添加してもよい。The culture is performed under aerobic conditions such as shaking culture or deep aeration stirring culture. The culturing temperature is preferably 15 to 40 ° C, and the culturing time is usually 16 hours to 7 days. PH during culture
Is kept at 3.0 to 9.0. The pH is adjusted using an inorganic or organic acid, an alkaline solution, urea, calcium carbonate, ammonia, or the like. If necessary, an antibiotic such as ampicillin or tetracycline may be added to the medium during the culture. When culturing a microorganism transformed with an expression vector using an inducible promoter as a promoter, an inducer may be added to the medium as necessary. For example, when culturing a microorganism transformed with an expression vector using a lac promoter, isopropyl-β-D-thiogalactopyranoside (IPTG) or the like is trp.
When culturing a microorganism transformed with an expression vector using a promoter, indoleacrylic acid (IAA) or the like may be added to the medium.
【0051】動物細胞を宿主細胞として得られた形質転
換体を培養する培地としては、一般に使用されているRP
M11640培地〔The Journal of the American Medical As
sociation,199,519(1967)〕、EagleのMEM培地〔Scienc
e, 122, 501(1952)〕、DMEM培地〔Virology, 8, 396(19
59)〕、199培地〔Proceeding of the Society for theB
iological Medicine, 73, 1(1950)〕またはこれら培地
に牛胎児血清等を添加した培地等が用いられる。培養
は、通常pH6〜8、30〜40℃、5%C02存在下等の条件下で1
〜7日間行う。また、培養中必要に応じて、カナマイシ
ン、ペニシリン等の抗生物質を培地に添加してもよい。
昆虫細胞を宿主細胞として得られた形質転換体を培養す
る培地としては、一般に使用されているTNM-FH培地〔Ph
armingen社製〕、Sf-900 II SFM培地(ギブコBRL社製)、
ExCell400、ExCell405〔いずれもJRH Biosciences社
製〕、Grace's Insect Medium〔Grace, T.C.C., Natur
e, 195,788(1962)〕等を用いることができる。培養は、
通常pH6〜7、25〜30℃等の条件下で、1〜5日間行う。ま
た、培養中必要に応じて、ゲンタマイシン等の抗生物質
を培地に添加してもよい。As a medium for culturing a transformant obtained by using an animal cell as a host cell, generally used RP
M11640 medium (The Journal of the American Medical As
sociation, 199, 519 (1967)), Eagle's MEM medium (Scienc
e, 122, 501 (1952)), DMEM medium (Virology, 8, 396 (19
59)), 199 medium (Proceeding of the Society for the B
iological Medicine, 73, 1 (1950)], or a medium obtained by adding fetal bovine serum or the like to such a medium. Culture is usually pH6~8,30~40 ℃, 1 under conditions such as 5% C0 2 presence
Do for ~ 7 days. If necessary, antibiotics such as kanamycin and penicillin may be added to the medium during the culture.
As a medium for culturing a transformant obtained using insect cells as host cells, a commonly used TNM-FH medium (Ph
armingen), Sf-900 II SFM medium (Gibco BRL),
ExCell400, ExCell405 (all manufactured by JRH Biosciences), Grace's Insect Medium (Grace, TCC, Natur
e, 195, 788 (1962)]. Culture is
Usually, it is carried out for 1-5 days under conditions of pH 6-7, 25-30 ° C. If necessary, an antibiotic such as gentamicin may be added to the medium during the culture.
【0052】本発明の形質転換体の培養物から、本発明
のタンパク質(メバロン酸経路に関与する酵素)を単離
精製するには、通常の酵素の単離、精製法を用いればよ
い。例えば、本発明のタンパク質が、細胞内に溶解状態
で発現した場合には、培養終了後、細胞を遠心分離によ
り回収し水系緩衝液に懸濁後、超音波破砕機、フレンチ
プレス、マントンガウリンホモゲナイザー、ダイノミル
等により細胞を破砕し、無細胞抽出液を得る。該無細胞
抽出液を遠心分離することにより得られた上清から、通
常の酵素の単離精製法、即ち、溶媒抽出法、硫安等によ
る塩析法、脱塩法、有機溶媒による沈殿法、ジエチルア
ミノエチル(DEAE)セファロース、DIAION HPA-75(三菱化
成社製)等レジンを用いた陰イオン交換クロマトグラフ
ィー法、S-Sepharose FF(ファルマシア社製)等のレジン
を用いた陽イオン交換クロマトグラフィー法、ブチルセ
ファロース、フェニルセファロース等のレジンを用いた
疎水性クロマトグラフィー法、分子篩を用いたゲルろ過
法、アフィニティークロマトグラフィ一法、クロマトフ
ォーカシング法、等電点電気泳動等の電気泳動法等の手
法を単独あるいは組み合わせて用い、精製標品を得るこ
とができる。In order to isolate and purify the protein of the present invention (enzyme involved in the mevalonate pathway) from the culture of the transformant of the present invention, a conventional enzyme isolation and purification method may be used. For example, when the protein of the present invention is expressed in a dissolved state in cells, after completion of the culture, the cells are collected by centrifugation, suspended in an aqueous buffer, and then sonicated with a sonicator, French press, and Mentongaulin homogenizer. The cells are crushed with a Dynomill or the like to obtain a cell-free extract. From the supernatant obtained by centrifuging the cell-free extract, a normal enzyme isolation and purification method, that is, a solvent extraction method, a salting out method using ammonium sulfate, a desalting method, a precipitation method using an organic solvent, Anion exchange chromatography using a resin such as diethylaminoethyl (DEAE) Sepharose, DIAION HPA-75 (manufactured by Mitsubishi Kasei), and cation exchange chromatography using a resin such as S-Sepharose FF (manufactured by Pharmacia) Chromatography using resin such as butyl sepharose, phenyl sepharose, etc., gel filtration using molecular sieve, affinity chromatography, chromatofocusing, electrophoresis such as isoelectric focusing, etc. Alternatively, it can be used in combination to obtain a purified sample.
【0053】また、該タンパク質が細胞内に不溶体を形
成して発現した場合は、同様に細胞を回収後破砕し、遠
心分離を行うことにより得られた沈殿画分より、通常の
方法により該タンパク質を回収後、該タンパク質の不溶
体をタンパク質変性剤で可溶化する。該可溶化液を、タ
ンパク質変性剤を含まないあるいはタンパク質変性剤の
濃度がタンパク質が変性しない程度に希薄な溶液に希
釈、あるいは透析し、該タンパク質を正常な立体構造に
構成させた後、上記と同様の単離精製法により精製標品
を得ることができる。When the protein is expressed in an insoluble form in the cells, the cells are similarly collected, crushed, and centrifuged to obtain a precipitate fraction obtained by a conventional method. After recovering the protein, the insoluble form of the protein is solubilized with a protein denaturant. After diluting the lysate into a solution containing no protein denaturing agent or a solution in which the concentration of the protein denaturing agent is so small that the protein is not denatured, or dialyzed to form the protein into a normal three-dimensional structure, A purified sample can be obtained by the same isolation and purification method.
【0054】本発明のタンパク質あるいはその糖修飾体
等の誘導体が細胞外に分泌された場合には、培養上清に
該タンパク質あるいはその糖鎖付加体等の誘導体を回収
することができる。即ち、該培養物を上記と同様の遠心
分離等の手法により処理することにより可溶性画分を取
得し、該可溶性画分から、上記と同様の単離精製法を用
いることにより、精製標品を得ることができる。このよ
うにして取得されるタンパク質として、例えば、配列番
号8〜13のアミノ酸配列を有するタンパク質を挙げる
ことができる。When the protein of the present invention or its derivative such as a modified sugar is secreted extracellularly, the protein or its derivative such as a sugar chain adduct can be recovered in the culture supernatant. That is, the culture is treated by a method such as centrifugation as described above to obtain a soluble fraction, and a purified sample is obtained from the soluble fraction by using the same isolation and purification method as described above. be able to. Examples of the protein thus obtained include proteins having the amino acid sequences of SEQ ID NOS: 8 to 13.
【0055】また、上記方法により発現させたタンパク
質を、Fmoc法(フルオレニルメチルオキシカルボニル
法)、tBoc法(t−ブチルオキシカルボニル法)等の化学合
成法によっても製造することができる。また、桑和貿易
(米国Advanced Chem Tech社製)、パーキンェルマージャ
バン(米国Perkin−Elmer社製)、ファルマシアバイオテ
ク(スウェーデンPharmacia Biotech社製)、アロカ(米国
Protein Technology Instrument社製)、クラボウ(米国S
ynthecell-Vega社製〉、日本パーセプティブ・リミテッ
ド(米国PerSeptive社製)、島津製作所等のペプチド合成
機を利用し合成することもできる。The protein expressed by the above method can also be produced by a chemical synthesis method such as the Fmoc method (fluorenylmethyloxycarbonyl method) and the tBoc method (t-butyloxycarbonyl method). Also, Kuwawa Trading
(US Advanced Chem Tech), Perkin-Elmer Javan (US Perkin-Elmer), Pharmacia Biotech (Pharmacia Biotech Sweden), Aloka (US
Protein Technology Instrument), Kurabo Industries (US S
ynthecell-Vega>, Nippon Perceptive Limited (US PerSeptive), Shimadzu Corp., etc.
【0056】(C)イソプレノイド化合物の製造 上記(B)で取得された形質転換体を、上記(B)の方
法に準じて培養し、培養物中にイソプレノイド化合物を
生成蓄積させ、該培養物からイソプレノイド化合物を採
取することによりイソプレノイド化合物を製造すること
ができる。該培養により、ユビキノン、ビタミンK2、カ
ロテノイド等のイソプレノイド化合物を製造することが
できる。具体的な例として、例えば、Escherichia属に
属する微生物を形質転換体としたユビキノン−8やメナ
キノン−8の製造、Rhodobacter属に属する微生物を形質
転換体としたユビキノン−10の製造、Arthrobacter属に
属する微生物を形質転換体としたビタミンK2の製造、A
grobacterium属に属する微生物を形質転換体としたアス
タキサンチンの製造、Erwinia属に属する微生物を形質
転換体としたりコペン、β一カロチン、ゼアキサンチン
の製造等をあげることができる。培養終了後、培養液に
適当な溶媒を加えてイソプレノイド化合物を抽出し、遠
心分離などで沈殿物を除去した後、各種クロマトグラフ
ィーを行うことによりイソプレノイド化合物を単離・精
製することができる。(C) Production of isoprenoid compound The transformant obtained in the above (B) is cultured according to the method of the above (B) to produce and accumulate the isoprenoid compound in the culture. The isoprenoid compound can be produced by collecting the isoprenoid compound. By this cultivation, isoprenoid compounds such as ubiquinone, vitamin K 2 and carotenoid can be produced. As specific examples, for example, production of ubiquinone-8 and menaquinone-8 using a microorganism belonging to the genus Escherichia as a transformant, production of ubiquinone-10 using a microorganism belonging to the genus Rhodobacter as a transformant, belonging to the genus Arthrobacter microbial production of vitamin K 2 in which the transformants, a
Production of astaxanthin using a microorganism belonging to the genus grobacterium as a transformant, production of a microorganism belonging to the genus Erwinia as a transformant, production of copen, β-carotene, and zeaxanthin can be exemplified. After completion of the culture, an isoprenoid compound is extracted by adding a suitable solvent to the culture solution, and the precipitate is removed by centrifugation or the like, and then the isoprenoid compound can be isolated and purified by performing various chromatography.
【0057】以下の実施例により本発明をより具体的に
示すが、本発明はこれらの実施例によって何ら限定され
るものではない。実施例で示した遺伝子組換え実験は、
特に言及しない限りモレキュラークローニング第2版に
記載の方法(以下、常法と呼ぶ)を用いて行った。The present invention will be more specifically illustrated by the following examples, but the present invention is not limited to these examples. Genetic recombination experiments shown in the examples,
Unless otherwise specified, the method was performed using the method described in Molecular Cloning, 2nd Edition (hereinafter, referred to as a conventional method).
【0058】[0058]
【実施例】実施例1:放線菌Streptomyces sp. CL190
株のメバロン酸経路に関わる遺伝子の取得 先ず、メバロン酸経路上の一つの反応を触媒する酵素で
ある3−ヒドロキシー3−メチルグルタリル CoA (H
MG−CoA)レダクターゼをコードする遺伝子(hmg
r)遺伝子を含むDNA断片を放線菌Streptomyces sp.
CL190 株から取得し、そのDNA断片の塩基配列を解
析し、メバロン酸経路に関わる遺伝子がhmgr遺伝子
の周辺にクラスターを形成して存在することを明らかに
した。その詳細を以下に示す。EXAMPLES Example 1: Actinomyces Streptomyces sp. CL190
Acquisition of Gene Related to Mevalonate Pathway of Strain First, 3-hydroxy-3-methylglutaryl CoA (H, an enzyme that catalyzes one reaction on the mevalonate pathway)
MG-CoA) gene encoding reductase (hmg
r) The DNA fragment containing the gene was transformed into Streptomyces sp.
The nucleotide sequence of the DNA fragment obtained from the CL190 strain was analyzed, and it was revealed that genes involved in the mevalonate pathway were present in a cluster around the hmgr gene. The details are shown below.
【0059】放線菌Streptomyces sp. CL190 株を15
mlのGPY培地(1%グルコース、0.4%ポリペプ
トン(和光純薬社製)、0.4%イーストエクストラク
ト(Difco社製)、0.5%MgSO4・7H2O、
0.1%K2HPO4)で30℃で2日間培養した。培養
後、得られた培養液より遠心分離により菌体を取得し
た。得られた菌体より、定法(モレキュラークローニン
グ第2版)に従い染色体DNAを単離精製した。得られ
たDNAの1μgを制限酵素SnaBI(宝酒造)で切
断した後,hmgr遺伝子をプローブとして用いたサザ
ンハイブリダイゼーション(モレキュラークローニング
第2版)を行った結果、6.7kbの位置にプローブの
シグナルが検出された。この結果から6.7kbの放線
菌染色体由来のDNA断片中にhmgr遺伝子が含まれ
ていることが判明した。The actinomycete Streptomyces sp.
ml GPY medium (1% glucose, 0.4% polypeptone (Wako Pure Chemical Industries, Ltd.), made 0.4% yeast extract (Difco Co.), 0.5% MgSO 4 · 7H 2 O,
(0.1% K 2 HPO 4 ) at 30 ° C. for 2 days. After the culture, cells were obtained from the resulting culture by centrifugation. Chromosomal DNA was isolated and purified from the obtained cells according to a standard method (molecular cloning, 2nd edition). After 1 μg of the obtained DNA was digested with the restriction enzyme SnaBI (Takara Shuzo), Southern hybridization (molecular cloning, 2nd edition) using the hmgr gene as a probe was performed. As a result, the probe signal was detected at a position of 6.7 kb. was detected. From this result, it was found that the hmgr gene was contained in the 6.7 kb DNA fragment derived from actinomycete chromosome.
【0060】次に、Streptomyces sp. CL190 株の染色
体DNAをSnaBIで再度切断後、アガロースゲル電
流泳動を行い、6.7kb付近の1μgのDNA断片を
アガロースゲルから抽出して回収した。この回収した
6.7kb付近のDNA断片をT4DNAポリメラーゼ
(宝酒造から購入)を用いて平滑末端にし、プラスミド
pUC118(宝酒造から購入)のHincIIサイト
に挿入し、Streptomycessp. CL190 株の染色体DNAラ
イブラリーを作製した。この染色体DNAライブラリー
を用いてE. Coli JM109株(宝酒造から購入)を定法
(モレキュラークローニング第2版)に従って形質転換
した。形質転換体をhmgr遺伝子(J. Bacteriol. 18
1:1256, 1999)をプローブに用いたコロニーハイブリダ
イゼーション法(モレキュラークローニング第2版)に
よりスクリーニングし、hmgr遺伝子を含むプラスミ
ドを持つ大腸菌の形質転換体を単離した。単離した形質
転換体から、プラスミド抽出キット QIAprep Spin Mini
prep Kit(キアゲン社製)を用いてプラスミドを抽出し
た。Next, the chromosomal DNA of Streptomyces sp. Strain CL190 was cut again with SnaBI, followed by agarose gel electrophoresis, and 1 μg of a DNA fragment of about 6.7 kb was extracted from the agarose gel and recovered. The recovered 6.7 kb DNA fragment was blunt-ended using T4 DNA polymerase (purchased from Takara Shuzo), inserted into the HincII site of plasmid pUC118 (purchased from Takara Shuzo), and a chromosomal DNA library of Streptomycessp. CL190 strain was prepared. did. Using this chromosomal DNA library, E. coli JM109 strain (purchased from Takara Shuzo) was transformed according to a conventional method (Molecular Cloning, 2nd edition). The transformant was transformed with the hmgr gene (J. Bacteriol. 18).
1: 1256, 1999) by a colony hybridization method (molecular cloning second edition) using a probe, and a transformant of Escherichia coli having a plasmid containing the hmgr gene was isolated. From the isolated transformant, use the plasmid extraction kit QIAprep Spin Mini
Plasmids were extracted using a prep Kit (Qiagen).
【0061】抽出したプラスミドの塩基配列の決定は定
法(モレキュラークローニング第2版)に従って行っ
た。その際、シーケンス反応用試薬としてThermo Seque
nase cycle sequencingキット(Amersham Pharmacia Bi
otech社製)を、シーケンス解析装置としてDNA sequenc
er model 4000L(Li-cor社製)を用いた。決定した塩基
配列を配列番号1に示す。The nucleotide sequence of the extracted plasmid was determined according to a standard method (Molecular Cloning, 2nd edition). At that time, Thermo Seque
nase cycle sequencing kit (Amersham Pharmacia Bi
otech) as a sequence analyzer
er model 4000L (manufactured by Li-cor) was used. The determined nucleotide sequence is shown in SEQ ID NO: 1.
【0062】実施例2:放線菌Streptomyces sp. CL190
株のメバロン酸経路に関わる遺伝子の構造の解析 配列番号1の塩基配列から、遺伝情報処理ソフトウェア
GENETYX−MAC(ソフトウェア開発社製)を用
いてオープンリーディングフレーム(以後、orfと略
す)を予測した。その結果、hmgr遺伝子以外に、5
つのorfの名前を便宜上、配列番号1の塩基配列番号
の少ない方から、orfA、orfB、orfC、or
fD、orfEと命名した。orfA〜E及びhmgr
遺伝子の配列番号1における位置は以下の通りである。 orfA(配列番号1の塩基番号132−1169) orfB(配列番号1の塩基番号1159−2211) orfC(配列番号1の塩基番号2208−3332) orfD(配列番号1の塩基番号3335−4426) hmgr遺伝子(配列番号1の塩基番号4423−54
84) orfE(配列番号1の塩基番号5488−6657) Example 2: Actinomyces Streptomyces sp. CL190
Analysis of the structure of the gene related to the mevalonate pathway of the strain An open reading frame (hereinafter abbreviated as orf) was predicted from the nucleotide sequence of SEQ ID NO: 1 using the genetic information processing software GENETYX-MAC (manufactured by Software Development Co., Ltd.). As a result, in addition to the hmgr gene, 5
For convenience, the names of two orfs are given in ascending order of the base sequence number of SEQ ID NO: 1, orfA, orfB, orfC, or
fD and orfE. orfA to E and hmgr
The positions of the genes in SEQ ID NO: 1 are as follows. orfA (base numbers 132-1169 of SEQ ID NO: 1) orfB (base numbers 1159-2211 of SEQ ID NO: 1) orfC (base numbers 2208-3332 of SEQ ID NO: 1) orfD (base numbers 3335-4426 of SEQ ID NO: 1) hmgr gene (Base numbers 4423-54 of SEQ ID NO: 1
84) orfE (base numbers 5488-6657 of SEQ ID NO: 1)
【0063】orfA、orfB、orfC、orf
D、hmgr及びorfEの塩基配列を各々配列番号
2、配列番号3、配列番号4、配列番号5、配列番号6
及び配列番号7に示し、アミノ酸配列を配列番号8、配
列番号9、配列番号10、配列番号11、配列番号12
及び配列番号13に示す。これらのorfの相同性検索
を国立遺伝学研究所のFASTAプログラム(Proc. Na
tl. Acad. Sci. USA, 85:2444, 1998)を用いて行っ
た。その結果、orfAはホスホメバロン酸キナーゼ
(accession number P24521)と、orfBはジホスホ
メバロン酸デカルボキシラーゼ(accession number P323
77)と、orfCはメバロン酸キナーゼ(accession num
ber P46086)と、orfEはHMG−CoAシンターゼ
(accession number P54873)と、それぞれ有意な相同
性があることが判明した。さらに、orfDは機能未知
のタンパク(以後、タンパクXと呼ぶ)と相同性が見ら
れた(accession number P46086)。これら5つのor
fをコードする新規遺伝子は、hmgr遺伝子とクラス
ターを形成し同一方向に転写され、その相同検索の結果
からhmgr遺伝子と同様にメバロン酸経路に関与する
遺伝子と推定された。これら遺伝子の位置関係を図1に
示す。OrfA, orfB, orfC, orf
SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6
And the amino acid sequence is shown in SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12.
And SEQ ID NO: 13. The homology search of these orfs was performed using the FASTA program (Proc.
Acad. Sci. USA, 85: 2444, 1998). As a result, orfA was phosphomevalonate kinase (accession number P24521) and orfB was diphosphomevalonate decarboxylase (accession number P323).
77) and orfC are mevalonate kinase (accession num
ber P46086) and orfE were found to have significant homology to HMG-CoA synthase (accession number P54873). Furthermore, orfD was found to be homologous to a protein of unknown function (hereinafter referred to as protein X) (accession number P46086). These five or
The novel gene coding for f forms a cluster with the hmgr gene and is transcribed in the same direction. From the result of the homology search, it was presumed to be a gene involved in the mevalonate pathway similarly to the hmgr gene. FIG. 1 shows the positional relationship between these genes.
【0064】実施例3:放線菌Streptomyces sp. CL190
株のメバロン酸経路に関わる遺伝子の機能の解析 (1)orfA、orfB、orfC、orfD及びo
rfEの機能解析 上記6.7kbのDNA断片の両末端をT4DNAポリ
メラーゼ(宝酒造から購入)を用いて平滑末端にし、プ
ラスミドpUC118(宝酒造から購入)のHincl
lサイトに、ホスホメバロン酸(PMVA)キナーゼの
N末端がpUC118のlacプロモーターに近くなる
向きに挿入したプラスミドをpUMV19と命名した。
pUMV19には、ホスホメバロン酸キナーゼ(orf
A)、ジホスホメバロン酸デカルボキシラーゼ(orf
B)、メバロン酸キナーゼ(orfC)、タンパクX
(orfD)、HMG−CoAレダクターゼ(hmg
r)、HMG−CoAシンターゼ(orfE)をコード
する合計6つの遺伝子が含まれている(図1)。次に、
pUMV19をE. coli JM109株に導入し、E. coli JM1
09 (pUMV19)株と命名した。E. coli JM109(pU
MV19)株では、上記6つの遺伝子はIPTGの添加
によってlacプロモーターから転写され発現するよう
に設計されている。 Example 3 Actinomyces Streptomyces sp. CL190
Analysis of the functions of genes related to the mevalonate pathway in strains (1) orfA, orfB, orfC, orfD and o
Functional analysis of rfE Both ends of the 6.7 kb DNA fragment were made blunt using T4 DNA polymerase (purchased from Takara Shuzo), and Hincl of plasmid pUC118 (purchased from Takara Shuzo) was used.
A plasmid in which the N-terminus of phosphomevalonate (PMVA) kinase was inserted in the l-site so that it was close to the lac promoter of pUC118 was named pUMV19.
pUMV19 includes phosphomevalonate kinase (orf
A), diphosphomevalonate decarboxylase (orf)
B), mevalonate kinase (orfC), protein X
(OrfD), HMG-CoA reductase (hmg
r), a total of six genes encoding HMG-CoA synthase (orfE) are included (FIG. 1). next,
pUMV19 was introduced into E. coli JM109 strain, and E. coli JM1 was introduced.
09 (pUMV19) strain. E. coli JM109 (pU
In the MV19) strain, the above six genes are designed to be transcribed and expressed from the lac promoter by the addition of IPTG.
【0065】ホスミドマイシンは非メバロン酸経路を特
異的に阻害することが知られている(Tetrahedron Let
t. 39:7913, 1998.)。従って、非メバロン酸経路しか
持たないE. coli JM109株は、ホスミドマイシンを含む
培地中では生育に必須なIPPを生合成できないため生
育不可能である。そこで、上記で作製したE. coli JM10
9(pUMV19)株が、ホスミドマイシン存在下でもI
PTGを添加すれば生育可能であれば、このE.coli JM1
09(pUMV19)株はメバロン酸経路でIPPを生合成
していることを証明することができる。すなわち、上記
6.7kbのDNA断片にはメバロン酸経路に関わる全
ての遺伝子が含まれていることを証明することができ
る。Fosmidmycin is known to specifically inhibit the non-mevalonate pathway (Tetrahedron Let
t. 39: 7913, 1998.). Therefore, the E. coli JM109 strain having only the non-mevalonate pathway cannot grow in a medium containing fosmidycin because it cannot biosynthesize IPP essential for growth. Therefore, E. coli JM10 prepared above
9 (pUMV19) is not affected by fosmidycin.
If growth is possible by adding PTG, this E. coli JM1
It can be demonstrated that the 09 (pUMV19) strain biosynthesizes IPP through the mevalonate pathway. That is, it can be proved that the 6.7 kb DNA fragment contains all genes related to the mevalonate pathway.
【0066】また、pUMV19の欠失変異体として、
pUMV19△E、pUMV19△S、pUMV19△
Mを構築した。pUMV19△E、pUMV19△S、
pUMV19△Mは、pUMV19をそれぞれ制限酵素
EcoRI、Sse8387IまたはMluIで消化し
てセルフライゲーションを行って作製したプラスミドで
あり、pUMV19△Eではホスホメバロン酸キナーゼ
が、pUMV19△SではHMG−CoAシンターゼ
が、pUMV19△MではタンパクXがそれぞれ欠けて
いる(図1を参照)。これら欠失変異体についてもpU
MV19と同様にE. coli JM109に形質転換し、E. coli
JM109(pUMV19△E)、E. coli JM109(pUMV1
9△S)および E. coli JM109(pUMV19△M)を作
製した。なお、ホスミドマイシンは(Chem. Pharm. Bul
l. 30:111, 1982.)に従って合成した。Further, as a deletion mutant of pUMV19,
pUMV19 △ E, pUMV19 △ S, pUMV19 △
M was constructed. pUMV19 @ E, pUMV19 @ S,
pUMV19ΔM is a plasmid prepared by self-ligation by digesting pUMV19 with restriction enzymes EcoRI, Sse8387I or MluI, respectively. At ΔM, protein X is missing (see FIG. 1). These deletion mutants also have pU
E. coli JM109 was transformed in the same manner as MV19, and E. coli JM109 was transformed.
JM109 (pUMV19 △ E), E. coli JM109 (pUMV1
9 △ S) and E. coli JM109 (pUMV19 △ M). In addition, fosmidomycin (Chem. Pharm. Bul
l. 30: 111, 1982.).
【0067】E. coli JM109(pUMV19)、E. coli J
M109(pUMV19△E)、E. coliJM109(pUMV19
△S)、 E. coli JM109(pUMV19△M)及びE. coli
JM109(pUC118)を、プラスミドを保持させるため
50μg/mlの抗生物質アンピシリン(Sigma社
製)を含むLB培地(トリプトン(Difco社製)1
%;イーストエクストラクト(Difco社製)0.5
%;及びNaCl0.5%)5mlで37℃で1晩振盪
培養した(培養条件は以下の培養条件1〜3を用い
た)。次いでこれらの生育を観察した。E. coli JM109
(pUC118)はコントロール実験のために用いた。E. coli JM109 (pUMV19), E. coli J
M109 (pUMV19 △ E), E. coli JM109 (pUMV19 △ E)
ΔS), E. coli JM109 (pUMV19ΔM) and E. coli
JM109 (pUC118) was added to an LB medium (tryptone (Difco)) 1 containing 50 μg / ml antibiotic ampicillin (Sigma) to retain the plasmid.
%; Yeast extract (manufactured by Difco) 0.5
And 5 ml of NaCl) at 37 ° C. overnight with shaking culture (the following culture conditions 1 to 3 were used as culture conditions). Then their growth was observed. E. coli JM109
(PUC118) was used for control experiments.
【0068】培養条件1:0.1mMのIPTG(和光
純薬)、および50μg/mlのアンピシリンを含むL
B培地5mlに上記5種類の培養液を1/1000量添
加し37℃で12時間培養した。 培養条件2: 0.1mMのIPTG、20μg/ml
のホスミドマイシン、および50μg/mlのLB培地
5mlに上記5種類の培養液を1/1000量添加し3
7℃で12時間培養した。 培養条件3: 0.1mMのIPTG、0.02%のメ
バロン酸(和光純薬)、20μg/mlのホスミドマイ
シン、および50μg/mlのアンピシリンを含むLB
培地5mlに上記5種類の培養液を1/1000量添加
し37℃で12時間培養した。Culture condition 1: L containing 0.1 mM IPTG (Wako Pure Chemical Industries) and 50 μg / ml ampicillin
The above five types of culture solutions were added in an amount of 1/1000 to 5 ml of B medium and cultured at 37 ° C for 12 hours. Culture condition 2: 0.1 mM IPTG, 20 μg / ml
Of fosmidycin and 50 μg / ml of LB medium in an amount of 1/1000,
The cells were cultured at 7 ° C for 12 hours. Culture condition 3: LB containing 0.1 mM IPTG, 0.02% mevalonic acid (Wako Pure Chemical Industries), 20 μg / ml fosmidomycin, and 50 μg / ml ampicillin
The above five kinds of culture solutions were added in 1/1000 amount to 5 ml of the medium, and cultured at 37 ° C. for 12 hours.
【0069】培養後における生育の結果を図1に示す
(+は正常に生育したことを、−は正常には生育しなか
ったことを表す)。培養条件1では、E. coli JM109(p
UMV19)、E. coli JM109(pUMV19△E)、E.
coli JM109(pUMV19△S)、E. coli JM109(p
UMV19△M)、E. coli JM109(pUC118)の
いずれの菌株も生育可能であった。培養条件2では、予
想通り、E. coli JM109(pUMV19)のみが生育可
能であった。つまりホスミドマイシン添加により、非メ
バロン酸経路が遮断されている条件下においては、上記
6.7kbのDNA断片上の6つのすべての遺伝子が組
み込まれているpUMV19を持っている場合に限って
メバロン酸経路が正常に機能しE. coli JM109は生育可
能であった。従って、タンパクXを含む6つの遺伝子が
メバロン酸経路に関与することが明らかとなった。The results of growth after the culture are shown in FIG. 1 (+ indicates normal growth,-indicates non-normal growth). In culture condition 1, E. coli JM109 (p
UMV19), E. coli JM109 (pUMV19 △ E), E. coli
coli JM109 (pUMV19VS), E. coli JM109 (p
UMV19 △ M) and E. coli JM109 (pUC118). Under culture condition 2, only E. coli JM109 (pUMV19) was able to grow as expected. In other words, under the conditions in which the non-mevalonate pathway is blocked by the addition of fosmidomycin, mevalon is limited to those having pUMV19 in which all six genes on the 6.7 kb DNA fragment are incorporated. The acid pathway functioned normally and E. coli JM109 was viable. Therefore, it was revealed that six genes including protein X are involved in the mevalonate pathway.
【0070】培養条件3では、予想通り、E. coli JM10
9(pUMV19)とE. coli JM109(pUMV19△
S)が生育可能であった。E. coli JM109(pUMV1
9△S)はHMG−CoAシンターゼ以外は正常に持っ
ているため、培地に添加したメバロン酸をIPPに変換
することができる。そのために、E. coli JM109(pU
MV19(S)はホスミドマイシン存在下でもメバロン
酸とIPTG添加すれば生育可能であったと考えられ
る。一方、E. coli JM109(pUMV19△E)、E. co
li JM109(pUMV19△M)、E. coli JM109(pU
C118)は、培地に添加したメバロン酸をIPPまで
変換するための遺伝子がそれぞれ一つずつ欠けているた
め、ホスミドマイシン存在下ではメバロン酸とIPTG
添加をしても生育不能であったと考えられる。In culture condition 3, as expected, E. coli JM10
9 (pUMV19) and E. coli JM109 (pUMV19 △)
S) was viable. E. coli JM109 (pUMV1
9 △ S) has normality except for HMG-CoA synthase, so that mevalonic acid added to the medium can be converted to IPP. For this purpose, E. coli JM109 (pU
It is considered that MV19 (S) could grow even in the presence of fosmidomycin if mevalonic acid and IPTG were added. On the other hand, E. coli JM109 (pUMV19 △ E),
li JM109 (pUMV19 △ M), E. coli JM109 (pUMV19 △ M)
C118) lacks one gene each for converting mevalonic acid added to the medium to IPP, so that in the presence of fosmidomycin, mevalonic acid and IPTG
It is considered that growth was impossible even with the addition.
【0071】上記結果より、(1)E. coli JM109にお
いてメバロン酸経路によりIPPを合成させるために
は,上記6つの遺伝子がすべて必要不可欠であり;
(2)HMG−CoAシンターゼを欠いた場合には、メ
バロン酸を添加すればE. coli JM109においてメバロン
酸経路によりIPPを合成させることが可能であること
が示された。From the above results, (1) all of the above six genes are indispensable for synthesizing IPP in E. coli JM109 by the mevalonate pathway;
(2) When HMG-CoA synthase was lacking, it was shown that IPP can be synthesized by the mevalonate pathway in E. coli JM109 by adding mevalonic acid.
【0072】実施例4:E. coli JM109(pUMV1
9)株によるユビキノンの生産 E. coli JM109(pUMV19)株によるユビキノン
(イソプレノイド化合物の1種)の生産量を、メバロン
酸経路の遺伝子を含まないベクターのみを持つE.coli J
M109(pUC118)株と比較した。ユビキノンの定量
はを以下の通り行なった。E. coli JM109(pUMV1
9)株とE. coli JM109(pUC118)株をそれぞ
れ、50μg/mlのアンピシリンを含む50mlのL
B培地中で37℃で14時間培養し、菌体を遠心法によ
り回収した。回収した菌体を凍結乾燥し、乾燥菌体の重
さを計量した後,クロロホルム:メタノール=2:1の
溶液60mlを加え、1時間放置してユビキノンを抽出
した。次いで、エバポレーターでクロロホルム−メタノ
ール溶液を蒸発させ、残った固形物にアセトン1mlを
加えて溶解した。このアセトン溶液中のユビキノンの量
を、センシューパックPEGASIL ODSカラム
(センシュー科学社製)を用いた高速液体クロマトグラ
フィー(日本分光より購入)で、イソプロピルアルコー
ル:メタノール=1:1で溶出して定量した。なお、こ
の条件では、ユビキノンは9.8分に溶出される。ま
た、ユビキノンの量は、上記クロマトグラムにおけるユ
ビキノンに相当するピークの面積の相対値で表した。そ
の結果を以下の表1に示す。 Example 4 E. coli JM109 (pUMV1
9) Production of ubiquinone by strain The production of ubiquinone (one of the isoprenoid compounds) produced by E. coli JM109 (pUMV19) was measured using only E.coli J, which has no mevalonate pathway gene.
This was compared with the M109 (pUC118) strain. Ubiquinone was quantified as follows. E. coli JM109 (pUMV1
9) Each of the strain and E. coli JM109 (pUC118) was mixed with 50 ml of L containing 50 μg / ml of ampicillin.
The cells were cultured in a B medium at 37 ° C. for 14 hours, and the cells were collected by centrifugation. The collected cells were freeze-dried, the weight of the dried cells was measured, 60 ml of a chloroform: methanol = 2: 1 solution was added, and the mixture was allowed to stand for 1 hour to extract ubiquinone. Next, the chloroform-methanol solution was evaporated by an evaporator, and the remaining solid was dissolved by adding 1 ml of acetone. The amount of ubiquinone in the acetone solution was quantified by high-performance liquid chromatography (purchased from JASCO Corporation) using a Senshupak PEGASIL ODS column (manufactured by Senshu Kagaku), eluting with isopropyl alcohol: methanol = 1: 1. did. Under these conditions, ubiquinone is eluted at 9.8 minutes. The amount of ubiquinone was represented by the relative value of the peak area corresponding to ubiquinone in the above chromatogram. The results are shown in Table 1 below.
【0073】[0073]
【表1】 [Table 1]
【0074】E. coli JM109(pUMV19)における
培養量あたり(今回は50ml)のユビキノンの生産量
は、E. coli JM109(pUC118)のそれと比べて、
2.1倍であった(表中において、25684/121
35=2.1)。また、E. coli JM109(pUMV1
9)における菌体量あたりのユビキノンの生産量は、E.
coli JM109(pUC118)のそれと比べて、4.5倍
であった(表中において、450000/100000
=4.5)。これらの結果から、放線菌Streptomyces s
p. CL190株由来のメバロン酸経路の遺伝子はイソプレノ
イドの生産性向上に有効であったことが判明した。The production amount of ubiquinone per culture volume (in this case, 50 ml) in E. coli JM109 (pUMV19) was higher than that of E. coli JM109 (pUC118).
2.1 times (in the table, 25684/121
35 = 2.1). In addition, E. coli JM109 (pUMV1
The production amount of ubiquinone per cell mass in 9) is E. coli.
It was 4.5 times that of E. coli JM109 (pUC118) (450,000 / 100000 in the table).
= 4.5). From these results, Streptomyces s
It was found that the mevalonate pathway gene derived from p.CL190 strain was effective in improving isoprenoid productivity.
【0075】[0075]
【発明の効果】本発明の遺伝子は新規遺伝子である。本
発明の遺伝子を大腸菌などの宿主に導入した形質転換体
を培養することにより、イソプレノイド化合物を生産す
ることができる。The gene of the present invention is a novel gene. By culturing a transformant in which the gene of the present invention has been introduced into a host such as Escherichia coli, an isoprenoid compound can be produced.
【0076】メバロン酸経路に関わる遺伝子の全長が全
て明らかにされている生物としては、本発明で明らかに
された放線菌Streptomyces sp. CL190株以外としては、
酵母Saccharomyces cerevisiaeが挙げられる(Natu
re 387:5,1997)。しかしながら、酵母の
メバロン酸経路に関わる遺伝子はクラスターを作ってい
ない。そのため、大腸菌内でこの酵母のメバロン酸経路
の遺伝子を発現させて、その大腸菌がメバロン酸経路を
利用することが出来るようにするためには、メバロン酸
経路の遺伝子を一つ一つ酵母より取得し、大腸菌内での
発現に適したベクターに一つ一つ組み込んでいくという
煩雑な操作が必要である。また、酵母のような真核生物
のゲノムはイントロンと呼ばれるタンパクをコードして
いない領域が多く含まれることから、メバロン酸経路の
遺伝子を取得するためには、酵母のメッセンジャーRN
Aを取り出し、それを鋳型に逆転写酵素でDNAに変換
した後、メバロン酸経路の酵素をコードしている領域の
みを取得するという煩雑な方法を採用せざるを得ない。
一方、原核生物である放線菌Streptomyces sp. CL190株
のメバロン酸経路の遺伝子は、上記したようにゲノム上
でクラスターを作っており、なおかつ同一方向に転写さ
れるように並んでいるため、煩雑な操作を必要とせず、
大腸菌内で発現させることができる。Organisms in which the full length of the genes involved in the mevalonate pathway have been completely clarified include Streptomyces sp.
Yeast Saccharomyces cerevisiae (Natu
re 387: 5, 1997). However, genes involved in the mevalonate pathway of yeast have not formed a cluster. Therefore, in order to express the mevalonate pathway gene of Escherichia coli in Escherichia coli so that the Escherichia coli can use the mevalonate pathway, obtain the mevalonate pathway genes one by one from the yeast. However, it is necessary to perform a complicated operation of incorporating each of them into a vector suitable for expression in E. coli. In addition, since the genome of a eukaryote such as yeast contains many regions that do not encode a protein called an intron, the yeast messenger RN is required to obtain a mevalonate pathway gene.
After extracting A and converting it into DNA using reverse transcriptase as a template, a complicated method of obtaining only the region encoding the enzyme of the mevalonate pathway has to be adopted.
On the other hand, the genes of the mevalonate pathway of the actinomycete Streptomyces sp.CL190 strain, which is a prokaryote, are clustered on the genome as described above, and are arranged so that they are transcribed in the same direction, which is complicated. No action required,
It can be expressed in E. coli.
【0077】[0077]
【配列表】 SEQUENCE LISTING <110> SETO Haruo and KUZUYAMA Tomohisa <120> Genes of the enzymes involved in mevalonate pathway which are deri ved from inomycetes <130> 99422A <160> 13[Sequence List] SEQUENCE LISTING <110> SETO Haruo and KUZUYAMA Tomohisa <120> Genes of the enzymes involved in mevalonate pathway which are deri ved from inomycetes <130> 99422A <160> 13
【0078】 <210> 1 <211> 6798 <212> DNA <213> Streptomyces <400> 1 tacgtacttc cctggcgtgt gcagcggttg acgcgccgtg ccctcgctgc gagcggcgcg 60 cacatctgac gtcctgcttt attgctttct cagaactcgg gacgaagcga tcccatgatc 120 acgcgatctc catgcagaaa agacaaaggg agctgagtgc gttgacacta ccgacctcgg 180 ctgagggggt atcagaaagc caccgggccc gctcggtcgg catcggtcgc gcccacgcca 240 aggccatcct gctgggagag catgcggtcg tctacggagc gccggcactc gctctgccga 300 ttccgcagct cacggtcacg gccagcgtcg gctggtcgtc cgaggcctcc gacagtgcgg 360 gtggcctgtc ctacacgatg accggtacgc cgtcgcgggc actggtgacg caggcctccg 420 acggcctgca ccggctcacc gcggaattca tggcgcggat gggcgtgacg aacgcgccgc 480 acctcgacgt gatcctggac ggcgcgatcc cgcacggccg gggtctcggct ccagcgcgg 540 ccggctcacg cgcgatcgcc ttggccctcg ccgacctctt cggccacgaa ctggccgagc 600 acacggcgta cgaactggtg cagacggccg agaacatggc gcacggccgg gccagcggcg 660 tggacgcgat gacggtcggc gcgtcccggc cgctgctgtt ccagcagggc cgcaccgagc 720 gactggccat cggctgcgac agcctgttca tcgtcgccga cagcggcgtc ccgggcagca 780 ccaaggaagc ggtcgagatg ctgcgggagg gattcacccg cagcgccgga acacaggagc 840 ggttcgtcgg ccgggcgacg gaactgaccg aggccgcccg gcaggccctc gccgacggcc 900 ggcccgagga gctgggctcg cagctgacgt actaccacga gctgctccat gaggcccgcc 960 tgagcaccga cggcatcgat gcgctggtcg aggccgcgct gaaggcaggc agcctcggag 1020 ccaagatcac cggcggtggt ctgggcggct gcatgatcgc acaggcccgg cccgaacagg 1080 cccgggaggt cacccggcag ctccacgagg ccggtgccgt acagacctgg gtcgtaccgc 1140 tgaaagggct cgacaaccat gcgcagtgaa cacccgacca cgaccgtgct ccagtcgcgg 1200 gagcagggca gcgcggccgg cgccaccgcg gtcgcgcacc caaacatcgc gctgatcaag 1260 tactggggca agcgcgacga gcggctgatc ctgccctgca ccaccagcct gtcgatgacg 1320 ctggacgtct tccccacgac caccgaggtc cggctcgacc ccgccgccga gcacgacacg 1380 gccgccctca acggcgaggt ggccacgggc gagacgctgc gccgcatcag cgccttcctc 1440 tccctggtgc gggaggtggc gggcagcgac cagcgggccg tggtggacac ccgcaacacc 1500 gtgcccaccg gggcgggcct ggcgtcctcc gccagcgggt tcgccgccct cgccgtcgcg 1560 gccgcggccg cctacgggct cgaactcgac gaccgcgggc tgtcccggct ggcccgacgt 1620 ggatccggct ccgcctcgcg gtcgatcttc ggcggcttcg ccgtctggca cgccggcccc 1680 gacggcacgg ccacggaagc ggacctcggc tcctacgccg agccggtgcc cgcggccgac 1740 ctcgacccgg cgctggtcat cgccgtggtc aacgccggcc ccaagcccgt ctccagccgc 1800 gaggccatgc gccgcaccgt cgacacctcg ccgctgtacc ggccgtgggc cgactccagt 1860 aaggacgacc tggacgagat gcgctcggcg ctgctgcgcg gcgacctcga ggccgtgggc 1920 gagatcgcgg agcgcaacgc gctcggcatg cacgccacca tgctggccgc ccgccccgcg 1980 gtgcggtacc tgtcgccggc cacggtcacc gtgctcgaca gcgtgctcca gctccgcaag 2040 gacggtgtcc tggcctacgc gaccatggac gccggtccca acgtgaaggt gctgtgccgg 2100 cgggcggacg ccgagcgggt ggccgacgtc gtacgcgccg ccgcgtccgg cggtcaggtc 2160 ctcgtcgccg ggccgggaga cggtgcccgc ctgctgagcg agggcgcatg acgacaggtc 2220 agcgcacgat cgtccggcac gcgccgggca agctgttcgt cgcgggcgag tacgcggtcg 2280 tggatccggg caacccggcg atcctggtag cggtcgaccg gcacatcagc gtcaccgtgt 2340 ccgacgccga cgcggacacc ggggccgccg acgtcgtgat ctcctccgac ctcggtccgc 2400 aggcggtcgg ctggcgctgg cacgacggcc ggctcgtcgt ccgcgacccg gacgacgggc 2460 agcaggcgcg cagcgccctg gcccacgtgg tgtcggcgat cgagaccgtg ggccggctgc 2520 tgggcgaacg cggacagaag gtccccgctc tcaccctctc cgtcagcagc cgcctgcacg 2580 aggacggccg gaagttcggc ctgggctcca gcggcgcggt gaccgtggcg accgtagccg 2640 ccgtcgccgc gttctgcgga ctcgaactgt ccaccgacga acggttccgg ctggccatgc 2700 tcgccaccgc ggaactcgac cccaagggct ccggcgggga cctcgccgcc agcacctggg 2760 gcggctggat cgcctaccag gcgcccgacc gggcctttgt gctcgacctg gcccggcgcg 2820 tgggagtcga ccggacactg aaggcgccct ggccggggca ctcggtgcgc cgactgccgg 2880 cgcccaaggg cctcaccctg gaggtcggct ggaccggaga gcccgcctcc accgcgtccc 2940 tggtgtccga tctgcaccgc cgcacctggc ggggcagcgc ctcccaccag aggttcgtcg 3000 agaccacgac cgactgtgtc cgctccgcgg tcaccgccct ggagtccggc gacgacacga 3060 gcctgctgca cgagatccgc cgggcccgcc aggagctggc ccgcctggac gacgaggtcg 3120 gcctcggcat cttcacaccc aagctgacgg cgctgtgcga cgccgccgaa gccgtcggcg 3180 gcgcggccaa gccctccggg gcaggcggcg gcgactgcgg catcgccctg ctggacgccg 3240 aggcgtcgcg ggacatcaca catgtacggc aacggtggga gacagccggg gtgctgcccc 3300 tgcccctgac tcctgccctg gaagggatct aagaatgacc agcgcccaac gcaaggacga 3360 ccacgtacgg ctcgccatcg agcagcacaa cgcccacagc ggacgcaacc agttcgacga 3420 cgtgtcgttc gtccaccacg ccctggccgg catcgaccgg ccggacgtgt ccctggccac 3480 gtccttcgcc gggatctcct ggcaggtgcc gatctacatc aacgcgatga ccggcggcag 3540 cgagaagacc ggcctcatca accgggacct ggccaccgcc gcccgcgaga ccggcgtccc 3600 catcgcgtcc gggtccatga acgcgtacat caaggacccc tcctgcgccg acacgttccg 3660 tgtgctgcgc gacgagaacc ccaacgggtt cgtcatcgcg aacatcaacg ccaccacgac 3720 ggtcgacaac gcgcagcgcg cgatcgacct gatcgaggcg aacgccctgc agatccacat 3780 caacacggcg caggagacgc cgatgccgga gggcgaccgg tcgttcgcgt cctgggtccc 3840 gcagatcgag aagatcgcgg cggccgtcga catccccgtg atcgtcaagg aggtcggcaa 3900 cggcctgagc cggcagacca tcctgctgct cgccgacctc ggcgtgcagg cggcggacgt 3960 cagcggccgc ggcggcacgg acttcgcccg catcgagaac ggccgccggg agctcggcga 4020 ctacgcgttc ctgcacggct gggggcagtc caccgccgcc tgcctgctgg acgcccagga 4080 catctccctg cccgtcctcg cctccggcgg tgtgcgtcac ccgctcgacg tggtccgcgc 4140 cctcgcgctc ggcgcccgcg ccgtcggctc ctccgccggc ttcctgcgca ccctgatgga 4200 cgacggcgtc gacgcgctga tcacgaagct cacgacctgg ctggaccagc tggcggcgct 4260 gcagaccatg ctcggcgcgc gcaccccggc cgacctcacc cgctgcgacg tgctgctcca 4320 cggcgagctg cgtgacttct gcgccgaccg gggcatcgac acgcgccgcc tcgcccagcg 4380 ctccagctcc atcgaggccc tccagacgac gggaagcaca cgatgacgga aacgcacgcc 4440 atagccgggg tcccgatgag gtgggtggga ccccttcgta tttccgggaa cgtcgccgag 4500 accgagaccc aggtcccgct cgccacgtac gagtcgccgc tgtggccgtc ggtgggccgc 4560 ggggcgaagg tctcccggct gacggagaag ggcatcgtcg ccaccctcgt cgacgagcgg 4620 atgacccgct cggtgatcgt cgaggcgacg gacgcgcaga ccgcgtacat ggccgcgcag 4680 accatccacg cccgcatcga cgagctgcgc gaggtggtgc gcggctgcag ccggttcgcc 4740 cagctgatca acatcaagca cgagatcaac gcgaacctgc tgttcatccg gttcgagttc 4800 accaccggtg acgcctccgg ccacaacatg gccacgctcg cctccgatgt gctcctgggg 4860 cacctgctgg agacgatccc tggcatctcc tacggctcga tctccggcaa ctactgcacg 4920 gacaagaagg ccaccgcgat caacggcatc ctcggccgcg gcaagaacgt gatcaccgag 4980 ctgctggtgc cgcgggacgt cgtcgagaac aacctgcaca ccacggctgc caagatcgtc 5040 gagctgaaca tccgcaagaa cctgctcggc accctgctcg ccggcggcat ccgctcggcc 5100 aacgcccact tcgcgaacat gctgctcggc ttctacctgg ccaccggcca ggacgccgcc 5160 aacatcgtcg agggctcgca gggcgtcgtc atggccgagg accgcgacgg cgacctctac 5220 ttcgcctgca ccctgccgaa cctgatcgtc ggcacggtcg gcaacggcaa gggtctcggc 5280 ttcgtggaga cgaacctcgc ccggctcggc tgccgagccg accgcgaacc cggggagaac 5340 gcccgccgcc tcgccgtcat cgcggcagcg accgtgctgt gcggtgaact ctcgctgctc 5400 gcggcacaga cgaacccggg cgaactcatg cgcgcgcacg tccagctgga acgcgacaac 5460 aagaccgcaa aggttggtgc atagggcatg tccatctcca taggcattca cgacctgtcg 5520 ttcgccacaa ccgagttcgt cctgccgcac acggcgctcg ccgagtacaa cggcaccgag 5580 atcggcaagt accacgtcgg catcggccag cagtcgatga gcgtgccggc cgccgacgag 5640 gacatcgtga ccatggccgc gaccgcggcg cggcccatca tcgagcgcaa cggcaagagc 5700 cggatccgca cggtcgtgtt cgccacggag tcgtcgatcg accaggcgaa ggcgggcggc 5760 gtgtacgtgc actccctgct ggggctggag tcggcctgcc gggtcgtcga gctgaagcag 5820 gcctgctacg gggccaccgc cgcccttcag ttcgccatcg gcctggtgcg gcgcgacccc 5880 gcccagcagg tcctggtcat cgccagtgac gtctccaagt acgagctgga cagccccggc 5940 gaggcgaccc agggcgcggc cgcggtggcc atgctggtcg gcgccgaccc ggccctgctg 6000 cgtatcgagg agccgtcggg cctgttcacc gccgacgtca tggacttctg gcggcccaac 6060 tacctcacca ccgctctggt cgacggccag gagtccatca acgcctacct gcaggccgtc 6120 gagggcgcct ggaaggacta cgcggagcag gacggccggt cgctggagga gttcgcggcg 6180 ttcgtctacc accagccgtt cacgaagatg gcctacaagg cgcaccgcca cctgctgaac 6240 ttcaacggct acgacaccga caaggacgcc atcgagggcg ccctcggcca gacgacggcg 6300 tacaacaacg tcatcggcaa cagctacacc gcgtcggtgt acctgggcct ggccgccctg 6360 ctcgaccagg cggacgacct gacgggccgt tccatcggct tcctgagcta cggctcgggc 6420 agcgtcgccg agttcttctc gggcaccgtc gtcgccgggt accgcgagcg tctgcgcacc 6480 gaggcgaacc aggaggcgat cgcccggcgc aagagcgtcg actacgccac ctaccgcgag 6540 ctgcacgagt acacgctccc gtccgacggc ggcgaccacg ccaccccggt gcagaccacc 6600 ggccccttcc ggctggccgg gatcaacgac cacaagcgca tctacgaggc gcgctagcga 6660 cacccctcgg caacggggtg cgccactgtt cggcgcaccc cgtgccgggc tttcgcacag 6720 ctattcacga ccatttgagg ggcgggcagc cgcatgaccg acgtccgatt ccgcattatc 6780 ggtacgggtg cctacgta 6798[0078] <210> 1 <211> 6798 <212> DNA <213> Streptomyces <400> 1 tacgtacttc cctggcgtgt gcagcggttg acgcgccgtg ccctcgctgc gagcggcgcg 60 cacatctgac gtcctgcttt attgctttct cagaactcgg gacgaagcga tcccatgatc 120 acgcgatctc catgcagaaa agacaaaggg agctgagtgc gttgacacta ccgacctcgg 180 ctgagggggt atcagaaagc caccgggccc gctcggtcgg catcggtcgc gcccacgcca 240 aggccatcct gctgggagag catgcggtcg tctacggagc gccggcactc gctctgccga 300 ttccgcagct cacggtcacg gccagcgtcg gctggtcgtc cgaggcctcc gacagtgcgg 360 gtggcctgtc ctacacgatg accggtacgc cgtcgcgggc actggtgacg caggcctccg 420 acggcctgca ccggctcacc gcggaattca tggcgcggat gggcgtgacg aacgcgccgc 480 acctcgacgt gatcctggac ggcgcgatcc cgcacggccg gggtctcggct ccagcgcgg 540 ccggctcacg cgcgatcgcc ttggccctcg ccgacctctt cggccacgaa ctggccgagc 600 acacggcgta cgaactggtg cagacggccg agaacatggc gcacggccgg gccagcggcg 660 tggacgcgat gacggtcggc gcgtcccggc cgctgctgtt ccagcagggc cgcaccgagc 720 gactggccat cggctgcgac agcctgttca tcgtcgccga cagcggcgtc ccgggcagca 780 ccaaggaagc ggtcg agatg ctgcgggagg gattcacccg cagcgccgga acacaggagc 840 ggttcgtcgg ccgggcgacg gaactgaccg aggccgcccg gcaggccctc gccgacggcc 900 ggcccgagga gctgggctcg cagctgacgt actaccacga gctgctccat gaggcccgcc 960 tgagcaccga cggcatcgat gcgctggtcg aggccgcgct gaaggcaggc agcctcggag 1020 ccaagatcac cggcggtggt ctgggcggct gcatgatcgc acaggcccgg cccgaacagg 1080 cccgggaggt cacccggcag ctccacgagg ccggtgccgt acagacctgg gtcgtaccgc 1140 tgaaagggct cgacaaccat gcgcagtgaa cacccgacca cgaccgtgct ccagtcgcgg 1200 gagcagggca gcgcggccgg cgccaccgcg gtcgcgcacc caaacatcgc gctgatcaag 1260 tactggggca agcgcgacga gcggctgatc ctgccctgca ccaccagcct gtcgatgacg 1320 ctggacgtct tccccacgac caccgaggtc cggctcgacc ccgccgccga gcacgacacg 1380 gccgccctca acggcgaggt ggccacgggc gagacgctgc gccgcatcag cgccttcctc 1440 tccctggtgc gggaggtggc gggcagcgac cagcgggccg tggtggacac ccgcaacacc 1500 gtgcccaccg gggcgggcct ggcgtcctcc gccagcgggt tcgccgccct cgccgtcgcg 1560 gccgcggccg cctacgggct cgaactcgac gaccgcgggc tgtcccggct ggcccgacgt 1620 ggatccggct ccgcctcgcg gtc gatcttc ggcggcttcg ccgtctggca cgccggcccc 1680 gacggcacgg ccacggaagc ggacctcggc tcctacgccg agccggtgcc cgcggccgac 1740 ctcgacccgg cgctggtcat cgccgtggtc aacgccggcc ccaagcccgt ctccagccgc 1800 gaggccatgc gccgcaccgt cgacacctcg ccgctgtacc ggccgtgggc cgactccagt 1860 aaggacgacc tggacgagat gcgctcggcg ctgctgcgcg gcgacctcga ggccgtgggc 1920 gagatcgcgg agcgcaacgc gctcggcatg cacgccacca tgctggccgc ccgccccgcg 1980 gtgcggtacc tgtcgccggc cacggtcacc gtgctcgaca gcgtgctcca gctccgcaag 2040 gacggtgtcc tggcctacgc gaccatggac gccggtccca acgtgaaggt gctgtgccgg 2100 cgggcggacg ccgagcgggt ggccgacgtc gtacgcgccg ccgcgtccgg cggtcaggtc 2160 ctcgtcgccg ggccgggaga cggtgcccgc ctgctgagcg agggcgcatg acgacaggtc 2220 agcgcacgat cgtccggcac gcgccgggca agctgttcgt cgcgggcgag tacgcggtcg 2280 tggatccggg caacccggcg atcctggtag cggtcgaccg gcacatcagc gtcaccgtgt 2340 ccgacgccga cgcggacacc ggggccgccg acgtcgtgat ctcctccgac ctcggtccgc 2400 aggcggtcgg ctggcgctgg cacgacggcc ggctcgtcgt ccgcgacccg gacgacgggc 2460 agcaggcgcg cagcgccctg gcccacgtg g tgtcggcgat cgagaccgtg ggccggctgc 2520 tgggcgaacg cggacagaag gtccccgctc tcaccctctc cgtcagcagc cgcctgcacg 2580 aggacggccg gaagttcggc ctgggctcca gcggcgcggt gaccgtggcg accgtagccg 2640 ccgtcgccgc gttctgcgga ctcgaactgt ccaccgacga acggttccgg ctggccatgc 2700 tcgccaccgc ggaactcgac cccaagggct ccggcgggga cctcgccgcc agcacctggg 2760 gcggctggat cgcctaccag gcgcccgacc gggcctttgt gctcgacctg gcccggcgcg 2820 tgggagtcga ccggacactg aaggcgccct ggccggggca ctcggtgcgc cgactgccgg 2880 cgcccaaggg cctcaccctg gaggtcggct ggaccggaga gcccgcctcc accgcgtccc 2940 tggtgtccga tctgcaccgc cgcacctggc ggggcagcgc ctcccaccag aggttcgtcg 3000 agaccacgac cgactgtgtc cgctccgcgg tcaccgccct ggagtccggc gacgacacga 3060 gcctgctgca cgagatccgc cgggcccgcc aggagctggc ccgcctggac gacgaggtcg 3120 gcctcggcat cttcacaccc aagctgacgg cgctgtgcga cgccgccgaa gccgtcggcg 3180 gcgcggccaa gccctccggg gcaggcggcg gcgactgcgg catcgccctg ctggacgccg 3240 aggcgtcgcg ggacatcaca catgtacggc aacggtggga gacagccggg gtgctgcccc 3300 tgcccctgac tcctgccctg gaagggatct aaga atgacc agcgcccaac gcaaggacga 3360 ccacgtacgg ctcgccatcg agcagcacaa cgcccacagc ggacgcaacc agttcgacga 3420 cgtgtcgttc gtccaccacg ccctggccgg catcgaccgg ccggacgtgt ccctggccac 3480 gtccttcgcc gggatctcct ggcaggtgcc gatctacatc aacgcgatga ccggcggcag 3540 cgagaagacc ggcctcatca accgggacct ggccaccgcc gcccgcgaga ccggcgtccc 3600 catcgcgtcc gggtccatga acgcgtacat caaggacccc tcctgcgccg acacgttccg 3660 tgtgctgcgc gacgagaacc ccaacgggtt cgtcatcgcg aacatcaacg ccaccacgac 3720 ggtcgacaac gcgcagcgcg cgatcgacct gatcgaggcg aacgccctgc agatccacat 3780 caacacggcg caggagacgc cgatgccgga gggcgaccgg tcgttcgcgt cctgggtccc 3840 gcagatcgag aagatcgcgg cggccgtcga catccccgtg atcgtcaagg aggtcggcaa 3900 cggcctgagc cggcagacca tcctgctgct cgccgacctc ggcgtgcagg cggcggacgt 3960 cagcggccgc ggcggcacgg acttcgcccg catcgagaac ggccgccggg agctcggcga 4020 ctacgcgttc ctgcacggct gggggcagtc caccgccgcc tgcctgctgg acgcccagga 4080 catctccctg cccgtcctcg cctccggcgg tgtgcgtcac ccgctcgacg tggtccgcgc 4140 cctcgcgctc ggcgcccgcg ccgtcggctc ctccgccggc ttcctgcgca ccctgatgga 4200 cgacggcgtc gacgcgctga tcacgaagct cacgacctgg ctggaccagc tggcggcgct 4260 gcagaccatg ctcggcgcgc gcaccccggc cgacctcacc cgctgcgacg tgctgctcca 4320 cggcgagctg cgtgacttct gcgccgaccg gggcatcgac acgcgccgcc tcgcccagcg 4380 ctccagctcc atcgaggccc tccagacgac gggaagcaca cgatgacgga aacgcacgcc 4440 atagccgggg tcccgatgag gtgggtggga ccccttcgta tttccgggaa cgtcgccgag 4500 accgagaccc aggtcccgct cgccacgtac gagtcgccgc tgtggccgtc ggtgggccgc 4560 ggggcgaagg tctcccggct gacggagaag ggcatcgtcg ccaccctcgt cgacgagcgg 4620 atgacccgct cggtgatcgt cgaggcgacg gacgcgcaga ccgcgtacat ggccgcgcag 4680 accatccacg cccgcatcga cgagctgcgc gaggtggtgc gcggctgcag ccggttcgcc 4740 cagctgatca acatcaagca cgagatcaac gcgaacctgc tgttcatccg gttcgagttc 4800 accaccggtg acgcctccgg ccacaacatg gccacgctcg cctccgatgt gctcctgggg 4860 cacctgctgg agacgatccc tggcatctcc tacggctcga tctccggcaa ctactgcacg 4920 gacaagaagg ccaccgcgat caacggcatc ctcggccgcg gcaagaacgt gatcaccgag 4980 ctgctggtgc cgcgggacgt cgtcgagaac aacctgcaca ccacg gctgc caagatcgtc 5040 gagctgaaca tccgcaagaa cctgctcggc accctgctcg ccggcggcat ccgctcggcc 5100 aacgcccact tcgcgaacat gctgctcggc ttctacctgg ccaccggcca ggacgccgcc 5160 aacatcgtcg agggctcgca gggcgtcgtc atggccgagg accgcgacgg cgacctctac 5220 ttcgcctgca ccctgccgaa cctgatcgtc ggcacggtcg gcaacggcaa gggtctcggc 5280 ttcgtggaga cgaacctcgc ccggctcggc tgccgagccg accgcgaacc cggggagaac 5340 gcccgccgcc tcgccgtcat cgcggcagcg accgtgctgt gcggtgaact ctcgctgctc 5400 gcggcacaga cgaacccggg cgaactcatg cgcgcgcacg tccagctgga acgcgacaac 5460 aagaccgcaa aggttggtgc atagggcatg tccatctcca taggcattca cgacctgtcg 5520 ttcgccacaa ccgagttcgt cctgccgcac acggcgctcg ccgagtacaa cggcaccgag 5580 atcggcaagt accacgtcgg catcggccag cagtcgatga gcgtgccggc cgccgacgag 5640 gacatcgtga ccatggccgc gaccgcggcg cggcccatca tcgagcgcaa cggcaagagc 5700 cggatccgca cggtcgtgtt cgccacggag tcgtcgatcg accaggcgaa ggcgggcggc 5760 gtgtacgtgc actccctgct ggggctggag tcggcctgcc gggtcgtcga gctgaagcag 5820 gcctgctacg gggccaccgc cgcccttcag ttcgccatcg gcctggtgcg gcgcgacccc 5880 gcccagcagg tcctggtcat cgccagtgac gtctccaagt acgagctgga cagccccggc 5940 gaggcgaccc agggcgcggc cgcggtggcc atgctggtcg gcgccgaccc ggccctgctg 6000 cgtatcgagg agccgtcggg cctgttcacc gccgacgtca tggacttctg gcggcccaac 6060 tacctcacca ccgctctggt cgacggccag gagtccatca acgcctacct gcaggccgtc 6120 gagggcgcct ggaaggacta cgcggagcag gacggccggt cgctggagga gttcgcggcg 6180 ttcgtctacc accagccgtt cacgaagatg gcctacaagg cgcaccgcca cctgctgaac 6240 ttcaacggct acgacaccga caaggacgcc atcgagggcg ccctcggcca gacgacggcg 6300 tacaacaacg tcatcggcaa cagctacacc gcgtcggtgt acctgggcct ggccgccctg 6360 ctcgaccagg cggacgacct gacgggccgt tccatcggct tcctgagcta cggctcgggc 6420 agcgtcgccg agttcttctc gggcaccgtc gtcgccgggt accgcgagcg tctgcgcacc 6480 gaggcgaacc aggaggcgat cgcccggcgc aagagcgtcg actacgccac ctaccgcgag 6540 ctgcacgagt acacgctccc gtccgacggc ggcgaccacg ccaccccggt gcagaccacc 6600 ggccccttcc ggctggccgg gatcaacgac cacaagcgca tctacgaggc gcgctagcga 6660 cacccctcgg caacggggtg cgccactgtt cggcgcaccc cgtgccgggc tttcgc acag 6720 ctattcacga ccatttgagg ggcgggcagc cgcatgaccg acgtccgatt ccgcattatc 6780 ggtacgggtg cctacgta 6798
【0079】 <210> 2 <211> 1038 <212> DNA <213> Streptomyces <400> 2 atg cag aaa aga caa agg gag ctg agt gcg ttg aca cta 39 Met Gln Lys Arg Gln Arg Glu Leu Ser Ala Leu Thr Leu 1 5 10 ccg acc tcg gct gag ggg gta tca gaa agc cac cgg gcc cgc tcg gtc 87 Pro Thr Ser Ala Glu Gly Val Ser Glu Ser His Arg Ala Arg Ser Val 15 20 25 ggc atc ggt cgc gcc cac gcc aag gcc atc ctg ctg gga gag cat gcg 135 Gly Ile Gly Arg Ala His Ala Lys Ala Ile Leu Leu Gly Glu His Ala 30 35 40 45 gtc gtc tac gga gcg ccg gca ctc gct ctg ccg att ccg cag ctc acg 183 Val Val Tyr Gly Ala Pro Ala Leu Ala Leu Pro Ile Pro Gln Leu Thr 50 55 60 gtc acg gcc agc gtc ggc tgg tcg tcc gag gcc tcc gac agt gcg ggt 231 Val Thr Ala Ser Val Gly Trp Ser Ser Glu Ala Ser Asp Ser Ala Gly 65 70 75 ggc ctg tcc tac acg atg acc ggt acg ccg tcg cgg gca ctg gtg acg 279 Gly Leu Ser Tyr Thr Met Thr Gly Thr Pro Ser Arg Ala Leu Val Thr 80 85 90 cag gcc tcc gac ggc ctg cac cgg ctc acc gcg gaa ttc atg gcg cgg 327 Gln Ala Ser Asp Gly Leu His Arg Leu Thr Ala Glu Phe Met Ala Arg 95 100 105 atg ggc gtg acg aac gcg ccg cac ctc gac gtg atc ctg gac ggc gcg 375 Met Gly Val Thr Asn Ala Pro His Leu Asp Val Ile Leu Asp Gly Ala 110 115 120 125 atc ccg cac ggc cgg ggt ctc ggc tcc agc gcg gcc ggc tca cgc gcg 423 Ile Pro His Gly Arg Gly Leu Gly Ser Ser Ala Ala Gly Ser Arg Ala 130 135 140 atc gcc ttg gcc ctc gcc gac ctc ttc ggc cac gaa ctg gcc gag cac 471 Ile Ala Leu Ala Leu Ala Asp Leu Phe Gly His Glu Leu Ala Glu His 145 150 155 acg gcg tac gaa ctg gtg cag acg gcc gag aac atg gcg cac ggc cgg 519 Thr Ala Tyr Glu Leu Val Gln Thr Ala Glu Asn Met Ala His Gly Arg 160 165 170 gcc agc ggc gtg gac gcg atg acg gtc ggc gcg tcc cgg ccg ctg ctg 567 Ala Ser Gly Val Asp Ala Met Thr Val Gly Ala Ser Arg Pro Leu Leu 175 180 185 ttc cag cag ggc cgc acc gag cga ctg gcc atc ggc tgc gac agc ctg 615 Phe Gln Gln Gly Arg Thr Glu Arg Leu Ala Ile Gly Cys Asp Ser Leu 190 195 200 205 ttc atc gtc gcc gac agc ggc gtc ccg ggc agc acc aag gaa gcg gtc 663 Phe Ile Val Ala Asp Ser Gly Val Pro Gly Ser Thr Lys Glu Ala Val 210 215 220 gag atg ctg cgg gag gga ttc acc cgc agc gcc gga aca cag gag cgg 711 Glu Met Leu Arg Glu Gly Phe Thr Arg Ser Ala Gly Thr Gln Glu Arg 225 230 235 ttc gtc ggc cgg gcg acg gaa ctg acc gag gcc gcc cgg cag gcc ctc 759 Phe Val Gly Arg Ala Thr Glu Leu Thr Glu Ala Ala Arg Gln Ala Leu 240 245 250 gcc gac ggc cgg ccc gag gag ctg ggc tcg cag ctg acg tac tac cac 807 Ala Asp Gly Arg Pro Glu Glu Leu Gly Ser Gln Leu Thr Tyr Tyr His 255 260 265 gag ctg ctc cat gag gcc cgc ctg agc acc gac ggc atc gat gcg ctg 855 Glu Leu Leu His Glu Ala Arg Leu Ser Thr Asp Gly Ile Asp Ala Leu 270 275 280 285 gtc gag gcc gcg ctg aag gca ggc agc ctc gga gcc aag atc acc ggc 903 Val Glu Ala Ala Leu Lys Ala Gly Ser Leu Gly Ala Lys Ile Thr Gly 290 295 300 ggt ggt ctg ggc ggc tgc atg atc gca cag gcc cgg ccc gaa cag gcc 951 Gly Gly Leu Gly Gly Cys Met Ile Ala Gln Ala Arg Pro Glu Gln Ala 305 310 315 cgg gag gtc acc cgg cag ctc cac gag gcc ggt gcc gta cag acc tgg 999 Arg Glu Val Thr Arg Gln Leu His Glu Ala Gly Ala Val Gln Thr Trp 320 325 330 gtc gta ccg ctg aaa ggg ctc gac aac cat gcg cag tga 1038 Val Val Pro Leu Lys Gly Leu Asp Asn His Ala Gln 335 340<210> 2 <211> 1038 <212> DNA <213> Streptomyces <400> 2 atg cag aaa aga caa agg gag ctg agt gcg ttg aca cta 39 Met Gln Lys Arg Gln Arg Glu Leu Ser Ala Leu Thr Leu 1 5 10 ccg acc tcg gct gag ggg gta tca gaa agc cac cgg gcc cgc tcg gtc 87 Pro Thr Ser Ala Glu Gly Val Ser Glu Ser His Arg Ala Arg Ser Val 15 20 25 ggc atc ggt cgc gcc cac gcc aag gcc atc ctg ctg gga gag cat gcg 135 Gly Ile Gly Arg Ala His Ala Lys Ala Ile Leu Leu Gly Glu His Ala 30 35 40 45 gtc gtc tac gga gcg ccg gca ctc gct ctg ccg att ccg cag ctc acg 183 Val Val Tyr Gly Ala Leu Ala Leu Pro Ile Pro Gln Leu Thr 50 55 60 gtc acg gcc agc gtc ggc tgg tcg tcc gag gcc tcc gac agt gcg ggt 231 Val Thr Ala Ser Val Gly Trp Ser Ser Glu Ala Ser Asp Ser Ala Gly 65 70 75 ggc ctg tcc tac acg atg acc ggt acg ccg tcg cgg gca ctg gtg acg 279 Gly Leu Ser Tyr Thr Met Thr Gly Thr Pro Ser Arg Ala Leu Val Thr 80 85 90 cag gcc tcc gac ggc ctg cac cgg ctc acc gcg gaa ttc atg gcg 327 Gln Ala Ser Asp Gly Leu His Arg Leu Thr Ala Glu Phe Met Ala Arg 95 100 105 atg ggc gtg acg aac gcg ccg cac ctc gac gtg atc ctg gac ggc gcg 375 Met Gly Val Thr Asn Ala Pro His Leu Asp Val Ile Leu Asp Gly Ala 110 115 120 125 atc ccg cac ggc cgg ggt ctc ggc tcc agc gcg gcc ggc tca cgc gcg 423 Ile Pro His Gly Arg Gly Leu Gly Ser Ser Ala Ala Gly Ser Arg Ala 130 135 140 atc gcc ttg gcc ctc gcc gac ctc ttc ggc cac gag cc gac gag cc gac gag Ile Ala Leu Ala Leu Ala Asp Leu Phe Gly His Glu Leu Ala Glu His 145 150 155 acg gcg tac gaa ctg gtg cag acg gcc gag aac atg gcg cac ggc cgg 519 Thr Ala Tyr Glu Leu Val Gln Thr Ala Glu Asn Met Ala Gly Arg 160 165 170 gcc agc ggc gtg gac gcg atg acg gtc ggc gcg tcc cgg ccg ctg ctg 567 Ala Ser Gly Val Asp Ala Met Thr Val Gly Ala Ser Arg Pro Leu Leu 175 180 185 ttc cag cag ggc cgc acc gcc atc ggc tgc gac agc ctg 615 Phe Gln Gln Gly Arg Thr Glu Arg Leu Ala Ile Gly Cys Asp Ser Leu 190 195 200 205 ttc atc gtc gcc gac agc ggc gtc ccg ggc agc acc aag gaa Ascg gtc 66 S er Gly Val Pro Gly Ser Thr Lys Glu Ala Val 210 215 220 gag atg ctg cgg gag gga ttc acc cgc agc gcc gga aca cag gag cgg 711 Glu Met Leu Arg Glu Gly Phe Thr Arg Ser Ala Gly Thr Gln Glu Arg 225 230 235 ttc gtc ggc cgg gcg acg gaa ctg acc gag gcc gcc cgg cag gcc ctc 759 Phe Val Gly Arg Ala Thr Glu Leu Thr Glu Ala Ala Ala Arg Gln Ala Leu 240 245 250 gcc gac ggc cgg ccc gag gag ctg gc tg ca tac cac 807 Ala Asp Gly Arg Pro Glu Glu Leu Gly Ser Gln Leu Thr Tyr Tyr His 255 260 265 gag ctg ctc cat gag gcc cgc ctg agc acc gac ggc atc gat gcg ctg 855 Glu Leu Leu His Glu Ala Arg Leu Ser Thrp Gly Ile Asp Ala Leu 270 275 280 285 gtc gag gcc gcg ctg aag gca ggc agc ctc gga gcc aag atc acc ggc 903 Val Glu Ala Ala Leu Lys Ala Gly Ser Leu Gly Ala Lys Ile Thr Gly 290 295 ggg ggt tgc atg atc gca cag gcc cgg ccc gaa cag gcc 951 Gly Gly Leu Gly Gly Cys Met Ile Ala Gln Ala Arg Pro Glu Gln Ala 305 310 315 cgg gag gtc acc cgg cag ctc cac gag gcc ggt gcc gta cag acc tgg 999 Arg V al Thr Arg Gln Leu His Glu Ala Gly Ala Val Gln Thr Trp 320 325 330 gtc gta ccg ctg aaa ggg ctc gac aac cat gcg cag tga 1038 Val Val Pro Leu Lys Gly Leu Asp Asn His Ala Gln 335 340
【0080】 <210> 3 <211> 1053 <212> DNA <213> Streptomyces <400> 3 atg cgc agt gaa cac ccg acc acg acc gtg ctc 33 Met Arg Ser Glu His Pro Thr Thr Thr Val Leu 1 5 10 cag tcg cgg gag cag ggc agc gcg gcc ggc gcc acc gcg gtc gcg cac 81 Gln Ser Arg Glu Gln Gly Ser Ala Ala Gly Ala Thr Ala Val Ala His 15 20 25 cca aac atc gcg ctg atc aag tac tgg ggc aag cgc gac gag cgg ctg 129 Pro Asn Ile Ala Leu Ile Lys Tyr Trp Gly Lys Arg Asp Glu Arg Leu 30 35 40 atc ctg ccc tgc acc acc agc ctg tcg atg acg ctg gac gtc ttc ccc 177 Ile Leu Pro Cys Thr Thr Ser Leu Ser Met Thr Leu Asp Val Phe Pro 45 50 55 acg acc acc gag gtc cgg ctc gac ccc gcc gcc gag cac gac acg gcc 225 Thr Thr Thr Glu Val Arg Leu Asp Pro Ala Ala Glu His Asp Thr Ala 60 65 70 75 gcc ctc aac ggc gag gtg gcc acg ggc gag acg ctg cgc cgc atc agc 273 Ala Leu Asn Gly Glu Val Ala Thr Gly Glu Thr Leu Arg Arg Ile Ser 80 85 90 gcc ttc ctc tcc ctg gtg cgg gag gtg gcg ggc agc gac cag cgg gcc 321 Ala Phe Leu Ser Leu Val Arg Glu Val Ala Gly Ser Asp Gln Arg Ala 95 100 105 gtg gtg gac acc cgc aac acc gtg ccc acc ggg gcg ggc ctg gcg tcc 369 Val Val Asp Thr Arg Asn Thr Val Pro Thr Gly Ala Gly Leu Ala Ser 110 115 120 tcc gcc agc ggg ttc gcc gcc ctc gcc gtc gcg gcc gcg gcc gcc tac 417 Ser Ala Ser Gly Phe Ala Ala Leu Ala Val Ala Ala Ala Ala Ala Tyr 125 130 135 ggg ctc gaa ctc gac gac cgc ggg ctg tcc cgg ctg gcc cga cgt gga 465 Gly Leu Glu Leu Asp Asp Arg Gly Leu Ser Arg Leu Ala Arg Arg Gly 140 145 150 155 tcc ggc tcc gcc tcg cgg tcg atc ttc ggc ggc ttc gcc gtc tgg cac 513 Ser Gly Ser Ala Ser Arg Ser Ile Phe Gly Gly Phe Ala Val Trp His 160 165 170 gcc ggc ccc gac ggc acg gcc acg gaa gcg gac ctc ggc tcc tac gcc 561 Ala Gly Pro Asp Gly Thr Ala Thr Glu Ala Asp Leu Gly Ser Tyr Ala 175 180 185 gag ccg gtg ccc gcg gcc gac ctc gac ccg gcg ctg gtc atc gcc gtg 609 Glu Pro Val Pro Ala Ala Asp Leu Asp Pro Ala Leu Val Ile Ala Val 190 195 200 gtc aac gcc ggc ccc aag ccc gtc tcc agc cgc gag gcc atg cgc cgc 657 Val Asn Ala Gly Pro Lys Pro Val Ser Ser Arg Glu Ala Met Arg Arg 205 210 215 acc gtc gac acc tcg ccg ctg tac cgg ccg tgg gcc gac tcc agt aag 705 Thr Val Asp Thr Ser Pro Leu Tyr Arg Pro Trp Ala Asp Ser Ser Lys 220 225 230 235 gac gac ctg gac gag atg cgc tcg gcg ctg ctg cgc ggc gac ctc gag 753 Asp Asp Leu Asp Glu Met Arg Ser Ala Leu Leu Arg Gly Asp Leu Glu 240 245 250 gcc gtg ggc gag atc gcg gag cgc aac gcg ctc ggc atg cac gcc acc 801 Ala Val Gly Glu Ile Ala Glu Arg Asn Ala Leu Gly Met His Ala Thr 255 260 265 atg ctg gcc gcc cgc ccc gcg gtg cgg tac ctg tcg ccg gcc acg gtc 849 Met Leu Ala Ala Arg Pro Ala Val Arg Tyr Leu Ser Pro Ala Thr Val 270 275 280 acc gtg ctc gac agc gtg ctc cag ctc cgc aag gac ggt gtc ctg gcc 897 Thr Val Leu Asp Ser Val Leu Gln Leu Arg Lys Asp Gly Val Leu Ala 285 290 295 tac gcg acc atg gac gcc ggt ccc aac gtg aag gtg ctg tgc cgg cgg 945 Tyr Ala Thr Met Asp Ala Gly Pro Asn Val Lys Val Leu Cys Arg Arg 300 305 310 315 gcg gac gcc gag cgg gtg gcc gac gtc gta cgc gcc gcc gcg tcc ggc 993 Ala Asp Ala Glu Arg Val Ala Asp Val Val Arg Ala Ala Ala Ser Gly 320 325 330 ggt cag gtc ctc gtc gcc ggg ccg gga gac ggt gcc cgc ctg ctg agc 1041 Gly Gln Val Leu Val Ala Gly Pro Gly Asp Gly Ala Arg Leu Leu Ser 335 340 345 gag ggc gca tga 1053 Glu Gly Ala 350<210> 3 <211> 1053 <212> DNA <213> Streptomyces <400> 3 atg cgc agt gaa cac ccg acc acg acc gtg ctc 33 Met Arg Ser Glu His Pro Thr Thr Thr Val Leu 15 10 cag tcg cgg gag cag ggc agc gcg gcc ggc gcc acc gcg gtc gcg cac 81 Gln Ser Arg Glu Gln Gly Ser Ala Ala Gly Ala Thr Ala Val Ala His 15 20 25 cca aac atc gcg ctg atc aag tac tgg ggc ag cgg gg ctg 129 Pro Asn Ile Ala Leu Ile Lys Tyr Trp Gly Lys Arg Asp Glu Arg Leu 30 35 40 atc ctg ccc tgc acc acc agc ctg tcg atg acg ctg gac gtc ttc ccc 177 Ile Leu Pro Cys Thr Thr Ser Leu Ser Met Thr Leu Asp Val Phe Pro 45 50 55 acg acc acc gag gtc cgg ctc gac ccc gcc gcc gag cac gac acg gcc 225 Thr Thr Thr Glu Val Arg Leu Asp Pro Ala Ala Glu His Asp Thr Ala 60 65 70 75 gcc ctc aac ggc gag gtg gcc acg ggc gag acg ctg cgc cgc atc agc 273 Ala Leu Asn Gly Glu Val Ala Thr Gly Glu Thr Leu Arg Arg Ile Ser 80 85 90 gcc ttc ctc tcc ctg gtg cgg gag gtg gcg ggc agc Gac cag cgg Pg Ser Leu Val Arg Glu Val Ala Gly Ser Asp Gln Arg Ala 95 100 105 gtg gtg gac acc cgc aac acc gtg ccc acc ggg gcg ggc ctg gcg tcc 369 Val Val Asp Thr Arg Asn Thr Val Pro Thr Gly Ala Gly Leu Ala Ser 110 115 120 tcc gcc agc ggg ttc gcc gcc ctc gcc gtc gcg gcc gcg gcc gcc tac 417 Ser Ala Ser Gly Phe Ala Ala Leu Ala Val Ala Ala Ala Ala Ala Tyr 125 130 135 ggg ctc gaa ctc gac gac cgc ggg ctg tcc cgg ctg gcc cga cgt gga Le Gly Leu Asp Arg Gly Leu Ser Arg Leu Ala Arg Arg Gly 140 145 150 155 tcc ggc tcc gcc tcg cgg tcg atc ttc ggc ggc ttc gcc gtc tgg cac 513 Ser Gly Ser Ala Ser Arg Ser Ile Phe Gly Gly Phe Ala Val Trp His 160 165 170 gcc ggc ccc gac ggc acg gcc acg gaa gcg gac ctc ggc tcc tac gcc 561 Ala Gly Pro Asp Gly Thr Ala Thr Glu Ala Asp Leu Gly Ser Tyr Ala 175 180 185 gag ccg gtg ccc gcg gcc gac ctc gac cc atc gcc gtg 609 Glu Pro Val Pro Ala Ala Asp Leu Asp Pro Ala Leu Val Ile Ala Val 190 195 200 gtc aac gcc ggc ccc aag ccc gtc tcc agc cgc gag gcc atg cgc cgc 657 Val Asn Ala Gly Pro Lys Pro Val Ser Ser A rg Glu Ala Met Arg Arg 205 210 215 acc gtc gac acc tcg ccg ctg tac cgg ccg tgg gcc gac tcc agt aag 705 Thr Val Asp Thr Ser Pro Leu Tyr Arg Pro Trp Ala Asp Ser Ser Lys 220 225 230 235 gac gac ctg gac gag atg cgc tcg gcg ctg ctg cgc ggc gac ctc gag 753 Asp Asp Leu Asp Glu Met Arg Ser Ala Leu Leu Arg Gly Asp Leu Glu 240 245 250 gcc gtg ggc gag atc gcg gag cgc aac gcg cc gac gc gc gc gc gc gc gc gc gc gc gc gc gc gc gc gc gc gc gc gc gc gcc Val Gly Glu Ile Ala Glu Arg Asn Ala Leu Gly Met His Ala Thr 255 260 265 atg ctg gcc gcc cgc ccc gcg gtg cgg tac ctg tcg ccg gcc acg gtc 849 Met Leu Ala Ala Arg Pro Ala Val Arg Tyr Leu Ser Ala Thr Val 270 275 280 acc gtg ctc gac agc gtg ctc cag ctc cgc aag gac ggt gtc ctg gcc 897 Thr Val Leu Asp Ser Val Leu Gln Leu Arg Lys Asp Gly Val Leu Ala 285 290 295 tac gcg acc atg gac gcc ggt ccc aag gtg ctg tgc cgg cgg 945 Tyr Ala Thr Met Asp Ala Gly Pro Asn Val Lys Val Leu Cys Arg Arg 300 305 310 315 gcg gac gcc gag cgg gtg gcc gac gtc gta cgc gcc gcc gcg tcc ggc 993 Ala Asp Ala Gla A la Asp Val Val Arg Ala Ala Ala Ser Gly 320 325 330 ggt cag gtc ctc gtc gcc ggg ccg gga gac ggt gcc cgc ctg ctg agc 1041 Gly Gln Val Leu Val Ala Gly Pro Gly Asp Gly Ala Arg Leu Leu Ser 335 340 340 g ggc gca tga 1053 Glu Gly Ala 350
【0081】 <210> 4 <211> 1125 <212> DNA <213> Streptomyces <400> 4 atg acg aca 9 Met Thr Thr 1 ggt cag cgc acg atc gtc cgg cac gcg ccg ggc aag ctg ttc gtc gcg 57 Gly Gln Arg Thr Ile Val Arg His Ala Pro Gly Lys Leu Phe Val Ala 5 10 15 ggc gag tac gcg gtc gtg gat ccg ggc aac ccg gcg atc ctg gta gcg 105 Gly Glu Tyr Ala Val Val Asp Pro Gly Asn Pro Ala Ile Leu Val Ala 20 25 30 35 gtc gac cgg cac atc agc gtc acc gtg tcc gac gcc gac gcg gac acc 153 Val Asp Arg His Ile Ser Val Thr Val Ser Asp Ala Asp Ala Asp Thr 40 45 50 ggg gcc gcc gac gtc gtg atc tcc tcc gac ctc ggt ccg cag gcg gtc 201 Gly Ala Ala Asp Val Val Ile Ser Ser Asp Leu Gly Pro Gln Ala Val 55 60 65 ggc tgg cgc tgg cac gac ggc cgg ctc gtc gtc cgc gac ccg gac gac 249 Gly Trp Arg Trp His Asp Gly Arg Leu Val Val Arg Asp Pro Asp Asp 70 75 80 ggg cag cag gcg cgc agc gcc ctg gcc cac gtg gtg tcg gcg atc gag 297 Gly Gln Gln Ala Arg Ser Ala Leu Ala His Val Val Ser Ala Ile Glu 85 90 95 acc gtg ggc cgg ctg ctg ggc gaa cgc gga cag aag gtc ccc gct ctc 345 Thr Val Gly Arg Leu Leu Gly Glu Arg Gly Gln Lys Val Pro Ala Leu 100 105 110 115 acc ctc tcc gtc agc agc cgc ctg cac gag gac ggc cgg aag ttc ggc 393 Thr Leu Ser Val Ser Ser Arg Leu His Glu Asp Gly Arg Lys Phe Gly 120 125 130 ctg ggc tcc agc ggc gcg gtg acc gtg gcg acc gta gcc gcc gtc gcc 441 Leu Gly Ser Ser Gly Ala Val Thr Val Ala Thr Val Ala Ala Val Ala 135 140 145 gcg ttc tgc gga ctc gaa ctg tcc acc gac gaa cgg ttc cgg ctg gcc 489 Ala Phe Cys Gly Leu Glu Leu Ser Thr Asp Glu Arg Phe Arg Leu Ala 150 155 160 atg ctc gcc acc gcg gaa ctc gac ccc aag ggc tcc ggc ggg gac ctc 537 Met Leu Ala Thr Ala Glu Leu Asp Pro Lys Gly Ser Gly Gly Asp Leu 165 170 175 gcc gcc agc acc tgg ggc ggc tgg atc gcc tac cag gcg ccc gac cgg 585 Ala Ala Ser Thr Trp Gly Gly Trp Ile Ala Tyr Gln Ala Pro Asp Arg 180 185 190 195 gcc ttt gtg ctc gac ctg gcc cgg cgc gtg gga gtc gac cgg aca ctg 633 Ala Phe Val Leu Asp Leu Ala Arg Arg Val Gly Val Asp Arg Thr Leu 200 205 210 aag gcg ccc tgg ccg ggg cac tcg gtg cgc cga ctg ccg gcg ccc aag 681 Lys Ala Pro Trp Pro Gly His Ser Val Arg Arg Leu Pro Ala Pro Lys 215 220 225 ggc ctc acc ctg gag gtc ggc tgg acc gga gag ccc gcc tcc acc gcg 729 Gly Leu Thr Leu Glu Val Gly Trp Thr Gly Glu Pro Ala Ser Thr Ala 230 235 240 tcc ctg gtg tcc gat ctg cac cgc cgc acc tgg cgg ggc agc gcc tcc 777 Ser Leu Val Ser Asp Leu His Arg Arg Thr Trp Arg Gly Ser Ala Ser 245 250 255 cac cag agg ttc gtc gag acc acg acc gac tgt gtc cgc tcc gcg gtc 825 His Gln Arg Phe Val Glu Thr Thr Thr Asp Cys Val Arg Ser Ala Val 260 265 270 275 acc gcc ctg gag tcc ggc gac gac acg agc ctg ctg cac gag atc cgc 873 Thr Ala Leu Glu Ser Gly Asp Asp Thr Ser Leu Leu His Glu Ile Arg 280 285 290 cgg gcc cgc cag gag ctg gcc cgc ctg gac gac gag gtc ggc ctc ggc 921 Arg Ala Arg Gln Glu Leu Ala Arg Leu Asp Asp Glu Val Gly Leu Gly 295 300 305 atc ttc aca ccc aag ctg acg gcg ctg tgc gac gcc gcc gaa gcc gtc 969 Ile Phe Thr Pro Lys Leu Thr Ala Leu Cys Asp Ala Ala Glu Ala Val 310 315 320 ggc ggc gcg gcc aag ccc tcc ggg gca ggc ggc ggc gac tgc ggc atc 1017 Gly Gly Ala Ala Lys Pro Ser Gly Ala Gly Gly Gly Asp Cys Gly Ile 325 330 335 gcc ctg ctg gac gcc gag gcg tcg cgg gac atc aca cat gta cgg caa 1065 Ala Leu Leu Asp Ala Glu Ala Ser Arg Asp Ile Thr His Val Arg Gln 340 345 350 355 cgg tgg gag aca gcc ggg gtg ctg ccc ctg ccc ctg act cct gcc ctg 1113 Arg Trp Glu Thr Ala Gly Val Leu Pro Leu Pro Leu Thr Pro Ala Leu 360 365 370 gaa ggg atc taa 1125 Glu Gly Ile<210> 4 <211> 1125 <212> DNA <213> Streptomyces <400> 4 atg acg aca 9 Met Thr Thr 1 ggt cag cgc acg atc gtc cgg cac gcg ccg ggc aag ctg ttc gtc gcg 57 Gly Gln Arg Thr Ile Val Arg His Ala Pro Gly Lys Leu Phe Val Ala 5 10 15 ggc gag tac gcg gtc gtg gat ccg ggc aac ccg gcg atc ctg gta gcg 105 Gly Glu Tyr Ala Val Val Asp Pro Gly Asn Pro Ala Ile Leu Val Ala 20 25 30 35 gtc gac cgg cac atc agc gtc acc gtg tcc gac gcc gac gcg gac acc 153 Val Asp Arg His Ile Ser Val Thr Val Ser Asp Ala Asp Ala Asp Thr 40 45 50 ggg gcc gcc gac gtc gtg atc tcc tcc gac ctc ggt ccg cag gcg gtc 201 Gly Ala Ala Asp Val Val Ile Ser Ser Asp Leu Gly Pro Gln Ala Val 55 60 65 ggc tgg cgc tgg cac gac ggc cgg ctc gtc gtc cgc gac ccg gac gac 249 Gly Trp Arg Trp Hisp Arg Leu Val Val Arg Asp Pro Asp Asp 70 75 80 ggg cag cag gcg cgc agc gcc ctg gcc cac gtg gtg tcg gcg atc gag 297 Gly Gln Gln Ala Arg Ser Ala Leu Ala His Val Val Ser Ala Ile Glu 85 90 95 acc gtg ggc cgg ctg ctg ggc gaa cgc gga cag aag gt c ccc gct ctc 345 Thr Val Gly Arg Leu Leu Gly Glu Arg Gly Gln Lys Val Pro Ala Leu 100 105 110 115 acc ctc tcc gtc agc agc cgc ctg cac gag gac ggc cgg aag ttc ggc 393 Thr Leu Ser Val Ser Ser Arg Leu His Glu Asp Gly Arg Lys Phe Gly 120 125 130 ctg ggc tcc agc ggc gcg gtg acc gtg gcg acc gta gcc gcc gtc gcc 441 Leu Gly Ser Ser Gly Ala Val Thr Val Ala Thr Val Ala Ala Val Ala 135 140 145 gcg ttc tgc gga ctc gaa ctg tcc acc gac gaa cgg ttc cgg ctg gcc 489 Ala Phe Cys Gly Leu Glu Leu Ser Thr Asp Glu Arg Phe Arg Leu Ala 150 155 160 atg ctc gcc acc gcg gaa ctc gac ccc aag ggc gcc ccg Met Leu Ala Thr Ala Glu Leu Asp Pro Lys Gly Ser Gly Gly Asp Leu 165 170 175 gcc gcc agc acc tgg ggc ggc tgg atc gcc tac cag gcg ccc gac cgg 585 Ala Ala Ser Thr Trp Gly Gly Trp Ile Ala Tyr Gln Ala Pro Asp Arg 180 185 190 195 gcc ttt gtg ctc gac ctg gcc cgg cgc gtg gga gtc gac cgg aca ctg 633 Ala Phe Val Leu Asp Leu Ala Arg Arg Val Gly Val Asp Arg Thr Leu 200 205 210 aag gcg ccc tgg tcc ggc gt g cgc cga ctg ccg gcg ccc aag 681 Lys Ala Pro Trp Pro Gly His Ser Val Arg Arg Leu Pro Ala Pro Lys 215 220 225 ggc ctc acc ctg gag gtc ggc tgg acc gga gag ccc gcc tcc acc gcg 729 Gly Leu Thr Leu Glu Val Gly Trp Thr Gly Glu Pro Ala Ser Thr Ala 230 235 240 tcc ctg gtg tcc gat ctg cac cgc cgc acc tgg cgg ggc agc gcc tcc 777 Ser Leu Val Ser Asp Leu His Arg Arg Thr Trp Arg Gly Ser Ala Ser 245 250 255 cac cag agg ttc gtc gag acc acg acc gac tgt gtc cgc tcc gcg gtc 825 His Gln Arg Phe Val Glu Thr Thr Thr Asp Cys Val Arg Ser Ala Val 260 265 270 270 275 acc gcc ctg gag tcc ggc gac gac acg agc ctg ctg cg gag atc cgc 873 Thr Ala Leu Glu Ser Gly Asp Asp Thr Ser Leu Leu His Glu Ile Arg 280 285 290 cgg gcc cgc cag gag ctg gcc cgc ctg gac gac gag gtc ggc ctc ggc 921 Arg Ala Arg Gln Glu Leu Ala Asp Glu Val Gly Leu Gly 295 300 305 atc ttc aca ccc aag ctg acg gcg ctg tgc gac gcc gcc gaa gcc gtc 969 Ile Phe Thr Pro Lys Leu Thr Ala Leu Cys Asp Ala Ala Glu Ala Val 310 315 320 ggc ggcggc cc c tcc ggg gca ggc ggc ggc gac tgc ggc atc 1017 Gly Gly Ala Ala Lys Pro Ser Gly Ala Gly Gly Gly Asp Cys Gly Ile 325 330 335 gcc ctg ctg gac gcc gag gcg tcg cgg gac atc aca cat gta cag caca Leu Asp Ala Glu Ala Ser Arg Asp Ile Thr His Val Arg Gln 340 345 350 350 355 cgg tgg gag aca gcc ggg gtg ctg ccc ctg ccc ctg act cct gcc ctg 1113 Arg Trp Glu Thr Ala Gly Val Leu Pro Leu Pro Leu Thr Pro Ala Leu 360 365 370 gaa ggg atc taa 1125 Glu Gly Ile
【0082】 <210> 5 <211> 1092 <212> DNA <213> Streptomyces <400> 5 atg acc agc gcc caa cgc aag 21 Met Thr Ser Ala Gln Arg Lys 1 5 gac gac cac gta cgg ctc gcc atc gag cag cac aac gcc cac agc gga 69 Asp Asp His Val Arg Leu Ala Ile Glu Gln His Asn Ala His Ser Gly 10 15 20 cgc aac cag ttc gac gac gtg tcg ttc gtc cac cac gcc ctg gcc ggc 117 Arg Asn Gln Phe Asp Asp Val Ser Phe Val His His Ala Leu Ala Gly 25 30 35 atc gac cgg ccg gac gtg tcc ctg gcc acg tcc ttc gcc ggg atc tcc 165 Ile Asp Arg Pro Asp Val Ser Leu Ala Thr Ser Phe Ala Gly Ile Ser 40 45 50 55 tgg cag gtg ccg atc tac atc aac gcg atg acc ggc ggc agc gag aag 213 Trp Gln Val Pro Ile Tyr Ile Asn Ala Met Thr Gly Gly Ser Glu Lys 60 65 70 acc ggc ctc atc aac cgg gac ctg gcc acc gcc gcc cgc gag acc ggc 261 Thr Gly Leu Ile Asn Arg Asp Leu Ala Thr Ala Ala Arg Glu Thr Gly 75 80 85 gtc ccc atc gcg tcc ggg tcc atg aac gcg tac atc aag gac ccc tcc 309 Val Pro Ile Ala Ser Gly Ser Met Asn Ala Tyr Ile Lys Asp Pro Ser 90 95 100 tgc gcc gac acg ttc cgt gtg ctg cgc gac gag aac ccc aac ggg ttc 357 Cys Ala Asp Thr Phe Arg Val Leu Arg Asp Glu Asn Pro Asn Gly Phe 105 110 115 gtc atc gcg aac atc aac gcc acc acg acg gtc gac aac gcg cag cgc 405 Val Ile Ala Asn Ile Asn Ala Thr Thr Thr Val Asp Asn Ala Gln Arg 120 125 130 135 gcg atc gac ctg atc gag gcg aac gcc ctg cag atc cac atc aac acg 453 Ala Ile Asp Leu Ile Glu Ala Asn Ala Leu Gln Ile His Ile Asn Thr 140 145 150 gcg cag gag acg ccg atg ccg gag ggc gac cgg tcg ttc gcg tcc tgg 501 Ala Gln Glu Thr Pro Met Pro Glu Gly Asp Arg Ser Phe Ala Ser Trp 155 160 165 gtc ccg cag atc gag aag atc gcg gcg gcc gtc gac atc ccc gtg atc 549 Val Pro Gln Ile Glu Lys Ile Ala Ala Ala Val Asp Ile Pro Val Ile 170 175 180 gtc aag gag gtc ggc aac ggc ctg agc cgg cag acc atc ctg ctg ctc 597 Val Lys Glu Val Gly Asn Gly Leu Ser Arg Gln Thr Ile Leu Leu Leu 185 190 195 gcc gac ctc ggc gtg cag gcg gcg gac gtc agc ggc cgc ggc ggc acg 645 Ala Asp Leu Gly Val Gln Ala Ala Asp Val Ser Gly Arg Gly Gly Thr 200 205 210 215 gac ttc gcc cgc atc gag aac ggc cgc cgg gag ctc ggc gac tac gcg 693 Asp Phe Ala Arg Ile Glu Asn Gly Arg Arg Glu Leu Gly Asp Tyr Ala 220 225 230 ttc ctg cac ggc tgg ggg cag tcc acc gcc gcc tgc ctg ctg gac gcc 741 Phe Leu His Gly Trp Gly Gln Ser Thr Ala Ala Cys Leu Leu Asp Ala 235 240 245 cag gac atc tcc ctg ccc gtc ctc gcc tcc ggc ggt gtg cgt cac ccg 789 Gln Asp Ile Ser Leu Pro Val Leu Ala Ser Gly Gly Val Arg His Pro 250 255 260 ctc gac gtg gtc cgc gcc ctc gcg ctc ggc gcc cgc gcc gtc ggc tcc 837 Leu Asp Val Val Arg Ala Leu Ala Leu Gly Ala Arg Ala Val Gly Ser 265 270 275 tcc gcc ggc ttc ctg cgc acc ctg atg gac gac ggc gtc gac gcg ctg 885 Ser Ala Gly Phe Leu Arg Thr Leu Met Asp Asp Gly Val Asp Ala Leu 280 285 290 295 atc acg aag ctc acg acc tgg ctg gac cag ctg gcg gcg ctg cag acc 933 Ile Thr Lys Leu Thr Thr Trp Leu Asp Gln Leu Ala Ala Leu Gln Thr 300 305 310 atg ctc ggc gcg cgc acc ccg gcc gac ctc acc cgc tgc gac gtg ctg 981 Met Leu Gly Ala Arg Thr Pro Ala Asp Leu Thr Arg Cys Asp Val Leu 315 320 325 ctc cac ggc gag ctg cgt gac ttc tgc gcc gac cgg ggc atc gac acg 1029 Leu His Gly Glu Leu Arg Asp Phe Cys Ala Asp Arg Gly Ile Asp Thr 330 335 340 cgc cgc ctc gcc cag cgc tcc agc tcc atc gag gcc ctc cag acg acg 1077 Arg Arg Leu Ala Gln Arg Ser Ser Ser Ile Glu Ala Leu Gln Thr Thr 345 350 355 gga agc aca cga tga 1092 Gly Ser Thr Arg 360<210> 5 <211> 1092 <212> DNA <213> Streptomyces <400> 5 atg acc agc gcc caa cgc aag 21 Met Thr Ser Ala Gln Arg Lys 15 gac gac cac gta cgg ctc gcc atc gag cag cac aac gcc cac agc gga 69 Asp Asp His Val Arg Leu Ala Ile Glu Gln His Asn Ala His Ser Gly 10 15 20 cgc aac cag ttc gac gac gtg tcg ttc gtc cac cac gcc ctg gcc ggc 117 Arg Asn Gln Phe Asp Val Ser Phe Val His His Ala Leu Ala Gly 25 30 35 atc gac cgg ccg gac gtg tcc ctg gcc acg tcc ttc gcc ggg atc tcc 165 Ile Asp Arg Pro Asp Val Ser Leu Ala Thr Ser Phe Ala Gly Ile Ser 40 45 50 55 tgg cag gtg ccg atc tac atc aac gcg atg acc ggc ggc agc gag aag 213 Trp Gln Val Pro Ile Tyr Ile Asn Ala Met Thr Gly Gly Ser Glu Lys 60 65 70 acc ggc ctc atc aac cgg gac ctg gcc acc gcc ggc gag accg ggc 261 Thr Gly Leu Ile Asn Arg Asp Leu Ala Thr Ala Ala Arg Glu Thr Gly 75 80 85 gtc ccc atc gcg tcc ggg tcc atg aac gcg tac atc aag gac ccc tcc 309 Val Pro Ile Ala Ser Gly Ser Met Asn Ala Tyr Ile Lys Asp Pro Ser 90 95 100 tgc gcc gac a cg ttc cgt gtg ctg cgc gac gag aac ccc aac ggg ttc 357 Cys Ala Asp Thr Phe Arg Val Leu Arg Asp Glu Asn Pro Asn Gly Phe 105 110 115 gtc atc gcg aac atc aac gcc acc acg acg gtc gac aac gcg ca Val Ile Ala Asn Ile Asn Ala Thr Thr Thr Val Asp Asn Ala Gln Arg 120 125 130 135 gcg atc gac ctg atc gag gcg aac gcc ctg cag atc cac atc aac acg 453 Ala Ile Asp Leu Ile Glu Ala Asn Ala Leu Gln Ile His Ile Asn Thr 140 145 150 gcg cag gag acg ccg atg ccg gag ggc gac cgg tcg ttc gcg tcc tgg 501 Ala Gln Glu Thr Pro Met Pro Glu Gly Asp Arg Ser Phe Ala Ser Trp 155 160 165 gtc ccg cag atc gag aag atg gcg gcc gtc gac atc ccc gtg atc 549 Val Pro Gln Ile Glu Lys Ile Ala Ala Ala Val Asp Ile Pro Val Ile 170 175 180 gtc aag gag gtc ggc aac ggc ctg agc cgg cag acc atc ctg ctg ctc lug Lys Asn Gly Leu Ser Arg Gln Thr Ile Leu Leu Leu 185 190 195 gcc gac ctc ggc gtg cag gcg gcg gac gtc agc ggc cgc ggc ggc acg 645 Ala Asp Leu Gly Val Gln Ala Ala Asp Val Ser Gly Arg Gly Gthl Two 15 gac ttc gcc cgc atc gag aac ggc cgc cgg gag ctc ggc gac tac gcg 693 Asp Phe Ala Arg Ile Glu Asn Gly Arg Arg Glu Leu Gly Asp Tyr Ala 220 225 230 ttc ctg cac ggc tgg ggg gcc accg ctg gac gcc 741 Phe Leu His Gly Trp Gly Gln Ser Thr Ala Ala Cys Leu Leu Asp Ala 235 240 245 cag gac atc tcc ctg ccc gtc ctc gcc tcc ggc ggt gtg cgt cac ccg 789 Gln Asp Ile Ser Leu Pro Val Leu A Gly Gly Val Arg His Pro 250 255 260 ctc gac gtg gtc cgc gcc ctc gcg ctc ggc gcc cgc gcc gtc ggc tcc 837 Leu Asp Val Val Arg Ala Leu Ala Leu Gly Ala Arg Ala Val Gly Ser 265 270 270 275 tcc gcc gg cgc acc ctg atg gac gac ggc gtc gac gcg ctg 885 Ser Ala Gly Phe Leu Arg Thr Leu Met Asp Asp Gly Val Asp Ala Leu 280 285 290 295 atc acg aag ctc acg acc tgg ctg gac cag ctg gcg gcg ctg cag acc 933 Ile Thr Lys Leu Thr Thr Trp Leu Asp Gln Leu Ala Ala Leu Gln Thr 300 305 310 atg ctc ggc gcg cgc acc ccg gcc gac ctc acc cgc tgc gac gtg ctg 981 Met Leu Gly Ala Arg Thr Pro Ala Asp Leu Thr Arg Cys Val L eu 315 320 325 ctc cac ggc gag ctg cgt gac ttc tgc gcc gac cgg ggc atc gac acg 1029 Leu His Gly Glu Leu Arg Asp Phe Cys Ala Asp Arg Gly Ile Asp Thr 330 335 340 cgc cgc ctc gcc cagcgccgc gag gcc ctc cag acg acg 1077 Arg Arg Leu Ala Gln Arg Ser Ser Ser Ile Glu Ala Leu Gln Thr Thr 345 350 355 gga agc aca cga tga 1092 Gly Ser Thr Arg 360
【0083】 <210> 6 <211> 1062 <212> DNA <213> Streptomyces <400> 6 atg acg gaa acg 12 Met Thr Glu Thr 1 cac gcc ata gcc ggg gtc ccg atg agg tgg gtg gga ccc ctt cgt att 60 His Ala Ile Ala Gly Val Pro Met Arg Trp Val Gly Pro Leu Arg Ile 5 10 15 20 tcc ggg aac gtc gcc gag acc gag acc cag gtc ccg ctc gcc acg tac 108 Ser Gly Asn Val Ala Glu Thr Glu Thr Gln Val Pro Leu Ala Thr Tyr 25 30 35 gag tcg ccg ctg tgg ccg tcg gtg ggc cgc ggg gcg aag gtc tcc cgg 156 Glu Ser Pro Leu Trp Pro Ser Val Gly Arg Gly Ala Lys Val Ser Arg 40 45 50 ctg acg gag aag ggc atc gtc gcc acc ctc gtc gac gag cgg atg acc 204 Leu Thr Glu Lys Gly Ile Val Ala Thr Leu Val Asp Glu Arg Met Thr 55 60 65 cgc tcg gtg atc gtc gag gcg acg gac gcg cag acc gcg tac atg gcc 252 Arg Ser Val Ile Val Glu Ala Thr Asp Ala Gln Thr Ala Tyr Met Ala 70 75 80 gcg cag acc atc cac gcc cgc atc gac gag ctg cgc gag gtg gtg cgc 300 Ala Gln Thr Ile His Ala Arg Ile Asp Glu Leu Arg Glu Val Val Arg 85 90 95 100 ggc tgc agc cgg ttc gcc cag ctg atc aac atc aag cac gag atc aac 348 Gly Cys Ser Arg Phe Ala Gln Leu Ile Asn Ile Lys His Glu Ile Asn 105 110 115 gcg aac ctg ctg ttc atc cgg ttc gag ttc acc acc ggt gac gcc tcc 396 Ala Asn Leu Leu Phe Ile Arg Phe Glu Phe Thr Thr Gly Asp Ala Ser 120 125 130 ggc cac aac atg gcc acg ctc gcc tcc gat gtg ctc ctg ggg cac ctg 444 Gly His Asn Met Ala Thr Leu Ala Ser Asp Val Leu Leu Gly His Leu 135 140 145 ctg gag acg atc cct ggc atc tcc tac ggc tcg atc tcc ggc aac tac 492 Leu Glu Thr Ile Pro Gly Ile Ser Tyr Gly Ser Ile Ser Gly Asn Tyr 150 155 160 tgc acg gac aag aag gcc acc gcg atc aac ggc atc ctc ggc cgc ggc 540 Cys Thr Asp Lys Lys Ala Thr Ala Ile Asn Gly Ile Leu Gly Arg Gly 165 170 175 180 aag aac gtg atc acc gag ctg ctg gtg ccg cgg gac gtc gtc gag aac 588 Lys Asn Val Ile Thr Glu Leu Leu Val Pro Arg Asp Val Val Glu Asn 185 190 195 aac ctg cac acc acg gct gcc aag atc gtc gag ctg aac atc cgc aag 636 Asn Leu His Thr Thr Ala Ala Lys Ile Val Glu Leu Asn Ile Arg Lys 200 205 210 aac ctg ctc ggc acc ctg ctc gcc ggc ggc atc cgc tcg gcc aac gcc 684 Asn Leu Leu Gly Thr Leu Leu Ala Gly Gly Ile Arg Ser Ala Asn Ala 215 220 225 cac ttc gcg aac atg ctg ctc ggc ttc tac ctg gcc acc ggc cag gac 732 His Phe Ala Asn Met Leu Leu Gly Phe Tyr Leu Ala Thr Gly Gln Asp 230 235 240 gcc gcc aac atc gtc gag ggc tcg cag ggc gtc gtc atg gcc gag gac 780 Ala Ala Asn Ile Val Glu Gly Ser Gln Gly Val Val Met Ala Glu Asp 245 250 255 260 cgc gac ggc gac ctc tac ttc gcc tgc acc ctg ccg aac ctg atc gtc 828 Arg Asp Gly Asp Leu Tyr Phe Ala Cys Thr Leu Pro Asn Leu Ile Val 265 270 275 ggc acg gtc ggc aac ggc aag ggt ctc ggc ttc gtg gag acg aac ctc 876 Gly Thr Val Gly Asn Gly Lys Gly Leu Gly Phe Val Glu Thr Asn Leu 280 285 290 gcc cgg ctc ggc tgc cga gcc gac cgc gaa ccc ggg gag aac gcc cgc 924 Ala Arg Leu Gly Cys Arg Ala Asp Arg Glu Pro Gly Glu Asn Ala Arg 295 300 305 cgc ctc gcc gtc atc gcg gca gcg acc gtg ctg tgc ggt gaa ctc tcg 972 Arg Leu Ala Val Ile Ala Ala Ala Thr Val Leu Cys Gly Glu Leu Ser 310 315 320 ctg ctc gcg gca cag acg aac ccg ggc gaa ctc atg cgc gcg cac gtc 1020 Leu Leu Ala Ala Gln Thr Asn Pro Gly Glu Leu Met Arg Ala His Val 325 330 335 340 cag ctg gaa cgc gac aac aag acc gca aag gtt ggt gca tag 1062 Gln Leu Glu Arg Asp Asn Lys Thr Ala Lys Val Gly Ala 345 350<210> 6 <211> 1062 <212> DNA <213> Streptomyces <400> 6 atg acg gaa acg 12 Met Thr Glu Thr 1 cac gcc ata gcc ggg gtc ccg atg agg tgg gtg gga ccc ctt cgt att 60 His Ala Ile Ala Gly Val Pro Met Arg Trp Val Gly Pro Leu Arg Ile 5 10 15 20 tcc ggg aac gtc gcc gag acc gag acc cag gtc ccg ctc gcc acg tac 108 Ser Gly Asn Val Ala Glu Thr Glu Thr Gln Val Pro Leu Ala Thr Tyr 25 30 35 gag tcg ccg ctg tgg ccg tcg gtg ggc cgc ggg gcg aag gtc tcc cgg 156 Glu Ser Pro Leu Trp Pro Ser Val Gly Arg Gly Ala Lys Val Ser Arg 40 45 50 ctg acg gag aag ggc atc gc acc ctc gtc gac gag cgg atg acc 204 Leu Thr Glu Lys Gly Ile Val Ala Thr Leu Val Asp Glu Arg Met Thr 55 60 65 cgc tcg gtg atc gtc gag gcg acg gac gcg cag acc gcg tac atg gcc 252 Arg Ser Val Ile Val Glu Ala Thr Asp Ala Gln Thr Ala Tyr Met Ala 70 75 80 gcg cag acc atc cac gcc cgc atc gac gag ctg cgc gag gtg gtg cgc 300 Ala Gln Thr Ile His Ala Arg Ile Asp Glu Leu Arg Glu Val Val Arg 85 90 95 100 ggc tgc agc cgg ttc gcc cag ctg atc a ac atc aag cac gag atc aac 348 Gly Cys Ser Arg Phe Ala Gln Leu Ile Asn Ile Lys His Glu Ile Asn 105 110 115 gcg aac ctg ctg ttc atc cgg ttc gag ttc acc acc ggt gac gcc tcc 396 Ala Asn Leu Leu Phe Arg Phe Glu Phe Thr Thr Gly Asp Ala Ser 120 125 130 ggc cac aac atg gcc acg ctc gcc tcc gat gtg ctc ctg ggg cac ctg 444 Gly His Asn Met Ala Thr Leu Ala Ser Asp Val Leu Leu Gly His Leu 135 140 145 ctg gag acg atc cct ggc atc tcc tac ggc tcg atc tcc ggc aac tac 492 Leu Glu Thr Ile Pro Gly Ile Ser Tyr Gly Ser Ile Ser Gly Asn Tyr 150 155 160 tgc acg gac aag aag gcc acc gcg atc aac ggc atc ccc ggc 540 Cys Thr Asp Lys Lys Ala Thr Ala Ile Asn Gly Ile Leu Gly Arg Gly 165 170 175 180 aag aac gtg atc acc gag ctg ctg gtg ccg cgg gac gtc gtc gag aac 588 Lys Asn Val Ile Thr Glu Leu Leu Val Ar Asp Val Val Glu Asn 185 190 195 aac ctg cac acc acg gct gcc aag atc gtc gag ctg aac atc cgc aag 636 Asn Leu His Thr Thr Ala Ala Lys Ile Val Glu Leu Asn Ile Arg Lys 200 205 210 aac ctg ctc ggc acc ctg c tc gcc ggc ggc atc cgc tcg gcc aac gcc 684 Asn Leu Leu Gly Thr Leu Leu Ala Gly Gly Ile Arg Ser Ala Asn Ala 215 220 225 cac ttc gcg aac atg ctg ctc ggc ttc tac ctg gcc acc ggc ca ggg ca Asn Met Leu Leu Gly Phe Tyr Leu Ala Thr Gly Gln Asp 230 235 240 gcc gcc aac atc gtc gag ggc tcg cag ggc gtc gtc atg gcc gag gac 780 Ala Ala Asn Ile Val Glu Gly Ser Gln Gly Val Val Met Ala Glu Asp 250 255 260 cgc gac ggc gac ctc tac ttc gcc tgc acc ctg ccg aac ctg atc gtc 828 Arg Asp Gly Asp Leu Tyr Phe Ala Cys Thr Leu Pro Asn Leu Ile Val 265 270 275 ggc acg gtc ggc agg gc ggc ag gc cag c gtg gag acg aac ctc 876 Gly Thr Val Gly Asn Gly Lys Gly Leu Gly Phe Val Glu Thr Asn Leu 280 285 290 gcc cgg ctc ggc tgc cga gcc gac cgc gaa ccc ggg gag aac gcc cgc 924 Ala Arg Leu Acy Cyg Arg Glu Pro Gly Glu Asn Ala Arg 295 300 305 cgc ctc gcc gtc atc gcg gca gcg acc gtg ctg tgc ggt gaa ctc tcg 972 Arg Leu Ala Val Ile Ala Ala Ala Thr Val Leu Cys Gly Glu Leu Serg 310 ct 320 ct 320 g ca cag acg aac ccg ggc gaa ctc atg cgc gcg cac gtc 1020 Leu Leu Ala Ala Gln Thr Asn Pro Gly Glu Leu Met Arg Ala His Val 325 330 335 340 cag ctg gaa cgc gac aac aag acc gca aag gtt ggt tag Leu Glu Arg Asp Asn Lys Thr Ala Lys Val Gly Ala 345 350
【0084】 <210> 7 <211> 1170 <212> DNA <213> Streptomyces <400> 7 atg tcc atc tcc ata ggc att cac gac 27 Met Ser Ile Ser Ile Gly Ile His Asp 1 5 ctg tcg ttc gcc aca acc gag ttc gtc ctg ccg cac acg gcg ctc gcc 75 Leu Ser Phe Ala Thr Thr Glu Phe Val Leu Pro His Thr Ala Leu Ala 10 15 20 25 gag tac aac ggc acc gag atc ggc aag tac cac gtc ggc atc ggc cag 123 Glu Tyr Asn Gly Thr Glu Ile Gly Lys Tyr His Val Gly Ile Gly Gln 30 35 40 cag tcg atg agc gtg ccg gcc gcc gac gag gac atc gtg acc atg gcc 171 Gln Ser Met Ser Val Pro Ala Ala Asp Glu Asp Ile Val Thr Met Ala 45 50 55 gcg acc gcg gcg cgg ccc atc atc gag cgc aac ggc aag agc cgg atc 219 Ala Thr Ala Ala Arg Pro Ile Ile Glu Arg Asn Gly Lys Ser Arg Ile 60 65 70 cgc acg gtc gtg ttc gcc acg gag tcg tcg atc gac cag gcg aag gcg 267 Arg Thr Val Val Phe Ala Thr Glu Ser Ser Ile Asp Gln Ala Lys Ala 75 80 85 ggc ggc gtg tac gtg cac tcc ctg ctg ggg ctg gag tcg gcc tgc cgg 315 Gly Gly Val Tyr Val His Ser Leu Leu Gly Leu Glu Ser Ala Cys Arg 90 95 100 105 gtc gtc gag ctg aag cag gcc tgc tac ggg gcc acc gcc gcc ctt cag 363 Val Val Glu Leu Lys Gln Ala Cys Tyr Gly Ala Thr Ala Ala Leu Gln 110 115 120 ttc gcc atc ggc ctg gtg cgg cgc gac ccc gcc cag cag gtc ctg gtc 411 Phe Ala Ile Gly Leu Val Arg Arg Asp Pro Ala Gln Gln Val Leu Val 125 130 135 atc gcc agt gac gtc tcc aag tac gag ctg gac agc ccc ggc gag gcg 459 Ile Ala Ser Asp Val Ser Lys Tyr Glu Leu Asp Ser Pro Gly Glu Ala 140 145 150 acc cag ggc gcg gcc gcg gtg gcc atg ctg gtc ggc gcc gac ccg gcc 507 Thr Gln Gly Ala Ala Ala Val Ala Met Leu Val Gly Ala Asp Pro Ala 155 160 165 ctg ctg cgt atc gag gag ccg tcg ggc ctg ttc acc gcc gac gtc atg 555 Leu Leu Arg Ile Glu Glu Pro Ser Gly Leu Phe Thr Ala Asp Val Met 170 175 180 185 gac ttc tgg cgg ccc aac tac ctc acc acc gct ctg gtc gac ggc cag 603 Asp Phe Trp Arg Pro Asn Tyr Leu Thr Thr Ala Leu Val Asp Gly Gln 190 195 200 gag tcc atc aac gcc tac ctg cag gcc gtc gag ggc gcc tgg aag gac 651 Glu Ser Ile Asn Ala Tyr Leu Gln Ala Val Glu Gly Ala Trp Lys Asp 205 210 215 tac gcg gag cag gac ggc cgg tcg ctg gag gag ttc gcg gcg ttc gtc 699 Tyr Ala Glu Gln Asp Gly Arg Ser Leu Glu Glu Phe Ala Ala Phe Val 220 225 230 tac cac cag ccg ttc acg aag atg gcc tac aag gcg cac cgc cac ctg 747 Tyr His Gln Pro Phe Thr Lys Met Ala Tyr Lys Ala His Arg His Leu 235 240 245 ctg aac ttc aac ggc tac gac acc gac aag gac gcc atc gag ggc gcc 795 Leu Asn Phe Asn Gly Tyr Asp Thr Asp Lys Asp Ala Ile Glu Gly Ala 250 255 260 265 ctc ggc cag acg acg gcg tac aac aac gtc atc ggc aac agc tac acc 843 Leu Gly Gln Thr Thr Ala Tyr Asn Asn Val Ile Gly Asn Ser Tyr Thr 270 275 280 gcg tcg gtg tac ctg ggc ctg gcc gcc ctg ctc gac cag gcg gac gac 891 Ala Ser Val Tyr Leu Gly Leu Ala Ala Leu Leu Asp Gln Ala Asp Asp 285 290 295 ctg acg ggc cgt tcc atc ggc ttc ctg agc tac ggc tcg ggc agc gtc 939 Leu Thr Gly Arg Ser Ile Gly Phe Leu Ser Tyr Gly Ser Gly Ser Val 300 305 310 gcc gag ttc ttc tcg ggc acc gtc gtc gcc ggg tac cgc gag cgt ctg 987 Ala Glu Phe Phe Ser Gly Thr Val Val Ala Gly Tyr Arg Glu Arg Leu 315 320 325 cgc acc gag gcg aac cag gag gcg atc gcc cgg cgc aag agc gtc gac 1035 Arg Thr Glu Ala Asn Gln Glu Ala Ile Ala Arg Arg Lys Ser Val Asp 330 335 340 345 tac gcc acc tac cgc gag ctg cac gag tac acg ctc ccg tcc gac ggc 1083 Tyr Ala Thr Tyr Arg Glu Leu His Glu Tyr Thr Leu Pro Ser Asp Gly 350 355 360 ggc gac cac gcc acc ccg gtg cag acc acc ggc ccc ttc cgg ctg gcc 1131 Gly Asp His Ala Thr Pro Val Gln Thr Thr Gly Pro Phe Arg Leu Ala 365 370 375 ggg atc aac gac cac aag cgc atc tac gag gcg cgc tag 1170 Gly Ile Asn Asp His Lys Arg Ile Tyr Glu Ala Arg 380 385<210> 7 <211> 1170 <212> DNA <213> Streptomyces <400> 7 atg tcc atc tcc ata ggc att cac gac 27 Met Ser Ile Ser Ile Gly Ile His Asp 15 ctg tcg ttc gcc aca acc gag ttc gtc ctg ccg cac acg gcg ctc gcc 75 Leu Ser Phe Ala Thr Thr Glu Phe Val Leu Pro His Thr Ala Leu Ala 10 15 20 25 gag tac aac ggc acc gag atc ggc aag tac cac gtc ggc atc ggc cag 123 Glu Tyr Asn Gly Thr Glu Ile Gly Lys Tyr His Val Gly Ile Gly Gln 30 35 40 cag tcg atg agc gtg ccg gcc gcc gac gag gac atc gtg acc atg gcc 171 Gln Ser Met Ser Val Pro Ala Ala Asp Glu Asp Ile Val Thr Met Ala 45 50 55 gcg acc gcg gcg cgg ccc atc atc gag cgc aac ggc aag agc cgg atc 219 Ala Thr Ala Ala Arg Pro Ile Ile Glu Arg Asn Gly Lys Ser Arg Ile 60 65 70 cgc acg gtc gtg ttc gcc acg gag tc gac cag gcg aag gcg 267 Arg Thr Val Val Phe Ala Thr Glu Ser Ser Ile Asp Gln Ala Lys Ala 75 80 85 ggc ggc gtg tac gtg cac tcc ctg ctg ggg ctg gag tcg gcc tgc cgg 315 Gly Gly Val Tyr Val His Seru Leu Gly Leu Glu Ser Ala Cys Arg 90 95 1 00 105 gtc gtc gag ctg aag cag gcc tgc tac ggg gcc acc gcc gcc ctt cag 363 Val Val Glu Leu Lys Gln Ala Cys Tyr Gly Ala Thr Ala Ala Leu Gln 110 115 120 ttc gcc atc ggc ctg gtg cgg cgc gcc gcc gcc cag gtc ctg gtc 411 Phe Ala Ile Gly Leu Val Arg Arg Asp Pro Ala Gln Gln Val Leu Val 125 130 135 atc gcc agt gac gtc tcc aag tac gag ctg gac agc ccc ggc gag gcg 459 Ile Ala Ser Asp Val Ser Lys Tyr Glu Leu Asp Ser Pro Gly Glu Ala 140 145 150 acc cag ggc gcg gcc gcg gtg gcc atg ctg gtc ggc gcc gac ccg gcc 507 Thr Gln Gly Ala Ala Ala Val Ala Met Leu Val Gly Ala Asp Pro Ala 155 160 165 ctg ctg cg gag gag ccg tcg ggc ctg ttc acc gcc gac gtc atg 555 Leu Leu Arg Ile Glu Glu Pro Ser Gly Leu Phe Thr Ala Asp Val Met 170 175 180 185 gac ttc tgg cgg ccc aac tac ctc acc acc gct ctg gtc gacgg Asp Phe Trp Arg Pro Asn Tyr Leu Thr Thr Ala Leu Val Asp Gly Gln 190 195 200 gag tcc atc aac gcc tac ctg cag gcc gtc gag ggc gcc tgg aag gac 651 Glu Ser Ile Asn Ala Tyr Leu Gln Ala Val Glu Gly Ala Trp L ys Asp 205 210 215 tac gcg gag cag gac ggc cgg tcg ctg gag gag ttc gcg gcg ttc gtc 699 Tyr Ala Glu Gln Asp Gly Arg Ser Leu Glu Glu Phe Ala Ala Phe Val 220 225 230 tac cac cag ccg ttc acg tac aag gcg cac cgc cac ctg 747 Tyr His Gln Pro Phe Thr Lys Met Ala Tyr Lys Ala His Arg His Leu 235 240 245 ctg aac ttc aac ggc tac gac acc gac aag gac gcc atc gag ggc gcc 795 Leu Asn Phe Asn Gyr Asp Thr Asp Lys Asp Ala Ile Glu Gly Ala 250 255 260 265 ctc ggc cag acg acg gcg tac aac aac gtc atc ggc aac agc tac acc 843 Leu Gly Gln Thr Thr Ala Tyr Asn Asn Val Ile Gly Asn Ser Tyr Thr 270 275 280 gcg tcg gtg tac ctg ggc ctg gcc gcc ctg ctc gac cag gcg gac gac 891 Ala Ser Val Tyr Leu Gly Leu Ala Ala Leu Leu Asp Gln Ala Asp Asp 285 290 295 ctg acg ggc cgt tc atc ggcc cc gc cc gc cc gc cc gc cc gg cc gc cc gc cc gg cc gc cc gg cc gc cc gg cc gc cc gc cc gg cc gg cc gc gc agc gtc 939 Leu Thr Gly Arg Ser Ile Gly Phe Leu Ser Tyr Gly Ser Gly Ser Val 300 305 310 gcc gag ttc ttc tcg ggc acc gtc gtc gcc ggg tac cgc gag cgt ctg 987 Ala Glu Phe Phe Ser Gly Thr Val Val Ala Gly T yr Arg Glu Arg Leu 315 320 325 cgc acc gag gcg aac cag gag gcg atc gcc cgg cgc aag agc gtc gac 1035 Arg Thr Glu Ala Asn Gln Glu Ala Ile Ala Arg Arg Lys Ser Val Asp 330 335 340 345 345 tacgcc acc tac gag ctg cac gag tac acg ctc ccg tcc gac ggc 1083 Tyr Ala Thr Tyr Arg Glu Leu His Glu Tyr Thr Leu Pro Ser Asp Gly 350 355 360 ggc gac cac gcc acc ccg gtg cag acc acc ggc ccc ttc cgg ctg gcc 1131 G His Ala Thr Pro Val Gln Thr Thr Gly Pro Phe Arg Leu Ala 365 370 375 ggg atc aac gac cac aag cgc atc tac gag gcg cgc tag 1170 Gly Ile Asn Asp His Lys Arg Ile Tyr Glu Ala Arg 380 385
【0085】 <210> 8 <211> 345 <212> PRT <213> Streptomyces <400> 8 Met Gln Lys Arg Gln Arg Glu Leu Ser Ala Leu Thr Leu 1 5 10 Pro Thr Ser Ala Glu Gly Val Ser Glu Ser His Arg Ala Arg Ser Val 15 20 25 Gly Ile Gly Arg Ala His Ala Lys Ala Ile Leu Leu Gly Glu His Ala 30 35 40 45 Val Val Tyr Gly Ala Pro Ala Leu Ala Leu Pro Ile Pro Gln Leu Thr 50 55 60 Val Thr Ala Ser Val Gly Trp Ser Ser Glu Ala Ser Asp Ser Ala Gly 65 70 75 Gly Leu Ser Tyr Thr Met Thr Gly Thr Pro Ser Arg Ala Leu Val Thr 80 85 90 Gln Ala Ser Asp Gly Leu His Arg Leu Thr Ala Glu Phe Met Ala Arg 95 100 105 Met Gly Val Thr Asn Ala Pro His Leu Asp Val Ile Leu Asp Gly Ala 110 115 120 125 Ile Pro His Gly Arg Gly Leu Gly Ser Ser Ala Ala Gly Ser Arg Ala 130 135 140 Ile Ala Leu Ala Leu Ala Asp Leu Phe Gly His Glu Leu Ala Glu His 145 150 155 Thr Ala Tyr Glu Leu Val Gln Thr Ala Glu Asn Met Ala His Gly Arg 160 165 170 Ala Ser Gly Val Asp Ala Met Thr Val Gly Ala Ser Arg Pro Leu Leu 175 180 185 Phe Gln Gln Gly Arg Thr Glu Arg Leu Ala Ile Gly Cys Asp Ser Leu 190 195 200 205 Phe Ile Val Ala Asp Ser Gly Val Pro Gly Ser Thr Lys Glu Ala Val 210 215 220 Glu Met Leu Arg Glu Gly Phe Thr Arg Ser Ala Gly Thr Gln Glu Arg 225 230 235 Phe Val Gly Arg Ala Thr Glu Leu Thr Glu Ala Ala Arg Gln Ala Leu 240 245 250 Ala Asp Gly Arg Pro Glu Glu Leu Gly Ser Gln Leu Thr Tyr Tyr His 255 260 265 Glu Leu Leu His Glu Ala Arg Leu Ser Thr Asp Gly Ile Asp Ala Leu 270 275 280 285 Val Glu Ala Ala Leu Lys Ala Gly Ser Leu Gly Ala Lys Ile Thr Gly 290 295 300 Gly Gly Leu Gly Gly Cys Met Ile Ala Gln Ala Arg Pro Glu Gln Ala 305 310 315 Arg Glu Val Thr Arg Gln Leu His Glu Ala Gly Ala Val Gln Thr Trp 320 325 330 Val Val Pro Leu Lys Gly Leu Asp Asn His Ala Gln 335 340<210> 8 <211> 345 <212> PRT <213> Streptomyces <400> 8 Met Gln Lys Arg Gln Arg Glu Leu Ser Ala Leu Thr Leu 1 5 10 Pro Thr Ser Ala Glu Gly Val Ser Glu Ser His Arg Ala Arg Ser Val 15 20 25 Gly Ile Gly Arg Ala His Ala Lys Ala Ile Leu Leu Gly Glu His Ala 30 35 40 45 Val Val Tyr Gly Ala Pro Ala Leu Ala Leu Pro Ile Pro Gln Leu Thr 50 55 60 Val Thr Ala Ser Val Gly Trp Ser Ser Glu Ala Ser Asp Ser Ala Gly 65 70 75 Gly Leu Ser Tyr Thr Met Thr Gly Thr Pro Ser Arg Ala Leu Val Thr 80 85 90 Gln Ala Ser Asp Gly Leu His Arg Leu Thr Ala Glu Phe Met Ala Arg 95 100 105 Met Gly Val Thr Asn Ala Pro His Leu Asp Val Ile Leu Asp Gly Ala 110 115 120 125 Ile Pro His Gly Arg Gly Leu Gly Ser Ser Ala Ala Gly Ser Arg Ala 130 135 140 Ile Ala Leu Ala Leu Ala Asp Leu Phe Gly His Glu Leu Ala Glu His 145 150 155 Thr Ala Tyr Glu Leu Val Gln Thr Ala Glu Asn Met Ala His Gly Arg 160 165 170 Ala Ser Gly Val Asp Ala Met Thr Val Gly Ala Ser Arg Pro Leu Leu 175 180 185 Phe Gln Gln Gly Arg Thr Glu Arg Leu Ala Ile Gly Cys Asp Ser Leu 190 195 200 205 Phe Ile Val Ala Asp Ser Gly Val Pro Gly Ser Thr Lys Glu Ala Val 210 215 220 Glu Met Leu Arg Glu Gly Phe Thr Arg Ser Ala Gly Thr Gln Glu Arg 225 230 235 Phe Val Gly Arg Ala Thr Glu Leu Thr Glu Ala Ala Arg Gln Ala Leu 240 245 250 Ala Asp Gly Arg Pro Glu Glu Leu Gly Ser Gln Leu Thr Tyr Tyr His 255 260 265 Glu Leu Leu His Glu Ala Arg Leu Ser Thr Asp Gly Ile Asp Ala Leu 270 275 280 285 Val Glu Ala Ala Leu Lys Ala Gly Ser Leu Gly Ala Lys Ile Thr Gly 290 295 300 Gly Gly Leu Gly Gly Cys Met Ile Ala Gln Ala Arg Pro Glu Gln Ala 305 310 315 Arg Glu Val Thr Arg Gln Leu His Glu Ala Gly Ala Val Gln Thr Trp 320 325 330 Val Val Pro Leu Lys Gly Leu Asp Asn His Ala Gln 335 340
【0086】 <210> 9 <211> 350 <212> PRT <213> Streptomyces <400> 9 Met Arg Ser Glu His Pro Thr Thr Thr Val Leu 1 5 10 Gln Ser Arg Glu Gln Gly Ser Ala Ala Gly Ala Thr Ala Val Ala His 15 20 25 Pro Asn Ile Ala Leu Ile Lys Tyr Trp Gly Lys Arg Asp Glu Arg Leu 30 35 40 Ile Leu Pro Cys Thr Thr Ser Leu Ser Met Thr Leu Asp Val Phe Pro 45 50 55 Thr Thr Thr Glu Val Arg Leu Asp Pro Ala Ala Glu His Asp Thr Ala 60 65 70 75 Ala Leu Asn Gly Glu Val Ala Thr Gly Glu Thr Leu Arg Arg Ile Ser 80 85 90 Ala Phe Leu Ser Leu Val Arg Glu Val Ala Gly Ser Asp Gln Arg Ala 95 100 105 Val Val Asp Thr Arg Asn Thr Val Pro Thr Gly Ala Gly Leu Ala Ser 110 115 120 Ser Ala Ser Gly Phe Ala Ala Leu Ala Val Ala Ala Ala Ala Ala Tyr 125 130 135 Gly Leu Glu Leu Asp Asp Arg Gly Leu Ser Arg Leu Ala Arg Arg Gly 140 145 150 155 Ser Gly Ser Ala Ser Arg Ser Ile Phe Gly Gly Phe Ala Val Trp His 160 165 170 Ala Gly Pro Asp Gly Thr Ala Thr Glu Ala Asp Leu Gly Ser Tyr Ala 175 180 185 Glu Pro Val Pro Ala Ala Asp Leu Asp Pro Ala Leu Val Ile Ala Val 190 195 200 Val Asn Ala Gly Pro Lys Pro Val Ser Ser Arg Glu Ala Met Arg Arg 205 210 215 Thr Val Asp Thr Ser Pro Leu Tyr Arg Pro Trp Ala Asp Ser Ser Lys 220 225 230 235 Asp Asp Leu Asp Glu Met Arg Ser Ala Leu Leu Arg Gly Asp Leu Glu 240 245 250 Ala Val Gly Glu Ile Ala Glu Arg Asn Ala Leu Gly Met His Ala Thr 255 260 265 Met Leu Ala Ala Arg Pro Ala Val Arg Tyr Leu Ser Pro Ala Thr Val 270 275 280 Thr Val Leu Asp Ser Val Leu Gln Leu Arg Lys Asp Gly Val Leu Ala 285 290 295 Tyr Ala Thr Met Asp Ala Gly Pro Asn Val Lys Val Leu Cys Arg Arg 300 305 310 315 Ala Asp Ala Glu Arg Val Ala Asp Val Val Arg Ala Ala Ala Ser Gly 320 325 330 Gly Gln Val Leu Val Ala Gly Pro Gly Asp Gly Ala Arg Leu Leu Ser 335 340 345 Glu Gly Ala 350<210> 9 <211> 350 <212> PRT <213> Streptomyces <400> 9 Met Arg Ser Glu His Pro Thr Thr Thr Val Leu 1 5 10 Gln Ser Arg Glu Gln Gly Ser Ala Ala Gly Ala Thr Ala Val Ala His 15 20 25 Pro Asn Ile Ala Leu Ile Lys Tyr Trp Gly Lys Arg Asp Glu Arg Leu 30 35 40 Ile Leu Pro Cys Thr Thr Ser Leu Ser Met Thr Leu Asp Val Phe Pro 45 50 55 Thr Thr Thr Glu Val Arg Leu Asp Pro Ala Ala Glu His Asp Thr Ala 60 65 70 75 Ala Leu Asn Gly Glu Val Ala Thr Gly Glu Thr Leu Arg Arg Ile Ser 80 85 90 Ala Phe Leu Ser Leu Val Arg Glu Val Ala Gly Ser Asp Gln Arg Ala 95 100 105 Val Val Asp Thr Arg Asn Thr Val Pro Thr Gly Ala Gly Leu Ala Ser 110 115 120 Ser Ala Ser Gly Phe Ala Ala Leu Ala Val Ala Ala Ala Ala Ala Tyr 125 130 135 Gly Leu Glu Leu Asp Asp Arg Gly Leu Ser Arg Leu Ala Arg Arg Gly 140 145 150 155 Ser Gly Ser Ala Ser Arg Ser Ile Phe Gly Gly Phe Ala Val Trp His 160 165 170 Ala Gly Pro Asp Gly Thr Ala Thr Glu Ala Asp Leu Gly Ser Tyr Ala 175 180 185 Glu Pro Val Pro Ala Ala Asp Leu Asp Pro Ala Leu Val Ile Ala Val 190 195 200 Val Asn Ala Gly Pro Lys Pro Val Ser Ser Arg Glu Ala Met Arg Arg 205 210 215 Thr Val Asp Thr Ser Pro Leu Tyr Arg Pro Trp Ala Asp Ser Ser Lys 220 225 230 235 Asp Asp Leu Asp Glu Met Arg Ser Ala Leu Leu Arg Gly Asp Leu Glu 240 245 250 Ala Val Gly Glu Ile Ala Glu Arg Asn Ala Leu Gly Met His Ala Thr 255 260 265 Met Leu Ala Ala Arg Pro Ala Val Arg Tyr Leu Ser Pro Ala Thr Val 270 275 280 Thr Val Leu Asp Ser Val Leu Gln Leu Arg Lys Asp Gly Val Leu Ala 285 290 295 Tyr Ala Thr Met Asp Ala Gly Pro Asn Val Lys Val Leu Cys Arg Arg 300 305 310 315 Ala Asp Ala Glu Arg Val Ala Asp Val Val Arg Ala Ala Ala Ser Gly 320 325 330 Gly Gln Val Leu Val Ala Gly Pro Gly Asp Gly Ala Arg Leu Leu Ser 335 340 345 Glu Gly Ala 350
【0087】 <210> 10 <211> 374 <212> PRT <213> Streptomyces <400> 10 Met Thr Thr 1 Gly Gln Arg Thr Ile Val Arg His Ala Pro Gly Lys Leu Phe Val Ala 5 10 15 Gly Glu Tyr Ala Val Val Asp Pro Gly Asn Pro Ala Ile Leu Val Ala 20 25 30 35 Val Asp Arg His Ile Ser Val Thr Val Ser Asp Ala Asp Ala Asp Thr 40 45 50 Gly Ala Ala Asp Val Val Ile Ser Ser Asp Leu Gly Pro Gln Ala Val 55 60 65 Gly Trp Arg Trp His Asp Gly Arg Leu Val Val Arg Asp Pro Asp Asp 70 75 80 Gly Gln Gln Ala Arg Ser Ala Leu Ala His Val Val Ser Ala Ile Glu 85 90 95 Thr Val Gly Arg Leu Leu Gly Glu Arg Gly Gln Lys Val Pro Ala Leu 100 105 110 115 Thr Leu Ser Val Ser Ser Arg Leu His Glu Asp Gly Arg Lys Phe Gly 120 125 130 Leu Gly Ser Ser Gly Ala Val Thr Val Ala Thr Val Ala Ala Val Ala 135 140 145 Ala Phe Cys Gly Leu Glu Leu Ser Thr Asp Glu Arg Phe Arg Leu Ala 150 155 160 Met Leu Ala Thr Ala Glu Leu Asp Pro Lys Gly Ser Gly Gly Asp Leu 165 170 175 Ala Ala Ser Thr Trp Gly Gly Trp Ile Ala Tyr Gln Ala Pro Asp Arg 180 185 190 195 Ala Phe Val Leu Asp Leu Ala Arg Arg Val Gly Val Asp Arg Thr Leu 200 205 210 Lys Ala Pro Trp Pro Gly His Ser Val Arg Arg Leu Pro Ala Pro Lys 215 220 225 Gly Leu Thr Leu Glu Val Gly Trp Thr Gly Glu Pro Ala Ser Thr Ala 230 235 240 Ser Leu Val Ser Asp Leu His Arg Arg Thr Trp Arg Gly Ser Ala Ser 245 250 255 His Gln Arg Phe Val Glu Thr Thr Thr Asp Cys Val Arg Ser Ala Val 260 265 270 275 Thr Ala Leu Glu Ser Gly Asp Asp Thr Ser Leu Leu His Glu Ile Arg 280 285 290 Arg Ala Arg Gln Glu Leu Ala Arg Leu Asp Asp Glu Val Gly Leu Gly 295 300 305 Ile Phe Thr Pro Lys Leu Thr Ala Leu Cys Asp Ala Ala Glu Ala Val 310 315 320 Gly Gly Ala Ala Lys Pro Ser Gly Ala Gly Gly Gly Asp Cys Gly Ile 325 330 335 Ala Leu Leu Asp Ala Glu Ala Ser Arg Asp Ile Thr His Val Arg Gln 340 345 350 355 Arg Trp Glu Thr Ala Gly Val Leu Pro Leu Pro Leu Thr Pro Ala Leu 360 365 370 Glu Gly Ile<210> 10 <211> 374 <212> PRT <213> Streptomyces <400> 10 Met Thr Thr 1 Gly Gln Arg Thr Ile Val Arg His Ala Pro Gly Lys Leu Phe Val Ala 5 10 15 Gly Glu Tyr Ala Val Val Asp Pro Gly Asn Pro Ala Ile Leu Val Ala 20 25 30 35 Val Asp Arg His Ile Ser Val Thr Val Ser Asp Ala Asp Ala Asp Thr 40 45 50 Gly Ala Ala Asp Val Val Ile Ser Ser Asp Leu Gly Pro Gln Ala Val 55 60 65 Gly Trp Arg Trp His Asp Gly Arg Leu Val Val Arg Asp Pro Asp Asp 70 75 80 Gly Gln Gln Ala Arg Ser Ala Leu Ala His Val Val Ser Ala Ile Glu 85 90 95 Thr Val Gly Arg Leu Leu Gly Glu Arg Gly Gln Lys Val Pro Ala Leu 100 105 110 115 Thr Leu Ser Val Ser Ser Arg Leu His Glu Asp Gly Arg Lys Phe Gly 120 125 130 Leu Gly Ser Ser Gly Ala Val Thr Val Ala Thr Val Ala Ala Val Ala 135 140 145 Ala Phe Cys Gly Leu Glu Leu Ser Thr Asp Glu Arg Phe Arg Leu Ala 150 155 160 Met Leu Ala Thr Ala Glu Leu Asp Pro Lys Gly Ser Gly Gly Asp Leu 165 170 175 Ala Ala Ser Thr Trp Gly Gly Trp Ile Ala Tyr Gln Ala Pro Asp Arg 180 185 190 195 Ala Ph e Val Leu Asp Leu Ala Arg Arg Val Gly Val Asp Arg Thr Leu 200 205 210 Lys Ala Pro Trp Pro Gly His Ser Val Arg Arg Leu Pro Ala Pro Lys 215 220 225 Gly Leu Thr Leu Glu Val Gly Trp Thr Gly Glu Pro Ala Ser Thr Ala 230 235 240 Ser Leu Val Ser Asp Leu His Arg Arg Thr Trp Arg Gly Ser Ala Ser 245 250 255 His Gln Arg Phe Val Glu Thr Thr Thr Asp Cys Val Arg Ser Ala Val 260 265 270 275 275 Thr Ala Leu Glu Ser Gly Asp Asp Thr Ser Leu Leu His Glu Ile Arg 280 285 290 Arg Ala Arg Gln Glu Leu Ala Arg Leu Asp Asp Glu Val Gly Leu Gly 295 300 305 Ile Phe Thr Pro Lys Leu Thr Ala Leu Cys Asp Ala Ala Glu Ala Val 310 315 320 Gly Gly Ala Ala Lys Pro Ser Gly Ala Gly Gly Gly Asp Cys Gly Ile 325 330 335 Ala Leu Leu Asp Ala Glu Ala Ser Arg Asp Ile Thr His Val Arg Gln 340 345 350 355 Arg Trp Glu Thr Ala Gly Val Leu Pro Leu Pro Leu Thr Pro Ala Leu 360 365 370 Glu Gly Ile
【0088】 <210> 11 <211> 363 <212> PRT <213> Streptomyces <400> 11 Met Thr Ser Ala Gln Arg Lys 1 5 Asp Asp His Val Arg Leu Ala Ile Glu Gln His Asn Ala His Ser Gly 10 15 20 Arg Asn Gln Phe Asp Asp Val Ser Phe Val His His Ala Leu Ala Gly 25 30 35 Ile Asp Arg Pro Asp Val Ser Leu Ala Thr Ser Phe Ala Gly Ile Ser 40 45 50 55 Trp Gln Val Pro Ile Tyr Ile Asn Ala Met Thr Gly Gly Ser Glu Lys 60 65 70 Thr Gly Leu Ile Asn Arg Asp Leu Ala Thr Ala Ala Arg Glu Thr Gly 75 80 85 Val Pro Ile Ala Ser Gly Ser Met Asn Ala Tyr Ile Lys Asp Pro Ser 90 95 100 Cys Ala Asp Thr Phe Arg Val Leu Arg Asp Glu Asn Pro Asn Gly Phe 105 110 115 Val Ile Ala Asn Ile Asn Ala Thr Thr Thr Val Asp Asn Ala Gln Arg 120 125 130 135 Ala Ile Asp Leu Ile Glu Ala Asn Ala Leu Gln Ile His Ile Asn Thr 140 145 150 Ala Gln Glu Thr Pro Met Pro Glu Gly Asp Arg Ser Phe Ala Ser Trp 155 160 165 Val Pro Gln Ile Glu Lys Ile Ala Ala Ala Val Asp Ile Pro Val Ile 170 175 180 Val Lys Glu Val Gly Asn Gly Leu Ser Arg Gln Thr Ile Leu Leu Leu 185 190 195 Ala Asp Leu Gly Val Gln Ala Ala Asp Val Ser Gly Arg Gly Gly Thr 200 205 210 215 Asp Phe Ala Arg Ile Glu Asn Gly Arg Arg Glu Leu Gly Asp Tyr Ala 220 225 230 Phe Leu His Gly Trp Gly Gln Ser Thr Ala Ala Cys Leu Leu Asp Ala 235 240 245 Gln Asp Ile Ser Leu Pro Val Leu Ala Ser Gly Gly Val Arg His Pro 250 255 260 Leu Asp Val Val Arg Ala Leu Ala Leu Gly Ala Arg Ala Val Gly Ser 265 270 275 Ser Ala Gly Phe Leu Arg Thr Leu Met Asp Asp Gly Val Asp Ala Leu 280 285 290 295 Ile Thr Lys Leu Thr Thr Trp Leu Asp Gln Leu Ala Ala Leu Gln Thr 300 305 310 Met Leu Gly Ala Arg Thr Pro Ala Asp Leu Thr Arg Cys Asp Val Leu 315 320 325 Leu His Gly Glu Leu Arg Asp Phe Cys Ala Asp Arg Gly Ile Asp Thr 330 335 340 Arg Arg Leu Ala Gln Arg Ser Ser Ser Ile Glu Ala Leu Gln Thr Thr 345 350 355 Gly Ser Thr Arg 360<210> 11 <211> 363 <212> PRT <213> Streptomyces <400> 11 Met Thr Ser Ala Gln Arg Lys 15 Asp Asp His Val Arg Leu Ala Ile Glu Gln His Asn Ala His Ser Gly 10 15 20 Arg Asn Gln Phe Asp Asp Val Ser Phe Val His His Ala Leu Ala Gly 25 30 35 Ile Asp Arg Pro Asp Val Ser Leu Ala Thr Ser Phe Ala Gly Ile Ser 40 45 50 55 Trp Gln Val Pro Ile Tyr Ile Asn Ala Met Thr Gly Gly Ser Glu Lys 60 65 70 Thr Gly Leu Ile Asn Arg Asp Leu Ala Thr Ala Ala Arg Glu Thr Gly 75 80 85 Val Pro Ile Ala Ser Gly Ser Met Asn Ala Tyr Ile Lys Asp Pro Ser 90 95 100 Cys Ala Asp Thr Phe Arg Val Leu Arg Asp Glu Asn Pro Asn Gly Phe 105 110 115 Val Ile Ala Asn Ile Asn Ala Thr Thr Thr Val Asp Asn Ala Gln Arg 120 125 130 135 Ala Ile Asp Leu Ile Glu Ala Asn Ala Leu Gln Ile His Ile Asn Thr 140 145 150 Ala Gln Glu Thr Pro Met Pro Glu Gly Asp Arg Ser Phe Ala Ser Trp 155 160 165 Val Pro Gln Ile Glu Lys Ile Ala Ala Ala Val Asp Ile Pro Val Ile 170 175 180 Val Lys Glu Val Gly Asn Gly Leu Ser Arg Gln Thr Ile Leu Leu Leu 18 5 190 195 Ala Asp Leu Gly Val Gln Ala Ala Asp Val Ser Gly Arg Gly Gly Thr 200 205 210 215 Asp Phe Ala Arg Ile Glu Asn Gly Arg Arg Glu Leu Gly Asp Tyr Ala 220 225 230 Phe Leu His Gly Trp Gly Gln Ser Thr Ala Ala Cys Leu Leu Asp Ala 235 240 245 Gln Asp Ile Ser Leu Pro Val Leu Ala Ser Gly Gly Val Arg His Pro 250 255 260 Leu Asp Val Val Arg Ala Leu Ala Leu Gly Ala Arg Ala Val Gly Ser 265 270 275 Ser Ala Gly Phe Leu Arg Thr Leu Met Asp Asp Gly Val Asp Ala Leu 280 285 290 295 Ile Thr Lys Leu Thr Thr Trp Leu Asp Gln Leu Ala Ala Leu Gln Thr 300 305 310 Met Leu Gly Ala Arg Thr Pro Ala Asp Leu Thr Arg Cys Asp Val Leu 315 320 325 Leu His Gly Glu Leu Arg Asp Phe Cys Ala Asp Arg Gly Ile Asp Thr 330 335 340 Arg Arg Leu Ala Gln Arg Ser Ser Ser Ile Glu Ala Leu Gln Thr Thr 345 350 355 Gly Ser Thr Arg 360
【0089】 <210> 12 <211> 353 <212> PRT <213> Streptomyces <400> 12 Met Thr Glu Thr 1 His Ala Ile Ala Gly Val Pro Met Arg Trp Val Gly Pro Leu Arg Ile 5 10 15 20 Ser Gly Asn Val Ala Glu Thr Glu Thr Gln Val Pro Leu Ala Thr Tyr 25 30 35 Glu Ser Pro Leu Trp Pro Ser Val Gly Arg Gly Ala Lys Val Ser Arg 40 45 50 Leu Thr Glu Lys Gly Ile Val Ala Thr Leu Val Asp Glu Arg Met Thr 55 60 65 Arg Ser Val Ile Val Glu Ala Thr Asp Ala Gln Thr Ala Tyr Met Ala 70 75 80 Ala Gln Thr Ile His Ala Arg Ile Asp Glu Leu Arg Glu Val Val Arg 85 90 95 100 Gly Cys Ser Arg Phe Ala Gln Leu Ile Asn Ile Lys His Glu Ile Asn 105 110 115 Ala Asn Leu Leu Phe Ile Arg Phe Glu Phe Thr Thr Gly Asp Ala Ser 120 125 130 Gly His Asn Met Ala Thr Leu Ala Ser Asp Val Leu Leu Gly His Leu 135 140 145 Leu Glu Thr Ile Pro Gly Ile Ser Tyr Gly Ser Ile Ser Gly Asn Tyr 150 155 160 Cys Thr Asp Lys Lys Ala Thr Ala Ile Asn Gly Ile Leu Gly Arg Gly 165 170 175 180 Lys Asn Val Ile Thr Glu Leu Leu Val Pro Arg Asp Val Val Glu Asn 185 190 195 Asn Leu His Thr Thr Ala Ala Lys Ile Val Glu Leu Asn Ile Arg Lys 200 205 210 Asn Leu Leu Gly Thr Leu Leu Ala Gly Gly Ile Arg Ser Ala Asn Ala 215 220 225 His Phe Ala Asn Met Leu Leu Gly Phe Tyr Leu Ala Thr Gly Gln Asp 230 235 240 Ala Ala Asn Ile Val Glu Gly Ser Gln Gly Val Val Met Ala Glu Asp 245 250 255 260 Arg Asp Gly Asp Leu Tyr Phe Ala Cys Thr Leu Pro Asn Leu Ile Val 265 270 275 Gly Thr Val Gly Asn Gly Lys Gly Leu Gly Phe Val Glu Thr Asn Leu 280 285 290 Ala Arg Leu Gly Cys Arg Ala Asp Arg Glu Pro Gly Glu Asn Ala Arg 295 300 305 Arg Leu Ala Val Ile Ala Ala Ala Thr Val Leu Cys Gly Glu Leu Ser 310 315 320 Leu Leu Ala Ala Gln Thr Asn Pro Gly Glu Leu Met Arg Ala His Val 325 330 335 340 Gln Leu Glu Arg Asp Asn Lys Thr Ala Lys Val Gly Ala 345 350<210> 12 <211> 353 <212> PRT <213> Streptomyces <400> 12 Met Thr Glu Thr 1 His Ala Ile Ala Gly Val Pro Met Arg Trp Val Gly Pro Leu Arg Ile 5 10 15 20 Ser Gly Asn Val Ala Glu Thr Glu Thr Gln Val Pro Leu Ala Thr Tyr 25 30 35 Glu Ser Pro Leu Trp Pro Ser Val Gly Arg Gly Ala Lys Val Ser Arg 40 45 50 Leu Thr Glu Lys Gly Ile Val Ala Thr Leu Val Asp Glu Arg Met Thr 55 60 65 Arg Ser Val Ile Val Glu Ala Thr Asp Ala Gln Thr Ala Tyr Met Ala 70 75 80 Ala Gln Thr Ile His Ala Arg Ile Asp Glu Leu Arg Glu Val Val Arg 85 90 95 100 Gly Cys Ser Arg Phe Ala Gln Leu Ile Asn Ile Lys His Glu Ile Asn 105 110 115 Ala Asn Leu Leu Phe Ile Arg Phe Glu Phe Thr Thr Gly Asp Ala Ser 120 125 130 Gly His Asn Met Ala Thr Leu Ala Ser Asp Val Leu Leu Gly His Leu 135 140 145 Leu Glu Thr Ile Pro Gly Ile Ser Tyr Gly Ser Ile Ser Gly Asn Tyr 150 155 160 Cys Thr Asp Lys Lys Ala Thr Ala Ile Asn Gly Ile Leu Gly Arg Gly 165 170 175 180 Lys Asn Val Ile Thr Glu Leu Leu Val Pro Arg Asp Val Val Glu Asn 185 190 195 As n Leu His Thr Thr Ala Ala Lys Ile Val Glu Leu Asn Ile Arg Lys 200 205 210 Asn Leu Leu Gly Thr Leu Leu Ala Gly Gly Ile Arg Ser Ala Asn Ala 215 220 225 His Phe Ala Asn Met Leu Leu Gly Phe Tyr Leu Ala Thr Gly Gln Asp 230 235 240 Ala Ala Asn Ile Val Glu Gly Ser Gln Gly Val Val Met Ala Glu Asp 245 250 255 260 Arg Asp Gly Asp Leu Tyr Phe Ala Cys Thr Leu Pro Asn Leu Ile Val 265 270 275 Gly Thr Val Gly Asn Gly Lys Gly Leu Gly Phe Val Glu Thr Asn Leu 280 285 290 Ala Arg Leu Gly Cys Arg Ala Asp Arg Glu Pro Gly Glu Asn Ala Arg 295 300 305 Arg Leu Ala Val Ile Ala Ala Ala Thr Val Val Leu Cys Gly Glu Glu Leu Ser 310 315 320 Leu Leu Ala Ala Gln Thr Asn Pro Gly Glu Leu Met Arg Ala His Val 325 330 335 340 Gln Leu Glu Arg Asp Asn Lys Thr Ala Lys Val Gly Ala 345 350
【0090】 <210> 13 <211> 389 <212> PRT <213> Streptomyces <400> 13 Met Ser Ile Ser Ile Gly Ile His Asp 1 5 Leu Ser Phe Ala Thr Thr Glu Phe Val Leu Pro His Thr Ala Leu Ala 10 15 20 25 Glu Tyr Asn Gly Thr Glu Ile Gly Lys Tyr His Val Gly Ile Gly Gln 30 35 40 Gln Ser Met Ser Val Pro Ala Ala Asp Glu Asp Ile Val Thr Met Ala 45 50 55 Ala Thr Ala Ala Arg Pro Ile Ile Glu Arg Asn Gly Lys Ser Arg Ile 60 65 70 Arg Thr Val Val Phe Ala Thr Glu Ser Ser Ile Asp Gln Ala Lys Ala 75 80 85 Gly Gly Val Tyr Val His Ser Leu Leu Gly Leu Glu Ser Ala Cys Arg 90 95 100 105 Val Val Glu Leu Lys Gln Ala Cys Tyr Gly Ala Thr Ala Ala Leu Gln 110 115 120 Phe Ala Ile Gly Leu Val Arg Arg Asp Pro Ala Gln Gln Val Leu Val 125 130 135 Ile Ala Ser Asp Val Ser Lys Tyr Glu Leu Asp Ser Pro Gly Glu Ala 140 145 150 Thr Gln Gly Ala Ala Ala Val Ala Met Leu Val Gly Ala Asp Pro Ala 155 160 165 Leu Leu Arg Ile Glu Glu Pro Ser Gly Leu Phe Thr Ala Asp Val Met 170 175 180 185 Asp Phe Trp Arg Pro Asn Tyr Leu Thr Thr Ala Leu Val Asp Gly Gln 190 195 200 Glu Ser Ile Asn Ala Tyr Leu Gln Ala Val Glu Gly Ala Trp Lys Asp 205 210 215 Tyr Ala Glu Gln Asp Gly Arg Ser Leu Glu Glu Phe Ala Ala Phe Val 220 225 230 Tyr His Gln Pro Phe Thr Lys Met Ala Tyr Lys Ala His Arg His Leu 235 240 245 Leu Asn Phe Asn Gly Tyr Asp Thr Asp Lys Asp Ala Ile Glu Gly Ala 250 255 260 265 Leu Gly Gln Thr Thr Ala Tyr Asn Asn Val Ile Gly Asn Ser Tyr Thr 270 275 280 Ala Ser Val Tyr Leu Gly Leu Ala Ala Leu Leu Asp Gln Ala Asp Asp 285 290 295 Leu Thr Gly Arg Ser Ile Gly Phe Leu Ser Tyr Gly Ser Gly Ser Val 300 305 310 Ala Glu Phe Phe Ser Gly Thr Val Val Ala Gly Tyr Arg Glu Arg Leu 315 320 325 Arg Thr Glu Ala Asn Gln Glu Ala Ile Ala Arg Arg Lys Ser Val Asp 330 335 340 345 Tyr Ala Thr Tyr Arg Glu Leu His Glu Tyr Thr Leu Pro Ser Asp Gly 350 355 360 Gly Asp His Ala Thr Pro Val Gln Thr Thr Gly Pro Phe Arg Leu Ala 365 370 375 Gly Ile Asn Asp His Lys Arg Ile Tyr Glu Ala Arg 380 385<210> 13 <211> 389 <212> PRT <213> Streptomyces <400> 13 Met Ser Ile Ser Ile Gly Ile His Asp 15 Leu Ser Phe Ala Thr Thr Glu Phe Val Leu Pro His Thr Ala Leu Ala 10 15 20 25 Glu Tyr Asn Gly Thr Glu Ile Gly Lys Tyr His Val Gly Ile Gly Gln 30 35 40 Gln Ser Met Ser Val Pro Ala Ala Asp Glu Asp Ile Val Thr Met Ala 45 50 55 Ala Thr Ala Ala Arg Pro Ile Ile Glu Arg Asn Gly Lys Ser Arg Ile 60 65 70 Arg Thr Val Val Phe Ala Thr Glu Ser Ser Ile Asp Gln Ala Lys Ala 75 80 85 Gly Gly Val Tyr Val His Ser Leu Leu Gly Leu Glu Ser Ala Cys Arg 90 95 100 105 Val Val Glu Leu Lys Gln Ala Cys Tyr Gly Ala Thr Ala Ala Leu Gln 110 115 120 Phe Ala Ile Gly Leu Val Arg Arg Asp Pro Ala Gln Gln Val Leu Val 125 130 135 Ile Ala Ser Asp Val Ser Lys Tyr Glu Leu Asp Ser Pro Gly Glu Ala 140 145 150 Thr Gln Gly Ala Ala Ala Val Ala Met Leu Val Gly Ala Asp Pro Ala 155 160 165 Leu Leu Arg Ile Glu Glu Glu Pro Ser Gly Leu Phe Thr Ala Asp Val Met 170 175 180 185 185 Asp Phe Trp Arg Pro Asn Tyr Leu Thr Thr Ala Leu Val As p Gly Gln 190 195 200 Glu Ser Ile Asn Ala Tyr Leu Gln Ala Val Glu Gly Ala Trp Lys Asp 205 210 215 Tyr Ala Glu Gln Asp Gly Arg Ser Leu Glu Glu Phe Ala Ala Phe Val 220 225 230 Tyr His Gln Pro Phe Thr Lys Met Ala Tyr Lys Ala His Arg His Leu 235 240 245 Leu Asn Phe Asn Gly Tyr Asp Thr Asp Lys Asp Ala Ile Glu Gly Ala 250 255 260 265 Leu Gly Gln Thr Thr Ala Tyr Asn Asn Val Ile Gly Asn Ser Tyr Thr 270 275 280 Ala Ser Val Tyr Leu Gly Leu Ala Ala Leu Leu Asp Gln Ala Asp Asp 285 290 295 Leu Thr Gly Arg Ser Ile Gly Phe Leu Ser Tyr Gly Ser Gly Ser Val 300 305 310 Ala Glu Phe Phe Ser Gly Thr Val Val Ala Gly Tyr Arg Glu Arg Leu 315 320 325 Arg Thr Glu Ala Asn Gln Glu Ala Ile Ala Arg Arg Lys Ser Val Asp 330 335 340 345 Tyr Ala Thr Tyr Arg Glu Leu His Glu Tyr Thr Leu Pro Ser Asp Gly 350 355 360 Gly Asp His Ala Thr Pro Val Gln Thr Thr Gly Pro Phe Arg Leu Ala 365 370 375 Gly Ile Asn Asp His Lys Arg Ile Tyr Glu Ala Arg 380 385
【図1】図1は本発明の遺伝子の構造、欠失変異体の構
造、各形質転換体の各種培養条件下に生育の有無の結果
を示す図である。FIG. 1 shows the structure of the gene of the present invention, the structure of a deletion mutant, and the results of the presence or absence of growth of each transformant under various culture conditions.
───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.7 識別記号 FI テーマコート゛(参考) C12R 1:19) C12N 15/00 ZNAA (72)発明者 高橋 俊二 千葉県千葉市中央区矢作町641 グリーン ハイツB−102号 (72)発明者 高木 基樹 東京都文京区本郷5−18−12 フタキハイ ツ306号 Fターム(参考) 4B024 AA01 AA03 AA11 BA07 BA08 BA10 BA80 CA03 CA09 DA06 EA04 FA02 GA11 GA19 GA27 HA13 HA14 4B050 CC01 CC03 DD02 EE01 LL01 LL03 LL05 4B065 AA26X AA50Y AB01 AC14 AC15 AC16 BA02 BA25 BB01 BC03 BC26 BD15 BD16 BD28 CA27 CA28 CA29 CA44 CA46──────────────────────────────────────────────────続 き Continued on the front page (51) Int.Cl. 7 Identification FI theme coat ゛ (Reference) C12R 1:19) C12N 15/00 ZNAA (72) Inventor Shunji Takahashi 641 Yagicho, Chuo-ku, Chiba-shi, Chiba Green Heights B-102 (72) Inventor Motoki Takagi 5-18-12 Hongo, Bunkyo-ku, Tokyo Futaki Heights 306 F-term (reference) 4B024 AA01 AA03 AA11 BA07 BA08 BA10 BA80 CA03 CA09 DA06 EA04 FA02 GA11 GA19 GA27 HA13 HA14 4B050 CC01 CC03 DD02 EE01 LL01 LL03 LL05 4B065 AA26X AA50Y AB01 AC14 AC15 AC16 BA02 BA25 BB01 BC03 BC26 BD15 BD16 BD28 CA27 CA28 CA29 CA44 CA46
Claims (11)
換、付加及び/または挿入されている塩基配列であっ
て、メバロン酸経路を機能させるのに必要な酵素を全て
コードする塩基配列;または (3)配列番号1の塩基配列とストリンジェントな条件
下でハイブリダイズすることができる塩基配列であっ
て、メバロン酸経路を機能させるのに必要な酵素を全て
コードする塩基配列。1. DNA having any of the following: (1) a nucleotide sequence of SEQ ID NO: 1; (2) one to several nucleotides are deleted, substituted, added and / or inserted in SEQ ID NO: 1 A nucleotide sequence that encodes all enzymes necessary for the function of the mevalonate pathway; or (3) a nucleotide sequence that can hybridize with the nucleotide sequence of SEQ ID NO: 1 under stringent conditions. A nucleotide sequence encoding all enzymes necessary for the function of the mevalonate pathway.
酵素が、少なくともホスホメバロン酸キナーゼ、ジホス
ホメバロン酸デカルボキシラーゼ、メバロン酸キナー
ゼ、HMG−CoAレダクターゼ及びHMG−CoAシ
ンターゼである、請求項1に記載のDNA。2. The method according to claim 1, wherein the enzymes required for functioning the mevalonate pathway are at least phosphomevalonate kinase, diphosphomevalonate decarboxylase, mevalonate kinase, HMG-CoA reductase and HMG-CoA synthase. DNA.
コードされるタンパク質。3. A protein encoded by the DNA according to claim 1 or 2.
ら数個の塩基が欠失、置換、付加及び/または挿入され
ている塩基配列であって、ホスホメバロン酸キナーゼを
コードする塩基配列、あるいは配列番号2の塩基配列と
ストリンジェントな条件下でハイブリダイズすることが
できる塩基配列であって、ホスホメバロン酸キナーゼを
コードする塩基配列; (2)配列番号3の塩基配列、配列番号3において1か
ら数個の塩基が欠失、置換、付加及び/または挿入され
ている塩基配列であって、ジホスホメバロン酸デカルボ
キシラーゼをコードする塩基配列、あるいは配列番号3
の塩基配列とストリンジェントな条件下でハイブリダイ
ズすることができる塩基配列であって、ジホスホメバロ
ン酸デカルボキシラーゼをコードする塩基配列; (3)配列番号4の塩基配列、配列番号4において1か
ら数個の塩基が欠失、置換、付加及び/または挿入され
ている塩基配列であって、メバロン酸キナーゼをコード
する塩基配列、あるいは配列番号4の塩基配列とストリ
ンジェントな条件下でハイブリダイズすることができる
塩基配列であって、メバロン酸キナーゼをコードする塩
基配列; (4)配列番号5の塩基配列;あるいは (5)配列番号7の塩基配列、配列番号7において1か
ら数個の塩基が欠失、置換、付加及び/または挿入され
ている塩基配列であって、HMG−CoAシンターゼを
コードする塩基配列、あるいは配列番号7の塩基配列と
ストリンジェントな条件下でハイブリダイズすることが
できる塩基配列であって、HMG−CoAシンターゼを
コードする塩基配列。4. A DNA having any of the following: (1) a base sequence of SEQ ID NO: 2, a base sequence in which one to several bases are deleted, substituted, added and / or inserted in SEQ ID NO: 2, and a base sequence encoding phosphomevalonate kinase; or A nucleotide sequence capable of hybridizing with the nucleotide sequence of SEQ ID NO: 2 under stringent conditions, wherein the nucleotide sequence encodes phosphomevalonate kinase; (2) the nucleotide sequence of SEQ ID NO: 3; A base sequence in which several bases are deleted, substituted, added and / or inserted, wherein the base sequence encodes diphosphomevalonate decarboxylase;
A base sequence capable of hybridizing under stringent conditions with the base sequence of SEQ ID NO: 4; a base sequence encoding diphosphomevalonate decarboxylase; (3) a base sequence of SEQ ID NO: 4; Is a nucleotide sequence having deletion, substitution, addition and / or insertion of a base, which is capable of hybridizing under stringent conditions to the nucleotide sequence encoding mevalonate kinase or the nucleotide sequence of SEQ ID NO: 4. A nucleotide sequence encoding mevalonate kinase; (4) a nucleotide sequence of SEQ ID NO: 5; or (5) a nucleotide sequence of SEQ ID NO: 7; , A base sequence that has been substituted, added and / or inserted and that codes for HMG-CoA synthase, A nucleotide sequence capable of hybridizing with the nucleotide sequence under stringent conditions of SEQ ID NO: 7, the nucleotide sequence encoding HMG-CoA synthase.
れるタンパク質。5. A protein encoded by the DNA according to claim 4.
1から数個のアミノ酸が欠失、置換、付加及び/または
挿入されているアミノ酸配列であって、ホスホメバロン
酸キナーゼ活性を有するアミノ酸配列、あるいは配列番
号8のアミノ酸配列と60%以上の相同性を有するアミ
ノ酸配列であって、ホスホメバロン酸キナーゼ活性を有
するアミノ酸配列; (2)配列番号9のアミノ酸配列、配列番号9において
1から数個のアミノ酸が欠失、置換、付加及び/または
挿入されているアミノ酸配列であって、ジホスホメバロ
ン酸デカルボキシラーゼ活性を有するアミノ酸配列、あ
るいは配列番号9のアミノ酸配列と60%以上の相同性
を有するアミノ酸配列であって、ジホスホメバロン酸デ
カルボキシラーゼ活性を有するアミノ酸配列; (3)配列番号10のアミノ酸配列、配列番号10にお
いて1から数個のアミノ酸が欠失、置換、付加及び/ま
たは挿入されているアミノ酸配列であって、メバロン酸
キナーゼ活性を有するアミノ酸配列、あるいは配列番号
10のアミノ酸配列と60%以上の相同性を有するアミ
ノ酸配列であって、メバロン酸キナーゼ活性を有するア
ミノ酸配列; (4)配列番号11のアミノ酸配列;あるいは (5)配列番号13のアミノ酸配列、配列番号13にお
いて1から数個のアミノ酸が欠失、置換、付加及び/ま
たは挿入されているアミノ酸配列であって、HMG−C
oAシンターゼ活性を有するアミノ酸配列、あるいは配
列番号13のアミノ酸配列と60%以上の相同性を有す
るアミノ酸配列であって、HMG−CoAシンターゼ活
性を有するアミノ酸配列。6. A protein having any one of the following. (1) an amino acid sequence of SEQ ID NO: 8, an amino acid sequence in which one to several amino acids have been deleted, substituted, added and / or inserted in SEQ ID NO: 8, which has phosphomevalonate kinase activity; or An amino acid sequence having 60% or more homology with the amino acid sequence of SEQ ID NO: 8, which has phosphomevalonate kinase activity; (2) the amino acid sequence of SEQ ID NO: 9, and one to several amino acids in SEQ ID NO: 9 Is an amino acid sequence having a diphosphomevalonate decarboxylase activity or an amino acid sequence having 60% or more homology with the amino acid sequence of SEQ ID NO: 9 An amino acid sequence having diphosphomevalonate decarboxylase activity; (3 An amino acid sequence having SEQ ID NO: 10, an amino acid sequence in which one to several amino acids have been deleted, substituted, added and / or inserted in SEQ ID NO: 10, which has a mevalonate kinase activity; (4) an amino acid sequence of SEQ ID NO: 11; or (5) an amino acid sequence of SEQ ID NO: 13, which has 60% or more homology with the amino acid sequence of 13. An amino acid sequence in which one to several amino acids have been deleted, substituted, added and / or inserted in HMG-C
An amino acid sequence having oA synthase activity or an amino acid sequence having 60% or more homology with the amino acid sequence of SEQ ID NO: 13, which has HMG-CoA synthase activity.
含むベクター。7. A vector comprising the DNA according to claim 1, 2 or 4.
転換体。8. A transformant having the vector according to claim 7.
換体。9. The transformant according to claim 8, which is Escherichia coli.
むベクターを宿主に形質転換して作製した形質転換体を
培養して培養物中にイソプレノイド化合物を生成させる
工程、及び該培養物からイソプレノイド化合物を採取す
る工程を含む、イソプレノイド化合物の製造方法。10. A step of culturing a transformant prepared by transforming a vector containing the DNA according to claim 1 or 2 into a host to produce an isoprenoid compound in a culture, and an isoprenoid compound from the culture. A method for producing an isoprenoid compound, comprising a step of collecting a compound.
ン、ビタミンK2、またはカロテノイドから選択される
イソプレノイド化合物である、請求項10に記載のイソ
プレノイド化合物の製造方法。11. The method for producing an isoprenoid compound according to claim 10, wherein the isoprenoid compound is an isoprenoid compound selected from ubiquinone, vitamin K 2 , or carotenoid.
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP34837599A JP4412779B2 (en) | 1999-12-08 | 1999-12-08 | Genes of enzymes involved in the mevalonate pathway from actinomycetes |
AU17310/01A AU1731001A (en) | 1999-12-08 | 2000-12-06 | Actinomycetes-origin genes of enzymes participating in mevalonate pathway |
PCT/JP2000/008620 WO2001042476A1 (en) | 1999-12-08 | 2000-12-06 | Actinomycetes-origin genes of enzymes participating in mevalonate pathway |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP34837599A JP4412779B2 (en) | 1999-12-08 | 1999-12-08 | Genes of enzymes involved in the mevalonate pathway from actinomycetes |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2001161370A true JP2001161370A (en) | 2001-06-19 |
JP4412779B2 JP4412779B2 (en) | 2010-02-10 |
Family
ID=18396611
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP34837599A Expired - Fee Related JP4412779B2 (en) | 1999-12-08 | 1999-12-08 | Genes of enzymes involved in the mevalonate pathway from actinomycetes |
Country Status (3)
Country | Link |
---|---|
JP (1) | JP4412779B2 (en) |
AU (1) | AU1731001A (en) |
WO (1) | WO2001042476A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005269982A (en) * | 2004-03-24 | 2005-10-06 | Asahi Denka Kogyo Kk | Method for producing labeled R-mevalonic acid |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2001269527A1 (en) * | 2000-07-18 | 2002-01-30 | Center For Advanced Science And Technology Incubation, Ltd. | Isopentenyl pyrophosphate isomerase |
JPWO2003095651A1 (en) * | 2002-05-10 | 2005-09-15 | 協和醗酵工業株式会社 | Method for producing mevalonic acid |
JP2010539902A (en) | 2007-09-20 | 2010-12-24 | アミリス バイオテクノロジーズ,インコーポレイティド | Production of isoprenoids |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0955363A3 (en) * | 1998-05-06 | 2004-01-28 | F. Hoffmann-La Roche Ag | Dna sequences encoding enzymes involved in production of isoprenoids |
-
1999
- 1999-12-08 JP JP34837599A patent/JP4412779B2/en not_active Expired - Fee Related
-
2000
- 2000-12-06 WO PCT/JP2000/008620 patent/WO2001042476A1/en active Application Filing
- 2000-12-06 AU AU17310/01A patent/AU1731001A/en not_active Abandoned
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005269982A (en) * | 2004-03-24 | 2005-10-06 | Asahi Denka Kogyo Kk | Method for producing labeled R-mevalonic acid |
Also Published As
Publication number | Publication date |
---|---|
WO2001042476A1 (en) | 2001-06-14 |
AU1731001A (en) | 2001-06-18 |
JP4412779B2 (en) | 2010-02-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1072683B1 (en) | Process for producing isoprenoid compounds by using microorganisms | |
JP4986547B2 (en) | Novel acetoacetyl CoA synthase, DNA sequence encoding the same, method for producing the enzyme, and method for producing mevalonic acid using the enzyme | |
von Heimendahl et al. | Pinoresinol–lariciresinol reductases with different stereospecificity from Linum album and Linum usitatissimum | |
KR19990088053A (en) | Improved production of isoprenoids | |
CA2385132C (en) | Process for producing ubiquinone-10 | |
JP2001161370A (en) | Gene of enzyme involved in actinomyces-derived mevalonate pathway | |
CN114555798B (en) | Biosynthesis of eriodictyol | |
JP4401471B2 (en) | Method for producing isoprenoid compound by microorganism | |
JP4668420B2 (en) | Method for producing HMG-CoA reductase inhibitor | |
WO2001057223A1 (en) | Enzyme in non-mevalonate pathway and gene encoding the same | |
JP2000300257A (en) | Search method for antimicrobially or herbicidally active compounds | |
CN1982449A (en) | Process for producing isoprenoid compounds by microorganisms and a method for screening compounds with antibiotic or weeding activity | |
JPWO2002006491A1 (en) | Isopentenyl diphosphate isomerase | |
JP2002512790A (en) | Recombinant secoisolariciresinol dehydrogenase and methods of use | |
KR20130121420A (en) | Recombinant vector comprising cytocrome p450 reductase genes, microorganism transformed thereof and method for producing p450 enzyme-derived compounds using the same | |
MXPA00010446A (en) | Recombinant secoisolariciresinol dehydrogenase, and methods of use |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20061120 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20061129 |
|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A821 Effective date: 20061120 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20090728 |
|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20090925 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20091027 |
|
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20091117 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20121127 Year of fee payment: 3 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 4412779 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20131127 Year of fee payment: 4 |
|
LAPS | Cancellation because of no payment of annual fees |