CN111269906A - Process for the preparation of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide - Google Patents
Process for the preparation of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide Download PDFInfo
- Publication number
- CN111269906A CN111269906A CN202010115296.7A CN202010115296A CN111269906A CN 111269906 A CN111269906 A CN 111269906A CN 202010115296 A CN202010115296 A CN 202010115296A CN 111269906 A CN111269906 A CN 111269906A
- Authority
- CN
- China
- Prior art keywords
- ala
- gly
- val
- glu
- leu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000002360 preparation method Methods 0.000 title claims abstract description 12
- XJKUFNKIQNIROK-GFCCVEGCSA-N (2r)-n-(2-fluorophenyl)-n-methyl-2-phenoxypropanamide Chemical compound O([C@H](C)C(=O)N(C)C=1C(=CC=CC=1)F)C1=CC=CC=C1 XJKUFNKIQNIROK-GFCCVEGCSA-N 0.000 title claims abstract description 9
- 238000000034 method Methods 0.000 title claims description 14
- 229920001184 polypeptide Polymers 0.000 claims abstract description 12
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 12
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 12
- HURHHRCQTOKLFO-UHFFFAOYSA-N 2-[4-[(6-chloro-1,3-benzoxazol-2-yl)oxy]phenoxy]-N-(2,6-difluorophenyl)-N-methylpropanamide Chemical compound C=1C=C(OC=2OC3=CC(Cl)=CC=C3N=2)C=CC=1OC(C)C(=O)N(C)C1=C(F)C=CC=C1F HURHHRCQTOKLFO-UHFFFAOYSA-N 0.000 claims abstract description 10
- ADDQHLREJDZPMT-CQSZACIVSA-N metamifop Chemical compound O=C([C@H](OC=1C=CC(OC=2OC3=CC(Cl)=CC=C3N=2)=CC=1)C)N(C)C1=CC=CC=C1F ADDQHLREJDZPMT-CQSZACIVSA-N 0.000 claims abstract description 9
- 108020004414 DNA Proteins 0.000 claims description 11
- 239000013598 vector Substances 0.000 claims description 9
- 239000000203 mixture Substances 0.000 claims description 7
- 102000053602 DNA Human genes 0.000 claims description 4
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 2
- 239000002773 nucleotide Substances 0.000 claims 1
- 125000003729 nucleotide group Chemical group 0.000 claims 1
- 230000003287 optical effect Effects 0.000 abstract description 12
- 230000000694 effects Effects 0.000 abstract description 6
- 238000003912 environmental pollution Methods 0.000 abstract description 3
- 239000003960 organic solvent Substances 0.000 abstract description 3
- 125000003275 alpha amino acid group Chemical group 0.000 abstract 1
- 241000588724 Escherichia coli Species 0.000 description 30
- 238000006243 chemical reaction Methods 0.000 description 24
- 108090000623 proteins and genes Proteins 0.000 description 24
- 102000004169 proteins and genes Human genes 0.000 description 16
- 108010079547 glutamylmethionine Proteins 0.000 description 12
- 239000000758 substrate Substances 0.000 description 9
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 8
- 108010093581 aspartyl-proline Proteins 0.000 description 8
- 239000012141 concentrate Substances 0.000 description 8
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- 241000192043 Echinochloa Species 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 6
- 239000004009 herbicide Substances 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 6
- 239000006228 supernatant Substances 0.000 description 6
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 5
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 5
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 5
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 5
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 5
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 5
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 5
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 5
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 5
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 5
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 5
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 5
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 5
- 108010041407 alanylaspartic acid Proteins 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- 108010068265 aspartyltyrosine Proteins 0.000 description 5
- 239000000413 hydrolysate Substances 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- ABEVIHIQUUXDMS-UHFFFAOYSA-N (2-bromophenyl)-phenylmethanone Chemical compound BrC1=CC=CC=C1C(=O)C1=CC=CC=C1 ABEVIHIQUUXDMS-UHFFFAOYSA-N 0.000 description 4
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 4
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 4
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 4
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 4
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 4
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 4
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 4
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 4
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 4
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 4
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 4
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 4
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 4
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 4
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 4
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 4
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 4
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 4
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 4
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 4
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 4
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 4
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 4
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 4
- DAAUVRPSZRDMBV-KBIXCLLPSA-N Gln-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DAAUVRPSZRDMBV-KBIXCLLPSA-N 0.000 description 4
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 4
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 4
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 4
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 4
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 4
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 4
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 4
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 4
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 4
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 4
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 4
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 4
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 4
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 4
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 4
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 4
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 4
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 4
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 4
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 4
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 4
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 4
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 4
- ZGGWRNBSBOHIGH-HVTMNAMFSA-N Ile-Gln-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZGGWRNBSBOHIGH-HVTMNAMFSA-N 0.000 description 4
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 4
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 4
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 4
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 4
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 4
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 4
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 4
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 4
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 4
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 4
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 4
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 4
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 4
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 4
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 4
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 4
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 4
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 4
- 108010066427 N-valyltryptophan Proteins 0.000 description 4
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 4
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 4
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 4
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 4
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 4
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 4
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 4
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 4
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 4
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 4
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 4
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 4
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 4
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 4
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 4
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 4
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 4
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 4
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 4
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 238000012258 culturing Methods 0.000 description 4
- 238000001212 derivatisation Methods 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 108010043612 kentsin Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- -1 (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide compound Chemical class 0.000 description 3
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 3
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 3
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 3
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 3
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 3
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 3
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 3
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 3
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 3
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 3
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 3
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 3
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 3
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 3
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 3
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 3
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 3
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 3
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 3
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- 240000007594 Oryza sativa Species 0.000 description 3
- 235000007164 Oryza sativa Nutrition 0.000 description 3
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 3
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 3
- QCMYJBKTMIWZAP-AVGNSLFASA-N Pro-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 QCMYJBKTMIWZAP-AVGNSLFASA-N 0.000 description 3
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 3
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 3
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 3
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 3
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 3
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 3
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 3
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 3
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 3
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 3
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000007062 hydrolysis Effects 0.000 description 3
- 238000006460 hydrolysis reaction Methods 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 229910052751 metal Inorganic materials 0.000 description 3
- 239000002184 metal Substances 0.000 description 3
- 235000009566 rice Nutrition 0.000 description 3
- 108010000998 wheylin-2 peptide Proteins 0.000 description 3
- SXERGJJQSKIUIC-UHFFFAOYSA-N 2-Phenoxypropionic acid Chemical compound OC(=O)C(C)OC1=CC=CC=C1 SXERGJJQSKIUIC-UHFFFAOYSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 2
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 2
- 101150030566 CCS1 gene Proteins 0.000 description 2
- 101100332461 Coffea arabica DXMT2 gene Proteins 0.000 description 2
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 2
- 102100036238 Dihydropyrimidinase Human genes 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 2
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- 241001139251 Jannaschia Species 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 2
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 2
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 2
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 2
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 239000001888 Peptone Substances 0.000 description 2
- 108010080698 Peptones Proteins 0.000 description 2
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 2
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- 101100341123 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) IRA2 gene Proteins 0.000 description 2
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 2
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 2
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 2
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 2
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 239000003054 catalyst Substances 0.000 description 2
- 101150104736 ccsB gene Proteins 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 108091022884 dihydropyrimidinase Proteins 0.000 description 2
- 239000012154 double-distilled water Substances 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 2
- 230000002363 herbicidal effect Effects 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 235000019319 peptone Nutrition 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 230000000707 stereoselective effect Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 239000002351 wastewater Substances 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- FNAQPQLVCOZGRH-UHFFFAOYSA-N 4-(2-methylpropyl)piperidine-2,6-dione Chemical compound CC(C)CC1CC(=O)NC(=O)C1 FNAQPQLVCOZGRH-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- 241001109946 Aquimarina Species 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- BJPPYOMRAVLXBY-YUMQZZPRSA-N Gln-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N BJPPYOMRAVLXBY-YUMQZZPRSA-N 0.000 description 1
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 1
- CYHWWHKRCKHYGQ-GUBZILKMSA-N His-Cys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CYHWWHKRCKHYGQ-GUBZILKMSA-N 0.000 description 1
- LBCAQRFTWMMWRR-CIUDSAMLSA-N His-Cys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O LBCAQRFTWMMWRR-CIUDSAMLSA-N 0.000 description 1
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- 241000287470 Jannaschia aquimarina Species 0.000 description 1
- 241000742345 Jannaschia sp. EhC01 Species 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 241000419792 Litoreibacter ponti Species 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- OBCRZLRPJFNLAN-DCAQKATOSA-N Met-His-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OBCRZLRPJFNLAN-DCAQKATOSA-N 0.000 description 1
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 1
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 1
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 241000157908 Paenarthrobacter aurescens Species 0.000 description 1
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- ZZCJYPLMOPTZFC-SRVKXCTJSA-N Pro-Met-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZZCJYPLMOPTZFC-SRVKXCTJSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- 108010009736 Protein Hydrolysates Proteins 0.000 description 1
- 241000589625 Ralstonia pickettii Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- QSHKTZVJGDVFEW-GUBZILKMSA-N Ser-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N QSHKTZVJGDVFEW-GUBZILKMSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- NIWAGRRZHCMPOY-GMVOTWDCSA-N Trp-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NIWAGRRZHCMPOY-GMVOTWDCSA-N 0.000 description 1
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- GKUROEIXVURAAO-BPUTZDHNSA-N Trp-Asp-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GKUROEIXVURAAO-BPUTZDHNSA-N 0.000 description 1
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 1
- HJTYJQVRIQXMHM-XIRDDKMYSA-N Trp-Asp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N HJTYJQVRIQXMHM-XIRDDKMYSA-N 0.000 description 1
- WACMTVIJWRNVSO-CWRNSKLLSA-N Trp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O WACMTVIJWRNVSO-CWRNSKLLSA-N 0.000 description 1
- PHNBFZBKLWEBJN-BPUTZDHNSA-N Trp-Glu-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PHNBFZBKLWEBJN-BPUTZDHNSA-N 0.000 description 1
- WLQRIHCMPFHGKP-PMVMPFDFSA-N Trp-Leu-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=CC=C1 WLQRIHCMPFHGKP-PMVMPFDFSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 1
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 1
- KEANSLVUGJADPN-LKTVYLICSA-N Tyr-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N KEANSLVUGJADPN-LKTVYLICSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- DLMNFMXSNGTSNJ-PYJNHQTQSA-N Val-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N DLMNFMXSNGTSNJ-PYJNHQTQSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 1
- IOETTZIEIBVWBZ-GUBZILKMSA-N Val-Met-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N IOETTZIEIBVWBZ-GUBZILKMSA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000007036 catalytic synthesis reaction Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 150000007523 nucleic acids Chemical group 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 238000012257 pre-denaturation Methods 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 238000006722 reduction reaction Methods 0.000 description 1
- 238000007670 refining Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 239000010888 waste organic solvent Substances 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
- C12N9/86—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5) acting on amide bonds in cyclic amides, e.g. penicillinase (3.5.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/14—Nitrogen or oxygen as hetero atom and at least one other diverse hetero ring atom in the same ring
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y305/00—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
- C12Y305/02—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amides (3.5.2)
- C12Y305/02002—Dihydropyrimidinase (3.5.2.2), i.e. hydantoinase
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention discloses a preparation method of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenyl amide, which uses a polypeptide, can hydrolyze 2- [4- (6-chloro-2-benzoxazolyl oxy) -phenoxy ] propionic acid-N-methyl-N- (2, 6-difluoro-phenyl) amide to obtain (R) -2- [4- (6-chloro-2-benzoxazolyl oxy) -phenoxy ] propionic acid-N- (2-fluorophenyl) -N-methyl amide with high optical purity and high yield, avoids using a large amount of organic solvent, reduces environmental pollution and can obtain high-activity target products. The polypeptide has the amino acid sequence shown in SEQ ID NO: 1. SEQ ID NO: 3. SEQ ID NO: 5 or SEQ ID NO: 7 has at least 80%, at least 89%, at least 91%, at least 97% or 100% sequence homology or identity.
Description
The technical field is as follows:
the invention relates to the technical field related to biochemical raw material medicines, in particular to a preparation method of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenyl amide.
Background art:
although various paddy field herbicides have been developed and used, barnyard grass is the greatest problem among paddy field weeds.
The development of herbicides capable of controlling barnyard grass is an urgent need in the agricultural field. After transplanting rice seedlings, the current developed herbicides cannot effectively control barnyard grass growth, thereby causing serious loss to yield. It was reported that when barnyard grass was grown for 1 week per square meter, it would reduce the yield by 2%, by 10 months for 5 weeks, by about 19% for 10 weeks, and by about 35% for 20 weeks.
To control barnyard grass to reduce yield loss in rice crops, various herbicides have been used. However, there is still a need for herbicides with a broader spectrum of activity, which are environmentally friendly and cost effective.
In order to provide herbicides capable of effectively controlling barnyard grass, intensive studies have been conducted by researchers, and particularly, it has been found that phenoxypropionic acid N-alkyl-N-2-fluorophenylamide has selective herbicidal activity. As a result, it was found that certain phenoxypropionic acid N-alkyl-N-2-fluorophenylamides exist in the form of (R) -or (S) -stereoisomers, and that the (R) -stereoisomers are more safe for rice crops and have better herbicidal activity than the (S) -stereoisomers or mixtures thereof, and thus CN01823753.3 discloses a herbicidally active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide compound. The excellent activity of the (R) -stereoisomer distinguishes it from the prior art. Also, CN01823753.3 discloses a method for producing an optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide compound. It is shown in the publication that they are all obtained by a chemical synthesis method to obtain the final optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide compound. And a large amount of chemical intermediates and organic solvents are used in the chemical synthesis process, and the steps are complicated. In addition, the literature reports that chiral compounds are prepared by a chemical reduction method, a metal catalyst is a very critical factor in the reaction, the requirement on the metal catalyst is strict, the reaction needs to be completed under a high-pressure condition, the requirement on operating equipment is high, and simultaneously, a large amount of wastewater containing heavy metal ions is generated, so that the wastewater is difficult to treat and causes great pollution to the environment. In addition, the content of the chiral compound with the configuration required by metal catalytic synthesis is usually 50%, and the requirement that the content of the chiral compound is more than or equal to 99.5% cannot be met, so that the subsequent resolution and refining are required, half of products are wasted, the production efficiency is reduced, a large amount of waste organic solvent is generated, and the pressure of environmental protection treatment is increased.
The invention content is as follows:
in order to solve the defects of the prior art, the invention provides a preparation method of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenyl amide, which uses a polypeptide, can hydrolyze 2- [4- (6-chloro-2-benzoxazolyl oxy) -phenoxy ] propionic acid-N-methyl-N- (2, 6-difluoro-phenyl) amide to obtain (R) -2- [4- (6-chloro-2-benzoxazolyl oxy) -phenoxy ] propionic acid-N- (2-fluorophenyl) -N-methyl amide with high optical purity and high yield, avoids using a large amount of organic solvents, reduces environmental pollution, the target product with high activity can be obtained.
In order to achieve the above object, the present invention provides the following technical solutions:
a polypeptide which is any one of the amino acid sequences shown in (II), (III) and (IV):
(II) has the sequence of SEQ ID NO: 3;
(iii) has SEQ ID NO: 5;
(iv) has the sequence of SEQ ID NO: 7.
Preferably, they are each identical to SEQ ID NO: 3. SEQ ID NO: 5 or SEQ ID NO: 7 has at least 80%, at least 89%, at least 91%, at least 97% or 100% sequence homology or identity.
Use of a polypeptide which is any one of the amino acid sequences shown in (i), (v):
(I) has the sequence of SEQ ID NO: 1;
(v) has SEQ ID NO: 1, at least 80%, at least 89%, at least 91%, at least 97% or 100% sequence homology or identity;
a DNA molecule encoding the polypeptide of SEQ ID NO: 1. SEQ ID NO: 3. SEQ ID NO: 5 or SEQ ID NO: 7.
A DNA molecule having the sequence of SEQ ID NO: 2. SEQ ID NO: 4. SEQ ID NO: 6 or SEQ ID NO: 8.
A recombinant vector comprising any one of the DNA molecules described above, or a DNA molecule capable of expressing any one of the polypeptides described above.
A transformant obtained by introducing the recombinant vector into a host cell.
Preferably, the host cell is E.coli.
A process for preparing optically active (R) -2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propionic acid-N- (2-fluorophenyl) -N-methylamide by reacting a polypeptide or a mixture of the above-mentioned transformants with an enantiomer of 2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propionic acid-N-methyl-N- (2, 6-difluoro-phenyl) amide to form (R) -2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propionic acid-N- (2-fluorophenyl) -N-methylamide.
Wherein the ratio of the polypeptide or transformant to the enantiomeric mixture of 2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propanoic acid-N-methyl-N- (2, 6-difluoro-phenyl) amide is 2-5: 1.
An optically active (R) -2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propionic acid-N- (2-fluorophenyl) -N-methylamide, prepared by any one of the above preparation methods.
The preparation method of the (R) -2- [4- (6-chloro-2-benzoxazolyl-oxy) -phenoxy ] propionic acid-N- (2-fluorophenyl) -N-methylamide has the advantages and beneficial effects that:
the polypeptide can hydrolyze 3-isobutyl glutarimide to obtain (R) -2- [4- (6-chloro-2-benzoxazolyl-oxy) -phenoxy ] propionic acid-N- (2-fluorophenyl) -N-methylamide with high optical purity, high yield and high activity; the method for preparing the target active substance can reduce environmental pollution.
The specific implementation mode is as follows:
the technical solutions in the embodiments of the present invention are clearly and completely described below with reference to specific embodiments. It should be understood that the described embodiments are part of the present invention, and are intended to be illustrative only and not limiting in scope. Further, it should be understood that various changes or modifications of the present invention may be made by those skilled in the art after reading the teaching of the present invention, and such equivalents may fall within the scope of the present invention as defined in the appended claims.
In order to make the technical means, the creation characteristics, the achievement purposes and the effects of the invention easy to understand, the invention is further described with the specific embodiments.
Example 1 Strain construction of recombinant transformants with hydantoinase
The hydantoinase sequences from Jannaschia CCS1 strain, Jannaschia sp.EhC01 strain, Jannaschia aquimarina strain, Litoreibacter ponti strain, Ralstonia pickettii strain and Arthrobacter aurescens strain were searched for on NCBI, and then codon-optimized in an Escherichia coli expression system (optimized using a conventional online website http:// www.jcat.de) in sequence, respectively, and the optimized genes were completely synthesized. A BamHI restriction site is added at the N end of each gene, an XhoI restriction site is added at the C end of each gene, the genes are restricted and connected into BamHI and XhoI restriction sites of a plasmid pET-28a, and the N end of each gene is provided with a His label. The process is as follows:
the enzyme digestion system is as follows:
target gene/plasmid 1ug
BamHI 1ul
XhoI 1ul
10×buffer 3ul
ddH2Supplementing O to 30ul
The reaction was carried out at 37 ℃ overnight, and the desired gene and plasmid were recovered by agarose gel electrophoresis.
The linking system is as follows:
vector Large fragment 100ng
83ng of target gene
10Xbuffer 2ul
T4 ligase 1ul
ddH2O is complemented to 20ul
The ligation was performed overnight at 16 ℃ after the ligation product was transformed into DH5 α competent cells, which were spread on kanamycin LB plates (kanamycin LB plates: yeast extract 0.5%, peptone 1%, sodium chloride 1%, agar 1.5%, pH 7.0).
After 12 hours of incubation at 37 ℃ in an incubator, monoclonal extracted plasmids were picked for validation. Verification PCR was performed using universal primers for the T7 promoter and terminator and subsequent experiments were performed on positive plasmids. The PCR system and procedure were as follows:
template plasmid 0.2ul
Forward primer (10uM) 0.4ul
Reverse primer (10uM) 0.4ul
10xTransTaq-T Buffer 1ul
2.5mM dNTPs 0.8ul
TransTaq-T pcr enzyme 0.2ul
ddH2O 7ul
Total 10ul
The PCR procedure was as follows:
pre-denaturation at 94 ℃ for 5min
Denaturation at 94 ℃ for 30s
60 ℃ annealing for 60s 30 cycles
Extension at 72 deg.C for 1-2min
Fully extending at 72 ℃ for 5min
The reaction was terminated at 4 ℃ for 30min
The positive recombinant vectors which are verified to be correct are respectively named as: RD01, RD02, RD03, RD04, RD05, RD 06.
Coli BL21(DE) was transformed with these positive recombinant vectors3Competent cells (manufactured by Invitrogen) were obtained as e.coli RD01, e.coli RD02, e.coli RD03, e.coli RD04, e.coli RD05, and e.coli rd06, in this order.
TABLE 1 correspondences between strain sources, recombinant vectors and transformants
Origin of origin | Recombinant vector name | Name of transformant |
Jannaschia CCS1 strain | RD01 | E.coliRD01 |
Jannaschiasp. EhC01 strain | RD02 | E.coliRD02 |
Jannachi aquimarina strain | RD03 | E.coliRD03 |
Litoreibactericepnti strain | RD04 | E.coliRD04 |
Ralstoniapackettii strain | RD05 | E.coliRD05 |
Arthrobacteraurens strain | RD06 | E.coliRD06 |
Example 2
Catalytic thallus and protein preparation
The E.coli RD01 transformant was inoculated into 50ml/250ml LB medium (yeast extract 0.5%, peptone 1%, sodium chloride 1%, pH7.0) containing 50. mu.g/ml kanamycin, cultured with shaking at 37 ℃ and 220rpm, and induced by adding 0.5mM IPTG when grown to an OD600 of 0.6-0.8 under 20 ℃ and 200rpm conditions. The cells were collected by centrifugation and suspended in 50mM phosphate buffer (pH 7.0).
The cells were sonicated and centrifuged, and the supernatant was passed through pre-filled and equilibrated Ni-NTA (available from Solambio. RTM. P2010), to collect the proteins in the supernatant. Impurities were eluted with 20mM imidazole and protein was eluted with 250mM imidazole. Then, a protein concentration tube with 30KDa is used for concentration, and the protein is concentrated to 2mg/mL to obtain a protein concentrated solution.
Similarly, the transformants of e.coli RD02, e.coli RD03, e.coli RD04, e.coli RD05 and e.coli RD06 were prepared to obtain the corresponding protein concentrates, respectively, according to the method of example 2.
Sequencing the protein concentrates, the amino acid sequences and corresponding nucleic acid sequences are shown in Table 2 below
TABLE 2
Example 3
Stereoselective hydrolysis of 2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propanoic acid-N-methyl-N- (2, 6-difluoro-phenyl) amide
After culturing the e.coli RD01 transformant, 10mL of the protein concentrate obtained by separation and purification was taken, 5mg of substrate 2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propionic acid-N-methyl-N- (2, 6-difluoro-phenyl) amide was added, and the ratio of protein to substrate in the protein concentrate was 4: 1, shaking the mixture at 30 ℃ for 48 hours to obtain a hydrolysate. After the reaction is finished, centrifuging and collecting supernatant, taking a proper amount of supernatant for sample injection, analyzing by high performance liquid chromatography, and calculating the conversion rate (%).
Derivatizing the hydrolysate with 2-bromobenzophenone at a molar ratio of product to derivatizing agent of 1: dissolving 1.5, 2-bromobenzophenone in acetonitrile, and carrying out derivatization reaction at 30 ℃ for 30 minutes. After the reaction, a proper amount of the derivatization reaction solution was sampled and analyzed for optical purity (% ee).
Similarly, after culturing the transformants of e.coli RD02, e.coli RD03, e.coli RD04, e.coli RD05 and e.coli RD06, the respective protein concentrates obtained by separation and purification were also treated as described in example 3, and the conversion (%) and the optical purity (% ee) were calculated, respectively. The results are given in table 3 below:
TABLE 3
Conversion (%) - (initial weight of substrate-weight of substrate remaining in reaction)/initial weight of substrate
Take e.coli RD01 as an example to calculate the conversion rate calculation process:
coli RD01 reaction was completed to yield 1.4mg of hydrolysate, and when 5mg of substrate was initially charged, the conversion rate was 1.4/5 × 100%
Optical purity (% ee) (a-B)/(a + B) × 100(a and B represent the amounts of the respective isomers, a > B)
Take e.coli RD01 as an example to calculate the optical purity ee value:
optical purity (% ee) (99.919-0.081)/(99.919+0.081) × 100
Example 4
Stereoselective hydrolysis of 2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propanoic acid-N-methyl-N- (2, 6-difluoro-phenyl) amide
After culturing the e.coli RD01 transformant, 10mL of the protein concentrate obtained by separation and purification was taken, 5mg of substrate 2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propionic acid-N-methyl-N- (2, 6-difluoro-phenyl) amide was added, and the ratio of protein to substrate in the protein concentrate was 4: 1, shaking the mixture at 30 ℃ for 48 hours to obtain a hydrolysate. After the reaction is finished, centrifuging and collecting supernatant, taking a proper amount of supernatant for sample injection, analyzing by high performance liquid chromatography, and calculating the conversion rate (%).
Derivatizing the hydrolysate with 2-bromobenzophenone at a molar ratio of product to derivatizing agent of 1: dissolving 1.5, 2-bromobenzophenone in acetonitrile, and carrying out derivatization reaction at 30 ℃ for 30 minutes. After the reaction, a proper amount of the derivatization reaction solution was sampled and analyzed for optical purity (% ee).
Similarly, after culturing the transformants of e.coli RD02, e.coli RD03, e.coli RD04, e.coli RD05 and e.coli RD06, the respective protein concentrates obtained by separation and purification were also treated as described in example 4, and the conversion (%) and the optical purity (% ee) were calculated, respectively. The results are given in table 4 below:
table 4:
microorganisms | Conversion (%) | Optical purity (% ee) | Absolute configuration |
E.coliRD01 | 43 | 99.4 | R |
E.coliRD02 | 38 | 99.1 | R |
E.coliRD03 | 57 | 99.8 | R |
E.coliRD04 | 25 | 92.2 | R |
E.coliRD05 | 36 | 30.0 | S |
E.coliRD06 | 15 | 22.0 | S |
Conversion (%) - (initial weight of substrate-weight of substrate remaining in reaction)/initial weight of substrate
Optical purity (% ee) (a-B)/(a + B) × 100(a and B represent the amounts of the respective isomers, a > B)
In this example, e.coli RD05 and e.coli RD06 catalyzed hydrolysis, the S configuration predominated in the hydrolysate.
The foregoing shows and describes the general principles and broad features of the present invention and advantages thereof. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.
Sequence listing
<110> Anhui Sailapu pharmaceutical Co., Ltd
<120> process for producing optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide
<130>2020.02.25
<160>12
<170>SIPOSequenceListing 1.0
<210>1
<211>487
<212>PRT
<213> Artificial sequence (Artificial sequence)
<400>1
Met Ser Lys Val Ile Lys Gly Gly Thr Ile Val Thr Ala Asp Arg Gln
1 5 10 15
Trp Gln Ala Asp Val Leu Ile Glu Gly Glu Lys Ile Ala Glu Ile Gly
20 25 30
Glu Asn Leu Arg Gly Asp Glu Val Ile Asp Ala Glu Gly Ala Tyr Val
35 40 45
Ile Pro Gly Gly Ile Asp Pro His Thr His Leu Glu Met Pro Phe Met
50 55 60
Gly Thr Thr Ala Ala Glu Thr Phe Glu Thr Gly Thr Phe Ala Ala Ala
65 70 75 80
Ala Gly Gly Thr Thr Met Leu Val Asp Phe Cys Leu Pro Gly Glu Asp
85 90 95
Gly Ser Leu Leu Ser Ala Ile Asp Ala Trp Asp Ala Lys Ser Lys Asp
100 105 110
Gln Ile Cys Val Asp Ile Ser Tyr His Met Ala Ile Thr Gly Trp Ser
115 120 125
Glu Ser Ile Phe Asn Glu Met Ala Asp Val Val Asn Val Arg Gly Ile
130 135 140
Asn Thr Phe Lys His Phe Met Ala Tyr Lys Gly Ala Leu Met Ile Glu
145 150155 160
Asp Asp Glu Met Phe Ser Ser Phe Lys Arg Cys Ala Glu Leu Gly Ala
165 170 175
Leu Pro Leu Val His Ala Glu Asn Gly Asp Ile Val Gln Glu Leu Gln
180 185 190
Gln Lys Tyr Met Ala Met Gly Val Thr Gly Pro Glu Gly His Ala Tyr
195 200 205
Ser Arg Pro Pro Glu Val Glu Gly Glu Ala Ala Asn Arg Ala Ile Met
210 215 220
Ile Ala Asp Ala Ala Gly Thr Pro Leu Tyr Ile Val His Val Ser Cys
225 230 235 240
Glu Gln Ala His Glu Ala Ile Arg Arg Ala Arg Gln Lys Gly Met Arg
245 250 255
Val Phe Gly Glu Pro Leu Ile Gln His Leu Thr Leu Asp Glu Ser Glu
260 265 270
Tyr Phe Asn Lys Asp Trp Gln Tyr Ala Ala Arg Arg Val Met Ser Pro
275 280 285
Pro Phe Arg Asn Lys Glu His Gln Asp Gly Leu Trp Ala Gly Leu Ala
290 295 300
Ala Gly Ser Leu Gln Val Val Ala Thr Asp His Ala Ala Phe Thr Asp
305 310315 320
Glu Gln Lys Arg Met Gly Val Asp Asn Phe Gly Met Ile Pro Asn Gly
325 330 335
Thr Gly Gly Leu Glu Glu Arg Met Ala Met Leu Trp Thr Arg Gly Val
340 345 350
Glu Thr Gly Arg Leu Thr Pro Glu Glu Phe Val Ala Val Thr Ser Ser
355 360 365
Asn Ile Ala Lys Ile Leu Asn Ile Tyr Pro Met Lys Gly Gly Ile Asn
370 375 380
Val Gly Gly Asp Ala Asp Ile Val Val Trp Asp Pro Lys Leu Gly Arg
385 390 395 400
Thr Ile Thr Thr Ala Thr Ala Lys Ser Ile Leu Asp Tyr Asn Val Phe
405 410 415
Glu Gly Met Glu Val Ser Ala Ser Pro Arg Tyr Thr Leu Ser Arg Gly
420 425 430
Asp Val Val Trp Ala Ala Gly Gln Asn Ser Gln Pro Gln Pro Gly Arg
435 440 445
Gly Lys Phe Val Lys Arg Pro Pro Ala Ala Ser Ala Ser Gln Ala Leu
450 455 460
Ser Lys Trp Lys Ala Leu Asn Thr Pro Arg Lys Ile Glu Arg Asp Pro
465 470 475480
Met Asn Ile Pro Ala Gly Val
485
<210>2
<211>1461
<212>DNA
<213> Artificial sequence (Artificial sequence)
<400>2
atgagcaaag ttattaaggg tggcaccatt gttaccgcgg accgtcagtg gcaggcggat 60
gttctgattg agggcgaaaa gattgcggaa attggcgaga acctgcgtgg cgacgaagtg 120
atcgatgcgg agggtgcgta cgttattccg ggtggcattg acccgcacac ccacctggaa 180
atgccgttta tgggcaccac cgcggcggaa acctttgaaa ccggtacctt tgcggcggcg 240
gcgggtggca ccaccatgct ggttgacttc tgcctgccgg gcgaggatgg cagcctgctg 300
agcgcgattg acgcgtggga tgcgaagagc aaagaccaaa tttgcgtgga tatcagctac 360
cacatggcga ttaccggttg gagcgaaagc atctttaacg agatggcgga tgtggttaac 420
gttcgtggca ttaacacctt caagcacttt atggcgtata aaggtgcgct gatgatcgaa 480
gacgatgaga tgttcagcag ctttaagcgt tgcgcggaac tgggtgcgct gccgctggtg 540
cacgcggaaa acggtgacat cgttcaggag ctgcagcaaa aatatatggc gatgggtgtg 600
accggtccgg agggtcatgc gtatagccgt ccgccggaag tggaaggtga agcggcgaac 660
cgtgcgatca tgattgcgga tgcggcgggt accccgctgt acattgtgca cgttagctgc 720
gagcaagcgc atgaggcgat ccgtcgtgcg cgtcaaaagg gtatgcgtgt gttcggcgaa 780
ccgctgattc aacacctgac cctggacgaa agcgagtact ttaacaagga ttggcagtat 840
gcggcgcgtc gtgttatgag cccgccgttc cgtaacaaag aacaccaaga tggcctgtgg 900
gcgggtctgg cggcgggcag cctgcaagtg gttgcgaccg accacgcggc gttcaccgat 960
gagcagaaac gtatgggtgt ggataacttt ggcatgatcc cgaacggtac cggtggcctg 1020
gaggaacgta tggcgatgct gtggacccgt ggtgttgaaa ccggccgtct gaccccggag 1080
gaattcgtgg cggttaccag cagcaacatc gcgaagattc tgaacatcta tccgatgaaa 1140
ggtggcatta acgtgggtgg cgacgcggat atcgtggttt gggacccgaa gctgggccgt 1200
accattacca ccgcgaccgc gaaaagcatc ctggattaca acgtgtttga gggtatggaa 1260
gtgagcgcga gcccgcgtta taccctgagc cgtggcgacg tggtttgggc ggcgggtcaa 1320
aacagccagc cgcaaccggg tcgtggcaag ttcgttaaac gtccgccggc ggcgagcgcg 1380
agccaggcgc tgagcaagtg gaaggcgctg aacaccccgc gtaagattga acgtgatccg 1440
atgaacattc cggcgggcgt g 1461
<210>3
<211>487
<212>PRT
<213> Artificial sequence (Artificial sequence)
<400>3
Met Ser Lys Val Ile Lys Gly Gly Thr Ile Val Thr Ala Asp Arg Gln
1 5 10 15
Trp Gln Ala Asp Val Leu Ile Glu Gly Glu Lys Ile Ala Glu Ile Gly
20 25 30
Glu Asn Leu Arg Gly Asp Glu Val Ile Asp Ala Glu Gly Ala Tyr Val
35 40 45
Ile Pro Gly Gly Ile Asp Pro His Thr His Leu Glu Met Pro Phe Met
50 55 60
Gly Thr Thr Ala Ala Glu Thr Phe Glu Thr Gly Thr Phe Ala Ala Ala
65 70 75 80
Ala Gly Gly Thr Thr Met Leu Val Asp Phe Cys Leu Pro Gly Glu Asp
85 90 95
Gly Ser Leu Leu Asn Ala Ile Asp Ala Trp Asp Lys Lys Ser Lys Asp
100 105 110
Gln Ile Cys Val Asp Ile Ser Tyr His Met Ala Ile Thr Gly Trp Asn
115 120 125
Glu Asp Ile Phe Asn Glu Met Asp Ala Val Val Asn Gln Arg Gly Ile
130 135 140
Asn Thr Phe Lys His Phe Met Ala Tyr Lys Gly Ala Leu Met Ile Glu
145 150 155 160
Asp Asp Glu Met Phe Ala Ser Phe Lys Arg Cys Ala Glu Leu Gly Ala
165 170 175
Leu Pro Leu Val His Ala Glu Asn Gly Asp Ile Val Gln Glu Leu Gln
180 185 190
Gln Lys Tyr Met Ala Met Gly Val Thr Gly Pro Glu Gly His Ala Tyr
195 200 205
Ser Arg Pro Pro Glu Val Glu Gly Glu Ala Ala Asn Arg Ala Ile Met
210 215 220
Ile Ala Asp Ala Ala Gly Thr Pro Leu Tyr Ile Val His Val Ser Cys
225 230 235 240
Glu Gln Ala His Glu Ala Ile Arg Arg Ala Arg Gln Lys Gly Met Arg
245 250 255
Val Phe Gly Glu Pro Leu Ile Gln His Leu Thr Leu Asp Glu Ser Glu
260 265 270
Tyr Phe Asn Lys Asp Trp Gln Tyr Ala Ala Arg Arg Val Met Ser Pro
275 280 285
Pro Phe Arg Ser Lys Asp His Gln Asp Gly Leu Trp Ala Gly Leu Ala
290 295 300
Ala Gly Ser Leu Gln Val Val Ala Thr Asp His Ala Ala Phe Thr Asp
305 310 315 320
Glu Gln Lys Arg Met Gly Val Gly Asn Phe Gly Met Ile Pro Asn Gly
325 330 335
Thr Gly Gly Leu Glu Glu Arg Met Ala Met Leu Trp Thr Asn Gly Val
340 345 350
Glu Thr Gly Arg Leu Thr Pro Glu Glu Phe Val Ala Val Thr Ser Ser
355 360 365
Asn Ile Ala Lys Ile Leu Asn Ile Tyr Pro Met Lys Gly Gly Ile Asn
370 375 380
Val Gly Gly Asp Ala Asp Val Val Val Trp Asp Pro Lys Leu Gly Arg
385 390 395 400
Thr Ile Thr Thr Ala Thr Ala Lys Ser Ile Leu Asp Tyr Asn Val Phe
405 410 415
Glu Gly Met Glu Val Ser Ala Ser Pro Arg Tyr Thr Leu Ser Arg Gly
420 425 430
Asp Val Val Trp Ala Ala Gly Gln Asn Ser Gln Pro Thr Pro Gly Arg
435 440 445
Gly Arg Phe Val Lys Arg Pro Pro Ala Ala Ser Ala Ser Gln Ala Leu
450 455 460
Ser Lys Trp Lys Ala Leu Asn Thr Pro Arg Lys Ile Glu Arg Asp Pro
465 470 475 480
Met Asn Ile Pro Ala Gly Val
485
<210>4
<211>1461
<212>DNA
<213> Artificial sequence (Artificial sequence)
<400>4
atgagcaagg ttattaaagg tggcaccatc gtgaccgcgg accgtcagtg gcaagcggat 60
gttctgattg agggtgaaaa gatcgcggaa attggcgaga acctgcgtgg cgacgaagtt 120
atcgatgcgg agggtgcgta cgtgattccg ggtggcattg acccgcacac ccacctggaa 180
atgccgttta tgggtaccac cgcggcggaa acctttgaga ccggtacctt tgcggcggcg 240
gcgggtggca ccaccatgct ggttgacttc tgcctgccgg gcgaggatgg cagcctgctg 300
aacgcgatcg acgcgtggga taagaaaagc aaagaccaaa tttgcgtgga tatcagctac 360
cacatggcga ttaccggttg gaacgaagac atctttaacg agatggatgc ggtggttaac 420
cagcgtggta ttaacacctt caagcacttt atggcgtata aaggcgcgct gatgatcgaa 480
gacgatgaga tgttcgcgag ctttaaacgt tgcgcggaac tgggtgcgct gccgctggtt 540
cacgcggaaa acggcgacat tgtgcaagag ctgcagcaaa aatacatggc gatgggtgtt 600
accggtccgg agggtcatgc gtacagccgt ccgccggaag tggaaggcga ggcggcgaac 660
cgtgcgatca tgattgcgga tgcggcgggc accccgctgt acattgttca cgtgagctgc 720
gagcaagcgc atgaggcgat ccgtcgtgcg cgtcaaaagg gtatgcgtgt tttcggcgaa 780
ccgctgatcc aacacctgac cctggacgaa agcgagtact ttaacaagga ttggcagtat 840
gcggcgcgtc gtgtgatgag cccgccgttc cgtagcaaag accaccagga tggtctgtgg 900
gcgggtctgg cggcgggtag cctgcaagtg gttgcgaccg accacgcggc gttcaccgat 960
gaacagaaac gtatgggtgt tggcaacttt ggtatgattc cgaacggcac cggtggcctg 1020
gaggaacgta tggcgatgct gtggaccaac ggtgtggaga ccggccgtct gaccccggag 1080
gaattcgttg cggtgaccag cagcaacatc gcgaagattc tgaacatcta tccgatgaaa 1140
ggtggcatca acgtgggtgg cgacgcggat gtggttgtgt gggacccgaa gctgggtcgt 1200
accattacca ccgcgaccgc gaaaagcatc ctggattaca acgtttttga aggcatggaa 1260
gtgagcgcga gcccgcgtta taccctgagc cgtggtgacg ttgtttgggc ggcgggtcaa 1320
aacagccagc cgaccccggg tcgtggccgt ttcgttaaac gtccgccggc ggcgagcgcg 1380
agccaggcgc tgagcaagtg gaaagcgctg aacaccccgc gtaaaattga gcgtgatccg 1440
atgaacatcc cggcgggtgt g 1461
<210>5
<211>487
<212>PRT
<213> Artificial sequence (Artificial sequence)
<400>5
Met Ser Lys Val Ile Lys Asn Gly Thr Ile Val Thr Ala Asp Arg Gln
1 5 10 15
Trp Lys Ala Asp Val Leu Ile Glu Gly Glu His Ile Ala Glu Ile Gly
20 25 30
Glu Asn Leu Lys Gly Asp Glu Thr Ile Asp Ala Ser Asp Ala Tyr Val
35 40 45
Ile Pro Gly Gly Ile Asp Pro His Thr His Leu Glu Met Pro Phe Met
50 55 60
Gly Thr Thr Ala Ala Glu Thr Phe Glu Ser Gly Thr Phe Ala Ala Val
65 70 75 80
Ala Gly Gly Thr Thr Met Leu Val Asp Phe Cys Leu Pro Gly Glu Asp
85 90 95
Gly Ser Leu Leu Asn Ala Ile Asp Glu Trp Asp Arg Lys Ser Arg Asp
100 105 110
Gln Ile Cys Cys Asp Ile Ser Tyr His Met Ala Ile Thr Gly Trp Ser
115 120 125
Glu Ser Ile Phe Asp Glu Met Glu Ala Val Val Lys Glu Arg Gly Ile
130 135 140
Asn Thr Phe Lys His Phe Met Ala Tyr Lys Gly Ala Leu Met Val Glu
145 150 155 160
Asp Asp Glu Met Phe Ala Ser Phe Lys Arg Cys Ala Glu Leu Gly Ala
165 170 175
Leu Pro Leu Val His Ala Glu Asn Gly Asp Ile Val Ala Glu Leu Gln
180 185 190
Gln Lys Tyr Leu Ala Glu Gly Ile Thr Gly Pro Glu Gly His Ala Tyr
195 200 205
Ser Arg Pro Pro Glu Val Glu Gly Glu Ala Ala Asn Arg Ala Ile Met
210 215 220
Ile Ala Asp Ala Ala Gly Thr Pro Leu Tyr Ile Val His Val Ser Cys
225 230 235 240
Glu Gln Ala His Glu Ala Ile Arg Arg Ala Arg Gln Lys Gly Met Arg
245 250 255
Val Tyr Gly Glu Pro Leu Ile Gln His Leu Thr Leu Asp Glu Ser Glu
260 265 270
Tyr Phe Asp Lys Asp Trp Gln Tyr Ala Ala Arg Arg Val Met Ser Pro
275 280 285
Pro Phe Arg Ser Lys Asp His Gln Asp Gly Leu Trp Asn Gly Leu Ala
290 295 300
Ala Gly Ser Leu Gln Val Val Ala Thr Asp His Ala Ala Phe Thr Asp
305 310 315 320
Glu Gln Lys Arg Met Gly Val Asp Asn Phe Ala Met Ile Pro Asn Gly
325 330 335
Thr Gly Gly Leu Glu Glu Arg Met Gly Met Leu Trp Thr Lys Gly Val
340 345 350
Glu Thr Gly Arg Leu Thr Pro Glu Glu Phe Val Ala Val Thr Ser Thr
355 360 365
Asn Ile Ala Lys Ile Leu Asn Ile Tyr Pro Met Lys Gly Gly Ile Ala
370 375 380
Val Gly Gly His Ala Asp Val Val Val Trp Asp Pro Thr Leu Gly Arg
385 390 395 400
Thr Ile Thr Thr Ala Thr Ala Lys Ser Ile Leu Asp Tyr Asn Val Phe
405 410 415
Glu Gly Ile Glu Val Ser Ala Ser Pro Arg Tyr Thr Leu Ser Arg Gly
420 425 430
Asp Val Val Trp Ala Ala Gly Gln Asn Ser Gln Pro Gln Pro Gly Arg
435 440 445
Gly Lys Phe Val Lys Arg Ser Pro Tyr Ala Ser Ala Ser Lys Ala Leu
450 455 460
Ser Lys Trp Lys Ala Leu Asn Thr Pro Arg Lys Ile Glu Arg Asp Pro
465 470 475 480
Met Asn Ile Pro Ala Gly Val
485
<210>6
<211>1461
<212>DNA
<213> Artificial sequence (Artificial sequence)
<400>6
atgagcaagg ttattaaaaa cggtaccatc gtgaccgcgg accgtcaatg gaaggcggat 60
gttctgattg agggcgaaca catcgcggaa attggcgaga acctgaaagg cgacgaaacc 120
atcgacgcga gcgatgcgta cgtgattccg ggtggcatcg atccgcacac ccacctggag 180
atgccgttca tgggtaccac cgcggcggaa acctttgaaa gcggtacctt tgcggcggtt 240
gcgggtggca ccaccatgct ggttgacttt tgcctgccgg gtgaagatgg cagcctgctg 300
aacgcgattg acgagtggga tcgtaagagc cgtgaccaga tttgctgcga tatcagctac 360
cacatggcga ttaccggttg gagcgaaagc atcttcgacg agatggaagc ggtggttaaa 420
gagcgtggta tcaacacctt caagcacttt atggcgtata aaggcgcgct gatggttgaa 480
gacgatgaga tgttcgcgag ctttaaacgt tgcgcggaac tgggtgcgct gccgctggtt 540
cacgcggaaa acggcgacat tgtggcggag ctgcagcaaa aatatctggc ggaaggtatt 600
accggtccgg agggtcatgc gtacagccgt ccgccggaag tggaaggcga ggcggcgaac 660
cgtgcgatca tgattgcgga tgcggcgggc accccgctgt acattgttca cgtgagctgc 720
gagcaagcgc atgaggcgat ccgtcgtgcg cgtcaaaagg gtatgcgtgt ttatggcgag 780
ccgctgatcc aacacctgac cctggacgaa agcgagtact tcgacaagga ttggcagtat 840
gcggcgcgtc gtgtgatgag cccgccgttt cgtagcaaag accaccaaga tggtctgtgg 900
aacggtctgg cggcgggtag cctgcaagtg gttgcgaccg accacgcggc gttcaccgat 960
gaacagaagc gtatgggtgt tgataacttt gcgatgatcc cgaacggtac cggtggcctg 1020
gaggaacgta tgggcatgct gtggaccaaa ggtgtggaga ccggccgtct gaccccggag 1080
gaatttgttg cggtgaccag caccaacatc gcgaagattc tgaacatcta cccgatgaaa 1140
ggtggcattg cggttggtgg ccacgcggac gtggttgtgt gggatccgac cctgggtcgt 1200
accattacca ccgcgaccgc gaaaagcatc ctggactaca acgtttttga aggtattgaa 1260
gtgagcgcga gcccgcgtta taccctgagc cgtggtgatg ttgtttgggc ggcgggtcaa 1320
aacagccagc cgcaaccggg tcgtggcaag tttgttaaac gtagcccgta tgcgagcgcg 1380
agcaaggcgc tgagcaagtg gaaagcgctg aacaccccgc gtaaaattga acgtgacccg 1440
atgaacatcc cggcgggtgt g 1461
<210>7
<211>487
<212>PRT
<213> Artificial sequence (Artificial sequence)
<400>7
Met Ser Lys Val Ile Lys Gly Gly Thr Ile Val Thr Ala Asp Arg Ser
1 5 10 15
Trp Thr Ala Asp Val Leu Ile Glu Gly Glu Lys Ile Ala Glu Ile Gly
20 25 30
Glu Asn Leu Lys Gly Asp Glu Val Ile Asp Ala Glu Gly Ala Tyr Val
35 40 45
Ile Pro Gly Gly Ile Asp Pro His Thr His Leu Glu Met Pro Phe Met
50 55 60
Gly Thr Thr Ala Ala Glu Thr Phe Glu Ser Gly Thr Phe Ala Ala Ala
65 70 75 80
Ala Gly Gly Thr Thr Met Leu Val Asp Phe Cys Leu Pro Gly Glu Asp
85 90 95
Gly Ser Leu Leu Ser Ala Ile Asp Asp Trp Asp Arg Lys Ser Lys Asp
100 105 110
Gln Ile Cys Cys Asp Ile Ser Tyr His Met Ala Ile Thr Gly Trp Asn
115 120 125
Glu Asn Ile Phe Asn Glu Met Glu Asp Val Val Asn Lys Arg Gly Ile
130 135 140
Asn Thr Phe Lys His Phe Met Ala Tyr Lys Gly Ala Leu Met Val Glu
145 150 155 160
Asp Asp Glu Met Phe Ala Ser Phe Lys Arg Cys Ala Glu Leu Gly Ala
165 170 175
Leu Pro Leu Val His Ala Glu Asn Gly Asp Ile Val Gln Glu Leu Gln
180 185 190
Gln Lys Tyr Met Ala Glu Gly Ile Thr Gly Pro Glu Gly His Ala Tyr
195 200 205
Ser Arg Pro Pro Glu Val Glu Gly Glu Ala Ala Asn Arg Ala Ile Met
210 215 220
Ile Ala Asp Ala Ala Gly Thr Pro Leu Tyr Ile Val His Val Ser Cys
225 230 235 240
Glu Gln Ala His Glu Ala Ile Arg Arg Ala Arg Gln Lys Gly Met Arg
245 250 255
Val Tyr Gly Glu Pro Leu Ile Gln His Leu Thr Leu Asp Glu Ser Glu
260 265 270
Tyr Phe Asn Lys Asp Trp Gln Tyr Ala Ala Arg Arg Val Met Ser Pro
275 280 285
Pro Phe Arg Ser Lys Asp His Gln Ala Ser Leu Trp Ala Gly Leu Ala
290 295 300
Ala Gly Ser Leu Gln Val Val Ala Thr Asp His Ala Ala Phe Thr Asp
305 310 315 320
Lys Gln Lys Gln Met Gly Leu Asp Asn Phe Thr Ser Ile Pro Asn Gly
325 330 335
Thr Gly Gly Leu Glu Glu Arg Met Ala Met Leu Trp Thr Thr Gly Val
340 345 350
Glu Thr Gly Arg Leu Thr Pro Glu Glu Phe Val Ala Ala Thr Ser Thr
355 360 365
Asn Ile Ala Lys Ile Leu Asn Ile Tyr Pro Leu Lys Gly Gly Ile Asn
370 375 380
Val Gly Gly Asp Ala Asp Val Val Val Trp Asp Pro Thr Ile Ser Arg
385 390 395 400
Glu Ile Ala Val Ser Thr Gln Lys Ser Ile Ile Asp Tyr Asn Val Phe
405 410 415
Glu Gly Met Thr Val Thr Ala Gln Pro Arg Tyr Thr Leu Ser Arg Gly
420 425 430
Glu Val Ile Trp Ala Tyr Gly Gln Asn Ser Gln Pro Gln Pro Gly Arg
435 440 445
Gly Lys Phe Val Arg Arg Pro Ala Phe Ala Ser Ala Ser Lys Ala Leu
450 455 460
Ser Arg Trp Lys Ser Leu Asn Thr Pro Arg Lys Ile Glu Arg Asp Pro
465 470 475 480
Leu Asn Ile Pro Ser Gly Val
485
<210>8
<211>1461
<212>DNA
<213> Artificial sequence (Artificial sequence)
<400>8
atgagcaagg ttattaaagg tggcaccatc gtgaccgcgg atcgtagctg gaccgcggac 60
gttctgattg agggcgaaaa gatcgcggaa attggcgaga acctgaaagg tgacgaagtt 120
atcgatgcgg agggtgcgta cgtgattccg ggtggcatcg atccgcacac ccacctggag 180
atgccgttca tgggcaccac cgcggcggaa accttcgaga gcggtacctt tgcggcggcg 240
gcgggtggca ccaccatgct ggttgacttt tgcctgccgg gcgaggatgg tagcctgctg 300
agcgcgattg acgattggga ccgtaagagc aaagatcaga tttgctgcga catcagctac 360
cacatggcga ttaccggttg gaacgaaaac atcttcaacg agatggaaga cgtggttaac 420
aagcgtggca tcaacacctt caagcacttt atggcgtata aaggtgcgct gatggttgaa 480
gacgatgaga tgttcgcgag ctttaaacgt tgcgcggagc tgggtgcgct gccgctggtt 540
cacgcggaaa acggtgatat tgtgcaagag ctgcagcaaa aatacatggc ggaaggtatt 600
accggtccgg agggtcatgc gtacagccgt ccgccggaag tggaaggtga agcggcgaac 660
cgtgcgatca tgattgcgga cgcggcgggt accccgctgt acattgttca cgtgagctgc 720
gagcaagcgc atgaggcgat ccgtcgtgcg cgtcaaaaag gcatgcgtgt ttatggtgaa 780
ccgctgatcc agcacctgac cctggatgaa agcgagtact tcaacaagga ctggcaatat 840
gcggcgcgtc gtgtgatgag cccgccgttt cgtagcaaag atcaccaagc gagcctgtgg 900
gcgggtctgg cggcgggtag cctgcaagtg gttgcgaccg atcacgcggc gttcaccgac 960
aagcagaaac aaatgggcct ggacaacttt accagcattc cgaacggtac cggtggcctg 1020
gaggaacgta tggcgatgct gtggaccacc ggcgttgaga ccggtcgtct gaccccggag 1080
gaattcgtgg cggcgaccag caccaacatc gcgaagattc tgaacatcta cccgctgaaa 1140
ggtggcatca acgttggtgg cgacgcggat gtggttgtgt gggatccgac cattagccgt 1200
gaaatcgctg tgagcaccca gaagagcatc attgactata acgtttttga gggtatgacc 1260
gttaccgcgc agccgcgtta caccctgagc cgtggcgaag ttatctgggc gtatggtcaa 1320
aacagccagc cgcaaccggg tcgtggcaag tttgtgcgtc gtccggcgtt tgcgagcgcg 1380
agcaaggcgc tgagccgttg gaaaagcctg aacaccccgc gtaaaattga acgtgacccg 1440
ctgaacatcc cgagcggtgt t 1461
<210>9
<211>457
<212>PRT
<213> Artificial sequence (Artificial sequence)
<400>9
Met Asp Ile Ile Ile Lys Asn Gly Thr Ile Val Thr Ala Asp Gly Ile
1 5 10 15
Ser Arg Ala Asp Leu Gly IleLys Asp Gly Lys Ile Thr Gln Ile Gly
20 25 30
Gly Ala Leu Gly Pro Ala Glu Arg Thr Ile Asp Ala Ala Gly Arg Tyr
35 40 45
Val Phe Pro Gly Gly Ile Asp Val His Thr His Val Glu Thr Val Ser
50 55 60
Phe Asn Thr Gln Ser Ala Asp Thr Phe Ala Thr Ala Thr Val Ala Ala
65 70 75 80
Ala Cys Gly Gly Thr Thr Thr Ile Val Asp Phe Cys Gln Gln Asp Arg
85 90 95
Gly His Ser Leu Ala Glu Ala Val Ala Lys Trp Asp Gly Met Ala Gly
100 105 110
Gly Lys Ser Ala Ile Asp Tyr Gly Tyr His Ile Ile Val Leu Asp Pro
115 120 125
Thr Asp Ser Val Ile Glu Glu Leu Glu Val Leu Pro Asp Leu Gly Ile
130 135 140
Thr Ser Phe Lys Val Phe Met Ala Tyr Arg Gly Met Asn Met Ile Asp
145 150 155 160
Asp Val Thr Leu Leu Lys Thr Leu Asp Lys Ala Val Lys Thr Gly Ser
165 170 175
Leu Val Met Val His Ala Glu Asn Gly Asp Ala Ala Asp Tyr Leu Arg
180 185 190
Asp Lys Phe Val Ala Glu Gly Lys Thr Ala Pro Ile Tyr His Ala Leu
195 200 205
Ser Arg Pro Pro Arg Val Glu Ala Glu Ala Thr Ala Arg Ala Leu Ala
210 215 220
Leu Ala Glu Ile Val Asn Ala Pro Ile Tyr Ile Val His Val Thr Cys
225 230 235 240
Glu Glu Ser Leu Glu Glu Val Met Arg Ala Lys Ser Arg Gly Val Arg
245 250 255
Ala Leu Ala Glu Thr Cys Thr His Tyr Leu Tyr Leu Thr Lys Glu Asp
260 265 270
Leu Glu Arg Pro Asp Phe Glu Gly Ala Lys Tyr Val Phe Thr Pro Pro
275 280 285
Ala Arg Ala Lys Lys Asp His Asp Val Leu Trp Asn Ala Leu Arg Asn
290 295 300
Gly Val Phe Glu Thr Val Ser Ser Asp His Cys Ser Trp Leu Phe Lys
305 310 315 320
Gly His Lys Asp Arg Gly Arg Asn Asp Phe Arg Ala Ile Pro Asn Gly
325 330 335
Ala Pro Gly Val Glu Glu Arg Leu Met Met Val Tyr Gln Gly Val Asn
340 345 350
Glu Gly Arg Ile Ser Leu Thr Gln Phe Val Glu Leu Val Ala Thr Arg
355 360 365
Pro Ala Lys Val Phe Gly Met Phe Pro Gln Lys Gly Thr Ile Ala Val
370 375 380
Gly Ser Asp Ala Asp Ile Val Leu Trp Asp Pro Glu Ala Glu Met Val
385 390 395 400
Ile Glu Gln Thr Ala Met His Asn Ala Met Asp Tyr Ser Ser Tyr Glu
405 410 415
Gly His Lys Val Lys Gly Val Pro Lys Thr Val Leu Leu Arg Gly Lys
420 425 430
Val Ile Val Asp Glu Gly Ser Tyr Val Gly Glu Pro Thr Asp Gly Lys
435 440 445
Phe Leu Lys Arg Arg Lys Tyr Lys Gln
450 455
<210>10
<211>1374
<212>DNA
<213> Artificial sequence (Artificial sequence)
<400>10
atggacatca ttatcaagaa cggtaccatt gtgaccgcgg atggtatcag ccgtgcggac 60
ctgggtatca aggatggcaa aattacccaa atcggtggcg cgctgggtcc ggcggagcgt 120
accattgatg cggcgggccg ttacgtgttc ccgggtggca tcgatgttca cacccacgtg 180
gaaaccgtta gcttcaacac ccaaagcgcg gacacctttg cgaccgcgac cgtggcggcg 240
gcgtgcggtg gcaccaccac cattgttgac ttttgccagc aagatcgtgg tcatagcctg 300
gcggaagcgg tggcgaagtg ggatggtatg gcgggtggca aaagcgcgat cgattacggc 360
tatcacatta tcgttctgga cccgaccgat agcgtgattg aggaactgga agttctgccg 420
gatctgggta tcaccagctt caaggtgttt atggcgtatc gtggcatgaa catgatcgac 480
gatgtgaccc tgctgaaaac cctggacaag gcggttaaaa ccggtagcct ggtgatggtt 540
catgcggaga acggtgatgc ggcggattac ctgcgtgata agttcgttgc ggaaggtaaa 600
accgcgccga tttatcacgc gctgagccgt ccgccgcgtg tggaggcgga agcgaccgcg 660
cgtgcgctgg cgctggcgga gatcgttaac gcgccgattt acatcgtgca cgttacctgc 720
gaggaaagcc tggaggaagt gatgcgtgcg aaaagccgtg gcgttcgtgc gctggcggag 780
acctgcaccc actacctgta tctgaccaaa gaggacctgg aacgtccgga tttcgaaggt 840
gcgaaatatg tgtttacccc gccggcgcgt gcgaagaaag accacgatgt tctgtggaac 900
gcgctgcgta acggcgtgtt cgaaaccgtt agcagcgacc actgcagctg gctgttcaag 960
ggtcacaaag accgtggccg taacgatttt cgtgcgattc cgaacggtgc gccgggcgtt 1020
gaggaacgtc tgatgatggt gtaccagggt gttaacgagg gccgtatcag cctgacccag 1080
tttgtggaac tggttgcgac ccgtccggcg aaggtgttcg gcatgtttcc gcagaaaggt 1140
accattgcgg tgggcagcga cgcggatatc gttctgtggg acccggaggc ggaaatggtt 1200
attgagcaaaccgcgatgca caacgcgatg gattacagca gctatgaagg tcacaaggtg 1260
aaaggcgttc cgaagaccgt gctgctgcgt ggtaaagtga tcgttgacga gggtagctac 1320
gttggcgaac cgaccgatgg caagtttctg aaacgtcgta agtataaaca gtga 1374
<210>11
<211>458
<212>PRT
<213> Artificial sequence (Artificial sequence)
<400>11
Met Phe Asp Val Ile Val Lys Asn Cys Arg Leu Val Ser Ser Asp Gly
1 5 10 15
Ile Thr Glu Ala Asp Ile Leu Val Lys Asp Gly Lys Val Ala Ala Ile
20 25 30
Ser Ala Asp Thr Ser Asp Val Glu Ala Ser Arg Thr Ile Asp Ala Gly
35 40 45
Gly Lys Phe Val Met Pro Gly Val Val Asp Glu His Val His Ile Ile
50 55 60
Asp Met Asp Leu Lys Asn Arg Tyr Gly Arg Phe Glu Leu Asp Ser Glu
65 70 75 80
Ser Ala Ala Val Gly Gly Ile Thr Thr Ile Ile Glu Met Pro Ile Thr
85 90 95
Phe Pro Pro Thr Thr Thr Leu Asp Ala Phe Leu Glu Lys Lys Lys Gln
100 105 110
Ala Gly Gln Arg Leu Lys Val Asp Phe Ala Leu Tyr Gly Gly Gly Val
115 120 125
Pro Gly Asn Leu Pro Glu Ile Arg Lys Met His Asp Ala Gly Ala Val
130 135 140
Gly Phe Lys Ser Met Met Ala Ala Ser Val Pro Gly Met Phe Asp Ala
145 150 155 160
Val Ser Asp Gly Glu Leu Phe Glu Ile Phe Gln Glu Ile Ala Ala Cys
165 170 175
Gly Ser Val Ile Val Val His Ala Glu Asn Glu Thr Ile Ile Gln Ala
180 185 190
Leu Gln Lys Gln Ile Lys Ala Ala Gly Gly Lys Asp Met Ala Ala Tyr
195 200 205
Glu Ala Ser Gln Pro Val Phe Gln Glu Asn Glu Ala Ile Gln Arg Ala
210 215 220
Leu Leu Leu Gln Lys Glu Ala Gly Cys Arg Leu Ile Val Leu His Val
225 230 235 240
Ser Asn Pro Asp Gly Val Glu Leu Ile His Gln Ala Gln Ser Glu Gly
245 250 255
Gln Asp Val His Cys Glu Ser Gly Pro Gln Tyr Leu Asn Ile Thr Thr
260 265 270
Asp Asp Ala Glu Arg Ile Gly Pro Tyr Met Lys Val Ala Pro Pro Val
275 280 285
Arg Ser Ala Glu Met Asn Ile Arg Leu Trp Glu Gln Leu Glu Asn Gly
290 295 300
Leu Ile Asp Thr Leu Gly Ser Asp His Gly Gly His Pro Val Glu Asp
305 310 315 320
Lys Glu Pro Gly Trp Lys Asp Val Trp Lys Ala Gly Asn Gly Ala Leu
325 330 335
Gly Leu Glu Thr Ser Leu Pro Met Met Leu Thr Asn Gly Val Asn Lys
340 345 350
Gly Arg Leu Ser Leu Glu Arg Leu Val Glu Val Met Cys Glu Lys Pro
355 360 365
Ala Lys Leu Phe Gly Ile Tyr Pro Gln Lys Gly Thr Leu Gln Val Gly
370 375 380
Ser Asp Ala Asp Leu Leu Ile Leu Asp Leu Asp Ile Asp Thr Lys Val
385 390 395 400
Asp Ala Ser Gln Phe Arg Ser Leu His Lys Tyr Ser Pro Phe Asp Gly
405 410 415
Met Pro Val Thr Gly Ala Pro Val Leu Thr Met Val Arg Gly Thr Val
420 425 430
Val Ala Glu Lys Gly Glu Val Leu Val Glu Gln Gly Phe Gly Gln Phe
435 440 445
Val Thr Arg Arg Asn Tyr Glu Ala Ser Lys
450 455
<210>12
<211>1377
<212>DNA
<213> Artificial sequence (Artificial sequence)
<400>12
atgttcgatg tgatcgttaa aaactgccgt ctggttagca gcgacggcat caccgaagcg 60
gatattctgg tgaaagacgg caaggttgcg gcgatcagcg cggataccag cgacgttgaa 120
gcgagccgta ccattgatgc gggtggcaaa ttcgtgatgc cgggcgtggt tgacgagcac 180
gttcacatca ttgacatgga tctgaagaac cgttacggtc gttttgagct ggatagcgaa 240
agcgcggcgg ttggtggcat caccaccatc attgaaatgc cgattacctt cccgccgacc 300
accaccctgg atgcgtttct ggagaagaaa aagcaggcgg gccaacgtct gaaggtggac 360
ttcgcgctgt atggtggcgg tgttccgggt aacctgccgg agatccgtaa aatgcacgac 420
gcgggcgcgg tgggtttcaa gagcatgatg gcggcgagcg tgccgggcat gtttgatgcg 480
gttagcgacg gcgagctgtt cgaaatcttt caggaaattg cggcgtgcgg tagcgttatc 540
gtggttcacg cggagaacga aaccatcatt caagcgctgc agaaacaaat taaggcggcg 600
ggcggtaaag atatggcggc gtatgaggcg agccagccgg tgtttcaaga gaacgaagcg 660
attcagcgtg cgctgctgct gcaaaaggaa gcgggctgcc gtctgatcgt gctgcacgtt 720
agcaacccgg atggtgtgga gctgattcac caggcgcaaa gcgaaggcca ggacgttcac 780
tgcgagagcg gtccgcaata cctgaacatc accaccgacg atgcggaacg tattggtccg 840
tatatgaaag tggctccgcc ggttcgtagc gcggagatga acatccgtct gtgggagcag 900
ctggaaaacg gcctgattga taccctgggt agcgaccatg gcggtcaccc ggtggaggat 960
aaggaaccgg gctggaaaga cgtttggaaa gcgggtaacg gtgcgctggg tctggaaacc 1020
agcctgccga tgatgctgac caacggcgtg aacaaaggtc gtctgagcct ggagcgtctg 1080
gtggaagtta tgtgcgagaa accggcgaag ctgtttggta tctacccgca gaagggcacc 1140
ctgcaagtgg gtagcgacgc ggatctgctg atcctggacc tggatattga caccaaagtt 1200
gatgcgagcc agttccgtag cctgcacaag tatagcccgt ttgatggtat gccggtgacc 1260
ggtgcgccgg tgctgaccat ggttcgtggc accgtggttg cggagaaagg tgaagtgctg 1320
gttgaacagg gcttcggtca atttgttacc cgtcgtaact atgaggcgag caagtga 1377
Claims (4)
1. A process for the preparation of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide, characterized in that: has the sequence shown in SEQ ID NO: 1. SEQ ID NO: 3. SEQ ID NO: 5 or SEQ ID NO: 7 with an enantiomeric mixture of 2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propionic acid-N-methyl-N- (2, 6-difluoro-phenyl) amide to produce (R) -2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propionic acid-N- (2-fluorophenyl) -N-methylamide.
2. The process for the preparation of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide according to claim 1, characterized in that the ratio of the above-mentioned polypeptide or transformant to the enantiomeric mixture of 2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propionic acid-N-methyl-N- (2, 6-difluoro-phenyl) amide is 2-5: 1.
3. The process for the preparation of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide according to any one of claims 1 or 2, characterized in that: the transformant is obtained by introducing a recombinant vector into a host cell, wherein the recombinant vector contains a nucleotide sequence shown in SEQ ID NO: 2. SEQ ID NO: 4. SEQ ID NO: 6 or SEQ ID NO: 8 or a DNA molecule comprising a polypeptide according to any one of the preceding claims.
4. An optically active (R) -2- [4- (6-chloro-2-benzoxazolyloxy) -phenoxy ] propionic acid-N- (2-fluorophenyl) -N-methylamide, characterized in that: is prepared by any one of the preparation methods.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010115296.7A CN111269906A (en) | 2020-02-25 | 2020-02-25 | Process for the preparation of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010115296.7A CN111269906A (en) | 2020-02-25 | 2020-02-25 | Process for the preparation of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111269906A true CN111269906A (en) | 2020-06-12 |
Family
ID=70995273
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010115296.7A Withdrawn CN111269906A (en) | 2020-02-25 | 2020-02-25 | Process for the preparation of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111269906A (en) |
-
2020
- 2020-02-25 CN CN202010115296.7A patent/CN111269906A/en not_active Withdrawn
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11001823B2 (en) | Nitrilase mutants and application thereof | |
US10865404B1 (en) | Aspartase mutant, recombinant expression vector and recombinant bacterium containing aspartase mutant, and use thereof | |
CN108690854B (en) | Method for producing L-glufosinate-ammonium by using chemical-enzymatic method | |
CN112877307B (en) | Amino acid dehydrogenase mutant and application thereof | |
Luo et al. | Cloning and expression of a novel leucine dehydrogenase: Characterization and L-tert-leucine production | |
CN112626057B (en) | Chimeric plant nitrilase mutant, coding gene and application thereof | |
CN114134134B (en) | L-threonine aldolase mutant and application thereof in synthesis of L-syn-p-methylsulfonyl phenylserine | |
CN102978220B (en) | Epoxide hydrolase enzyme gene and encoding enzyme and carrier and engineering bacteria and application | |
CN109554358B (en) | Polypeptide, DNA molecule, recombinant vector, transformant and application thereof | |
CN104263713A (en) | Tistrella mobilis, halohydrin dehalogenase, gene, vector, recombinant strain and application of halohydrin dehalogenase | |
CN108285895A (en) | A kind of esterase EstC11 and its encoding gene and application | |
CN112522228B (en) | R-aminotransferase from pseudomonas ammoxidation and synthesis method thereof | |
CN113122527A (en) | Aspartase mutant with improved enzyme activity and changed optimal pH | |
CN116426499B (en) | Methyltransferase mutant, biological material and application | |
CN110592045B (en) | A kind of recombinant esterase, gene, engineering bacteria and application of splitting (R,S)-indoline-2-ethyl carboxylate | |
CN111269906A (en) | Process for the preparation of optically active (R) -phenoxypropionic acid-N-methyl-N-2-fluorophenylamide | |
CN114934061B (en) | Engineering bacterium and application thereof in whole-cell catalysis of ketopantolactone to produce D-pantolactone | |
CN112226422B (en) | EstWY enzyme mutant with improved activity | |
CN110923223B (en) | A novel nitrilase and its application | |
CN116064446A (en) | A mutant of D-amino acid oxidase, expression vector, genetically engineered bacteria and its construction method and application | |
CN113025601B (en) | Optimized expression and application of nitrilase promoter | |
CN114958934A (en) | Method for preparing L-glufosinate-ammonium | |
CN115521961A (en) | Application of alpha-amino acid ester acyltransferase in preparation of L-alanyl-L-glutamine | |
CN113151233A (en) | Nitrile hydratase lysine mutant HBA-K2H2, coding gene and application | |
CN106119224A (en) | A kind of esterase EstP00714 and encoding gene thereof and application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20200612 |
|
WW01 | Invention patent application withdrawn after publication |