CN111269312A - Heterologous fusion protein - Google Patents
Heterologous fusion protein Download PDFInfo
- Publication number
- CN111269312A CN111269312A CN201811471370.8A CN201811471370A CN111269312A CN 111269312 A CN111269312 A CN 111269312A CN 201811471370 A CN201811471370 A CN 201811471370A CN 111269312 A CN111269312 A CN 111269312A
- Authority
- CN
- China
- Prior art keywords
- gly
- ser
- lys
- glu
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 73
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 73
- DTHNMHAUYICORS-KTKZVXAJSA-N Glucagon-like peptide 1 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 DTHNMHAUYICORS-KTKZVXAJSA-N 0.000 claims abstract description 74
- 208000008589 Obesity Diseases 0.000 claims abstract description 4
- 235000020824 obesity Nutrition 0.000 claims abstract description 4
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 166
- 238000000034 method Methods 0.000 claims description 34
- 108010011459 Exenatide Proteins 0.000 claims description 25
- 229960001519 exenatide Drugs 0.000 claims description 20
- HTQBXNHDCUEHJF-XWLPCZSASA-N Exenatide Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)NCC(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CO)C(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 HTQBXNHDCUEHJF-XWLPCZSASA-N 0.000 claims description 18
- 239000003814 drug Substances 0.000 claims description 14
- 150000001413 amino acids Chemical group 0.000 claims description 13
- 108060003951 Immunoglobulin Proteins 0.000 claims description 12
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 claims description 12
- 102000018358 immunoglobulin Human genes 0.000 claims description 12
- 238000004519 manufacturing process Methods 0.000 claims description 12
- 210000004899 c-terminal region Anatomy 0.000 claims description 11
- 230000001225 therapeutic effect Effects 0.000 claims description 10
- YSDQQAXHVYUZIW-QCIJIYAXSA-N Liraglutide Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCNC(=O)CC[C@H](NC(=O)CCCCCCCCCCCCCCC)C(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=C(O)C=C1 YSDQQAXHVYUZIW-QCIJIYAXSA-N 0.000 claims description 6
- 108010019598 Liraglutide Proteins 0.000 claims description 5
- 108010005794 dulaglutide Proteins 0.000 claims description 5
- 229960005175 dulaglutide Drugs 0.000 claims description 5
- 229960002701 liraglutide Drugs 0.000 claims description 5
- 208000001072 type 2 diabetes mellitus Diseases 0.000 claims description 5
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 claims description 4
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 claims description 4
- 230000001939 inductive effect Effects 0.000 claims description 4
- 206010033307 Overweight Diseases 0.000 claims description 2
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 claims description 2
- 230000004580 weight loss Effects 0.000 claims description 2
- LTXREWYXXSTFRX-QGZVFWFLSA-N Linagliptin Chemical compound N=1C=2N(C)C(=O)N(CC=3N=C4C=CC=CC4=C(C)N=3)C(=O)C=2N(CC#CC)C=1N1CCC[C@@H](N)C1 LTXREWYXXSTFRX-QGZVFWFLSA-N 0.000 claims 1
- 229960002397 linagliptin Drugs 0.000 claims 1
- 101710198884 GATA-type zinc finger protein 1 Proteins 0.000 abstract description 20
- 230000000694 effects Effects 0.000 abstract description 19
- 206010012601 diabetes mellitus Diseases 0.000 abstract description 14
- 238000001727 in vivo Methods 0.000 abstract description 12
- 230000004927 fusion Effects 0.000 abstract description 9
- 238000000338 in vitro Methods 0.000 abstract description 5
- 230000005847 immunogenicity Effects 0.000 abstract description 4
- 238000002474 experimental method Methods 0.000 abstract description 3
- 102100025101 GATA-type zinc finger protein 1 Human genes 0.000 abstract 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 106
- 229920001184 polypeptide Polymers 0.000 description 103
- 229920005989 resin Polymers 0.000 description 103
- 239000011347 resin Substances 0.000 description 103
- 239000000243 solution Substances 0.000 description 84
- 108090000623 proteins and genes Proteins 0.000 description 59
- 210000004027 cell Anatomy 0.000 description 45
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 43
- 238000002360 preparation method Methods 0.000 description 43
- 239000000047 product Substances 0.000 description 38
- 125000006239 protecting group Chemical group 0.000 description 38
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 36
- 102000004169 proteins and genes Human genes 0.000 description 36
- 238000006243 chemical reaction Methods 0.000 description 35
- 235000018102 proteins Nutrition 0.000 description 35
- DTQVDTLACAAQTR-UHFFFAOYSA-N trifluoroacetic acid Substances OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 33
- 210000004369 blood Anatomy 0.000 description 32
- 239000008280 blood Substances 0.000 description 32
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 30
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 29
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 28
- 229920003180 amino resin Polymers 0.000 description 28
- 239000012071 phase Substances 0.000 description 28
- 230000002829 reductive effect Effects 0.000 description 28
- 238000000746 purification Methods 0.000 description 27
- -1 Ideglira Proteins 0.000 description 26
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 25
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 24
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 24
- UMRUUWFGLGNQLI-QFIPXVFZSA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-6-[(2-methylpropan-2-yl)oxycarbonylamino]hexanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](CCCCNC(=O)OC(C)(C)C)C(O)=O)C3=CC=CC=C3C2=C1 UMRUUWFGLGNQLI-QFIPXVFZSA-N 0.000 description 23
- QEPWHIXHJNNGLU-KRWDZBQOSA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)pentanedioic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](CCC(=O)O)C(O)=O)C3=CC=CC=C3C2=C1 QEPWHIXHJNNGLU-KRWDZBQOSA-N 0.000 description 23
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 23
- 239000012528 membrane Substances 0.000 description 23
- 238000001914 filtration Methods 0.000 description 22
- 238000005406 washing Methods 0.000 description 22
- QWXZOFZKSQXPDC-NSHDSACASA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)propanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](C)C(O)=O)C3=CC=CC=C3C2=C1 QWXZOFZKSQXPDC-NSHDSACASA-N 0.000 description 21
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 21
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 21
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 21
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 21
- 108010047857 aspartylglycine Proteins 0.000 description 21
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 20
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 20
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 20
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 20
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 20
- 102100040918 Pro-glucagon Human genes 0.000 description 19
- NDKDFTQNXLHCGO-UHFFFAOYSA-N 2-(9h-fluoren-9-ylmethoxycarbonylamino)acetic acid Chemical compound C1=CC=C2C(COC(=O)NCC(=O)O)C3=CC=CC=C3C2=C1 NDKDFTQNXLHCGO-UHFFFAOYSA-N 0.000 description 18
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 18
- 239000012043 crude product Substances 0.000 description 18
- 108010077112 prolyl-proline Proteins 0.000 description 18
- 235000000346 sugar Nutrition 0.000 description 18
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 16
- 241000699670 Mus sp. Species 0.000 description 16
- 238000010828 elution Methods 0.000 description 16
- 239000000706 filtrate Substances 0.000 description 16
- YDJXDYKQMRNUSA-UHFFFAOYSA-N tri(propan-2-yl)silane Chemical compound CC(C)[SiH](C(C)C)C(C)C YDJXDYKQMRNUSA-UHFFFAOYSA-N 0.000 description 16
- 239000013598 vector Substances 0.000 description 16
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 15
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 15
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 15
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 15
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 15
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 15
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 15
- 150000007523 nucleic acids Chemical group 0.000 description 15
- 229920001223 polyethylene glycol Polymers 0.000 description 15
- 108010031719 prolyl-serine Proteins 0.000 description 15
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 15
- 108010052774 valyl-lysyl-glycyl-phenylalanyl-tyrosine Proteins 0.000 description 15
- CBPJQFCAFFNICX-IBGZPJMESA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-4-methylpentanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](CC(C)C)C(O)=O)C3=CC=CC=C3C2=C1 CBPJQFCAFFNICX-IBGZPJMESA-N 0.000 description 14
- 238000004128 high performance liquid chromatography Methods 0.000 description 14
- CMWYAOXYQATXSI-UHFFFAOYSA-N n,n-dimethylformamide;piperidine Chemical compound CN(C)C=O.C1CCNCC1 CMWYAOXYQATXSI-UHFFFAOYSA-N 0.000 description 14
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 13
- IVOMOUWHDPKRLL-UHFFFAOYSA-N UNPD107823 Natural products O1C2COP(O)(=O)OC2C(O)C1N1C(N=CN=C2N)=C2N=C1 IVOMOUWHDPKRLL-UHFFFAOYSA-N 0.000 description 13
- 239000000872 buffer Substances 0.000 description 13
- 229940095074 cyclic amp Drugs 0.000 description 13
- 239000008103 glucose Substances 0.000 description 13
- 239000011259 mixed solution Substances 0.000 description 13
- 239000000523 sample Substances 0.000 description 13
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 12
- IVOMOUWHDPKRLL-KQYNXXCUSA-N Cyclic adenosine monophosphate Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=CN=C2N)=C2N=C1 IVOMOUWHDPKRLL-KQYNXXCUSA-N 0.000 description 12
- 108020004414 DNA Proteins 0.000 description 12
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 12
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 12
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 12
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 12
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 12
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 12
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 12
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 12
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 12
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 12
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 12
- 235000001014 amino acid Nutrition 0.000 description 12
- 238000010367 cloning Methods 0.000 description 12
- 108010057821 leucylproline Proteins 0.000 description 12
- 108010003700 lysyl aspartic acid Proteins 0.000 description 12
- 238000005086 pumping Methods 0.000 description 12
- OTKXCALUHMPIGM-FQEVSTJZSA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-5-[(2-methylpropan-2-yl)oxy]-5-oxopentanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](CCC(=O)OC(C)(C)C)C(O)=O)C3=CC=CC=C3C2=C1 OTKXCALUHMPIGM-FQEVSTJZSA-N 0.000 description 11
- 241000282414 Homo sapiens Species 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 11
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 11
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 11
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 11
- 239000011324 bead Substances 0.000 description 11
- 238000009833 condensation Methods 0.000 description 11
- 230000005494 condensation Effects 0.000 description 11
- 108010060199 cysteinylproline Proteins 0.000 description 11
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 10
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 10
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 10
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 10
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 10
- 238000003016 alphascreen Methods 0.000 description 10
- 238000006482 condensation reaction Methods 0.000 description 10
- 239000012065 filter cake Substances 0.000 description 10
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 10
- 239000000463 material Substances 0.000 description 10
- 238000006467 substitution reaction Methods 0.000 description 10
- UGNIYGNGCNXHTR-SFHVURJKSA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-3-methylbutanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](C(C)C)C(O)=O)C3=CC=CC=C3C2=C1 UGNIYGNGCNXHTR-SFHVURJKSA-N 0.000 description 9
- SJVFAHZPLIXNDH-QFIPXVFZSA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-3-phenylpropanoic acid Chemical compound C([C@@H](C(=O)O)NC(=O)OCC1C2=CC=CC=C2C2=CC=CC=C21)C1=CC=CC=C1 SJVFAHZPLIXNDH-QFIPXVFZSA-N 0.000 description 9
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 9
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 9
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 9
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 9
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 9
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 9
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 9
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 9
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 9
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 9
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 9
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 9
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 9
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 9
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 9
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 9
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 9
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 9
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 9
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 9
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 9
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 9
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 9
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 9
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 9
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 9
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 9
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 9
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 9
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 9
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 9
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 9
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 9
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 9
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 9
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 9
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 9
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 9
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 9
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 9
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 9
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 9
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 9
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 9
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 9
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 9
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 9
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 9
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 9
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 9
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 9
- 238000010511 deprotection reaction Methods 0.000 description 9
- 239000013604 expression vector Substances 0.000 description 9
- 108010049041 glutamylalanine Proteins 0.000 description 9
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 9
- 108010010147 glycylglutamine Proteins 0.000 description 9
- NPZTUJOABDZTLV-UHFFFAOYSA-N hydroxybenzotriazole Substances O=C1C=CC=C2NNN=C12 NPZTUJOABDZTLV-UHFFFAOYSA-N 0.000 description 9
- 108010091871 leucylmethionine Proteins 0.000 description 9
- 108010064235 lysylglycine Proteins 0.000 description 9
- 108010073101 phenylalanylleucine Proteins 0.000 description 9
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 9
- 108010070643 prolylglutamic acid Proteins 0.000 description 9
- WDGICUODAOGOMO-DHUJRADRSA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-5-oxo-5-(tritylamino)pentanoic acid Chemical compound C([C@@H](C(=O)O)NC(=O)OCC1C2=CC=CC=C2C2=CC=CC=C21)CC(=O)NC(C=1C=CC=CC=1)(C=1C=CC=CC=1)C1=CC=CC=C1 WDGICUODAOGOMO-DHUJRADRSA-N 0.000 description 8
- JDDWRLPTKIOUOF-UHFFFAOYSA-N 9h-fluoren-9-ylmethyl n-[[4-[2-[bis(4-methylphenyl)methylamino]-2-oxoethoxy]phenyl]-(2,4-dimethoxyphenyl)methyl]carbamate Chemical compound COC1=CC(OC)=CC=C1C(C=1C=CC(OCC(=O)NC(C=2C=CC(C)=CC=2)C=2C=CC(C)=CC=2)=CC=1)NC(=O)OCC1C2=CC=CC=C2C2=CC=CC=C21 JDDWRLPTKIOUOF-UHFFFAOYSA-N 0.000 description 8
- 230000015572 biosynthetic process Effects 0.000 description 8
- 238000005859 coupling reaction Methods 0.000 description 8
- 238000011033 desalting Methods 0.000 description 8
- 238000001514 detection method Methods 0.000 description 8
- 229940079593 drug Drugs 0.000 description 8
- 239000003480 eluent Substances 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 239000007791 liquid phase Substances 0.000 description 8
- 238000002156 mixing Methods 0.000 description 8
- 239000013014 purified material Substances 0.000 description 8
- 238000010532 solid phase synthesis reaction Methods 0.000 description 8
- KLBPUVPNPAJWHZ-UMSFTDKQSA-N (2r)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-3-tritylsulfanylpropanoic acid Chemical compound C([C@@H](C(=O)O)NC(=O)OCC1C2=CC=CC=C2C2=CC=CC=C21)SC(C=1C=CC=CC=1)(C=1C=CC=CC=1)C1=CC=CC=C1 KLBPUVPNPAJWHZ-UMSFTDKQSA-N 0.000 description 7
- QXVFEIPAZSXRGM-DJJJIMSYSA-N (2s,3s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-3-methylpentanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H]([C@@H](C)CC)C(O)=O)C3=CC=CC=C3C2=C1 QXVFEIPAZSXRGM-DJJJIMSYSA-N 0.000 description 7
- 239000002202 Polyethylene glycol Substances 0.000 description 7
- 230000005587 bubbling Effects 0.000 description 7
- 239000012295 chemical reaction liquid Substances 0.000 description 7
- 230000008878 coupling Effects 0.000 description 7
- 238000010168 coupling process Methods 0.000 description 7
- 108010016616 cysteinylglycine Proteins 0.000 description 7
- 239000003623 enhancer Substances 0.000 description 7
- 239000007924 injection Substances 0.000 description 7
- 238000002347 injection Methods 0.000 description 7
- 108010054155 lysyllysine Proteins 0.000 description 7
- 108020004999 messenger RNA Proteins 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 239000002244 precipitate Substances 0.000 description 7
- 238000002953 preparative HPLC Methods 0.000 description 7
- 239000007790 solid phase Substances 0.000 description 7
- 238000003756 stirring Methods 0.000 description 7
- 230000002194 synthesizing effect Effects 0.000 description 7
- 238000001291 vacuum drying Methods 0.000 description 7
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 6
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 6
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 6
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 6
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 6
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 6
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 6
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 6
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 6
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 6
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 6
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 6
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 6
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 6
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 6
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 6
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 6
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 6
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 6
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 6
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 6
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 6
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 6
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 6
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 6
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 6
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 6
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 6
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 6
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 6
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 6
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 6
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 6
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 6
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 6
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 6
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 6
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 6
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 6
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 6
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 6
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 6
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 6
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 6
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 6
- PBCJIPOGFJYBJE-UHFFFAOYSA-N acetonitrile;hydrate Chemical compound O.CC#N PBCJIPOGFJYBJE-UHFFFAOYSA-N 0.000 description 6
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 6
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 6
- 230000004071 biological effect Effects 0.000 description 6
- 238000001816 cooling Methods 0.000 description 6
- 108010050848 glycylleucine Proteins 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 108010077515 glycylproline Proteins 0.000 description 6
- 238000000227 grinding Methods 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 6
- 108020004707 nucleic acids Proteins 0.000 description 6
- 102000039446 nucleic acids Human genes 0.000 description 6
- 230000001603 reducing effect Effects 0.000 description 6
- 150000003839 salts Chemical class 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 108010044292 tryptophyltyrosine Proteins 0.000 description 6
- 108010051110 tyrosyl-lysine Proteins 0.000 description 6
- 108010027345 wheylin-1 peptide Proteins 0.000 description 6
- SWZCTMTWRHEBIN-QFIPXVFZSA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@@H](C(=O)O)NC(=O)OCC1C2=CC=CC=C2C2=CC=CC=C21)C1=CC=C(O)C=C1 SWZCTMTWRHEBIN-QFIPXVFZSA-N 0.000 description 5
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 101800004266 Glucagon-like peptide 1(7-37) Proteins 0.000 description 5
- 102100032742 Histone-lysine N-methyltransferase SETD2 Human genes 0.000 description 5
- 108050003304 Huntingtin-interacting protein 1 Proteins 0.000 description 5
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 5
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 5
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 5
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 5
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 5
- 239000013599 cloning vector Substances 0.000 description 5
- 239000000539 dimer Substances 0.000 description 5
- 108010053037 kyotorphin Proteins 0.000 description 5
- 238000011068 loading method Methods 0.000 description 5
- 239000003550 marker Substances 0.000 description 5
- 238000005070 sampling Methods 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 102100022974 AP-2 complex subunit alpha-2 Human genes 0.000 description 4
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 4
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 4
- 108010088406 Glucagon-Like Peptides Proteins 0.000 description 4
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 4
- 102000017011 Glycated Hemoglobin A Human genes 0.000 description 4
- 101000757394 Homo sapiens AP-2 complex subunit alpha-2 Proteins 0.000 description 4
- 101001015516 Homo sapiens Glucagon-like peptide 1 receptor Proteins 0.000 description 4
- 241000699666 Mus <mouse, genus> Species 0.000 description 4
- 241000235648 Pichia Species 0.000 description 4
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- 125000003275 alpha amino acid group Chemical group 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- JUFFVKRROAPVBI-PVOYSMBESA-N chembl1210015 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N[C@H]1[C@@H]([C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO[C@]3(O[C@@H](C[C@H](O)[C@H](O)CO)[C@H](NC(C)=O)[C@@H](O)C3)C(O)=O)O2)O)[C@@H](CO)O1)NC(C)=O)C(=O)NCC(=O)NCC(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CO)C(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 JUFFVKRROAPVBI-PVOYSMBESA-N 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 239000003937 drug carrier Substances 0.000 description 4
- 238000001035 drying Methods 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 230000002473 insulinotropic effect Effects 0.000 description 4
- 210000004962 mammalian cell Anatomy 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- ZPGDWQNBZYOZTI-SFHVURJKSA-N (2s)-1-(9h-fluoren-9-ylmethoxycarbonyl)pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)OCC1C2=CC=CC=C2C2=CC=CC=C21 ZPGDWQNBZYOZTI-SFHVURJKSA-N 0.000 description 3
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 3
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 3
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 3
- 108010088751 Albumins Proteins 0.000 description 3
- 102000009027 Albumins Human genes 0.000 description 3
- 101100337060 Caenorhabditis elegans glp-1 gene Proteins 0.000 description 3
- 102100024746 Dihydrofolate reductase Human genes 0.000 description 3
- 108010067722 Dipeptidyl Peptidase 4 Proteins 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- AQPZYBSRDRZBAG-AVGNSLFASA-N Gln-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N AQPZYBSRDRZBAG-AVGNSLFASA-N 0.000 description 3
- 102400000324 Glucagon-like peptide 1(7-37) Human genes 0.000 description 3
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 3
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 3
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 3
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 3
- 239000007995 HEPES buffer Substances 0.000 description 3
- 101000788682 Homo sapiens GATA-type zinc finger protein 1 Proteins 0.000 description 3
- 101000872458 Homo sapiens Huntingtin-interacting protein 1-related protein Proteins 0.000 description 3
- 101000992283 Homo sapiens Optineurin Proteins 0.000 description 3
- 101000818376 Homo sapiens Palmitoyltransferase ZDHHC17 Proteins 0.000 description 3
- 101000574013 Homo sapiens Pre-mRNA-processing factor 40 homolog A Proteins 0.000 description 3
- 102100034773 Huntingtin-interacting protein 1-related protein Human genes 0.000 description 3
- 241000235649 Kluyveromyces Species 0.000 description 3
- 244000285963 Kluyveromyces fragilis Species 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 3
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 3
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 102100031822 Optineurin Human genes 0.000 description 3
- 102100021061 Palmitoyltransferase ZDHHC17 Human genes 0.000 description 3
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 3
- JDMKQHSHKJHAHR-UHFFFAOYSA-N Phe-Phe-Leu-Tyr Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)CC1=CC=CC=C1 JDMKQHSHKJHAHR-UHFFFAOYSA-N 0.000 description 3
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 3
- 102100025822 Pre-mRNA-processing factor 40 homolog A Human genes 0.000 description 3
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 3
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 210000001744 T-lymphocyte Anatomy 0.000 description 3
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 3
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 3
- 102100020696 Ubiquitin-conjugating enzyme E2 K Human genes 0.000 description 3
- 101710192920 Ubiquitin-conjugating enzyme E2 K Proteins 0.000 description 3
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 3
- PQLVXDKIJBQVDF-UHFFFAOYSA-N acetic acid;hydrate Chemical compound O.CC(O)=O PQLVXDKIJBQVDF-UHFFFAOYSA-N 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 125000003277 amino group Chemical group 0.000 description 3
- 210000000612 antigen-presenting cell Anatomy 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 150000001875 compounds Chemical group 0.000 description 3
- 238000005336 cracking Methods 0.000 description 3
- 108020001096 dihydrofolate reductase Proteins 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 108091005995 glycated hemoglobin Proteins 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 210000003734 kidney Anatomy 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 239000008194 pharmaceutical composition Substances 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 239000013641 positive control Substances 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 238000004007 reversed phase HPLC Methods 0.000 description 3
- 229920006395 saturated elastomer Polymers 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- 230000014616 translation Effects 0.000 description 3
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 2
- BDNKZNFMNDZQMI-UHFFFAOYSA-N 1,3-diisopropylcarbodiimide Chemical compound CC(C)N=C=NC(C)C BDNKZNFMNDZQMI-UHFFFAOYSA-N 0.000 description 2
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical group C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 2
- 238000003276 AlphaScreen cAMP assay kit Methods 0.000 description 2
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 2
- 241000228212 Aspergillus Species 0.000 description 2
- 241000351920 Aspergillus nidulans Species 0.000 description 2
- 241000228245 Aspergillus niger Species 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000699802 Cricetulus griseus Species 0.000 description 2
- 229920002785 Croscarmellose sodium Polymers 0.000 description 2
- 241000701022 Cytomegalovirus Species 0.000 description 2
- 108090000194 Dipeptidyl-peptidases and tripeptidyl-peptidases Proteins 0.000 description 2
- 102000003779 Dipeptidyl-peptidases and tripeptidyl-peptidases Human genes 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- 108010086246 Glucagon-Like Peptide-1 Receptor Proteins 0.000 description 2
- 102100032882 Glucagon-like peptide 1 receptor Human genes 0.000 description 2
- 102000005731 Glucose-6-phosphate isomerase Human genes 0.000 description 2
- 108010070600 Glucose-6-phosphate isomerase Proteins 0.000 description 2
- 241001149669 Hanseniaspora Species 0.000 description 2
- 102000001554 Hemoglobins Human genes 0.000 description 2
- 108010054147 Hemoglobins Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101500028772 Homo sapiens Glucagon-like peptide 1(7-36) Proteins 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 description 2
- 241001138401 Kluyveromyces lactis Species 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 239000007832 Na2SO4 Substances 0.000 description 2
- 241000221960 Neurospora Species 0.000 description 2
- 241000221961 Neurospora crassa Species 0.000 description 2
- 241000228143 Penicillium Species 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 2
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 241000223252 Rhodotorula Species 0.000 description 2
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 2
- 241000311088 Schwanniomyces Species 0.000 description 2
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- 239000004098 Tetracycline Substances 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- 241000255993 Trichoplusia ni Species 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 229960004733 albiglutide Drugs 0.000 description 2
- OGWAVGNOAMXIIM-UHFFFAOYSA-N albiglutide Chemical compound O=C(O)C(NC(=O)CNC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)CNC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)CNC(=O)C(NC(=O)CNC(=O)C(N)CC=1(N=CNC=1))CCC(=O)O)C(O)C)CC2(=CC=CC=C2))C(O)C)CO)CC(=O)O)C(C)C)CO)CO)CC3(=CC=C(O)C=C3))CC(C)C)CCC(=O)O)CCC(=O)N)C)C)CCCCN)CCC(=O)O)CC4(=CC=CC=C4))C(CC)C)C)CC=6(C5(=C(C=CC=C5)NC=6)))CC(C)C)C(C)C)CCCCN)CCCNC(=N)N OGWAVGNOAMXIIM-UHFFFAOYSA-N 0.000 description 2
- 125000003368 amide group Chemical group 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 210000003719 b-lymphocyte Anatomy 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000010241 blood sampling Methods 0.000 description 2
- 230000037396 body weight Effects 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 125000000837 carbohydrate group Chemical group 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 230000004087 circulation Effects 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 238000004090 dissolution Methods 0.000 description 2
- AMRJKAQTDDKMCE-UHFFFAOYSA-N dolastatin Chemical compound CC(C)C(N(C)C)C(=O)NC(C(C)C)C(=O)N(C)C(C(C)C)C(OC)CC(=O)N1CCCC1C(OC)C(C)C(=O)NC(C=1SC=CN=1)CC1=CC=CC=C1 AMRJKAQTDDKMCE-UHFFFAOYSA-N 0.000 description 2
- 229930188854 dolastatin Natural products 0.000 description 2
- 238000009509 drug development Methods 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 210000003743 erythrocyte Anatomy 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 150000004665 fatty acids Chemical group 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 2
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 2
- 230000002218 hypoglycaemic effect Effects 0.000 description 2
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 2
- 230000003914 insulin secretion Effects 0.000 description 2
- 239000003509 long acting drug Substances 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 208000030159 metabolic disease Diseases 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 239000000546 pharmaceutical excipient Substances 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 230000001376 precipitating effect Effects 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 230000002035 prolonged effect Effects 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 108700027806 rGLP-1 Proteins 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 229910052938 sodium sulfate Inorganic materials 0.000 description 2
- 239000008107 starch Substances 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 210000002784 stomach Anatomy 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 239000007929 subcutaneous injection Substances 0.000 description 2
- 238000010254 subcutaneous injection Methods 0.000 description 2
- 125000001424 substituent group Chemical group 0.000 description 2
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 238000013268 sustained release Methods 0.000 description 2
- 239000012730 sustained-release form Substances 0.000 description 2
- 238000012353 t test Methods 0.000 description 2
- 229960002180 tetracycline Drugs 0.000 description 2
- 229930101283 tetracycline Natural products 0.000 description 2
- 235000019364 tetracycline Nutrition 0.000 description 2
- 150000003522 tetracyclines Chemical class 0.000 description 2
- 125000003396 thiol group Chemical group [H]S* 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 230000003442 weekly effect Effects 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- DQJCDTNMLBYVAY-ZXXIYAEKSA-N (2S,5R,10R,13R)-16-{[(2R,3S,4R,5R)-3-{[(2S,3R,4R,5S,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy}-5-(ethylamino)-6-hydroxy-2-(hydroxymethyl)oxan-4-yl]oxy}-5-(4-aminobutyl)-10-carbamoyl-2,13-dimethyl-4,7,12,15-tetraoxo-3,6,11,14-tetraazaheptadecan-1-oic acid Chemical compound NCCCC[C@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@@H](C)NC(=O)C(C)O[C@@H]1[C@@H](NCC)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 DQJCDTNMLBYVAY-ZXXIYAEKSA-N 0.000 description 1
- LDDMACCNBZAMSG-BDVNFPICSA-N (2r,3r,4s,5r)-3,4,5,6-tetrahydroxy-2-(methylamino)hexanal Chemical compound CN[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO LDDMACCNBZAMSG-BDVNFPICSA-N 0.000 description 1
- REITVGIIZHFVGU-IBGZPJMESA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-3-[(2-methylpropan-2-yl)oxy]propanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](COC(C)(C)C)C(O)=O)C3=CC=CC=C3C2=C1 REITVGIIZHFVGU-IBGZPJMESA-N 0.000 description 1
- DIBLBAURNYJYBF-XLXZRNDBSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-6-amino-2-[[(2s)-2-amino-3-methylbutanoyl]amino]hexanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 DIBLBAURNYJYBF-XLXZRNDBSA-N 0.000 description 1
- DQUHYEDEGRNAFO-QMMMGPOBSA-N (2s)-6-amino-2-[(2-methylpropan-2-yl)oxycarbonylamino]hexanoic acid Chemical compound CC(C)(C)OC(=O)N[C@H](C(O)=O)CCCCN DQUHYEDEGRNAFO-QMMMGPOBSA-N 0.000 description 1
- OYULCCKKLJPNPU-DIFFPNOSSA-N (2s,3r)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-3-hydroxybutanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H]([C@H](O)C)C(O)=O)C3=CC=CC=C3C2=C1 OYULCCKKLJPNPU-DIFFPNOSSA-N 0.000 description 1
- DDYAPMZTJAYBOF-ZMYDTDHYSA-N (3S)-4-[[(2S)-1-[[(2S)-1-[[(2S)-5-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-4-amino-1-[[(2S)-1-[[(2S)-4-amino-1-[[(2S)-4-amino-1-[[(2S,3S)-1-[[(1S)-1-carboxyethyl]amino]-3-methyl-1-oxopentan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1-oxohexan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-4-methylsulfanyl-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]amino]-1,5-dioxopentan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-6-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S,3R)-2-[[(2S)-2-[[(2S,3R)-2-[[2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-amino-3-(1H-imidazol-4-yl)propanoyl]amino]-3-hydroxypropanoyl]amino]-5-oxopentanoyl]amino]acetyl]amino]-3-hydroxybutanoyl]amino]-3-phenylpropanoyl]amino]-3-hydroxybutanoyl]amino]-3-hydroxypropanoyl]amino]-3-carboxypropanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-hydroxypropanoyl]amino]hexanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-4-methylpentanoyl]amino]-3-carboxypropanoyl]amino]-3-hydroxypropanoyl]amino]-5-carbamimidamidopentanoyl]amino]-5-carbamimidamidopentanoyl]amino]propanoyl]amino]-5-oxopentanoyl]amino]-4-oxobutanoic acid Chemical class [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DDYAPMZTJAYBOF-ZMYDTDHYSA-N 0.000 description 1
- NGJOFQZEYQGZMB-KTKZVXAJSA-N (4S)-5-[[2-[[(2S,3R)-1-[[(2S)-1-[[(2S,3R)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[2-[[(2S)-5-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-1-[[(2S,3S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[2-[[(1S)-4-carbamimidamido-1-carboxybutyl]amino]-2-oxoethyl]amino]-1-oxohexan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-3-methyl-1-oxopentan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-4-carboxy-1-oxobutan-2-yl]amino]-1-oxohexan-2-yl]amino]-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-1,5-dioxopentan-2-yl]amino]-2-oxoethyl]amino]-4-carboxy-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-(4-hydroxyphenyl)-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-3-carboxy-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-2-oxoethyl]amino]-4-[[(2S)-2-[[(2S)-2-amino-3-(1H-imidazol-4-yl)propanoyl]amino]propanoyl]amino]-5-oxopentanoic acid Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 NGJOFQZEYQGZMB-KTKZVXAJSA-N 0.000 description 1
- GOPWHXPXSPIIQZ-FQEVSTJZSA-N (4s)-4-(9h-fluoren-9-ylmethoxycarbonylamino)-5-[(2-methylpropan-2-yl)oxy]-5-oxopentanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](CCC(O)=O)C(=O)OC(C)(C)C)C3=CC=CC=C3C2=C1 GOPWHXPXSPIIQZ-FQEVSTJZSA-N 0.000 description 1
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 description 1
- ASOKPJOREAFHNY-UHFFFAOYSA-N 1-Hydroxybenzotriazole Chemical class C1=CC=C2N(O)N=NC2=C1 ASOKPJOREAFHNY-UHFFFAOYSA-N 0.000 description 1
- IIZPXYDJLKNOIY-JXPKJXOSSA-N 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC IIZPXYDJLKNOIY-JXPKJXOSSA-N 0.000 description 1
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- IIVWHGMLFGNMOW-UHFFFAOYSA-N 2-methylpropane Chemical compound C[C](C)C IIVWHGMLFGNMOW-UHFFFAOYSA-N 0.000 description 1
- XXSCONYSQQLHTH-UHFFFAOYSA-N 9h-fluoren-9-ylmethanol Chemical compound C1=CC=C2C(CO)C3=CC=CC=C3C2=C1 XXSCONYSQQLHTH-UHFFFAOYSA-N 0.000 description 1
- 102000013563 Acid Phosphatase Human genes 0.000 description 1
- 108010051457 Acid Phosphatase Proteins 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- 101710187573 Alcohol dehydrogenase 2 Proteins 0.000 description 1
- 101710133776 Alcohol dehydrogenase class-3 Proteins 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- 241000713842 Avian sarcoma virus Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 241000701822 Bovine papillomavirus Species 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 108010055448 CJC 1131 Proteins 0.000 description 1
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 229920000858 Cyclodextrin Polymers 0.000 description 1
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-UWTATZPHSA-N D-alanine Chemical compound C[C@@H](N)C(O)=O QNAYBMKLOCPYGJ-UWTATZPHSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-UHFFFAOYSA-N D-alpha-Ala Natural products CC([NH3+])C([O-])=O QNAYBMKLOCPYGJ-UHFFFAOYSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 108010016626 Dipeptides Proteins 0.000 description 1
- 102100025012 Dipeptidyl peptidase 4 Human genes 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 238000010156 Dunnett's T3 test Methods 0.000 description 1
- 241000700662 Fowlpox virus Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- QMVCEWKHIUHTSD-GUBZILKMSA-N Gln-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QMVCEWKHIUHTSD-GUBZILKMSA-N 0.000 description 1
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- 102000051325 Glucagon Human genes 0.000 description 1
- 108060003199 Glucagon Proteins 0.000 description 1
- 102400000326 Glucagon-like peptide 2 Human genes 0.000 description 1
- 101800000221 Glucagon-like peptide 2 Proteins 0.000 description 1
- 102000030595 Glucokinase Human genes 0.000 description 1
- 108010021582 Glucokinase Proteins 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- 108010014663 Glycated Hemoglobin A Proteins 0.000 description 1
- 102000002068 Glycopeptides Human genes 0.000 description 1
- 108010015899 Glycopeptides Proteins 0.000 description 1
- 208000032843 Hemorrhage Diseases 0.000 description 1
- 241000700721 Hepatitis B virus Species 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- 102000005548 Hexokinase Human genes 0.000 description 1
- 108700040460 Hexokinases Proteins 0.000 description 1
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 1
- 241000701109 Human adenovirus 2 Species 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 1
- LEHPJMKVGFPSSP-ZQINRCPSSA-N Ile-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 LEHPJMKVGFPSSP-ZQINRCPSSA-N 0.000 description 1
- 108010009817 Immunoglobulin Constant Regions Proteins 0.000 description 1
- 102000009786 Immunoglobulin Constant Regions Human genes 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 108090001061 Insulin Proteins 0.000 description 1
- 102000004877 Insulin Human genes 0.000 description 1
- 102000036770 Islet Amyloid Polypeptide Human genes 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- JVTAAEKCZFNVCJ-UHFFFAOYSA-M Lactate Chemical compound CC(O)C([O-])=O JVTAAEKCZFNVCJ-UHFFFAOYSA-M 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000270322 Lepidosauria Species 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 1
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-dimethylformamide Substances CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 102000016387 Pancreatic elastase Human genes 0.000 description 1
- 108010067372 Pancreatic elastase Proteins 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- YVXPUUOTMVBKDO-IHRRRGAJSA-N Phe-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CS)C(=O)O YVXPUUOTMVBKDO-IHRRRGAJSA-N 0.000 description 1
- AOKZOUGUMLBPSS-PMVMPFDFSA-N Phe-Trp-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O AOKZOUGUMLBPSS-PMVMPFDFSA-N 0.000 description 1
- 102000001105 Phosphofructokinases Human genes 0.000 description 1
- 108010069341 Phosphofructokinases Proteins 0.000 description 1
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 1
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 229920000361 Poly(styrene)-block-poly(ethylene glycol) Polymers 0.000 description 1
- 241001505332 Polyomavirus sp. Species 0.000 description 1
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 1
- 108010011939 Pyruvate Decarboxylase Proteins 0.000 description 1
- 102000013009 Pyruvate Kinase Human genes 0.000 description 1
- 108020005115 Pyruvate Kinase Proteins 0.000 description 1
- 102000004879 Racemases and epimerases Human genes 0.000 description 1
- 108090001066 Racemases and epimerases Proteins 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 235000018368 Saccharomyces fragilis Nutrition 0.000 description 1
- 241001123650 Schwanniomyces occidentalis Species 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- FFOKMZOAVHEWET-IMJSIDKUSA-N Ser-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(O)=O FFOKMZOAVHEWET-IMJSIDKUSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- 241000256248 Spodoptera Species 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 238000000692 Student's t-test Methods 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 101150006914 TRP1 gene Proteins 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 241001149964 Tolypocladium Species 0.000 description 1
- 101710120037 Toxin CcdB Proteins 0.000 description 1
- 241000223259 Trichoderma Species 0.000 description 1
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 1
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 1
- 244000000188 Vaccinium ovalifolium Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 239000003875 Wang resin Substances 0.000 description 1
- 241000235013 Yarrowia Species 0.000 description 1
- NERFNHBZJXXFGY-UHFFFAOYSA-N [4-[(4-methylphenyl)methoxy]phenyl]methanol Chemical compound C1=CC(C)=CC=C1COC1=CC=C(CO)C=C1 NERFNHBZJXXFGY-UHFFFAOYSA-N 0.000 description 1
- VJHCJDRQFCCTHL-UHFFFAOYSA-N acetic acid 2,3,4,5,6-pentahydroxyhexanal Chemical compound CC(O)=O.OCC(O)C(O)C(O)C(O)C=O VJHCJDRQFCCTHL-UHFFFAOYSA-N 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- CEGOLXSVJUTHNZ-UHFFFAOYSA-K aluminium tristearate Chemical compound [Al+3].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O CEGOLXSVJUTHNZ-UHFFFAOYSA-K 0.000 description 1
- 229940063655 aluminum stearate Drugs 0.000 description 1
- 150000001408 amides Chemical group 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960004977 anhydrous lactose Drugs 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 239000003957 anion exchange resin Substances 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 239000008346 aqueous phase Substances 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- 230000001651 autotrophic effect Effects 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000000740 bleeding effect Effects 0.000 description 1
- 230000017531 blood circulation Effects 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 229940014641 bydureon Drugs 0.000 description 1
- 229940084891 byetta Drugs 0.000 description 1
- 238000013262 cAMP assay Methods 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 1
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 1
- 238000005341 cation exchange Methods 0.000 description 1
- 238000000423 cell based assay Methods 0.000 description 1
- 210000005056 cell body Anatomy 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000003889 chemical engineering Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 229960005168 croscarmellose Drugs 0.000 description 1
- 229960001681 croscarmellose sodium Drugs 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 239000001767 crosslinked sodium carboxy methyl cellulose Substances 0.000 description 1
- 235000010947 crosslinked sodium carboxy methyl cellulose Nutrition 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 229940097362 cyclodextrins Drugs 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000003413 degradative effect Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 239000012351 deprotecting agent Substances 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 125000004177 diethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- UXGNZZKBCMGWAZ-UHFFFAOYSA-N dimethylformamide dmf Chemical compound CN(C)C=O.CN(C)C=O UXGNZZKBCMGWAZ-UHFFFAOYSA-N 0.000 description 1
- 208000016097 disease of metabolism Diseases 0.000 description 1
- 239000007884 disintegrant Substances 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 230000000857 drug effect Effects 0.000 description 1
- 230000002124 endocrine Effects 0.000 description 1
- 230000012202 endocytosis Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000002662 enteric coated tablet Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000556 factor analysis Methods 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- SAVROPQJUYSBDD-UHFFFAOYSA-N formyl(dimethyl)azanium;chloride Chemical compound [Cl-].C[N+](C)=CO SAVROPQJUYSBDD-UHFFFAOYSA-N 0.000 description 1
- AIWAEWBZDJARBJ-PXUUZXDZSA-N fz7co35x2s Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCNC(=O)COCCOCCNC(=O)CCN1C(C=CC1=O)=O)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 AIWAEWBZDJARBJ-PXUUZXDZSA-N 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 239000003629 gastrointestinal hormone Substances 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 102000018146 globin Human genes 0.000 description 1
- 108060003196 globin Proteins 0.000 description 1
- MASNOZXLGMXCHN-ZLPAWPGGSA-N glucagon Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 MASNOZXLGMXCHN-ZLPAWPGGSA-N 0.000 description 1
- 229960004666 glucagon Drugs 0.000 description 1
- 239000003877 glucagon like peptide 1 receptor agonist Substances 0.000 description 1
- TWSALRJGPBVBQU-PKQQPRCHSA-N glucagon-like peptide 2 Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O)[C@@H](C)CC)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)CC)C1=CC=CC=C1 TWSALRJGPBVBQU-PKQQPRCHSA-N 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 239000000710 homodimer Substances 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 230000016784 immunoglobulin production Effects 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 239000007943 implant Substances 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- MGXWVYUBJRZYPE-YUGYIWNOSA-N incretin Chemical class C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)[C@@H](C)O)[C@@H](C)CC)C1=CC=C(O)C=C1 MGXWVYUBJRZYPE-YUGYIWNOSA-N 0.000 description 1
- 239000000859 incretin Substances 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 150000007529 inorganic bases Chemical class 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 229940125396 insulin Drugs 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 239000007928 intraperitoneal injection Substances 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 229940031154 kluyveromyces marxianus Drugs 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 229960001375 lactose Drugs 0.000 description 1
- 150000002605 large molecules Chemical class 0.000 description 1
- 235000010445 lecithin Nutrition 0.000 description 1
- 239000000787 lecithin Substances 0.000 description 1
- 229940067606 lecithin Drugs 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 229940031703 low substituted hydroxypropyl cellulose Drugs 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 210000005265 lung cell Anatomy 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- VZCYOOQTPOCHFL-UPHRSURJSA-N maleic acid Chemical compound OC(=O)\C=C/C(O)=O VZCYOOQTPOCHFL-UPHRSURJSA-N 0.000 description 1
- 150000002689 maleic acids Chemical class 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- COTNUBDHGSIOTA-UHFFFAOYSA-N meoh methanol Chemical compound OC.OC COTNUBDHGSIOTA-UHFFFAOYSA-N 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 1
- 239000008108 microcrystalline cellulose Substances 0.000 description 1
- 229940016286 microcrystalline cellulose Drugs 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000004005 microsphere Substances 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 238000006386 neutralization reaction Methods 0.000 description 1
- FEMOMIGRRWSMCU-UHFFFAOYSA-N ninhydrin Chemical compound C1=CC=C2C(=O)C(O)(O)C(=O)C2=C1 FEMOMIGRRWSMCU-UHFFFAOYSA-N 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 239000002674 ointment Substances 0.000 description 1
- 150000007530 organic bases Chemical class 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 239000006174 pH buffer Substances 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 239000000825 pharmaceutical preparation Substances 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000004237 preparative chromatography Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 1
- 150000003254 radicals Chemical class 0.000 description 1
- 239000000018 receptor agonist Substances 0.000 description 1
- 229940044601 receptor agonist Drugs 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 238000001953 recrystallisation Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 239000011342 resin composition Substances 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000000580 secretagogue effect Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000000741 silica gel Substances 0.000 description 1
- 229910002027 silica gel Inorganic materials 0.000 description 1
- CLDWGXZGFUNWKB-UHFFFAOYSA-M silver;benzoate Chemical compound [Ag+].[O-]C(=O)C1=CC=CC=C1 CLDWGXZGFUNWKB-UHFFFAOYSA-M 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- CDBYLPFSWZWCQE-UHFFFAOYSA-L sodium carbonate Substances [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 229940032147 starch Drugs 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- UEUXEKPTXMALOB-UHFFFAOYSA-J tetrasodium;2-[2-[bis(carboxylatomethyl)amino]ethyl-(carboxylatomethyl)amino]acetate Chemical compound [Na+].[Na+].[Na+].[Na+].[O-]C(=O)CN(CC([O-])=O)CCN(CC([O-])=O)CC([O-])=O UEUXEKPTXMALOB-UHFFFAOYSA-J 0.000 description 1
- WROMPOXWARCANT-UHFFFAOYSA-N tfa trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F.OC(=O)C(F)(F)F WROMPOXWARCANT-UHFFFAOYSA-N 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- VZCYOOQTPOCHFL-UHFFFAOYSA-N trans-butenedioic acid Natural products OC(=O)C=CC(O)=O VZCYOOQTPOCHFL-UHFFFAOYSA-N 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- ZGYICYBLPGRURT-UHFFFAOYSA-N tri(propan-2-yl)silicon Chemical compound CC(C)[Si](C(C)C)C(C)C ZGYICYBLPGRURT-UHFFFAOYSA-N 0.000 description 1
- OHSJPLSEQNCRLW-UHFFFAOYSA-N triphenylmethyl radical Chemical compound C1=CC=CC=C1[C](C=1C=CC=CC=1)C1=CC=CC=C1 OHSJPLSEQNCRLW-UHFFFAOYSA-N 0.000 description 1
- 101150108727 trpl gene Proteins 0.000 description 1
- 238000000825 ultraviolet detection Methods 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 210000003462 vein Anatomy 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/04—Anorexiants; Antiobesity agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/08—Drugs for disorders of the metabolism for glucose homeostasis
- A61P3/10—Drugs for disorders of the metabolism for glucose homeostasis for hyperglycaemia, e.g. antidiabetics
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/605—Glucagons
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K7/00—Peptides having 5 to 20 amino acids in a fully defined sequence; Derivatives thereof
- C07K7/04—Linear peptides containing only normal peptide links
- C07K7/08—Linear peptides containing only normal peptide links having 12 to 20 amino acids
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/50—Immunoglobulins specific features characterized by immunoglobulin fragments
- C07K2317/52—Constant or Fc region; Isotype
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/30—Non-immunoglobulin-derived peptide or protein having an immunoglobulin constant or Fc region, or a fragment thereof, attached thereto
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Diabetes (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Obesity (AREA)
- Hematology (AREA)
- Pharmacology & Pharmacy (AREA)
- General Chemical & Material Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Gastroenterology & Hepatology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Engineering & Computer Science (AREA)
- Endocrinology (AREA)
- Emergency Medicine (AREA)
- Toxicology (AREA)
- Zoology (AREA)
- Child & Adolescent Psychology (AREA)
- Immunology (AREA)
- Peptides Or Proteins (AREA)
Abstract
The present invention provides for the replacement of a particular fusion protein at multiple positions in the GLP-1 portion and Fc portion of the molecule. Wherein, the Ser at the 11 th site, the Val at the 85 th site, the Val at the 86 th site, the Ser at the 87 th site and the Val at the 91 st site of the Fc are replaced, and the terminal Lys is deleted, thereby overcoming the problem of potential immunogenicity related to the application of GLP-1-Fc fusion to a great extent, and the results of in vitro and in vivo activity experiments also prove that the activity of the GLP-1 analogue disclosed by the invention is obviously higher than that of the existing GLP-1 analogue. Therefore, the analogs can be used to better prevent, prevent or reduce diabetes and obesity and/or complications.
Description
Technical Field
The invention belongs to the technical field of genetic engineering, and particularly relates to a heterologous fusion protein.
Background
Diabetes Mellitus (DM) is a complex metabolic disease with elevated blood sugar caused by endocrine metabolism disorder due to insufficient insulin secretion or synthesis inhibition in vivo, and is mainly divided into type 1 diabetes mellitus (T1 DM), type2diabetes mellitus (T2 diabetes mellitus, T2DM) and other types of diabetes mellitus, according to statistics, the type2diabetes mellitus accounts for a large proportion, which accounts for more than 90% of the total number of patients with diabetes mellitus, and the disease period is more than 35-40 years old, and more serious complications appear in the later period of onset.
Glucagon-like peptide (GLP) is an intestinal hormone secreted by human body, and is formed by cracking Glucagon molecules through intestinal proteolytic enzyme to generate two Glucagon-like peptides GLP-1 and GLP-2. Among them, GLP-1 has two active forms, i.e., GLP-1(7-37) and GLP-1(7-36) amide forms, and has the same effect of promoting insulin secretion in vivo, so it is also called insulinotropic peptide (Incretin or insulinotropic peptide) (Negar Sarrazadeh et al, Pharmaceutical Sciences, Vol.96, 1925-1954 (2007)).
GLP-1(7-37):
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly GlnAla Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Gly。
GLP-1 agonist is taken as a kind of polypeptide drug, and can only be injected for administration at present, and in order to improve the compliance of patients, sustained-release injection and non-injection administration become main directions for the research and development of the kind of drugs. Structural modification is adopted to cover DPP-IV enzyme sites and prolong the biological half-life, which is the most mainstream method at present. The structural modification comprises two types of chained high-molecular protein and polyethylene glycol modification, wherein Abiglutide and Dulaglutide which are used once a week belong to the typical former type, and although a plurality of polyethylene glycol modified protein or polypeptide medicaments are available on the market, the polyethylene glycol modified GLP-1 analogue starts later, and only the somaglutide enters the third clinical stage. Certainly, some companies can prepare the sustained-release microspheres by means of preparations to realize once a week, and a typical product is Bydureon. There are also companies that directly avoid the disadvantages of injection administration, develop products such as implants, oral administration, transdermal administration and inhalation, wherein oral administration is the most ideal administration route, but need to solve the problems of stomach acid and enzyme damage and absorption of GLP-1 analogue in intestines and stomach, so as to improve bioavailability and reduce individual difference, the current companies that orally take GLP-1 analogue are the Oramed company and the emisphene company, wherein NN9924 of Nound is an oral soma peptide, and the Iseland has reported the exenatide enteric-coated tablet in China.
Amidated GLP-1 derivative Liraglutide (T, DiabetesCare, 2007, 30: 1608-1610) developed by Novo Nordisk of Denmark has Lys at position 34 replaced by Arg as compared with native GLP-1, and a fatty acid side chain having 16 carbons is linked to Lys at position 26 of GLP-1 via glutamyl. Liraglutide can form a non-covalent linked derivative with albumin in plasma, retains the function of GLP-1, antagonizes the degradation of DPP IV, has a half-life prolonged to 11-15 hours, and is suitable for once-a-day administration (Elbrond B, Diabetes Care, 2002, 25: 1398-1404). The C-terminal lysine at position 37 in another GLP-1 analog CJC-1131 under development can form covalent linkage with cysteine residue 34 in albumin in plasma after in vitro modification, and can also achieve the goal of once-a-day administration (Kim, JG, Diabetes, 2003, 52: 751-759). Therefore, GLP-1 and exendin-4 can better retain the biological activity through the GLP-1 analogue obtained by mutating (1 or a plurality of amino acids), deleting or adding amino acids to the C terminal by a chemical or genetic engineering means.
Exendin-4:
His Gly Glu Gly Thr Phe Thr Ser Asp Leu Ser Lys Gln Met Glu Glu GluAla Val Arg Leu Phe Ile Glu Trp Leu Lys Asn Gly Gly Pro Ser Ser Gly Ala ProPro Pro Ser。
Exendin-4 is a GLP-1 analogue isolated from lizard saliva and consisting of 39 amino acids, the amino acid sequence of which has sequence similarity with several members of the GLP family and 53% homology with human GLP-1. Because the second position of the N terminal is Ala (GLP-1 is Gly), the N terminal is not easy to be degraded by DPP-IV enzyme, and has longer half life and stronger biological activity. The chemical synthesized product of Exendin-4 is named Exenatide (trade name Byetta), and is jointly developed by Amylin and Lilly in 1995, and approved by FDA to be marketed in 4 months in 2005.
The expression and activity of human GLP-1(7-36) in vivo are strictly regulated, and when the second bit Ala at the N terminal of the human GLP-1(7-36) is hydrolyzed by dipeptidyl peptidase (DPP), inactive GLP-1 (9-36) is formed, and the metabolite is also an in vivo natural antagonist of GLP-1R. Therefore, the natural human active GLP-1 has short half-life in vivo and the metabolism rate of 2 min; in addition, under physiological conditions, GLP-1 is mainly excreted through the kidney, and the clinical application of the human GLP-1 is limited. The second amino acid Gly of non-human-derived artificially synthesized Exenatide is different from Ala of human GLP-1, and can effectively resist the degradation of dipeptide acyl peptidase; the C-terminal rigid (PSSGAPPPS) amino acid sequence of Exenatide can increase the stability of polypeptide, and the blood sugar reducing capability of Exenatide in vivo is about 1000 times stronger than that of GLP-1.
Exenatide peptide secondary structure: the N end of Exenatide is irregularly curled, amino acid residues with opposite charge side chains are alternately arranged on the same side surface of the middle part of Exenatide, a helix is formed through a salt bridge or a polar hydrogen bond, and the C end of Exenatide is hydrophilic Trp-Cage. The interaction mechanism of Exenatide with the GLP-1 receptor has been studied clearly.
A series of different structural designs have been developed to extend the structural half-life of GLP-1 analogs and enhance biological activity. CN1384755A discloses a novel Exendin agonist preparation and an administration method thereof, and discloses a compound structure and a preparation method of Exenatide. CN102532303A discloses the modification of amino group of lysine or amino group of N-terminal histidine residue in Exenatide with methoxypolyethylene glycol. CN101980725B discloses fatty acid-PEG-Exenatide. CN105753963A discloses Exenatide analogs with single-point or multi-point amino acid mutations. CN102397558A discloses the substitution of certain amino acids in Exenatide with cysteine, and the modification with PEG or PEG with methyl substituted end.
Although many efforts are made in the development of GLP-1 analogue long-acting drugs, the GLP-1 analogue on the market at present has poor stability and low drug effect, and innovative modification and research of GLP-1 analogue are still a very important topic as the structure and activity basis of long-acting drug development.
Various approaches have been taken to prolong the clearance half-life of the GLP-1 peptide or to reduce clearance of the peptide from the body while maintaining biological activity. One approach involves fusing a GLP-1 peptide to an immunoglobulin Fc moiety. Immunoglobulins generally have a long circulating half-life in vivo. For example, IgG molecules have a half-life in humans of up to 23 days. The immunoglobulin Fc portion is part of the reason for this in vivo stability. While retaining the biological activity of the GLP-1 molecule, GLP-1-Fc fusion proteins have the advantage of retaining the biological activity of the GLP-1 molecule due to the stability provided by the Fc portion of the immunoglobulin.
However, the problem of immunogenicity of many fusion proteins when repeatedly administered over long periods of time is still prevalent.
Proteins, including therapeutic proteins, are generally immunogenic, in part because the product of protein endocytosis and proteolysis by antigen presenting cells binds the peptide to molecules called the Major Histocompatibility Complex (MHC), which then present the peptide to T cells. Antigen peptide-MHC complexes on the surface of Antigen Presenting Cells (APC) stimulate T cells to proliferate, differentiate and release cytokines. At the same time, B-cell differentiation and antibody production are induced, which may further limit the utility of the therapeutic protein by clearance of the therapeutic protein. Thus, antigenic peptides derived from therapeutic proteins are capable of eliciting a range of unwanted immune responses. The utility of therapeutic proteins is limited by the neutralization of antibodies, and because T-cell and B-cell responses can cause inflammatory and allergic reactions in patients, their induction is often detrimental.
Disclosure of Invention
The present invention provides a heterologous fusion protein that provides multiple positional substitutions in the Fc portion, thereby largely overcoming potential immunogenicity problems associated with administration of Fc fusions and improving stability. The therapeutic peptide bound by the heterologous fusion protein may be some therapeutic peptide capable of being fused to Fc, such as exenatide, liraglutide, lissamide, somaglutide, ldeglira, albiglutide, dulaglutide, ITCA650, LAEx4, Lixilan, and the like.
The invention also provides an improved GLP-1 analogue, and the results of in vivo and in vitro activity experiments prove that the GLP-1 analogue has obviously higher activity than the existing GLP-1 analogue. Therefore, the analogues of the present invention can be used for better prevention, prevention or reduction of diabetes and obesity and/or complications.
The GLP-1 analogue and the immunoglobulin Fc form a fusion protein through a joint, the immunoglobulin Fc part is obtained by modifying the Fc fragment of human IgG4, and the sequence number of the Fc fragment is SEQ ID NO: 1, the specific sequence is as follows:
Ala-Glu-Ser-Lys-Tyr-Gly-Pro-Pro-Cys-Pro-Xaall-Cys-Pro-Ala-Pro-
Glu-Phe-Leu-Gly-Gly-Pro-Ser-Val-Phe-Leu-Phe-Pro-Pro-Lys-Pro-
Lys-Asp-Thr-Leu-Met-Ile-Ser-Arg-Thr-Pro-Glu-Val-Thr-Cys-Val-
Val-Val-Asp-Val-Ser-Gln-Glu-Asp-Pro-Glu-Val-Gln-Phe-Asn-Trp-
Tyr-Val-Asp-Gly-Val-Glu-Val-His-Asn-Ala-Lys-Thr-Lys-Pro-Arg-
Glu-Glu-Gln-Phe-Asn-Ser-Thr-Tyr-Arg-Xaa85-Xaa86-Xaa87-Val-Leu-Thr-
Xaa91-Leu-His-Gln-Asp-Trp-Leu-Asn-Gly-Lys-Glu-Tyr-Lys-Cys-Lys-
Val-Ser-Asn-Lys-Gly-Leu-Pro-Ser-Ser-Ile-Glu-Lys-Thr-Ile-Ser-
Lys-Ala-Lys-Gly-Gln-Pro-Arg-Glu-Pro-Gln-Val-Tyr-Thr-Leu-Pro-
Pro-Ser-Gln-Glu-Glu-Met-Thr-Lys-Asn-Gln-Val-Ser-Leu-Thr-Cys-
Leu-Val-Lys-Gly-Phe-Tyr-Pro-Ser-Asp-Ile-Ala-Val-Glu-Trp-Glu-
Ser-Asn-Gly-Gln-Pro-Glu-Asn-Asn-Tyr-Lys-Thr-Thr-Pro-Pro-Val-
Leu-Asp--Ser-Asp--Gly-Ser-Phe-Phe-Leu-Tyr-Ser-Arg-Leu-Thr-Val--
Asp-Lys-Ser-Arg-Trp-Gln-Glu-Gly-Asn-Val-Phe-Ser-Cys-Ser-Val-
Met-His-Glu-Ala-Leu-His-Asn-His-Tyr-Thr-Gln-Lys-Ser-Leu-Ser-
Leu-Ser-Leu-Gly,
wherein Ser at position 11, Val at position 85, Val at position 86, Ser at position 87 and Val at position 91 are replaced, and terminal Lys is deleted, and the method specifically comprises the following steps:
xaa at position 11 is Pro;
xaa at position 85 is Ala or Phe;
xaa at position 86 is Phe or Glu;
xaa at position 87 is Glu, Phe or Leu;
xaa at position 91 is Gly.
Sequence 1 is abbreviated as:
in a preferred embodiment, it is preferably SEQ ID NO: 2, sequence 2:
substitution at position 11 with Pro;
xaa at position 85 is Ala;
xaa at position 86 is Phe;
xaa at position 87 is Glu;
xaa at position 91 is Gly.
Sequence 2 abbreviation:
in a preferred embodiment, it is preferably SEQ ID NO: 3, sequence 3:
substitution at position 11 with Pro;
xaa at position 85 is Phe;
xaa at position 86 is Glu;
xaa at position 87 is Phe;
xaa at position 91 is Gly.
Sequence 3 abbreviation:
the fusion protein, the therapeutic peptide of which is a GLP-1 analog, wherein the C-terminal glycine residue of the GLP-1 analog is fused to the N-terminal alanine residue of the Fc portion via a peptide linker, wherein the peptide linker is SEQ ID NO: 4, in particular to Gly-Ala-Gly-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser.
The peptide linker may also be selected from the following sequences:
a) Gly-Gly-Gly-Gly-Gly-Ser-Lys-Lys-Lys-Lys-Ser-Gly-Gly-Gly-Gly-Ser, such as SEQ ID NO: 5 is shown in the specification;
b) Gly-Ala-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Gly-Ser, as shown in SEQ ID NO: 6 is shown in the specification;
c) Gly-Gly-Gly-Gly-Gly-Ser-Lys-Lys-Lys-Lys-Ser-Gly-Gly-Gly-Gly-Gly-Ser, as shown in SEQ ID NO: 7 is shown in the specification;
d)Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser(SEQ IDNO:8)。
in the fusion protein, the GLP-1 analogue is obtained by modifying GLP-1(7-37), and in order to conveniently renumber the GLP-1 analogue, the original 7-bit His is numbered as 1 bit, and the sequence is SEQ ID NO: 9 is sequence 9:
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Xaa-Leu-Val-Lys-Gly-Xaa-Xaa;
xaa at the 25 th position is β Trp or β Phe or β His, and β th position in a sequence table is subject to the specification;
xaa at position 30 is Pro or Val or Glu or Ala;
xaa at the 31 th position is Lys-Lys-Lys-Gln-Gln-NH2;Lys-Lys-Pro-Phe-Phe-Pro-Cys-NH2;Lys-Lys-Ser-Cys-NH2;Gly-Gly-Lys-Lys-Cys-NH2。
In a preferred embodiment, the nucleic acid sequence of SEQ id no: 10, sequence 10:
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-β Phe-Leu-Val-Lys-Gly-Ala-Gly-Gly-Lys-Lys-Cys-NH2。
in a preferred embodiment, the nucleic acid sequence of SEQ id no: 11, sequence 11:
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-β Phe-Leu-Val-Lys-Gly-Pro-Lys-Lys-Pro-Phe-Phe-Pro-Cys-NH2。
in a preferred embodiment, the nucleic acid sequence of SEQ id no: 12, sequence 12:
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-β Phe-Leu-Val-Lys-Gly-Glu-Gly-Gly-Lys-Lys-Cys-NH2。
in a preferred embodiment, the nucleic acid sequence of SEQ id no: 13, sequence 13:
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-β Phe-Leu-Val-Lys-Gly-Val-Gly-Gly-Lys-Lys-Cys-NH2。
in a preferred embodiment, the nucleic acid sequence of SEQ id no: 14, sequence 14:
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-βPhe-Leu-Val-Lys-Gly-Val-Lys-Lys-Lys-Gln-Gln-NH2。
in a preferred embodiment, the nucleic acid sequence of SEQ id no: 15, sequence 15:
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-β His-Leu-Val-Lys-Gly-Ala-Gly-Gly-Lys-Lys-Cys-NH2。
in a preferred embodiment, the nucleic acid sequence of SEQ id no: 16, sequence 16:
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-β Trp-Leu-Val-Lys-Gly-Ala-Lys-Lys-Ser-Cys-NH2。
in a preferred embodiment, the fusion protein, wherein the sequence of the GLP-1 analog is SEQ ID NO: 10.
the fusion protein is a sequence 10-linker sequence 4-sequence 2, is named as HIP-1, and has a sequence number of SEQ ID NO: the sequence of the 17 specific protein is as follows:
the fusion protein is a sequence 10-linker sequence 4-sequence 3, named as HIP-2, and has a sequence number of SEQ ID NO: the sequence of 18 specific proteins is:
in a preferred embodiment, the fusion protein, wherein the sequence of the GLP-1 analog is SEQ ID NO: 11.
the fusion protein is a sequence 11-linker sequence 4-sequence 2, named as HIP-3, and has a sequence number of SEQ ID NO: the 19 specific protein sequence is:
the fusion protein is a sequence 11-linker sequence 4-sequence 3, named as HIP-4, and has a sequence number of SEQ ID NO: 20 the specific protein sequence is:
in a preferred embodiment, the fusion protein, wherein the sequence of the GLP-1 analog is SEQ ID NO: 12.
the fusion protein is a sequence 12-linker sequence 4-sequence 2, named as HIP-5, and has a sequence number of SEQ ID NO: the sequence of the 21 specific protein is as follows:
the fusion protein is a sequence 12-linker sequence 4-sequence 3, named as HIP-6, and has a sequence number of SEQ ID NO: the sequence of the 22 specific protein is as follows:
in a preferred embodiment, the fusion protein, wherein the sequence of the GLP-1 analog is SEQ ID NO: 13.
the fusion protein is a sequence 13-linker sequence 4-sequence 2, is named as HIP-7, and has a sequence number of SEQ ID NO: the 23 specific protein sequence is:
the fusion protein is a sequence 13-linker sequence 4-sequence 2, is named as HIP-8, and has a sequence number of SEQ ID NO: the sequence of the 24 specific proteins is:
in a preferred embodiment, the fusion protein, wherein the sequence of the GLP-1 analog is SEQ ID NO: 14.
the fusion protein is a sequence 14-linker sequence 4-sequence 2, is named as HIP-9, and has a sequence number of SEQ ID NO: 25 the specific protein sequence is:
the fusion protein is a sequence 14-linker sequence 4-sequence 3, is named as HIP-10, and has a sequence number of SEQID NO: 26 the specific protein sequence is:
in a preferred embodiment, the fusion protein, wherein the sequence of the GLP-1 analog is SEQ ID NO: 15.
the fusion protein is a sequence 15-linker sequence 4-sequence 2, is named as HIP-11, and has a sequence number of SEQID NO: 27 the specific protein sequence is:
in a preferred embodiment, the fusion protein, wherein the sequence of the GLP-1 analog is SEQ ID NO: 16.
the fusion protein is a sequence 16-linker sequence 4-sequence 2, is named as HIP-12, and has a sequence number of SEQID NO: 28 specific protein sequences are:
among the above sequences, the Glp-1 analogue sequences 10-16 before Fc fusion show better stability than other Glp-1 analogues, wherein the sequences 10 and 11 are better; the half-life can be further prolonged after the fusion with Fc, wherein Fc sequence 2 is better than sequence 3.
The fusion protein, wherein the GLP-1 analogue can also be exenatide, liraglutide, linatide, soxhlet peptide, Ideglira, albiglutide, dolaglutide, ITCA650, LAEx4, Lixilan and the like.
The Fc part of the fusion protein can be obtained by a solid-phase synthesis method and can also be obtained by eukaryotic cell expression; the GLP-1 analog moiety can be obtained by solid phase synthesis methods.
Solid phase synthesis method:
when the C terminal is carboxyl, the Wang resin is selected for chemical solid phase synthesis of two polypeptides. After the synthesis is finished, the obtained polypeptide resin with the side chain protecting group is cracked, and the C terminal is broken from the resin to form carboxyl. When the C-terminal of the polypeptide is amide, Fmoc-PAL-PEG-PS resin is selected for chemical solid-phase synthesis of the polypeptide. After the synthesis is finished, the obtained polypeptide resin with the side chain protecting group is cracked, and the C terminal is broken from the resin to form amide. The above solid phase synthesis was carried out on a polypeptide synthesizer.
The C-terminal cysteine was first attached to the resin, and then one amino acid was carried out from the C-terminus to the N-terminus. After the synthesis is complete, the Glp-1 analogue polypeptide is purified by reverse phase HPLC chromatography (Waters, C18 preparative column) after deprotection and cleavage from the resin. The purification conditions are that phase A contains 0.5% (V/V) acetic acid water solution, phase B contains 80% acetonitrile and 0.5% (V/V) acetic acid water solution, and gradient elution is carried out by 0-100% B solution. Collecting target polypeptide, and lyophilizing to obtain dry powder. The molecular weight of GLP-1 of sequence 10 is 4288.69 by mass spectrometry, which is consistent with the theoretical value.
In one embodiment, the modified GLP-1 analogs of the invention are prepared as follows:
when Xaa at position 31 is Lys-Lys-Lys-Gln-Gln-NH 2: solid phase synthesis was chosen and, further, Fmoc method was used. As the C end of the polypeptide is an amide group, the Fmoc method can use Fmoc-Rink resin which is characterized in that stable amide bonds are formed by reacting carboxyl of Fmoc-Gln (Trt) -OH with carrier amino, and a crude product can be obtained by cracking after the peptide grafting.
Gly-Gly-Lys-Lys-Cys-NH at position 312In this case, a solid-phase synthesis method is selected, and further, Fmoc method is used. As the C end of the polypeptide is an amido group, the Fmoc method can use Fmoc-Rink resin, the principle is that the carboxyl group of Fmoc-Cys (Trt) -OH reacts with the amino group of the carrier to form a stable amide bond, and a crude product can be obtained by cracking after the peptide grafting.
Fmoc-Rink resin (100-200 meshes, the crosslinking degree is 1-2%, and the substitution value is 0.3-0.6 mmol NH/g) is selected as a starting material, and the molar ratio of the protected amino acid to the Fmoc-Rink resin is 2: 1 (or 3: 1). The starting Fmoc-Rink resin (substitution 0.436mmol/g) was reacted with 20% piperidine (PIP/DMF) as a deprotecting agent at room temperature for 25 min, filtered with suction and the resin was washed 6 times with DMF (or DCM). Dissolving equimolar amounts of protected amino acids Fmoc-AAn and HOBt in DMF, cooling at-5 deg.C for more than 30min, slowly adding N, N-diisopropylcarbodiimide solution in dichloromethane to the protected amino acid DMF solution, reacting at-5-5 deg.C for 30min under stirring, and adding to the deprotected resin.
After 3 hours of reaction of the deprotected resin with the activated protected amino acid, the condensation reaction was monitored with ninhydrin test: the resin is blue or dark blue when the condensation reaction is incomplete; the resin is colorless or yellowish when the condensation reaction is complete. After monitoring the complete condensation of the condensation, the resin was washed 6 times with DMF. The condensation of the activated amino acids is carried out in sequence according to the polypeptide sequence. After completion of the peptide-joining reaction, the resin was washed with DMF and then shrunk with methanol, and the resin was dried under reduced pressure.
The dried peptide resin was added to TFA cleavage reagent (in mL: g, TFA: H) at 13mL/g peptide resin dosage2O, TIS and DTT are equal to 90: 2.5: 5), and after reaction at 0-5 ℃ for 30min, the mixture is stirred and reacted for 2-3h at room temperature. The reaction mixture was filtered, the filtrate was collected and concentrated to 25% of the initial volume at 35 ℃ under reduced pressure. Adding the concentrated solution into anhydrous ether frozen below-10 deg.C, shaking, precipitating at below-10 deg.C for 60 min, filtering, collecting filter cake, washing the filter cake with anhydrous ether for 3-5 times, and drying the filter cake under reduced pressure to constant weight to obtain polypeptide crude product.
Taking the crude polypeptide, dissolving the crude polypeptide in 10% acetic acid water solution for purification. Purification was performed using high performance liquid preparative chromatography with a mobile phase system of 0.1% TFA/water to 0.1% TFA/acetonitrile. The purification method comprises the following steps of using a chromatographic column with 10-micron reversed phase C18 as a chromatographic packing, ultraviolet detection wavelength of 215nm and 50mm x 300mm, enabling the flow rate to be 60mL/min, feeding 2.0-3.0g of sample each time, adopting a gradient system for elution, circularly feeding and purifying, collecting a main peak, and detecting the purity by using an analytical liquid phase, wherein the purity of the purified main peak is more than 98%. The prepared eluent is decompressed and concentrated under the condition of water bath at the temperature of 30-37 ℃ to obtain a concentrated solution of the primary purified material.
And (3) carrying out secondary desalting purification on the concentrated solution of the material collected by the primary purification under the conditions of: mobile phase a phase: purifying the water; phase B: acetonitrile solution, detection wavelength: 215nm and a flow rate of 30-60 ml min < -1 >. Collecting the target fraction, concentrating the collected fraction at a temperature below 35 deg.C under reduced pressure until the fraction does not flow out, lyophilizing the rest fraction to obtain refined product, and vacuum lyophilizing to obtain pure GLP-1 analogue.
The cell expression method can refer to CN 104277112A, firstly, the coding gene of the heterologous fusion protein is constructed; then cloning the gene to a eukaryotic expression vector to obtain a eukaryotic expression vector capable of expressing the fusion protein; then the expression vector is used for transfecting host cells to enable the host cells to express the recombinant fusion protein, and then the recombinant fusion protein is obtained through separation and purification. The eukaryotic expression vector can be PET32 and the like. The host cell can be FreeStyle 293F, yeast cell, CHO cell, NSO cell, etc.
Among them, wild-type human IgG4 protein can be obtained from various sources. For example, a cDNA library can be prepared from cells expressing the mRNA of interest at detectable levels to obtain these proteins. Libraries can be screened using probes designed using published DNA or protein sequences for a particular protein of interest. For example, in Adams et al, (1980) Biochemistry 19: 2711-2719; goughet et al, (1980) Biochemistry 19: 2702-2710; dolby et al, (1980) proc.natl.acad.sci.usa 77: 6027-6031; rice et al, (1982) proc.natl.acad.sci.usa 79: 7862-7862; falkner et al, (1982) Nature 298: 286-; and Morrison et al, (1984) ann. rev. immunol.2: 239-, 256, describe immunoglobulin light or heavy chain constant regions.
The cDNA or genomic library can be screened with selected probes using standard procedures, for example, as described in Sambrook et al, Molecular Cloning: a Laboratory Manual, Cold spring harbor Laboratory Press, NY (1989). An alternative method for isolating genes encoding immunoglobulin proteins is to use the PCR method [ Sambrook et al, supra; dieffenbach et al, PCR Primer: a Laboratory Manual, Cold spring harbor Laboratory Press, NY (1995) ]. PCR primers can be designed based on published sequences.
The full-length wild-type sequence cloned from a particular library can generally be used as a template for generating an IgG4Fc analog fragment of the invention, wherein the IgG4Fc analog fragment retains the ability to confer a longer plasma half-life to a GLP-1 analog that is part of the fusion protein. PCR techniques can be used to generate fragments of IgG4Fc analogs using primers designed to hybridize to sequences corresponding to the desired ends of the fragments. PCR primers can also be designed to generate restriction enzyme sites to facilitate cloning into an expression vector.
The DNA encoding the GLP-1 analogs of the present invention can be produced by a variety of different methods, including those cloning methods and chemically synthesized DNA as described above.
The sequence number is SEQ ID NO: 17 is a fusion protein consisting of the sequence of SEQ ID NO: 10 by peptide linker SEQ ID NO: 4 and SEQ ID NO: 2, preferably the fusion protein obtained by ligation of the polynucleotide sequence numbered SEQ id no: 29, the specific sequence is:
preferably, the amino acid sequence of the signal peptide used in the present invention is as shown in SEQ ID NO: 30, the specific sequence is as follows:
the polynucleotide sequence of the signal peptide is shown as SEQ ID NO: 31, the specific sequence is as follows:
host cells are transfected or transformed with the expression or cloning vectors described herein for fusion protein production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting transformants, or amplifying genes encoding the desired sequences. The skilled person can select culture conditions such as medium, temperature, pH, etc. without undue experimentation. In general, the methods can be described in Mammalian cell Biotechnology: principles, methods and Practical techniques for maximizing cell culture productivity were found in the Practical Approach, m.butler eds (IRL Press, 1991) and Sambrook et al, supra. Transfection methods are known to the skilled worker, for example CaPO4 and electroporation. General aspects of mammalian cell host system transformation are described in U.S. Pat. No.4,399,216. Generally according to van Solingen et al, J Bact.130 (2): 946-7(1977) and Hsiao et al, proc.natl.acad.sci.usa 76 (8): 3829-33 (1979). However, other methods of introducing DNA into cells may be used, such as nuclear microinjection, electroporation, bacterial fusion with protoplasts of intact cells, or polycations, such as 1, 5-dimethyl-1, 5-diaza-undecamethylene polymethylenebromide (polybrene) or polymithine. For various techniques for transforming mammalian cells, see Keown et al, methods Enzymology 185: 527-37(1990) and Mansour et al, Nature 336 (6197): 348-52(1988).
Suitable host cells for cloning or expressing the nucleic acids (e.g., DNA) in the vectors herein include FreeStyle 293F, yeast cells, CHO cells, NSO cells, and the like.
Eukaryotic microorganisms such as filamentous fungi or yeast are suitable cloning or expression hosts for the fusion protein vector. Saccharomyces cerevisiae (Saccharomyces cerevisiae) is a commonly used lower eukaryotic host microorganism. Others include Schizosaccharomyces pombe (Schizosaccharomyces pombe) [ beacon and number, Nature 290: 140-3 (1981); EP 139,383 published on 2.5.1995 ]; muyveromyces hosts [ U.S. patent No.4,943,529; fleer et al, Bio/Technology9 (10): 968-75(1991) such as Kluyveromyces lactis (K.lactis) (MW98-8C, CBS683, CBS4574) [ de Louvhouse et al, J.Bacteriol.154 (2): 737-42 (1983); kluyveromyces fragilis (k.fiagalis) (ATCC 12, 424), kluyveromyces bulgaricus (k.bulgaricus) (ATCC 16, 045), kluyveromyces vachelli (K wickeramii) (ATCC24, 178), K walltii (ATCC 56, 500), kluyveromyces drosophilus (k.drosophilum) (ATCC 36.906) [ Van den Berg et al, Bio/Technology 8 (2): 135-9(1990) ]; thermomoierans and kluyveromyces marxianus (k. marxianus); yarrowia (EP402,226); pichia pastoris (Pichia pastoris) (EP 183,070) [ Sreekrishna et al, j.basicpicrobiol.28 (4): 265-78 (1988); candida genus (Candida); trichoderma reesia (EP 244,234); neurospora crassa (Neurospora crassa) [ Case et al, Proc. Natl. Acad Sci. USA 76 (10): 5259-63 (1979); schwanniomyces (Schwanniomyces), such as Schwanniomyces occidentalis (EP394,538), published in 1990 at 10 months and 31 days; and filamentous fungi such as Neurospora (Neurospora), Penicillium (Penicillium), Tolypocladium (WO 91/00357, published 1/10 1991) and Aspergillus (Aspergillus) hosts, such as Aspergillus nidulans (a. nidulans) [ Balance et al, biochem. biophysis. res. comm.112 (1): 284-9 (1983); tilburn et al, Gene 26 (2-3): 205-21 (1983); yelton et al, proc.natl.acad.sci.usa 81 (5): 1470-4(1984) and aspergillus niger (a. niger) [ Kelly and Hynes, EMBO j.4 (2): 475-9(1985)]. The methylotrophic yeast is selected from Hansenula (Hansenula), Candida (Candida), Kloeckera (Kloeckera), Pichia (Pichia), Saccharomyces, Torulopsis (Torulopsis), and Rhodotorula (Rhodotorula). A list of exemplary specific species of such yeasts can be found in C.Antony, The Biochemistry of Methylotrophs, 269 (1982).
Suitable host cells for expressing the fusion proteins of the invention are derived from multicellular organisms. Examples of invertebrate cells include insect cells such as Drosophila S2 and Trichoplusia ni (Spodoptera) Sp, Trichoplusia ni high5, and plant cells. Examples of useful mammalian host cell lines include NSO myeloma cells, Chinese Hamster Ovary (CHO) cells, SP2, and COS cells. More specific examples include monkey kidney CVl line transformed with SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line [293 or subcloned 293 cells grown in suspension culture, Graham et al, j.gen virol, 36 (1): 59-74 (1977); chinese hamster ovary cells/-DHFR [ CHO, Urlaub and Chasin, proc.natl.acad.sci.usa, 77 (7): 4216-20 (1980); mouse support cells [ TM4, Mather, biol. reprod.23 (1): 243-52 (1980); human lung cells (w138.atccccl 75); human hepatocytes (Hep G2, HB 8065); and mouse mammary tumor (MMT060562, ATCC CCL 51).
Both expression and cloning vectors contain nucleic acid sequences that enable the vector to replicate in one or more selected host cells. Expression and cloning vectors will generally contain a selection gene, also referred to as a selectable marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins (e.g., neomycin, methotrexate, or tetracycline), (b) complement autotrophic deficiencies, or (c) supplement key nutrients not available from complex media (e.g., the gene encoding bacillus D-alanine racemase).
Examples of suitable selectable markers for mammalian cells are those which are capable of identifying cells competent to take up the nucleic acid encoding the fusion protein, such as DHFR or thymidine kinase. When wild-type DHFR is used, suitable host cells are e.g. [ Urlaub and Chasin, proc.natl.acad.sci.usa, 77 (7): 4216-20(1980) describe CHO cell lines deficient in DHFR activity that were prepared and propagated. Suitable selection genes for use in yeast are the trpl gene present in the yeast plasmid YRp7 [ Stinchcomb et al, Nature 282 (5734): 39-43 (1979); kingsman et al, Gene 7 (2): 14l-52 (1979); tschumper et al, Gene 10 (2): 157-66(1980)]. The Trp1 gene provides a selectable marker for yeast mutants lacking the ability to grow in tryptophan, such as atccno.44076 or PEPC1 [ Jones, Genetics 85: 23-33(1977)].
Expression and cloning vectors typically contain a promoter operably linked to the fusion protein-encoding nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are well known. Examples of suitable promoter sequences for use with yeast hosts include 3-phosphoglycerate kinase [ Hitzeman et al, j.biol.chem.255 (24): 12073-80(1980) or other glycolytic enzymes [ Hess et al, j.adv.enzyme reg.7: 149 (1968); holland, Biochemistry 17 (23): 4900-7(1978), for example enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. Other yeast promoters are the promoter regions of alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization, which are inducible promoters with the additional advantage of transcription controlled by growth conditions. Suitable vectors and promoters for yeast expression are further described in EP 73,657. For example, transcription of fusion protein-encoding mRNA from vectors in mammalian host cells can be controlled by promoters from viral genomes, such as polyoma virus, fowlpox virus, adenovirus (e.g., adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, retroviruses, hepatitis b virus, and simian virus 40(SV 40); heterologous mammalian promoters, such as the actin promoter or immunoglobulin promoter, and heat shock promoters, provided that these promoters are compatible with the host cell system.
Transcription of a polynucleotide encoding a fusion protein by higher eukaryotic cells can be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA that act on a promoter to increase its transcription, usually about 10 to 300 bp. Many enhancer sequences are known from mammalian genes (globin, elastase, albumin, a-keto protein (ketoprotein) and insulin). However, one will typically use eukaryotic viral promoters. Examples include the SV40 enhancer (bp 100-270) posterior to the origin of replication, the cytomegalovirus early promoter enhancer, the polyoma enhancer posterior to the origin of replication, and the adenovirus enhancer. The enhancer may be spliced into the vector at the 5 ' or 3 ' position of the fusion protein coding sequence, but is preferably located at the 5 ' position of the promoter.
Expression vectors for eukaryotic host cells (nucleated cells of yeast, fungi, insect, plant, animal, human, or other multicellular organisms) will also contain sequences necessary for transcription termination and stabilization of the mRNA. These sequences can be obtained generally from the 5 'and occasionally the 3' untranslated regions of eukaryotic or viral DNA or cDNA. These regions contain nucleotide segments that are transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding the fusion protein.
Various forms of fusion proteins can be recovered from the culture medium or host cell lysate. If membrane bound, it may be released from the membrane using a suitable detergent solution (e.g.Triton-X100) or enzymatic cleavage. The cells used in the expression of the fusion protein can be disrupted by a variety of physical or chemical methods, such as cyclic freeze-thawing, sonication, mechanical disruption, or cell lysis reagents.
Once the fusion protein of the invention is expressed in a suitable host cell, the analog can be isolated and purified. The following steps are representative of suitable purification steps: separating carboxymethyl cellulose by classification; gel filtration such as Sephadex G-75; anion exchange resins such as DEAE or Mono-Q; cation exchange such as CM or Mono-S; a metal chelating column to bind an epitope tag form of the polypeptide; reversed phase HPLC; carrying out chromatographic focusing; silica gel; ethanol precipitation; and ammonium sulfate precipitation.
A variety of protein purification Methods can be used, such Methods are well known in the art and are described, for example, in Deutscher, Methods in Enzymology 182: 83-9(1990) and Scopes, protein purification: principlsand Practice, Springer-Verlag, NY (1982). The purification step chosen depends on the production method used and on the nature of the particular fusion protein produced. For example, fusion proteins comprising an Fc fragment can be efficiently purified using protein A or protein G affinity matrices. The fusion protein may be eluted from the affinity matrix using low or high pH buffers. Mild elution conditions will help to prevent irreversible denaturation of the fusion protein.
The Fc portion contains substitutions at positions 11, 85, 86, 87, 91 with a deletion of the terminal lys. The substituted analogs reduce the immunogenicity of the fusion protein.
The GLP-1 analogue can be prepared into a dimer by a method for example, GLP-1 analogue polypeptide with the purity of more than 98% is dissolved in 50mmol/LTris-HCl (pH8.5) buffer, and the final concentration of the polypeptide is 1-2.0 mg/ml. The reaction was carried out overnight at 4 ℃ or at room temperature for 4-6 hours, 90% of the GLP-1 analog was analyzed by RP-HPLC to form a dimer, the reaction was terminated by adding 1% TFA, and the dimer was separated by purification using a C18 column.
The invention uses double activated polyethylene glycol molecule as conjugate, and reduces the conjugate and free sulfhydryl at C end of analog polypeptide into covalent polyethylene glycol coupled homodimer under certain reaction condition: GLP-1-Cys-PEG-Cys- -GLP-1.
The dual activated polyethylene glycol molecules (PEG) include dual activated maleic acid PEG, characterized as MAL-PEG-MAL; or comprise a di-activated sulfhydryl PEG characterized as SH-PEG-SH. Or comprise di-activated ortho-dipyridyl disulfide (PEG) characterized as OPSS-PEG-OPSS.
The polyethylene glycol molecules used in the homodimers of insulinotropic peptide analogs of the present invention can be small or large molecules with molecular weights ranging from 500 to 40,000 daltons. The significant prolongation of the in vivo half-life of the insulinotropic hormone secretagogue peptide analogue dimer can be achieved by using a PEG molecule with a molecular weight of more than 5000 as a dimer conjugate. The preferred PEG molecule size is between 10KD-40 KD.
The fusion protein of the invention contains an Fc portion derived from human IgG4 but includes multiple substitutions compared to the wild-type human sequence. The Fc portion consists of the two heavy chain constant regions of the antibody bound by non-covalent interactions and disulfide bonds. The Fc portion may comprise a hinge region and may be via CH2And CH3The domain extends to the C-terminus of the antibody. The Fc portion may also comprise one or more glycosylation sites.
Native Glp-1 (7-37):
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Arg-Gly
the GLP-1 analog portion of the fusion protein of the invention comprises substitutions at positions 31, 36, and 37 (i.e., substitutions at positions 25, 30, and 31 of the new numbering of the invention) as compared to native Glp-1 (7-37).
The GLP-1 analogue can be artificially and chemically synthesized by adopting an Fmoc method, and the C terminal of the GLP-1 analogue is amidated.
The GLP-1 analogs of the present invention may be salified, including a variety of inorganic or organic salts, such as hydrochloride, phosphate, sulfate, maleate, oxalate, citrate, and lactate; salt formation with certain inorganic and organic bases, such as sodium hydroxide and N-methyl-glucosamine.
The GLP-1 analogs of the invention can be administered as a single drug or can be administered in combination with other drugs, or as a pharmaceutically acceptable carrier, or to provide a modifiable active precursor for long-acting polypeptide drug development. The application of the GLP-1 analogue or the pharmaceutically acceptable salt thereof and a pharmaceutically acceptable carrier can be used. The "pharmaceutically acceptable carrier" does not destroy the pharmaceutical activity of the compounds of the present invention and their useful salts, while the effective amount thereof is non-toxic to humans. The "pharmaceutically acceptable carrier" can be used without limitation: ion exchange materials, aluminum stearate, lecithin, serum proteins for pharmaceutical preparations, saturated vegetable fatty acids, cellulosic substances, ethylene-polyoxyethylene-block polymers, cyclodextrins or chemically modified derivatives or other soluble derivatives thereof, and the like.
Other pharmaceutically acceptable excipients such as fillers such as anhydrous lactose, starch, lactose beads and glucose, binders such as microcrystalline cellulose, disintegrants such as croscarmellose sodium, croscarmellose starch, low-substituted hydroxypropylcellulose, lubricants such as magnesium stearate, absorption enhancers, excipients, solubilizers and colorants may also be added to the pharmaceutical composition of the present invention.
The above GLP-1 analogues or pharmaceutically acceptable salts thereof and pharmaceutical compositions of the present invention can be administered by enteral or parenteral routes. Parenteral routes of administration include subcutaneous, intradermal, intramuscular, nasal, mucosal administration or inhalation. The preparation can be developed into injection, cream, ointment, patch, aerosol, etc.
The GLP-1 analogue or the fusion protein thereof or the pharmaceutically acceptable salt thereof and the pharmaceutical composition can be used for single-drug or combined-drug treatment of related diseases, and are within the scope understood by a person skilled in the art.
The fusion protein can be used for producing a medicament for treating non-insulin-dependent diabetes mellitus and also can be used for producing a medicament for treating obesity or inducing weight loss in an overweight subject.
The present invention provides explanations of some terms to facilitate better understanding of the technical aspects of the present invention.
A "polypeptide" is a polymer of amino acid residues joined by peptide bonds, either naturally occurring or synthetic. Polypeptides of less than about 10 amino acid residues are commonly referred to as "peptides".
The "Fc" region contains the CH comprising antibody2And CH3Two heavy chain fragments of a domain. Two heavy chain fragments consisting of two or more disulfide bonds and passing through CH3The hydrophobic interaction of the domains remains together.
A "protein" is a macromolecule comprising one or more polypeptide chains. Proteins may also contain non-peptide components, such as carbohydrate groups. Carbohydrates and other non-peptide substituent groups may be added to the protein in the cell in which the protein is produced, and the groups will vary from cell type to cell type. Proteins are defined herein below in terms of their amino acid backbone structure; substituents such as the subject carbohydrate groups are generally not specified, but may still be present.
"heterologous peptide" a peptide or polypeptide encoded by a non-host DNA molecule is a "heterologous" peptide or polypeptide.
A "cloning vector" is a nucleic acid molecule, such as a plasmid, cosmid, or phage, that is capable of autonomous replication in a host cell. Cloning vectors typically contain one or a small number of restriction endonuclease recognition sites that permit insertion of nucleic acid molecules in a specific manner without loss of the essential biological function of the vector, as well as nucleotide sequences encoding marker genes suitable for use in the identification and selection of cells transformed with the cloning vector. Marker genes generally include genes that provide tetracycline resistance or ampicillin resistance. Cloning vectors, which have a relaxed replicon and can carry a foreign gene and replicate and amplify in host cells, are vectors used for cloning and amplifying DNA fragments (genes).
"expression vector" is a vector having the basic elements of a cloning vector (ori, Ampr, Mcs, etc.) and further having DNA sequences necessary for transcription/translation; is a bacterium used for engineering production, the introduced target gene can be expressed in the bacterium to produce the required product, the introduced gene is produced by cloning vector, the expression vector has higher protein expression efficiency, generally because of having strong promoter.
A "recombinant host" is a cell that contains a heterologous nucleic acid molecule, such as a cloning vector or an expression vector.
An "integrated transformant" is a recombinant host cell in which heterologous DNA is integrated into the genomic DNA of the cell.
A "fusion protein" is a hybrid protein expressed from a nucleic acid molecule comprising the nucleotide sequences of at least two genes.
"immunoglobulin moiety" refers to a polypeptide comprising an immunoglobulin constant region. For example: the immunoglobulin moiety may comprise a heavy chain constant region.
"expression" refers to the biosynthesis of a gene product. For example, in the case of a structural gene, expression includes transcription of the structural gene into mRNA and translation of the mRNA into one or more polypeptides.
Detailed Description
The present invention will be described in more detail with reference to examples. It will be apparent to those skilled in the art from this disclosure that these examples are provided only for illustrating the present invention more specifically and are not intended to limit the scope of the present invention. Examples are exemplified by HIP-1 to 12. All the materials used, not specifically mentioned, are commercially available.
Some common abbreviations in the present invention have the following meanings:
abbreviations | Means of |
Resin | Resin composition |
Fmoc | Fmoc group |
HOBt | 1-hydroxybenzotriazoles |
DCM | Methylene dichloride |
DMF | N, N-dimethylformamide |
DIC | N, N' -diisopropylcarbodiimide |
Rink Amide-MBHA Resin | Rink amide-MBHA |
MeOH | Methanol |
TFA | Trifluoroacetic acid |
TIS | Tri-isopropyl silane |
DTT | Dithiothreitol |
-Trt | Trityl radical |
-tBu | Tert-butyl radical |
-Pbf | 2,2, 4, 6, 7-pentamethylbenzofuran-5-sulfonyl |
-Boc | Tert-butyloxycarbonyl radical |
-OtBu | Tert-butyl ester radical |
Reference example 1 preparation of Fmoc-L- β -Phe-OH
Fmoc-L- β -Phe-OH, Fmoc-L- β -His (Trt) -OH and Fmoc-L- β -Trp-OH are prepared by Synthetic Communications, vol.32, nb.4, (2002) and p.651-657.
The synthetic route of Fmoc-L- β -Phe-OH is as follows:
preparation of Fmoc-L- β -Phe-OH:
TsC1(0.21g, 1.1mmol) and pyridine (0.08ml, 1mmol) were added to a solution of Fmoc-Phe-OH (0.39g, 1mmol) in THF (5ml), stirred at 0 ℃ for 15min, saturated CH was added to the reaction mixture2N2Dry CH of2Cl2(20mL) the solution was stirred at 0 ℃ for lh. The progress of the reaction was checked by TLC and IR. After the reaction is completed, acetic acid is added dropwise to decompose excess CH2N2. Sequentially with NaHCO3(25 ml. times.3), 5% HCl (25 ml. times.3) and NaCl wash mixture, anhydrous Na2SO4Drying, distilling under reduced pressure, and collecting the residue with CH2Cl2Recrystallization from hexane gave crystals of intermediate II.
Intermediate II (1mmol) was dissolved in a solution of 1, 4-dioxane (10ml) and water (5ml), silver benzoate (5.7mg, 0.025mmol) was added and refluxed at 70 ℃ for 6 hours. Filtering, and distilling the filtrate under reduced pressure. Dissolving the residue in saturated Na2CO3The mixture was stirred for 20 minutes in an aqueous solution (20 ml). The mixture was washed with diethyl ether (20 ml. times.3). The pH of the aqueous phase was adjusted to 2 and extracted with ethyl acetate (20 ml. times.3). The extracts were combined, washed with water (20 ml. times.2) and the organic phase with anhydrous Na2SO4Drying and distilling under reduced pressure. The residue is substituted by CH2Cl2Recrystallizing in hexane to obtain Fmoc-L- β -Phe-OH.
Reference example 2 preparation of sequence 10
Sequence 10: molecular weight 3660.1;
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-βPhe-Leu-Val-Lys-Gly-Ala-Gly-Gly-Lys-Lys-Cys-NH2。
this example provides a method for producing a polypeptide of sequence 10.
The preparation method of the sequence 10 polypeptide comprises the following steps: adopting amino resin with Fmoc protection, after removing Fmoc protecting group on the amino resin, synthesizing sequence 10 polypeptide peptide resin, removing resin and side chain protecting group to obtain sequence 10 polypeptide crude product, and preparing and purifying the sequence 10 polypeptide crude product to obtain the sequence 10 polypeptide refined product.
(1) Preparation of sequence 10 polypeptide peptide resin:
rink Amide-MBHA Resin (5.0g, 0.42mmol/g, 2.1mmol) and 40ml CH were added to a 100ml solid phase reactor2Cl2Swelling the resin for 30min, removing CH2Cl2Removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution, after 10min, pumping out the reaction solution, removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution again, after 10min, pumping out the reaction solution, and washing the resin by using DMF (6 x 40ml) for later use; simultaneously placing 3.7g Fmoc-Cys (Trt) -OH (6.3mmol) and 0.9g HOBt (6.6mmol) in a 100ml flask, adding 40ml DMF for dissolving, cooling to 0-5 ℃ in ice bath, dropwise adding 1.0ml DIC (6.6mmol), reacting for 5min after the addition, transferring the reaction liquid to the obtained Fmoc-removed amino resin, and adding N2After bubbling and mixing, condensation reaction was carried out for 3 hours, and then the reaction solution was taken out and the resin was washed with DMF (6X 40 ml).
Repeating the deprotection and condensation steps, sequentially coupling Fmoc-Lys (Boc) -OH, Fmoc-Gly-OH, Fmoc-Ala-OH, Fmoc-Gly-OH, Fmoc-Lys (Boc) -OH, Fmoc-Val-OH, Fmoc-Leu-OH, Fmoc- β -L-Phe-OH, Fmoc-Ala-OH, Fmoc-Ile-OH, Fmoc-Phe-OH, Fmoc-Glu (Otbu) -OH, Fmoc-Lys (Boc) -OH, Fmoc-Ala-OH, Fmoc-Gln (Trt) -OH, Fmoc-Gly-OH, Fmoc-Glu (Otbu) -OH, Fmoc-Leu-OH, Fmoc-Tyr (Tyr) -OH, Fmoc-Glu (Thr) and Fmoc-Glu-OH, sequentially, performing a coupling reaction, wherein after removing the protective groups of Fmoc-Lys (Boc) -OH, Fmoc-Ala-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu-.
(2) Preparation of crude polypeptide of sequence 10:
adding the amino resin loaded peptide resin of sequence 10 polypeptide prepared above into 150ml TFA/TIS/H2Removing resin and side chain protecting group from O (v/v/v ═ 95: 2.5) mixed solution, suction filtering after 2.5H, collecting filtrate, and using TFA/TIS/H2Washing resin with O (volume ratio of 95: 2.5) mixed solution (10ml), filtering, combining filtrates, concentrating under reduced pressure at 35 deg.C until no fraction flows out, slowly pouring the concentrated solution into 3L cold diethyl ether, stirring, precipitating white precipitate, filtering, grinding filter cake, washing with cold diethyl ether for 3 times, vacuum drying at 35 deg.C for 5 hr to obtain crude product of sequence 10 polypeptide 7.0g, yield: 91.0%, HPLC purity: 56.2 percent.
(3) Preparation of refined product of polypeptide of sequence 10:
7.0g of crude sequence 10 polypeptide are dissolved in 20ml of 10% aqueous acetic acid, filtered through a 0.45 μm membrane and purified by preparative HPLC: loading a C18 column, wherein the loading is performed for 1.8-2.2 g each time, the wavelength is 214nm, gradient elution is performed for 5min on (10% acetonitrile-H2O (containing 1% TFA) and the like, gradient elution is performed for 35min on 10% to 80% acetonitrile-H2O (containing 1% TFA), the flow rate is 50ml/min, a gradient system is adopted for elution, circulation sample injection purification is performed, a main peak is collected and the purity is detected by an analytical liquid phase, the purity of the purified main peak is more than 98%, and the prepared eluent is subjected to reduced pressure concentration under the water bath condition of 30-35 ℃ to obtain a primary purified material concentrated solution.
And (3) carrying out secondary desalting purification on the concentrated solution of the material collected by the primary purification under the conditions of: mobile phase a phase: purifying the water; phase B: acetonitrile solution, detection wavelength: 215nm at a flow rate of 30-60 ml/min-1. Collecting target fraction, concentrating the collected fraction at water temperature below 35 deg.C under reduced pressure until the fraction does not flow out, lyophilizing the rest fraction to obtain refined product, and vacuum lyophilizing to obtain refined product of sequence 10 polypeptide 3.1g, with total yield of 40.5% and HPLC purity of 99.1%.
Reference example 3 preparation of sequence 11
Sequence 11: molecular weight 4059.6;
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-βPhe-Leu-Val-Lys-Gly-Pro-Lys-Lys-Pro-Phe-Phe-Pro-Cys-NH2。
this example provides a method for producing a polypeptide of sequence 11.
The preparation method of the sequence 11 polypeptide comprises the following steps: adopting amino resin with Fmoc protection, after removing Fmoc protecting group on the amino resin, synthesizing sequence 11 polypeptide peptide resin, removing resin and side chain protecting group to obtain sequence 11 polypeptide crude product, and preparing and purifying the sequence 11 polypeptide crude product to obtain the sequence 11 polypeptide refined product.
(1) Preparation of sequence 11 polypeptide peptide resin:
rink Amide-MBHA Resin (5.0g, 0.42mmol/g, 2.1mmol) and 40ml CH were added to a 100ml solid phase reactor2Cl2Swelling the resin for 30min, removing CH2Cl2Removing Fmoc protecting group on the resin with 30ml of 20% piperidine/DMF solution, after 10min, removing the reaction solution, removing Fmoc protecting group on the resin with 30ml of 20% piperidine/DMF solution again, after 10min, removing the reaction solution, washing the resin with DMF (6X 40ml)And is ready for use; simultaneously placing 3.7g Fmoc-Cys (Trt) -OH (6.3mmol) and 0.9g HOBt (6.6mmol) in a 100ml flask, adding 40ml DMF for dissolving, cooling to 0-5 ℃ in ice bath, dropwise adding 1.0ml DIC (6.6mmol), reacting for 5min after the addition, transferring the reaction liquid to the obtained Fmoc-removed amino resin, and adding N2After bubbling and mixing, condensation reaction was carried out for 3 hours, and then the reaction solution was taken out and the resin was washed with DMF (6X 40 ml).
Repeating the deprotection and condensation steps, sequentially coupling Fmoc-Pro-OH, Fmoc-Phe-OH, Fmoc-Pro-OH, Fmoc-Lys (Boc) -OH, Fmoc-Pro-OH, Fmoc-Gly-OH, Fmoc-Lys (Boc) -OH, Fmoc-Val-OH, Fmoc-Leu-OH, Fmoc- β -L-Phe-OH, Fmoc-Ala-OH, Fmoc-Ile-OH, Fmoc-Phe-OH, Fmoc-Glu (Otbu) -OH, Fmoc-Lys (Boc) -OH, Fmoc-Ala-OH, Fmoc-Gln (t) -OH, Fmoc-Gly-OH, Fmoc-Glu (Glu-Glu) (OH, Fmoc-Leu-OH, Fmoc-Tyr-Ala-OH, Fmoc-Thr-OH, Fmoc-Thr-Thru-NH-OH, Fmoc-Ser-Lys (Boc) -OH, Fmoc-Boc-L-OH, Fmoc-L-OH, Fmoc-L-OH, Fmoc-L-OH, Fmoc-T-OH, Fmoc-N (T-OH, Fmoc-T-OH, Fmoc-T-OH, F.
(2) Preparation of crude polypeptide of sequence 11:
adding the amino resin loaded peptide resin of the sequence 11 polypeptide prepared in the above into 150ml TFA/TIS/H2Removing resin and side chain protecting group from O (v/v/v ═ 95: 2.5) mixed solution, suction filtering after 2.5H, collecting filtrate, and using TFA/TIS/H2Washing resin with mixed solution (10ml) of O (volume ratio of 95: 2.5), vacuum filtering, mixing filtrates, concentrating under reduced pressure at 35 deg.C until no distillate flows out, slowly pouring the concentrated solution into 3L cold diethyl etherStirring to separate out white precipitate, filtering, crushing the filter cake, washing with cold ether for 3 times, and vacuum drying at 35 deg.c for 5 hr to obtain crude polypeptide of sequence 11 in 7.3g and yield: 85.3%, HPLC purity: 51.7 percent.
(3) Preparation of refined product of polypeptide of sequence 11:
7.3g of crude sequence 11 polypeptide are dissolved in 20ml of 10% aqueous acetic acid, filtered through a 0.45 μm membrane and purified by preparative HPLC: loading a C18 column, wherein the loading is performed for 1.8-2.2 g each time, the wavelength is 214nm, gradient elution is performed for 5min on (10% acetonitrile-H2O (containing 1% TFA) and the like, gradient elution is performed for 35min on 10% to 80% acetonitrile-H2O (containing 1% TFA), the flow rate is 50ml/min, a gradient system is adopted for elution, circulation sample injection purification is performed, a main peak is collected and the purity is detected by an analytical liquid phase, the purity of the purified main peak is more than 98%, and the prepared eluent is subjected to reduced pressure concentration under the water bath condition of 30-35 ℃ to obtain a primary purified material concentrated solution.
And (3) carrying out secondary desalting purification on the concentrated solution of the material collected by the primary purification under the conditions of: mobile phase a phase: purifying the water; phase B: acetonitrile solution, detection wavelength: 215nm at a flow rate of 30-60 ml/min-1. Collecting target fraction, concentrating the collected fraction at water temperature below 35 deg.C under reduced pressure until the fraction does not flow out, lyophilizing the rest fraction to obtain refined product, and vacuum lyophilizing to obtain refined product of sequence 11 polypeptide 3.0g, with total yield of 35.6% and HPLC purity of 99.3%.
Reference example 4 preparation of sequence 12
Sequence 12: molecular weight 3717;
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-βPhe-Leu-Val-Lys-Gly-Glu-Gly-Gly-Lys-Lys-Cys-NH2。
this example provides a method for producing a polypeptide of sequence 12.
Preparation method of the sequence 12 polypeptide: adopting amino resin with Fmoc protection, after removing Fmoc protecting group on the amino resin, synthesizing sequence 12 polypeptide peptide resin, removing resin and side chain protecting group to obtain sequence 12 polypeptide crude product, and preparing and purifying the sequence 12 polypeptide crude product to obtain a sequence 12 polypeptide refined product.
(1) Preparation of sequence 12 polypeptide peptide resin:
rink Amide-MBHA Resin (5.0g, 0.42mmol/g, 2.1mmol) and 40ml CH were added to a 100ml solid phase reactor2Cl2Swelling the resin for 30min, removing CH2Cl2Removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution, after 10min, pumping out the reaction solution, removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution again, after 10min, pumping out the reaction solution, and washing the resin by using DMF (6 x 40ml) for later use; simultaneously placing 3.7g Fmoc-Cys (Trt) -OH (6.3mmol) and 0.9g HOBt (6.6mmol) in a 100ml flask, adding 40ml DMF for dissolving, cooling to 0-5 ℃ in ice bath, dropwise adding 1.0ml DIC (6.6mmol), reacting for 5min after the addition, transferring the reaction liquid to the obtained Fmoc-removed amino resin, and adding N2After bubbling and mixing, condensation reaction was carried out for 3 hours, and then the reaction solution was taken out and the resin was washed with DMF (6X 40 ml).
Repeating the deprotection and condensation steps, sequentially coupling Fmoc-Lys (Boc) -OH, Fmoc-Gly-OH, Fmoc-Glu (OtBu) -OH, Fmoc-Gly-OH, Fmoc-Lys (Boc) -OH, Fmoc-Val-OH, Fmoc-Leu-OH, Fmoc- β -L-Phe-OH, Fmoc-Ala-OH, Fmoc-Ile-OH, Fmoc-Phe-OH, Fmoc-Glu Otbu-OH, Fmoc-Lys (Boc) -OH, Fmoc-Ala-OH, Fmoc-Gln (Trt) -OH, Fmoc-Gly-OH, Fmoc-Glu-Otbu) -OH, Fmoc-Leu-OH, Fmoc-Ty-Tyr-OH, Fmoc-Glu-OH, Fmoc-Thr-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu (Boc-Lys-OH, Fmoc-Gly-Ala-OH, Fmoc-Ala-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu, Fmoc-Glu-OH, Fmoc-Glu.
(2) Preparation of crude polypeptide of sequence 12:
adding the amino resin-loaded peptide resin of sequence 12 prepared above into 150ml TFA/TIS/H2Removing resin and side chain protecting group from O (v/v/v ═ 95: 2.5) mixed solution, suction filtering after 2.5H, collecting filtrate, and using TFA/TIS/H2Washing resin with O (volume ratio of 95: 2.5) mixed solution (10ml), filtering, combining filtrates, concentrating under reduced pressure at 35 ℃ until the fraction does not flow out, slowly pouring the concentrated solution into 3L cold diethyl ether, stirring, separating out white precipitate, filtering, grinding the filter cake, washing with cold diethyl ether for 3 times, and vacuum drying at 35 ℃ for 5 hours to obtain 7.0g crude polypeptide of sequence 12, yield: 89.6%, HPLC purity: 55.5 percent.
(3) Preparation of refined product of polypeptide of sequence 12:
7.0g of crude sequence 12 polypeptide are dissolved in 20ml of 10% aqueous acetic acid, filtered through a 0.45 μm membrane and purified by preparative HPLC: loading a C18 column, wherein 2.0-2.5 g of the C18 column is loaded each time, the wavelength is 214nm, gradient elution is performed for 5min on (10% acetonitrile-H2O (containing 1% TFA) and the like, gradient elution is performed for 35min on 10% to 80% acetonitrile-H2O (containing 1% TFA), the flow rate is 50ml/min, gradient system elution is adopted, circulating sample injection purification is performed, a main peak is collected and the purity is detected by an analytical liquid phase, the purity of the purified main peak is more than 98%, and the prepared eluent is subjected to reduced pressure concentration under the water bath condition of 30-35 ℃ to obtain a primary purified material concentrated solution.
And (3) carrying out secondary desalting purification on the concentrated solution of the material collected by the primary purification under the conditions of: mobile phase a phase: purifying the water; phase B: acetonitrile solution, detection wavelength: 215nm at a flow rate of 30-60 ml/min-1. Collecting target fraction, concentrating the collected fraction at water temperature below 35 deg.C under reduced pressure until the fraction does not flow out, lyophilizing the rest fraction to obtain refined product, and vacuum lyophilizing to obtain refined product of sequence 12 polypeptide 3.1g, with total yield of 39.8% and HPLC purity of 98.8%.
Reference example 5 preparation of sequence 13
Sequence 13: molecular weight 3687;
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-βPhe-Leu-Val-Lys-Gly-Val-Gly-Gly-Lys-Lys-Cys-NH2。
this example provides a method for producing a polypeptide of sequence 13.
The preparation method of the sequence 13 polypeptide comprises the following steps: adopting amino resin with Fmoc protection, after removing Fmoc protecting group on the amino resin, synthesizing sequence 13 polypeptide peptide resin, removing resin and side chain protecting group to obtain sequence 13 polypeptide crude product, and preparing and purifying the sequence 13 polypeptide crude product to obtain the sequence 13 polypeptide refined product.
(1) Preparation of sequence 13 polypeptide peptide resin:
a100 ml solid phase reactor was charged with Rinkamide-MBHAResin resin (5.0g, 0.42mmol/g, 2.1mmol) and 40ml CH2Cl2Swelling the resin for 30min, removing CH2Cl2Removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution, after 10min, pumping out the reaction solution, removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution again, after 10min, pumping out the reaction solution, and washing the resin by using DMF (6 x 40ml) for later use; simultaneously placing 3.7g Fmoc-Cys (Trt) -OH (6.3mmol) and 0.9g HOBt (6.6mmol) in a 100ml flask, adding 40ml DMF for dissolving, cooling to 0-5 ℃ in ice bath, dropwise adding 1.0ml DIC (6.6mmol), reacting for 5min after the addition, transferring the reaction liquid to the obtained Fmoc-removed amino resin, and adding N2After bubbling and mixing, condensation reaction was carried out for 3 hours, and then the reaction solution was taken out and the resin was washed with DMF (6X 40 ml).
Repeating the deprotection and condensation steps, sequentially coupling Fmoc-Lys (Boc) -OH, Fmoc-Gly-OH, Fmoc-Val-OH, Fmoc-Gly-OH, Fmoc-Lys (Boc) -OH, Fmoc-Val-OH, Fmoc-Leu-OH, Fmoc- β -L-Phe-OH, Fmoc-Ala-OH, Fmoc-Ile-OH, Fmoc-Phe-OH, Fmoc-Glu (Otbu) -OH, Fmoc-Lys (Boc) -OH, Fmoc-Ala-OH, Fmoc-Gln (Trt) -OH, Fmoc-Gly-OH, Fmoc-Glu (Otbu) -OH, Fmoc-Leu-OH, Fmoc-Tyr (Tyr) -OH, Fmoc-Glu (Thru) -OH, Fmoc-Glu-OH, Fmoc-Glu (Boc-Glu) and Fmoc-Glu (20ml) to obtain a solution, drying the peptide, removing peptide (DMF) and the peptide, after the peptide-Glu-HCl/DMF-HCl is removed by a reaction (DMF) and the peptide-HCl, the peptide-HCl, the peptide is removed by a reaction step (DMF) and the amino-HCl is performed by a reaction step (DMF) and the steps are performed by a reaction (20 ml-HCl, wherein the peptide-HCl is performed by a reaction step (20-HCl, the amino-HCl-.
(2) Preparation of crude sequence 13 polypeptide:
adding the amino resin loaded polypeptide peptide resin of sequence 13 prepared in the above into 150ml TFA/TIS/H2Removing resin and side chain protecting group from O (v/v/v ═ 95: 2.5) mixed solution, suction filtering after 2.5H, collecting filtrate, and using TFA/TIS/H2Washing resin with O (volume ratio of 95: 2.5) mixed solution (10ml), filtering, combining filtrates, concentrating under reduced pressure at 35 ℃ until the fraction does not flow out, slowly pouring the concentrated solution into 3L cold diethyl ether, stirring, separating out white precipitate, filtering, grinding the filter cake, washing with cold diethyl ether for 3 times, and vacuum-drying at 35 ℃ for 5 hours to obtain crude polypeptide of sequence 13 (6.9 g, yield): 88.9%, HPLC purity: 55.8 percent.
(3) Preparation of refined product of polypeptide of sequence 13:
6.9g of crude sequence 13 polypeptide are dissolved in 20ml of 10% aqueous acetic acid, filtered through a 0.45 μm membrane and purified by preparative HPLC: c18 column is selected for sampling, each time the sample is 2.0-2.5 g, the wavelength is 214nm, (10% acetonitrile-H)2Gradient elution of O (1% TFA) for 5min, 10% to 80% acetonitrile-H2Gradient eluting O (containing 1% TFA) for 35min at flow rate of 50ml/min, eluting with gradient system, circularly injecting sample, purifying, collecting main peak, and detecting purity with analytical liquid phase, wherein the purity of purified main peak should be greater than 98%. The prepared eluent is decompressed and concentrated under the condition of water bath at the temperature of 30-35 ℃ to obtain a concentrated solution of a primary purified material.
And (3) carrying out secondary desalting purification on the concentrated solution of the material collected by the primary purification under the conditions of: mobile phase a phase: purifying the water; phase B: acetonitrile solution, detection wavelength: 215nm at a flow rate of 30-60 ml/min-1. Collecting the target fraction, and concentrating the collected fraction at a temperature below 35 deg.C under reduced pressureAnd (3) the distillate does not flow out any more, and the rest distillate is freeze-dried to obtain a refined product, and the refined product is freeze-dried in vacuum to obtain 3.0g of the refined product of the sequence 13 polypeptide, wherein the total yield is 38.5 percent, and the HPLC purity is 98.6 percent.
Reference example 6 preparation of sequence 14
Sequence 14: molecular weight 3982;
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-βPhe-Leu-Val-Lys-Gly-Val-Lys-Lys-Lys-Gln-Gln-NH2。
this example provides a method for producing a polypeptide of sequence 14.
The preparation method of the sequence 14 polypeptide comprises the following steps: adopting amino resin with Fmoc protection, after removing Fmoc protecting group on the amino resin, synthesizing sequence 14 polypeptide peptide resin, removing resin and side chain protecting group to obtain sequence 14 polypeptide crude product, and preparing and purifying the sequence 14 polypeptide crude product to obtain a sequence 14 polypeptide refined product.
(1) Preparation of sequence 14 polypeptide peptide resin:
a100 ml solid phase reactor was charged with Rinkamide-MBHAResin resin (5.0g, 0.42mmol/g, 2.1mmol) and 40ml CH2Cl2Swelling the resin for 30min, removing CH2Cl2Removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution, after 10min, pumping out the reaction solution, removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution again, after 10min, pumping out the reaction solution, and washing the resin by using DMF (6 x 40ml) for later use; simultaneously, 3.8g of Fmoc-Gln (Trt) -OH (6.3mmol) and 0.9g of HOBt (6.6mmol) are placed in a 100ml flask, 40ml of DMF is added for dissolution, the temperature is reduced to 0-5 ℃ in an ice bath, 1.0ml of DIC (6.6mmol) is added dropwise, reaction is carried out for 5min after the addition is finished, the reaction liquid is transferred to the obtained amino resin with Fmoc removed, and N is2After bubbling and mixing, condensation reaction was carried out for 3 hours, and then the reaction solution was taken out and the resin was washed with DMF (6X 40 ml).
Repeating the deprotection and condensation steps, sequentially coupling Fmoc-Gln (Trt) -OH, Fmoc-Lys (Boc) -OH, Fmoc-Val-OH, Fmoc-Gly-OH, Fmoc-Lys (Boc) -OH, Fmoc-Val-OH, Fmoc-Leu-OH, Fmoc- β -L-Phe-OH, Fmoc-Ala-OH, Fmoc-Ile-OH, Fmoc-Phe-OH, Fmoc-Glu (Otbu) -OH, Fmoc-Lys (Boc) -OH, Fmoc-Ala-OH, Fmoc-Gln (Trt) -OH, Fmoc-Gly-Glu-OH, Fmoc-Glu (Otbu) -OH, Fmoc-Lys (Boc) -OH, Fmoc-Ala-OH, Fmoc-Glu-OH, Fmoc-Gln (Tyr) -OH, Fmoc-Thr-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-.
(2) Preparation of crude polypeptide of sequence 14:
adding the amino resin-loaded peptide resin of the sequence 14 polypeptide prepared in the above into 150ml TFA/TIS/H2Removing resin and side chain protecting group from O (v/v/v ═ 95: 2.5) mixed solution, suction filtering after 2.5H, collecting filtrate, and using TFA/TIS/H2Washing resin with O (volume ratio of 95: 2.5) mixed solution (10ml), filtering, combining filtrates, concentrating under reduced pressure at 35 ℃ until the fraction does not flow out, slowly pouring the concentrated solution into 3L cold diethyl ether, stirring, separating out white precipitate, filtering, grinding the filter cake, washing with cold diethyl ether for 3 times, and vacuum-drying at 35 ℃ for 5 hours to obtain 7.1g crude polypeptide of sequence 14, yield: 85.3%, HPLC purity: 52.1 percent.
(3) Preparation of refined product of polypeptide of sequence 14:
7.1g of crude sequence 14 polypeptide are dissolved in 20ml of 10% aqueous acetic acid, filtered through a 0.45 μm membrane and purified by preparative HPLC: c18 column is selected for sampling, each time the sample is 2.0-2.5 g, the wavelength is 214nm, (10% acetonitrile-H)2Gradient elution of O (1% TFA) for 5min, 10% to 80% acetonitrile-H2Gradient eluting with O (containing 1% TFA) for 35min at 50ml/min, and circulatingAnd (3) sampling and purifying, collecting a main peak, and detecting the purity by using an analysis liquid phase, wherein the purity of the purified main peak is more than 98%. The prepared eluent is decompressed and concentrated under the condition of water bath at the temperature of 30-35 ℃ to obtain a concentrated solution of a primary purified material.
And (3) carrying out secondary desalting purification on the concentrated solution of the material collected by the primary purification under the conditions of: mobile phase a phase: purifying the water; phase B: acetonitrile solution, detection wavelength: 215nm at a flow rate of 30-60 ml/min-1. Collecting target fraction, concentrating the collected fraction at water temperature below 35 deg.C under reduced pressure until the fraction does not flow out, lyophilizing the rest fraction to obtain refined product, and vacuum lyophilizing to obtain refined product of sequence 14 polypeptide 3.0g, with total yield of 36.2% and HPLC purity of 98.2%.
Reference example 7 preparation of sequence 15
Sequence 15: molecular weight 3650.1;
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-βHis-Leu-Val-Lys-Gly-Ala-Gly-Gly-Lys-Lys-Cys-NH2。
this example provides a method for producing a polypeptide of sequence 15.
The preparation method of the sequence 15 polypeptide comprises the following steps: adopting amino resin with Fmoc protection, after removing Fmoc protecting group on the amino resin, synthesizing sequence 15 polypeptide peptide resin, removing resin and side chain protecting group to obtain sequence 15 polypeptide crude product, and preparing and purifying the sequence 15 polypeptide crude product to obtain the sequence 15 polypeptide refined product.
(1) Preparation of sequence 15 polypeptide peptide resin:
rink Amide-MBHA Resin (5.0g, 0.42mmol/g, 2.1mmol) and 40ml CH were added to a 100ml solid phase reactor2Cl2Swelling the resin for 30min, removing CH2Cl2Removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution, after 10min, pumping out the reaction solution, removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution again, after 10min, pumping out the reaction solution, and washing the resin by using DMF (6 x 40ml) for later use; 3.7g Fmoc-Cys (Trt) -OH (6.3mmol) and 0.9g HOBt (6.6mmol) are placed in a 100ml flask, 40ml DMF is added for dissolution, the temperature is reduced to 0-5 ℃ in ice bath, and 1.0ml DIC (6.6 ml DIC) is added dropwisemmol), reacting for 5min after the addition is finished, transferring the reaction liquid to the Fmoc-removed amino resin obtained above, and N2After bubbling and mixing, condensation reaction was carried out for 3 hours, and then the reaction solution was taken out and the resin was washed with DMF (6X 40m 1).
Repeating the deprotection and condensation steps, sequentially coupling Fmoc-Lys (Boc) -OH, Fmoc-Gly-OH, Fmoc-Ala-OH, Fmoc-Gly-OH, Fmoc-Lys (Boc) -OH, Fmoc-Val-OH, Fmoc-Leu-OH, Fmoc- β -L-His (Trt) -OH, Fmoc-Ala-OH, Fmoc-Ile-OH, Fmoc-Phe-OH, Fmoc-Glu-Ou tbOH, Fmoc-Lys (Boc) -OH, Fmoc-Ala-OH, Fmoc-Gln (Trt) -OH, Fmoc-Gly-OH, Fmoc-Glu-Ou, Fmoc-Leu-OH, Fmoc-Tyr-OH, Fmoc-Glu-Ou, Fmoc-OH, Fmoc-Throc-Thr-OH, Fmoc-Glu-NH-OH, Fmoc-Glu-OH, Fmoc-Glu-DMF, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu-OH, Fmoc-Glu.
(2) Preparation of crude polypeptide of sequence 15:
adding the amino resin loaded peptide resin of the sequence 15 polypeptide prepared in the above into 150ml TFA/TIS/H2Removing resin and side chain protecting group from O (v/v/v ═ 95: 2.5) mixed solution, suction filtering after 2.5H, collecting filtrate, and using TFA/TIS/H2Washing resin with O (volume ratio of 95: 2.5) mixed solution (10ml), filtering, combining filtrates, concentrating under reduced pressure at 35 ℃ until the fraction does not flow out, slowly pouring the concentrated solution into 3L cold diethyl ether, stirring, separating out white precipitate, filtering, grinding the filter cake, washing with cold diethyl ether for 3 times, and vacuum-drying at 35 ℃ for 5 hours to obtain crude polypeptide of sequence 15 (6.9 g, yield): 90.5%, HPLC purity: 57.6 percent.
(3) Preparation of refined product of polypeptide of sequence 15:
6.9g of crude sequence 15 polypeptide are dissolved in 20ml of 10% aqueous acetic acid, filtered through a 0.45 μm membrane and purified by preparative HPLC: c18 column is selected for sampling, each time the sample is 2.0-2.5 g, the wavelength is 214nm, (10% acetonitrile-H)2Gradient elution of O (1% TFA) for 5min, 10% to 80% acetonitrile-H2Gradient eluting O (containing 1% TFA) for 35min at flow rate of 50ml/min, eluting with gradient system, circularly injecting sample, purifying, collecting main peak, and detecting purity with analytical liquid phase, wherein the purity of purified main peak should be greater than 98%. The prepared eluent is decompressed and concentrated under the condition of water bath at the temperature of 30-35 ℃ to obtain a concentrated solution of a primary purified material.
And (3) carrying out secondary desalting purification on the concentrated solution of the material collected by the primary purification under the conditions of: mobile phase a phase: purifying the water; phase B: acetonitrile solution, detection wavelength: 215nm at a flow rate of 30-60 ml/min-1. Collecting target fraction, concentrating the collected fraction at water temperature below 35 deg.C under reduced pressure until the fraction does not flow out, lyophilizing the rest fraction to obtain refined product, and vacuum lyophilizing to obtain refined product of sequence 15 polypeptide 3.2g, with total yield of 41.6% and HPLC purity of 99.0%.
Reference example 8 preparation of sequence 16
Sequence 16: molecular weight 3671;
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-βTrp-Leu-Val-Lys-Gly-Ala-Lys-Lys-Ser-Cys-NH2。
this example provides a method for producing a polypeptide of sequence 16.
The preparation method of the sequence 16 polypeptide comprises the following steps: adopting amino resin with Fmoc protection, after removing Fmoc protecting group on the amino resin, synthesizing sequence 16 polypeptide peptide resin, removing resin and side chain protecting group to obtain sequence 16 polypeptide crude product, and preparing and purifying the sequence 16 polypeptide crude product to obtain the sequence 16 polypeptide refined product.
(1) Preparation of sequence 16 polypeptide peptide resin:
a100 ml solid phase reactor was charged with Rinkamide-MBHAResin resin (5.0g, 0.42mmol/g, 2.1mmol) and 40ml CH2Cl2Swelling the resin for 30min, removing CH2Cl2Removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution, after 10min, pumping out the reaction solution, removing the Fmoc protecting group on the resin by using 30ml of 20% piperidine/DMF solution again, after 10min, pumping out the reaction solution, and washing the resin by using DMF (6 x 40ml) for later use; simultaneously placing 3.7g Fmoc-Cys (Trt) -OH (6.3mmol) and 0.9g HOBt (6.6mmol) in a 100ml flask, adding 40ml DMF for dissolving, cooling to 0-5 ℃ in ice bath, dropwise adding 1.0ml DIC (6.6mmol), reacting for 5min after the addition, transferring the reaction liquid to the obtained Fmoc-removed amino resin, and adding N2After bubbling and mixing, condensation reaction was carried out for 3 hours, and then the reaction solution was taken out and the resin was washed with DMF (6X 40 ml).
Repeating the deprotection and condensation steps, sequentially coupling Fmoc-Ser (tBu) -OH, Fmoc-Lys (Boc) -OH, Fmoc-Ala-OH, Fmoc-Gly-OH, Fmoc-Lys (Boc) -OH, Fmoc-Val-OH, Fmoc-Leu-OH, Fmoc- β -L-Trp-OH, Fmoc-Ala-OH, Fmoc-Ile-OH, Fmoc-Phe-OH, Fmoc-Glu (Otbu) -OH, Fmoc-Lys (Boc) -OH, Fmoc-Ala-OH, Fmoc-Gln (t) -OH, Fmoc-Gly-OH, Fmoc-Glu (Otbu) -OH, Fmoc-Leu-OH, Fmoc-Tyr (Tyr) OH, Fmoc-Ala-OH, Fmoc-Gln (t) -OH, Fmoc-Gly-OH, Fmoc-Glu (Otbu) -OH, Fmoc-Leu-OH, Fmoc-Tyr (Tyr) and Thru-OH, after the deprotection and condensation steps, the peptide-Glu-peptide-Glu-peptide-Glu-peptide-Glu-peptide-Ala-OH, the peptide-Ala-OH, the peptide-Ala-OH.
(2) Preparation of crude polypeptide of sequence 16:
the amino resin-loaded peptide resin of sequence 16 prepared above was added to 150ml TFA/TIS/H2Removal of O (v/v/v ═ 95: 2.5) mixed solutionFiltering the resin and the side chain protecting group after 2.5H, collecting filtrate, and using TFA/TIS/H2Washing resin with O (volume ratio of 95: 2.5) mixed solution (10ml), filtering, combining filtrates, concentrating under reduced pressure at 35 ℃ until the fraction does not flow out, slowly pouring the concentrated solution into 3L cold diethyl ether, stirring, separating out white precipitate, filtering, grinding the filter cake, washing with cold diethyl ether for 3 times, and vacuum-drying at 35 ℃ for 5 hours to obtain crude polypeptide of sequence 16 (6.8 g, yield): 88.7%, HPLC purity: 55.6 percent.
(3) Preparation of refined product of polypeptide of sequence 16:
6.8g of crude sequence 16 polypeptide are dissolved in 20ml of 10% aqueous acetic acid, filtered through a 0.45 μm membrane and purified by preparative HPLC: c18 column is selected for sampling, each time the sample is 2.0-2.5 g, the wavelength is 214nm, (10% acetonitrile-H)2Gradient elution of O (1% TFA) for 5min, 10% to 80% acetonitrile-H2Gradient eluting O (containing 1% TFA) for 35min at flow rate of 50ml/min, eluting with gradient system, circularly injecting sample, purifying, collecting main peak, and detecting purity with analytical liquid phase, wherein the purity of purified main peak should be greater than 98%. The prepared eluent is decompressed and concentrated under the condition of water bath at the temperature of 30-35 ℃ to obtain a concentrated solution of a primary purified material.
And (3) carrying out secondary desalting purification on the concentrated solution of the material collected by the primary purification under the conditions of: mobile phase a phase: purifying the water; phase B: acetonitrile solution, detection wavelength: 215nm at a flow rate of 30-60 ml/min-1. Collecting target fraction, concentrating the collected fraction at water temperature below 35 deg.C under reduced pressure until the fraction does not flow out, lyophilizing the rest fraction to obtain refined product, and vacuum lyophilizing to obtain refined product of sequence 16 polypeptide 3.0g, with total yield 38.6% and HPLC purity 99.0%.
Validation example 1. in vitro activity or potency of a GLP-l compound fusion protein of the invention was determined.
The potency of analogues of GLP-1 was determined by stimulating the formation of cyclic AMP (cAMP) in media containing membranes expressing the human GLP-1 receptor.
Purified plasma membranes from the stably transfected cell line BHK467-12A (tk-ts 13) expressing the human GLP-1 receptor were stimulated with the GLP-1 analogue or derivative of interest, and cAMP-producing titers were measured with the AlphaScreen cAMP assay kit from Perkinelmer Life Sciences. The rationale for the AlphaScreen assay is the competition between endogenous cAMP and exogenously added biotin-cAMP. Capture of cAMP was achieved by specific antibody conjugated to acceptor beads.
Cell culture and membrane preparation: stably transfected cell lines and high expressing clones were selected for screening. Cells were plated in DMEM, 5% FCS, 1% Pen/Strep (penicillin/streptomycin) and 0.5mg/ml selection marker G418 at 5% CO2And (5) growing.
Cells at 2X approximately 80% confluence were washed with PBS, harvested with Versene (an aqueous solution of ethylenediaminetetraacetic acid tetrasodium salt), centrifuged at 1000rpm for 5 minutes, and the supernatant removed. The additional steps were all performed on ice. The cell pellet was homogenized in 10ml of buffer 1(20mM Na-HEPES, 10mM EDTA, pH 7.4) by Ultrathurax for 20-30 seconds, centrifuged at 20,000rpm for 15 minutes, and the pellet was resuspended in 10ml of buffer 2(20mM Na-HEPES, 0.1mM EDTA, pH 7.4). The suspension was homogenized for 20-30 seconds and centrifuged at 20,000rpm for 15 minutes. The homogenization and centrifugation were repeated again for the suspension in buffer 2, and the membrane was resuspended in buffer 2. Protein concentration was determined and the membranes were stored at-80 ℃ until use.
The assay was performed in a flat bottom 1/2-area 96-well plate. The final volume was 50. mu.l per well.
The solutions and reagents were as follows:
AlphaScreen cAMP assay kit from Perkin Elmer Life Sciences; contains anti-cAMP acceptor beads (10U/. mu.l), streptavidin donor beads (10U/. mu.l) and biotinylated-cAMP (133U/. mu.l).
AlphaScreen buffer, pH 7.4: 50mM TRIS-HCl; 5mM HEPES; 10mM MgCl2, 6H 2O; 150mM NaCl; 0.01% tween. The following were added to AlphaScreen buffer (to final concentrations) before use: BSA: 0.1 percent; IBMX: 0.5 mM; ATP: 1 mM; GTP: luM are provided.
cAMP standard (dilution factor in assay 5): cAMP solution: mu.L of 5 mMcAMP-stock + 495. mu. LAlphaScreen buffer.
Preparation of cAMP Standard and GLP-1 analogs or derivatives to be tested in AlphaScreen bufferSuitable dilution series of organisms, for example the following 8 concentrations of GLP-1 compound: 10-7、10-8、10-9、10-10、10-11、10-12、10-13And 10-14M and series 10 of e.g. cAMP-6To 3X 10-11。
Membrane/acceptor beads: hGLP-1/BHK 467-12A membrane was used; 6 μ g/well corresponds to 0.6mg/ml (the amount of membrane used per well can vary).
"without film": acceptor beads in AlphaScreen buffer (Final 15. mu.g/ml)
"6 μ g/pore membrane": membrane + acceptor beads in AlphaScreen buffer (final 15 μ g/ml).
Add 10. mu.l "no membrane" to cAMP standards (duplicate per well), positive and negative controls.
Add 10. mu.l "6. mu.g/well membrane" to GLP-1 and analogues (duplicate/triplicate per well)
Positive control: 10. mu.l "No Membrane" + 10. mu.l AlphaScreen buffer
Negative control: 10 μ l "No Membrane" +10 μ l cAMP stock solution (50 μ M)
Because the beads are sensitive to direct light, any treatment is in darkness (as dark as possible) or in green. All dilutions were made on ice.
The operation method comprises the following steps:
preparing an AlphaScreen buffer solution;
dissolving and diluting the GLP-1/analog/cAMP standard in AlphaScreen buffer;
a donor bead solution was prepared and incubated at R.T for 30 minutes;
cAMP/GLP-l/analog was added to the plate: 10. mu.l per well;
membrane/acceptor bead solutions were prepared and added to the plates: 10. mu.l per well;
addition of donor beads: 30. mu.l per well;
plates were wrapped in aluminum foil and incubated in a shaker at RT for 3 hours (very slow);
counting on AlphaScreen-each plate was preincubated for 3 minutes before counting in AlphaScreen;
from GrCalculation of EC by aph-Pad Prism software (version 5)50[pM]The values, results are shown in Table 1.
Dula glycopeptide EC50EC value of 13.712nM, HIP-150The value was 0.102 nM.
TABLE 1 in vitro Activity of the respective fusion proteins EC50Value comparison
Fusion proteins | EC50(nM) |
HIP-1 | 0.102 |
HIP-2 | 0.334 |
HIP-3 | 0.517 |
HIP-4 | 1.358 |
HIP-5 | 1.158 |
HIP-6 | 1.155 |
HIP-7 | 1.289 |
HIP-8 | 1.988 |
HIP-9 | 1.772 |
HIP-10 | 2.142 |
HIP-11 | 2.228 |
HIP-12 | 2.145 |
Dolaglutide | 13.712 |
Demonstration of the Effect of the linker on the Activity of the fused GLP-1 analog peptide of example 2
Comparison of peptide linker SEQ ID NO: 4-8 on the activity. The fusion peptides of the different linkers were tested for activity at the human GLP-1 receptor according to the cAMP assay method described in the above example. The fusion molecules tested were able to activate the human GLP-1 receptor in a cAMP cell-based assay. Linker sequences 4, 6 and 7 were slightly better than 5 and 8 in this assay.
Verification of blood glucose Change in db/db diabetic mice injected with a Single dose in example 3
Male diabetic db/db mice, 8 weeks old, with a body weight of 42 + -2 g, were randomly divided into 6 groups of 6 mice per group by body weight. The test group is injected subcutaneously according to the dosage of 1.5mg/kg, the positive group is injected with 1.5mg/kg of dolauda, the experiment 1 group is injected with 1.5mg/kg of HIP-1-12, and the model group is injected with PBS buffer solution with the same volume dosage (10 mE/kg). Each group of animals was subjected to intraperitoneal injection of 200. mu.l of a 40% glucose solution for each of pre-administration (0h), post-administration (1 h, 2h, 4h, 6h, 24h, 48h, 72h, 96h, 120h, 144h, 168h, 180h, 192h and 204h, respectively, and then blood glucose level RBG (blood glucose meter measurement) was measured by tail vein bleeding 30 and 60 minutes after administration of glucose. Blood glucose data are expressed as mean ± standard deviation (means ± SD) and analyzed using SPSS18.0 statistical software. Normal distribution, wherein the mean difference among multiple groups is subjected to one-factor variance analysis, the homogeneity of variance is detected by LSD, and the irregularity of variance is detected by Dunnett-T3; the non-normality distribution adopts a non-parameter test, and P < 0.05 represents that the statistical difference is significant.
Under the same dosage, the experimental group and the positive control drug, the dolacilin, have the function of reducing blood sugar. Through t-test, the random blood sugar values of animals in HIP-1-12 groups and the dolalupeptide group are remarkably reduced in comparison with a model group within 0 h-6 h, and P is less than 0.01, which shows that the onset time of the animals in vivo is similar, and the blood sugar reducing effect in a short time is similar; in addition, it is known from the change of mouse RBG values within 9 days after administration that the blood sugar-reducing effect of the dolastatin can be maintained to 3.5 days, the blood sugar value of the dolastatin has no statistical difference compared with that of the model group at 100 hours after administration, while the blood sugar level of the mice of HIP-1-12 groups can be maintained to 240 hours after administration, namely the blood sugar level of the mice at 11 days still has statistical difference (P < 0.05) compared with that of the model group, wherein the HIP-l group can be maintained to 276 hours after administration, and the HIP-9 group can be maintained to 240 hours after administration. The stability of other fusion proteins of the invention is also higher than that of dulaglutide. Therefore, the fusion protein has high stability and is expected to be developed into a long-acting GLP-l receptor agonist which is administrated once per week or longer.
TABLE 2 comparison of duration of time for which each fusion protein maintains hypoglycemic effects
Fusion proteins | Duration (h) to maintain hypoglycemic Effect |
HIP-1 | 276 |
HIP-2 | 270 |
HIP-3 | 272 |
HIP-4 | 266 |
HIP-5 | 269 |
HIP-6 | 263 |
HIP-7 | 263 |
HIP-8 | 260 |
HIP-9 | 240 |
HIP-10 | 248 |
HIP-11 | 242 |
HIP-12 | 242 |
Dolaglutide | 100 |
Verification example 4 db/db diabetic mice were given HIP-1-12 with random blood glucose and HbA1c content changes for 10 weeks
Male db/db SPF-grade mice (purchased from shanghai schleck laboratory animals ltd), 8 weeks old, after 1 week of adaptive breeding, 30 db/db mice were randomly assigned to 5 groups (n-6) according to Random Blood Glucose (RBG): the model group, the dolaglutide group and the HIP-1-12 are administrated according to low (0.75mg/kg), medium (1.5mg/kg) and high (3mg/kg) doses. Each administration group was given the corresponding dose of the drug solution by subcutaneous injection, and the model group was given the PBS buffer by subcutaneous injection in a volume of 10 ml/kg. Animals of each group were dosed once a week for 10 weeks, and RBG values of mice of each group at different blood sampling time points after dosing were measured with a glucometer (an accurate glucometer, product of changsanno biosensing gmbh) respectively, and data was recorded. Setting blood sampling time points: pre-first dose (0d), post-dose 7d, 14d, 2ld, 28d, 35d, 42d, 49d, 56d, 63d and 70 d. At 70d, each group of mice was fasted for 14 hours and then bled from the orbit, and then immediately the glycated hemoglobin assay kit (immunoturbidimetry) and its kit were used to measure the glycated hemoglobin (HbA1c) content of whole blood using a H700 specific protein analyzer, and the results were expressed as the percentage (%) of HbAlc to total hemoglobin.
Data are expressed as means ± standard deviation (means ± SD) and analyzed using SPSS18.0 statistical software. Normal distribution, single-factor analysis of variance for mean difference among multiple groups, Dunnett-t test for homogeneity of variance, and Dunnett-t3 test for heterogeneity of variance; the non-normality distribution adopts a non-parameter test, and P < 0.05 represents that the statistical difference is significant.
The change trend research of random blood sugar values of mice of each group after continuous administration for 10 weeks shows that the blood sugar values of the mice of the HIP-1-12 high, medium and low dose groups are reduced to a certain degree relative to the blood sugar value of the model group, and the blood sugar reducing activity of the mice shows dose dependence. The HIP-1-12 can be prompted to effectively and continuously control the blood sugar level of db/db diabetic mice. Moreover, the blood sugar reducing effects of the first administration and the last administration of the HIP-1-12 are similar, which indicates that the effect of reducing the receptor sensitivity caused by the long-term administration of the HIP-1-12 does not occur.
Glycated hemoglobin (HbA1c) is a product of the combination of blood glucose and hemoglobin of red blood cells, and is proportional to the level of blood glucose. Because the life of the red blood cells in the blood circulation is about 120 days, the glycosylated hemoglobin can reflect the total level of the blood sugar 4-12 weeks before blood taking, and the defect that the fasting blood sugar only reflects the instantaneous blood sugar is overcome. Therefore, HbA1c is the most important evaluation index for long-term blood glucose control and is also an important basis for clinical decision on whether to change the treatment scheme. The HbA1c test result in the embodiment can stably and reliably reflect the blood sugar control condition of the mouse 2-3 months before blood drawing. The results of measurement of HbAlc content of each group of mice after 10 weeks of continuous administration are shown in Table 3 below.
TABLE 3 effect of HIP-1 on random blood glucose in db/db mice (means + -SD, n ═ 6)
Note: p < 0.05 for each dose group compared to model group; p < 0.01. Units mmol/L.
Moreover, HlP-1 the proportion of HbAlc that decreased to normal levels (HbAlc < 7%) in the treated group of subjects at 0.75mg once weekly was 14% higher than that of dulaglutide at 0.75mg once weekly. The HIP-1-12 effect is approximate.
Sequence listing
<110> Lunan pharmaceutical group, Inc
<120> a heterologous fusion protein
<160>31
<170>SIPOSequenceListing 1.0
<210>1
<211>229
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<220>
<221>UNSURE
<222>(11)..(11)
<223>11 th X is Pro
<220>
<221>UNSURE
<222>(85)..(85)
<223>85 th X is Ala or Phe
<220>
<221>UNSURE
<222>(86)..(86)
<223> X at position 86 is Phe or Glu
<220>
<221>UNSURE
<222>(87)..(87)
<223> X at position 87 is Glu, Phe or Leu
<220>
<221>UNSURE
<222>(91)..(91)
<223> X at position 91 is Gly
<220>
<221>UNSURE
<222>(230)..(230)
<223> 230X deletions
<400>1
Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Xaa Cys Pro Ala Pro Glu
1 5 10 15
Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp
20 25 30
Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp
35 40 45
Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly
50 55 60
Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn
65 70 75 80
Ser Thr Tyr Arg Xaa Xaa Xaa Val Leu Thr Xaa Leu His Gln Asp Trp
85 90 95
Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro
100 105 110
Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu
115 120 125
Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn
130 135 140
Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile
145 150 155 160
Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr
165 170 175
Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Arg
180 185 190
Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys
195 200 205
Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu
210 215 220
Ser Leu Ser Leu Gly
225
<210>2
<211>229
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>2
Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro Ala Pro Glu
1 5 10 15
Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp
20 25 30
Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp
35 40 45
Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly
50 55 60
Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn
65 70 75 80
Ser Thr Tyr Arg Ala Phe Glu Val Leu Thr Gly Leu His Gln Asp Trp
85 90 95
Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro
100 105 110
Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu
115 120 125
Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn
130 135 140
Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile
145 150 155 160
Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr
165 170 175
Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Arg
180 185 190
Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys
195 200 205
Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu
210 215 220
Ser Leu Ser Leu Gly
225
<210>3
<211>229
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>3
Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro Ala Pro Glu
1 5 10 15
Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp
20 25 30
Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp
35 40 45
Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly
50 55 60
Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn
65 70 75 80
Ser Thr Tyr Arg Phe Glu Phe Val Leu Thr Gly Leu His Gln Asp Trp
85 90 95
Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro
100 105 110
Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu
115 120 125
Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn
130 135 140
Gln Val Ser Leu Thr Cys Leu Val LysGly Phe Tyr Pro Ser Asp Ile
145 150 155 160
Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr
165 170 175
Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Arg
180 185 190
Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys
195 200 205
Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu
210 215 220
Ser Leu Ser Leu Gly
225
<210>4
<211>17
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>4
Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
1 5 10 15
Ser
<210>5
<211>15
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>5
Gly Gly Gly Gly Ser Lys Lys Lys Lys Ser Gly Gly Gly Gly Ser
1 5 10 15
<210>6
<211>22
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>6
Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
1 5 10 15
Ser Gly Gly Gly Gly Ser
20
<210>7
<211>25
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>7
Gly Gly Gly Gly Ser Lys Lys Lys Lys Ser Gly Gly Gly Gly Ser Lys
1 5 10 15
Lys Lys Lys Ser Gly Gly Gly Gly Ser
20 25
<210>8
<211>15
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>8
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
1 5 10 15
<210>9
<211>31
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<220>
<221>UNSURE
<222>(25)..(25)
<223> Xaa at position 25 is Trp or Phe or His
<220>
<221>UNSURE
<222>(30)..(30)
<223> Xaa at position 30 is Pro or Val or Glu or Ala
<220>
<221>UNSURE
<222>(31)..(31)
<223> Xaa at position 31 is Lys-Lys-Lys-Gln-Gln-NH2 or Lys-Lys-Pro-Phe-Phe-Pro-Cys-NH2 or Lys-Lys-Ser-Cys-NH2 or Gly-Gly-Lys-Lys-Cys-NH2
<400>9
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Xaa Leu Val Lys Gly Xaa Xaa
20 25 30
<210>10
<211>35
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<220>
<221>UNSURE
<222>(25)..(25)
<223> Xaa at position 25 is β Phe
<400>10
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Xaa Leu Val Lys Gly Ala Gly Gly
20 25 30
Lys Lys Cys
35
<210>11
<211>35
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<220>
<221>UNSURE
<222>(25)..(25)
<223> Xaa at position 25 is β Phe
<400>11
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Xaa Leu Val Pro Lys Lys Pro Phe
20 25 30
Phe Pro Cys
35
<210>12
<211>35
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<220>
<221>UNSURE
<222>(25)..(25)
<223> Xaa at position 25 is β Phe
<400>12
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Xaa Leu Val Lys Gly Glu Gly Gly
20 25 30
Lys Lys Cys
35
<210>13
<211>35
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<220>
<221>UNSURE
<222>(25)..(25)
<223> Xaa at position 25 is β Phe
<400>13
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Xaa Leu Val Lys Gly Val Gly Gly
20 25 30
Lys Lys Cys
35
<210>14
<211>35
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<220>
<221>UNSURE
<222>(25)..(25)
<223> Xaa at position 25 is β Phe
<400>14
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Xaa Leu Val Lys Gly Val Lys Lys
20 25 30
Lys Gln Gln
35
<210>15
<211>35
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<220>
<221>UNSURE
<222>(25)..(25)
<223> position 25 Xaa is β His
<400>15
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Xaa Leu Val Lys Gly Ala Gly Gly
20 25 30
Leu Leu Cys
35
<210>16
<211>34
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<220>
<221>UNSURE
<222>(25)..(25)
<223> position 25 Xaa is β Trp
<400>16
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Xaa Leu Val Lys Gly Ala Lys Lys
20 25 30
Ser Cys
<210>17
<211>281
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>17
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Phe Leu Val Lys Gly Ala Gly Gly
20 25 30
Lys Lys Cys Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
35 40 45
Gly Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys
50 5560
Pro Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
65 70 75 80
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
85 90 95
Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp
100 105 110
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
115 120 125
Glu Gln Phe Asn Ser Thr Tyr Arg Ala Phe Glu Val Leu Thr Gly Leu
130 135 140
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
145 150 155 160
Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
165 170 175
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu
180 185 190
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
195 200 205
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
210 215 220
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
225 230 235 240
Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn
245 250 255
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
260 265 270
Gln Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>18
<211>281
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>18
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Phe Leu Val Lys Gly Ala Gly Gly
20 25 30
Lys Lys Cys Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
35 40 45
Gly Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys
50 55 60
Pro Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
65 70 75 80
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
85 90 95
Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp
100 105 110
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
115 120 125
Glu Gln Phe Asn Ser Thr Tyr Arg Phe Glu Phe Val Leu Thr Gly Leu
130 135 140
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
145 150 155 160
Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
165 170 175
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu
180 185 190
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
195 200 205
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
210 215 220
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
225230 235 240
Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn
245 250 255
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
260 265 270
Gln Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>19
<211>280
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>19
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Phe Leu Val Lys Gly Ala Lys Lys
20 25 30
Ser Cys Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
35 40 45
Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro
50 55 60
Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys
65 70 75 80
Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val
85 90 95
Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr
100 105 110
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu
115 120 125
Gln Phe Asn Ser Thr Tyr Arg Ala Phe Glu Val Leu Thr Gly Leu His
130 135 140
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys
145 150 155 160
Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln
165 170 175
Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met
180 185 190
Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro
195 200 205
Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn
210 215 220
Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu
225 230 235 240
Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu GlyAsn Val
245 250 255
Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln
260 265 270
Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>20
<211>280
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>20
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Phe Leu Val Lys Gly Ala Lys Lys
20 25 30
Ser Cys Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
35 40 45
Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro
50 55 60
Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys
65 70 75 80
Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val
85 90 95
Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr
100 105 110
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu
115 120 125
Gln Phe Asn Ser Thr Tyr Arg Phe Glu Phe Val Leu Thr Gly Leu His
130 135 140
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys
145 150 155 160
Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln
165 170 175
Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met
180 185 190
Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro
195 200 205
Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn
210 215 220
Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu
225 230 235 240
Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val
245 250 255
Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln
260 265 270
Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>21
<211>281
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>21
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Phe Leu Val Lys Gly Glu Gly Gly
20 25 30
Lys Lys Cys Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
35 40 45
Gly Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys
50 55 60
Pro Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
65 70 75 80
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
85 90 95
Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp
100 105110
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
115 120 125
Glu Gln Phe Asn Ser Thr Tyr Arg Ala Phe Glu Val Leu Thr Gly Leu
130 135 140
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
145 150 155 160
Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
165 170 175
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu
180 185 190
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
195 200 205
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
210 215 220
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
225 230 235 240
Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn
245 250 255
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
260 265 270
Gln Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>22
<211>281
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>22
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Phe Leu Val Lys Gly Glu Gly Gly
20 25 30
Lys Lys Cys Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
35 40 45
Gly Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys
50 55 60
Pro Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
65 70 75 80
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
85 90 95
Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp
100 105 110
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
115 120 125
Glu Gln Phe Asn Ser Thr Tyr Arg Phe Glu Phe Val Leu Thr Gly Leu
130 135 140
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
145 150 155 160
Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
165 170 175
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu
180 185 190
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
195 200 205
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
210 215 220
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
225 230 235 240
Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn
245 250 255
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
260 265 270
Gln Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>23
<211>281
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>23
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Phe Leu Val Lys Gly Val Gly Gly
20 25 30
Lys Lys Cys Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
35 40 45
Gly Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys
50 55 60
Pro Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
65 70 75 80
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
85 90 95
Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp
100 105 110
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
115 120 125
Glu Gln Phe Asn Ser Thr Tyr Arg Ala Phe Glu Val Leu Thr Gly Leu
130 135 140
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
145 150 155 160
Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
165 170 175
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu
180 185 190
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
195 200 205
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
210 215 220
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
225 230 235 240
Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn
245 250 255
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
260 265 270
Gln Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>24
<211>281
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>24
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Phe Leu Val Lys Gly Val Gly Gly
20 25 30
Lys Lys Cys Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
35 40 45
Gly Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys
50 55 60
Pro Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
65 70 75 80
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
85 90 95
Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp
100 105 110
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
115 120 125
Glu Gln Phe Asn Ser Thr Tyr Arg Phe Glu Phe Val Leu Thr Gly Leu
130 135 140
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
145 150 155 160
Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
165 170 175
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu
180 185 190
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
195 200 205
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
210 215 220
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
225 230 235 240
Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn
245 250 255
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
260 265 270
Gln Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>25
<211>281
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>25
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Phe Leu Val Lys Gly Val Lys Lys
20 25 30
Lys Gln Gln Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
35 40 45
Gly Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys
50 55 60
Pro Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
65 70 75 80
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
85 90 95
Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp
100 105 110
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
115 120 125
Glu Gln Phe Asn Ser Thr Tyr Arg Ala Phe Glu Val Leu Thr Gly Leu
130 135 140
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
145 150 155 160
Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
165 170 175
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu
180 185 190
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
195 200 205
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
210 215 220
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
225 230 235 240
Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn
245 250 255
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
260 265 270
Gln Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>26
<211>281
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>26
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Phe Leu Val Lys Gly Val Lys Lys
20 25 30
Lys Gln Gln Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
35 40 45
Gly Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys
50 55 60
Pro Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
65 70 75 80
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
85 90 95
Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp
100 105 110
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
115 120 125
Glu Gln Phe Asn Ser Thr Tyr Arg Phe Glu Phe Val Leu Thr Gly Leu
130 135 140
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
145 150 155 160
Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
165 170 175
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu
180 185 190
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
195 200 205
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
210 215 220
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
225 230 235 240
Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn
245 250 255
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
260 265 270
Gln Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>27
<211>281
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>27
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala His Leu Val Lys Gly Ala Gly Gly
20 25 30
Lys Lys Cys Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
35 40 45
Gly Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys
50 55 60
Pro Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro
65 70 75 80
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
85 90 95
Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp
100 105 110
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
115 120 125
Glu Gln Phe Asn Ser Thr Tyr Arg Ala Phe Glu Val Leu Thr Gly Leu
130 135 140
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
145 150 155 160
Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
165 170 175
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu
180 185 190
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
195 200 205
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
210 215 220
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
225 230 235 240
Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn
245 250 255
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
260 265 270
Gln Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>28
<211>280
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>28
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Ala Lys Lys
20 25 30
SerCys Gly Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
35 40 45
Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro
50 55 60
Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys
65 70 75 80
Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val
85 90 95
Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr
100 105 110
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu
115 120 125
Gln Phe Asn Ser Thr Tyr Arg Ala Phe Glu Val Leu Thr Gly Leu His
130 135 140
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys
145 150 155 160
Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln
165 170 175
Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met
180 185 190
Thr Lys Asn GlnVal Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro
195 200 205
Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn
210 215 220
Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu
225 230 235 240
Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val
245 250 255
Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln
260 265 270
Lys Ser Leu Ser Leu Ser Leu Gly
275 280
<210>29
<211>843
<212>DNA
<213> Artificial Sequence (Artificial Sequence)
<400>29
cacgccgaag gaaccttcac ctcagatgtg tcatcatacc tcgaaggcga agccgccaag 60
gaattcatcg cattcctcgt gaaaggagca ggaggaaaga agtgcggcgc gggcggcggc 120
ggctcgggag gaggaggatc gggaggagga ggatcggccg aatcgaaata tggtcccccc 180
tgtccaccat gtcccgctcc cgaatttctt ggtggtcctt ctgtttttct ttttccccca 240
aaaccgaaag atactcttat gatttccggc actccagagg ttacttgtgt tgttgtggac 300
gcctcacagg aggatccaga ggcccaattc aattggtatg tcgatggcgt cgaagtccac 360
aacgccaaaa ccaaaccccg cgaagagcag ttcaattcta cttatcgtgc cttcgaagtc 420
cttactggtc tccatcaaga ttggcttaat ggcaaggaat acaaatgcaa ggtctccaac 480
aaaggcctcc cctcatcaat cgaaaaaacc atctcgaaag ccaaaggcca accccgcgag 540
ccccaggtct atactctccc cccctctcaa gaagagatga ccaaaaacca ggcctcactc 600
acgtgcctcg tcaaaggatt ctacccgtcc gatattgccg tcgagtggga atcaaacgga 660
gagccagaga acaactacaa aaccacccca ccggtgctcg attcagatgg ctcatttttc 720
ctctactcac gtctcactgt cgacaaatcc cgctggcagg aaggaaacgt cttttcatgt 780
tcagtcatgc acgaagccct ccacaatcat tatacccaaa aatcgctgtc actcatgttg 840
gga 843
<210>30
<211>20
<212>PRT
<213> Artificial Sequence (Artificial Sequence)
<400>30
Met Gly Lys Leu Ile Phe Trp Leu Val Phe Trp Leu Thr Ile Phe Trp
1 5 10 15
Leu Gly Ser Ala
20
<210>31
<211>60
<212>DNA
<213> Artificial Sequence (Artificial Sequence)
<400>31
atgggcaaac tcatattctg gcttgtgttt tggctgacta ttttttggct tggatcagct 60
Claims (11)
1. An immunoglobulin Fc fragment comprising SEQ ID NO: 1, the specific sequence is as follows:
Ala-Glu-Ser-Lys-Tyr-Gly-Pro-Pro-Cys-Pro-Xaal1-Cys-Pro-Ala-Pro-
Glu-Phe-Leu-Gly-Gly-Pro-Ser-Val-Phe-Leu-Phe-Pro-Pro-Lys-Pro-
Lys-Asp-Thr-Leu-Met-Ile-Ser-Arg-Thr-Pro-Glu-Val-Thr-Cys-Val-
Val-Val-Asp-Val-Ser-Gln-Glu-Asp-Pro-Glu-Val-Gln-Phe-Asn-Trp-
Tyr-Val-Asp-Gly-Val-Glu-Val-His-Asn-Ala-Lys-Thr-Lys-Pro-Arg-
Glu-Glu-Gln-Phe-Asn-Ser-Thr-Tyr-Arg-Xaa85-Xaa86-Xaa87-Val-Leu-Thr-
Xaa91-Leu-His-Gln-Asp-Trp-Leu-Asn-Gly-Lys-Glu-Tyr-Lys-Cys-Lys-
Val-Ser-Asn-Lys-Gly-Leu-Pro-Ser-Ser-Ile-Glu-Lys-Thr-Ile-Ser-
Lys-Ala-Lys-Gly-Gln-Pro-Arg-Glu-Pro-Gln-Val-Tyr-Thr-Leu-Pro-
Pro-Ser-Gln-Glu-Glu-Met-Thr-Lys-Asn-Gln-Val-Ser-Leu-Thr-Cys-
Leu-Val-Lys-Gly-Phe-Tyr-Pro-Ser-Asp-Ile-Ala-Val-Glu-Trp-Glu-
Ser-Asn-Gly-Gln-Pro-Glu-Asn-Asn-Tyr-Lys-Thr-Thr-Pro-Pro-Val-
Leu-Asp-Ser-Asp-Gly-Ser-Phe-Phe-Leu-Tyr-Ser-Arg-Leu-Thr-Val-
Asp-Lys-Ser-Arg-Trp-Gln-Glu-Gly-Asn-Val-Phe-Ser-Cys-Ser-Val-
Met-His-Glu-Ala-Leu-His-Asn-His-Tyr-Thr-Gln-Lys-Ser-Leu-Ser-
Leu-Ser-Leu-Gly,
wherein Ser at position 11, Val at position 85, Val at position 86, Ser at position 87 and Val at position 91 are replaced, and terminal Lys is deleted, and the method specifically comprises the following steps:
xaa at position 11 is Pro;
xaa at position 85 is Ala or Phe;
xaa at position 86 is Phe or Glu;
xaa at position 87 is Glu, Phe or Leu;
xaa at position 91 is GlV.
2. A heterologous fusion protein comprising a therapeutic peptide and an immunoglobulin Fc portion as set forth in claim 1.
3. The fusion protein of claim 2, wherein the therapeutic peptide is a GLP-1 analog having a C-terminal glycine residue fused to an N-terminal alanine residue of an Fc portion by a peptide linker, wherein the peptide linker is SEQ ID NO: 4, in particular to Gly-Ala-Gly-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser.
4. The fusion protein of claim 3, wherein the peptide linker is further selected from the group consisting of:
a) Gly-Gly-Gly-Gly-Gly-Ser-Lys-Lys-Lys-Lys-Ser-Gly-Gly-Gly-Gly-Ser, as shown in SEQ ID NO: 5 is shown in the specification;
b) Gly-Ala-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Gly-Ser, as shown in SEQ ID NO: 6 is shown in the specification;
c) Gly-Gly-Gly-Gly-Gly-Ser-Lys-Lys-Lys-Lys-Ser-Gly-Gly-Gly-Gly-Gly-Ser, as shown in SEQ ID NO: 7 is shown in the specification;
d) Gly-Gly-Gly-Gly-Gly-Ser, and is shown in SEQ ID NO: shown in fig. 8.
5. The fusion protein of claim 3 or 4, wherein the GLP-1 analog consists of the amino acid sequence of SEQ ID NO: sequence 9 consists of seq id no:
His-Ala-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Xaa-Leu-Val-Lys-Gly-Xaa-Xaa;
xaa at position 25 is β Trp or β Phe or β His;
xaa at position 30 is Pro or Val or Glu or Ala;
xaa at the 31 th position is Lys-Lys-Lys-Gln-Gln-NH2;Lys-Lys-Pro-Phe-Phe-Pro-Cys-NH2;Lys-Lys-Ser-Cys-NH2;Gly-Gly-Lys-Lys-Cys-NH2。
6. A GLP-1 analogue, the sequence of which is shown in SEQ ID NO: 9, the method is as follows.
7. A peptide linker having the sequence set forth in SEQ ID NO: 4 or 5 or 6 or 7 or 8.
8. Use of a fusion protein according to any one of claims 1 to 4 for the manufacture of a medicament.
9. Use of a fusion protein according to any one of claims 1 to 4 for the manufacture of a medicament for the treatment of non-insulin dependent diabetes mellitus.
10. Use of a fusion protein according to any one of claims 1 to 4 for the manufacture of a medicament for treating obesity or inducing weight loss in an overweight subject.
11. The fusion protein of claim 4, wherein the GLP-1 analog is also exenatide, liraglutide, linagliptin, somagluteptide, Ideglira, albigluteptide, dulaglutide, ITCA650, LAEx4, Lixilan.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811471370.8A CN111269312B (en) | 2018-12-04 | 2018-12-04 | Heterologous fusion protein |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811471370.8A CN111269312B (en) | 2018-12-04 | 2018-12-04 | Heterologous fusion protein |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111269312A true CN111269312A (en) | 2020-06-12 |
CN111269312B CN111269312B (en) | 2023-05-09 |
Family
ID=71001297
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811471370.8A Active CN111269312B (en) | 2018-12-04 | 2018-12-04 | Heterologous fusion protein |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111269312B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114410676A (en) * | 2020-10-28 | 2022-04-29 | 中国科学院植物研究所 | Creation method and application of functional rice material for reducing blood sugar |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005000892A2 (en) * | 2003-06-12 | 2005-01-06 | Eli Lilly And Company | Glp-1 analog fusion plroteins |
KR20110039175A (en) * | 2009-10-09 | 2011-04-15 | (주)알테오젠 | Fusion of BLP-1 analogue, and composition for preventing or treating diabetes containing the same as an active ingredient |
CN105367664A (en) * | 2015-11-04 | 2016-03-02 | 成都贝爱特生物科技有限公司 | Preparation method for dual-functional fusion protein capable of activating GLP-1 receptor and Amylin receptor and application of fusion protein |
CN106397569A (en) * | 2015-08-01 | 2017-02-15 | 深圳红石科创生物科技发展有限公司 | Mutant cytokine fusion protein for treating metabolic diseases |
-
2018
- 2018-12-04 CN CN201811471370.8A patent/CN111269312B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005000892A2 (en) * | 2003-06-12 | 2005-01-06 | Eli Lilly And Company | Glp-1 analog fusion plroteins |
CN1802386A (en) * | 2003-06-12 | 2006-07-12 | 伊莱利利公司 | GLP-1 analog fusion plroteins |
KR20110039175A (en) * | 2009-10-09 | 2011-04-15 | (주)알테오젠 | Fusion of BLP-1 analogue, and composition for preventing or treating diabetes containing the same as an active ingredient |
CN106397569A (en) * | 2015-08-01 | 2017-02-15 | 深圳红石科创生物科技发展有限公司 | Mutant cytokine fusion protein for treating metabolic diseases |
CN105367664A (en) * | 2015-11-04 | 2016-03-02 | 成都贝爱特生物科技有限公司 | Preparation method for dual-functional fusion protein capable of activating GLP-1 receptor and Amylin receptor and application of fusion protein |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114410676A (en) * | 2020-10-28 | 2022-04-29 | 中国科学院植物研究所 | Creation method and application of functional rice material for reducing blood sugar |
Also Published As
Publication number | Publication date |
---|---|
CN111269312B (en) | 2023-05-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2004251145C1 (en) | GLP-1 analog fusion proteins | |
JP6991196B2 (en) | Methods for Producing Glucagon-Like Peptides | |
CN103732628B (en) | Multivalence heteropolymer frame design and construct | |
US20100317600A1 (en) | Process for Preparing Cardiodilatin Fragments; Highly Purified Cardiodilatin Fragments and Intermediate Products for the Preparation of Same | |
CN106928341B (en) | Fixed-point mono-substituted pegylated Exendin analogue and preparation method thereof | |
CN101389648A (en) | Peptide oxyntomodulin derivative | |
AU2005211776A1 (en) | Pancreatic polypeptide family motifs and polypeptides comprising the same | |
JPS61251700A (en) | Insulin receptor | |
KR20110043687A (en) | Truncated Analogs of Glucose-dependent Insulin Secretory Stimulating Polypeptides | |
CN113292646B (en) | GLP-1/glucagon dual agonist fusion proteins | |
CN111269312B (en) | Heterologous fusion protein | |
WO2021147869A1 (en) | Liraglutide derivative and preparation method therefor | |
CN111269321B (en) | GLP-1 analogue fusion protein | |
EP0315118A2 (en) | DNA coding for endothelin and use thereof | |
CN106008717B (en) | Long-acting recombinant GLP-1 fusion protein and preparation method and application thereof | |
WO2017214543A1 (en) | Glucagon analogs and methods of use thereof | |
CN116333041A (en) | A kind of polypeptide compound for activating GRP receptor and its use | |
JPH11514858A (en) | Bone stimulating factor | |
WO2017113445A1 (en) | Erythropoietin peptide, derivatives and polymers, preparation method and application | |
JP2002335972A (en) | Insulin receptor-associated receptor-binding protein and its application | |
US6872703B2 (en) | Insulin receptor-related receptor binding protein | |
TW202216747A (en) | Fusion protein comprising glucagon-like peptide-1 and interleukin-1 receptor antagonist, and use thereof | |
HK1149566B (en) | Glp-1 analog fusion plroteins | |
WO2000078804A1 (en) | Immune response regulatory proteins |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |