CN110092835A - 一种glp-1类似物-col3a1融合蛋白 - Google Patents
一种glp-1类似物-col3a1融合蛋白 Download PDFInfo
- Publication number
- CN110092835A CN110092835A CN201810089639.XA CN201810089639A CN110092835A CN 110092835 A CN110092835 A CN 110092835A CN 201810089639 A CN201810089639 A CN 201810089639A CN 110092835 A CN110092835 A CN 110092835A
- Authority
- CN
- China
- Prior art keywords
- gly
- pro
- ala
- glu
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 108020001507 fusion proteins Proteins 0.000 description 78
- 102000037865 fusion proteins Human genes 0.000 description 76
- DTHNMHAUYICORS-KTKZVXAJSA-N Glucagon-like peptide 1 Chemical class C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 DTHNMHAUYICORS-KTKZVXAJSA-N 0.000 description 68
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 56
- 108090000765 processed proteins & peptides Proteins 0.000 description 56
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 50
- 210000004027 cell Anatomy 0.000 description 49
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 39
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 37
- 108010029020 prolylglycine Proteins 0.000 description 37
- 229920001184 polypeptide Polymers 0.000 description 33
- 102000004196 processed proteins & peptides Human genes 0.000 description 33
- 101800000224 Glucagon-like peptide 1 Proteins 0.000 description 31
- 102100040918 Pro-glucagon Human genes 0.000 description 31
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 30
- 102100031611 Collagen alpha-1(III) chain Human genes 0.000 description 29
- 101000993285 Homo sapiens Collagen alpha-1(III) chain Proteins 0.000 description 29
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 29
- 238000000034 method Methods 0.000 description 26
- 108090000623 proteins and genes Proteins 0.000 description 26
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 24
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 23
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 22
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 21
- 108010047495 alanylglycine Proteins 0.000 description 20
- 108020004414 DNA Proteins 0.000 description 19
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 19
- 235000018102 proteins Nutrition 0.000 description 19
- 102000004169 proteins and genes Human genes 0.000 description 19
- 108010077515 glycylproline Proteins 0.000 description 18
- 125000003275 alpha amino acid group Chemical group 0.000 description 17
- 241000282414 Homo sapiens Species 0.000 description 16
- 108091033319 polynucleotide Proteins 0.000 description 16
- 102000040430 polynucleotide Human genes 0.000 description 16
- 239000002157 polynucleotide Substances 0.000 description 16
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 16
- 108010079364 N-glycylalanine Proteins 0.000 description 15
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 15
- 235000001014 amino acid Nutrition 0.000 description 15
- 206010012601 diabetes mellitus Diseases 0.000 description 15
- 239000003814 drug Substances 0.000 description 15
- 108010026333 seryl-proline Proteins 0.000 description 15
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 14
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 14
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 14
- 230000000694 effects Effects 0.000 description 14
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 13
- 150000001413 amino acids Chemical class 0.000 description 13
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 13
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 12
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 12
- FPIPGXGPPPQFEQ-OVSJKPMPSA-N all-trans-retinol Chemical compound OC\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-OVSJKPMPSA-N 0.000 description 12
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 11
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 11
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 11
- 108010092854 aspartyllysine Proteins 0.000 description 11
- 201000010099 disease Diseases 0.000 description 11
- 229940079593 drug Drugs 0.000 description 11
- 238000001727 in vivo Methods 0.000 description 11
- 102000008186 Collagen Human genes 0.000 description 10
- 108010035532 Collagen Proteins 0.000 description 10
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 10
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 10
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 10
- 229920001436 collagen Polymers 0.000 description 10
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 10
- SCAKQYSGEIHPLV-IUCAKERBSA-N (4S)-4-[(2-aminoacetyl)amino]-5-[(2S)-2-(carboxymethylcarbamoyl)pyrrolidin-1-yl]-5-oxopentanoic acid Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SCAKQYSGEIHPLV-IUCAKERBSA-N 0.000 description 9
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 9
- 102000016622 Dipeptidyl Peptidase 4 Human genes 0.000 description 9
- 108010067722 Dipeptidyl Peptidase 4 Proteins 0.000 description 9
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 9
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 9
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 9
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 108010061238 threonyl-glycine Proteins 0.000 description 9
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 description 8
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 8
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 8
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 8
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 8
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 8
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 8
- 210000004369 blood Anatomy 0.000 description 8
- 239000008280 blood Substances 0.000 description 8
- 230000004927 fusion Effects 0.000 description 8
- 108010078144 glutaminyl-glycine Proteins 0.000 description 8
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 8
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 8
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 7
- 102000002734 Collagen Type VI Human genes 0.000 description 7
- 108010043741 Collagen Type VI Proteins 0.000 description 7
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 7
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 7
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 7
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 230000005847 immunogenicity Effects 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 6
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 6
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 6
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 6
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 6
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 6
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 6
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 6
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 6
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 6
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 6
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 6
- 241000235058 Komagataella pastoris Species 0.000 description 6
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 6
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 6
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 6
- 239000011717 all-trans-retinol Substances 0.000 description 6
- 235000019169 all-trans-retinol Nutrition 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- 108010004073 cysteinylcysteine Proteins 0.000 description 6
- 239000013604 expression vector Substances 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 6
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- GCYXWQUSHADNBF-AAEALURTSA-N preproglucagon 78-108 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 GCYXWQUSHADNBF-AAEALURTSA-N 0.000 description 6
- 238000003259 recombinant expression Methods 0.000 description 6
- 108010080629 tryptophan-leucine Proteins 0.000 description 6
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 5
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 5
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 5
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 5
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 5
- 108010086246 Glucagon-Like Peptide-1 Receptor Proteins 0.000 description 5
- 101800004266 Glucagon-like peptide 1(7-37) Proteins 0.000 description 5
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 5
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 5
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 5
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 5
- 239000004471 Glycine Substances 0.000 description 5
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 5
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 5
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 5
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 5
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 5
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 5
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 5
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 5
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 5
- 238000007796 conventional method Methods 0.000 description 5
- 108010020688 glycylhistidine Proteins 0.000 description 5
- 108010064235 lysylglycine Proteins 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 239000008194 pharmaceutical composition Substances 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- CUVSTAMIHSSVKL-UWVGGRQHSA-N (4s)-4-[(2-aminoacetyl)amino]-5-[[(2s)-6-amino-1-(carboxymethylamino)-1-oxohexan-2-yl]amino]-5-oxopentanoic acid Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN CUVSTAMIHSSVKL-UWVGGRQHSA-N 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 4
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 4
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 4
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 4
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 4
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 4
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 4
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 4
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 4
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 4
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 4
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 4
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 4
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 4
- 230000003914 insulin secretion Effects 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 238000005215 recombination Methods 0.000 description 4
- 230000006798 recombination Effects 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- DQJCDTNMLBYVAY-ZXXIYAEKSA-N (2S,5R,10R,13R)-16-{[(2R,3S,4R,5R)-3-{[(2S,3R,4R,5S,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy}-5-(ethylamino)-6-hydroxy-2-(hydroxymethyl)oxan-4-yl]oxy}-5-(4-aminobutyl)-10-carbamoyl-2,13-dimethyl-4,7,12,15-tetraoxo-3,6,11,14-tetraazaheptadecan-1-oic acid Chemical compound NCCCC[C@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@@H](C)NC(=O)C(C)O[C@@H]1[C@@H](NCC)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 DQJCDTNMLBYVAY-ZXXIYAEKSA-N 0.000 description 3
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 3
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 3
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 3
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 3
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 3
- 108010072062 GEKG peptide Proteins 0.000 description 3
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 3
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 3
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 3
- 102100032882 Glucagon-like peptide 1 receptor Human genes 0.000 description 3
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 3
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 3
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 3
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 3
- CAVKXZMMDNOZJU-UHFFFAOYSA-N Gly-Pro-Ala-Gly-Pro Natural products C1CCC(C(O)=O)N1C(=O)CNC(=O)C(C)NC(=O)C1CCCN1C(=O)CN CAVKXZMMDNOZJU-UHFFFAOYSA-N 0.000 description 3
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 3
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 3
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 3
- 102000002068 Glycopeptides Human genes 0.000 description 3
- 108010015899 Glycopeptides Proteins 0.000 description 3
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 3
- 108090001061 Insulin Proteins 0.000 description 3
- 102000004877 Insulin Human genes 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- 108010019598 Liraglutide Proteins 0.000 description 3
- YSDQQAXHVYUZIW-QCIJIYAXSA-N Liraglutide Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCNC(=O)CC[C@H](NC(=O)CCCCCCCCCCCCCCC)C(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=C(O)C=C1 YSDQQAXHVYUZIW-QCIJIYAXSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 3
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000000855 fermentation Methods 0.000 description 3
- 230000004151 fermentation Effects 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 229940125396 insulin Drugs 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 229960002701 liraglutide Drugs 0.000 description 3
- 150000007523 nucleic acids Chemical group 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 229940124597 therapeutic agent Drugs 0.000 description 3
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 2
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 2
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 2
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 2
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 208000017667 Chronic Disease Diseases 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 102000002322 Egg Proteins Human genes 0.000 description 2
- 108010000912 Egg Proteins Proteins 0.000 description 2
- 108010011459 Exenatide Proteins 0.000 description 2
- HTQBXNHDCUEHJF-XWLPCZSASA-N Exenatide Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)NCC(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CO)C(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 HTQBXNHDCUEHJF-XWLPCZSASA-N 0.000 description 2
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 2
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 2
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 2
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 2
- 102000007446 Glucagon-Like Peptide-1 Receptor Human genes 0.000 description 2
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 2
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 2
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- 206010018473 Glycosuria Diseases 0.000 description 2
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- 241001506991 Komagataella phaffii GS115 Species 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 2
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 2
- 208000008589 Obesity Diseases 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 2
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 2
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 2
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 230000010261 cell growth Effects 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 231100000433 cytotoxic Toxicity 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 235000014103 egg white Nutrition 0.000 description 2
- 210000000969 egg white Anatomy 0.000 description 2
- 235000013601 eggs Nutrition 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 229960001519 exenatide Drugs 0.000 description 2
- 210000002744 extracellular matrix Anatomy 0.000 description 2
- 210000001035 gastrointestinal tract Anatomy 0.000 description 2
- 239000003292 glue Substances 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000002218 hypoglycaemic effect Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 235000020824 obesity Nutrition 0.000 description 2
- 101710135378 pH 6 antigen Proteins 0.000 description 2
- 210000000496 pancreas Anatomy 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 238000004064 recycling Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 238000007086 side reaction Methods 0.000 description 2
- 210000003491 skin Anatomy 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000000108 ultra-filtration Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 108010000998 wheylin-2 peptide Proteins 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- CEHZCZCQHUNAJF-AVGNSLFASA-N (2s)-1-[2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N1[C@H](C(O)=O)CCC1 CEHZCZCQHUNAJF-AVGNSLFASA-N 0.000 description 1
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 1
- BOVGTQGAOIONJV-BETUJISGSA-N 1-[(3ar,6as)-3,3a,4,5,6,6a-hexahydro-1h-cyclopenta[c]pyrrol-2-yl]-3-(4-methylphenyl)sulfonylurea Chemical compound C1=CC(C)=CC=C1S(=O)(=O)NC(=O)NN1C[C@H]2CCC[C@H]2C1 BOVGTQGAOIONJV-BETUJISGSA-N 0.000 description 1
- PEZMQPADLFXCJJ-ZETCQYMHSA-N 2-[[2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(O)=O PEZMQPADLFXCJJ-ZETCQYMHSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- 235000010894 Artemisia argyi Nutrition 0.000 description 1
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- HMUKKNAMNSXDBB-CIUDSAMLSA-N Asn-Met-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMUKKNAMNSXDBB-CIUDSAMLSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 1
- DBLPNHGKMDHWNZ-UHFFFAOYSA-N Asp Gly Arg Asn Chemical compound OC(=O)CC(N)C(=O)NCC(=O)NC(CCCN=C(N)N)C(=O)NC(CC(N)=O)C(O)=O DBLPNHGKMDHWNZ-UHFFFAOYSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 1
- 210000002237 B-cell of pancreatic islet Anatomy 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 208000024172 Cardiovascular disease Diseases 0.000 description 1
- 102000001189 Cyclic Peptides Human genes 0.000 description 1
- 108010069514 Cyclic Peptides Proteins 0.000 description 1
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 1
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- HYKFOHGZGLOCAY-ZLUOBGJFSA-N Cys-Cys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O HYKFOHGZGLOCAY-ZLUOBGJFSA-N 0.000 description 1
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 1
- RFHGRMMADHHQSA-KBIXCLLPSA-N Cys-Gln-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RFHGRMMADHHQSA-KBIXCLLPSA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 1
- VFGADOJXRLWTBU-JBDRJPRFSA-N Cys-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N VFGADOJXRLWTBU-JBDRJPRFSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- CAXGCBSRJLADPD-FXQIFTODSA-N Cys-Pro-Asn Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CAXGCBSRJLADPD-FXQIFTODSA-N 0.000 description 1
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 1
- IRKLTAKLAFUTLA-KATARQTJSA-N Cys-Thr-Lys Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCCCN)C(O)=O IRKLTAKLAFUTLA-KATARQTJSA-N 0.000 description 1
- MQQLYEHXSBJTRK-FXQIFTODSA-N Cys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N MQQLYEHXSBJTRK-FXQIFTODSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108010042546 GCGGCCGC-specific type II deoxyribonucleases Proteins 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- GTFYQOVVVJASOA-ACZMJKKPSA-N Glu-Ser-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N GTFYQOVVVJASOA-ACZMJKKPSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 1
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 1
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 1
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 description 1
- 102000051325 Glucagon Human genes 0.000 description 1
- 108060003199 Glucagon Proteins 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical group NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- CMPHFUWXKBPNRS-WDSOQIARSA-N His-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 CMPHFUWXKBPNRS-WDSOQIARSA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 208000013016 Hypoglycemia Diseases 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 1
- JHCVYQKVKOLAIU-NAKRPEOUSA-N Ile-Cys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N JHCVYQKVKOLAIU-NAKRPEOUSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 239000007836 KH2PO4 Substances 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000270322 Lepidosauria Species 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- CTBMEDOQJFGNMI-IHPCNDPISA-N Lys-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCCCN)N CTBMEDOQJFGNMI-IHPCNDPISA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 1
- JQECLVNLAZGHRQ-CIUDSAMLSA-N Met-Asp-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O JQECLVNLAZGHRQ-CIUDSAMLSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- BZQFBWGGLXLEPQ-UHFFFAOYSA-N O-phosphoryl-L-serine Natural products OC(=O)C(N)COP(O)(O)=O BZQFBWGGLXLEPQ-UHFFFAOYSA-N 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- QEPZQAPZKIPVDV-KKUMJFAQSA-N Phe-Cys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N QEPZQAPZKIPVDV-KKUMJFAQSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- DIZLUAZLNDFDPR-CIUDSAMLSA-N Pro-Cys-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 DIZLUAZLNDFDPR-CIUDSAMLSA-N 0.000 description 1
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- 101710129616 Protein glp-1 Proteins 0.000 description 1
- 101100221606 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) COS7 gene Proteins 0.000 description 1
- 102100037505 Secretin Human genes 0.000 description 1
- 108010086019 Secretin Proteins 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 229940100389 Sulfonylurea Drugs 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- 241000218636 Thuja Species 0.000 description 1
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 1
- ACGIVBXINJFALS-HKUYNNGSSA-N Trp-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N ACGIVBXINJFALS-HKUYNNGSSA-N 0.000 description 1
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 1
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 1
- GQYPNFIFJRNDPY-ONUFPDRFSA-N Trp-Trp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC=3C4=CC=CC=C4NC=3)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 GQYPNFIFJRNDPY-ONUFPDRFSA-N 0.000 description 1
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- OHNXAUCZVWGTLL-KKUMJFAQSA-N Tyr-His-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N)O OHNXAUCZVWGTLL-KKUMJFAQSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- JRMCISZDVLOTLR-BVSLBCMMSA-N Tyr-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N JRMCISZDVLOTLR-BVSLBCMMSA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 description 1
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000002269 analeptic agent Substances 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 230000000636 anti-proteolytic effect Effects 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 230000036528 appetite Effects 0.000 description 1
- 235000019789 appetite Nutrition 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 244000030166 artemisia Species 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000007640 basal medium Substances 0.000 description 1
- 210000000227 basophil cell of anterior lobe of hypophysis Anatomy 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 230000007211 cardiovascular event Effects 0.000 description 1
- 210000000845 cartilage Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 208000026106 cerebrovascular disease Diseases 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 210000002808 connective tissue Anatomy 0.000 description 1
- 210000000695 crystalline len Anatomy 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 230000022811 deglycosylation Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 229950006137 dexfosfoserine Drugs 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- 229910000396 dipotassium phosphate Inorganic materials 0.000 description 1
- 208000016097 disease of metabolism Diseases 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000002086 displacement chromatography Methods 0.000 description 1
- 238000001647 drug administration Methods 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 238000002651 drug therapy Methods 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000002360 explosive Substances 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 235000012631 food intake Nutrition 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 230000030136 gastric emptying Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 208000004104 gestational diabetes Diseases 0.000 description 1
- 229960000346 gliclazide Drugs 0.000 description 1
- MASNOZXLGMXCHN-ZLPAWPGGSA-N glucagon Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 MASNOZXLGMXCHN-ZLPAWPGGSA-N 0.000 description 1
- 229960004666 glucagon Drugs 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 210000003128 head Anatomy 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 201000001421 hyperglycemia Diseases 0.000 description 1
- 210000003016 hypothalamus Anatomy 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical class O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- WRUGWIBCXHJTDG-UHFFFAOYSA-L magnesium sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Mg+2].[O-]S([O-])(=O)=O WRUGWIBCXHJTDG-UHFFFAOYSA-L 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 239000013028 medium composition Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 208000030159 metabolic disease Diseases 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 229940057059 monascus purpureus Drugs 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 210000004877 mucosa Anatomy 0.000 description 1
- 230000012666 negative regulation of transcription by glucose Effects 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000003538 oral antidiabetic agent Substances 0.000 description 1
- 229940127209 oral hypoglycaemic agent Drugs 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000000149 penetrating effect Effects 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 210000000578 peripheral nerve Anatomy 0.000 description 1
- 239000012466 permeate Substances 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- USRGIUJOYOXOQJ-GBXIJSLDSA-N phosphothreonine Chemical group OP(=O)(O)O[C@H](C)[C@H](N)C(O)=O USRGIUJOYOXOQJ-GBXIJSLDSA-N 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- OTYBMLCTZGSZBG-UHFFFAOYSA-L potassium sulfate Chemical compound [K+].[K+].[O-]S([O-])(=O)=O OTYBMLCTZGSZBG-UHFFFAOYSA-L 0.000 description 1
- 229910052939 potassium sulfate Inorganic materials 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000013558 reference substance Substances 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 210000001525 retina Anatomy 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 102220117237 rs142486394 Human genes 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 229960002101 secretin Drugs 0.000 description 1
- OWMZNFCDEHGFEP-NFBCVYDUSA-N secretin human Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(N)=O)[C@@H](C)O)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)C1=CC=CC=C1 OWMZNFCDEHGFEP-NFBCVYDUSA-N 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 238000003153 stable transfection Methods 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000002459 sustained effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 210000002435 tendon Anatomy 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010003885 valyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 210000004885 white matter Anatomy 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/605—Glucagons
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/78—Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin or cold insoluble globulin [CIG]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Toxicology (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Endocrinology (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明涉及一种GLP‑1类似物‑COL3A1融合蛋白。具体地,所述融合蛋白包含胰高血糖素样肽1类似物和的人Ⅲ型胶原α1链的融合蛋白,所述融合蛋白具有胰高血糖素样肽1的降低血糖作用,并具有延长的体内半衰期。
Description
技术领域
本发明涉及医药领域,具体地涉及一种GLP-1类似物-COL3A1融合蛋白。
背景技术
糖尿病是一种严重的慢性疾病,主要是由高血糖和内源性胰岛素的分泌不足和功能丧失导致。糖尿病会引起多种并发症,比如血管系统、肾脏、视网膜、晶状体、周围神经和皮肤等,进而影响寿命和生活质量。随着世界各国社会经济的发展和居民生活水平的提高,糖尿病的发病率及患病率逐年上升,糖尿病已经成为继肿瘤、心脑血管疾病之后第三位严重危害人类健康的慢性疾病。糖尿病基本分为四类,包括:Ⅰ型(胰岛素依赖型)、Ⅱ型(非胰岛素依赖型)、其他型和妊娠糖尿病。其中,Ⅱ型糖尿病是目前最常见的糖尿病,约占糖尿病患者的90%。Ⅱ型糖尿病是一种病因复杂、以血糖升高为特征的代谢性疾病,中国Ⅱ型糖尿病患者在过去二十多年中呈爆炸式增长。
目前Ⅱ型糖尿病的治疗主要以口服降糖药为主,有磺脲类物如格列本、格列齐特、二甲双胍和胰岛素等。但是长期使用这些药物治疗会产生耐受,无法长期控制血糖和细胞功能的紊乱。因此,研发一种更加安全、有效针对糖尿病发病机制的新型药物尤为重要。
胰高血糖素样肽1(GLP-1)是由位于胃肠道黏膜L细胞分泌的一种肠促胰素,GLP-1的分泌是血糖依赖性的,即当血糖浓度高于正常时,GLP-1呈现促胰岛素分泌作用,而当血糖浓度正常时,GLP-1的促胰岛素分泌作用减弱,因此外源性GLP-1治疗不会增加低血糖风险。此外,GLP-1通过与胰岛α细胞的受体结合,增加胰岛素分泌和生物合成,有效的降低血糖;促进胰岛β细胞增殖、抑制其调凋亡,增加葡萄糖依赖性的胰岛素分泌;能够减弱胃肠道蠕动,延缓胃排空,减少食物摄入;作用于下丘脑,降低食欲,从而减轻体重。基于以上特点,GLP-1已经成为新型Ⅱ型糖尿病治疗药物的开发热点。
天然的GLP-1在体内极不稳定,释放后很容易二肽基肽酶4(DPP-4)快速降解,其在体内的半衰期仅为1~2min,不具备成药性。2005年国际上第一个GLP-1受体激动剂药物艾塞那肽上市,它是一种来源于蜥蜴唾液的GLP-1类似物,与人GLP-1具有50%的同源性。稍后上市的利拉鲁肽,是人工合成的GLP-1类似物,与人GLP-1具有97%的同源性,因此其有效性提升的同时其副作用相比艾塞那肽大大降低。但利拉鲁肽与艾塞那肽一样,都需要每日皮下注射给药,在药物的易用性方面仍存在一定劣势,因此对GLP-1进行结构修饰,在保留其生物学效应的同时延长其体内半衰期已经成为GLP-1药物的研发方向。而与人免疫球蛋白IgG Fc部分结合的人源GLP-1改构体度拉糖肽,利用了IgG长循环半衰期的特点,在效果不劣于利拉鲁肽的同时,可以达到一周一次给药,是目前同类产品中最理想的产品。但是度拉糖肽中的IgG的Fc段可能具有ADCC(Antibody-Dependent Cell Cytotoxicity,抗体依赖细胞介导的细胞毒性作用)作用,具有潜在的免疫原性及副反应。
因此,本领域亟需开发一种新型、安全、并能长期有效治疗糖尿病的药物。
发明内容
本发明的目的是提供一种新型、安全、并能长期有效治疗糖尿病的药物。
本发明的第一方面,提供了一种融合蛋白,所述融合蛋白的结构如下式I所示:
A-L-B (I)
式中,A为GLP-1类似物,B为人Ⅲ型胶原α1链,L为无或连接肽,各“-”独立地为连接肽或肽键;且
所述GLP-1类似物具有SEQ ID NO.:1所示氨基酸序列的多肽,
His-Xaa8-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Xaa22-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Arg-Xaa37
其中Xaa8是Gly或Ala,Xaa22是Glu或Gly,Xaa37是Gly或无。
在另一优选例中,所述GLP-1类似物在对应于SEQ ID NO.:6所示的序列中存在选自下组的氨基酸突变:第2位甘氨酸(Ala)、第16位甘氨酸(Gly)、第31位甘氨酸(Gly)、或其组合。
在另一优选例中,所述GLP-1类似物在对应于SEQ ID NO.:6所示的序列中存在选自的突变:A2G、G16E、缺失G31、或其组合。
在另一优选例中,所述GLP-1类似物除所述突变(如第2、16、31位氨基酸)外,其余氨基酸与SEQ ID NO.:6所示的序列相同或基本相同。
在另一优选例中,所述GLP-1类似物具有如SEQ ID NO.:6或12所示的氨基酸序列。
在另一优选例中,所述人Ⅲ型胶原α1链(即COL3A1)为COL3A1蛋白全长或片段,其中所述片段选自下组:COL3A1598-896片段、COL3A1733-896片段、或其组合。
在另一优选例中,所述COL3A1蛋白全长具有SEQ ID NO.:2所示的氨基酸序列。
在另一优选例中,所述COL3A1598-896片段为COL3A1蛋白中第598-896位氨基酸序列。
在另一优选例中,所述COL3A1598-896片段具有SEQ ID NO.:3所示的氨基酸序列。
在另一优选例中,所述COL3A1733-896片段为COL3A1蛋白中第733-896位氨基酸序列。
在另一优选例中,所述COL3A1733-896片段具有SEQ ID NO.:4所示的氨基酸序列。
在另一优选例中,所述人Ⅲ型胶原α1链具有SEQ ID NO.:2、3或4所示的氨基酸序列。
在另一优选例中,所述连接肽为具有n个重复的如SEQ ID NO.:5(Gly-Gly-Gly-Gly-Ser)所示的序列,并且其C端还连接有一个丙氨酸(Ala)的多肽,其中n为2-6,优选地n为2。
在另一优选例中,所述连接肽具有SEQ ID NO.:7所示的氨基酸序列。
在另一优选例中,所述融合蛋白具有SEQ ID NO.:8、9或13所示的氨基酸序列。
本发明的第二方面,提供了一种寡聚体,所述寡聚体包含本发明第一方面所述的融合蛋白。
在另一优选例中,所述寡聚体为二聚体、三聚体、四聚体或五聚体。
在另一优选例中,所述寡聚体为本发明第一方面所述的融合蛋白的二聚体。
本发明的第三方面,提供了一种分离的多核苷酸,所述多核苷酸编码本发明第一方面所述的融合蛋白。
在另一优选例中,所述GLP-1类似物的编码序列如SEQ ID NO.:10中第1-90位所示。
在另一优选例中,所述COL3A1598-896片段的编码序列如SEQ ID NO.:10中第139-1035位所示。
在另一优选例中,所述COL3A1733-896片段的编码序列如SEQ ID NO.:11中第139-630位所示。
在另一优选例中,所述多核苷酸具有如SEQ ID NO.:10或11所示的序列。
本发明的第四方面,提供了一种载体,所述载体包括本发明第三方面所述的多核苷酸。
在另一优选例中,所述的载体选自下组:DNA、RNA、质粒、慢病毒载体、腺病毒载体、逆转录病毒载体、转座子、或其组合。
在另一优选例中,所述载体为质粒,优选地pUC57质粒。
本发明的第五方面,提供了一种宿主细胞,所述的宿主细胞含有本发明第四方面所述的载体、或染色体中整合有外源的本发明第三方面所述的多核苷酸、或表达本发明第一方面所述的融合蛋白、或表达本发明第二方面所述的寡聚体。
在另一优选例中,所述宿主细胞为酵母,优选地毕赤酵母细胞,更优选地毕赤酵母细胞GS115。
本发明的第六方面,提供了一种药物组合物,所述药物组合物包括本发明第一方面所述的融合蛋白或本发明第二方面所述的寡聚体,以及药学上可接受的载体或赋形剂。
在另一优选例中,所述药物组合物用于治疗非胰岛素依赖型糖尿病或其相关疾病。
本发明的第七方面,提供了如本发明第一方面所述的融合蛋白、本发明第二方面所述的寡聚体、本发明第三方面所述的多核苷酸、本发明第四方面所述的载体、本发明第五方面所述的宿主细胞,用于制备预防和/或治疗糖尿病的药物或制剂。
在另一优选例中,所述糖尿病为非胰岛素依赖型糖尿病或其相关疾病。
在本发明的第八方面,提供了一种本发明第一方面所述的融合蛋白、本发明第二方面所述的寡聚体或本发明第六方面所述的药物组合物的用途,用于预防和/或治疗糖尿病,优选地非胰岛素依赖型糖尿病或其相关疾病。
本发明的第九方面,提供了一种治疗疾病的方法,包括给需要治疗的对象施用适量的本发明第一方面所述的融合蛋白、本发明第二方面所述的寡聚体或本发明第六方面所述的药物组合物。
在另一优选例中,所述疾病为非胰岛素依赖型糖尿病或其相关疾病。
应理解,在本发明范围内中,本发明的上述各技术特征和在下文(如实施例)中具体描述的各技术特征之间都可以互相组合,从而构成新的或优选的技术方案。限于篇幅,在此不再一一累述。
附图说明
图1显示了pPic9m-GLP-COL-1表达质粒的构建图。
图2显示了pPic9m-GLP-COL-2质粒结构图。
图3显示了纯化后GLP-1-2L-COL598-896及GLP-1-2L-COL733-896融合蛋白的电泳纯度分析结果。
图4显示了GLP-1-2L-COL598-896及GLP-1-2L-COL733-896融合蛋白的GLP-1R受体激活活性分析。
图5显示了GLP-1-2L-COL598-896及GLP-1-2L-COL733-896融合蛋白的体内半衰期分析结果。
具体实施方式
本发明人经过广泛而深入的研究,意外地获得了一种安全、长期有效治疗非胰岛素依赖型糖尿病的药物。所述药物是一种GLP-1类似物-COL3A1融合蛋白,包含胰高血糖素样肽1类似物和的人Ⅲ型胶原α1链的融合蛋白,所述融合蛋白具有胰高血糖素样肽1的降低血糖作用,并具有延长的体内半衰期。本发明还对胰高血糖素样肽1类似物进行了突变,降低GLP-1类似物对DPP-4水解的敏感性,活性提高,免疫原性降低。本发明还筛选出两种人COL3A1片段(SEQ ID NO.:3和SEQ ID NO.:4所示),保留了人COL3A1形成同源三聚体的能力同时有利于本发明中异源融合蛋白的重组表达,避免过长的氨基酸序列导致重组表达的困难。本发明融合蛋白质可用于治疗Ⅱ型糖尿病和各种相关状况。在此基础上,发明人完成了本发明。
融合蛋白
如本文所用,“本发明融合蛋白”、“重组融合蛋白”或“多肽”均指本发明第一方面所述的融合蛋白。本发明融合蛋白的结构如下式I所示:
A-L-B (I)
式中,A为GLP-1类似物,B为人Ⅲ型胶原α1链,L为无或连接肽,各“-”独立地为连接肽或肽键;且
所述GLP-1类似物具有SEQ ID NO.:1所示氨基酸序列的多肽,
His-Xaa8-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Xaa22-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Arg-Xaa37
其中Xaa8是Gly或Ala,Xaa22是Glu或Gly,Xaa37是Gly或无。
如本文所用,术语“融合蛋白”还包括具有上述活性的、SEQ ID NO.:8、9或13序列的变异形式。这些变异形式包括(但并不限于):1-3个(通常为1-2个,更佳地1个)氨基酸的缺失、插入和/或取代,以及在C末端和/或N末端添加或缺失一个或数个(通常为3个以内,较佳地为2个以内,更佳地为1个以内)氨基酸。例如,在本领域中,用性能相近或相似的氨基酸进行取代时,通常不会改变蛋白质的功能。又比如,在C末端和/或N末端添加或缺失一个或数个氨基酸通常也不会改变蛋白质的结构和功能。此外,所述术语还包括单体和多聚体形式的本发明多肽。该术语还包括线性以及非线性的多肽(如环肽)。
本发明还包括上述融合蛋白的活性片段、衍生物和类似物。如本文所用,术语“片段”、“衍生物”和“类似物”是指基本上保持本发明融合蛋白的功能或活性的多肽。本发明的多肽片段、衍生物或类似物可以是(i)有一个或几个保守或非保守性氨基酸残基(优选保守性氨基酸残基)被取代的多肽,或(ii)在一个或多个氨基酸残基中具有取代基团的多肽,或(iii)抗原肽与另一个化合物(比如延长多肽半衰期的化合物,例如聚乙二醇)融合所形成的多肽,或(iv)附加的氨基酸序列融合于此多肽序列而形成的多肽(与前导序列、分泌序列或6His等标签序列融合而形成的融合蛋白)。根据本文的教导,这些片段、衍生物和类似物属于本领域熟练技术人员公知的范围。
一类优选的活性衍生物指与式I的氨基酸序列相比,有至多3个,较佳地至多2个,更佳地至多1个氨基酸被性质相似或相近的氨基酸所替换而形成多肽。这些保守性变异多肽最好根据表A进行氨基酸替换而产生。
表A
最初的残基 | 代表性的取代 | 优选的取代 |
Ala(A) | Val;Leu;Ile | Val |
Arg(R) | Lys;Gln;Asn | Lys |
Asn(N) | Gln;His;Lys;Arg | Gln |
Asp(D) | Glu | Glu |
Cys(C) | Ser | Ser |
Gln(Q) | Asn | Asn |
Glu(E) | Asp | Asp |
Gly(G) | Pro;Ala | Ala |
His(H) | Asn;Gln;Lys;Arg | Arg |
Ile(I) | Leu;Val;Met;Ala;Phe | Leu |
Leu(L) | Ile;Val;Met;Ala;Phe | Ile |
Lys(K) | Arg;Gln;Asn | Arg |
Met(M) | Leu;Phe;Ile | Leu |
Phe(F) | Leu;Val;Ile;Ala;Tyr | Leu |
Pro(P) | Ala | Ala |
Ser(S) | Thr | Thr |
Thr(T) | Ser | Ser |
Trp(W) | Tyr;Phe | Tyr |
Tyr(Y) | Trp;Phe;Thr;Ser | Phe |
Val(V) | Ile;Leu;Met;Phe;Ala | Leu |
本发明还提供本发明融合蛋白的类似物。这些类似物与SEQ ID NO.:8、9或13所示的多肽的差别可以是氨基酸序列上的差异,也可以是不影响序列的修饰形式上的差异,或者兼而有之。类似物还包括具有不同于天然L-氨基酸的残基(如D-氨基酸)的类似物,以及具有非天然存在的或合成的氨基酸(如β、γ-氨基酸)的类似物。应理解,本发明的多肽并不限于上述例举的代表性的多肽。
修饰(通常不改变一级结构)形式包括:体内或体外的多肽的化学衍生形式如乙酰化或羧基化。修饰还包括糖基化,如那些在多肽的合成和加工中或进一步加工步骤中进行糖基化修饰而产生的多肽。这种修饰可以通过将多肽暴露于进行糖基化的酶(如哺乳动物的糖基化酶或去糖基化酶)而完成。修饰形式还包括具有磷酸化氨基酸残基(如磷酸酪氨酸,磷酸丝氨酸,磷酸苏氨酸)的序列。还包括被修饰从而提高了其抗蛋白水解性能或优化了溶解性能的多肽。
本发明的化合物包括一种异源融合蛋白质,其中第一个多肽是GLP-1类似物,其序列选自SEQIDNO.:1,
His-Xaa8-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Xaa22-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Arg-Xaa37
其中Xaa8是Gly或Ala;
其中Xaa22是Glu或Gly;
其中Xaa37是Gly或被去除。
第二个多肽是人Ⅲ型胶原α1链(即COL3A1)全长或片段,其序列选自
(a)全长COL3A1(SEQ ID NO.:2)
Met-Met-Ser-Phe-Val-Gln-Lys-Gly-Ser--Trp-Leu-Leu-Leu-Ala-Leu-Leu-His-Pro--Thr-Ile-Ile-Leu-Ala-Gln-Gln-Glu-Ala-Val-Glu-Gly-Gly-Cys-Ser-His-Leu-Gly-Gln-Ser--Tyr-Ala-Asp-Arg-Asp-Val--Trp-Lys-Pro-Glu-Pro-Cys-Gln-Ile-Cys-Val-Cys-Asp-Ser-Gly-Ser-Val-Leu-Cys-Asp-Asp-Ile-Ile-Cys-Asp-Asp-Gln-Glu-Leu-Asp-Cys-Pro-Asn-Pro-Glu-Ile-Pro-Phe-Gly-Glu-Cys-Cys-Ala-Val-Cys-Pro-Gln-Pro-Pro--Thr-Ala-Pro--Thr-Arg-Pro-Pro-Asn-Gly-Gln-Gly-Pro-Gln-Gly-Pro-Lys-Gly-Asp-Pro-Gly-Pro-Pro-Gly-Ile-Pro-Gly-Arg-Asn-Gly-Asp-Pro-Gly-Ile-Pro-Gly-Gln-Pro-Gly-Ser-Pro-Gly-Ser-Pro-Gly-Pro-Pro-Gly-Ile-Cys-Glu-Ser-Cys-Pro--Thr-Gly-Pro-Gln-Asn--Tyr-Ser-Pro-Gln--Tyr-Asp-Ser--Tyr-Asp-Val-Lys-Ser-Gly-Val-Ala-Val-Gly-Gly-Leu-Ala-Gly--Tyr-Pro-Gly-Pro-Ala-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Pro-Pro-Gly--Thr-Ser-Gly-His-Pro-Gly-Ser-Pro-Gly-Ser-Pro-Gly--Tyr-Gln-Gly-Pro-Pro-Gly-Glu-Pro-Gly-Gln-Ala-Gly-Pro-Ser-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Ala-Ile-Gly-Pro-Ser-Gly-Pro-Ala-Gly-Lys-Asp-Gly-Glu-Ser-Gly-Arg-Pro-Gly-Arg-Pro-Gly-Glu-Arg-Gly-Leu-Pro-Gly-Pro-Pro-Gly-Ile-Lys-Gly-Pro-Ala-Gly-Ile-Pro-Gly-Phe-Pro-Gly-Met-Lys-Gly-His-Arg-Gly-Phe-Asp-Gly-Arg-Asn-Gly-Glu-Lys-Gly-Glu--Thr-Gly-Ala-Pro-Gly-Leu-Lys-Gly-Glu-Asn-Gly-Leu-Pro-Gly-Glu-Asn-Gly-Ala-Pro-Gly-Pro-Met-Gly-Pro-Arg-Gly-Ala-Pro-Gly-Glu-Arg-Gly-Arg-Pro-Gly-Leu-Pro-Gly-Ala-Ala-Gly-Ala-Arg-Gly-Asn-Asp-Gly-Ala-Arg-Gly-Ser-Asp-Gly-Gln-Pro-Gly-Pro-Pro-Gly-Pro-Pro-Gly--Thr-Ala-Gly-Phe-Pro-Gly-Ser-Pro-Gly-Ala-Lys-Gly-Glu-Val-Gly-Pro-Ala-Gly-Ser-Pro-Gly-Ser-Asn-Gly-Ala-Pro-Gly-Gln-Arg-Gly-Glu-Pro-Gly-Pro-Gln-Gly-His-Ala-Gly-Ala-Gln-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Ile-Asn-Gly-Ser-Pro-Gly-Gly-Lys-Gly-Glu-Met-Gly-Pro-Ala-Gly-Ile-Pro-Gly-Ala-Pro-Gly-Leu-Met-Gly-Ala-Arg-Gly-Pro-Pro-Gly-Pro-Ala-Gly-Ala-Asn-Gly-Ala-Pro-Gly-Leu-Arg-Gly-Gly-Ala-Gly-Glu-Pro-Gly-Lys-Asn-Gly-Ala-Lys-Gly-Glu-Pro-Gly-Pro-Arg-Gly-Glu-Arg-Gly-Glu-Ala-Gly-Ile-Pro-Gly-Val-Pro-Gly-Ala-Lys-Gly-Glu-Asp-Gly-Lys-Asp-Gly-Ser-Pro-Gly-Glu-Pro-Gly-Ala-Asn-Gly-Leu-Pro-Gly-Ala-Ala-Gly-Glu-Arg-Gly-Ala-Pro-Gly-Phe-Arg-Gly-Pro-Ala-Gly-Pro-Asn-Gly-Ile-Pro-Gly-Glu-Lys-Gly-Pro-Ala-Gly-Glu-Arg-Gly-Ala-Pro-Gly-Pro-Ala-Gly-Pro-Arg-Gly-Ala-Ala-Gly-Glu-Pro-Gly-Arg-Asp-Gly-Val-Pro-Gly-Gly-Pro-Gly-Met-Arg-Gly-Met-Pro-Gly-Ser-Pro-Gly-Gly-Pro-Gly-Ser-Asp-Gly-Lys-Pro-Gly-Pro-Pro-Gly-Ser-Gln-Gly-Glu-Ser-Gly-Arg-Pro-Gly-Pro-Pro-Gly-Pro-Ser-Gly-Pro-Arg-Gly-Gln-Pro-Gly-Val-Met-Gly-Phe-Pro-Gly-Pro-Lys-Gly-Asn-Asp-Gly-Ala-Pro-Gly-Lys-Asn-Gly-Glu-Arg-Gly-Gly-Pro-Gly-Gly-Pro-Gly-Pro-Gln-Gly-Pro-Pro-Gly-Lys-Asn-Gly-Glu--Thr-Gly-Pro-Gln-Gly-Pro-Pro-Gly-Pro--Thr-Gly-Pro-Gly-Gly-Asp-Lys-Gly-Asp--Thr-Gly-Pro-Pro-Gly-Pro-Gln-Gly-Leu-Gln-Gly-Leu-Pro-Gly--Thr-Gly-Gly-Pro-Pro-Gly-Glu-Asn-Gly-Lys-Pro-Gly-Glu-Pro-Gly-Pro-Lys-Gly-Asp-Ala-Gly-Ala-Pro-Gly-Ala-Pro-Gly-Gly-Lys-Gly-Asp-Ala-Gly-Ala-Pro-Gly-Glu-Arg-Gly-Pro-Pro-Gly-Leu-Ala-Gly-Ala-Pro-Gly-Leu-Arg-Gly-Gly-Ala-Gly-Pro-Pro-Gly-Pro-Glu-Gly-Gly-Lys-Gly-Ala-Ala-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Ala-Ala-Gly--Thr-Pro-Gly-Leu-Gln-Gly-Met-Pro-Gly-Glu-Arg-Gly-Gly-Leu-Gly-Ser-Pro-Gly-Pro-Lys-Gly-Asp-Lys-Gly-Glu-Pro-Gly-Gly-Pro-Gly-Ala-Asp-Gly-Val-Pro-Gly-Lys-Asp-Gly-Pro-Arg-Gly-Pro--Thr-Gly-Pro-Ile-Gly-Pro-Pro-Gly-Pro-Ala-Gly-Gln-Pro-Gly-Asp-Lys-Gly-Glu-Gly-Gly-Ala-Pro-Gly-Leu-Pro-Gly-Ile-Ala-Gly-Pro-Arg-Gly-Ser-Pro-Gly-Glu-Arg-Gly-Glu--Thr-Gly-Pro-Pro-Gly-Pro-Ala-Gly-Phe-Pro-Gly-Ala-Pro-Gly-Gln-Asn-Gly-Glu-Pro-Gly-Gly-Lys-Gly-Glu-Arg-Gly-Ala-Pro-Gly-Glu-Lys-Gly-Glu-Gly-Gly-Pro-Pro-Gly-Val-Ala-Gly-Pro-Pro-Gly-Lys-Asp-Gly--Thr-Ser-Gly-His-Pro-Gly-Pro-Ile-Gly-Pro-Pro-Gly-Pro-Arg-Gly-Asn-Arg-Gly-Glu-Arg-Gly-Ser-Glu-Gly-Ser-Pro-Gly-His-Pro-Gly-Gln-Pro-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Ala-Pro-Gly-Pro-Cys-Cys-Gly-Gly-Val-Gly-Ala-Ala-Ala-Ile-Ala-Gly-Ile-Gly-Gly-Glu-Lys-Ala-Gly-Gly-Phe-Ala-Pro--Tyr--Tyr-Gly-Asp-Glu-Pro-Met-Asp-Phe-Lys-Ile-Asn--Thr-Asp-Glu-Ile-Met--Thr-Ser-Leu-Lys-Ser-Val-Asn-Gly-Gln-Ile-Glu-Ser-Leu-Ile-Ser-Pro-Asp-Gly-Ser-Arg-Lys-Asn-Pro-Ala-Arg-Asn-Cys-Arg-Asp-Leu-Lys-Phe-Cys-His-Pro-Glu-Leu-Lys-Ser-Gly-Glu--Tyr--Trp-Val-Asp-Pro-Asn-Gln-Gly-Cys-Lys-Leu-Asp-Ala-Ile-Lys-Val-Phe-Cys-Asn-Met-Glu--Thr-Gly-Glu--Thr-Cys-Ile-Ser-Ala-Asn-Pro-Leu-Asn-Val-Pro-Arg-Lys-His--Trp--Trp--Thr-Asp-Ser-Ser-Ala-Glu-Lys-Lys-His-Val--Trp-Phe-Gly-Glu-Ser-Met-Asp-Gly-Gly-Phe-Gln-Phe-Ser--Tyr-Gly-Asn-Pro-Glu-Leu-Pro-Glu-Asp-Val-Leu-Asp-Val-Gln-Leu-Ala-Phe-Leu-Arg-Leu-Leu-Ser-Ser-Arg-Ala-Ser-Gln-Asn-Ile--Thr--Tyr-His-Cys-Lys-Asn-Ser-Ile-Ala--Tyr-Met-Asp-Gln-Ala-Ser-Gly-Asn-Val-Lys-Lys-Ala-Leu-Lys-Leu-Met-Gly-Ser-Asn-Glu-Gly-Glu-Phe-Lys-Ala-Glu-Gly-Asn-Ser-Lys-Phe--Thr--Tyr--Thr-Val-Leu-Glu-Asp-Gly-Cys--Thr-Lys-His--Thr-Gly-Glu--Trp-Ser-Lys--Thr-Val-Phe-Glu--Tyr-Arg--Thr-Arg-Lys-Ala-Val-Arg-Leu-Pro-Ile-Val-Asp-Ile-Ala-Pro--Tyr-Asp-Ile-Gly-Gly-Pro-Asp-Gln-Glu-Phe-Gly-Val-Asp-Val-Gly-Pro-Val-Cys-Phe-Leu
(b)COL3A1598-896(SEQ ID NO.:3)
Gly-Pro-Gly-Gly-Pro-Gly-Pro-Gln-Gly-Pro-Pro-Gly-Lys-Asn-Gly-Glu-Thr-Gly-Pro-Gln-Gly-Pro-Pro-Gly-Pro-Thr-Gly-Pro-Gly-Gly-Asp-Lys-Gly-Asp-Thr-Gly-Pro-Pro-Gly-Pro-Gln-Gly-Leu-Gln-Gly-Leu-Pro-Gly-Thr-Gly-Gly-Pro-Pro-Gly-Glu-Asn-Gly-Lys-Pro-Gly-Glu-Pro-Gly-Pro-Lys-Gly-Asp-Ala-Gly-Ala-Pro-Gly-Ala-Pro-Gly-Gly-Lys-Gly-Asp-Ala-Gly-Ala-Pro-Gly-Glu-Arg-Gly-Pro-Pro-Gly-Leu-Ala-Gly-Ala-Pro-Gly-Leu-Arg-Gly-Gly-Ala-Gly-Pro-Pro-Gly-Pro-Glu-Gly-Gly-Lys-Gly-Ala-Ala-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Ala-Ala-Gly-Thr-Pro-Gly-Leu-Gln-Gly-Met-Pro-Gly-Glu-Arg-Gly-Gly-Leu-Gly-Ser-Pro-Lys-Gly-Asp-Lys-Gly-Glu-Pro-Gly-Gly-Pro-Gly-Ala-Asp-Gly-Val-Pro-Gly-Lys-Asp-Gly-Pro-Arg-Gly-Pro-Thr-Gly-Pro-Ile-Gly-Pro-Pro-Gly-Pro-Ala-GLy-Gln-Pro-Gly-Asp-Lys-Gly-Glu-Gly-Gly-Ala-Pro-Gly-Leu-Pro-Gly-Ile-Ala-Gly-Pro-Arg-Gly-Ser-Pro-Gly-Glu-Arg-Gly-Glu-Thr-Gly-Pro-Pro-Gly-Pro-Ala-Gly-Phe-Pro-Gly-Ala-Pro-Gly-Gln-Asn-Gly-Glu-Pro-Gly-Gly-Lys-Gly-Glu-Arg-Gly-Ala-Pro-Gly-Glu-Lys-Gly-Glu-Gly-Gly-Pro-Pro-Gly-Val-Ala-Gly-Pro-Pro-Gly-Lys-Asp-Gly-Thr-Ser-Gly-His-Pro-Gly-Pro-I le-Gly-Pro-Pro-Gly-Pro-Arg-Gly-Asn-Arg-Gly-Glu-Arg-Gly-Ser-Glu-Gly-Ser-Pro-Gly-His-Pro-Gly-Gln-Pro-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Ala-Pro-Gly-Pro-Cys-Cys-Gly-Gly
(c)COL3A1733-896(SEQ ID NO.:4)
Gly-Leu-Gly-Ser-Pro-Lys-Gly-Asp-Lys-Gly-Glu-Pro-Gly-Gly-Pro-Gly-Ala-Asp-Gly-Val-Pro-Gly-Lys-Asp-Gly-Pro-Arg-Gly-Pro-Thr-Gly-Pro-Ile-Gly-Pro-Pro-Gly-Pro-Ala-GLy-Gln-Pro-Gly-Asp-Lys-Gly-Glu-Gly-Gly-Ala-Pro-Gly-Leu-Pro-Gly-Ile-Ala-Gly-Pro-Arg-Gly-Ser-Pro-Gly-Glu-Arg-Gly-Glu-Thr-Gly-Pro-Pro-Gly-Pro-Ala-Gly-Phe-Pro-Gly-Ala-Pro-Gly-Gln-Asn-Gly-Glu-Pro-Gly-Gly-Lys-Gly-Glu-Arg-Gly-Ala-Pro-Gly-Glu-Lys-Gly-Glu-Gly-Gly-Pro-Pro-Gly-Val-Ala-Gly-Pro-Pro-Gly-Lys-Asp-Gly-Thr-Ser-Gly-His-Pro-Gly-Pro-Ile-Gly-Pro-Pro-Gly-Pro-Arg-Gly-Asn-Arg-Gly-Glu-Arg-Gly-Ser-Glu-Gly-Ser-Pro-Gly-His-Pro-Gly-Gln-Pro-Gly-Pro-Pro-Gly-Pro-Pro-Gly-Ala-Pro-Gly-Pro-Cys-Cys-Gly-Gly
本发明的异源融合蛋白质GLP-1类似物多肽的C末端和人COL3A1片段的N末端优选通过富含G的肽接头(即连接肽)融合在一起,其中肽接头具有序列[Gly-Gly-Gly-Gly-Ser(SEQ ID NO.:5)]n-Ala的序列,其中n为2-6,优选地为2。
本发明的异源融合蛋白质包括GLP-1类似物部分和人COL3A1片段部分。通过对天然GLP-1序列的部分替换与融合人COL3A1片段,在保留天然GLP-1活性的同时,融合蛋白通过人COL3A1区域形成稳定的寡聚体,增加融合蛋白的体内稳定性。
天然GLP-1在体内经过加工,切割成AA7-AA37具有活性的片段,因此,按照本领域习惯将GLP-1的氨基端指定为7号,羧基端为37号。如SEQ ID NO.:6中所示对该多肽中的其他氨基酸连续编号。
7His-Ala-Glu-10Gly-Thr-Phe-Thr-Ser-15Asp-Val-Ser-Ser-Tyr-20Leu-Glu-Gly-Gln-Ala-25Ala-Lys-Glu-Phe-Ile-30Ala-Trp-Leu-Val-Lys-35Gly-Arg-37Gly
(SEQ ID NO:6)
相对于天然GLP-1(7-37),异源融合蛋白质的GLP-1类似物部分包括第8、22和36位的三个初步替换。内源性二肽基肽酶4(DPP-4)在第8位的Ala及第9位的Glu之间切割天然GLP-1,产生的无活性的GLP-1(9-37)片段,第8位替换为Gly后可以降低GLP-1类似物对DPP-4水解的敏感性。第22位的替换可以提高GLP-1类似物的活性。第37位的去除可降低融合蛋白质得免疫原性。突变后的序列如SEQ ID NO.:12所示。
7His-Gly-Glu-10Gly-Thr-Phe-Thr-Ser-15Asp-Val-Ser-Ser-Tyr-20Leu-Glu-Glu-Gln-Ala-25Ala-Lys-Glu-Phe-Ile-30Ala-Trp-Leu-Val-Lys-35Gly-Arg(SEQ ID NO.:12)
本发明的异源融合蛋白含有人COL3A1及其片段。在分子结构上,Ⅲ型胶原蛋白是由平行线型链组成,每一线型链由三条扭曲左旋的α1链通过链间相互作用紧密结合而形成的一极强的右旋三重螺旋结构。每条Ⅲ型胶原α1链由多达300以上的Gly-X-Y三联体重复构成,该三联体结构是Ⅲ型胶原α1链形成同源三聚体的关键。因此,本发明在采用全长人COL3A1(SEQ ID NO.:2)的基础上,为了避免过长的氨基酸序列导致重组表达的困难,优选了两种人COL3A1片段,序列分别如SEQ ID NO.:3及SEQ ID NO.:4所示。两种片段含有不同长度的Gly-X-Y三联体结构域,保留了人COL3A1形成同源三聚体的能力同时有利于本发明中异源融合蛋白的重组表达。
本发明中的GLP-1类似物部分的C末端氨基酸优选通过富含甘氨酸的接头肽[Gly-Gly-Gly-Gly-Ser(SEQ ID NO.:5)]n-Ala与人COL3A1片段的N末端融合。增加肽接头可以防止潜在的两个结构域之间的相互干扰,提高异源融合蛋白的稳定性。另外,富含甘氨酸的接头提供了一定的结构柔性,使GLP-1类似物部分可以与靶细胞例如胰腺β细胞上的GLP-1受体分子有效地相互作用,有利于发挥其生物活性。本发明中接头肽[Gly-Gly-Gly-Gly-Ser(SEQ ID NO.:5)]n-Ala的重复数n≥2,但过长的接头肽不利于融合蛋白的稳定性并可能增加潜在的免疫原性。因此优选的接头肽包括序列:
Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Ala(SEQ ID NO.:7)
因此本发明优选的GLP-1-COL3A1异源融合蛋白质包括下列蛋白质:
(a)GLP-1-2L-COL598-896(SEQ ID NO.:8)
HGEGTFTSDVSSYLEEQAAKEFIAWLVKGRGGGGSGGGGSGGGGSAGPGGPGPQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGG
(b)GLP-1-2L-COL733-896(SEQ ID NO.:9)
HGEGTFTSDVSSYLEEQAAKEFIAWLVKGRGGGGSGGGGSGGGGSAGLGSPGPKGDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGG
(c)GLP-1-2L-COL(SEQ ID NO.:13)
HGEGTFTSDVSSYLEEQAAKEFIAWLVKGRGGGGSGGGGSGGGGSAAMMSFVQKGSWLLLALLHPTIILAQQEAVEGGCSHLGQSYADRDVWKPEPCQICVCDSGSVLCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQGPQGPKGDPGPPGIPGRNGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDVKSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETGAPGLKGENGLPGENGAPGPMGPRGAPGERGRPGLPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPPGPPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPAGANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGIPGVPGAKGEDGKDGSPGEPGANGLPGAAGERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRDGVPGGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPGGPGPQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGGVGAAAIAGIGGEKAGGFAPYYGDEPMDFKINTDEIMTSLKSVNGQIESLISPDGSRKNPARNCRDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDSSAEKKHVWFGESMDGGFQFSYGNPELPEDVLDVQLAFLRLLSSRASQNITYHCKNSIAYMDQASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYRTRKAVRLPIVDIAPYDIGGPDQEFGVDVGPVCFL
此处使用的提及特定异源融合蛋白质的命名法定义如下:融合蛋白质GLP-1部分特指成熟GLP-1(7-37)的类似物,其中第8位的Ala突变为Gly,第22位的Gly突变为Glu,第37位的Gly去除。L指具有序列[Gly-Gly-Gly-Gly-Ser(SEQ ID NO.:5)]n-Ala的接头。直接在L前面的数字指接头肽中n的重复数目。指定为2L的接头肽指序列Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Ala(SEQ ID NO.:7)。融合蛋白质COL3A1片段的缩写为COL,其氨基酸序列起始位置以残基编号来表示。COL598-896表示成熟融合蛋白质的COL3A1部分以第598位的Gly开始,以第896位Gly结束;COL733-896表示成熟融合蛋白质的COL3A1部分以第733位的Gly开始,以第896位Gly结束。
本发明涉及人源GLP-1类似物与人Ⅲ型胶原α1链片段相结合的融合蛋白。该融合蛋白与度拉糖肽不同之处在于,IgG的Fc段可能具有ADCC(Antibody-Dependent CellCytotoxicity,抗体依赖细胞介导的细胞毒性作用)作用,具有潜在的免疫原性及副反应。相对的,胶原蛋白是体内含量最多的一种蛋白质,约占蛋白质总量的25%-33%,广泛地存在于人体的骨、腱、软骨和皮肤及其它结缔组织种,是细胞外基质(ECM)的主要成分,具有良好的生物兼容性、生物吸收性。其中,Ⅲ型胶原只占胶原蛋白总量的10%,主要存在于血管中。在分子结构上,Ⅲ型胶原蛋白是由平行线型链组成,每一线型链由三条扭曲左旋的α1链通过链间相互作用紧密结合而形成的一极强的右旋三重螺旋结构。每条Ⅲ型胶原α1链由多达300以上的Gly-X-Y三联体重复构成,该三联体结构是Ⅲ型胶原α1链形成同源三聚体的关键。
核酸编码序列
本发明还包括编码本发明异源融合蛋白质的多核苷酸以及包含这些多核苷酸的载体和宿主细胞。本发明也包括治疗患有非胰岛素依赖型、肥胖及多种其他疾病和病症的患者的方法,其包括应用在此讨论的异源融合蛋白质。
本发明还涉及编码根据本发明的融合蛋白的多核苷酸。
在本发明的一个优选例中,所述核苷酸序列如SEQ ID NO.:10或11所示。
本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。编码成熟多肽的编码区序列可以与编码SEQ ID NO.:8、9或13所示的多肽的序列相同或者是简并的变异体。如本文所用,“简并的变异体”在本发明中是指编码具有SEQ ID NO.:8、9或13所示的多肽,但相应编码区序列有差别的核酸序列。
本发明的多肽的核苷酸全长序列或其片段通常可以用PCR扩增法、重组法或人工合成的方法获得。对于PCR扩增法,可根据已公开的有关核苷酸序列,尤其是开放阅读框序列来设计引物,并用市售的cDNA库或按本领域技术人员已知的常规方法所制备的cDNA库作为模板,扩增而得有关序列。当序列较长时,常常需要进行两次或多次PCR扩增,然后再将各次扩增出的片段按正确次序拼接在一起。目前,已经可以完全通过化学合成来得到编码本发明多肽(或其片段,或其衍生物)的DNA序列。然后可将该DNA序列引入本领域中已知的各种现有的DNA分子(或如载体)和细胞中。
本发明也涉及包含本发明的多核苷酸的载体,以及用本发明的载体或多肽编码序列经基因工程产生的宿主细胞。上述多核苷酸、载体或宿主细胞可以是分离的。
如本文所用,“分离的”是指物质从其原始环境中分离出来(如果是天然的物质,原始环境即是天然环境)。如活体细胞内的天然状态下的多核苷酸和多肽是没有分离纯化的,但同样的多核苷酸或多肽如从天然状态中同存在的其他物质中分开,则为分离纯化的。
一旦获得了有关的序列,就可以用重组法来大批量地获得有关序列。这通常是将其克隆入载体,再转入细胞,然后通过常规方法从增殖后的宿主细胞中分离得到有关序列。
此外,还可用人工合成的方法来合成有关序列,尤其是片段长度较短时。通常,通过先合成多个小片段,然后再进行连接可获得序列很长的片段。
应用PCR技术扩增DNA/RNA的方法被优选用于获得本发明的基因。用于PCR的引物可根据本文所公开的本发明的序列信息适当地选择,并可用常规方法合成。可用常规方法如通过凝胶电泳分离和纯化扩增的DNA/RNA片段。
本发明也涉及包含本发明的多核苷酸的载体,以及用本发明的载体或蛋白编码序列经基因工程产生的宿主细胞,以及经重组技术利用所述宿主细胞表达本发明融合蛋白的方法。
通过常规的重组DNA技术,可利用本发明的多核苷酸序列获得表达本发明融合蛋白的宿主细胞。一般来说包括步骤:将本发明第三方面所述的多核苷酸或本发明第四方面所述的载体转导入宿主细胞内。
本领域的技术人员熟知的方法能用于构建含本发明酶的编码DNA序列和合适的转录/翻译控制信号的表达载体。这些方法包括体外重组DNA技术、DNA合成技术、体内重组技术等。所述的DNA序列可有效连接到表达载体中的适当启动子上,以指导mRNA合成。表达载体还包括翻译起始用的核糖体结合位点和转录终止子。
此外,表达载体优选地包含一个或多个选择性标记基因,以提供用于选择转化的宿主细胞的表型性状,如真核细胞培养用的二氢叶酸还原酶、新霉素抗性以及绿色荧光蛋白(GFP),或用于大肠杆菌的四环素或氨苄青霉素抗性。
包含上述的适当DNA序列以及适当启动子或者控制序列的载体,可以用于转化适当的宿主细胞,以使其能够表达蛋白质。
宿主细胞可以是原核细胞,如细菌细胞;或是低等真核细胞,如酵母细胞;或是高等真核细胞,如哺乳动物细胞。代表性例子有:大肠杆菌,枯草芽胞杆菌,链霉菌属的细菌细胞;真菌细胞如毕赤酵母、酿酒酵母细胞;植物细胞;果蝇S2或Sf9的昆虫细胞;CHO、NS0、COS7、或293细胞的动物细胞等。在另一优选例中,所述宿主细胞为毕赤酵母。
用重组DNA转化宿主细胞可用本领域技术人员熟知的常规技术进行。当宿主为原核生物如大肠杆菌时,能吸收DNA的感受态细胞可在指数生长期后收获,用CaCl2法处理,所用的步骤在本领域众所周知。另一种方法是使用MgCl2。如果需要,转化也可用电穿孔的方法进行。当宿主是真核生物,可选用如下的DNA转染方法:磷酸钙共沉淀法,常规机械方法如显微注射、电穿孔、脂质体包装等。
获得的转化子可以用常规方法培养,表达本发明的基因所编码的蛋白质。根据所用的宿主细胞,培养中所用的培养基可选自各种常规培养基。在适于宿主细胞生长的条件下进行培养。当宿主细胞生长到适当的细胞密度后,用合适的方法(如温度转换或化学诱导)诱导选择的启动子,将细胞再培养一段时间。
在上面的方法中的蛋白质可在细胞内、或在细胞膜上表达、或分泌到细胞外。如果需要,可利用其物理的、化学的和其它特性通过各种分离方法分离和纯化蛋白。这些方法是本领域技术人员所熟知的。这些方法的例子包括但并不限于:常规的复性处理、用蛋白沉淀剂处理(盐析方法)、离心、渗透破菌、超处理、超离心、分子筛层析(凝胶过滤)、吸附层析、离子交换层析、高效液相层析(HPLC)和其它各种液相层析技术及这些方法的结合。
可以通过多种不同的方法产生编码本发明的GLP-1类似物的DNA,可以基于天然序列设计引物以产生此处所述的编码GLP-1类似物的DNA,可以在连接前或在编码整个融合蛋白质的cDNA中突变编码野生型GLP-1DNA。从特定文库克隆的全长野生型序列通常可以作为产生本发明COL3A1片段的模板,可以通过设计引物以产生此处所述的编码COL3A1片段的DNA。通过PCR技术及引物设计,可以将编码GLP-1类似物的基因和编码COL3A1类似物蛋白质的基因也通过编码富含G的接头肽的DNA在框内连接。进行全序列的化学合成也是可行的技术。可以使用PCR技术,用经设计与对应于COL3A1片段所需末端的序列杂交的引物产生该片段。也可以设计PCR引物产生限制性酶位点以便于克隆到表达载体中。
在SEQ ID NO.:10中提供编码本发明优选的异源融合蛋白质之一GLP-1-2L-COL598-896的优选DNA序列:
CACGGTGAGGGTACTTTTACCTCTGATGTTTCCTCATACTTGGAAGAACAAGCTGCTAAGGAATTCATTGCCTGGCTGGTCAAAGGCAGAGGAGGTGGCGGATCCGGTGGCGGTGGGTCCGGAGGAGGTGGTTCAGCTGGTCCAGGTGGTCCAGGTCCTCAAGGTCCTCCAGGTAAGAATGGTGAAACTGGTCCTCAGGGACCTCCAGGCCCAACCGGTCCTGGAGGTGATAAGGGTGATACCGGACCACCTGGCCCACAAGGCTTGCAGGGTCTGCCAGGTACAGGGGGTCCACCCGGTGAAAACGGCAAGCCTGGTGAACCAGGCCCAAAAGGTGACGCTGGAGCTCCAGGAGCCCCAGGAGGTAAGGGTGATGCTGGTGCCCCCGGTGAGAGAGGCCCACCAGGTTTGGCCGGTGCTCCCGGTCTGAGAGGGGGAGCTGGTCCACCAGGACCTGAAGGCGGAAAAGGTGCTGCTGGTCCACCTGGACCACCTGGTGCTGCCGGAACTCCAGGACTGCAGGGAATGCCTGGTGAAAGAGGCGGATTGGGATCTCCTGGCCCAAAAGGAGACAAGGGAGAGCCTGGTGGACCAGGGGCAGATGGAGTTCCTGGAAAAGATGGTCCTCGTGGTCCAACAGGACCTATCGGTCCCCCAGGACCTGCTGGTCAACCTGGAGATAAAGGTGAAGGCGGGGCTCCAGGATTGCCTGGTATTGCCGGCCCTAGAGGTTCTCCCGGTGAAAGAGGTGAGACCGGCCCACCTGGTCCAGCTGGCTTCCCTGGAGCACCAGGTCAGAATGGTGAGCCAGGTGGTAAGGGTGAGAGAGGAGCTCCAGGTGAGAAGGGGGAAGGTGGTCCACCTGGTGTTGCTGGTCCACCAGGTAAGGATGGTACATCCGGTCATCCTGGACCAATTGGACCTCCAGGGCCTAGAGGTAACAGGGGTGAAAGGGGATCTGAAGGATCTCCTGGACATCCAGGTCAGCCCGGTCCTCCTGGTCCACCCGGAGCTCCTGGGCCATGCTGTGGTGGC(SEQ ID NO.:10)
在SEQ ID NO.:11中提供编码本发明优选的异源融合蛋白质之一GLP-1-2L-COL733-896的优选DNA序列:
CACGGTGAGGGTACTTTTACCTCTGATGTTTCCTCATACTTGGAAGAACAAGCTGCTAAGGAATTCATTGCCTGGCTGGTCAAAGGCAGAGGAGGTGGCGGATCCGGTGGCGGTGGGTCCGGAGGAGGTGGTTCAGCTGGTTTGGGATCTCCTGGCCCAAAAGGAGACAAGGGAGAGCCTGGTGGACCAGGGGCAGATGGAGTTCCTGGAAAAGATGGTCCTCGTGGTCCAACAGGACCTATCGGTCCCCCAGGACCTGCTGGTCAACCTGGAGATAAAGGTGAAGGCGGGGCTCCAGGATTGCCTGGTATTGCCGGCCCTAGAGGTTCTCCCGGTGAAAGAGGTGAGACCGGCCCACCTGGTCCAGCTGGCTTCCCTGGAGCACCAGGTCAGAATGGTGAGCCAGGTGGTAAGGGTGAGAGAGGAGCTCCAGGTGAGAAGGGGGAAGGTGGTCCACCTGGTGTTGCTGGTCCACCAGGTAAGGATGGTACATCCGGTCATCCTGGACCAATTGGACCTCCAGGGCCTAGAGGTAACAGGGGTGAAAGGGGATCTGAAGGATCTCCTGGACATCCAGGTCAGCCCGGTCCTCCTGGTCCACCCGGAGCTCCTGGGCCATGCTGTGGTGGC(SEQ ID NO.:11)
表达载体和宿主细胞
本发明还提供了一种用于本发明融合蛋白的表达载体。
克隆或表达本发明核酸的宿主细胞可以是原核细胞,更优的宿主细胞包括酵母或高等真核细胞。融合蛋白基因经宿主细胞表达后,从表达产物中进行分离纯化,可用于制备糖尿病及相关疾病的治疗药物。所述相关疾病包括:Ⅱ型糖尿病、Ⅰ型糖尿病、肥胖、Ⅱ型糖尿病患者严重心血管事件和其他严重并发症等。
与现有技术相比,本发明主要具有以下优点:
本发明中涉及的GLP-1-COL3A1融合蛋白,秉着保留GLP-1降糖活性并显著延长其体内半衰期的目标,设计了一种不同于其他GLP-1类似物药物的新型分子,其具有GLP-1R激活活性、形成稳定多聚体及延长的体内半衰期。
(a)相对于天然GLP-1(7-37),本发明融合蛋白质的GLP-1类似物部分包括第8、22和36位的三个替换。内源性二肽基肽酶4(DPP-4)在第8位的Ala及第9位的Glu之间切割天然GLP-1,产生的无活性的GLP-1(9-37)片段,第8位替换为Gly后降低了GLP-1类似物对DPP-4水解的敏感性。第22位的替换提高了GLP-1类似物的活性。第37位的去除降低了融合蛋白质得免疫原性。本发明融合蛋白对DPP-4水解的敏感性低,活性非常高,免疫原性低。
(b)本发明融合蛋白含有人Ⅲ型胶原α1链,首次采用COL3A1片段与GLP-1类似物进行长效GLP-1类似物的构建,生物相容性好、可形成寡聚体、并易于表达。优选的COL3A1有利于表达的优点,构建的融合蛋白适用于毕赤酵母表达系统,生产成本低于其他GLP-1类药物采用的高级真核细胞系统,通过制备工艺的进一步优化及放大,有望得到一种价格较低廉的长效2型糖尿病治疗药物。并且,在保留GLP-1生物学活性的同时,利用融合蛋白通过α1链形成三聚体的特性,获得显著延长的体内半衰期。
(c)为了避免过长的氨基酸序列导致重组表达的困难,本发明优选了两种人COL3A1片段(COL3A1598-896及COL3A1733-8962个片段),序列分别如SEQ ID NO.:3及SEQ IDNO.:4所示。两种片段含有不同长度的Gly-X-Y三联体结构域,保留了人COL3A1形成同源三聚体的能力同时有利于本发明中异源融合蛋白的重组表达,本发明融合蛋白稳定性高。
下面结合具体实施,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件,例如Sambrook等人,分子克隆:实验室手册(New York:Cold SpringHarbor LaboratoryPress,1989)中所述的条件,或按照制造厂商所建议的条件。除非另外说明,否则百分比和份数按重量计算。
实施例1:构建编码GLP-1-COL3A1融合蛋白的DNA
编码本发明所述融合蛋白GLP-1-2L-COL598-896的基因(SEQ ID NO:8)由南京金斯瑞生物科技有限公司合成并克隆至pUC57质粒,融合基因的5’端含有XhoI酶切位点,3’端含有TAA终止密码子及NotI酶切位点,所述pUC57质粒命名为pUC57-GLP-COL-1。
用限制内核酸内切酶XhoI及NotI(购自Fermentas)按照说明书对pUC57-GLP-COLA-1进行双酶切消化,对消化后产生长度为1050bp左右的编码GLP-1-2L-COL598-896融合蛋白的基因片段进行胶回收(胶回收试剂盒购自Axygen)。同时将pPic9m用XhoI及NotI进行双酶切消化,并对消化后长度为9000bp的质粒条带进行胶回收。
上述酶切得到的融合蛋白基因片段及pPic9m质粒片段用T4DNA连接酶(购自Fermentas)进行连接,连接产物用热休克法转化大肠杆菌感受态细胞DH5α,转化产物涂布在具有卡那霉素霉素抗性的LB固体培养基上,挑取单克隆进行基因测序,确定插入基因的序列正确,获得的质粒命名为pPic9m-GLP-COL-1(如图1所示)。
相似的,构建了含有编码GLP-1-2L-COL733-896融合蛋白编码基因的融合蛋白表达载体,命名为pPic9m-GLP-COL-2(图2)。
实施例2:异源融合蛋白的表达
将实施例1中的插入有融合蛋白基因的载体进行抽提,使用电转化的方法转入毕赤酵母GS115感受态细胞。通过营养限制性培养基筛选重组子后,利用G418抗性进行高拷贝重组子的筛选。最终获得适宜进行重组表达的含有异源融合蛋白基因的重组毕赤酵母细胞。
重组毕赤酵母细胞在经过种子扩增后,接种入5L发酵罐内进行小规模制备,发酵持续5天,其中利用甲醇进行异源融合蛋白的诱导表达,诱导持续36h,发酵结束后进行发酵液的收集用于蛋白的纯化。
表1:毕赤酵母GS115种子扩增培养基组成
配方 | 含量 |
Yeast Extract | 10.0g/L |
Peptone | 20.0g/L |
KH<sub>2</sub>PO<sub>4</sub> | 11.8g/L |
K<sub>2</sub>HPO<sub>4</sub> | 3.0g/L |
甘油 | 10.0ml/L |
表2:毕赤酵母GS115发酵培养基组成
配方 | 含量 |
YNB | 0.67g/L |
CaCl<sub>2</sub> | 0.4g/L |
K<sub>2</sub>SO<sub>4</sub> | 10.0g/L |
MgSO4.7H<sub>2</sub>O | 8.0g/L |
(NH<sub>4</sub>)<sub>2</sub>SO<sub>4</sub> | 8.0g/L |
柠檬酸 | 5.0g/L |
K<sub>2</sub>HPO<sub>4</sub>.3H<sub>2</sub>O | 18.0g/L |
甘油 | 40.0ml/L |
实施例3:异源融合蛋白的纯化
两种优选的异源融合蛋白GLP-1-2L-COL598-896、GLP-1-2L-COL733-896采用相似的纯化步骤。
4L发酵培养基使用PALL 0.2μm的中空纤维过滤系统进行上清液的收集,得到约4L上清液。上清液再采用PALL 50kDa滤膜的超滤系统进行样品超滤,除去部分杂蛋白。超滤液体最后用Source30Q阴离子交换色谱进行纯化,利用0-500mM NaCl线型梯度进行样品的洗脱,最后得到纯化的异源融合蛋白。使用SDS-PAGE确定了异源融合蛋白的纯度及分子量,纯度>90%,分子量与预期相符(图3)。
实施例4:异源融合蛋白的生物活性及药代动力学研究
实施例4a、融合蛋白的生物活性研究
利用稳定转染有GLP-1R的人骨肉瘤细胞U2OS进行异源融合蛋白的活性测定,异源融合蛋白的GLP-1类似物片段与U2OS细胞上的GLP-1R受体结合会刺激细胞分泌cAMP,通过cAMP酶联测定法来检测cAMP的活性以表征融合蛋白的GLP-1活性。U2OS细胞以1.2×105/孔接种于96孔板中,用含10%FBS的DMEM培养基,在37℃、5%CO2中培养24h。撤去培养基,然后加入基础培养基培养过夜。撤去基础培养基,加入不同浓度的待测样品,包括2种融合蛋白GLP-1-2L-COL598-896、GLP-1-2L-COL733-896及人工合成的GLP-1(7-37)对照品,37℃、5%CO2中培养0.5h。使用cAMP试剂盒(购自R&D)进行细胞cAMP含量的测定,结果如图4所示。
图4的结果显示,经GLP-1-2L-COL598-896、GLP-1-2L-COL733-896刺激U2OS细胞产生的cAMP含量存在明显的剂量效应,且与GLP-1相当,确定GLP-1-2L-COL598-896、GLP-1-2L-COL733-896融合蛋白具有GLP-1相似的GLP-1R激活活性。
实施例4b、融合蛋白的药代动力学研究
在药代动力学试验中使用SD雄性大鼠,设置GLP-1-2L-COL598-896、GLP-1-2L-COL733-896及化学合成GLP-1(7-37)对照组,每组8只。按照1mg/kg剂量静脉注射,分别采集注射前及注射后不同时间点的血样:0h、0.5h、1h、2h、4h、6h、10h、24h、2d、4d、6d、8d、10d、14d、21d。采集的血清放置于-80℃保存。血清中融合蛋白的量使用GLP-1试剂盒进行检测(图5)。由图5结果可知,本发明的融合蛋白可以显著延长GLP-1类似物的体内循环半衰期。
以上所述,仅是本发明的较优实施例,并非对本发明任何形式上的限制。应指出的是,对于本技术领域内的普通技术人员对本发明做出的改进和补充,也应视为本发明的保护范围。
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。
序列表
<110> 上海惠盾生物技术有限公司
<120> 一种GLP-1类似物-COL3A1融合蛋白
<130> P2018-0197
<160> 13
<170> PatentIn version 3.5
<210> 1
<211> 31
<212> PRT
<213> 人工序列(artificial sequence)
<400> 1
His Xaa Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Xaa
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Xaa
20 25 30
<210> 2
<211> 1163
<212> PRT
<213> 智人(Homo sapiens)
<400> 2
Met Met Ser Phe Val Gln Lys Gly Ser Trp Leu Leu Leu Ala Leu Leu
1 5 10 15
His Pro Thr Ile Ile Leu Ala Gln Gln Glu Ala Val Glu Gly Gly Cys
20 25 30
Ser His Leu Gly Gln Ser Tyr Ala Asp Arg Asp Val Trp Lys Pro Glu
35 40 45
Pro Cys Gln Ile Cys Val Cys Asp Ser Gly Ser Val Leu Cys Asp Asp
50 55 60
Ile Ile Cys Asp Asp Gln Glu Leu Asp Cys Pro Asn Pro Glu Ile Pro
65 70 75 80
Phe Gly Glu Cys Cys Ala Val Cys Pro Gln Pro Pro Thr Ala Pro Thr
85 90 95
Arg Pro Pro Asn Gly Gln Gly Pro Gln Gly Pro Lys Gly Asp Pro Gly
100 105 110
Pro Pro Gly Ile Pro Gly Arg Asn Gly Asp Pro Gly Ile Pro Gly Gln
115 120 125
Pro Gly Ser Pro Gly Ser Pro Gly Pro Pro Gly Ile Cys Glu Ser Cys
130 135 140
Pro Thr Gly Pro Gln Asn Tyr Ser Pro Gln Tyr Asp Ser Tyr Asp Val
145 150 155 160
Lys Ser Gly Val Ala Val Gly Gly Leu Ala Gly Tyr Pro Gly Pro Ala
165 170 175
Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Thr Ser Gly His Pro Gly
180 185 190
Ser Pro Gly Ser Pro Gly Tyr Gln Gly Pro Pro Gly Glu Pro Gly Gln
195 200 205
Ala Gly Pro Ser Gly Pro Pro Gly Pro Pro Gly Ala Ile Gly Pro Ser
210 215 220
Gly Pro Ala Gly Lys Asp Gly Glu Ser Gly Arg Pro Gly Arg Pro Gly
225 230 235 240
Glu Arg Gly Leu Pro Gly Pro Pro Gly Ile Lys Gly Pro Ala Gly Ile
245 250 255
Pro Gly Phe Pro Gly Met Lys Gly His Arg Gly Phe Asp Gly Arg Asn
260 265 270
Gly Glu Lys Gly Glu Thr Gly Ala Pro Gly Leu Lys Gly Glu Asn Gly
275 280 285
Leu Pro Gly Glu Asn Gly Ala Pro Gly Pro Met Gly Pro Arg Gly Ala
290 295 300
Pro Gly Glu Arg Gly Arg Pro Gly Leu Pro Gly Ala Ala Gly Ala Arg
305 310 315 320
Gly Asn Asp Gly Ala Arg Gly Ser Asp Gly Gln Pro Gly Pro Pro Gly
325 330 335
Pro Pro Gly Thr Ala Gly Phe Pro Gly Ser Pro Gly Ala Lys Gly Glu
340 345 350
Val Gly Pro Ala Gly Ser Pro Gly Ser Asn Gly Ala Pro Gly Gln Arg
355 360 365
Gly Glu Pro Gly Pro Gln Gly His Ala Gly Ala Gln Gly Pro Pro Gly
370 375 380
Pro Pro Gly Ile Asn Gly Ser Pro Gly Gly Lys Gly Glu Met Gly Pro
385 390 395 400
Ala Gly Ile Pro Gly Ala Pro Gly Leu Met Gly Ala Arg Gly Pro Pro
405 410 415
Gly Pro Ala Gly Ala Asn Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly
420 425 430
Glu Pro Gly Lys Asn Gly Ala Lys Gly Glu Pro Gly Pro Arg Gly Glu
435 440 445
Arg Gly Glu Ala Gly Ile Pro Gly Val Pro Gly Ala Lys Gly Glu Asp
450 455 460
Gly Lys Asp Gly Ser Pro Gly Glu Pro Gly Ala Asn Gly Leu Pro Gly
465 470 475 480
Ala Ala Gly Glu Arg Gly Ala Pro Gly Phe Arg Gly Pro Ala Gly Pro
485 490 495
Asn Gly Ile Pro Gly Glu Lys Gly Pro Ala Gly Glu Arg Gly Ala Pro
500 505 510
Gly Pro Ala Gly Pro Arg Gly Ala Ala Gly Glu Pro Gly Arg Asp Gly
515 520 525
Val Pro Gly Gly Pro Gly Met Arg Gly Met Pro Gly Ser Pro Gly Gly
530 535 540
Pro Gly Ser Asp Gly Lys Pro Gly Pro Pro Gly Ser Gln Gly Glu Ser
545 550 555 560
Gly Arg Pro Gly Pro Pro Gly Pro Ser Gly Pro Arg Gly Gln Pro Gly
565 570 575
Val Met Gly Phe Pro Gly Pro Lys Gly Asn Asp Gly Ala Pro Gly Lys
580 585 590
Asn Gly Glu Arg Gly Gly Pro Gly Gly Pro Gly Pro Gln Gly Pro Pro
595 600 605
Gly Lys Asn Gly Glu Thr Gly Pro Gln Gly Pro Pro Gly Pro Thr Gly
610 615 620
Pro Gly Gly Asp Lys Gly Asp Thr Gly Pro Pro Gly Pro Gln Gly Leu
625 630 635 640
Gln Gly Leu Pro Gly Thr Gly Gly Pro Pro Gly Glu Asn Gly Lys Pro
645 650 655
Gly Glu Pro Gly Pro Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly
660 665 670
Gly Lys Gly Asp Ala Gly Ala Pro Gly Glu Arg Gly Pro Pro Gly Leu
675 680 685
Ala Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly Pro Pro Gly Pro Glu
690 695 700
Gly Gly Lys Gly Ala Ala Gly Pro Pro Gly Pro Pro Gly Ala Ala Gly
705 710 715 720
Thr Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly Gly Leu Gly Ser
725 730 735
Pro Gly Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly Ala Asp
740 745 750
Gly Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro Ile Gly
755 760 765
Pro Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly Gly Ala
770 775 780
Pro Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly Glu Arg
785 790 795 800
Gly Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala Pro Gly
805 810 815
Gln Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro Gly Glu
820 825 830
Lys Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly Lys Asp
835 840 845
Gly Thr Ser Gly His Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly
850 855 860
Asn Arg Gly Glu Arg Gly Ser Glu Gly Ser Pro Gly His Pro Gly Gln
865 870 875 880
Pro Gly Pro Pro Gly Pro Pro Gly Ala Pro Gly Pro Cys Cys Gly Gly
885 890 895
Val Gly Ala Ala Ala Ile Ala Gly Ile Gly Gly Glu Lys Ala Gly Gly
900 905 910
Phe Ala Pro Tyr Tyr Gly Asp Glu Pro Met Asp Phe Lys Ile Asn Thr
915 920 925
Asp Glu Ile Met Thr Ser Leu Lys Ser Val Asn Gly Gln Ile Glu Ser
930 935 940
Leu Ile Ser Pro Asp Gly Ser Arg Lys Asn Pro Ala Arg Asn Cys Arg
945 950 955 960
Asp Leu Lys Phe Cys His Pro Glu Leu Lys Ser Gly Glu Tyr Trp Val
965 970 975
Asp Pro Asn Gln Gly Cys Lys Leu Asp Ala Ile Lys Val Phe Cys Asn
980 985 990
Met Glu Thr Gly Glu Thr Cys Ile Ser Ala Asn Pro Leu Asn Val Pro
995 1000 1005
Arg Lys His Trp Trp Thr Asp Ser Ser Ala Glu Lys Lys His Val
1010 1015 1020
Trp Phe Gly Glu Ser Met Asp Gly Gly Phe Gln Phe Ser Tyr Gly
1025 1030 1035
Asn Pro Glu Leu Pro Glu Asp Val Leu Asp Val Gln Leu Ala Phe
1040 1045 1050
Leu Arg Leu Leu Ser Ser Arg Ala Ser Gln Asn Ile Thr Tyr His
1055 1060 1065
Cys Lys Asn Ser Ile Ala Tyr Met Asp Gln Ala Ser Gly Asn Val
1070 1075 1080
Lys Lys Ala Leu Lys Leu Met Gly Ser Asn Glu Gly Glu Phe Lys
1085 1090 1095
Ala Glu Gly Asn Ser Lys Phe Thr Tyr Thr Val Leu Glu Asp Gly
1100 1105 1110
Cys Thr Lys His Thr Gly Glu Trp Ser Lys Thr Val Phe Glu Tyr
1115 1120 1125
Arg Thr Arg Lys Ala Val Arg Leu Pro Ile Val Asp Ile Ala Pro
1130 1135 1140
Tyr Asp Ile Gly Gly Pro Asp Gln Glu Phe Gly Val Asp Val Gly
1145 1150 1155
Pro Val Cys Phe Leu
1160
<210> 3
<211> 297
<212> PRT
<213> 智人(Homo sapiens)
<400> 3
Gly Pro Gly Gly Pro Gly Pro Gln Gly Pro Pro Gly Lys Asn Gly Glu
1 5 10 15
Thr Gly Pro Gln Gly Pro Pro Gly Pro Thr Gly Pro Gly Gly Asp Lys
20 25 30
Gly Asp Thr Gly Pro Pro Gly Pro Gln Gly Leu Gln Gly Leu Pro Gly
35 40 45
Thr Gly Gly Pro Pro Gly Glu Asn Gly Lys Pro Gly Glu Pro Gly Pro
50 55 60
Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Gly Lys Gly Asp Ala
65 70 75 80
Gly Ala Pro Gly Glu Arg Gly Pro Pro Gly Leu Ala Gly Ala Pro Gly
85 90 95
Leu Arg Gly Gly Ala Gly Pro Pro Gly Pro Glu Gly Gly Lys Gly Ala
100 105 110
Ala Gly Pro Pro Gly Pro Pro Gly Ala Ala Gly Thr Pro Gly Leu Gln
115 120 125
Gly Met Pro Gly Glu Arg Gly Gly Leu Gly Ser Pro Lys Gly Asp Lys
130 135 140
Gly Glu Pro Gly Gly Pro Gly Ala Asp Gly Val Pro Gly Lys Asp Gly
145 150 155 160
Pro Arg Gly Pro Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly Gln
165 170 175
Pro Gly Asp Lys Gly Glu Gly Gly Ala Pro Gly Leu Pro Gly Ile Ala
180 185 190
Gly Pro Arg Gly Ser Pro Gly Glu Arg Gly Glu Thr Gly Pro Pro Gly
195 200 205
Pro Ala Gly Phe Pro Gly Ala Pro Gly Gln Asn Gly Glu Pro Gly Gly
210 215 220
Lys Gly Glu Arg Gly Ala Pro Gly Glu Lys Gly Glu Gly Gly Pro Pro
225 230 235 240
Gly Val Ala Gly Pro Pro Gly Lys Asp Gly Thr Ser Gly His Pro Gly
245 250 255
Pro Ile Gly Pro Pro Gly Pro Arg Gly Asn Arg Gly Glu Arg Gly Ser
260 265 270
Glu Gly Ser Pro Gly His Pro Gly Gln Pro Gly Pro Pro Gly Pro Pro
275 280 285
Gly Ala Pro Gly Pro Cys Cys Gly Gly
290 295
<210> 4
<211> 162
<212> PRT
<213> 智人(Homo sapiens)
<400> 4
Gly Leu Gly Ser Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly
1 5 10 15
Ala Asp Gly Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro
20 25 30
Ile Gly Pro Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly
35 40 45
Gly Ala Pro Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly
50 55 60
Glu Arg Gly Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala
65 70 75 80
Pro Gly Gln Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro
85 90 95
Gly Glu Lys Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly
100 105 110
Lys Asp Gly Thr Ser Gly His Pro Gly Pro Ile Gly Pro Pro Gly Pro
115 120 125
Arg Gly Asn Arg Gly Glu Arg Gly Ser Glu Gly Ser Pro Gly His Pro
130 135 140
Gly Gln Pro Gly Pro Pro Gly Pro Pro Gly Ala Pro Gly Pro Cys Cys
145 150 155 160
Gly Gly
<210> 5
<211> 5
<212> PRT
<213> 人工序列(artificial sequence)
<400> 5
Gly Gly Gly Gly Ser
1 5
<210> 6
<211> 31
<212> PRT
<213> 人工序列(artificial sequence)
<400> 6
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Gly
20 25 30
<210> 7
<211> 11
<212> PRT
<213> 人工序列(artificial sequence)
<400> 7
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala
1 5 10
<210> 8
<211> 345
<212> PRT
<213> 人工序列(artificial sequence)
<400> 8
His Gly Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Glu
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Gly Gly
20 25 30
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Gly Pro
35 40 45
Gly Gly Pro Gly Pro Gln Gly Pro Pro Gly Lys Asn Gly Glu Thr Gly
50 55 60
Pro Gln Gly Pro Pro Gly Pro Thr Gly Pro Gly Gly Asp Lys Gly Asp
65 70 75 80
Thr Gly Pro Pro Gly Pro Gln Gly Leu Gln Gly Leu Pro Gly Thr Gly
85 90 95
Gly Pro Pro Gly Glu Asn Gly Lys Pro Gly Glu Pro Gly Pro Lys Gly
100 105 110
Asp Ala Gly Ala Pro Gly Ala Pro Gly Gly Lys Gly Asp Ala Gly Ala
115 120 125
Pro Gly Glu Arg Gly Pro Pro Gly Leu Ala Gly Ala Pro Gly Leu Arg
130 135 140
Gly Gly Ala Gly Pro Pro Gly Pro Glu Gly Gly Lys Gly Ala Ala Gly
145 150 155 160
Pro Pro Gly Pro Pro Gly Ala Ala Gly Thr Pro Gly Leu Gln Gly Met
165 170 175
Pro Gly Glu Arg Gly Gly Leu Gly Ser Pro Gly Pro Lys Gly Asp Lys
180 185 190
Gly Glu Pro Gly Gly Pro Gly Ala Asp Gly Val Pro Gly Lys Asp Gly
195 200 205
Pro Arg Gly Pro Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly Gln
210 215 220
Pro Gly Asp Lys Gly Glu Gly Gly Ala Pro Gly Leu Pro Gly Ile Ala
225 230 235 240
Gly Pro Arg Gly Ser Pro Gly Glu Arg Gly Glu Thr Gly Pro Pro Gly
245 250 255
Pro Ala Gly Phe Pro Gly Ala Pro Gly Gln Asn Gly Glu Pro Gly Gly
260 265 270
Lys Gly Glu Arg Gly Ala Pro Gly Glu Lys Gly Glu Gly Gly Pro Pro
275 280 285
Gly Val Ala Gly Pro Pro Gly Lys Asp Gly Thr Ser Gly His Pro Gly
290 295 300
Pro Ile Gly Pro Pro Gly Pro Arg Gly Asn Arg Gly Glu Arg Gly Ser
305 310 315 320
Glu Gly Ser Pro Gly His Pro Gly Gln Pro Gly Pro Pro Gly Pro Pro
325 330 335
Gly Ala Pro Gly Pro Cys Cys Gly Gly
340 345
<210> 9
<211> 210
<212> PRT
<213> 人工序列(artificial sequence)
<400> 9
His Gly Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Glu
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Gly Gly
20 25 30
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Gly Leu
35 40 45
Gly Ser Pro Gly Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly
50 55 60
Ala Asp Gly Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro
65 70 75 80
Ile Gly Pro Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly
85 90 95
Gly Ala Pro Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly
100 105 110
Glu Arg Gly Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala
115 120 125
Pro Gly Gln Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro
130 135 140
Gly Glu Lys Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly
145 150 155 160
Lys Asp Gly Thr Ser Gly His Pro Gly Pro Ile Gly Pro Pro Gly Pro
165 170 175
Arg Gly Asn Arg Gly Glu Arg Gly Ser Glu Gly Ser Pro Gly His Pro
180 185 190
Gly Gln Pro Gly Pro Pro Gly Pro Pro Gly Ala Pro Gly Pro Cys Cys
195 200 205
Gly Gly
210
<210> 10
<211> 1035
<212> DNA
<213> 人工序列(artificial sequence)
<400> 10
cacggtgagg gtacttttac ctctgatgtt tcctcatact tggaagaaca agctgctaag 60
gaattcattg cctggctggt caaaggcaga ggaggtggcg gatccggtgg cggtgggtcc 120
ggaggaggtg gttcagctgg tccaggtggt ccaggtcctc aaggtcctcc aggtaagaat 180
ggtgaaactg gtcctcaggg acctccaggc ccaaccggtc ctggaggtga taagggtgat 240
accggaccac ctggcccaca aggcttgcag ggtctgccag gtacaggggg tccacccggt 300
gaaaacggca agcctggtga accaggccca aaaggtgacg ctggagctcc aggagcccca 360
ggaggtaagg gtgatgctgg tgcccccggt gagagaggcc caccaggttt ggccggtgct 420
cccggtctga gagggggagc tggtccacca ggacctgaag gcggaaaagg tgctgctggt 480
ccacctggac cacctggtgc tgccggaact ccaggactgc agggaatgcc tggtgaaaga 540
ggcggattgg gatctcctgg cccaaaagga gacaagggag agcctggtgg accaggggca 600
gatggagttc ctggaaaaga tggtcctcgt ggtccaacag gacctatcgg tcccccagga 660
cctgctggtc aacctggaga taaaggtgaa ggcggggctc caggattgcc tggtattgcc 720
ggccctagag gttctcccgg tgaaagaggt gagaccggcc cacctggtcc agctggcttc 780
cctggagcac caggtcagaa tggtgagcca ggtggtaagg gtgagagagg agctccaggt 840
gagaaggggg aaggtggtcc acctggtgtt gctggtccac caggtaagga tggtacatcc 900
ggtcatcctg gaccaattgg acctccaggg cctagaggta acaggggtga aaggggatct 960
gaaggatctc ctggacatcc aggtcagccc ggtcctcctg gtccacccgg agctcctggg 1020
ccatgctgtg gtggc 1035
<210> 11
<211> 630
<212> DNA
<213> 人工序列(artificial sequence)
<400> 11
cacggtgagg gtacttttac ctctgatgtt tcctcatact tggaagaaca agctgctaag 60
gaattcattg cctggctggt caaaggcaga ggaggtggcg gatccggtgg cggtgggtcc 120
ggaggaggtg gttcagctgg tttgggatct cctggcccaa aaggagacaa gggagagcct 180
ggtggaccag gggcagatgg agttcctgga aaagatggtc ctcgtggtcc aacaggacct 240
atcggtcccc caggacctgc tggtcaacct ggagataaag gtgaaggcgg ggctccagga 300
ttgcctggta ttgccggccc tagaggttct cccggtgaaa gaggtgagac cggcccacct 360
ggtccagctg gcttccctgg agcaccaggt cagaatggtg agccaggtgg taagggtgag 420
agaggagctc caggtgagaa gggggaaggt ggtccacctg gtgttgctgg tccaccaggt 480
aaggatggta catccggtca tcctggacca attggacctc cagggcctag aggtaacagg 540
ggtgaaaggg gatctgaagg atctcctgga catccaggtc agcccggtcc tcctggtcca 600
cccggagctc ctgggccatg ctgtggtggc 630
<210> 12
<211> 30
<212> PRT
<213> 人工序列(artificial sequence)
<400> 12
His Gly Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Glu
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg
20 25 30
<210> 13
<211> 1210
<212> PRT
<213> 人工序列(artificial sequence)
<400> 13
His Gly Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Glu
1 5 10 15
Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Lys Gly Arg Gly Gly
20 25 30
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Ala Met
35 40 45
Met Ser Phe Val Gln Lys Gly Ser Trp Leu Leu Leu Ala Leu Leu His
50 55 60
Pro Thr Ile Ile Leu Ala Gln Gln Glu Ala Val Glu Gly Gly Cys Ser
65 70 75 80
His Leu Gly Gln Ser Tyr Ala Asp Arg Asp Val Trp Lys Pro Glu Pro
85 90 95
Cys Gln Ile Cys Val Cys Asp Ser Gly Ser Val Leu Cys Asp Asp Ile
100 105 110
Ile Cys Asp Asp Gln Glu Leu Asp Cys Pro Asn Pro Glu Ile Pro Phe
115 120 125
Gly Glu Cys Cys Ala Val Cys Pro Gln Pro Pro Thr Ala Pro Thr Arg
130 135 140
Pro Pro Asn Gly Gln Gly Pro Gln Gly Pro Lys Gly Asp Pro Gly Pro
145 150 155 160
Pro Gly Ile Pro Gly Arg Asn Gly Asp Pro Gly Ile Pro Gly Gln Pro
165 170 175
Gly Ser Pro Gly Ser Pro Gly Pro Pro Gly Ile Cys Glu Ser Cys Pro
180 185 190
Thr Gly Pro Gln Asn Tyr Ser Pro Gln Tyr Asp Ser Tyr Asp Val Lys
195 200 205
Ser Gly Val Ala Val Gly Gly Leu Ala Gly Tyr Pro Gly Pro Ala Gly
210 215 220
Pro Pro Gly Pro Pro Gly Pro Pro Gly Thr Ser Gly His Pro Gly Ser
225 230 235 240
Pro Gly Ser Pro Gly Tyr Gln Gly Pro Pro Gly Glu Pro Gly Gln Ala
245 250 255
Gly Pro Ser Gly Pro Pro Gly Pro Pro Gly Ala Ile Gly Pro Ser Gly
260 265 270
Pro Ala Gly Lys Asp Gly Glu Ser Gly Arg Pro Gly Arg Pro Gly Glu
275 280 285
Arg Gly Leu Pro Gly Pro Pro Gly Ile Lys Gly Pro Ala Gly Ile Pro
290 295 300
Gly Phe Pro Gly Met Lys Gly His Arg Gly Phe Asp Gly Arg Asn Gly
305 310 315 320
Glu Lys Gly Glu Thr Gly Ala Pro Gly Leu Lys Gly Glu Asn Gly Leu
325 330 335
Pro Gly Glu Asn Gly Ala Pro Gly Pro Met Gly Pro Arg Gly Ala Pro
340 345 350
Gly Glu Arg Gly Arg Pro Gly Leu Pro Gly Ala Ala Gly Ala Arg Gly
355 360 365
Asn Asp Gly Ala Arg Gly Ser Asp Gly Gln Pro Gly Pro Pro Gly Pro
370 375 380
Pro Gly Thr Ala Gly Phe Pro Gly Ser Pro Gly Ala Lys Gly Glu Val
385 390 395 400
Gly Pro Ala Gly Ser Pro Gly Ser Asn Gly Ala Pro Gly Gln Arg Gly
405 410 415
Glu Pro Gly Pro Gln Gly His Ala Gly Ala Gln Gly Pro Pro Gly Pro
420 425 430
Pro Gly Ile Asn Gly Ser Pro Gly Gly Lys Gly Glu Met Gly Pro Ala
435 440 445
Gly Ile Pro Gly Ala Pro Gly Leu Met Gly Ala Arg Gly Pro Pro Gly
450 455 460
Pro Ala Gly Ala Asn Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly Glu
465 470 475 480
Pro Gly Lys Asn Gly Ala Lys Gly Glu Pro Gly Pro Arg Gly Glu Arg
485 490 495
Gly Glu Ala Gly Ile Pro Gly Val Pro Gly Ala Lys Gly Glu Asp Gly
500 505 510
Lys Asp Gly Ser Pro Gly Glu Pro Gly Ala Asn Gly Leu Pro Gly Ala
515 520 525
Ala Gly Glu Arg Gly Ala Pro Gly Phe Arg Gly Pro Ala Gly Pro Asn
530 535 540
Gly Ile Pro Gly Glu Lys Gly Pro Ala Gly Glu Arg Gly Ala Pro Gly
545 550 555 560
Pro Ala Gly Pro Arg Gly Ala Ala Gly Glu Pro Gly Arg Asp Gly Val
565 570 575
Pro Gly Gly Pro Gly Met Arg Gly Met Pro Gly Ser Pro Gly Gly Pro
580 585 590
Gly Ser Asp Gly Lys Pro Gly Pro Pro Gly Ser Gln Gly Glu Ser Gly
595 600 605
Arg Pro Gly Pro Pro Gly Pro Ser Gly Pro Arg Gly Gln Pro Gly Val
610 615 620
Met Gly Phe Pro Gly Pro Lys Gly Asn Asp Gly Ala Pro Gly Lys Asn
625 630 635 640
Gly Glu Arg Gly Gly Pro Gly Gly Pro Gly Pro Gln Gly Pro Pro Gly
645 650 655
Lys Asn Gly Glu Thr Gly Pro Gln Gly Pro Pro Gly Pro Thr Gly Pro
660 665 670
Gly Gly Asp Lys Gly Asp Thr Gly Pro Pro Gly Pro Gln Gly Leu Gln
675 680 685
Gly Leu Pro Gly Thr Gly Gly Pro Pro Gly Glu Asn Gly Lys Pro Gly
690 695 700
Glu Pro Gly Pro Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Gly
705 710 715 720
Lys Gly Asp Ala Gly Ala Pro Gly Glu Arg Gly Pro Pro Gly Leu Ala
725 730 735
Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly Pro Pro Gly Pro Glu Gly
740 745 750
Gly Lys Gly Ala Ala Gly Pro Pro Gly Pro Pro Gly Ala Ala Gly Thr
755 760 765
Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly Gly Leu Gly Ser Pro
770 775 780
Gly Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly Ala Asp Gly
785 790 795 800
Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro Ile Gly Pro
805 810 815
Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly Gly Ala Pro
820 825 830
Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly Glu Arg Gly
835 840 845
Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala Pro Gly Gln
850 855 860
Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro Gly Glu Lys
865 870 875 880
Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly Lys Asp Gly
885 890 895
Thr Ser Gly His Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly Asn
900 905 910
Arg Gly Glu Arg Gly Ser Glu Gly Ser Pro Gly His Pro Gly Gln Pro
915 920 925
Gly Pro Pro Gly Pro Pro Gly Ala Pro Gly Pro Cys Cys Gly Gly Val
930 935 940
Gly Ala Ala Ala Ile Ala Gly Ile Gly Gly Glu Lys Ala Gly Gly Phe
945 950 955 960
Ala Pro Tyr Tyr Gly Asp Glu Pro Met Asp Phe Lys Ile Asn Thr Asp
965 970 975
Glu Ile Met Thr Ser Leu Lys Ser Val Asn Gly Gln Ile Glu Ser Leu
980 985 990
Ile Ser Pro Asp Gly Ser Arg Lys Asn Pro Ala Arg Asn Cys Arg Asp
995 1000 1005
Leu Lys Phe Cys His Pro Glu Leu Lys Ser Gly Glu Tyr Trp Val
1010 1015 1020
Asp Pro Asn Gln Gly Cys Lys Leu Asp Ala Ile Lys Val Phe Cys
1025 1030 1035
Asn Met Glu Thr Gly Glu Thr Cys Ile Ser Ala Asn Pro Leu Asn
1040 1045 1050
Val Pro Arg Lys His Trp Trp Thr Asp Ser Ser Ala Glu Lys Lys
1055 1060 1065
His Val Trp Phe Gly Glu Ser Met Asp Gly Gly Phe Gln Phe Ser
1070 1075 1080
Tyr Gly Asn Pro Glu Leu Pro Glu Asp Val Leu Asp Val Gln Leu
1085 1090 1095
Ala Phe Leu Arg Leu Leu Ser Ser Arg Ala Ser Gln Asn Ile Thr
1100 1105 1110
Tyr His Cys Lys Asn Ser Ile Ala Tyr Met Asp Gln Ala Ser Gly
1115 1120 1125
Asn Val Lys Lys Ala Leu Lys Leu Met Gly Ser Asn Glu Gly Glu
1130 1135 1140
Phe Lys Ala Glu Gly Asn Ser Lys Phe Thr Tyr Thr Val Leu Glu
1145 1150 1155
Asp Gly Cys Thr Lys His Thr Gly Glu Trp Ser Lys Thr Val Phe
1160 1165 1170
Glu Tyr Arg Thr Arg Lys Ala Val Arg Leu Pro Ile Val Asp Ile
1175 1180 1185
Ala Pro Tyr Asp Ile Gly Gly Pro Asp Gln Glu Phe Gly Val Asp
1190 1195 1200
Val Gly Pro Val Cys Phe Leu
1205 1210
Claims (10)
1.一种融合蛋白,其特征在于,所述融合蛋白的结构如下式I所示:
A-L-B (I)
式中,A为GLP-1类似物,B为人Ⅲ型胶原α1链,L为无或连接肽,各“-”独立地为连接肽或肽键;且
所述GLP-1类似物具有SEQ ID NO.:1所示氨基酸序列的多肽,
His-Xaa8-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Xaa22-Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Arg-Xaa37
其中Xaa8是Gly或Ala,Xaa22是Glu或Gly,Xaa37是Gly或无。
2.如权利要求1所述的融合蛋白,其特征在于,所述GLP-1类似物具有如SEQ ID NO.:6或12所示的氨基酸序列。
3.如权利要求1所述的融合蛋白,其特征在于,所述人Ⅲ型胶原α1链具有SEQ ID NO.:2、3或4所示的氨基酸序列。
4.如权利要求1所述的融合蛋白,其特征在于,所述融合蛋白具有SEQ ID NO.:8、9或13所示的氨基酸序列。
5.一种寡聚体,其特征在于,所述寡聚体包含权利要求1所述的融合蛋白。
6.一种分离的多核苷酸,其特征在于,所述多核苷酸编码权利要求1所述的融合蛋白。
7.一种载体,其特征在于,所述载体包括权利要求6所述的多核苷酸。
8.一种宿主细胞,其特征在于,所述的宿主细胞含有权利要求7所述的载体、或染色体中整合有外源的权利要求6所述的多核苷酸、或表达权利要求1所述的融合蛋白、或表达权利要求5所述的寡聚体。
9.一种药物组合物,其特征在于,所述药物组合物包括权利要求1所述的融合蛋白或权利要求5所述的寡聚体,以及药学上可接受的载体或赋形剂。
10.如权利要求1所述的融合蛋白、权利要求5所述的寡聚体、权利要求6所述的多核苷酸、权利要求7所述的载体、权利要求8所述的宿主细胞,其特征在于,用于制备预防和/或治疗糖尿病的药物或制剂。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810089639.XA CN110092835A (zh) | 2018-01-30 | 2018-01-30 | 一种glp-1类似物-col3a1融合蛋白 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810089639.XA CN110092835A (zh) | 2018-01-30 | 2018-01-30 | 一种glp-1类似物-col3a1融合蛋白 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110092835A true CN110092835A (zh) | 2019-08-06 |
Family
ID=67442488
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810089639.XA Withdrawn CN110092835A (zh) | 2018-01-30 | 2018-01-30 | 一种glp-1类似物-col3a1融合蛋白 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110092835A (zh) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114539390A (zh) * | 2022-03-02 | 2022-05-27 | 广州美神生物科技有限公司 | 一种重组ⅲ型人源化胶原蛋白c3及其表达载体、表达菌株、表达方法和应用 |
CN114805551A (zh) * | 2022-06-28 | 2022-07-29 | 华熙生物科技股份有限公司 | 一种重组iii型胶原蛋白及其制备方法 |
CN116284340A (zh) * | 2023-02-01 | 2023-06-23 | 美尔健(深圳)生物科技有限公司 | 基于伴侣肽的透皮增强型重组人源三型胶原蛋白及其应用 |
CN116813751A (zh) * | 2023-08-04 | 2023-09-29 | 山西锦波生物医药股份有限公司 | 自交联重组人源化胶原蛋白高分子生物材料及其制备方法 |
CN116874590A (zh) * | 2023-08-16 | 2023-10-13 | 医械妆(广州)技术服务有限公司 | 一种重组iii型胶原蛋白及其制备方法 |
CN117384276A (zh) * | 2023-12-11 | 2024-01-12 | 上海昱菘医药科技有限公司 | 一种重组胶原蛋白及其制备方法和用途 |
CN118359703A (zh) * | 2024-06-19 | 2024-07-19 | 山东福瑞达生物股份有限公司 | 一种重组透皮类胶原蛋白及其制备方法和应用 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030064436A1 (en) * | 1996-10-29 | 2003-04-03 | Vaughan Paul Richard | Method for producing, in yeast, a hydroxylated triple helical protein, and yeast host cells useful in said method |
CN1483041A (zh) * | 2000-12-07 | 2004-03-17 | Glp-1融合蛋白 | |
CN102164949A (zh) * | 2009-11-19 | 2011-08-24 | 浙江大学 | 新型重组融合蛋白 |
CN103641896A (zh) * | 2009-11-19 | 2014-03-19 | 浙江大学 | 明胶样单元的用途 |
CN104870478A (zh) * | 2012-12-24 | 2015-08-26 | 北京安信怀德生物技术有限公司 | 具有改善的药代动力学性质的治疗性多肽融合蛋白及其应用 |
US20160194370A1 (en) * | 2012-12-24 | 2016-07-07 | Beijing Anxinhuaide Biotech. Co., Ltd. | Fusion protein of therapeutic polypeptide with improved pharmacokinetic profile and use therof |
-
2018
- 2018-01-30 CN CN201810089639.XA patent/CN110092835A/zh not_active Withdrawn
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030064436A1 (en) * | 1996-10-29 | 2003-04-03 | Vaughan Paul Richard | Method for producing, in yeast, a hydroxylated triple helical protein, and yeast host cells useful in said method |
CN1483041A (zh) * | 2000-12-07 | 2004-03-17 | Glp-1融合蛋白 | |
CN102164949A (zh) * | 2009-11-19 | 2011-08-24 | 浙江大学 | 新型重组融合蛋白 |
US20130203959A1 (en) * | 2009-11-19 | 2013-08-08 | Zhejiang University | Nonnatural collagen-like protein and use thereof |
CN103641896A (zh) * | 2009-11-19 | 2014-03-19 | 浙江大学 | 明胶样单元的用途 |
CN104870478A (zh) * | 2012-12-24 | 2015-08-26 | 北京安信怀德生物技术有限公司 | 具有改善的药代动力学性质的治疗性多肽融合蛋白及其应用 |
US20160194370A1 (en) * | 2012-12-24 | 2016-07-07 | Beijing Anxinhuaide Biotech. Co., Ltd. | Fusion protein of therapeutic polypeptide with improved pharmacokinetic profile and use therof |
Non-Patent Citations (2)
Title |
---|
"COL3A1 protein [Homo sapiens]" * |
LEENA ALA-KOKKO等: "Structure of cDNA clones coding for the entire preproxl(III) chain of human type III procollagen. Differences in protein structure from type I procollagen and conservation of codon preferences" * |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114539390A (zh) * | 2022-03-02 | 2022-05-27 | 广州美神生物科技有限公司 | 一种重组ⅲ型人源化胶原蛋白c3及其表达载体、表达菌株、表达方法和应用 |
CN114805551A (zh) * | 2022-06-28 | 2022-07-29 | 华熙生物科技股份有限公司 | 一种重组iii型胶原蛋白及其制备方法 |
CN116284340A (zh) * | 2023-02-01 | 2023-06-23 | 美尔健(深圳)生物科技有限公司 | 基于伴侣肽的透皮增强型重组人源三型胶原蛋白及其应用 |
CN116284340B (zh) * | 2023-02-01 | 2024-06-25 | 美尔健(深圳)生物科技有限公司 | 基于伴侣肽的透皮增强型重组人源三型胶原蛋白及其应用 |
CN116813751A (zh) * | 2023-08-04 | 2023-09-29 | 山西锦波生物医药股份有限公司 | 自交联重组人源化胶原蛋白高分子生物材料及其制备方法 |
CN116874590A (zh) * | 2023-08-16 | 2023-10-13 | 医械妆(广州)技术服务有限公司 | 一种重组iii型胶原蛋白及其制备方法 |
CN116874590B (zh) * | 2023-08-16 | 2024-06-07 | 医械妆(广州)技术服务有限公司 | 一种重组iii型胶原蛋白及其制备方法 |
CN117384276A (zh) * | 2023-12-11 | 2024-01-12 | 上海昱菘医药科技有限公司 | 一种重组胶原蛋白及其制备方法和用途 |
CN118359703A (zh) * | 2024-06-19 | 2024-07-19 | 山东福瑞达生物股份有限公司 | 一种重组透皮类胶原蛋白及其制备方法和应用 |
CN118359703B (zh) * | 2024-06-19 | 2024-08-16 | 山东福瑞达生物股份有限公司 | 一种重组透皮类胶原蛋白及其制备方法和应用 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110092835A (zh) | 一种glp-1类似物-col3a1融合蛋白 | |
CN114805551B (zh) | 一种重组iii型胶原蛋白及其制备方法 | |
CN113502296B (zh) | 一种表达司美鲁肽前体的重组工程菌及其构建方法 | |
WO2012062078A1 (zh) | 人成纤维细胞生长因子21的n末端缺失型变异体及其偶联物 | |
CA2565300A1 (en) | Fgf-21 fusion proteins | |
CN106432509B (zh) | 一种治疗代谢疾病的重组人成纤维细胞生长因子21融合蛋白及其制备方法和应用 | |
US20210253662A1 (en) | Long-Acting Recombinant GLP1-Fc-CD47 Protein and Preparation and Use Thereof | |
CN101875700B (zh) | 一种增加促胰岛素分泌肽融合蛋白生物活性的方法 | |
CN106397607A (zh) | 重组人成纤维细胞生长因子21融合蛋白及其在制备治疗代谢疾病药物中的应用 | |
CN113683680A (zh) | 一种重组ⅰ型人源化胶原蛋白c1l1t及其制备方法和用途 | |
CN113683679A (zh) | 一种重组i型人源化胶原蛋白c1l6t及其制备方法和用途 | |
CN113278060B (zh) | Glp-1/胰高血糖素双重激动剂融合蛋白 | |
CN113265007A (zh) | 一种治疗代谢疾病的融合蛋白及其制备方法和应用 | |
CN113583142A (zh) | 双靶点融合蛋白、编码基因、载体或宿主细胞及其应用与表达和纯化方法 | |
CN115991793A (zh) | 具有多重活性的融合蛋白及其应用 | |
CN110172103B (zh) | GLP-1类似物-Fc融合蛋白及其制备方法和用途 | |
CN113105561B (zh) | 一种双靶点融合蛋白的制备方法和应用 | |
CN112851791B (zh) | 一种新型抗代谢紊乱的fgf类似物及其应用 | |
JP6612360B2 (ja) | 薬用作用を有する融合タンパク質複合体および融合タンパク質 | |
CN105884901B (zh) | 具持续控制血糖含量功能的重组人血清白蛋白/胰高糖素类肽融合蛋白 | |
CN108794634A (zh) | 重组的长效人生长激素融合蛋白及其制备和用途 | |
CN101062948B (zh) | 单体速效胰岛素及其制法和用途 | |
CN1692121A (zh) | 一种编码鸡ⅱ型胶原蛋白的全长多核苷酸序列及其用途 | |
CN116284441B (zh) | 具有三重活性的融合蛋白及其应用 | |
JPH0380096A (ja) | モチリン様ポリペプチドの製法並びにそのための組換えdna及び発現用プラスミド |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20210618 Address after: Room 205, West District, 2nd floor, no.707 Zhangyang Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 200120 Applicant after: Shanghai Huidun Yintai Biotechnology Co.,Ltd. Address before: 200433 rooms 309, 311, 313, 316, No. 135, Guowei Road, Yangpu District, Shanghai Applicant before: SHANGHAI HUIDUN BIOTECHNOLOGY Co.,Ltd. |
|
WW01 | Invention patent application withdrawn after publication | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20190806 |