CA2274799A1 - Anti-obesity proteins - Google Patents
Anti-obesity proteins Download PDFInfo
- Publication number
- CA2274799A1 CA2274799A1 CA002274799A CA2274799A CA2274799A1 CA 2274799 A1 CA2274799 A1 CA 2274799A1 CA 002274799 A CA002274799 A CA 002274799A CA 2274799 A CA2274799 A CA 2274799A CA 2274799 A1 CA2274799 A1 CA 2274799A1
- Authority
- CA
- Canada
- Prior art keywords
- asp
- leu
- ser
- lys
- arg
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 248
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 222
- 230000003579 anti-obesity Effects 0.000 title abstract description 11
- 208000008589 Obesity Diseases 0.000 claims abstract description 30
- 235000020824 obesity Nutrition 0.000 claims abstract description 30
- 238000000034 method Methods 0.000 claims description 35
- 150000001413 amino acids Chemical class 0.000 claims description 34
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 claims description 24
- 108010086434 alanyl-seryl-glycine Proteins 0.000 claims description 21
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 claims description 19
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 claims description 18
- 150000003839 salts Chemical class 0.000 claims description 18
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 claims description 17
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 claims description 17
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 claims description 17
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 claims description 16
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 claims description 15
- 238000006467 substitution reaction Methods 0.000 claims description 15
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 claims description 14
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 claims description 14
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 claims description 14
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 claims description 14
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 claims description 14
- 108010017391 lysylvaline Proteins 0.000 claims description 14
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 claims description 13
- SOIAHPSKKUYREP-CIUDSAMLSA-N Gln-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SOIAHPSKKUYREP-CIUDSAMLSA-N 0.000 claims description 13
- 108010078144 glutaminyl-glycine Proteins 0.000 claims description 13
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 claims description 12
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 claims description 12
- 108010049041 glutamylalanine Proteins 0.000 claims description 12
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 claims description 11
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 claims description 11
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 claims description 11
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 claims description 11
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 claims description 11
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 claims description 11
- 108010087823 glycyltyrosine Proteins 0.000 claims description 11
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 claims description 10
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 claims description 10
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 claims description 10
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 claims description 9
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 claims description 9
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 claims description 8
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 claims description 8
- GZGWILAQHOVXTD-DCAQKATOSA-N Lys-Met-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O GZGWILAQHOVXTD-DCAQKATOSA-N 0.000 claims description 8
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 claims description 8
- 108010061238 threonyl-glycine Proteins 0.000 claims description 8
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 claims description 7
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 claims description 7
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 claims description 7
- 241000124008 Mammalia Species 0.000 claims description 7
- UASDAHIAHBRZQV-YUMQZZPRSA-N Met-Arg Chemical group CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N UASDAHIAHBRZQV-YUMQZZPRSA-N 0.000 claims description 7
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 claims description 7
- 238000004519 manufacturing process Methods 0.000 claims description 7
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 claims description 6
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 claims description 6
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 claims description 6
- 108010040443 aspartyl-aspartic acid Proteins 0.000 claims description 6
- 108010085325 histidylproline Proteins 0.000 claims description 6
- 230000001131 transforming effect Effects 0.000 claims description 6
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 claims description 5
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 claims description 5
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 claims description 5
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 claims description 5
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 claims description 5
- 108010029020 prolylglycine Proteins 0.000 claims description 5
- 108010048818 seryl-histidine Proteins 0.000 claims description 5
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 claims description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 3
- 239000008194 pharmaceutical composition Substances 0.000 claims description 3
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 3
- 239000000969 carrier Substances 0.000 claims description 2
- 239000003085 diluting agent Substances 0.000 claims description 2
- 125000002228 disulfide group Chemical group 0.000 claims description 2
- 150000001875 compounds Chemical class 0.000 claims 2
- QTZXSYBVOSXBEJ-WDSKDSINSA-N Met-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O QTZXSYBVOSXBEJ-WDSKDSINSA-N 0.000 claims 1
- PESQCPHRXOFIPX-RYUDHWBXSA-N Met-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-RYUDHWBXSA-N 0.000 claims 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 claims 1
- 239000013543 active substance Substances 0.000 claims 1
- 208000001072 type 2 diabetes mellitus Diseases 0.000 abstract description 4
- 208000024172 Cardiovascular disease Diseases 0.000 abstract description 3
- 201000011529 cardiovascular cancer Diseases 0.000 abstract description 3
- 235000018102 proteins Nutrition 0.000 description 189
- 210000004027 cell Anatomy 0.000 description 55
- 239000013612 plasmid Substances 0.000 description 38
- 235000001014 amino acid Nutrition 0.000 description 34
- 229940024606 amino acid Drugs 0.000 description 34
- 108020004414 DNA Proteins 0.000 description 30
- 239000013598 vector Substances 0.000 description 28
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 22
- 239000012634 fragment Substances 0.000 description 21
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 19
- MFBYPDKTAJXHNI-VKHMYHEASA-N Gly-Cys Chemical compound [NH3+]CC(=O)N[C@@H](CS)C([O-])=O MFBYPDKTAJXHNI-VKHMYHEASA-N 0.000 description 18
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 16
- 239000000872 buffer Substances 0.000 description 16
- COLJZWUVZIXSSS-CIUDSAMLSA-N Ser-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N COLJZWUVZIXSSS-CIUDSAMLSA-N 0.000 description 15
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 15
- 239000002953 phosphate buffered saline Substances 0.000 description 15
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 14
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 14
- 239000000203 mixture Substances 0.000 description 14
- 108090000765 processed proteins & peptides Proteins 0.000 description 14
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 13
- SNOUHRPNNCAOPI-SZMVWBNQSA-N Leu-Trp-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SNOUHRPNNCAOPI-SZMVWBNQSA-N 0.000 description 13
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 13
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 12
- 241000588724 Escherichia coli Species 0.000 description 12
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 12
- 108091034117 Oligonucleotide Proteins 0.000 description 12
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 12
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 12
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 12
- 239000013604 expression vector Substances 0.000 description 12
- 239000000243 solution Substances 0.000 description 12
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 11
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 11
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 11
- 239000002299 complementary DNA Substances 0.000 description 11
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 10
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 10
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 10
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 239000013613 expression plasmid Substances 0.000 description 10
- 108010025306 histidylleucine Proteins 0.000 description 10
- 108020004707 nucleic acids Proteins 0.000 description 10
- 102000039446 nucleic acids Human genes 0.000 description 10
- 150000007523 nucleic acids Chemical class 0.000 description 10
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 10
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 9
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 9
- 241000282414 Homo sapiens Species 0.000 description 9
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 9
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 9
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 9
- 241000699670 Mus sp. Species 0.000 description 9
- 108091000080 Phosphotransferase Proteins 0.000 description 9
- 108020004511 Recombinant DNA Proteins 0.000 description 9
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 9
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 9
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 9
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 9
- 102000020233 phosphotransferase Human genes 0.000 description 9
- 102000004196 processed proteins & peptides Human genes 0.000 description 9
- 238000011282 treatment Methods 0.000 description 9
- -1 4-hydroxymethyl-phenylacetamidomethyl Chemical group 0.000 description 8
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 8
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 8
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 8
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 8
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 8
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 8
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 8
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 8
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 8
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 8
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 8
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 description 8
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 8
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 8
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 8
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 8
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 8
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 8
- 239000003623 enhancer Substances 0.000 description 8
- RLSSMJSEOOYNOY-UHFFFAOYSA-N m-cresol Chemical compound CC1=CC=CC(O)=C1 RLSSMJSEOOYNOY-UHFFFAOYSA-N 0.000 description 8
- 238000003786 synthesis reaction Methods 0.000 description 8
- 238000013518 transcription Methods 0.000 description 8
- 230000035897 transcription Effects 0.000 description 8
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 7
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 7
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 7
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 7
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 7
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 7
- 239000004098 Tetracycline Substances 0.000 description 7
- 238000003776 cleavage reaction Methods 0.000 description 7
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 7
- 235000012631 food intake Nutrition 0.000 description 7
- 238000009472 formulation Methods 0.000 description 7
- 238000000746 purification Methods 0.000 description 7
- 239000011347 resin Substances 0.000 description 7
- 229920005989 resin Polymers 0.000 description 7
- 230000007017 scission Effects 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 229960002180 tetracycline Drugs 0.000 description 7
- 229930101283 tetracycline Natural products 0.000 description 7
- 235000019364 tetracycline Nutrition 0.000 description 7
- 150000003522 tetracyclines Chemical class 0.000 description 7
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 6
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 6
- QMMFVYPAHWMCMS-UHFFFAOYSA-N Dimethyl sulfide Chemical compound CSC QMMFVYPAHWMCMS-UHFFFAOYSA-N 0.000 description 6
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 6
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 6
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 6
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 6
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 6
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 6
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 6
- JQECLVNLAZGHRQ-CIUDSAMLSA-N Met-Asp-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O JQECLVNLAZGHRQ-CIUDSAMLSA-N 0.000 description 6
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 241000699666 Mus <mouse, genus> Species 0.000 description 6
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 6
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 6
- 230000002776 aggregation Effects 0.000 description 6
- 238000004220 aggregation Methods 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 239000003814 drug Substances 0.000 description 6
- 239000013600 plasmid vector Substances 0.000 description 6
- 229920001184 polypeptide Polymers 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 5
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 5
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 5
- 102000012410 DNA Ligases Human genes 0.000 description 5
- 108010061982 DNA Ligases Proteins 0.000 description 5
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 5
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 5
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 5
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 5
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 5
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 5
- 239000004202 carbamide Substances 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 108020001507 fusion proteins Proteins 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 210000003292 kidney cell Anatomy 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 239000002773 nucleotide Substances 0.000 description 5
- 125000003729 nucleotide group Chemical group 0.000 description 5
- 239000002245 particle Substances 0.000 description 5
- 239000003755 preservative agent Substances 0.000 description 5
- 108091008146 restriction endonucleases Proteins 0.000 description 5
- 108010071207 serylmethionine Proteins 0.000 description 5
- 238000001542 size-exclusion chromatography Methods 0.000 description 5
- 239000007790 solid phase Substances 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 230000014616 translation Effects 0.000 description 5
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 4
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 4
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 4
- 238000001712 DNA sequencing Methods 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 241001646716 Escherichia coli K-12 Species 0.000 description 4
- XITLYYAIPBBHPX-ZKWXMUAHSA-N Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O XITLYYAIPBBHPX-ZKWXMUAHSA-N 0.000 description 4
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 4
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 4
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 4
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 4
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 4
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 4
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 4
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 4
- UXBZYLSMYOATLH-DCAQKATOSA-N Val-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C UXBZYLSMYOATLH-DCAQKATOSA-N 0.000 description 4
- FMPUTJLSQLIZJJ-DCAQKATOSA-N Val-Leu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FMPUTJLSQLIZJJ-DCAQKATOSA-N 0.000 description 4
- 239000011543 agarose gel Substances 0.000 description 4
- ZVDPYSVOZFINEE-BQBZGAKWSA-N alpha-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O ZVDPYSVOZFINEE-BQBZGAKWSA-N 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 230000037396 body weight Effects 0.000 description 4
- 208000035475 disorder Diseases 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 102000037865 fusion proteins Human genes 0.000 description 4
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 229910001629 magnesium chloride Inorganic materials 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 230000004060 metabolic process Effects 0.000 description 4
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N phenol group Chemical group C1(=CC=CC=C1)O ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 4
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000010188 recombinant method Methods 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 3
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 3
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 3
- 208000031648 Body Weight Changes Diseases 0.000 description 3
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 3
- 102100024746 Dihydrofolate reductase Human genes 0.000 description 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- PFOUFRJYHWZJKW-NKIYYHGXSA-N His-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O PFOUFRJYHWZJKW-NKIYYHGXSA-N 0.000 description 3
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 3
- JYOAXOMPIXKMKK-YUMQZZPRSA-N Leu-Gln Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CCC(N)=O JYOAXOMPIXKMKK-YUMQZZPRSA-N 0.000 description 3
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 3
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 3
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 3
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 3
- 229930193140 Neomycin Natural products 0.000 description 3
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 3
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 3
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 3
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 3
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 3
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 3
- IWADHXDXSQONEL-GUBZILKMSA-N Val-Val-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O IWADHXDXSQONEL-GUBZILKMSA-N 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 210000000577 adipose tissue Anatomy 0.000 description 3
- 125000003275 alpha amino acid group Chemical group 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000004579 body weight change Effects 0.000 description 3
- 239000013599 cloning vector Substances 0.000 description 3
- 230000001276 controlling effect Effects 0.000 description 3
- 239000002537 cosmetic Substances 0.000 description 3
- 230000001186 cumulative effect Effects 0.000 description 3
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 3
- 235000018417 cysteine Nutrition 0.000 description 3
- 206010012601 diabetes mellitus Diseases 0.000 description 3
- 238000000502 dialysis Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Natural products O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 3
- 239000008187 granular material Substances 0.000 description 3
- 229940088597 hormone Drugs 0.000 description 3
- 239000005556 hormone Substances 0.000 description 3
- 230000001976 improved effect Effects 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 239000000543 intermediate Substances 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 229960004927 neomycin Drugs 0.000 description 3
- 239000002831 pharmacologic agent Substances 0.000 description 3
- 230000002335 preservative effect Effects 0.000 description 3
- 239000012460 protein solution Substances 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 235000019786 weight gain Nutrition 0.000 description 3
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 2
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 2
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 2
- WLHCBQAPPJAULW-UHFFFAOYSA-N 4-methylbenzenethiol Chemical compound CC1=CC=C(S)C=C1 WLHCBQAPPJAULW-UHFFFAOYSA-N 0.000 description 2
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 2
- IPWKGIFRRBGCJO-IMJSIDKUSA-N Ala-Ser Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O IPWKGIFRRBGCJO-IMJSIDKUSA-N 0.000 description 2
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- QFOHBWFCKVYLES-UHFFFAOYSA-N Butylparaben Chemical compound CCCCOC(=O)C1=CC=C(O)C=C1 QFOHBWFCKVYLES-UHFFFAOYSA-N 0.000 description 2
- 101100392772 Caenorhabditis elegans gln-2 gene Proteins 0.000 description 2
- 241000282552 Chlorocebus aethiops Species 0.000 description 2
- 241000701022 Cytomegalovirus Species 0.000 description 2
- 101150074155 DHFR gene Proteins 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 2
- 241001522878 Escherichia coli B Species 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 2
- 102000005731 Glucose-6-phosphate isomerase Human genes 0.000 description 2
- 108010070600 Glucose-6-phosphate isomerase Proteins 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 2
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 2
- 102000003839 Human Proteins Human genes 0.000 description 2
- 108090000144 Human Proteins Proteins 0.000 description 2
- HZYHBDVRCBDJJV-HAFWLYHUSA-N Ile-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O HZYHBDVRCBDJJV-HAFWLYHUSA-N 0.000 description 2
- 102000004877 Insulin Human genes 0.000 description 2
- 108090001061 Insulin Proteins 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- ZTUWCZQOKOJGEX-DCAQKATOSA-N Leu-Ala-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O ZTUWCZQOKOJGEX-DCAQKATOSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 2
- XBZOQGHZGQLEQO-IUCAKERBSA-N Lys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN XBZOQGHZGQLEQO-IUCAKERBSA-N 0.000 description 2
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 2
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 2
- OCYROESYHWUPBP-CIUDSAMLSA-N Pro-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 OCYROESYHWUPBP-CIUDSAMLSA-N 0.000 description 2
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 2
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 2
- 108010076181 Proinsulin Proteins 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 2
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 2
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 2
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 2
- OVQZAFXWIWNYKA-GUBZILKMSA-N Ser-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)N OVQZAFXWIWNYKA-GUBZILKMSA-N 0.000 description 2
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 2
- 108700005078 Synthetic Genes Proteins 0.000 description 2
- 101150003725 TK gene Proteins 0.000 description 2
- BIYXEUAFGLTAEM-WUJLRWPWSA-N Thr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(O)=O BIYXEUAFGLTAEM-WUJLRWPWSA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 102000006601 Thymidine Kinase Human genes 0.000 description 2
- 108020004440 Thymidine kinase Proteins 0.000 description 2
- MDDYTWOFHZFABW-SZMVWBNQSA-N Trp-Gln-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 MDDYTWOFHZFABW-SZMVWBNQSA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 2
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 239000000883 anti-obesity agent Substances 0.000 description 2
- 229940125710 antiobesity agent Drugs 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 239000000337 buffer salt Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 2
- 125000000113 cyclohexyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C([H])([H])C1([H])[H] 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 230000037406 food intake Effects 0.000 description 2
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 2
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 230000002267 hypothalamic effect Effects 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 229940125396 insulin Drugs 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 230000003520 lipogenic effect Effects 0.000 description 2
- LXCFILQKKLGQFO-UHFFFAOYSA-N methylparaben Chemical compound COC(=O)C1=CC=C(O)C=C1 LXCFILQKKLGQFO-UHFFFAOYSA-N 0.000 description 2
- HPNSFSBZBAHARI-UHFFFAOYSA-N micophenolic acid Natural products OC1=C(CC=C(C)CCC(O)=O)C(OC)=C(C)C2=C1C(=O)OC2 HPNSFSBZBAHARI-UHFFFAOYSA-N 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- HPNSFSBZBAHARI-RUDMXATFSA-N mycophenolic acid Chemical compound OC1=C(C\C=C(/C)CCC(O)=O)C(OC)=C(C)C2=C1C(=O)OC2 HPNSFSBZBAHARI-RUDMXATFSA-N 0.000 description 2
- 229960000951 mycophenolic acid Drugs 0.000 description 2
- 239000002858 neurotransmitter agent Substances 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 238000013116 obese mouse model Methods 0.000 description 2
- 238000010647 peptide synthesis reaction Methods 0.000 description 2
- 125000001151 peptidyl group Chemical group 0.000 description 2
- 230000000704 physical effect Effects 0.000 description 2
- 239000002504 physiological saline solution Substances 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- QELSKZZBTMNZEB-UHFFFAOYSA-N propylparaben Chemical compound CCCOC(=O)C1=CC=C(O)C=C1 QELSKZZBTMNZEB-UHFFFAOYSA-N 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 239000013589 supplement Substances 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- SFLSHLFXELFNJZ-QMMMGPOBSA-N (-)-norepinephrine Chemical compound NC[C@H](O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-QMMMGPOBSA-N 0.000 description 1
- SGWFGVQCRDTUQN-UHFFFAOYSA-N (2-prop-2-ynoyloxy-3-prop-2-ynoylsulfanylpropyl) prop-2-ynoate Chemical compound C#CC(=O)OCC(OC(=O)C#C)CSC(=O)C#C SGWFGVQCRDTUQN-UHFFFAOYSA-N 0.000 description 1
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- OFHXPCLWHLXQHT-JKQORVJESA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]-4-methylpentanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN OFHXPCLWHLXQHT-JKQORVJESA-N 0.000 description 1
- UHPQFNXOFFPHJW-UHFFFAOYSA-N (4-methylphenyl)-phenylmethanamine Chemical compound C1=CC(C)=CC=C1C(N)C1=CC=CC=C1 UHPQFNXOFFPHJW-UHFFFAOYSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- JMTMSDXUXJISAY-UHFFFAOYSA-N 2H-benzotriazol-4-ol Chemical class OC1=CC=CC2=C1N=NN2 JMTMSDXUXJISAY-UHFFFAOYSA-N 0.000 description 1
- BHQCQFFYRZLCQQ-UHFFFAOYSA-N 4-(3,7,12-trihydroxy-10,13-dimethyl-2,3,4,5,6,7,8,9,11,12,14,15,16,17-tetradecahydro-1h-cyclopenta[a]phenanthren-17-yl)pentanoic acid Chemical compound OC1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 BHQCQFFYRZLCQQ-UHFFFAOYSA-N 0.000 description 1
- QCQCHGYLTSGIGX-GHXANHINSA-N 4-[[(3ar,5ar,5br,7ar,9s,11ar,11br,13as)-5a,5b,8,8,11a-pentamethyl-3a-[(5-methylpyridine-3-carbonyl)amino]-2-oxo-1-propan-2-yl-4,5,6,7,7a,9,10,11,11b,12,13,13a-dodecahydro-3h-cyclopenta[a]chrysen-9-yl]oxy]-2,2-dimethyl-4-oxobutanoic acid Chemical compound N([C@@]12CC[C@@]3(C)[C@]4(C)CC[C@H]5C(C)(C)[C@@H](OC(=O)CC(C)(C)C(O)=O)CC[C@]5(C)[C@H]4CC[C@@H]3C1=C(C(C2)=O)C(C)C)C(=O)C1=CN=CC(C)=C1 QCQCHGYLTSGIGX-GHXANHINSA-N 0.000 description 1
- 102000013563 Acid Phosphatase Human genes 0.000 description 1
- 108010051457 Acid Phosphatase Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 101710187573 Alcohol dehydrogenase 2 Proteins 0.000 description 1
- 101710133776 Alcohol dehydrogenase class-3 Proteins 0.000 description 1
- 102000004400 Aminopeptidases Human genes 0.000 description 1
- 108090000915 Aminopeptidases Proteins 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- OAMLVOVXNKILLQ-BQBZGAKWSA-N Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O OAMLVOVXNKILLQ-BQBZGAKWSA-N 0.000 description 1
- UKGGPJNBONZZCM-WDSKDSINSA-N Asp-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O UKGGPJNBONZZCM-WDSKDSINSA-N 0.000 description 1
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108700003860 Bacterial Genes Proteins 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 102100021257 Beta-secretase 1 Human genes 0.000 description 1
- 101710150192 Beta-secretase 1 Proteins 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 101000583086 Bunodosoma granuliferum Delta-actitoxin-Bgr2b Proteins 0.000 description 1
- 101100163949 Caenorhabditis elegans asp-3 gene Proteins 0.000 description 1
- 101100455752 Caenorhabditis elegans lys-3 gene Proteins 0.000 description 1
- 101100289888 Caenorhabditis elegans lys-5 gene Proteins 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 201000009030 Carcinoma Diseases 0.000 description 1
- 102000003902 Cathepsin C Human genes 0.000 description 1
- 108090000267 Cathepsin C Proteins 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108020004394 Complementary RNA Proteins 0.000 description 1
- OMFXVFTZEKFJBZ-UHFFFAOYSA-N Corticosterone Natural products O=C1CCC2(C)C3C(O)CC(C)(C(CC4)C(=O)CO)C4C3CCC2=C1 OMFXVFTZEKFJBZ-UHFFFAOYSA-N 0.000 description 1
- 239000000055 Corticotropin-Releasing Hormone Substances 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 101100172226 Drosophila melanogaster emc gene Proteins 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 208000032928 Dyslipidaemia Diseases 0.000 description 1
- LVGKNOAMLMIIKO-UHFFFAOYSA-N Elaidinsaeure-aethylester Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC LVGKNOAMLMIIKO-UHFFFAOYSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241000588921 Enterobacteriaceae Species 0.000 description 1
- 241001302160 Escherichia coli str. K-12 substr. DH10B Species 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- KRHYYFGTRYWZRS-UHFFFAOYSA-N Fluorane Chemical compound F KRHYYFGTRYWZRS-UHFFFAOYSA-N 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 102400001370 Galanin Human genes 0.000 description 1
- 101800002068 Galanin Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 101000930822 Giardia intestinalis Dipeptidyl-peptidase 4 Proteins 0.000 description 1
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- UKKNTTCNGZLJEX-WHFBIAKZSA-N Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UKKNTTCNGZLJEX-WHFBIAKZSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- TUTIHHSZKFBMHM-WHFBIAKZSA-N Glu-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O TUTIHHSZKFBMHM-WHFBIAKZSA-N 0.000 description 1
- 102000030595 Glucokinase Human genes 0.000 description 1
- 108010021582 Glucokinase Proteins 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- 241000700721 Hepatitis B virus Species 0.000 description 1
- 241001622557 Hesperia Species 0.000 description 1
- 108091027305 Heteroduplex Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 102000005548 Hexokinase Human genes 0.000 description 1
- 108700040460 Hexokinases Proteins 0.000 description 1
- LNCFUHAPNTYMJB-IUCAKERBSA-N His-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNCFUHAPNTYMJB-IUCAKERBSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101001063991 Homo sapiens Leptin Proteins 0.000 description 1
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 1
- 206010020772 Hypertension Diseases 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- UWBDLNOCIDGPQE-GUBZILKMSA-N Ile-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN UWBDLNOCIDGPQE-GUBZILKMSA-N 0.000 description 1
- TWVKGYNQQAUNRN-ACZMJKKPSA-N Ile-Ser Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O TWVKGYNQQAUNRN-ACZMJKKPSA-N 0.000 description 1
- 206010022489 Insulin Resistance Diseases 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- HFKJBCPRWWGPEY-BQBZGAKWSA-N L-arginyl-L-glutamic acid Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HFKJBCPRWWGPEY-BQBZGAKWSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-YGVKFDHGSA-N L-methionine S-oxide Chemical compound CS(=O)CC[C@H](N)C(O)=O QEFRNWWLZKMPFJ-YGVKFDHGSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 102000016267 Leptin Human genes 0.000 description 1
- 108010092277 Leptin Proteins 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- XWOBNBRUDDUEEY-UWVGGRQHSA-N Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XWOBNBRUDDUEEY-UWVGGRQHSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 208000017170 Lipid metabolism disease Diseases 0.000 description 1
- WKUXWMWQTOYTFI-SRVKXCTJSA-N Lys-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N WKUXWMWQTOYTFI-SRVKXCTJSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 101150008132 NDE1 gene Proteins 0.000 description 1
- 101100205189 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-5 gene Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 206010033307 Overweight Diseases 0.000 description 1
- 102000016387 Pancreatic elastase Human genes 0.000 description 1
- 108010067372 Pancreatic elastase Proteins 0.000 description 1
- ROHDXJUFQVRDAV-UWVGGRQHSA-N Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ROHDXJUFQVRDAV-UWVGGRQHSA-N 0.000 description 1
- 102000001105 Phosphofructokinases Human genes 0.000 description 1
- 108010069341 Phosphofructokinases Proteins 0.000 description 1
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 1
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 240000004350 Prunus spinosa Species 0.000 description 1
- 235000010829 Prunus spinosa Nutrition 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 101100408135 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) phnA gene Proteins 0.000 description 1
- 108010011939 Pyruvate Decarboxylase Proteins 0.000 description 1
- 102000013009 Pyruvate Kinase Human genes 0.000 description 1
- 108020005115 Pyruvate Kinase Proteins 0.000 description 1
- 241000700157 Rattus norvegicus Species 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- RZEQTVHJZCIUBT-WDSKDSINSA-N Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-WDSKDSINSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 1
- FFOKMZOAVHEWET-IMJSIDKUSA-N Ser-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(O)=O FFOKMZOAVHEWET-IMJSIDKUSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 102100035254 Sodium- and chloride-dependent GABA transporter 3 Human genes 0.000 description 1
- 101710104417 Sodium- and chloride-dependent GABA transporter 3 Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- BQBCIBCLXBKYHW-CSMHCCOUSA-N Thr-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O BQBCIBCLXBKYHW-CSMHCCOUSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 1
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- HSRXSKHRSXRCFC-WDSKDSINSA-N Val-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(O)=O HSRXSKHRSXRCFC-WDSKDSINSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- VEYJKJORLPYVLO-RYUDHWBXSA-N Val-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VEYJKJORLPYVLO-RYUDHWBXSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- 241000588902 Zymomonas mobilis Species 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 229960000583 acetic acid Drugs 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical group 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000003674 animal food additive Substances 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 238000005349 anion exchange Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 239000003124 biologic agent Substances 0.000 description 1
- 229940067596 butylparaben Drugs 0.000 description 1
- 150000003857 carboxamides Chemical group 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 229960004926 chlorobutanol Drugs 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- OMFXVFTZEKFJBZ-HJTSIMOOSA-N corticosterone Chemical compound O=C1CC[C@]2(C)[C@H]3[C@@H](O)C[C@](C)([C@H](CC4)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 OMFXVFTZEKFJBZ-HJTSIMOOSA-N 0.000 description 1
- IDLFZVILOHSSID-OVLDLUHVSA-N corticotropin Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)NC(=O)[C@@H](N)CO)C1=CC=C(O)C=C1 IDLFZVILOHSSID-OVLDLUHVSA-N 0.000 description 1
- 229960000258 corticotropin Drugs 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 230000003413 degradative effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010511 deprotection reaction Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000011026 diafiltration Methods 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 108020001096 dihydrofolate reductase Proteins 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 229960003638 dopamine Drugs 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 229960001617 ethyl hydroxybenzoate Drugs 0.000 description 1
- LVGKNOAMLMIIKO-QXMHVHEDSA-N ethyl oleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC LVGKNOAMLMIIKO-QXMHVHEDSA-N 0.000 description 1
- 229940093471 ethyl oleate Drugs 0.000 description 1
- 239000004403 ethyl p-hydroxybenzoate Substances 0.000 description 1
- 235000010228 ethyl p-hydroxybenzoate Nutrition 0.000 description 1
- NUVBSKCKDOMJSU-UHFFFAOYSA-N ethylparaben Chemical compound CCOC(=O)C1=CC=C(O)C=C1 NUVBSKCKDOMJSU-UHFFFAOYSA-N 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 231100000221 frame shift mutation induction Toxicity 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 239000012362 glacial acetic acid Substances 0.000 description 1
- 102000018146 globin Human genes 0.000 description 1
- 108060003196 globin Proteins 0.000 description 1
- 229960001031 glucose Drugs 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010033706 glycylserine Proteins 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 1
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 102000049953 human LEP Human genes 0.000 description 1
- 229910000040 hydrogen fluoride Inorganic materials 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 210000003016 hypothalamus Anatomy 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 210000003000 inclusion body Anatomy 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 206010021654 increased appetite Diseases 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 230000007794 irritation Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 210000003140 lateral ventricle Anatomy 0.000 description 1
- DVCSNHXRZUVYAM-BQBZGAKWSA-N leu-asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O DVCSNHXRZUVYAM-BQBZGAKWSA-N 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 150000004668 long chain fatty acids Chemical class 0.000 description 1
- 210000005265 lung cell Anatomy 0.000 description 1
- 108010010679 lysyl-valyl-leucyl-aspartic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 239000004292 methyl p-hydroxybenzoate Substances 0.000 description 1
- 235000010270 methyl p-hydroxybenzoate Nutrition 0.000 description 1
- 229960002216 methylparaben Drugs 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- SLZIZIJTGAYEKK-CIJSCKBQSA-N molport-023-220-247 Chemical compound C([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)CN)[C@@H](C)O)C1=CNC=N1 SLZIZIJTGAYEKK-CIJSCKBQSA-N 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- SFLSHLFXELFNJZ-UHFFFAOYSA-N norepinephrine Natural products NCC(O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-UHFFFAOYSA-N 0.000 description 1
- 229960002748 norepinephrine Drugs 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- URPYMXQQVHTUDU-OFGSCBOVSA-N nucleopeptide y Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(N)=O)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 URPYMXQQVHTUDU-OFGSCBOVSA-N 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 239000008024 pharmaceutical diluent Substances 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 230000037081 physical activity Effects 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000002203 pretreatment Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 239000004405 propyl p-hydroxybenzoate Substances 0.000 description 1
- 235000010232 propyl p-hydroxybenzoate Nutrition 0.000 description 1
- 229960003415 propylparaben Drugs 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 125000006239 protecting group Chemical group 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 239000002516 radical scavenger Substances 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000000717 sertoli cell Anatomy 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 150000003462 sulfoxides Chemical class 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- WROMPOXWARCANT-UHFFFAOYSA-N tfa trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F.OC(=O)C(F)(F)F WROMPOXWARCANT-UHFFFAOYSA-N 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 210000000211 third ventricle Anatomy 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 231100000027 toxicology Toxicity 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 239000013638 trimer Substances 0.000 description 1
- 101150035767 trp gene Proteins 0.000 description 1
- 101150044170 trpE gene Proteins 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 239000007160 ty medium Substances 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
- 230000004584 weight gain Effects 0.000 description 1
- 208000016261 weight loss Diseases 0.000 description 1
- 230000004580 weight loss Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/5759—Products of obesity genes, e.g. leptin, obese (OB), tub, fat
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/04—Anorexiants; Antiobesity agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Child & Adolescent Psychology (AREA)
- Obesity (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Diabetes (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Toxicology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Endocrinology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- Hematology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The present invention provides anti-obesity proteins, which when administered to a patient regulate fat tissue. Accordingly, such agents allow patients to overcome their obesity handicap and live normal lives with much reduced risk for type 2 diabetes, cardiovascular disease, and cancer.
Description
WO 98/28414 . PCT/US97I23549 ANTI-OBESITY PROTEINS
Field of the Invention The present invention is in the field of human medicine, particularly in the treatment of obesity and disorders associated with obesity. Most specifically the invention relates to anti-obesity proteins that when administered to a patient regulate fat tissue.
Hackgrouad of the Invention Obesity, and especially upper body obesity, is a common and very serious public health problem in the United States and throughout the world. According to recent statistics, more than 25% of the United States population and 27% of the Canadian population are overweight [Kuczmarski, R. J., Amer. J. of Clin. Nutr. 55:4955-502S
(1992) ; Reeder, B. A. , et. a1. , Can. Med. Assoc. J. , 146:2009-2019 (1992)]. Upper body obesity is the strongest risk factor known for people with type 2 diabetes, and is a strong risk factor for cardiovascular disease and cancer as well. Recent estimates for the medical cost of obesity are $150,000,000,000 world wide. The problem has become serious enough that the surgeon general has begun an initiative to combat the ever increasing adiposity rampant in American society.
Much of this obesity-induced pathology can be attributed to the strong association with dyslipidemia, hypertension, and insulin resistance. Many studies have demonstrated that reduction in obesity by diet and exercise reduces these risk factors dramatically. Unfortunately, these treatments are largely unsuccessful, with a failure rate reaching 95%. This failure may be due to the fact that the condition is strongly associated with genetically inherited factors that contribute to increased appetite, preference for highly caloric foods, reduced physical activity, and increased lipogenic metabolism. This indicates that people inheriting these genetic traits are SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 prone to becoming obese regardless of their efforts to combat the condition. Therefore, a pharmacological agent that can correct this adiposity handicap and allow the physician to successfully treat obese patients in spite of their genetic inheritance is needed.
Physiologists have postulated for years that, when a mammal overeats, the resulting excess fat signals to the brain that the body is obese, which in turn causes the body to eat less and burn more fuel [Hervey, G. R., Nature 222:629-631 (1969)]. This "feedback" model is supported by parabiotic experiments, which implicate a circulating hormone controlling adiposity.
The ob/ob mouse is a model of obesity and diabetes that is known to carry an autosomal recessive trait linked to a mutation in the sixth chromosome. Recently, Zhang, Y.
and co-workers published the positional cloning of the mouse gene linked with this condition [Zhang, Y., et al. Nature 372:425-432 (1994)]. This report disclosed a gene coding for a 167 amino acid protein with a 21 amino acid signal peptide that is exclusively expressed in adipose tissue.
The rat obese gene was subsequently cloned and expressed [Murakami, T. et al., Biochem. Biophys. Res. Comm. 209:944-952 (1995)]. The protein, which is apparently encoded by the ob gene, is now speculated to be the adiposity regulating hormone.
The protein encoded by the ob gene is pharmacologically active, but the physical properties of the protein are less than desirable. The human protein, for example, is prone to precipitation and aggregation in both a formulation and under physiological conditions.
Formulations of a protein containing a precipitate, or that permit precipitation at the site of administration, increase the risk of producing an immunological response in the patient and may result in irritation at the site of injection. Additives to the formulation, such as detergents or surfactants, are undesirable because of increased risk of SUBSTITUTE SHEET ( rule 26 ) WO 98128414 . PCT/US97/23549 toxicology. Accordingly, there remains a need to develop pharmacological agents that demonstrate pharmacological activity and provide improved physical and chemical stability.
Applicants have discovered that the aggregation observed in the native human protein is due, in part, to hydrophobic interactions at the surface of the protein, particularly at residues 1, 2, 3, 30, 36, 41) 42, 43, 45, 46, 47, 48, 49, 50, 74, 89, 92, 99, 100, 138, and 142.
Applicants have further discovered that, by substituting these positions or regions of the protein with charged or hydrophilic amino acids, the propensity of the ob protein to aggregate is dramatically reduced. This reduction in aggregation provides a much improved pharmacological agent.
Accordingly, the present invention provides biologically active obesity proteins. Such agents allow patients to overcome their obesity handicap and live normal lives with a more normalized risk for type 2 diabetes, cardiovascular disease, and cancer.
Summary of Invention The present invention is directed to a protein of the Formula ( I ) 5 10 i5 2 5 Xaa Xaa Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Xaa Asp Ile Ser His Thr Xaa Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Xaa Xaa Ile Gln Ile Ser Asn Asp Leu 4 0 Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Xaa Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly SUBSTITUTE SHEET ( ruie 26 ) WO 98/28414 . PCT/US97/23549 Val Leu Glu Ala Ser Gly Tyr Ser.Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Xaa Gln Leu Asp Leu Ser Pro Gly Cys ( SEQ ID NO :1 ) ( I ) wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 22 is Asn or Ser;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn, Gln, Glu or Asp;
Xaa at position 73 is Val or Met;
Xaa at position 100 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 138 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
Xaa at position 1 is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 2 is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Ile at position 3 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 30 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 36 is replaced with Glu, Asp, Arg, Lys, or His;
Phe at position 41 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 42 is replaced with Glu, Asp, Arg, Lys, or His;
Pro at position 43 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 45 is replaced with Glu, Asp, Arg, Lys, or His;
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/(TS97/23549 His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Pro at position 47 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 49 is replaced with Glu, Asp, Arg, Lys, or His;
Thr at position 50 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 74 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Val at position 89 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Phe at position 92 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Pro at position 99 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser; or Leu at position 142 is replaced with Glu, Asp, Arg, Lys, or His;
or a pharmaceutically acceptable salt thereof.
The invention further provides a method of treating obesity or those condit ions associated with obesity, which comprises adminis tering to a mammal in need thereof a protein of the Formula (I).
The invention further provides a pharmaceutical formulation, which comprises a rotein of the Formula (I), p optionally having a leader seque nce, together with one or more pharmaceutically acceptable diluents, carriers, or excipients therefor.
The invention further provides proteins of the Formula (I) having additionally a leader sequence bonded to the N-terminus of the protein of Formula I. Such proteins are useful for their anti-obesity activity and as intermediates in the preparation of proteins of the Formula (I) .
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/ITS97/23549 The invention further provides DNA encoding a protein of Formula {I), or a protein of Formula (I) having a leader sequence.
An additional embodiment of the present invention is a process for producing a protein of Formula (I), which comprises:
(a) transforming a host cell with DNA that encodes the protein of Formula (I) or a protein of Formula (I) having a leader sequence;
(b) culturing the host cell under conditions permitting expression of the protein;
(c) recovering the expressed protein; and, optionally, (d) cleaving enzymatically the leader sequence.
The invention further provides a protein for use in treating obesity and conditions associated with obesity, as well as, for use in preparing a medicament used to treat obesity and conditions associated with obesity.
Detailed Description For purposes of the present invention, as disclosed and claimed herein, the following terms and abbreviations are defined as follows:
Base pair (bp) -- refers to DNA or RNA. The abbreviations A,C,G, and T correspond to the 5~-monophosphate forms of the nucleotides (deoxy)adenine, (deoxy)cytidine, (deoxy)guanine, and {deoxy)thymine, respectively, when they occur in DNA molecules. The abbreviations U,C,G, and T correspond to the 5~-monophosphate forms of the nucleosides uracil, cytidine, guanine, and thymine, respectively, when they occur in RNA
molecules. In double stranded DNA, base pair may refer to a partnership of A with T or C with G. In a DNA/RNA
heteroduplex, base pair may refer to a partnership of T or U
with A or C with G.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97I23549 - 7 _ FMOC - an abbreviation for 9-fluorenymethoxycarbony.
Immunoreactive Protein(s) -- a term used to collectively describe antibodies, fragments of antibodies capable of binding antigens of a similar nature as the parent antibody molecule from which they are derived, and single chain polypeptide binding molecules [Bird, E. R., et al., PCT Application No. PCT/US 87/02208, International Publication No. WO 88/01649, published March 10, 1988].
Plasmid -- an extrachromosomal self-replicating genetic element.
PAM - an abbreviation for 4-hydroxymethyl-phenylacetamidomethyl.
PMSF -- an abbreviation for phenylmethylsulfonyl fluoride.
Reading frame -- the nucleotide sequence from which translation occurs "read" in triplets by the translational apparatus of tRNA, ribosomes and associated factors, each triplet corresponding to a particular amino acid. Because each triplet is distinct and of the same length, the coding sequence must be a multiple of three. A
base pair insertion or deletion (termed a frameshift mutation) may result in two different proteins being coded for by the same DNA segment. To insure against this, the triplet codons corresponding to the desired polypeptide must be aligned in multiples of three from the initiation codon, i.e., the correct "reading frame" must be maintained.
Recombinant DNA Cloning Vector -- any autonomously replicating agent including, but not limited to, plasmids and phages, comprising a DNA molecule to which one or more additional DNA segments can or have been added.
Recombinant DNA Expression Vector -- any recombinant DNA cloning vector in which a promoter has been incorporated.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 _ g _ Replicon -- A DNA sequence that controls and allows for autonomous replication of a plasmid or other vector.
TFA - an abbreviation for trifluoroacetic acid.
Transcription -- the process whereby information contained in a nucleotide sequence o~f DNA is transferred to a complementary RNA sequence.
Translation -- the process whereby the genetic information of messenger RNA is used to specify and direct the synthesis of a polypeptide chain.
Tris -- an abbreviation for tris(hydroxymethyl)-aminomethane.
Treating -- describes the management and care of a patient for the purpose of combating the disease, condition, or disorder and includes the administration of a protein of present invention to prevent the onset of the symptoms or complications, alleviating the symptoms or complications, or eliminating the disease, condition) or disorder. Treating obesity, therefore, includes the inhibition of food intake, the inhibition of weight gain, and inducing weight loss in patients in need thereof.
Vector -- a replicon used for the transformation of cells in gene manipulation bearing polynucleotide sequences corresponding to appropriate protein molecules which, when combined with appropriate control sequences, confer specific properties on the host cell to be transformed. Plasmids, viruses, and bacteriophage are suitable vectors, since they are replicons in their own right. Artificial vectors are constructed by cutting and joining DNA molecules from different sources using restriction enzymes and ligases. Vectors include Recombinant DNA cloning vectors and Recombinant DNA
expression vectors.
X-gal -- an abbreviation for 5-bromo-4-chloro-3-indolyl beta-D-galactoside.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/ITS97/23549 g _ The amino acid abbreviations are accepted by the United States Patent and Trademark Office as set forth in 37 C.F.R. ~ 1.822 (b)(2) (1993). Unless otherwise indicated the amino acids are in the L configuration.
As noted above the present invention provides a protein of the Formula (I):
Xaa Xaa Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Xaa Asp Ile Ser His Thr Xaa Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Xaa Xaa Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Xaa Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Xaa Gln Leu Asp Leu Ser Pro Gly Cys ( SEQ ID NO : 1 ) ( I ) wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 22 is Asn or Ser;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn, Gln, Glu or Asp;
Xaa at position 73 is Val or Met;
Xaa at position 100 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 138 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Xaa at position l is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 2 is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Ile at position 3 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 30 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 36 is replaced with Glu, Asp, Arg, Lys, or His;
Phe at position 41 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 42 is replaced with Glu, Asp) Arg, Lys, or His;
Pro at position 43 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 45 is replaced with Glu, Asp, Arg, Lys, or His;
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Pro at position 47 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 49 is replaced with Glu, Asp, Arg, Lys, or His;
Thr at position 50 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 74 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Val at position 89 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Phe at position 92 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Pro at position 99 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser; or SUBSTITUTE SHEET ( rule 26 ) Leu at position 142 is replaced with Glu, Asp, Arg, Lys, or His;
or a pharmaceutically acceptable salt thereof.
Preferred proteins are those having one to three substitutions; most preferably one or two substitutions defined in Formula (I).
Other preferred proteins of the present invention are those of SEQ ID NO:1, wherein:
Val at position 36 is replaced with Arg, Lys, or His;
Ile at position 74 is replaced with Glu or Asp, or Phe at position 92 is replaced with Arg, Lys, or His.
Other preferred proteins of the present invention are those of SEQ ID NO:1, wherein:
Phe at position 92 is replaced with Asp, Glu, Lys, Arg, or His, or pharmaceutically acceptable salts thereof.
Other preferred proteins of the present invention are those of SEQ ID N0:1, wherein:
Phe at position 92 is replaced with Asp or Glu, or pharmaceutically acceptable salts thereof.
Other preferred proteins of the present invention are those of SEQ ID NO:1, wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 22 is Asn;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn;
Xaa at position 73 is Val;
Xaa at position 100 is Trp, Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 138 is Trp, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/tTS97/23549 Leu at position 45 is replaced with Glu, Asp, Arg, Lys, or His;
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Pro at position 47 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His; or Leu at position 142 is replaced with Glu, Asp, Arg, Lys, or His;
or pharmaceutically acceptable salts thereof.
Other preferred proteins of the present invention are those of SEQ ID NO:1, wherein:
Xaa at position 1 is Val;
Xaa at position 2 is Pro;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn;
Xaa at position 100 is Trp, Glu, or Asp;
Xaa at position 138 is Trp, Glu or Asp;
said protein having at least one of the following substitutions:
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His; or Phe at position 92 is replaced with Glu, Asp, Arg, Lys, or His;
or pharmaceutically acceptable salts thereof.
Other preferred proteins of the present invention are those of SEQ ID NO: I, wherein:
Xaa at position 1 is Val;
Xaa at position 2 is Pro;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn;
Xaa at position 100 is Trp, Glu, or Asp;
Xaa at position 138 is Trp, Glu or Asp;
SUBSTITUTE SHEET ( rule 26 ) WO 98128414 . PCT/US97/23549 said protein having at least one of the following substitutions:
His at position 46 is replaced with Glu, Asp, Arg, or Lys; or Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His;
or pharmaceutically acceptable salts thereof.
Still more preferred are proteins of the present invention are those of SEQ ID NO:1, wherein:
Xaa at position 1 is Val;
Xaa at position 2 is Pro;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn;
Xaa at position 100 is Trp or Asp;
Xaa at position 138 is Trp;
said protein having at least one of the following substitutions:
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Ile at position 48 is replaced with Asp, Arg, Lys, or His; or Phe at position 92 is replaced with Asp, Glu, Arg, Lys, or His;
or pharmaceutically acceptable salts thereof.
Still more preferred are proteins of the present invention are those of SEQ ID NO: l, wherein:
Xaa at position 1 is Val;
Xaa at position 2 is Pro;
Xaa at pos~i.tion 28 is Gln or absent;
Xaa at position 72 is Asn;
Xaa at position 100 is Trp or Asp;
Xaa at position 138 is Trp;
said protein having at least one of the following substitutions:
His at position 46 is replaced with Glu, Asp, Arg, or Lys; or SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Ile at position 48 is replaced with Asp, Arg, Lys, or His;
or pharmaceutically acceptable salts thereof.
Other preferred proteins of the present invention are those of SEQ ID NO:1, wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn, Glu or Asp;
Xaa at position 100 is Trp, Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 138 is Trp, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
Xaa at position 1 is replaced with Glu, Asp, Ser, Thr, Lys, His, Arg, or absent;
Xaa at position 2 is replaced with Glu, Asp, Ser, Thr, Lys, His, Arg, or absent;
Val at position 89 is replaced with Glu, Asp, Arg, Lys, or His; or Phe at position 92 is replaced with Glu, Asp, Arg, Lys, or His;
or a pharmaceutically acceptable salt thereof.
Other preferred proteins are those of Formula I
wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn, Glu or Asp;
Xaa at position 100 is Glu, Asp, Lys, His, or Arg;
Xaa at position 138 is Trp, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . . PCT/US97/23549 Phe at position 92 is replaced with Glu, Asp, Arg, Lys, or His;
Pro position 99 is replaced Glu, Asp, Arg, at with Lys, r is;
o H
or a cally acceptable salt pharmaceuti thereof.
Preferred s ithin ormula I nclude specie w F i species of SEQID N0: 2 SEQID N0:7:
through Val ProIleGln LysVal AspAspThr ThrLeu IleLysThr Gln Lys Ile ValThrArg IleAsn IleSerHis GlnSer ValSerSer Asp Thr Lys GlnLysVal ThrGly AspPheIle GlyLeu AspProIle Leu Pro Leu ThrLeuSer LysMet GlnThrLeu ValTyr GlnGlnIle Asp Ala Leu ThrSerMet ProSer AsnValIle IleSer AsnAspLeu Arg Gln Glu AsnLeuArg AspLeu HisValLeu PheSer LysSerCys Leu Ala His LeuProTrp AlaSer LeuGluThr AspSer LeuGlyGly Gly Leu 3 Val LeuGluAla SerGly SerThrGlu ValAla LeuSerArg 0 Tyr Val Leu GlnGlySer LeuGln MetLeuTrp LeuAsp LeuSerPro Asp Gln Gly Cys ( ID NO :
SEQ 2 ) Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser 4 5 Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Asp Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCTlUS97/Z3549 115 120 , 125 Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro Gly Cys ( SEQ ID NO : 3 ) Val Pro IleGln LysValGln AspAspThr LysThrLeuIle LysThr Ile Val ThrArg IleAsnAsp IleSerHis ThrGlnSerVal SerSer Lys Gln LysVal ThrGlyLeu AspPheIle ProGlyLeuAsp ProIle 2 Leu Thr LeuSer LysMetAsp GlnThrLeu AlaValTyrGln GlnIle Leu Thr SerMet ProSerArg AsnValIle GlnIleSerAsn AspLeu 85 90 g5 Glu Asn LeuArg AspLeuLeu HisValLeu AlaPheSerLys SerCys His Leu ProAsp AlaSerGly LeuGluThr LeuAspSerLeu GlyGly Val Leu GluAla SerGlyTyr SerThrGlu ValValAlaLeu SerArg 3 Leu Gln GlySer LeuGlnAsp MetLeuTrp GlnLeuAspLeu SerPro Gly Cys ( Q
SE ID
NO
:
) Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Asp 5 0 Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile SUBSTITUTE SHEET ( rule 26 ) WO 98128414 . PCT/US97123549 Leu Thr SerMetPro SerArgAsn ValIleGln IleSerAsn AspLeu Glu Asn LeuArgAsp LeuLeuHis ValLeuAla PheSerLys SerCys His Leu ProAspAla SerGlyLeu GluThrLeu AspSerLeu GlyGly Val Leu GluAlaSer GlyTyrSer ThrGluVal ValAlaLeu SerArg Leu Gln GlySerLeu GlnAspMet LeuTrpGln LeuAspLeu SerPro Gly Cys ( Q
SE ID
NO
:
) Val Pro IleGlnLys ValGlnAsp AspThrLys ThrLeuIle LysThr Ile Val ThrArgIle AsnAspIle SerHisThr GlnSerVal SerSer 2 5 Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro IIe Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Glri Gln Ile Leu ThrSerMet ProSerArg AsnValIle GlnIleSerAsn AspLeu Glu AsnLeuArg AspLeuLeu HisValLeu AlaAspSerLys SerCys His LeuProTrp AlaSerGly LeuGluThr LeuAspSerLeu GlyGly 4 Val LeuGluAla SerGlyTyr SerThrGlu ValValAlaLeu SerArg Leu GlnGlySer LeuGlnAsp MetLeuTrp GlnLeuAspLeu SerPro Gly Cys ( ID NO :
) Val ProIleGln LysValGln AspAspThr LysThrLeuIle LysThr Ile ValThrArg IleAsnAsp IleSerHis ThrGlnSerVal SerSer Lys GlnLysVal ThrGlyLeu AspPheIle ProGlyLeuHis ProIle SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97123549 Leu Thr Leu Ser Lys Met Asp Gln~Thr Leu Ala Val Tyr Gln Gln Ile 65 70 75 g0 Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Asp Ser Lys Ser Cys His Leu Pro Asp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro 2 0 Gly cys ( SEQ ID NO : 7 ) As mentioned above, the invention also provides proteins of the Formula (I) having additionally a leader sequence bonded to the N-terminus of the protein of Formula I. Such proteins are useful for their anti-obesity activity and as intermediates in the preparation of proteins of the Formula (I). The species having SEQ ID N0:8-13 are particularly preferred proteins of Formula (I) having the leader sequence Met-Arg bonded to the N-terminus of the proteins of SEQ ID N0:2-7, respectively.
Met Arg Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile 3 Lys ThrIle ValThrArg IleAsnAsp IleSerHis ThrGlnSer Val Ser SerLys GlnLysVal ThrGlyLeu AspPheIle ProGlyLeu Asp Pro IleLeu ThrLeuSer LysMetAsp GlnThrLeu AlaValTyr Gln Gln IleLeu ThrSerMet ProSerArg AsnValIle GlnIleSer Asn Asp LeuGlu AsnLeuArg AspLeuLeu HisValLeu AlaPheSer Lys 5 Ser CysHis LeuProTrp AlaSerGly LeuGluThr LeuAspSer Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/I1S97/23549 Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro Gly Cys ( SEQ ID NO : 8 ) Met Arg ValProIleGln LysVal GlnAspAsp ThrLysThrLeu Ile Lys Thr IleValThrArg IleAsn AspIleSer HisThrGlnSer Val Ser Ser LysGlnLysVal ThrGly LeuAspPhe IleProGlyLeu His Pro Asp LeuThrLeuSer LysMet AspGlnThr LeuAlaValTyr Gln 2 Gln Ile LeuThrSerMet ProSer ArgAsnVal IleGlnIleSer Asn 85 90 . 95 Asp Leu GluAsnLeuArg AspLeu LeuHisVal LeuAlaPheSer Lys loo 105 110 Ser Cys HisLeuProTrp AlaSer GlyLeuGlu ThrLeuAspSer Leu Gly Gly ValLeuGluAIa SerGly TyrSerThr GluValValAla Leu Ser Arg LeuGlnGlySer LeuGln AspMetLeu TrpGlnLeuAsp Leu 3 Ser Pro GlyCys ( ID NO
5 SEQ :
) Met Arg Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile 4 0 Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu Asp Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys SUBSTITUTE SHEET ( rule 26 ) Ser Cys His Leu Pro Asp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro GIy Cys ( SEQ ID NO :10 ) Met ArgVal ProIleGln LysValGln AspAspThr LysThrLeuIle Lys ThrIle ValThrArg IleAsnAsp IleSerHis ThrGlnSerVal 2 Ser SerLys GlnLysVal ThrGlyLeu AspPheIle ProGlyLeuHis Pro AspLeu ThrLeuSer LysMetAsp GlnThrLeu AlaValTyrGln Gln IleLeu ThrSerMet ProSerArg AsnValIle GlnIleSerAsn Asp LeuGlu AsnLeuArg AspLeuLeu HisValLeu AlaPheSerLys loo l05 llo Ser CysHis LeuProAsp AlaSerGly LeuGluThr LeuAspSerLeu 3 Gly GlyVal LeuGluAla SerGlyTyr SerThrGlu ValValAlaLeu Ser ArgLeu GlnGlySer LeuGlnAsp MetLeuTrp GlnLeuAspLeu Ser ProGly Cys ( ID NO
SEQ :11 ) Met Arg Val Pro Ile Gln Lys Val GIn Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val 5 0 Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/Z3549 Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Asp Ser Lys Ser Cys His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu ser Pro Gly Cys ( SEQ ID NO :12 ) Met Arg Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile 2 0 Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Asp Ser Lys 3 5 Ser Cys His Leu Pro Asp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro Gly Cys ( SEQ ID NO :13 ) The present invention provides biologically active proteins that provide effective treatment for obesity.
Applicants have discovered that specific substitutions to the hydrophobic residues on the surface of the protein result in improved properties. These residues could not be predicted from the primary sequence. The proteins having these substitutions are pharmacologically active and have a reduced propensity to aggregate when compared to both the SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 mouse and human forms of the protein. The present invention permits obesity protein to be more readily formulated at higher concentrations, and increases their pharmaceutical elegance, because they are compatible with commonly-used preservatives.
The claimed proteins ordinarily are prepared by recombinant techniques. Techniques for making substitutional mutations at predetermined sites in DNA
having a known sequence are well known, for example M13 primer mutagenesis. The mutations that might be made in the DNA encoding the present anti-obesity proteins must not place the sequence out of reading frame and preferably will not create complementary regions that could produce secondary mRNA structure [DeBoer, H.A., et al., European Patent Publication No. 75,444 A2, published March 30, 1983].
The proteins of the present invention may be produced either by recombinant DNA technology or well known chemical procedures, such as solution or solid-phase peptide synthesis, or semi-synthesis in solution beginning with protein fragments coupled through conventional solution methods.
A. Solid Phase The synthesis of the claimed proteins may proceed by solid phase peptide synthesis or by recombinant methods.
The principles of solid phase chemical synthesis of polypeptides are well known in the art and may be found in general texts in the area such as Dugas, H. and Penney, C., Bioorganic Chemistry Springer-Verlag, New York, (1981) 54-59. For example, peptides may be synthesized by solid-phase methodology utilizing an PE-Applied Biosystems 433A peptide synthesizer (Perkin Elmer-Applied Biosystems Division, Foster City, California) and synthesis cycles supplied by Applied Biosystems. Boc amino acids and other reagents are commercially available from PE-Applied Biosystems and other chemical supply houses. Sequential Boc chemistry using SUBSTITUTE SHEET { rule 26 ) WO 98/28414 . PCT/US97/23549 double couple protocols are applied to the starting p-methyl benzhydryl amine resins for the production of C-terminal carboxamides. For the production of C-terminal acids, the corresponding PAM resin is used. Arginine, Asparagine, Glutamine, Histidine, and Methionine are coupled using preformed hydroxy benzotriazole esters. The following side chain protection may be used:
Arg, Tosyl Asp, cyclohexyl or benzyl Cys, 4-methylbenzyl Glu, cyclohexyl His, benzyloxymethyl Lys, 2-chlorobenzyloxycarbonyl Met, sulfoxide I5 Ser, Benzyl Thr, Benzyl Trp, formyl Tyr, 4-bromo carbobenzoxy Boc deprotection may be accomplished with trifluoroacetic acid (TFA) in methylene chloride. Formyl removal from Trp is accomplished by treatment of the peptidyl resin with 20%
piperidine in dimethylformamide for 60 minutes at 4°C.
Met(O) can be reduced by treatment of the peptidyl resin with TFA/dimethylsulfide/conc. HC1 (95/5/1) at 25°C for 60 minutes. Following the above pre-treatments, the peptides may be further deprotected and cleaved from the resin with anhydrous hydrogen fluoride containing a mixture of 10% m-cresol or m-cresol/10% p-thiocresol or m-cresol/p-thiocresol/dimethyl-sulfide. Cleavage of the side chain protecting groups) and of the peptide from the resin is carried out at 0°C or below, preferably -20°C for thirty minutes followed by thirty minutes at 0°C. After removal of the HF, the peptide/resin is washed with ether. The peptide is extracted with glacial acetic acid and lyophilized.
Purification is accomplished by reversed-phase C18 SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 chromatography in 0.1% TFA with a gradient of increasing acetonitrile concentration, e.g., a 2.2 cm X 25 cm VydacT""
column (The Separations Group, Inc., Hesperia, CA).
One skilled in the art recognizes that the solid phase synthesis could also be accomplished using the FMOC
strategy and a TFA/scavenger cleavage mixture.
B. Recombinant Synthesis The claimed proteins may also be produced by recombinant methods. Recombinant methods are preferred if a high yield or if specific post-translational modifications are desired. The basic steps in the recombinant production of protein include:
a) construction of a synthetic or semi-synthetic (or isolation from natural sources) DNA
encoding the claimed protein, b) integrating the coding sequence into an expression vector in a manner suitable for the expression of the protein either alone or as a fusion protein, c) transforming an appropriate eukaryotic or prokaryotic host cell with the expression vector, and d) recovering and purifying the recombinantly produced protein.
a. Gene Construction Synthetic genes, the in vitro or in vivo transcription and translation of which will result in the production of the protein may be constructed by techniques well known in the art. Owing to the natural degeneracy of the genetic code, the skilled artisan will recognize that a sizable yet definite number of DNA sequences may be constructed which encode the claimed proteins. In the preferred practice of the invention, synthesis is achieved by recombinant DNA technology.
Methodology of synthetic gene construction is well known in the art (Brown, E. L., et a.~. Methods in SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Enzymology, Academic Press,. New York, NY, 68:109-151 (1979)]. The DNA sequence corresponding to the synthetic claimed protein gene may be generated using conventional DNA
synthesizing apparatus such as the Applied Biosystems Model 380A or 380B DNA synthesizers (Perkin Elmer, Applied Biosystems Division, Foster City, Cahifornia).
It may desirable in some applications to modify the coding sequence of the claimed protein so as to incorporate a convenient protease sensitive cleavage site, e.g., between the signal peptide and the structural protein facilitating the controlled excision of the signal peptide from the fusion protein construct.
The gene encoding the claimed protein may also be created by using polymerase chain reaction (PCR). The template can be a cDNA library (commercially available from CLONETECH or STRATAGENE) or mRNA isolated from human adipose tissue. Such methodologies are well known in the art. See, e.g., Maniatis, T., et a1. Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1989).
b. Direct Expression or Fusion Protein The claimed protein may be made either by direct expression or as fusion protein comprising the claimed protein followed by enzymatic or chemical cleavage. A
variety of peptidases (e.g., trypsin) which cleave a polypeptide at specific sites or digest the peptides from the amino or carboxy termini (e.g., diaminopeptidase) of the peptide chain are known. Furthermore, particular chemicals (e.g., cyanogen bromide) will cleave a polypeptide chain at specific sites. The skilled artisan will appreciate the modifications necessary to the amino acid sequence (and synthetic or semi-synthetic coding sequence if recombinant means are employed) to incorporate site-specific internal cleavage sites. See, e.g., Carter, P., Chapter 13 in Protein Purification: From Molecular Mechanisms to Large-SUBSTITUTE SHEET ( rule 2b ) Scale Processes, Ladisch, M., et a1. (Eds.) American Chemical Soc., Washington, D.C. (1990).
c. Vector Construction Construction of suitable vectors containing the desired coding and control sequences employ standard ligation techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and religated in the form desired to form the plasmids required.
To effect the translation of the desired protein, one inserts the engineered synthetic DNA sequence in any of a plethora of appropriate recombinant DNA expression vectors through the use of appropriate restriction endonucleases. A
synthetic coding sequence. is designed to possess restriction endonuclease cleavage sites at either end of the transcript to facilitate isolation from and integration into these expression and amplification and expression plasmids. The isolated cDNA coding sequence may be readily modified by the use of synthetic linkers to facilitate the incorporation of this sequence into the desired cloning vectors by techniques well known in the art. The particular endonucleases employed will be dictated by the restriction endonuclease cleavage pattern of the parent expression vector to be employed. Restriction sites are chosen so as to properly orient the coding sequence with control sequences to achieve proper in-frame reading and expression of the claimed protein .
In general, plasmid vectors containing promoters and control sequences which are derived from species compatible with the host cell are used with these hosts.
The vector ordinarily carries a replication site as well as marker sequences which are capable of providing phenotypic selection in transformed cells. For example, E. coli is typically transformed using pBR322, a plasmid derived from an E. coli species [Bolivar, F., et al., Gene 2:95-113 (1977)]. Plasmid pBR322 contains genes for ampicillin and tetracycline resistance and thus provides easy means for SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/Z3549 identifying transformed cells. The pHR322 plasmid, or other microbial plasmid must also contain or be modified to contain promoters and other control elements commonly used in recombinant DNA technology.
The desired coding sequence is inserted into an expression vector in the proper orientation to be transcribed from a promoter and ribosome binding site, both of which should be functional in the host cell in which the protein is to be expressed. An example of such an expression vector is a plasmid described in Belagaje, R. M., et al., U.S. patent No. 5,304,473, issued, April 19, 1994, the teachings of which are herein incorporated by reference.
The gene encoding A-C-B proinsulin described in U.S. Patent No. 5,304,473 can be removed from the plasmid pRBl82 with restriction enzymes Ndel and BamHI. The genes encoding the protein of the present invention can be inserted into the plasmid backbone on a Ndel/BamHI restriction fragment cassette.
d. Prokaryotic Ex ression In general, prokaryotes are used for cloning of DNA sequences in constructing the vectors useful in the invention. For example, E. coli K12 strain 294 (ATCC No.
31446) is particularly useful. Other microbial strains which may be used include E. coli B and E. coli X1776 (ATCC No.
31537). These examples are illustrative rather than limiting.
Prokaryotes also are used for expression. The aforementioned strains, as well as E. coli W3110 (prototrophic, ATCC No. 27325), bacilli such as Bacillus subtilis, and other enterobacteriaceae such as Salmonella typhimurium or Serratia marcescans, and various pseudomonas species may be used. Promoters suitable for use with prokaryotic hosts include the [i-lactamase (vector pGX2907 [ATCC 39344] contains the replicon and [3-lactamase gene) and lactose promoter systems [Chang, A. C. Y., et al., Nature, 275:617-624 (1978); and Goeddel, D. V., et al., Nature SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 281:544-548 (1979)], alkaline phosphatase, the tryptophan (trp) promoter system (vector pATHl [ATCC 37695] is designed to facilitate expression of an open reading frame as a trpE
fusion protein under control of the trp promoter) and hybrid promoters such as the tac promoter (isolatable from plasmid pDR540 ATCC-37282). However, other functional bacterial promoters, whose nucleotide sequences are generally known, enable one skilled in the art to ligate them to DNA encoding the protein using linkers or adaptors to supply any required restriction sites. Promoters for use in bacterial systems also will contain a Shine-Dalgarno sequence operably linked to the DNA encoding protein.
e. Eukaryotic Ex ression The protein may be recombinantly produced in eukaryotic expression systems. Preferred promoters controlling transcription in mammalian host cells may be obtained from various sources, for example, the genomes of viruses such as: polyoma, Simian Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most preferably cytomegalovirus, or from heterologous mammalian promoters, e.g., p-actin promoter. The early and late promoters of the SV40 virus are conveniently obtained as an SV40 restriction fragment which also contains the SV40 viral origin of replication [Fiers, W., et al., Nature, 273:113-120 (1978)]. The entire SV40 genome may be obtained from plasmid pBRSV, ATCC 45019. The immediate early promoter of the human cytomegalovirus may be obtained from plasmid pCMBQ
(ATCC 77177). Of course, promoters from the host cell or related species also are useful herein.
Transcription of a DNA encoding the claimed protein by higher eukaryotes is increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about 10-300 bp, that act on a promoter to increase its transcription. Enhancers are relatively orientation and position independent having been found 5~ [Laimins, L. A., et al., Proc. Nat~1 Acad. Sci.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/I1S97/23549 (USA) 78:464-468 (1981)] and 3~ [Lusky, M., et al., Mol.
Cell. Bio. 3:1108-1122 (1983)] to the transcription unit, within an intron [Banerji, J., et al., Cell 33:729-740 (1983)] as well as within the coding sequence itself [Osborne, T. F., et al., Mol. Cell. Bio. 4:1293-1305 (1984)). Many enhancer sequences are now known from mammalian genes (globin, RSV, SV40, EMC, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus. Examples include the SV40 late enhancer, the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.
Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination of transcription which may affect mRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the mRNA encoding protein. The 3~ untranslated regions also include transcription termination sites.
Expression vectors may contain a selection gene, also termed a selectable marker. Examples of suitable selectable markers for mammalian cells are dihydrofolate reductase (DHFR, which may be derived from the BgIII/HindIII
restriction fragment of pJOD-10 [ATCC 68815)), thymidine kinase (herpes simplex virus thymidine kinase is contained on the BamHI fragment of vP-5 clone (ATCC 2028), or neomycin (G418) resistance genes, which are obtainable from pNN414 yeast artificial chromosome vector (ATCC 37682). When such selectable markers are successfully transferred into a mammalian host cell, the transfected mammalian host cell can survive if placed under selective pressure. There are two widely used distinct categories of selective regimes. The first category is based on a cel l s metabolism and the use of a mutant cell line which lacks the ability to grow without a supplemented media. Two examples are: CHO DHFR-SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 cells (ATCC CRL-9096) and mouse LTK- cells [L-M(TK-) ATCC
CCL-2.3]. These cells lack the ability to grow without the addition of such nutrients as thymidine or hypoxanthine.
Because these cells lack certain genes necessary for a complete nucleotide synthesis pathway, they cannot survive unless the missing nucleotides are provided in a supplemented media. An alternative to supplementing the media is to introduce an intact DHFR or TK gene into cells lacking the respective genes, thus altering their growth requirements. Individual cells which were not transformed with the DHFR or TK gene will not be capable of survival in non-supplemented media.
The second category is dominant selection which refers to a selection scheme used in any cell type and does not require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host cell. Those cells which have a novel gene would express a protein conveying drug resistance and would survive the selection.
Examples of such dominant selection use the drugs neomycin, [Southern P J., et al., J. Molec. Appl. Genet. 1:327-341 (1982)], mycophenolic acid [Mulligan, R. C. et al., Science 209:1422-1427 (1980)], or hygromycin [Sugden, B. et al., Mol Cell. Biol. 5:410-413 (1985)]. The three examples given above employ bacterial genes under eukaryotic control to convey resistance to the appropriate drug, such as, 6418, neomycin (geneticin), xgpt (mycophenolic acid), or hygromycin, respectively.
A preferred vector for eukaryotic expression is pRc/CMV. pRc/CMV is commercially available from Invitrogen Corporation, San Diego, CA. To confirm correct sequences in constructed plasmids, the ligation mixtures are used to transform E. coli K12 strain DHSa (ATCC 31446) and successful transformants are selected by antibiotic resistance where appropriate. Plasmids from the transformants are prepared, analyzed by restriction and/or SUBSTITUTE SHEET ( rule 26 ) WO 98J284I4 . PCT/LTS97/23549 sequenced by the method of. Messing, J., et al., Nucleic Acids Res. 9:309-321 (1981).
Host cells may be transformed with the expression vectors of this invention and cultured in conventional nutrient media modified as is appropriate for inducing promoters, selecting transformants or amplifying genes. The culture conditions, such as temperature, pH and the like, are those previously used with the host cell selected for expression, and will be apparent to the ordinarily skilled artisan. The techniques of transforming cells with the aforementioned vectors are well known in the art and may be found in such general references as Maniatis, T., et a1.
Molecular Cloning: A Laboratory Manual, 2"d Ed., Cold Spring Harbor Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1989), or Current Protocols in Molecular Biology (1989) and supplements.
Preferred suitable host cells for expressing the vectors encoding the claimed proteins in higher eukaryotes include: African green monkey kidney cell line transformed by SV40 (COS-7, ATCC CRL-1651); transformed human primary embryonal kidney cell line 293 [Graham, F. L. et al., J. Gen Virol. 36:59-72 (1977); Harrison, T., et al., Virology 77:319-329 (1977); Graham, F. L. et al., Virology 86:10-21 (1978)]; baby hamster kidney cells [BHK-21(C-13), ATCC CCL-10; MacPherson, I., et a1, Virology 16:147-151 (1962)];
Chinese hamster ovary cells [CHO-DHFR- (ATCC CRL-9096)];
mouse Sertoli cells [TM4, ATCC CRL-1715; Mather, J. P., Biol. Reprod. 23:243-252 (1980)]; African green monkey kidney cells (VERO 76, ATCC CRL-1587); human cervical epitheloid carcinoma cells (HeLa, ATCC CCL-2); canine kidney cells (MDCK, ATCC CCL-34); buffalo rat liver cells (BRL 3A, ATCC CRL-1442); human diploid lung cells (WI-38, ATCC CCL-75); human hepatocellular carcinoma cells (Hep G2, ATCC HB-8065);and mouse mammary tumor cells (MMT 060562, ATCC
CCL51) .
f. Yeast Ex ression SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 In addition to prokaryotic and mammalian host cells, eukaryotic microorganisms such as yeast may also be used as host cells. Saccharomyces cerevisiae, common baker s yeast, is the most commonly-used eukaryotic microorganism for expressing heterologous proteins, although a number of other strains are commonly available. For expression in Saccharomyces, the plasmid YRp7, for example, is commonly used [ATCC-40053, Stinchcomb, D. T., et al., Nature 282:39-43 (1979); Kingsman, A. J., et al., Gene 7:141-152 (1979); Tschumper, G., et al., Gene 10:157-166 (1980)]. This plasmid already contains the trp gene which provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan [e. g., ATCC 44076 or PEP4-l; Jones, E. W., Genetics 85:23-33 (1977)].
Suitable promoting sequences for use with yeast hosts include the promoters for 3-phosphoglycerate kinase, which is found on plasmid pAPI2BD ATCC 53231 [Patel, A. C., et al., U.S. Patent No. 4,935,350, issued June 19, 1990] or other glycolytic enzymes such as enolase, which is found on plasmid pACl (ATCC 39532), glyceraldehyde-3-phosphate dehydrogenase, which is derived from plasmid pHcGAPCl (ATCC
57090, 57091), zymomonas mobilis [Ingrain, L.O., et al., U.S.
Patent No. 5,000,000 issued March 19, 19911, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase.
Other yeast promoters, which are inducible promoters having the additional advantage of transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, which is contained on plasmid vector pCL28XhoLHBPV [ATCC 39475; Reddy, V. B., et al., U.S.
Patent No. 4,840,896, issued June 20, 1989], glyceraldehyde 3-phosphate dehydrogenase, and enzymes responsible for SUBSTITUTE SHEET ( ruie 26 ) WO 98128414 . PCT/US97/23549 maltose and galactose utilization, such as, the GALL
promoter, which may be found on plasmid pRY121 (ATCC 37658).
Suitable vectors and promoters for use in yeast expression are further described in Hitzeman, R. A., et al., European Patent Publication No. 73,657A1, published March 9, 1983.
Yeast enhancers such as the UAS Gal from Saccharomyces cerevisiae, which is found in conjunction with the CYC1 promoter on plasmid YEpsec--hIlbeta (ATCC 67024), also are advantageously used with yeast promoters.
The following examples are presented to further illustrate the preparation of the claimed proteins. The scope of the present invention is not to be construed as merely consisting of the following examples.
8xample 1 Vector Construction A gene of SEQ ID N0:14 is assembled from a 220 base pair and a 240 base pair segment which are derived from chemically synthesized oligonucleotides:
451 TAGGATCC (SEQ ID N0:14) The base pair segment extends from the Ndel site to the Xbal site at position 220 within the coding region and is assembled from overlapping oligonucleotides which range in length from between and bases.
The base pair segment which extends from the Xbal to the BamHI
site is also assembled from 7 overlapping oligonucleotides which range in length from between 57 and 92 bases.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/U597/23549 To assemble these fragments, the respective 7 oligonucleotides are mixed in equimolar amounts, usually at concentrations of about 1-2 picomoles per microliter. Prior to assembly, all but the oligonucleotides at the 5'-ends of the segment are phosphorylated in standard kinase buffer with T4 DNA kinase using the conditions specified by the supplier of the reagents. The mixtures are heated to 95°C
and allowed to cool slowly to room temperature over a period of 1-2 hours to ensure proper annealing of the oligonucleotides. The oligonucleotides are then ligated to each other and into an appropriated cloning vector such as pUCl8 or pUC 19 using T4 DNA ligase. The buffers and conditions are those recommended by the supplier of the enzyme. The vector for the 220 base pair fragment is digested with Ndel and Xbal, whereas the vector for the 240 base pair fragment is digested with Xbal and BamHI prior to use. The ligation mixes are used to transform E. coli DH10B
cells (commercially available from Life Technologies, producer of GIBCO/BRL products, Grand Island, NY) and the transformed cells are plated on tryptone-yeast plates (TY, Difco, Detroit, MI) containing 100 ~g/ml of ampicillin, X-gal and IPTG. Colonies which grow up overnight are grown in liquid TY medium with 100 ~.g/ml of ampicillin and are used for plasmid isolation and DNA sequence analysis. Plasmids with the correct sequence are kept for the assembly of the complete gene. This is accomplished by gel-purification of the 220 base-pair and the 240 base-pair fragments and ligation of these two fragments into an expression vector such as pRBl82 from which the coding sequence for A-C-B
proinsulin is deleted and is digested with Ndel and BamHI
prior to use.
Alternatively, plasmid pET30 (Novagen, Madison, wI) can be digested with Ndel and BamHI, and the desired DNA
sequence encoding the proteins of the present invention can be inserted by procedures recognized in the art and described herein. The source of the DNA is synthetic SUBSTITUTE SHEET ( rule 26 ) WO 98/Z8414 . PCT/I1S97/23549 oligonucleotides that are assembled by art recognized methodology and described herein.
8xample 2 Expression plasmid for the Proteia of S8Q ID N0:3 (pH8767) Plasmid vector pOJ722 (40 fig, NRRL No. B-21,654) was prepared in a manner analogous to Example 1 and was digested with 20 units of EcoRV (Boehringer Mannheim, Indianapolis, IN) and 20 units of Bglll (Boehringer Mannheim) in "Boehringer Mannheim buffer B" at 37oC for two hours. The digest was purified on a 1% agarose gel, and the 2674 by and 1422 by fragments were isolated using the freeze-squeeze method.
The oligonucleotides 14255 [5'-ATC TCA CAC ACA CAG TCA
GTC TCG AGT AAA CAG AAA GTC ACA GGC TTG GAC TTC ATA CCT GGG
CTG CAC CCC GAC CTG ACA TTG TCT AAA ATG GAC CAG ACA CTG GCA
GTC TAT CAA CA-3' (SEQ ID N0:15)] and 14256 [5'-GAT CTG TTG
ATA GAC TGC CAG TGT CTG GTC CAT TTT AGA CAA TGT CAG GTC GGG
GTG CAG CCC AGG TAT GAA GTC CAA GCC TGT GAC TTT CTG TTT ACT
CGA GAC TGA CTG TGT GTG TGA GAT-3' (SEQ ID N0:16)] were kinased in the presence of 1x kinase buffer, 50 mM Tris-HCl pH 8.0, 10 mM MgCl2, 0.5 mM ATP, 1 mM DTT, and 5 units T4 polynucleotide kinase (GIBCO BRL) at 37°C for 30 minutes.
The EcoRV/Bglll 2674 by and 1422 by fragments, prepared as described above, and the 14255 and 14256 linkers were ligated in the presence of 1x kinase buffer, 0.5 mM ATP
and 1 unit T4 DNA ligase (GIBCO BRL) at 16°C overnight. The ligation products are transformed into E. coli BL21(DE3) (Novagen) and colonies growing on TY plates supplemented with 10 ~,g/mL of tetracycline are analyzed. Plasmid DNA is isolated, then subjected to DNA sequencing on a PE-Applied Biosystems 370 DNA sequencer. Plasmids containing the expected -.400 by Nde1 to BamHI fragment are kept, and are designated pHS767.
Example 3 SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Expression plasmid for the Protein of SEQ ID N0:5 (pH8786) Plasmid vector pHS767 (40 ~tg) was digested with 20 units of Pmll (New England Biolab, Beverly, MA) in "New England Biolab buffer 1" at 37°C for two hours. Buffer salts were added to adjust to the mixture to the conditions of "New England Biolab buffer 3." BstXI was added(20 units, New England Biolab), and the plasmid was digested for 2 hours at 55°C. After that, the digest was treated with 20 units of alkaline phosphatase (Boehringer Mannheim) at 37°C
for 30 minutes. The digest was purified on a 1% agarose gel, and the 4170 by fragment was isolated using the freeze-squeeze method.
The oligonucleotides 13824 [5'-TGA GGC TTC CAG GAC
TCC CCC CAG ACT GTC CAA TGT CTC CAG GCC ACT GGC GTC TGG CAA
GTG GCA ACT TTT AGA GAA GGC CAG CAC-3'(SEQ ID N0:17)] and 13825 [5'-GTG CTG GCC TTC TCT AAA AGT TGC CAC TTG CCA GAC
GCC AGT GGC CTG GAG ACA TTG GAC AGT CTG GGG GGA GTC CTG GAA
GCC-3'(SEQ ID N0:18)] were kinased in the presence of lx kinase buffer, 50 mM Tris-HC1 pH 8.0, 10 mM MgCl2, 0.5 mM
ATP, 1 mM DTT, and 5 units T4 polynucleotide kinase (GIBCO
BRL) at 37°C for 30 minutes.
The Pmll/BstXI linearized pHS767 vector, and the 13824 and 13825 linkers, were ligated in the presence of 1x kinase buffer, 0.5 mM ATP and 1 unit T4 DNA ligase (GIBCO
BRL) at 16°C overnight. The ligation products are transformed into E, coli BL21(DE3) (Novagen) and colonies growing on TY plates supplemented with 10 ~g/mL of tetracycline are analyzed. Plasmid DNA is isolated, then subjected to DNA sequencing on a PE-Applied Biosystems 370 DNA sequencer. Plasmids containing the expected 400 by Ndel to BamHI fragment are kept, and are designated pHS786.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/L1S97/23549 Example 4 Expression plasmid for the Protein of SEQ ID N0:2 (pHS775) Plasmid vector pHS767 (40 ug) was digested with 20 units of Xhol (Boehringer Mannheim) and 20 units of Ncol (Boehringer Mannheim) in "Boehringer Mannheim buffer H" at 37°C for two hours. The digest was treated with 20 units of alkaline phosphatase (Boehringer Mannheim) at 37°C for 30 minutes. The digest was purified on a 1% agarose gel, and the 4215 by fragment was isolated using the freeze-squeeze method.
The oligonucleotides 14405 [5'-TCG AGT AAA CAG AAG
GTC ACC GGC TTG GAC TTC ATA CCT GGG CTG GAC CCC ATA CTG ACA
TTG TCT AAA ATG GAC CAG ACA CTG GCA GTC TAT CAA CA-3' (SEQ
ID N0:19)] and 14406 [5'-GAT CTG TTG ATA GAC TGC CAG.TGT CTG
GTC CAT TTT AGA CAA TGT CAG TAT GGG GTC CAG CCC AGG TAT GAA
GTC CAA GCC GGT GAC CTT CTG TTT AC-3' (SEQ ID N0:20)] were kinased in the presence of lx kinase buffer, 50 mM Tris-HC1 pH 5.0, 10 mM MgCl2, 0.5 mM ATP, 1 mM DTT, and 5 units T4 polynucleotide kinase (GIBCO BRL) at 37°C for 30 minutes.
The Xhol/Ncol linearized pHS767 vector and 14405 and 13406 linkers were ligated in the presence of lx Kinase buffer, 0.5 mM ATP and 1 unit T4 DNA ligase (GIBCO BRL) at 16°C overnight. The ligation products are transformed into E. coli BL21(DE3) (Novagen) and colonies growing on TY
plates supplemented with 10 ~g/mL of tetracycline are analyzed. Plasmid DNA is isolated, then subjected to DNA
sequencing on a PE-Applied Biosystems 370 DNA sequencer.
Plasmids containing the expected 400 by Ndel to BamHI
fragment are kept, and are designated pHS775.
Example 5 Expression plasmid for the Protein of SEQ ID N0:4 (pHS785) Plasmid vector pHS775 (40 ug) was digested with 20 units of Pmll (New England Biolab) in "New England Biolab buffer 1" at 37°C for two hours. The buffer salts were adjusted to "New England Biolab buffer 3," 20 units of BstXI
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 (New England Biolab) were added, and the plasmid was digested for 2 hours at 55°C. After that, the digest was treated with 20 units of alkaline phosphatase (Boehringer Mannheim) at 37°C for 30 minutes. The digest was purified on a 1% agarose gel, and the 4170 by fragment was isolated using the freeze-squeeze method.
The oligonucleotides 13824 [5'-TGA GGC TTC CAG GAC
TCC CCC CAG ACT GTC CAA TGT CTC CAG GCC ACT GGC GTC TGG CAA
GTG GCA ACT TTT AGA GAA GGC CAG CAC-3'(SEQ ID N0:21)] and 13825 [5'-GTG CTG GCC TTC TCT AAA AGT TGC CAC TTG CCA GAC
GCC AGT GGC CTG GAG ACA TTG GAC AGT CTG GGG GGA GTC CTG GAA
GCC-3'(SEQ ID N0:22)] were kinased in the presence of lx kinase buffer, 50 mM Tris-HC1 pH 8.0, 10 mM MgCl2, 0.5 mM
ATP, 10 mM DTT, and 5 units T4 polynucleotide kinase (GIBCO
BRL ) .
The Pmll/BstXI linearized pHS775 vector, and the 13824 and 13825 linkers, were ligated in the presence of lx kinase buffer, 0.5 mM ATP and 1 unit T4 DNA ligase (GIBCO
BRL) at 16°C overnight. The ligation products are transformed into E. coli BL21(DE3) (Novagen) and colonies growing on TY plates supplemented with 10 ~.g/mL of tetracycline are analyzed. Plasmid DNA is isolated, then subjected to DNA sequencing on a PE-Applied Biosystems 370 DNA sequencer. Plasmids containing the expected 400 by Ndel to BamHI fragment are kept, and are designated pHS785.
Example 6 Transformation of E. coli with expression plasmids The expression plasmids were transformed into E.
coli cells. Competent E. coli cells (100 ~,L) were transformed with 10 ng of the expression plasmids described earlier. The transformation mixtures were plated in TY-agar plates containing tetracycline (10 ~g/mL)) Transformants were picked and used to grow 10 L cultures.
Cells harboring the expression plasmid for the desired protein were grown overnight at 37°C in 1 L of 2xTY
SUBSTITUTE SHEET { rule 26 ) medium containing tetracycline (10 ~g/mL). After overnight growth, 100 mL of a culture were inoculated into 10 L of medium. When the culture optical density reached 0.6 O.D., expression of the particular protein was induced by adding TPTG to a concentration of 10 ~M. After 3 hours, the E.
coli cells were harvested, and granules were isolated for protein purification.
The techniques of transforming cells with the aforementioned vectors are well-known in the art [Maniatis, T., et al. Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1988); Id., Current Protocols in Molecular Biology (1989), and supplements thereof]. The techniques involved in the transformation of E. coli cells used in the preferred practice of the invention as exemplified herein are well known in the art. The precise conditions under which the transformed E. coli cells are cultured is dependent on the nature of the E. coli host cell line and the expression or cloning vectors employed. For example, vectors which incorporate thermoinducible promoter-operator regions) such as the c1857 thermoinducible lambda-phage promoter-operator region, require a temperature shift from about 30°C to about 40°C. in the culture conditions so as to induce protein synthesis.
In the preferred embodiment of the invention, E.
coli K12 RV308 cells are employed as host cells, but numerous other cell lines are available, such as, but not limited to, E. coli K12 L201, L687, L693, L507, L640) L641, L695, L814 {E. coli B). The transformed host cells are then plated on appropriate media under the selective pressure of the antibiotic corresponding to the resistance gene present on the expression plasmid. The cultures are then incubated for a time and temperature appropriate to the host cell line employed.
Proteins that are expressed in high-level bacterial expression systems characteristically aggregate in SUBSTITUTE SHEET ( rule 26 ) granules or inclusion bodies which contain high levels of the overexpressed protein [Kreuger, J. K., et al., in Protein Folding, Gierasch, L. M. and King, J., eds., American Association for the Advancement of Science Publication No. 89-185, Washington, D.C., 136-142 (1990)].
Such protein aggregates must be dissolved to provide further purification and isolation of the desired protein product.
[Kreuger, J. K., et al., supra.]. A variety of techniques using strongly denaturing solutions such as guanidinium-HC1 and/or weakly denaturing solutions such as urea are used to solubilize the proteins. Removal of the denaturing agents allows the denatured protein to assume its native conformation. The particular conditions for denaturation and folding are determined by the particular~protein expression system and/or the protein in question.
Preferably, the present proteins are expressed with a leader sequence. One of ordinary skill in the art would recognize that numerous leader sequences are operable.
However, the leader sequence is preferably Met-R1-, wherein R1 is any amino acid except Pro or is absent, so that the expressed proteins may be readily converted to the claimed protein with Cathepsin C, or other suitable aminopeptidases.
Preferably, R1 is Arg, Asp, or Tyr; and most preferably, the proteins are expressed with a Met-Arg leader sequence.
Interestingly, the leader sequence does not significantly affect stability or activity of the protein. Nevertheless, the leader sequence is preferably cleaved from the protein.
Thus, the proteins of the Formula: Met-R1-SEQ ID NO:1 are useful as biological agents and, preferably, as intermediates.
SUBSTITUTE SHEET ( rule 26 ) Example 7 The Protein of the Formula: Met-Arg-SEQ ID N0:4 The protein of SEQ ID N0:4 with a Met-Arg leader sequence was expressed in E. coli as previously described.
Granules were solubilized in 8 M urea containing 5 mM
cysteine. The protein was purified by anion exchange in 8 M
urea containing 5 mM cysteine and folded by dilution into 8 M urea (containing 5 mM cysteine) and exhaustive dialysis against phosphate-buffered saline (PBS). Little to no aggregation of the protein was seen. Following final purification of the protein by size exclusion chromatography, the protein was concentrated to 3-3.5 mg/mL
in PBS.
Example 8 The Protein of the Formula SEQ ID N0:3 The protein of SEQ ID N0:3 with a Met-Arg leader sequence was expressed in E. coli, isolated and folded by techniques analogous to the previous Examples. The pH of the protein solution was reduced to pH 2.8. The Met-Arg leader sequence was cleaved by the addition of 6-10 milliunits dDAP per mg of protein (dDAP is the abbreviation for a dipeptidylaminopeptidase isolated from the slime mold, Dicteostelium descoidium, described by Atkinson, P. R., et al., U.S. Patent No. 5,565,330, issued October 15, 1996).
The conversion reaction was allowed to proceed for 2-8 hours at room temperature. The progress of the reaction was monitored by high performance reversed phase chromatography.
The reaction was terminated by adjusting the pH to 8 with NaOH. The des(Met-Arg) protein was further purified by cation exchange chromatography in the presence of 7-8 M urea and size exclusion chromatography in PBS. Following final purification of the proteins by size exclusion chromatography the proteins were concentrated to 3-3.5 mg/mL
in PBS. Virtually no aggregation of either protein was noted.
SUBSTITUTE SHEET ( rule 26 ) The purification.of the claimed proteins is by techniques known in the art and includes reversed phase chromatography, affinity chromatography, ion exchange and size exclusion chromatography.
The claimed proteins contain two cysteine residues. Thus, a disulfide bond may be formed to stabilize the protein. The present invention includes proteins of the Formula (I) wherein the Cys at position 96 is cross-linked to Cys at position 146 as well as those proteins without such disulfide bonds. Preferably, the Cys at position 96 is disulfide bonded to the Cys at position 146. In addition the proteins of the present invention may exist, particularly when formulated, as dimers, trimers, tetramers, and other multimers. Such multimers are included within the scope of the present invention.
The present invention provides a method for treating obesity. The method comprises administering to the organism an effective amount of anti-obesity protein in a dose between about 1 and 1000 ~.g/kg. A preferred dose is from about 10 to 100 ~g/kg of active protein. A typical daily dose for an adult human is from about 0.5 to 100 mg.
In practicing this method, proteins of the Formula (I) can be administered in a single daily dose or in multiple doses per day. The treatment regime may require administration over extended periods of time. The amount per administered dose or the total amount administered will be determined by the physician and depend on such factors as the nature and severity of the disease, the age and general health of the patient and the tolerance of the patient for the protein.
The instant invention further provides pharmaceutical formulations comprising proteins of the present invention. The proteins, preferably in the form of a pharmaceutically acceptable salt, can be formulated for parenteral administration for the therapeutic or prophylactic treatment of obesity. For example, proteins of the Formula (I) can be admixed with conventional SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 ~ PCT/LTS97/23549 pharmaceutical carriers and excipients. The compositions comprising claimed proteins contain from about 0.1 to 95% by weight of the active protein, preferably in a soluble form, and more generally from about 10 to 30%. Furthermore, the present proteins may be administered alone or in combination with other anti-obesity agents or agents useful in treating diabetes. For intravenous (i.v.) use, the protein is administered in commonly-used intravenous fluids) and administered by infusion. Such fluids, for example, physiological saline, Ringer s solution or 5% dextrose solution can be used. For intramuscular preparations, a sterile formulation, preferably a suitable soluble salt form of a protein of the Formula (I), for example the hydrochloride salt, can be dissolved and administered in a pharmaceutical diluent such as pyrogen-free, distilled water, physiological saline, or 5% glucose solution. A
suitable insoluble form of the protein may be prepared and administered as a suspension in an aqueous base or a pharmaceutically acceptable oil base, e.g., an ester of a long chain fatty acid such as ethyl oleate.
Pharmaceutically acceptable preservatives such as an alkylparaben, particularly methylparaben, ethylparaben, propylparaben, or butylparaben or chlorobutanol are preferably added to the formulation to allow multi-dose use.
Significantly, the claimed proteins having Asp or Glu at position 100 are also stable in the presence of a phenolic preservative, such as, m-cresol or phenol. Proteins with one of these replacements are highly preferred for that reason. The stability of the proteins in the presence of a phenolic preservative offers advantages in pharmaceutical delivery, including, enhanced preservative effectiveness.
The formulation is preferably prepared in the absence of salt to minimize the ionic strength of the formulation.
The, ability of the present proteins to treat obesity is demonstrated in vivo, as follows.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 ~ PCT/US97/23549 Biological Testing Parabiotic experiments suggest that a protein is released by peripheral adipose tissue and that the protein is able to control body weight gain in normal, as well as obese mice. Therefore, the most closely related biological test is to inject the test protein by any of several routes of administration (e.g., i.v., s.c., i.p., or by minipump or cannula) and then to monitor food and water consumption, body weight gain, and also plasma chemicals or hormones, such as glucose, insulin, ACTH, corticosterone, GH, or T4 over various time periods. Suitable test animals include normal mice (ICR, etc.) and obese mice (ob/ob, Avy/a, KK-Ay, tubby, fat). The ob/ob mouse model of obesity and diabetes is generally accepted in the art as being indicative.of the obesity condition. Control for non-specific effects for these injections is accomplished by administering the vehicle without the protein in the same animal model, and monitoring the same parameters. Alternatively, the protein itself is administered to animals that are thought to lack the protein receptor, such as, db/db mice or fa/fa or cp/cp rats. Proteins demonstrating activity in these models will demonstrate similar activity in other mammals, particularly humans.
Since the target tissue is expected to be the hypothalamus where food intake and lipogenic state are regulated, an alternate study involves injecting a protein directly into the brain of test animals, for example, by i.c.v. injection via the lateral or third ventricles, or by injection directly into specific hypothalamic nuclei, such as, the arcuate, paraventricular, or perifornical nuclei.
The same parameters as above could be measured, or the release of neurotransmitters that are known to regulate feeding or metabolism could be monitored (e. g., NPY, galanin, norepinephrine, dopamine, ~3-endorphin release).
Similar studies are accomplished in vitro using isolated hypothalamic tissue in a perifusion or tissue bath SUBSTITUTE SHEET ( rule 26 ) system. In this situation, the release of neurotransmitters or electrophysiological changes is monitored.
The proteins used to carry out the experiments that are reported in Examples 9 and 10 were prepared in accordance with the teachings and examples provided herein.
Amino acid sequences of the proteins were confirmed by mass spectroscopy and/or amino acid analysis. The proteins were folded with the Cys residues cross-linked by a disulfide bond when tested.
Example 9 Testing of Proteins of SEQ ID N0:2 and 3 in Male ob/ob Mice Male ob/ob mice [Harlan, Ltd., Blackthorn, England] were housed in groups of 5 animals each and provided with Purina 5008 chow and water ad Iibitum. The mice were maintained on a reverse lighting schedule (lights off at 9:00 A.M., and on at 9:00 P.M.). The mice were weighed daily at 8:30 A.M. Their food and water consumption were determined at the same time. Treatment, as indicated below, were made following the morning weighing, just prior to lights out. The mice were treated once daily for 4 days.
Group Treatment (n=5) 1 PBS 0.2 mL/day, S.C.
2 Protein 30 ~,g/0.2 mL/day, S.C.
3 Protein 300 ~,g/0.2 mL/day, S.C.
The effects of these treatments on food consumption and cumulative body weight change are illustrated for representative proteins of the present invention in Tables 1 and 2.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 ~ PCT/LTS97/23549 Table 1. Effect of SEQ ID.N0:2 on food consumption and cumulative body weight change in ob/ob mice Food Consumption Body Weight (g/day) Change (g) Day Group Group Group Group Group Group 1 4.4 4.6 3.4 +0.5 -0.4 -0.4 2 5.3 4.7 2.8 ~+0.4 -0.8 -1.1 3 4.8 4.2 2.2 +0.6 -0.6 -1.5 4 5.1 3.6 1.7 +0.9 -1.2 -2.9 Table 2. Effect of SEQ ID N0:3 on food consumption and cumulative body weight change in ob/ob mice Food Consumption Body Weight (g/day) Change (g) Day Group Group Group Group Group Group 1 5.0 4.5 3.1 -0.3 -0.2 -1.0 2 5.4 3.5 2.2 0.0 -0.8 -2.0 3 5.2 3.3 1.8 +0.1 -1.0 -2.8 Group 1 = Control (PBS); Group 2 - 30 ~g/day of protein;
Group 3 - 300 ~g/day of protein Example 10 Analysis of Aggregation by Dynamic Light Scatteriag (DLS) The physical properties of the present proteins are demonstrated as follows. Based on availability of material, the solution analyzed by DLS was prepared in one of two ways. Material from the last size exclusion chromatography purification step, at approximately 1.5 mg/mL
in PBS, was either concentrated to about 3 or 5 mg/mL by diafiltration, or was dialyzed against water, lyophilized, and then reconstituted to about 3 or 5 mg/mL.
In the first method the protein solution at approximately 1.5 mg/mL was concentrated to greater than about 3 or 5 mg/mL in a small volume stir cell (10 mL) using an Amicon YM10, 25-mm membrane. This was conducted under cold room conditions (about 5°C). Protein concentration was determined by W absorption, and the solution was diluted to SUBSTITUTE SHEET { rule 26 ) about 3.0 mg/mL or 5 mg/mL.with PBS (10-fold water dilution of lOx PBS without Ca/Mg, GIBCO BRL).
A second alternative method, used when lyophilization was feasible, consisted of dialyzing the solution of protein against water, using three to five exchanges of water at about 4°C. A typical dialysis membrane was the Spectra/Por-7 dialysis membrane, 2000 molecular weight cutoff membrane (Spectrum Medical Industries, Los Angeles, CA). The material was concentrated as above to 3 to 4 mg/mL and typically a 2 mg quantity was lyophilized in a 5 mL vial. For DLS analysis this plug was reconstituted with water to greater than 3.3 mg/mL or greater than 5 mg/mL. Several vials were typically pooled.
Protein concentration was determined by W at peak maximum (typically 280-nm). The protein concentration was diluted to about 3 or 5 mg/mL with a combination of water and 10x PBS (without Ca/Mg, GIBCO BRL) to yield a final PBS
concentration of lx.
The protein solution at 3.0 mg/mL or at 5 mg/mL in lx PBS was adjusted to 7.4 with HC1/NaOH and passed through a 0.1 ~m Anotopl"'-10 filter (Whatman International, Ltd.) Maidstone, England) into a DLS cell. The average cumulant particle size was measured on a Brookhaven BI-9000 DLS
instrument (Brookhaven Instruments, Holtsville, NY) with a Lexel argon-ion laser every 15 minutes using a 30 second duration at a 90° angle. A 488-nm filter with a 400 ~m pinhole was assumed. At the 37°C incubation temperature a viscosity of 0.6915 centipoise (cP) and refractive index of 1.333 was used. The light-weighted average particle size was calculated by a cumulant method using the measured auto-correlation baseline.
The estimated time required for various anti-obesity proteins to reach an averaged light-weighted particle size of 50 nm in a PBS solution at pH 7.4 and 37°C is shown in Table 3. The averaged light-weighted particle size was determined from a cumulant analysis of SUBSTITUTE SHEET ( rule 26 ) a binodal distribution composed of monomeric and higher-order aggregate populations. The time necessary to achieve an average aggregate size of 50 nm was estimated by plotting size as a function of time. The proteins were folded with the Cys residues cross-linked by a disulfide bond when tested.
Table 3. Estimated time (minutes) required for various anti-obesity proteins to reach an averaged light-weighted particle size of 50 nm in a PBS solution at pH 7.4 and 37°C.
Estimated Time to Reach nm (min) Protein 3 mg/mL 5 mg/mL
Human Ob Protein 2 u.d.
SEQ ID N0:2 871 n.d.
SEQ ID N0:3 543 n.d.
SEQ ID N0:5 835 342 SEQ ID N0:6 2,399 75g The data in Table 3 demonstrate the dramatic reduction in the propensity of the claimed proteins to aggregate.
The claimed proteins are active in at least one of the above biological tests and are anti-obesity agents. As such, they are useful in treating obesity and those disorders implicated by obesity. However, the proteins are not only useful as therapeutic agents. One skilled in the art recognizes that the proteins are useful in the production of antibodies for diagnostic use and, as proteins, are useful as feed additives for animals.
Furthermore, the proteins are useful for controlling weight for cosmetic purposes in mammals. A cosmetic purpose seeks to control the weight of a mammal to improve bodily appearance. The mammal is not necessarily obese. Such cosmetic use forms part of the present invention.
The principles, preferred embodiments and modes of operation of the present invention have been described in SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 ~ - PCT/US97/23549 the foregoing specification. The invention which is intended to be protected is not limited to the particular forms disclosed, since they are to be regarded as illustrative rather than restrictive. Variations and changes may be made by those skilled in the art without departing from the spirit of the invention.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 ~ PCT/US97I23549 SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Beals, John M
Hoffmann, James A
Kline, Allen D
Schevitz, Richard W
Zhang, Forming (ii) TITLE OF INVENTION: Anti-Obesity Proteins (iii) NUMBER OF SEQUENCES: 22 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Eli Lilly and Company (B) STREET: Lilly Corporate Center (C) CITY: Indianapolis 2 0 (D) STATE: Indiana (E) COUNTRY: U.S.A.
(F) ZIP: 46285 (v) COMPUTER READABLE FORM:
2 (A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 3 (vi) CURRENT APPLICATION DATA:
O
(A) APPLICATION NUMBER: WO N/A (w/ Filing) (B) FILING DATE:
(C) CLASSIFICATION:
3 (viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Caltrider, Steven P
(B) REGISTRATION NUMBER: 36,467 (C) REFERENCE/DOCKET NUMBER: X-11092 4 (ix) TELECOMMUNICATION INFORMATION:
O
(A) TELEPHONE: (317) 276-0757 (B) TELEFAX: (317) 277-1917 4 5 (2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids (B) TYPE: amino acid 5 0 (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 1 (D) OTHER INFORMATION: /note= "Xaa at position 1 is Val or 6 0 absent ; ~~
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 2 (D) OTHER INFORMATION: /note= "Xaa at position 2 is Pro or SUBSTITUTE SHEET ( rule 26 ) absent;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 22 (D) OTHER INFORMATION: /note= "Xaaat position22 is Asn or Ser;"
(ix) FEATURE:
1 {A) NAME/KEY: Modified-site (B) LOCATION: 28 (D) OTHER INFORMATION: /note= "Xaaat position28 is Gln or absent;"
( ix) FEATURE
(A) NAME/KEY: Modified-site (B) LOCATION: 72 (D) OTHER INFORMATION: /note= "Xaaat position72 is Asn, Gln, Glu or Asp;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 73 (D) OTHER INFORMATION: /note= "Xaaat position73 is Val 2 or Met ; "
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 100 3 {D) OTHER INFORMATION: /note= "Xaaat position100 is 0 Trp, Gln, Glu, Asp, Ser, Thr, Lys, His,..."
or (ix) FEATURE:
(A) NAME/KEY: Modified-site 3 (B) LOCATION: 138 (D) OTHER INFORMATION: /note= "Xaaat position138 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His,Arg;said protein or having at least one of the following:"
4 (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 1 (D) OTHER INFORMATION: /note= "Xaaat position1 is replaced with Glu, Asp, Ser, Thr, His,or Arg;"
Lys, (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 2 {D) OTHER INFORMATION: /note= "Xaaat position2 is 50 replaced with Glu, Asp, Ser, Thr, His,or Arg;"
Lys, (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 3 55 (D) OTHER INFORMATION: /note= "Ileat position3 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site 6 (B) LOCATION: 30 (D) OTHER INFORMATION: /note= "Valat position30 is replaced with Glu, Asp, Arg, Lys, is;"
or H
(ix} FEATURE:
65 (A) NAME/KEY: Modified-site SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCTIUS97/23549 (B) LOCATION: 36 (D) OTHER INFORMATION: /note= "Val at position 36 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 41 (D) OTHER INFORMATION: /note= "Phe at position 41 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 42 (D) OTHER INFORMATION: /note= "Ile at position 42 is replaced with Glu, Asp, Arg, Lys, or His;"
( i.x) FEATURE
(A) NAME/KEY: Modified-site (B) LOCATION: 43 2 (D) OTHER INFORMATION: /note= "Pro at position 0 43 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site 2 (B) LOCATION: 45 (D) OTHER INFORMATION: /note= "Leu at position 45 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
3 (A) NAME/KEY: Modified-site (B) LOCATION: 46 (D) OTHER INFORMATION: /note= "His at position 46 is replaced with Glu, Asp) Arg) or Lys;"
3 {ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 47 (D) OTHER INFORMATION: /note= "Pro at position 47 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 48 (D) OTHER INFORMATION: /note= "Ile at position 48 is 4 replaced with Glu, Asp, Arg, Lys, 5 or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 49 5 (D) OTHER INFORMATION: /note= "Leu at position 0 49 is replaced with Glu, Asp, Arg) Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site 5 (B) LOCATION: 50 (D) OTHER INFORMATION: /note= "Thr at position 50 is replaced with Glu) Asp, Arg, Lys, or His;"
(ix) FEATURE:
6 (A) NAME/KEY: Modified-site (B) LOCATION: 74 (D) OTHER INFORMATION: /note= "Ile at position 74 is replaced with Gln, Gly, Asp, Arg, Thr or Ser;"
Lys, His, 65 (ix) FEATURE:
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 (A) NAME/KEY: Modified-site (B) LOCATION: 89 (D) OTHER INFORMATION: /note= "Valat position 89 is replaced with Gln, Glu, Asp, Arg, His,Thr or Ser;"
Lys, (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 92 (D) OTHER INFORMATION: /note= "Pheat position 92 is replaced with Gln, Glu, Asp, Arg, His,Thr or Ser;~~
Lys, (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 99 (D) OTHER INFORMATION: /note= "Proat position 99 is replaced with Gln, Glu, Asp, Arg, His,Thr or Ser; or"
Lys, (ix) FEATURE:
(A) NAME/KEY: Modified-site 2 (B) LOCATION: 142 (D) OTHER INFORMATION: /note= ~~Leuat position 142 is replaced with Glu, Asp, Arg, Lys, r a pharmaceutically or His; o acceptable salt thereof."
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: l:
Xaa Xaa Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Xaa Asp Ile Ser His Thr Xaa Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile 4 0 Leu Thr Ser Met Pro Ser Arg Xaa Xaa Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Xaa Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Xaa Gln Leu Asp Leu Ser Pro 5 5 Gly Cys (2) INFORMATION FOR SEQ ID N0:2:
60 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/LTS97/23549 (ii) MOLECULE TYPE: protein (xi) SEQUENCE :2:
DESCRIPTION:
SEQ
ID
Val ProIle Gln Lys Val AspAspThr LysThrLeu IleLysThr Gln Ile ValThr Arg Ile Asn IleSerHis ThrGlnSer ValSerSer Asp Lys GlnLys Val Thr Gly AspPheIle ProGlyLeu AspProile Leu Leu ThrLeu Ser Lys Met GlnThrLeu AlaValTyr GlnGlnIle Asp 2 Leu ThrSer Met Pro Ser AsnValIle GlnIleSer AsnAspLeu 0 Arg Glu AsnLeu Arg Asp Leu HisValLeu AlaPheSer LysSerCys Leu His LeuPro Trp Ala Ser LeuGluThr LeuAspSer LeuGlyGly Gly Val LeuGlu Ala Ser Gly SerThrGlu ValValAla LeuSerArg Tyr Leu GlnGly Ser Leu Gln MetLeuTrp GlnLeuAsp LeuSerPro Asp 3 Gly Cys (2) INFORMATI ON FOR SEQ ID
N0:3:
4 (i) SEQUENCE
O CHARACTERISTICS:
(A)LENGTH: 146 amino acids (B)TYPE: amino acid (C)STRANDEDNESS: e singl (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0:3:
ial Pro Ile Gln 5ys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Asp Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 65 70 . 75 80 Glu Asn Leu Arg Asp Leu Leu ValLeuAla PheSerLys SerCys His His Leu Pro Trp Ala Ser Gly GluThrLeu AspSerLeu GlyGly Leu Val Leu Glu Ala Ser Gly Tyr ThrGluVal ValAlaLeu SerArg Ser Leu Gln Gly Ser Leu Gln Asp LeuTrpGln LeuAspLeu SerPro Met Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:4:
2 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE
DESCRIPTION:
SEQ
ID
N0:4:
ial Pro IleGln5ys ValGlnAsp AspThrLys ThrLeuIle LysThr Ile Val ThrArgIle AsnAspIle SerHisThr GlnSerVal SerSer Lys Gln LysValThr GlyLeuAsp PheIlePro GlyLeuAsp ProIle Leu Thr LeuSerLys MetAspGln ThrLeuAla ValTyrGln GlnIle 4 Leu Thr SerMetPro SerArgAsn ValIleGln IleSerAsn AspLeu Glu Asn LeuArgAsp LeuLeuHis ValLeuAla PheSerLys SerCys His Leu ProAspAla SerGlyLeu GluThrLeu AspSerLeu GlyGly Val Leu GluAlaSer GlyTyrSer ThrGluVal ValAlaLeu SerArg Leu Gln GlySerLeu GinAspMet LeuTrpGln LeuAspLeu SerPro 6 Gly Cys (2) INFORMATION OR
F SEQ
ID
N0:5:
(i) SEQUENCE CHARACTERISTICS:
SUBSTITUTE SHEET ( rule 26 ) WO 98/Z8414 . . PCT/US97/23549 (A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: :5:
Val Pro Ile Gln Lys Val Gln AspThrLys ThrLeuIle LysThr Asp Ile Val Thr Arg Ile Asn Asp SerHisThr GlnSerVal SerSer Ile Lys Gln Lys Val Thr Gly Leu PheIlePro GlyLeuHis ProAsp Asp Leu Thr Leu Ser Lys Met Asp ThrLeuAla ValTyrGln GlnIle Gln 2 Leu Thr Ser Met Pro Ser Arg ValIleGln IleSerAsn AspLeu 5 Asn Glu Asn Leu Arg Asp Leu Leu ValLeuAla PheSerLys SerCys His His Leu Pro Asp Ala Ser Gly GluThrLeu AspSerLeu GlyGly Leu Val Leu Glu Ala Ser Gly Tyr ThrGluVal ValAlaLeu SerArg Ser Leu Gln Gly Ser Leu Gln Asp LeuTrpGln LeuAspLeu SerPro Met 4 Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:6:
4 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0:6:
Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Asp Ser Lys Ser Cys His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro 2 0 Gly Cys (2) INFORMATION FOR SEQ ID N0:7:
2 5 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0:7:
ial Pro Ile Gln 5ys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile 50 Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Asp Ser Lys Ser Cys His Leu Pro Asp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro 6 5 Gly Cys SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 (2) INFORMATION
FOR
SEQ
ID N0:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 148 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: :8:
Met Arg Val Pro Ile Gln GlnAsp Asp Thr Lys Thr Leu Lys Val Ile Lys Thr Ile Val Thr Arg AspIle Ser His Thr Gln Ser Ile Asn Val Ser Ser Lys Gln Lys Val LeuAsp Phe Ile Pro Gly Leu Thr Gly Asp Pro Ile Leu Thr Leu Ser AspGln Thr Leu Ala Val Tyr Lys Met Gln 3 Gln Ile Leu Thr Ser Met ArgAsn Val Ile Gln Ile Ser 0 Pro Ser Asn Asp Leu Glu Asn Leu Arg LeuHis Val Leu Ala Phe Ser Asp Leu Lys Ser Cys His Leu Pro Trp GlyLeu Glu Thr Leu Asp Ser Ala Ser Leu Gly Gly Val Leu Glu Ala TyrSer Thr Glu Val Val Ala Ser Gly Leu Ser Arg Leu Gln Gly Ser AspMet Leu Trp Gln Leu Asp Leu Gln Leu 4 Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:9:
5 (i) SEQUENCE CHARACTERISTICS:
O
(A) LENGTH: 148 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0:9:
Met Arg Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Lys Thr Ile Val Thr Arg AsnAspIleSer HisThrGln SerVal Il.e Ser Ser 35s Gln Lys Val GlyLeuAspPhe IleProGly LeuHis Thr Pro Asp Leu Thr Leu Ser MetAspGlnThr LeuAlaVal TyrGln Lys Gln Ile Leu Thr Ser Met SerArgAsnVal IleGlnIle SerAsn Pro Asp Leu Glu Asn Leu Arg LeuLeuHisVal LeuAlaPhe SerLys Asp Ser Cys His Leu Pro Trp SerGlyLeuGlu ThrLeuAsp SerLeu Ala Gly Gly Val Leu Glu Ala GlyTyrSerThr GluValVal AlaLeu Ser Ser Arg Leu Gln Gly Ser GlnAspMetLeu TrpGlnLeu AspLeu Leu 2 Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID NO:10:
30 (i) SEQUENCE :
CHARACTERISTICS
(A) LENGTH: 148 aminocids a (B) TYPE: amino acid (C) STRANDEDNESS: e singl (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE
DESCRIPTION:
SEQ
ID
NO:10:
Met Arg ValProIle GlnLysVal GlnAspAspThr LysThrLeu Ile Lys Thr IleValThr ArgIleAsn AspIleSerHis ThrGlnSer Val Ser Ser LysGlnLys ValThrGly LeuAspPheIle ProGlyLeu Asp Pro Ile LeuThrLeu SerLysMet AspGlnThrLeu AlaValTyr Gln 55 Gln Ile LeuThrSer MetProSer ArgAsnValIle GlnIleSer Asn Asp Leu GluAsnLeu ArgAspLeu LeuHisValLeu AlaPheSer Lys Ser Cys HisLeuPro AspAIaSer GlyLeuGluThr LeuAspSer Leu Gly Gly ValLeuGlu AlaSerGly TyrSerThrGlu ValValAla Leu SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Ser Arg Leu Gln Gly Ser Asp MetLeuTrp GlnLeuAsp Leu Gln Leu Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 148 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:
SEQ ID N0:11:
Met Arg Val Pro Ile Gln Gln AspAspThr LysThrLeu Lys Val Ile Lys Thr Ile Val Thr Arg Asp IleSerHis ThrGlnSer Ile Asn Val Ser Ser Lys Gln Lys Val Leu AspPheIle ProGlyLeu Thr Gly His Pro Asp Leu Thr Leu Ser Asp GlnThrLeu AlaValTyr Lys Met Gln 3 Gln Ile Leu Thr Ser Met Arg AsnValIle GlnIleSer 5 Pro Ser Asn Asp Leu Glu Asn Leu Arg Leu HisValLeu AlaPheSer Asp Leu Lys Ser Cys His Leu Pro Asp Gly LeuGluThr LeuAspSer Ala Ser Leu Gly Gly Val Leu Glu Ala Tyr SerThrGlu ValValAla Ser Gly Leu Ser Arg Leu Gln Gly Ser Asp MetLeuTrp GlnLeuAsp Leu Gln Leu 5 Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:12:
5 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 148 amino acids (B) TYPE: amino acid {C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . . PCT/C1S97/23549 (xi) SEQUENCE
DESCRIPTION:
SEQ
ID
N0:12:
Met Arg Val Pro Ile Gln Val GlnAspAsp ThrLysThr LeuIle Lys Lys Thr Ile Val Thr Arg Asn AspIleSer HisThrGln SerVal Ile Ser Ser Lys Gln Lys Val Gly LeuAspPhe IleProGly LeuHis Thr Pro Ile Leu Thr Leu Ser Met AspGlnThr LeuAlaVal TyrGln Lys Gln Ile Leu Thr Ser Met Ser ArgAsnVal IleGlnIle SerAsn Pro Asp Leu Glu Asn Leu Arg Leu LeuHisVal LeuAlaAsp SerLys Asp Ser Cys His Leu Pro Trp Ser GlyLeuGlu ThrLeuAsp SerLeu Ala Gly Gly Val Leu Glu Ala Gly TyrSerThr GluValVal AlaLeu Ser Ser Arg Leu Gln Gly Ser Gln AspMetLeu TrpGlnLeu AspLeu Leu 3 Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:13:
3 (i) SEQUENCE
5 CHARACTERISTICS:
(A) LENGTH: 148 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: e singl (D) TOPOLOGY: linear (ii) MOLECULE
TYPE:
protein (xi) SEQUENCE
DESCRIPTION:
SEQ
ID
N0:13:
Met ArgVal ProIleGln LysValGln AspAspThr LysThrLeu Ile Lys ThrIle ValThrArg IleAsnAsp IleSerHis ThrGlnSer Val Ser SerLys GlnLysVal ThrGlyLeu AspPheIle ProGlyLeu His Pro IleLeu ThrLeuSer LysMetAsp GlnThrLeu AlaValTyr Gln 60 Gln IleLeu ThrSerMet ProSerArg AsnValIle GlnIleSer Asn Asp LeuGlu AsnLeuArg AspLeuLeu HisValLeu AlaAspSer Lys SUBSTITUTE SHEET ( rule 26 ) Ser Cys His Leu Pro Asp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 458 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE
DESCRIPTION:
SEQ ID
N0:14:
CATATGAGGGTACCTATCCA AAAAGTACAA GATGACACCAAAACACTGAT AAAGACAATA
GTCACAAGGATAAATGATAT CTCACACACA CAGTCAGTCTCATCTAAACA GAAAGTCACA
GGCTTGGACTTCATACCTGG GCTGCACCCC ATACTGACATTGTCTAAAAT GGACCAGACA
1ao CTGGCAGTCTATCAACAGAT CTTAACAAGT ATGCCTTCTAGAAACGTGAT ACAAATATCT
TTGCCATGGGCCAGTGGCCT GGAGACATTG GACAGTCTGGGGGGAGTCCT GGAAGCCTCA
GGCTATTCTACAGAGGTGGT GGCCCTGAGC AGGCTGCAGGGGTCTCTGCA AGACATGCTG
TGGCAGCTGGACCTGAGCCC CGGGTGCTAA TAGGATCC
(2) INFORMATION
FOR SEQ
ID N0:15:
(i) SEQUENCE
CHARACTERISTICS:
5 (A) LENGTH: 119 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear 6 (ii) MOLECULE
O TYPE:
cDNA
65 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:15:
SUBSTITUTE SHEET ( rule 26 ) ATCTCACACA CACAGTCAGT CTCGAGTAAA CAGAAAGTCA CAGGCTTGGA CTTCATACCT
(2) INFORMATION FOR SEQ ID N0:16:
1 0 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 123 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16:
GATCTGTTGA TAGACTGCCA GTGTCTGGTC CATTTTAGAC AATGTCAGGT CGGGGTGCAG
60 .
CCCAGGTATG AAGTCCAAGC CTGTGACTTT CTGTTTACTC GAGACTGACT GTGTGTGTGA
GAT
(2) INFORMATION FOR SEQ ID N0:17:
(i) 5EQUENCE CHARACTERISTICS:
3 5 {A) LENGTH: 87 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear 4 O (ii) MOLECULE TYPE: cDNA
4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:17:
TGAGGCTTCC AGGACTCCCC CCAGACTGTC CAATGTCTCC AGGCCACTGG CGTCTGGCAA
(2) INFORMATION FOR SEQ ID N0:18:
5 5 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 84 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97J23549 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:18:
GTGCTGGCCT TCTCTAAAAG TTGCCACTTG CCAGACGCCA GTGGCCTGGA GACATTGGAC
AGTCTGGGGG GAGTCCTGGA AGCC
(2} INFORMATION FOR SEQ ID N0:19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 98 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19:
TCGAGTAAAC AGAAGGTCAC CGGCTTGGAC TTCATACCTG GGCTGGACCC CATACTGACA
TTGTCTAAAA TGGACCAGAC ACTGGCAGTC TATCAACA
3 0 (2) INFORMATION FOR SEQ ID N0:20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 98 base pairs (B) TYPE: nucleic acid 3 5 (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20:
CCCAGGTATG AAGTCCAAGC CGGTGACCTT CTGTTTAC
(2) INFORMATION FOR SEQ ID N0:21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 87 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21:
SUBSTITUTE SHEET ( ruie 26 ) WO 98/28414 . PCT/US97/23549 TGAGGCTTCC AGGACTCCCC CCAGACTGTC CAATGTCTCC AGGCCACTGG CGTCTGGCAA
GTGGCAACTT TTAGAGAAGG CCAGCAC
5 8~
(2) INFORMATION FOR SEQ ID N0:22:
(i) SEQUENCE CHARACTERISTICS:
10 (A) LENGTH: 84 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear 15 {ii) MOLECULE TYPE: cDNA
2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:22:
GTGCTGGCCT TCTCTAAAAG TTGCCACTTG CCAGACGCCA GTGGCCTGGA GACATTGGAC
SUBSTITUTE SHEET ( rule 26 )
Field of the Invention The present invention is in the field of human medicine, particularly in the treatment of obesity and disorders associated with obesity. Most specifically the invention relates to anti-obesity proteins that when administered to a patient regulate fat tissue.
Hackgrouad of the Invention Obesity, and especially upper body obesity, is a common and very serious public health problem in the United States and throughout the world. According to recent statistics, more than 25% of the United States population and 27% of the Canadian population are overweight [Kuczmarski, R. J., Amer. J. of Clin. Nutr. 55:4955-502S
(1992) ; Reeder, B. A. , et. a1. , Can. Med. Assoc. J. , 146:2009-2019 (1992)]. Upper body obesity is the strongest risk factor known for people with type 2 diabetes, and is a strong risk factor for cardiovascular disease and cancer as well. Recent estimates for the medical cost of obesity are $150,000,000,000 world wide. The problem has become serious enough that the surgeon general has begun an initiative to combat the ever increasing adiposity rampant in American society.
Much of this obesity-induced pathology can be attributed to the strong association with dyslipidemia, hypertension, and insulin resistance. Many studies have demonstrated that reduction in obesity by diet and exercise reduces these risk factors dramatically. Unfortunately, these treatments are largely unsuccessful, with a failure rate reaching 95%. This failure may be due to the fact that the condition is strongly associated with genetically inherited factors that contribute to increased appetite, preference for highly caloric foods, reduced physical activity, and increased lipogenic metabolism. This indicates that people inheriting these genetic traits are SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 prone to becoming obese regardless of their efforts to combat the condition. Therefore, a pharmacological agent that can correct this adiposity handicap and allow the physician to successfully treat obese patients in spite of their genetic inheritance is needed.
Physiologists have postulated for years that, when a mammal overeats, the resulting excess fat signals to the brain that the body is obese, which in turn causes the body to eat less and burn more fuel [Hervey, G. R., Nature 222:629-631 (1969)]. This "feedback" model is supported by parabiotic experiments, which implicate a circulating hormone controlling adiposity.
The ob/ob mouse is a model of obesity and diabetes that is known to carry an autosomal recessive trait linked to a mutation in the sixth chromosome. Recently, Zhang, Y.
and co-workers published the positional cloning of the mouse gene linked with this condition [Zhang, Y., et al. Nature 372:425-432 (1994)]. This report disclosed a gene coding for a 167 amino acid protein with a 21 amino acid signal peptide that is exclusively expressed in adipose tissue.
The rat obese gene was subsequently cloned and expressed [Murakami, T. et al., Biochem. Biophys. Res. Comm. 209:944-952 (1995)]. The protein, which is apparently encoded by the ob gene, is now speculated to be the adiposity regulating hormone.
The protein encoded by the ob gene is pharmacologically active, but the physical properties of the protein are less than desirable. The human protein, for example, is prone to precipitation and aggregation in both a formulation and under physiological conditions.
Formulations of a protein containing a precipitate, or that permit precipitation at the site of administration, increase the risk of producing an immunological response in the patient and may result in irritation at the site of injection. Additives to the formulation, such as detergents or surfactants, are undesirable because of increased risk of SUBSTITUTE SHEET ( rule 26 ) WO 98128414 . PCT/US97/23549 toxicology. Accordingly, there remains a need to develop pharmacological agents that demonstrate pharmacological activity and provide improved physical and chemical stability.
Applicants have discovered that the aggregation observed in the native human protein is due, in part, to hydrophobic interactions at the surface of the protein, particularly at residues 1, 2, 3, 30, 36, 41) 42, 43, 45, 46, 47, 48, 49, 50, 74, 89, 92, 99, 100, 138, and 142.
Applicants have further discovered that, by substituting these positions or regions of the protein with charged or hydrophilic amino acids, the propensity of the ob protein to aggregate is dramatically reduced. This reduction in aggregation provides a much improved pharmacological agent.
Accordingly, the present invention provides biologically active obesity proteins. Such agents allow patients to overcome their obesity handicap and live normal lives with a more normalized risk for type 2 diabetes, cardiovascular disease, and cancer.
Summary of Invention The present invention is directed to a protein of the Formula ( I ) 5 10 i5 2 5 Xaa Xaa Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Xaa Asp Ile Ser His Thr Xaa Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Xaa Xaa Ile Gln Ile Ser Asn Asp Leu 4 0 Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Xaa Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly SUBSTITUTE SHEET ( ruie 26 ) WO 98/28414 . PCT/US97/23549 Val Leu Glu Ala Ser Gly Tyr Ser.Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Xaa Gln Leu Asp Leu Ser Pro Gly Cys ( SEQ ID NO :1 ) ( I ) wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 22 is Asn or Ser;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn, Gln, Glu or Asp;
Xaa at position 73 is Val or Met;
Xaa at position 100 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 138 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
Xaa at position 1 is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 2 is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Ile at position 3 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 30 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 36 is replaced with Glu, Asp, Arg, Lys, or His;
Phe at position 41 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 42 is replaced with Glu, Asp, Arg, Lys, or His;
Pro at position 43 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 45 is replaced with Glu, Asp, Arg, Lys, or His;
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/(TS97/23549 His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Pro at position 47 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 49 is replaced with Glu, Asp, Arg, Lys, or His;
Thr at position 50 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 74 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Val at position 89 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Phe at position 92 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Pro at position 99 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser; or Leu at position 142 is replaced with Glu, Asp, Arg, Lys, or His;
or a pharmaceutically acceptable salt thereof.
The invention further provides a method of treating obesity or those condit ions associated with obesity, which comprises adminis tering to a mammal in need thereof a protein of the Formula (I).
The invention further provides a pharmaceutical formulation, which comprises a rotein of the Formula (I), p optionally having a leader seque nce, together with one or more pharmaceutically acceptable diluents, carriers, or excipients therefor.
The invention further provides proteins of the Formula (I) having additionally a leader sequence bonded to the N-terminus of the protein of Formula I. Such proteins are useful for their anti-obesity activity and as intermediates in the preparation of proteins of the Formula (I) .
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/ITS97/23549 The invention further provides DNA encoding a protein of Formula {I), or a protein of Formula (I) having a leader sequence.
An additional embodiment of the present invention is a process for producing a protein of Formula (I), which comprises:
(a) transforming a host cell with DNA that encodes the protein of Formula (I) or a protein of Formula (I) having a leader sequence;
(b) culturing the host cell under conditions permitting expression of the protein;
(c) recovering the expressed protein; and, optionally, (d) cleaving enzymatically the leader sequence.
The invention further provides a protein for use in treating obesity and conditions associated with obesity, as well as, for use in preparing a medicament used to treat obesity and conditions associated with obesity.
Detailed Description For purposes of the present invention, as disclosed and claimed herein, the following terms and abbreviations are defined as follows:
Base pair (bp) -- refers to DNA or RNA. The abbreviations A,C,G, and T correspond to the 5~-monophosphate forms of the nucleotides (deoxy)adenine, (deoxy)cytidine, (deoxy)guanine, and {deoxy)thymine, respectively, when they occur in DNA molecules. The abbreviations U,C,G, and T correspond to the 5~-monophosphate forms of the nucleosides uracil, cytidine, guanine, and thymine, respectively, when they occur in RNA
molecules. In double stranded DNA, base pair may refer to a partnership of A with T or C with G. In a DNA/RNA
heteroduplex, base pair may refer to a partnership of T or U
with A or C with G.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97I23549 - 7 _ FMOC - an abbreviation for 9-fluorenymethoxycarbony.
Immunoreactive Protein(s) -- a term used to collectively describe antibodies, fragments of antibodies capable of binding antigens of a similar nature as the parent antibody molecule from which they are derived, and single chain polypeptide binding molecules [Bird, E. R., et al., PCT Application No. PCT/US 87/02208, International Publication No. WO 88/01649, published March 10, 1988].
Plasmid -- an extrachromosomal self-replicating genetic element.
PAM - an abbreviation for 4-hydroxymethyl-phenylacetamidomethyl.
PMSF -- an abbreviation for phenylmethylsulfonyl fluoride.
Reading frame -- the nucleotide sequence from which translation occurs "read" in triplets by the translational apparatus of tRNA, ribosomes and associated factors, each triplet corresponding to a particular amino acid. Because each triplet is distinct and of the same length, the coding sequence must be a multiple of three. A
base pair insertion or deletion (termed a frameshift mutation) may result in two different proteins being coded for by the same DNA segment. To insure against this, the triplet codons corresponding to the desired polypeptide must be aligned in multiples of three from the initiation codon, i.e., the correct "reading frame" must be maintained.
Recombinant DNA Cloning Vector -- any autonomously replicating agent including, but not limited to, plasmids and phages, comprising a DNA molecule to which one or more additional DNA segments can or have been added.
Recombinant DNA Expression Vector -- any recombinant DNA cloning vector in which a promoter has been incorporated.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 _ g _ Replicon -- A DNA sequence that controls and allows for autonomous replication of a plasmid or other vector.
TFA - an abbreviation for trifluoroacetic acid.
Transcription -- the process whereby information contained in a nucleotide sequence o~f DNA is transferred to a complementary RNA sequence.
Translation -- the process whereby the genetic information of messenger RNA is used to specify and direct the synthesis of a polypeptide chain.
Tris -- an abbreviation for tris(hydroxymethyl)-aminomethane.
Treating -- describes the management and care of a patient for the purpose of combating the disease, condition, or disorder and includes the administration of a protein of present invention to prevent the onset of the symptoms or complications, alleviating the symptoms or complications, or eliminating the disease, condition) or disorder. Treating obesity, therefore, includes the inhibition of food intake, the inhibition of weight gain, and inducing weight loss in patients in need thereof.
Vector -- a replicon used for the transformation of cells in gene manipulation bearing polynucleotide sequences corresponding to appropriate protein molecules which, when combined with appropriate control sequences, confer specific properties on the host cell to be transformed. Plasmids, viruses, and bacteriophage are suitable vectors, since they are replicons in their own right. Artificial vectors are constructed by cutting and joining DNA molecules from different sources using restriction enzymes and ligases. Vectors include Recombinant DNA cloning vectors and Recombinant DNA
expression vectors.
X-gal -- an abbreviation for 5-bromo-4-chloro-3-indolyl beta-D-galactoside.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/ITS97/23549 g _ The amino acid abbreviations are accepted by the United States Patent and Trademark Office as set forth in 37 C.F.R. ~ 1.822 (b)(2) (1993). Unless otherwise indicated the amino acids are in the L configuration.
As noted above the present invention provides a protein of the Formula (I):
Xaa Xaa Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Xaa Asp Ile Ser His Thr Xaa Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Xaa Xaa Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Xaa Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Xaa Gln Leu Asp Leu Ser Pro Gly Cys ( SEQ ID NO : 1 ) ( I ) wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 22 is Asn or Ser;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn, Gln, Glu or Asp;
Xaa at position 73 is Val or Met;
Xaa at position 100 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 138 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Xaa at position l is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 2 is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Ile at position 3 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 30 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 36 is replaced with Glu, Asp, Arg, Lys, or His;
Phe at position 41 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 42 is replaced with Glu, Asp) Arg, Lys, or His;
Pro at position 43 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 45 is replaced with Glu, Asp, Arg, Lys, or His;
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Pro at position 47 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 49 is replaced with Glu, Asp, Arg, Lys, or His;
Thr at position 50 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 74 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Val at position 89 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Phe at position 92 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Pro at position 99 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser; or SUBSTITUTE SHEET ( rule 26 ) Leu at position 142 is replaced with Glu, Asp, Arg, Lys, or His;
or a pharmaceutically acceptable salt thereof.
Preferred proteins are those having one to three substitutions; most preferably one or two substitutions defined in Formula (I).
Other preferred proteins of the present invention are those of SEQ ID NO:1, wherein:
Val at position 36 is replaced with Arg, Lys, or His;
Ile at position 74 is replaced with Glu or Asp, or Phe at position 92 is replaced with Arg, Lys, or His.
Other preferred proteins of the present invention are those of SEQ ID NO:1, wherein:
Phe at position 92 is replaced with Asp, Glu, Lys, Arg, or His, or pharmaceutically acceptable salts thereof.
Other preferred proteins of the present invention are those of SEQ ID N0:1, wherein:
Phe at position 92 is replaced with Asp or Glu, or pharmaceutically acceptable salts thereof.
Other preferred proteins of the present invention are those of SEQ ID NO:1, wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 22 is Asn;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn;
Xaa at position 73 is Val;
Xaa at position 100 is Trp, Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 138 is Trp, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/tTS97/23549 Leu at position 45 is replaced with Glu, Asp, Arg, Lys, or His;
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Pro at position 47 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His; or Leu at position 142 is replaced with Glu, Asp, Arg, Lys, or His;
or pharmaceutically acceptable salts thereof.
Other preferred proteins of the present invention are those of SEQ ID NO:1, wherein:
Xaa at position 1 is Val;
Xaa at position 2 is Pro;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn;
Xaa at position 100 is Trp, Glu, or Asp;
Xaa at position 138 is Trp, Glu or Asp;
said protein having at least one of the following substitutions:
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His; or Phe at position 92 is replaced with Glu, Asp, Arg, Lys, or His;
or pharmaceutically acceptable salts thereof.
Other preferred proteins of the present invention are those of SEQ ID NO: I, wherein:
Xaa at position 1 is Val;
Xaa at position 2 is Pro;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn;
Xaa at position 100 is Trp, Glu, or Asp;
Xaa at position 138 is Trp, Glu or Asp;
SUBSTITUTE SHEET ( rule 26 ) WO 98128414 . PCT/US97/23549 said protein having at least one of the following substitutions:
His at position 46 is replaced with Glu, Asp, Arg, or Lys; or Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His;
or pharmaceutically acceptable salts thereof.
Still more preferred are proteins of the present invention are those of SEQ ID NO:1, wherein:
Xaa at position 1 is Val;
Xaa at position 2 is Pro;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn;
Xaa at position 100 is Trp or Asp;
Xaa at position 138 is Trp;
said protein having at least one of the following substitutions:
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Ile at position 48 is replaced with Asp, Arg, Lys, or His; or Phe at position 92 is replaced with Asp, Glu, Arg, Lys, or His;
or pharmaceutically acceptable salts thereof.
Still more preferred are proteins of the present invention are those of SEQ ID NO: l, wherein:
Xaa at position 1 is Val;
Xaa at position 2 is Pro;
Xaa at pos~i.tion 28 is Gln or absent;
Xaa at position 72 is Asn;
Xaa at position 100 is Trp or Asp;
Xaa at position 138 is Trp;
said protein having at least one of the following substitutions:
His at position 46 is replaced with Glu, Asp, Arg, or Lys; or SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Ile at position 48 is replaced with Asp, Arg, Lys, or His;
or pharmaceutically acceptable salts thereof.
Other preferred proteins of the present invention are those of SEQ ID NO:1, wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn, Glu or Asp;
Xaa at position 100 is Trp, Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 138 is Trp, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
Xaa at position 1 is replaced with Glu, Asp, Ser, Thr, Lys, His, Arg, or absent;
Xaa at position 2 is replaced with Glu, Asp, Ser, Thr, Lys, His, Arg, or absent;
Val at position 89 is replaced with Glu, Asp, Arg, Lys, or His; or Phe at position 92 is replaced with Glu, Asp, Arg, Lys, or His;
or a pharmaceutically acceptable salt thereof.
Other preferred proteins are those of Formula I
wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn, Glu or Asp;
Xaa at position 100 is Glu, Asp, Lys, His, or Arg;
Xaa at position 138 is Trp, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . . PCT/US97/23549 Phe at position 92 is replaced with Glu, Asp, Arg, Lys, or His;
Pro position 99 is replaced Glu, Asp, Arg, at with Lys, r is;
o H
or a cally acceptable salt pharmaceuti thereof.
Preferred s ithin ormula I nclude specie w F i species of SEQID N0: 2 SEQID N0:7:
through Val ProIleGln LysVal AspAspThr ThrLeu IleLysThr Gln Lys Ile ValThrArg IleAsn IleSerHis GlnSer ValSerSer Asp Thr Lys GlnLysVal ThrGly AspPheIle GlyLeu AspProIle Leu Pro Leu ThrLeuSer LysMet GlnThrLeu ValTyr GlnGlnIle Asp Ala Leu ThrSerMet ProSer AsnValIle IleSer AsnAspLeu Arg Gln Glu AsnLeuArg AspLeu HisValLeu PheSer LysSerCys Leu Ala His LeuProTrp AlaSer LeuGluThr AspSer LeuGlyGly Gly Leu 3 Val LeuGluAla SerGly SerThrGlu ValAla LeuSerArg 0 Tyr Val Leu GlnGlySer LeuGln MetLeuTrp LeuAsp LeuSerPro Asp Gln Gly Cys ( ID NO :
SEQ 2 ) Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser 4 5 Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Asp Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCTlUS97/Z3549 115 120 , 125 Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro Gly Cys ( SEQ ID NO : 3 ) Val Pro IleGln LysValGln AspAspThr LysThrLeuIle LysThr Ile Val ThrArg IleAsnAsp IleSerHis ThrGlnSerVal SerSer Lys Gln LysVal ThrGlyLeu AspPheIle ProGlyLeuAsp ProIle 2 Leu Thr LeuSer LysMetAsp GlnThrLeu AlaValTyrGln GlnIle Leu Thr SerMet ProSerArg AsnValIle GlnIleSerAsn AspLeu 85 90 g5 Glu Asn LeuArg AspLeuLeu HisValLeu AlaPheSerLys SerCys His Leu ProAsp AlaSerGly LeuGluThr LeuAspSerLeu GlyGly Val Leu GluAla SerGlyTyr SerThrGlu ValValAlaLeu SerArg 3 Leu Gln GlySer LeuGlnAsp MetLeuTrp GlnLeuAspLeu SerPro Gly Cys ( Q
SE ID
NO
:
) Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Asp 5 0 Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile SUBSTITUTE SHEET ( rule 26 ) WO 98128414 . PCT/US97123549 Leu Thr SerMetPro SerArgAsn ValIleGln IleSerAsn AspLeu Glu Asn LeuArgAsp LeuLeuHis ValLeuAla PheSerLys SerCys His Leu ProAspAla SerGlyLeu GluThrLeu AspSerLeu GlyGly Val Leu GluAlaSer GlyTyrSer ThrGluVal ValAlaLeu SerArg Leu Gln GlySerLeu GlnAspMet LeuTrpGln LeuAspLeu SerPro Gly Cys ( Q
SE ID
NO
:
) Val Pro IleGlnLys ValGlnAsp AspThrLys ThrLeuIle LysThr Ile Val ThrArgIle AsnAspIle SerHisThr GlnSerVal SerSer 2 5 Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro IIe Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Glri Gln Ile Leu ThrSerMet ProSerArg AsnValIle GlnIleSerAsn AspLeu Glu AsnLeuArg AspLeuLeu HisValLeu AlaAspSerLys SerCys His LeuProTrp AlaSerGly LeuGluThr LeuAspSerLeu GlyGly 4 Val LeuGluAla SerGlyTyr SerThrGlu ValValAlaLeu SerArg Leu GlnGlySer LeuGlnAsp MetLeuTrp GlnLeuAspLeu SerPro Gly Cys ( ID NO :
) Val ProIleGln LysValGln AspAspThr LysThrLeuIle LysThr Ile ValThrArg IleAsnAsp IleSerHis ThrGlnSerVal SerSer Lys GlnLysVal ThrGlyLeu AspPheIle ProGlyLeuHis ProIle SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97123549 Leu Thr Leu Ser Lys Met Asp Gln~Thr Leu Ala Val Tyr Gln Gln Ile 65 70 75 g0 Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Asp Ser Lys Ser Cys His Leu Pro Asp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro 2 0 Gly cys ( SEQ ID NO : 7 ) As mentioned above, the invention also provides proteins of the Formula (I) having additionally a leader sequence bonded to the N-terminus of the protein of Formula I. Such proteins are useful for their anti-obesity activity and as intermediates in the preparation of proteins of the Formula (I). The species having SEQ ID N0:8-13 are particularly preferred proteins of Formula (I) having the leader sequence Met-Arg bonded to the N-terminus of the proteins of SEQ ID N0:2-7, respectively.
Met Arg Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile 3 Lys ThrIle ValThrArg IleAsnAsp IleSerHis ThrGlnSer Val Ser SerLys GlnLysVal ThrGlyLeu AspPheIle ProGlyLeu Asp Pro IleLeu ThrLeuSer LysMetAsp GlnThrLeu AlaValTyr Gln Gln IleLeu ThrSerMet ProSerArg AsnValIle GlnIleSer Asn Asp LeuGlu AsnLeuArg AspLeuLeu HisValLeu AlaPheSer Lys 5 Ser CysHis LeuProTrp AlaSerGly LeuGluThr LeuAspSer Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/I1S97/23549 Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro Gly Cys ( SEQ ID NO : 8 ) Met Arg ValProIleGln LysVal GlnAspAsp ThrLysThrLeu Ile Lys Thr IleValThrArg IleAsn AspIleSer HisThrGlnSer Val Ser Ser LysGlnLysVal ThrGly LeuAspPhe IleProGlyLeu His Pro Asp LeuThrLeuSer LysMet AspGlnThr LeuAlaValTyr Gln 2 Gln Ile LeuThrSerMet ProSer ArgAsnVal IleGlnIleSer Asn 85 90 . 95 Asp Leu GluAsnLeuArg AspLeu LeuHisVal LeuAlaPheSer Lys loo 105 110 Ser Cys HisLeuProTrp AlaSer GlyLeuGlu ThrLeuAspSer Leu Gly Gly ValLeuGluAIa SerGly TyrSerThr GluValValAla Leu Ser Arg LeuGlnGlySer LeuGln AspMetLeu TrpGlnLeuAsp Leu 3 Ser Pro GlyCys ( ID NO
5 SEQ :
) Met Arg Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile 4 0 Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu Asp Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys SUBSTITUTE SHEET ( rule 26 ) Ser Cys His Leu Pro Asp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro GIy Cys ( SEQ ID NO :10 ) Met ArgVal ProIleGln LysValGln AspAspThr LysThrLeuIle Lys ThrIle ValThrArg IleAsnAsp IleSerHis ThrGlnSerVal 2 Ser SerLys GlnLysVal ThrGlyLeu AspPheIle ProGlyLeuHis Pro AspLeu ThrLeuSer LysMetAsp GlnThrLeu AlaValTyrGln Gln IleLeu ThrSerMet ProSerArg AsnValIle GlnIleSerAsn Asp LeuGlu AsnLeuArg AspLeuLeu HisValLeu AlaPheSerLys loo l05 llo Ser CysHis LeuProAsp AlaSerGly LeuGluThr LeuAspSerLeu 3 Gly GlyVal LeuGluAla SerGlyTyr SerThrGlu ValValAlaLeu Ser ArgLeu GlnGlySer LeuGlnAsp MetLeuTrp GlnLeuAspLeu Ser ProGly Cys ( ID NO
SEQ :11 ) Met Arg Val Pro Ile Gln Lys Val GIn Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val 5 0 Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/Z3549 Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Asp Ser Lys Ser Cys His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu ser Pro Gly Cys ( SEQ ID NO :12 ) Met Arg Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile 2 0 Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Asp Ser Lys 3 5 Ser Cys His Leu Pro Asp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro Gly Cys ( SEQ ID NO :13 ) The present invention provides biologically active proteins that provide effective treatment for obesity.
Applicants have discovered that specific substitutions to the hydrophobic residues on the surface of the protein result in improved properties. These residues could not be predicted from the primary sequence. The proteins having these substitutions are pharmacologically active and have a reduced propensity to aggregate when compared to both the SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 mouse and human forms of the protein. The present invention permits obesity protein to be more readily formulated at higher concentrations, and increases their pharmaceutical elegance, because they are compatible with commonly-used preservatives.
The claimed proteins ordinarily are prepared by recombinant techniques. Techniques for making substitutional mutations at predetermined sites in DNA
having a known sequence are well known, for example M13 primer mutagenesis. The mutations that might be made in the DNA encoding the present anti-obesity proteins must not place the sequence out of reading frame and preferably will not create complementary regions that could produce secondary mRNA structure [DeBoer, H.A., et al., European Patent Publication No. 75,444 A2, published March 30, 1983].
The proteins of the present invention may be produced either by recombinant DNA technology or well known chemical procedures, such as solution or solid-phase peptide synthesis, or semi-synthesis in solution beginning with protein fragments coupled through conventional solution methods.
A. Solid Phase The synthesis of the claimed proteins may proceed by solid phase peptide synthesis or by recombinant methods.
The principles of solid phase chemical synthesis of polypeptides are well known in the art and may be found in general texts in the area such as Dugas, H. and Penney, C., Bioorganic Chemistry Springer-Verlag, New York, (1981) 54-59. For example, peptides may be synthesized by solid-phase methodology utilizing an PE-Applied Biosystems 433A peptide synthesizer (Perkin Elmer-Applied Biosystems Division, Foster City, California) and synthesis cycles supplied by Applied Biosystems. Boc amino acids and other reagents are commercially available from PE-Applied Biosystems and other chemical supply houses. Sequential Boc chemistry using SUBSTITUTE SHEET { rule 26 ) WO 98/28414 . PCT/US97/23549 double couple protocols are applied to the starting p-methyl benzhydryl amine resins for the production of C-terminal carboxamides. For the production of C-terminal acids, the corresponding PAM resin is used. Arginine, Asparagine, Glutamine, Histidine, and Methionine are coupled using preformed hydroxy benzotriazole esters. The following side chain protection may be used:
Arg, Tosyl Asp, cyclohexyl or benzyl Cys, 4-methylbenzyl Glu, cyclohexyl His, benzyloxymethyl Lys, 2-chlorobenzyloxycarbonyl Met, sulfoxide I5 Ser, Benzyl Thr, Benzyl Trp, formyl Tyr, 4-bromo carbobenzoxy Boc deprotection may be accomplished with trifluoroacetic acid (TFA) in methylene chloride. Formyl removal from Trp is accomplished by treatment of the peptidyl resin with 20%
piperidine in dimethylformamide for 60 minutes at 4°C.
Met(O) can be reduced by treatment of the peptidyl resin with TFA/dimethylsulfide/conc. HC1 (95/5/1) at 25°C for 60 minutes. Following the above pre-treatments, the peptides may be further deprotected and cleaved from the resin with anhydrous hydrogen fluoride containing a mixture of 10% m-cresol or m-cresol/10% p-thiocresol or m-cresol/p-thiocresol/dimethyl-sulfide. Cleavage of the side chain protecting groups) and of the peptide from the resin is carried out at 0°C or below, preferably -20°C for thirty minutes followed by thirty minutes at 0°C. After removal of the HF, the peptide/resin is washed with ether. The peptide is extracted with glacial acetic acid and lyophilized.
Purification is accomplished by reversed-phase C18 SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 chromatography in 0.1% TFA with a gradient of increasing acetonitrile concentration, e.g., a 2.2 cm X 25 cm VydacT""
column (The Separations Group, Inc., Hesperia, CA).
One skilled in the art recognizes that the solid phase synthesis could also be accomplished using the FMOC
strategy and a TFA/scavenger cleavage mixture.
B. Recombinant Synthesis The claimed proteins may also be produced by recombinant methods. Recombinant methods are preferred if a high yield or if specific post-translational modifications are desired. The basic steps in the recombinant production of protein include:
a) construction of a synthetic or semi-synthetic (or isolation from natural sources) DNA
encoding the claimed protein, b) integrating the coding sequence into an expression vector in a manner suitable for the expression of the protein either alone or as a fusion protein, c) transforming an appropriate eukaryotic or prokaryotic host cell with the expression vector, and d) recovering and purifying the recombinantly produced protein.
a. Gene Construction Synthetic genes, the in vitro or in vivo transcription and translation of which will result in the production of the protein may be constructed by techniques well known in the art. Owing to the natural degeneracy of the genetic code, the skilled artisan will recognize that a sizable yet definite number of DNA sequences may be constructed which encode the claimed proteins. In the preferred practice of the invention, synthesis is achieved by recombinant DNA technology.
Methodology of synthetic gene construction is well known in the art (Brown, E. L., et a.~. Methods in SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Enzymology, Academic Press,. New York, NY, 68:109-151 (1979)]. The DNA sequence corresponding to the synthetic claimed protein gene may be generated using conventional DNA
synthesizing apparatus such as the Applied Biosystems Model 380A or 380B DNA synthesizers (Perkin Elmer, Applied Biosystems Division, Foster City, Cahifornia).
It may desirable in some applications to modify the coding sequence of the claimed protein so as to incorporate a convenient protease sensitive cleavage site, e.g., between the signal peptide and the structural protein facilitating the controlled excision of the signal peptide from the fusion protein construct.
The gene encoding the claimed protein may also be created by using polymerase chain reaction (PCR). The template can be a cDNA library (commercially available from CLONETECH or STRATAGENE) or mRNA isolated from human adipose tissue. Such methodologies are well known in the art. See, e.g., Maniatis, T., et a1. Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1989).
b. Direct Expression or Fusion Protein The claimed protein may be made either by direct expression or as fusion protein comprising the claimed protein followed by enzymatic or chemical cleavage. A
variety of peptidases (e.g., trypsin) which cleave a polypeptide at specific sites or digest the peptides from the amino or carboxy termini (e.g., diaminopeptidase) of the peptide chain are known. Furthermore, particular chemicals (e.g., cyanogen bromide) will cleave a polypeptide chain at specific sites. The skilled artisan will appreciate the modifications necessary to the amino acid sequence (and synthetic or semi-synthetic coding sequence if recombinant means are employed) to incorporate site-specific internal cleavage sites. See, e.g., Carter, P., Chapter 13 in Protein Purification: From Molecular Mechanisms to Large-SUBSTITUTE SHEET ( rule 2b ) Scale Processes, Ladisch, M., et a1. (Eds.) American Chemical Soc., Washington, D.C. (1990).
c. Vector Construction Construction of suitable vectors containing the desired coding and control sequences employ standard ligation techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and religated in the form desired to form the plasmids required.
To effect the translation of the desired protein, one inserts the engineered synthetic DNA sequence in any of a plethora of appropriate recombinant DNA expression vectors through the use of appropriate restriction endonucleases. A
synthetic coding sequence. is designed to possess restriction endonuclease cleavage sites at either end of the transcript to facilitate isolation from and integration into these expression and amplification and expression plasmids. The isolated cDNA coding sequence may be readily modified by the use of synthetic linkers to facilitate the incorporation of this sequence into the desired cloning vectors by techniques well known in the art. The particular endonucleases employed will be dictated by the restriction endonuclease cleavage pattern of the parent expression vector to be employed. Restriction sites are chosen so as to properly orient the coding sequence with control sequences to achieve proper in-frame reading and expression of the claimed protein .
In general, plasmid vectors containing promoters and control sequences which are derived from species compatible with the host cell are used with these hosts.
The vector ordinarily carries a replication site as well as marker sequences which are capable of providing phenotypic selection in transformed cells. For example, E. coli is typically transformed using pBR322, a plasmid derived from an E. coli species [Bolivar, F., et al., Gene 2:95-113 (1977)]. Plasmid pBR322 contains genes for ampicillin and tetracycline resistance and thus provides easy means for SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/Z3549 identifying transformed cells. The pHR322 plasmid, or other microbial plasmid must also contain or be modified to contain promoters and other control elements commonly used in recombinant DNA technology.
The desired coding sequence is inserted into an expression vector in the proper orientation to be transcribed from a promoter and ribosome binding site, both of which should be functional in the host cell in which the protein is to be expressed. An example of such an expression vector is a plasmid described in Belagaje, R. M., et al., U.S. patent No. 5,304,473, issued, April 19, 1994, the teachings of which are herein incorporated by reference.
The gene encoding A-C-B proinsulin described in U.S. Patent No. 5,304,473 can be removed from the plasmid pRBl82 with restriction enzymes Ndel and BamHI. The genes encoding the protein of the present invention can be inserted into the plasmid backbone on a Ndel/BamHI restriction fragment cassette.
d. Prokaryotic Ex ression In general, prokaryotes are used for cloning of DNA sequences in constructing the vectors useful in the invention. For example, E. coli K12 strain 294 (ATCC No.
31446) is particularly useful. Other microbial strains which may be used include E. coli B and E. coli X1776 (ATCC No.
31537). These examples are illustrative rather than limiting.
Prokaryotes also are used for expression. The aforementioned strains, as well as E. coli W3110 (prototrophic, ATCC No. 27325), bacilli such as Bacillus subtilis, and other enterobacteriaceae such as Salmonella typhimurium or Serratia marcescans, and various pseudomonas species may be used. Promoters suitable for use with prokaryotic hosts include the [i-lactamase (vector pGX2907 [ATCC 39344] contains the replicon and [3-lactamase gene) and lactose promoter systems [Chang, A. C. Y., et al., Nature, 275:617-624 (1978); and Goeddel, D. V., et al., Nature SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 281:544-548 (1979)], alkaline phosphatase, the tryptophan (trp) promoter system (vector pATHl [ATCC 37695] is designed to facilitate expression of an open reading frame as a trpE
fusion protein under control of the trp promoter) and hybrid promoters such as the tac promoter (isolatable from plasmid pDR540 ATCC-37282). However, other functional bacterial promoters, whose nucleotide sequences are generally known, enable one skilled in the art to ligate them to DNA encoding the protein using linkers or adaptors to supply any required restriction sites. Promoters for use in bacterial systems also will contain a Shine-Dalgarno sequence operably linked to the DNA encoding protein.
e. Eukaryotic Ex ression The protein may be recombinantly produced in eukaryotic expression systems. Preferred promoters controlling transcription in mammalian host cells may be obtained from various sources, for example, the genomes of viruses such as: polyoma, Simian Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most preferably cytomegalovirus, or from heterologous mammalian promoters, e.g., p-actin promoter. The early and late promoters of the SV40 virus are conveniently obtained as an SV40 restriction fragment which also contains the SV40 viral origin of replication [Fiers, W., et al., Nature, 273:113-120 (1978)]. The entire SV40 genome may be obtained from plasmid pBRSV, ATCC 45019. The immediate early promoter of the human cytomegalovirus may be obtained from plasmid pCMBQ
(ATCC 77177). Of course, promoters from the host cell or related species also are useful herein.
Transcription of a DNA encoding the claimed protein by higher eukaryotes is increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about 10-300 bp, that act on a promoter to increase its transcription. Enhancers are relatively orientation and position independent having been found 5~ [Laimins, L. A., et al., Proc. Nat~1 Acad. Sci.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/I1S97/23549 (USA) 78:464-468 (1981)] and 3~ [Lusky, M., et al., Mol.
Cell. Bio. 3:1108-1122 (1983)] to the transcription unit, within an intron [Banerji, J., et al., Cell 33:729-740 (1983)] as well as within the coding sequence itself [Osborne, T. F., et al., Mol. Cell. Bio. 4:1293-1305 (1984)). Many enhancer sequences are now known from mammalian genes (globin, RSV, SV40, EMC, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus. Examples include the SV40 late enhancer, the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.
Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination of transcription which may affect mRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the mRNA encoding protein. The 3~ untranslated regions also include transcription termination sites.
Expression vectors may contain a selection gene, also termed a selectable marker. Examples of suitable selectable markers for mammalian cells are dihydrofolate reductase (DHFR, which may be derived from the BgIII/HindIII
restriction fragment of pJOD-10 [ATCC 68815)), thymidine kinase (herpes simplex virus thymidine kinase is contained on the BamHI fragment of vP-5 clone (ATCC 2028), or neomycin (G418) resistance genes, which are obtainable from pNN414 yeast artificial chromosome vector (ATCC 37682). When such selectable markers are successfully transferred into a mammalian host cell, the transfected mammalian host cell can survive if placed under selective pressure. There are two widely used distinct categories of selective regimes. The first category is based on a cel l s metabolism and the use of a mutant cell line which lacks the ability to grow without a supplemented media. Two examples are: CHO DHFR-SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 cells (ATCC CRL-9096) and mouse LTK- cells [L-M(TK-) ATCC
CCL-2.3]. These cells lack the ability to grow without the addition of such nutrients as thymidine or hypoxanthine.
Because these cells lack certain genes necessary for a complete nucleotide synthesis pathway, they cannot survive unless the missing nucleotides are provided in a supplemented media. An alternative to supplementing the media is to introduce an intact DHFR or TK gene into cells lacking the respective genes, thus altering their growth requirements. Individual cells which were not transformed with the DHFR or TK gene will not be capable of survival in non-supplemented media.
The second category is dominant selection which refers to a selection scheme used in any cell type and does not require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host cell. Those cells which have a novel gene would express a protein conveying drug resistance and would survive the selection.
Examples of such dominant selection use the drugs neomycin, [Southern P J., et al., J. Molec. Appl. Genet. 1:327-341 (1982)], mycophenolic acid [Mulligan, R. C. et al., Science 209:1422-1427 (1980)], or hygromycin [Sugden, B. et al., Mol Cell. Biol. 5:410-413 (1985)]. The three examples given above employ bacterial genes under eukaryotic control to convey resistance to the appropriate drug, such as, 6418, neomycin (geneticin), xgpt (mycophenolic acid), or hygromycin, respectively.
A preferred vector for eukaryotic expression is pRc/CMV. pRc/CMV is commercially available from Invitrogen Corporation, San Diego, CA. To confirm correct sequences in constructed plasmids, the ligation mixtures are used to transform E. coli K12 strain DHSa (ATCC 31446) and successful transformants are selected by antibiotic resistance where appropriate. Plasmids from the transformants are prepared, analyzed by restriction and/or SUBSTITUTE SHEET ( rule 26 ) WO 98J284I4 . PCT/LTS97/23549 sequenced by the method of. Messing, J., et al., Nucleic Acids Res. 9:309-321 (1981).
Host cells may be transformed with the expression vectors of this invention and cultured in conventional nutrient media modified as is appropriate for inducing promoters, selecting transformants or amplifying genes. The culture conditions, such as temperature, pH and the like, are those previously used with the host cell selected for expression, and will be apparent to the ordinarily skilled artisan. The techniques of transforming cells with the aforementioned vectors are well known in the art and may be found in such general references as Maniatis, T., et a1.
Molecular Cloning: A Laboratory Manual, 2"d Ed., Cold Spring Harbor Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1989), or Current Protocols in Molecular Biology (1989) and supplements.
Preferred suitable host cells for expressing the vectors encoding the claimed proteins in higher eukaryotes include: African green monkey kidney cell line transformed by SV40 (COS-7, ATCC CRL-1651); transformed human primary embryonal kidney cell line 293 [Graham, F. L. et al., J. Gen Virol. 36:59-72 (1977); Harrison, T., et al., Virology 77:319-329 (1977); Graham, F. L. et al., Virology 86:10-21 (1978)]; baby hamster kidney cells [BHK-21(C-13), ATCC CCL-10; MacPherson, I., et a1, Virology 16:147-151 (1962)];
Chinese hamster ovary cells [CHO-DHFR- (ATCC CRL-9096)];
mouse Sertoli cells [TM4, ATCC CRL-1715; Mather, J. P., Biol. Reprod. 23:243-252 (1980)]; African green monkey kidney cells (VERO 76, ATCC CRL-1587); human cervical epitheloid carcinoma cells (HeLa, ATCC CCL-2); canine kidney cells (MDCK, ATCC CCL-34); buffalo rat liver cells (BRL 3A, ATCC CRL-1442); human diploid lung cells (WI-38, ATCC CCL-75); human hepatocellular carcinoma cells (Hep G2, ATCC HB-8065);and mouse mammary tumor cells (MMT 060562, ATCC
CCL51) .
f. Yeast Ex ression SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 In addition to prokaryotic and mammalian host cells, eukaryotic microorganisms such as yeast may also be used as host cells. Saccharomyces cerevisiae, common baker s yeast, is the most commonly-used eukaryotic microorganism for expressing heterologous proteins, although a number of other strains are commonly available. For expression in Saccharomyces, the plasmid YRp7, for example, is commonly used [ATCC-40053, Stinchcomb, D. T., et al., Nature 282:39-43 (1979); Kingsman, A. J., et al., Gene 7:141-152 (1979); Tschumper, G., et al., Gene 10:157-166 (1980)]. This plasmid already contains the trp gene which provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan [e. g., ATCC 44076 or PEP4-l; Jones, E. W., Genetics 85:23-33 (1977)].
Suitable promoting sequences for use with yeast hosts include the promoters for 3-phosphoglycerate kinase, which is found on plasmid pAPI2BD ATCC 53231 [Patel, A. C., et al., U.S. Patent No. 4,935,350, issued June 19, 1990] or other glycolytic enzymes such as enolase, which is found on plasmid pACl (ATCC 39532), glyceraldehyde-3-phosphate dehydrogenase, which is derived from plasmid pHcGAPCl (ATCC
57090, 57091), zymomonas mobilis [Ingrain, L.O., et al., U.S.
Patent No. 5,000,000 issued March 19, 19911, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase.
Other yeast promoters, which are inducible promoters having the additional advantage of transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, which is contained on plasmid vector pCL28XhoLHBPV [ATCC 39475; Reddy, V. B., et al., U.S.
Patent No. 4,840,896, issued June 20, 1989], glyceraldehyde 3-phosphate dehydrogenase, and enzymes responsible for SUBSTITUTE SHEET ( ruie 26 ) WO 98128414 . PCT/US97/23549 maltose and galactose utilization, such as, the GALL
promoter, which may be found on plasmid pRY121 (ATCC 37658).
Suitable vectors and promoters for use in yeast expression are further described in Hitzeman, R. A., et al., European Patent Publication No. 73,657A1, published March 9, 1983.
Yeast enhancers such as the UAS Gal from Saccharomyces cerevisiae, which is found in conjunction with the CYC1 promoter on plasmid YEpsec--hIlbeta (ATCC 67024), also are advantageously used with yeast promoters.
The following examples are presented to further illustrate the preparation of the claimed proteins. The scope of the present invention is not to be construed as merely consisting of the following examples.
8xample 1 Vector Construction A gene of SEQ ID N0:14 is assembled from a 220 base pair and a 240 base pair segment which are derived from chemically synthesized oligonucleotides:
451 TAGGATCC (SEQ ID N0:14) The base pair segment extends from the Ndel site to the Xbal site at position 220 within the coding region and is assembled from overlapping oligonucleotides which range in length from between and bases.
The base pair segment which extends from the Xbal to the BamHI
site is also assembled from 7 overlapping oligonucleotides which range in length from between 57 and 92 bases.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/U597/23549 To assemble these fragments, the respective 7 oligonucleotides are mixed in equimolar amounts, usually at concentrations of about 1-2 picomoles per microliter. Prior to assembly, all but the oligonucleotides at the 5'-ends of the segment are phosphorylated in standard kinase buffer with T4 DNA kinase using the conditions specified by the supplier of the reagents. The mixtures are heated to 95°C
and allowed to cool slowly to room temperature over a period of 1-2 hours to ensure proper annealing of the oligonucleotides. The oligonucleotides are then ligated to each other and into an appropriated cloning vector such as pUCl8 or pUC 19 using T4 DNA ligase. The buffers and conditions are those recommended by the supplier of the enzyme. The vector for the 220 base pair fragment is digested with Ndel and Xbal, whereas the vector for the 240 base pair fragment is digested with Xbal and BamHI prior to use. The ligation mixes are used to transform E. coli DH10B
cells (commercially available from Life Technologies, producer of GIBCO/BRL products, Grand Island, NY) and the transformed cells are plated on tryptone-yeast plates (TY, Difco, Detroit, MI) containing 100 ~g/ml of ampicillin, X-gal and IPTG. Colonies which grow up overnight are grown in liquid TY medium with 100 ~.g/ml of ampicillin and are used for plasmid isolation and DNA sequence analysis. Plasmids with the correct sequence are kept for the assembly of the complete gene. This is accomplished by gel-purification of the 220 base-pair and the 240 base-pair fragments and ligation of these two fragments into an expression vector such as pRBl82 from which the coding sequence for A-C-B
proinsulin is deleted and is digested with Ndel and BamHI
prior to use.
Alternatively, plasmid pET30 (Novagen, Madison, wI) can be digested with Ndel and BamHI, and the desired DNA
sequence encoding the proteins of the present invention can be inserted by procedures recognized in the art and described herein. The source of the DNA is synthetic SUBSTITUTE SHEET ( rule 26 ) WO 98/Z8414 . PCT/I1S97/23549 oligonucleotides that are assembled by art recognized methodology and described herein.
8xample 2 Expression plasmid for the Proteia of S8Q ID N0:3 (pH8767) Plasmid vector pOJ722 (40 fig, NRRL No. B-21,654) was prepared in a manner analogous to Example 1 and was digested with 20 units of EcoRV (Boehringer Mannheim, Indianapolis, IN) and 20 units of Bglll (Boehringer Mannheim) in "Boehringer Mannheim buffer B" at 37oC for two hours. The digest was purified on a 1% agarose gel, and the 2674 by and 1422 by fragments were isolated using the freeze-squeeze method.
The oligonucleotides 14255 [5'-ATC TCA CAC ACA CAG TCA
GTC TCG AGT AAA CAG AAA GTC ACA GGC TTG GAC TTC ATA CCT GGG
CTG CAC CCC GAC CTG ACA TTG TCT AAA ATG GAC CAG ACA CTG GCA
GTC TAT CAA CA-3' (SEQ ID N0:15)] and 14256 [5'-GAT CTG TTG
ATA GAC TGC CAG TGT CTG GTC CAT TTT AGA CAA TGT CAG GTC GGG
GTG CAG CCC AGG TAT GAA GTC CAA GCC TGT GAC TTT CTG TTT ACT
CGA GAC TGA CTG TGT GTG TGA GAT-3' (SEQ ID N0:16)] were kinased in the presence of 1x kinase buffer, 50 mM Tris-HCl pH 8.0, 10 mM MgCl2, 0.5 mM ATP, 1 mM DTT, and 5 units T4 polynucleotide kinase (GIBCO BRL) at 37°C for 30 minutes.
The EcoRV/Bglll 2674 by and 1422 by fragments, prepared as described above, and the 14255 and 14256 linkers were ligated in the presence of 1x kinase buffer, 0.5 mM ATP
and 1 unit T4 DNA ligase (GIBCO BRL) at 16°C overnight. The ligation products are transformed into E. coli BL21(DE3) (Novagen) and colonies growing on TY plates supplemented with 10 ~,g/mL of tetracycline are analyzed. Plasmid DNA is isolated, then subjected to DNA sequencing on a PE-Applied Biosystems 370 DNA sequencer. Plasmids containing the expected -.400 by Nde1 to BamHI fragment are kept, and are designated pHS767.
Example 3 SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Expression plasmid for the Protein of SEQ ID N0:5 (pH8786) Plasmid vector pHS767 (40 ~tg) was digested with 20 units of Pmll (New England Biolab, Beverly, MA) in "New England Biolab buffer 1" at 37°C for two hours. Buffer salts were added to adjust to the mixture to the conditions of "New England Biolab buffer 3." BstXI was added(20 units, New England Biolab), and the plasmid was digested for 2 hours at 55°C. After that, the digest was treated with 20 units of alkaline phosphatase (Boehringer Mannheim) at 37°C
for 30 minutes. The digest was purified on a 1% agarose gel, and the 4170 by fragment was isolated using the freeze-squeeze method.
The oligonucleotides 13824 [5'-TGA GGC TTC CAG GAC
TCC CCC CAG ACT GTC CAA TGT CTC CAG GCC ACT GGC GTC TGG CAA
GTG GCA ACT TTT AGA GAA GGC CAG CAC-3'(SEQ ID N0:17)] and 13825 [5'-GTG CTG GCC TTC TCT AAA AGT TGC CAC TTG CCA GAC
GCC AGT GGC CTG GAG ACA TTG GAC AGT CTG GGG GGA GTC CTG GAA
GCC-3'(SEQ ID N0:18)] were kinased in the presence of lx kinase buffer, 50 mM Tris-HC1 pH 8.0, 10 mM MgCl2, 0.5 mM
ATP, 1 mM DTT, and 5 units T4 polynucleotide kinase (GIBCO
BRL) at 37°C for 30 minutes.
The Pmll/BstXI linearized pHS767 vector, and the 13824 and 13825 linkers, were ligated in the presence of 1x kinase buffer, 0.5 mM ATP and 1 unit T4 DNA ligase (GIBCO
BRL) at 16°C overnight. The ligation products are transformed into E, coli BL21(DE3) (Novagen) and colonies growing on TY plates supplemented with 10 ~g/mL of tetracycline are analyzed. Plasmid DNA is isolated, then subjected to DNA sequencing on a PE-Applied Biosystems 370 DNA sequencer. Plasmids containing the expected 400 by Ndel to BamHI fragment are kept, and are designated pHS786.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/L1S97/23549 Example 4 Expression plasmid for the Protein of SEQ ID N0:2 (pHS775) Plasmid vector pHS767 (40 ug) was digested with 20 units of Xhol (Boehringer Mannheim) and 20 units of Ncol (Boehringer Mannheim) in "Boehringer Mannheim buffer H" at 37°C for two hours. The digest was treated with 20 units of alkaline phosphatase (Boehringer Mannheim) at 37°C for 30 minutes. The digest was purified on a 1% agarose gel, and the 4215 by fragment was isolated using the freeze-squeeze method.
The oligonucleotides 14405 [5'-TCG AGT AAA CAG AAG
GTC ACC GGC TTG GAC TTC ATA CCT GGG CTG GAC CCC ATA CTG ACA
TTG TCT AAA ATG GAC CAG ACA CTG GCA GTC TAT CAA CA-3' (SEQ
ID N0:19)] and 14406 [5'-GAT CTG TTG ATA GAC TGC CAG.TGT CTG
GTC CAT TTT AGA CAA TGT CAG TAT GGG GTC CAG CCC AGG TAT GAA
GTC CAA GCC GGT GAC CTT CTG TTT AC-3' (SEQ ID N0:20)] were kinased in the presence of lx kinase buffer, 50 mM Tris-HC1 pH 5.0, 10 mM MgCl2, 0.5 mM ATP, 1 mM DTT, and 5 units T4 polynucleotide kinase (GIBCO BRL) at 37°C for 30 minutes.
The Xhol/Ncol linearized pHS767 vector and 14405 and 13406 linkers were ligated in the presence of lx Kinase buffer, 0.5 mM ATP and 1 unit T4 DNA ligase (GIBCO BRL) at 16°C overnight. The ligation products are transformed into E. coli BL21(DE3) (Novagen) and colonies growing on TY
plates supplemented with 10 ~g/mL of tetracycline are analyzed. Plasmid DNA is isolated, then subjected to DNA
sequencing on a PE-Applied Biosystems 370 DNA sequencer.
Plasmids containing the expected 400 by Ndel to BamHI
fragment are kept, and are designated pHS775.
Example 5 Expression plasmid for the Protein of SEQ ID N0:4 (pHS785) Plasmid vector pHS775 (40 ug) was digested with 20 units of Pmll (New England Biolab) in "New England Biolab buffer 1" at 37°C for two hours. The buffer salts were adjusted to "New England Biolab buffer 3," 20 units of BstXI
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 (New England Biolab) were added, and the plasmid was digested for 2 hours at 55°C. After that, the digest was treated with 20 units of alkaline phosphatase (Boehringer Mannheim) at 37°C for 30 minutes. The digest was purified on a 1% agarose gel, and the 4170 by fragment was isolated using the freeze-squeeze method.
The oligonucleotides 13824 [5'-TGA GGC TTC CAG GAC
TCC CCC CAG ACT GTC CAA TGT CTC CAG GCC ACT GGC GTC TGG CAA
GTG GCA ACT TTT AGA GAA GGC CAG CAC-3'(SEQ ID N0:21)] and 13825 [5'-GTG CTG GCC TTC TCT AAA AGT TGC CAC TTG CCA GAC
GCC AGT GGC CTG GAG ACA TTG GAC AGT CTG GGG GGA GTC CTG GAA
GCC-3'(SEQ ID N0:22)] were kinased in the presence of lx kinase buffer, 50 mM Tris-HC1 pH 8.0, 10 mM MgCl2, 0.5 mM
ATP, 10 mM DTT, and 5 units T4 polynucleotide kinase (GIBCO
BRL ) .
The Pmll/BstXI linearized pHS775 vector, and the 13824 and 13825 linkers, were ligated in the presence of lx kinase buffer, 0.5 mM ATP and 1 unit T4 DNA ligase (GIBCO
BRL) at 16°C overnight. The ligation products are transformed into E. coli BL21(DE3) (Novagen) and colonies growing on TY plates supplemented with 10 ~.g/mL of tetracycline are analyzed. Plasmid DNA is isolated, then subjected to DNA sequencing on a PE-Applied Biosystems 370 DNA sequencer. Plasmids containing the expected 400 by Ndel to BamHI fragment are kept, and are designated pHS785.
Example 6 Transformation of E. coli with expression plasmids The expression plasmids were transformed into E.
coli cells. Competent E. coli cells (100 ~,L) were transformed with 10 ng of the expression plasmids described earlier. The transformation mixtures were plated in TY-agar plates containing tetracycline (10 ~g/mL)) Transformants were picked and used to grow 10 L cultures.
Cells harboring the expression plasmid for the desired protein were grown overnight at 37°C in 1 L of 2xTY
SUBSTITUTE SHEET { rule 26 ) medium containing tetracycline (10 ~g/mL). After overnight growth, 100 mL of a culture were inoculated into 10 L of medium. When the culture optical density reached 0.6 O.D., expression of the particular protein was induced by adding TPTG to a concentration of 10 ~M. After 3 hours, the E.
coli cells were harvested, and granules were isolated for protein purification.
The techniques of transforming cells with the aforementioned vectors are well-known in the art [Maniatis, T., et al. Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1988); Id., Current Protocols in Molecular Biology (1989), and supplements thereof]. The techniques involved in the transformation of E. coli cells used in the preferred practice of the invention as exemplified herein are well known in the art. The precise conditions under which the transformed E. coli cells are cultured is dependent on the nature of the E. coli host cell line and the expression or cloning vectors employed. For example, vectors which incorporate thermoinducible promoter-operator regions) such as the c1857 thermoinducible lambda-phage promoter-operator region, require a temperature shift from about 30°C to about 40°C. in the culture conditions so as to induce protein synthesis.
In the preferred embodiment of the invention, E.
coli K12 RV308 cells are employed as host cells, but numerous other cell lines are available, such as, but not limited to, E. coli K12 L201, L687, L693, L507, L640) L641, L695, L814 {E. coli B). The transformed host cells are then plated on appropriate media under the selective pressure of the antibiotic corresponding to the resistance gene present on the expression plasmid. The cultures are then incubated for a time and temperature appropriate to the host cell line employed.
Proteins that are expressed in high-level bacterial expression systems characteristically aggregate in SUBSTITUTE SHEET ( rule 26 ) granules or inclusion bodies which contain high levels of the overexpressed protein [Kreuger, J. K., et al., in Protein Folding, Gierasch, L. M. and King, J., eds., American Association for the Advancement of Science Publication No. 89-185, Washington, D.C., 136-142 (1990)].
Such protein aggregates must be dissolved to provide further purification and isolation of the desired protein product.
[Kreuger, J. K., et al., supra.]. A variety of techniques using strongly denaturing solutions such as guanidinium-HC1 and/or weakly denaturing solutions such as urea are used to solubilize the proteins. Removal of the denaturing agents allows the denatured protein to assume its native conformation. The particular conditions for denaturation and folding are determined by the particular~protein expression system and/or the protein in question.
Preferably, the present proteins are expressed with a leader sequence. One of ordinary skill in the art would recognize that numerous leader sequences are operable.
However, the leader sequence is preferably Met-R1-, wherein R1 is any amino acid except Pro or is absent, so that the expressed proteins may be readily converted to the claimed protein with Cathepsin C, or other suitable aminopeptidases.
Preferably, R1 is Arg, Asp, or Tyr; and most preferably, the proteins are expressed with a Met-Arg leader sequence.
Interestingly, the leader sequence does not significantly affect stability or activity of the protein. Nevertheless, the leader sequence is preferably cleaved from the protein.
Thus, the proteins of the Formula: Met-R1-SEQ ID NO:1 are useful as biological agents and, preferably, as intermediates.
SUBSTITUTE SHEET ( rule 26 ) Example 7 The Protein of the Formula: Met-Arg-SEQ ID N0:4 The protein of SEQ ID N0:4 with a Met-Arg leader sequence was expressed in E. coli as previously described.
Granules were solubilized in 8 M urea containing 5 mM
cysteine. The protein was purified by anion exchange in 8 M
urea containing 5 mM cysteine and folded by dilution into 8 M urea (containing 5 mM cysteine) and exhaustive dialysis against phosphate-buffered saline (PBS). Little to no aggregation of the protein was seen. Following final purification of the protein by size exclusion chromatography, the protein was concentrated to 3-3.5 mg/mL
in PBS.
Example 8 The Protein of the Formula SEQ ID N0:3 The protein of SEQ ID N0:3 with a Met-Arg leader sequence was expressed in E. coli, isolated and folded by techniques analogous to the previous Examples. The pH of the protein solution was reduced to pH 2.8. The Met-Arg leader sequence was cleaved by the addition of 6-10 milliunits dDAP per mg of protein (dDAP is the abbreviation for a dipeptidylaminopeptidase isolated from the slime mold, Dicteostelium descoidium, described by Atkinson, P. R., et al., U.S. Patent No. 5,565,330, issued October 15, 1996).
The conversion reaction was allowed to proceed for 2-8 hours at room temperature. The progress of the reaction was monitored by high performance reversed phase chromatography.
The reaction was terminated by adjusting the pH to 8 with NaOH. The des(Met-Arg) protein was further purified by cation exchange chromatography in the presence of 7-8 M urea and size exclusion chromatography in PBS. Following final purification of the proteins by size exclusion chromatography the proteins were concentrated to 3-3.5 mg/mL
in PBS. Virtually no aggregation of either protein was noted.
SUBSTITUTE SHEET ( rule 26 ) The purification.of the claimed proteins is by techniques known in the art and includes reversed phase chromatography, affinity chromatography, ion exchange and size exclusion chromatography.
The claimed proteins contain two cysteine residues. Thus, a disulfide bond may be formed to stabilize the protein. The present invention includes proteins of the Formula (I) wherein the Cys at position 96 is cross-linked to Cys at position 146 as well as those proteins without such disulfide bonds. Preferably, the Cys at position 96 is disulfide bonded to the Cys at position 146. In addition the proteins of the present invention may exist, particularly when formulated, as dimers, trimers, tetramers, and other multimers. Such multimers are included within the scope of the present invention.
The present invention provides a method for treating obesity. The method comprises administering to the organism an effective amount of anti-obesity protein in a dose between about 1 and 1000 ~.g/kg. A preferred dose is from about 10 to 100 ~g/kg of active protein. A typical daily dose for an adult human is from about 0.5 to 100 mg.
In practicing this method, proteins of the Formula (I) can be administered in a single daily dose or in multiple doses per day. The treatment regime may require administration over extended periods of time. The amount per administered dose or the total amount administered will be determined by the physician and depend on such factors as the nature and severity of the disease, the age and general health of the patient and the tolerance of the patient for the protein.
The instant invention further provides pharmaceutical formulations comprising proteins of the present invention. The proteins, preferably in the form of a pharmaceutically acceptable salt, can be formulated for parenteral administration for the therapeutic or prophylactic treatment of obesity. For example, proteins of the Formula (I) can be admixed with conventional SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 ~ PCT/LTS97/23549 pharmaceutical carriers and excipients. The compositions comprising claimed proteins contain from about 0.1 to 95% by weight of the active protein, preferably in a soluble form, and more generally from about 10 to 30%. Furthermore, the present proteins may be administered alone or in combination with other anti-obesity agents or agents useful in treating diabetes. For intravenous (i.v.) use, the protein is administered in commonly-used intravenous fluids) and administered by infusion. Such fluids, for example, physiological saline, Ringer s solution or 5% dextrose solution can be used. For intramuscular preparations, a sterile formulation, preferably a suitable soluble salt form of a protein of the Formula (I), for example the hydrochloride salt, can be dissolved and administered in a pharmaceutical diluent such as pyrogen-free, distilled water, physiological saline, or 5% glucose solution. A
suitable insoluble form of the protein may be prepared and administered as a suspension in an aqueous base or a pharmaceutically acceptable oil base, e.g., an ester of a long chain fatty acid such as ethyl oleate.
Pharmaceutically acceptable preservatives such as an alkylparaben, particularly methylparaben, ethylparaben, propylparaben, or butylparaben or chlorobutanol are preferably added to the formulation to allow multi-dose use.
Significantly, the claimed proteins having Asp or Glu at position 100 are also stable in the presence of a phenolic preservative, such as, m-cresol or phenol. Proteins with one of these replacements are highly preferred for that reason. The stability of the proteins in the presence of a phenolic preservative offers advantages in pharmaceutical delivery, including, enhanced preservative effectiveness.
The formulation is preferably prepared in the absence of salt to minimize the ionic strength of the formulation.
The, ability of the present proteins to treat obesity is demonstrated in vivo, as follows.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 ~ PCT/US97/23549 Biological Testing Parabiotic experiments suggest that a protein is released by peripheral adipose tissue and that the protein is able to control body weight gain in normal, as well as obese mice. Therefore, the most closely related biological test is to inject the test protein by any of several routes of administration (e.g., i.v., s.c., i.p., or by minipump or cannula) and then to monitor food and water consumption, body weight gain, and also plasma chemicals or hormones, such as glucose, insulin, ACTH, corticosterone, GH, or T4 over various time periods. Suitable test animals include normal mice (ICR, etc.) and obese mice (ob/ob, Avy/a, KK-Ay, tubby, fat). The ob/ob mouse model of obesity and diabetes is generally accepted in the art as being indicative.of the obesity condition. Control for non-specific effects for these injections is accomplished by administering the vehicle without the protein in the same animal model, and monitoring the same parameters. Alternatively, the protein itself is administered to animals that are thought to lack the protein receptor, such as, db/db mice or fa/fa or cp/cp rats. Proteins demonstrating activity in these models will demonstrate similar activity in other mammals, particularly humans.
Since the target tissue is expected to be the hypothalamus where food intake and lipogenic state are regulated, an alternate study involves injecting a protein directly into the brain of test animals, for example, by i.c.v. injection via the lateral or third ventricles, or by injection directly into specific hypothalamic nuclei, such as, the arcuate, paraventricular, or perifornical nuclei.
The same parameters as above could be measured, or the release of neurotransmitters that are known to regulate feeding or metabolism could be monitored (e. g., NPY, galanin, norepinephrine, dopamine, ~3-endorphin release).
Similar studies are accomplished in vitro using isolated hypothalamic tissue in a perifusion or tissue bath SUBSTITUTE SHEET ( rule 26 ) system. In this situation, the release of neurotransmitters or electrophysiological changes is monitored.
The proteins used to carry out the experiments that are reported in Examples 9 and 10 were prepared in accordance with the teachings and examples provided herein.
Amino acid sequences of the proteins were confirmed by mass spectroscopy and/or amino acid analysis. The proteins were folded with the Cys residues cross-linked by a disulfide bond when tested.
Example 9 Testing of Proteins of SEQ ID N0:2 and 3 in Male ob/ob Mice Male ob/ob mice [Harlan, Ltd., Blackthorn, England] were housed in groups of 5 animals each and provided with Purina 5008 chow and water ad Iibitum. The mice were maintained on a reverse lighting schedule (lights off at 9:00 A.M., and on at 9:00 P.M.). The mice were weighed daily at 8:30 A.M. Their food and water consumption were determined at the same time. Treatment, as indicated below, were made following the morning weighing, just prior to lights out. The mice were treated once daily for 4 days.
Group Treatment (n=5) 1 PBS 0.2 mL/day, S.C.
2 Protein 30 ~,g/0.2 mL/day, S.C.
3 Protein 300 ~,g/0.2 mL/day, S.C.
The effects of these treatments on food consumption and cumulative body weight change are illustrated for representative proteins of the present invention in Tables 1 and 2.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 ~ PCT/LTS97/23549 Table 1. Effect of SEQ ID.N0:2 on food consumption and cumulative body weight change in ob/ob mice Food Consumption Body Weight (g/day) Change (g) Day Group Group Group Group Group Group 1 4.4 4.6 3.4 +0.5 -0.4 -0.4 2 5.3 4.7 2.8 ~+0.4 -0.8 -1.1 3 4.8 4.2 2.2 +0.6 -0.6 -1.5 4 5.1 3.6 1.7 +0.9 -1.2 -2.9 Table 2. Effect of SEQ ID N0:3 on food consumption and cumulative body weight change in ob/ob mice Food Consumption Body Weight (g/day) Change (g) Day Group Group Group Group Group Group 1 5.0 4.5 3.1 -0.3 -0.2 -1.0 2 5.4 3.5 2.2 0.0 -0.8 -2.0 3 5.2 3.3 1.8 +0.1 -1.0 -2.8 Group 1 = Control (PBS); Group 2 - 30 ~g/day of protein;
Group 3 - 300 ~g/day of protein Example 10 Analysis of Aggregation by Dynamic Light Scatteriag (DLS) The physical properties of the present proteins are demonstrated as follows. Based on availability of material, the solution analyzed by DLS was prepared in one of two ways. Material from the last size exclusion chromatography purification step, at approximately 1.5 mg/mL
in PBS, was either concentrated to about 3 or 5 mg/mL by diafiltration, or was dialyzed against water, lyophilized, and then reconstituted to about 3 or 5 mg/mL.
In the first method the protein solution at approximately 1.5 mg/mL was concentrated to greater than about 3 or 5 mg/mL in a small volume stir cell (10 mL) using an Amicon YM10, 25-mm membrane. This was conducted under cold room conditions (about 5°C). Protein concentration was determined by W absorption, and the solution was diluted to SUBSTITUTE SHEET { rule 26 ) about 3.0 mg/mL or 5 mg/mL.with PBS (10-fold water dilution of lOx PBS without Ca/Mg, GIBCO BRL).
A second alternative method, used when lyophilization was feasible, consisted of dialyzing the solution of protein against water, using three to five exchanges of water at about 4°C. A typical dialysis membrane was the Spectra/Por-7 dialysis membrane, 2000 molecular weight cutoff membrane (Spectrum Medical Industries, Los Angeles, CA). The material was concentrated as above to 3 to 4 mg/mL and typically a 2 mg quantity was lyophilized in a 5 mL vial. For DLS analysis this plug was reconstituted with water to greater than 3.3 mg/mL or greater than 5 mg/mL. Several vials were typically pooled.
Protein concentration was determined by W at peak maximum (typically 280-nm). The protein concentration was diluted to about 3 or 5 mg/mL with a combination of water and 10x PBS (without Ca/Mg, GIBCO BRL) to yield a final PBS
concentration of lx.
The protein solution at 3.0 mg/mL or at 5 mg/mL in lx PBS was adjusted to 7.4 with HC1/NaOH and passed through a 0.1 ~m Anotopl"'-10 filter (Whatman International, Ltd.) Maidstone, England) into a DLS cell. The average cumulant particle size was measured on a Brookhaven BI-9000 DLS
instrument (Brookhaven Instruments, Holtsville, NY) with a Lexel argon-ion laser every 15 minutes using a 30 second duration at a 90° angle. A 488-nm filter with a 400 ~m pinhole was assumed. At the 37°C incubation temperature a viscosity of 0.6915 centipoise (cP) and refractive index of 1.333 was used. The light-weighted average particle size was calculated by a cumulant method using the measured auto-correlation baseline.
The estimated time required for various anti-obesity proteins to reach an averaged light-weighted particle size of 50 nm in a PBS solution at pH 7.4 and 37°C is shown in Table 3. The averaged light-weighted particle size was determined from a cumulant analysis of SUBSTITUTE SHEET ( rule 26 ) a binodal distribution composed of monomeric and higher-order aggregate populations. The time necessary to achieve an average aggregate size of 50 nm was estimated by plotting size as a function of time. The proteins were folded with the Cys residues cross-linked by a disulfide bond when tested.
Table 3. Estimated time (minutes) required for various anti-obesity proteins to reach an averaged light-weighted particle size of 50 nm in a PBS solution at pH 7.4 and 37°C.
Estimated Time to Reach nm (min) Protein 3 mg/mL 5 mg/mL
Human Ob Protein 2 u.d.
SEQ ID N0:2 871 n.d.
SEQ ID N0:3 543 n.d.
SEQ ID N0:5 835 342 SEQ ID N0:6 2,399 75g The data in Table 3 demonstrate the dramatic reduction in the propensity of the claimed proteins to aggregate.
The claimed proteins are active in at least one of the above biological tests and are anti-obesity agents. As such, they are useful in treating obesity and those disorders implicated by obesity. However, the proteins are not only useful as therapeutic agents. One skilled in the art recognizes that the proteins are useful in the production of antibodies for diagnostic use and, as proteins, are useful as feed additives for animals.
Furthermore, the proteins are useful for controlling weight for cosmetic purposes in mammals. A cosmetic purpose seeks to control the weight of a mammal to improve bodily appearance. The mammal is not necessarily obese. Such cosmetic use forms part of the present invention.
The principles, preferred embodiments and modes of operation of the present invention have been described in SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 ~ - PCT/US97/23549 the foregoing specification. The invention which is intended to be protected is not limited to the particular forms disclosed, since they are to be regarded as illustrative rather than restrictive. Variations and changes may be made by those skilled in the art without departing from the spirit of the invention.
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 ~ PCT/US97I23549 SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Beals, John M
Hoffmann, James A
Kline, Allen D
Schevitz, Richard W
Zhang, Forming (ii) TITLE OF INVENTION: Anti-Obesity Proteins (iii) NUMBER OF SEQUENCES: 22 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Eli Lilly and Company (B) STREET: Lilly Corporate Center (C) CITY: Indianapolis 2 0 (D) STATE: Indiana (E) COUNTRY: U.S.A.
(F) ZIP: 46285 (v) COMPUTER READABLE FORM:
2 (A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 3 (vi) CURRENT APPLICATION DATA:
O
(A) APPLICATION NUMBER: WO N/A (w/ Filing) (B) FILING DATE:
(C) CLASSIFICATION:
3 (viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Caltrider, Steven P
(B) REGISTRATION NUMBER: 36,467 (C) REFERENCE/DOCKET NUMBER: X-11092 4 (ix) TELECOMMUNICATION INFORMATION:
O
(A) TELEPHONE: (317) 276-0757 (B) TELEFAX: (317) 277-1917 4 5 (2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids (B) TYPE: amino acid 5 0 (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 1 (D) OTHER INFORMATION: /note= "Xaa at position 1 is Val or 6 0 absent ; ~~
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 2 (D) OTHER INFORMATION: /note= "Xaa at position 2 is Pro or SUBSTITUTE SHEET ( rule 26 ) absent;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 22 (D) OTHER INFORMATION: /note= "Xaaat position22 is Asn or Ser;"
(ix) FEATURE:
1 {A) NAME/KEY: Modified-site (B) LOCATION: 28 (D) OTHER INFORMATION: /note= "Xaaat position28 is Gln or absent;"
( ix) FEATURE
(A) NAME/KEY: Modified-site (B) LOCATION: 72 (D) OTHER INFORMATION: /note= "Xaaat position72 is Asn, Gln, Glu or Asp;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 73 (D) OTHER INFORMATION: /note= "Xaaat position73 is Val 2 or Met ; "
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 100 3 {D) OTHER INFORMATION: /note= "Xaaat position100 is 0 Trp, Gln, Glu, Asp, Ser, Thr, Lys, His,..."
or (ix) FEATURE:
(A) NAME/KEY: Modified-site 3 (B) LOCATION: 138 (D) OTHER INFORMATION: /note= "Xaaat position138 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His,Arg;said protein or having at least one of the following:"
4 (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 1 (D) OTHER INFORMATION: /note= "Xaaat position1 is replaced with Glu, Asp, Ser, Thr, His,or Arg;"
Lys, (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 2 {D) OTHER INFORMATION: /note= "Xaaat position2 is 50 replaced with Glu, Asp, Ser, Thr, His,or Arg;"
Lys, (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 3 55 (D) OTHER INFORMATION: /note= "Ileat position3 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site 6 (B) LOCATION: 30 (D) OTHER INFORMATION: /note= "Valat position30 is replaced with Glu, Asp, Arg, Lys, is;"
or H
(ix} FEATURE:
65 (A) NAME/KEY: Modified-site SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCTIUS97/23549 (B) LOCATION: 36 (D) OTHER INFORMATION: /note= "Val at position 36 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 41 (D) OTHER INFORMATION: /note= "Phe at position 41 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 42 (D) OTHER INFORMATION: /note= "Ile at position 42 is replaced with Glu, Asp, Arg, Lys, or His;"
( i.x) FEATURE
(A) NAME/KEY: Modified-site (B) LOCATION: 43 2 (D) OTHER INFORMATION: /note= "Pro at position 0 43 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site 2 (B) LOCATION: 45 (D) OTHER INFORMATION: /note= "Leu at position 45 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
3 (A) NAME/KEY: Modified-site (B) LOCATION: 46 (D) OTHER INFORMATION: /note= "His at position 46 is replaced with Glu, Asp) Arg) or Lys;"
3 {ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 47 (D) OTHER INFORMATION: /note= "Pro at position 47 is replaced with Glu, Asp, Arg, Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 48 (D) OTHER INFORMATION: /note= "Ile at position 48 is 4 replaced with Glu, Asp, Arg, Lys, 5 or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 49 5 (D) OTHER INFORMATION: /note= "Leu at position 0 49 is replaced with Glu, Asp, Arg) Lys, or His;"
(ix) FEATURE:
(A) NAME/KEY: Modified-site 5 (B) LOCATION: 50 (D) OTHER INFORMATION: /note= "Thr at position 50 is replaced with Glu) Asp, Arg, Lys, or His;"
(ix) FEATURE:
6 (A) NAME/KEY: Modified-site (B) LOCATION: 74 (D) OTHER INFORMATION: /note= "Ile at position 74 is replaced with Gln, Gly, Asp, Arg, Thr or Ser;"
Lys, His, 65 (ix) FEATURE:
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 (A) NAME/KEY: Modified-site (B) LOCATION: 89 (D) OTHER INFORMATION: /note= "Valat position 89 is replaced with Gln, Glu, Asp, Arg, His,Thr or Ser;"
Lys, (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 92 (D) OTHER INFORMATION: /note= "Pheat position 92 is replaced with Gln, Glu, Asp, Arg, His,Thr or Ser;~~
Lys, (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 99 (D) OTHER INFORMATION: /note= "Proat position 99 is replaced with Gln, Glu, Asp, Arg, His,Thr or Ser; or"
Lys, (ix) FEATURE:
(A) NAME/KEY: Modified-site 2 (B) LOCATION: 142 (D) OTHER INFORMATION: /note= ~~Leuat position 142 is replaced with Glu, Asp, Arg, Lys, r a pharmaceutically or His; o acceptable salt thereof."
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: l:
Xaa Xaa Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Xaa Asp Ile Ser His Thr Xaa Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile 4 0 Leu Thr Ser Met Pro Ser Arg Xaa Xaa Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Xaa Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Xaa Gln Leu Asp Leu Ser Pro 5 5 Gly Cys (2) INFORMATION FOR SEQ ID N0:2:
60 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/LTS97/23549 (ii) MOLECULE TYPE: protein (xi) SEQUENCE :2:
DESCRIPTION:
SEQ
ID
Val ProIle Gln Lys Val AspAspThr LysThrLeu IleLysThr Gln Ile ValThr Arg Ile Asn IleSerHis ThrGlnSer ValSerSer Asp Lys GlnLys Val Thr Gly AspPheIle ProGlyLeu AspProile Leu Leu ThrLeu Ser Lys Met GlnThrLeu AlaValTyr GlnGlnIle Asp 2 Leu ThrSer Met Pro Ser AsnValIle GlnIleSer AsnAspLeu 0 Arg Glu AsnLeu Arg Asp Leu HisValLeu AlaPheSer LysSerCys Leu His LeuPro Trp Ala Ser LeuGluThr LeuAspSer LeuGlyGly Gly Val LeuGlu Ala Ser Gly SerThrGlu ValValAla LeuSerArg Tyr Leu GlnGly Ser Leu Gln MetLeuTrp GlnLeuAsp LeuSerPro Asp 3 Gly Cys (2) INFORMATI ON FOR SEQ ID
N0:3:
4 (i) SEQUENCE
O CHARACTERISTICS:
(A)LENGTH: 146 amino acids (B)TYPE: amino acid (C)STRANDEDNESS: e singl (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0:3:
ial Pro Ile Gln 5ys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Asp Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 65 70 . 75 80 Glu Asn Leu Arg Asp Leu Leu ValLeuAla PheSerLys SerCys His His Leu Pro Trp Ala Ser Gly GluThrLeu AspSerLeu GlyGly Leu Val Leu Glu Ala Ser Gly Tyr ThrGluVal ValAlaLeu SerArg Ser Leu Gln Gly Ser Leu Gln Asp LeuTrpGln LeuAspLeu SerPro Met Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:4:
2 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE
DESCRIPTION:
SEQ
ID
N0:4:
ial Pro IleGln5ys ValGlnAsp AspThrLys ThrLeuIle LysThr Ile Val ThrArgIle AsnAspIle SerHisThr GlnSerVal SerSer Lys Gln LysValThr GlyLeuAsp PheIlePro GlyLeuAsp ProIle Leu Thr LeuSerLys MetAspGln ThrLeuAla ValTyrGln GlnIle 4 Leu Thr SerMetPro SerArgAsn ValIleGln IleSerAsn AspLeu Glu Asn LeuArgAsp LeuLeuHis ValLeuAla PheSerLys SerCys His Leu ProAspAla SerGlyLeu GluThrLeu AspSerLeu GlyGly Val Leu GluAlaSer GlyTyrSer ThrGluVal ValAlaLeu SerArg Leu Gln GlySerLeu GinAspMet LeuTrpGln LeuAspLeu SerPro 6 Gly Cys (2) INFORMATION OR
F SEQ
ID
N0:5:
(i) SEQUENCE CHARACTERISTICS:
SUBSTITUTE SHEET ( rule 26 ) WO 98/Z8414 . . PCT/US97/23549 (A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: :5:
Val Pro Ile Gln Lys Val Gln AspThrLys ThrLeuIle LysThr Asp Ile Val Thr Arg Ile Asn Asp SerHisThr GlnSerVal SerSer Ile Lys Gln Lys Val Thr Gly Leu PheIlePro GlyLeuHis ProAsp Asp Leu Thr Leu Ser Lys Met Asp ThrLeuAla ValTyrGln GlnIle Gln 2 Leu Thr Ser Met Pro Ser Arg ValIleGln IleSerAsn AspLeu 5 Asn Glu Asn Leu Arg Asp Leu Leu ValLeuAla PheSerLys SerCys His His Leu Pro Asp Ala Ser Gly GluThrLeu AspSerLeu GlyGly Leu Val Leu Glu Ala Ser Gly Tyr ThrGluVal ValAlaLeu SerArg Ser Leu Gln Gly Ser Leu Gln Asp LeuTrpGln LeuAspLeu SerPro Met 4 Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:6:
4 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0:6:
Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Asp Ser Lys Ser Cys His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro 2 0 Gly Cys (2) INFORMATION FOR SEQ ID N0:7:
2 5 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0:7:
ial Pro Ile Gln 5ys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Asn Asp Ile Ser His Thr Gln Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile 50 Leu Thr Ser Met Pro Ser Arg Asn Val Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Asp Ser Lys Ser Cys His Leu Pro Asp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro 6 5 Gly Cys SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 (2) INFORMATION
FOR
SEQ
ID N0:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 148 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: :8:
Met Arg Val Pro Ile Gln GlnAsp Asp Thr Lys Thr Leu Lys Val Ile Lys Thr Ile Val Thr Arg AspIle Ser His Thr Gln Ser Ile Asn Val Ser Ser Lys Gln Lys Val LeuAsp Phe Ile Pro Gly Leu Thr Gly Asp Pro Ile Leu Thr Leu Ser AspGln Thr Leu Ala Val Tyr Lys Met Gln 3 Gln Ile Leu Thr Ser Met ArgAsn Val Ile Gln Ile Ser 0 Pro Ser Asn Asp Leu Glu Asn Leu Arg LeuHis Val Leu Ala Phe Ser Asp Leu Lys Ser Cys His Leu Pro Trp GlyLeu Glu Thr Leu Asp Ser Ala Ser Leu Gly Gly Val Leu Glu Ala TyrSer Thr Glu Val Val Ala Ser Gly Leu Ser Arg Leu Gln Gly Ser AspMet Leu Trp Gln Leu Asp Leu Gln Leu 4 Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:9:
5 (i) SEQUENCE CHARACTERISTICS:
O
(A) LENGTH: 148 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0:9:
Met Arg Val Pro Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Lys Thr Ile Val Thr Arg AsnAspIleSer HisThrGln SerVal Il.e Ser Ser 35s Gln Lys Val GlyLeuAspPhe IleProGly LeuHis Thr Pro Asp Leu Thr Leu Ser MetAspGlnThr LeuAlaVal TyrGln Lys Gln Ile Leu Thr Ser Met SerArgAsnVal IleGlnIle SerAsn Pro Asp Leu Glu Asn Leu Arg LeuLeuHisVal LeuAlaPhe SerLys Asp Ser Cys His Leu Pro Trp SerGlyLeuGlu ThrLeuAsp SerLeu Ala Gly Gly Val Leu Glu Ala GlyTyrSerThr GluValVal AlaLeu Ser Ser Arg Leu Gln Gly Ser GlnAspMetLeu TrpGlnLeu AspLeu Leu 2 Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID NO:10:
30 (i) SEQUENCE :
CHARACTERISTICS
(A) LENGTH: 148 aminocids a (B) TYPE: amino acid (C) STRANDEDNESS: e singl (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE
DESCRIPTION:
SEQ
ID
NO:10:
Met Arg ValProIle GlnLysVal GlnAspAspThr LysThrLeu Ile Lys Thr IleValThr ArgIleAsn AspIleSerHis ThrGlnSer Val Ser Ser LysGlnLys ValThrGly LeuAspPheIle ProGlyLeu Asp Pro Ile LeuThrLeu SerLysMet AspGlnThrLeu AlaValTyr Gln 55 Gln Ile LeuThrSer MetProSer ArgAsnValIle GlnIleSer Asn Asp Leu GluAsnLeu ArgAspLeu LeuHisValLeu AlaPheSer Lys Ser Cys HisLeuPro AspAIaSer GlyLeuGluThr LeuAspSer Leu Gly Gly ValLeuGlu AlaSerGly TyrSerThrGlu ValValAla Leu SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97/23549 Ser Arg Leu Gln Gly Ser Asp MetLeuTrp GlnLeuAsp Leu Gln Leu Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 148 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:
SEQ ID N0:11:
Met Arg Val Pro Ile Gln Gln AspAspThr LysThrLeu Lys Val Ile Lys Thr Ile Val Thr Arg Asp IleSerHis ThrGlnSer Ile Asn Val Ser Ser Lys Gln Lys Val Leu AspPheIle ProGlyLeu Thr Gly His Pro Asp Leu Thr Leu Ser Asp GlnThrLeu AlaValTyr Lys Met Gln 3 Gln Ile Leu Thr Ser Met Arg AsnValIle GlnIleSer 5 Pro Ser Asn Asp Leu Glu Asn Leu Arg Leu HisValLeu AlaPheSer Asp Leu Lys Ser Cys His Leu Pro Asp Gly LeuGluThr LeuAspSer Ala Ser Leu Gly Gly Val Leu Glu Ala Tyr SerThrGlu ValValAla Ser Gly Leu Ser Arg Leu Gln Gly Ser Asp MetLeuTrp GlnLeuAsp Leu Gln Leu 5 Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:12:
5 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 148 amino acids (B) TYPE: amino acid {C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . . PCT/C1S97/23549 (xi) SEQUENCE
DESCRIPTION:
SEQ
ID
N0:12:
Met Arg Val Pro Ile Gln Val GlnAspAsp ThrLysThr LeuIle Lys Lys Thr Ile Val Thr Arg Asn AspIleSer HisThrGln SerVal Ile Ser Ser Lys Gln Lys Val Gly LeuAspPhe IleProGly LeuHis Thr Pro Ile Leu Thr Leu Ser Met AspGlnThr LeuAlaVal TyrGln Lys Gln Ile Leu Thr Ser Met Ser ArgAsnVal IleGlnIle SerAsn Pro Asp Leu Glu Asn Leu Arg Leu LeuHisVal LeuAlaAsp SerLys Asp Ser Cys His Leu Pro Trp Ser GlyLeuGlu ThrLeuAsp SerLeu Ala Gly Gly Val Leu Glu Ala Gly TyrSerThr GluValVal AlaLeu Ser Ser Arg Leu Gln Gly Ser Gln AspMetLeu TrpGlnLeu AspLeu Leu 3 Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:13:
3 (i) SEQUENCE
5 CHARACTERISTICS:
(A) LENGTH: 148 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: e singl (D) TOPOLOGY: linear (ii) MOLECULE
TYPE:
protein (xi) SEQUENCE
DESCRIPTION:
SEQ
ID
N0:13:
Met ArgVal ProIleGln LysValGln AspAspThr LysThrLeu Ile Lys ThrIle ValThrArg IleAsnAsp IleSerHis ThrGlnSer Val Ser SerLys GlnLysVal ThrGlyLeu AspPheIle ProGlyLeu His Pro IleLeu ThrLeuSer LysMetAsp GlnThrLeu AlaValTyr Gln 60 Gln IleLeu ThrSerMet ProSerArg AsnValIle GlnIleSer Asn Asp LeuGlu AsnLeuArg AspLeuLeu HisValLeu AlaAspSer Lys SUBSTITUTE SHEET ( rule 26 ) Ser Cys His Leu Pro Asp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Trp Gln Leu Asp Leu Ser Pro Gly Cys (2) INFORMATION
FOR
SEQ
ID N0:14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 458 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE
DESCRIPTION:
SEQ ID
N0:14:
CATATGAGGGTACCTATCCA AAAAGTACAA GATGACACCAAAACACTGAT AAAGACAATA
GTCACAAGGATAAATGATAT CTCACACACA CAGTCAGTCTCATCTAAACA GAAAGTCACA
GGCTTGGACTTCATACCTGG GCTGCACCCC ATACTGACATTGTCTAAAAT GGACCAGACA
1ao CTGGCAGTCTATCAACAGAT CTTAACAAGT ATGCCTTCTAGAAACGTGAT ACAAATATCT
TTGCCATGGGCCAGTGGCCT GGAGACATTG GACAGTCTGGGGGGAGTCCT GGAAGCCTCA
GGCTATTCTACAGAGGTGGT GGCCCTGAGC AGGCTGCAGGGGTCTCTGCA AGACATGCTG
TGGCAGCTGGACCTGAGCCC CGGGTGCTAA TAGGATCC
(2) INFORMATION
FOR SEQ
ID N0:15:
(i) SEQUENCE
CHARACTERISTICS:
5 (A) LENGTH: 119 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear 6 (ii) MOLECULE
O TYPE:
cDNA
65 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:15:
SUBSTITUTE SHEET ( rule 26 ) ATCTCACACA CACAGTCAGT CTCGAGTAAA CAGAAAGTCA CAGGCTTGGA CTTCATACCT
(2) INFORMATION FOR SEQ ID N0:16:
1 0 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 123 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16:
GATCTGTTGA TAGACTGCCA GTGTCTGGTC CATTTTAGAC AATGTCAGGT CGGGGTGCAG
60 .
CCCAGGTATG AAGTCCAAGC CTGTGACTTT CTGTTTACTC GAGACTGACT GTGTGTGTGA
GAT
(2) INFORMATION FOR SEQ ID N0:17:
(i) 5EQUENCE CHARACTERISTICS:
3 5 {A) LENGTH: 87 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear 4 O (ii) MOLECULE TYPE: cDNA
4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:17:
TGAGGCTTCC AGGACTCCCC CCAGACTGTC CAATGTCTCC AGGCCACTGG CGTCTGGCAA
(2) INFORMATION FOR SEQ ID N0:18:
5 5 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 84 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
SUBSTITUTE SHEET ( rule 26 ) WO 98/28414 . PCT/US97J23549 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:18:
GTGCTGGCCT TCTCTAAAAG TTGCCACTTG CCAGACGCCA GTGGCCTGGA GACATTGGAC
AGTCTGGGGG GAGTCCTGGA AGCC
(2} INFORMATION FOR SEQ ID N0:19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 98 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19:
TCGAGTAAAC AGAAGGTCAC CGGCTTGGAC TTCATACCTG GGCTGGACCC CATACTGACA
TTGTCTAAAA TGGACCAGAC ACTGGCAGTC TATCAACA
3 0 (2) INFORMATION FOR SEQ ID N0:20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 98 base pairs (B) TYPE: nucleic acid 3 5 (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20:
CCCAGGTATG AAGTCCAAGC CGGTGACCTT CTGTTTAC
(2) INFORMATION FOR SEQ ID N0:21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 87 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21:
SUBSTITUTE SHEET ( ruie 26 ) WO 98/28414 . PCT/US97/23549 TGAGGCTTCC AGGACTCCCC CCAGACTGTC CAATGTCTCC AGGCCACTGG CGTCTGGCAA
GTGGCAACTT TTAGAGAAGG CCAGCAC
5 8~
(2) INFORMATION FOR SEQ ID N0:22:
(i) SEQUENCE CHARACTERISTICS:
10 (A) LENGTH: 84 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear 15 {ii) MOLECULE TYPE: cDNA
2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:22:
GTGCTGGCCT TCTCTAAAAG TTGCCACTTG CCAGACGCCA GTGGCCTGGA GACATTGGAC
SUBSTITUTE SHEET ( rule 26 )
Claims (15)
1. A protein of the Formula (I):
Xaa Xaa Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Xaa Asp Ile Ser His Thr Xaa Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Xaa Xaa Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Xaa Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Xaa Gln Leu Asp Leu Ser Pro Gly Cys (SEQ ID NO:1) (I) wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 22 is Asn or Ser;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn, Gln, Glu or Asp;
Xaa at position 73 is Val or Met;
Xaa at position 100 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 138 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
Xaa at position 1 is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 2 is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Ile at position 3 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 30 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 36 is replaced with Glu, Asp, Arg, Lys, or His;
Phe at position 41 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 42 is replaced with Glu, Asp, Arg, Lys, or His;
Pro at position 43 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 45 is replaced with Glu, Asp, Arg, Lys, or His;
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Pro at position 47 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 49 is replaced with Glu, Asp, Arg, Lys, or His;
Thr at position 50 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 74 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Val at position 89 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Phe at position 92 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Pro at position 99 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser; or Leu at position 142 is replaced with Glu, Asp, Arg, Lys, or His;
or a pharmaceutically acceptable salt thereof.
Xaa Xaa Ile Gln Lys Val Gln Asp Asp Thr Lys Thr Leu Ile Lys Thr Ile Val Thr Arg Ile Xaa Asp Ile Ser His Thr Xaa Ser Val Ser Ser Lys Gln Lys Val Thr Gly Leu Asp Phe Ile Pro Gly Leu His Pro Ile Leu Thr Leu Ser Lys Met Asp Gln Thr Leu Ala Val Tyr Gln Gln Ile Leu Thr Ser Met Pro Ser Arg Xaa Xaa Ile Gln Ile Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Xaa Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gln Gly Ser Leu Gln Asp Met Leu Xaa Gln Leu Asp Leu Ser Pro Gly Cys (SEQ ID NO:1) (I) wherein:
Xaa at position 1 is Val or absent;
Xaa at position 2 is Pro or absent;
Xaa at position 22 is Asn or Ser;
Xaa at position 28 is Gln or absent;
Xaa at position 72 is Asn, Gln, Glu or Asp;
Xaa at position 73 is Val or Met;
Xaa at position 100 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 138 is Trp, Gln, Glu, Asp, Ser, Thr, Lys, His, or Arg;
said protein having at least one of the following substitutions:
Xaa at position 1 is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Xaa at position 2 is replaced with Glu, Asp, Ser, Thr, Lys, His, or Arg;
Ile at position 3 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 30 is replaced with Glu, Asp, Arg, Lys, or His;
Val at position 36 is replaced with Glu, Asp, Arg, Lys, or His;
Phe at position 41 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 42 is replaced with Glu, Asp, Arg, Lys, or His;
Pro at position 43 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 45 is replaced with Glu, Asp, Arg, Lys, or His;
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Pro at position 47 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His;
Leu at position 49 is replaced with Glu, Asp, Arg, Lys, or His;
Thr at position 50 is replaced with Glu, Asp, Arg, Lys, or His;
Ile at position 74 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Val at position 89 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Phe at position 92 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser;
Pro at position 99 is replaced with Gln, Glu, Asp, Arg, Lys, His, Thr or Ser; or Leu at position 142 is replaced with Glu, Asp, Arg, Lys, or His;
or a pharmaceutically acceptable salt thereof.
2. The protein of Claim 1, wherein the Cys at position 96 is disulfide bonded to Cys at position 146.
3. The protein of Claim 2, wherein Xaa at position 1 is Val;
Xaa at position 2 is Pro;
Xaa at position 28 is Gln or absent;
Xaa at position 72 in Asn or Asp;
Xaa at position 100 is Trp, Glu, or Asp;
Xaa at position 138 is Trp, Glu, or Asp; said protein having at least one of the following substitutions:
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His; or Phe at position 92 is replaced with Glu, Asp, Arg, Lys, or His.
Xaa at position 2 is Pro;
Xaa at position 28 is Gln or absent;
Xaa at position 72 in Asn or Asp;
Xaa at position 100 is Trp, Glu, or Asp;
Xaa at position 138 is Trp, Glu, or Asp; said protein having at least one of the following substitutions:
His at position 46 is replaced with Glu, Asp, Arg, or Lys;
Ile at position 48 is replaced with Glu, Asp, Arg, Lys, or His; or Phe at position 92 is replaced with Glu, Asp, Arg, Lys, or His.
4. The protein of Claim 3, wherein Xaa at position 100 is Trp or Asp, and Xaa at position 138 is Trp.
5. The protein of Claim 2 which has a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, and SEQ ID
NO:7.
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, and SEQ ID
NO:7.
6. A protein consisting of the protein as claimed in Claim 1, and a leader sequence, wherein, the leader sequence is bonded to the N-terminal of the protein, or a pharmaceutically acceptable salt thereof.
7. The protein of Claim 6, wherein the leader sequence is Met-R1, wherein R1 is absent or is any amino acid except Pro.
8. The protein. of Claim 7, wherein the leader sequence is Met-Arg, Met-Asp, or Met-Tyr.
9. The protein of Claim 8 which has a sequence selected from the group consisting of SEQ ID NO:8, SEQ ID
NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, and SEQ ID
NO:13.
NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, and SEQ ID
NO:13.
10. A pharmaceutical formulation, comprising as an active agent, a protein of Claim 1, or a pharmaceutically acceptable salt thereof, together with one or more pharmaceutically acceptable diluents, carriers or excipients therefor.
11. A DNA polynucleotide compound comprising DNA
that encodes a protein of Claim 1.
that encodes a protein of Claim 1.
12. A DNA polynucleotide compound comprising DNA
that encodes a protein of Claim 6.
that encodes a protein of Claim 6.
13. A process for preparing a protein as claimed in Claim 6, comprising:
(a) transforming a host cell with DNA that encodes the protein;
(b) culturing the transformed host cell such that the protein is expressed; and (c) recovering the protein.
(a) transforming a host cell with DNA that encodes the protein;
(b) culturing the transformed host cell such that the protein is expressed; and (c) recovering the protein.
14. A process for preparing a protein of Claim 1, comprising:
(a) transforming a host cell with DNA that encodes a protein comprised of a protein of Claim 1 and a leader sequence;
(b) culturing the transformed host cell such that the protein encoded for in step (a) is expressed;
(c) enzymatically removing the leader sequence of the expressed protein to produce a protein of Claim 1; and (d) recovering the protein of Claim 1.
(a) transforming a host cell with DNA that encodes a protein comprised of a protein of Claim 1 and a leader sequence;
(b) culturing the transformed host cell such that the protein encoded for in step (a) is expressed;
(c) enzymatically removing the leader sequence of the expressed protein to produce a protein of Claim 1; and (d) recovering the protein of Claim 1.
15. A method of treating obesity or a condition associated with obesity, which comprises administering to a mammal in need thereof a protein of the Formula (I).
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US3354196P | 1996-12-20 | 1996-12-20 | |
US60/033,541 | 1996-12-20 | ||
PCT/US1997/023549 WO1998028414A1 (en) | 1996-12-20 | 1997-12-19 | Anti-obesity proteins |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2274799A1 true CA2274799A1 (en) | 1998-07-02 |
Family
ID=21871008
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002274799A Abandoned CA2274799A1 (en) | 1996-12-20 | 1997-12-19 | Anti-obesity proteins |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1007648A4 (en) |
JP (1) | JP2001507228A (en) |
AU (1) | AU5802898A (en) |
CA (1) | CA2274799A1 (en) |
WO (1) | WO1998028414A1 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6927214B1 (en) | 1999-01-15 | 2005-08-09 | Novo Nordisk A/S | Non-peptide GLP-1 agonists |
DE19957904A1 (en) * | 1999-12-01 | 2001-06-07 | Merck Patent Gmbh | Insect venom allergens with reduced IgE reactivity and process for their preparation |
MXPA03006989A (en) * | 2001-02-06 | 2003-11-18 | Merck Patent Gmbh | Modified leptin with reduced immunogenicity. |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6309853B1 (en) * | 1994-08-17 | 2001-10-30 | The Rockfeller University | Modulators of body weight, corresponding nucleic acids and proteins, and diagnostic and therapeutic uses thereof |
US5827734A (en) * | 1995-01-20 | 1998-10-27 | University Of Washington | Materials and methods for determining ob protein in a biological sample |
EP0725079A1 (en) * | 1995-01-31 | 1996-08-07 | Eli Lilly And Company | Anti-obesity proteins |
US5594101A (en) * | 1995-03-03 | 1997-01-14 | Eli Lilly And Company | Anti-obesity proteins |
WO1996031526A1 (en) * | 1995-04-06 | 1996-10-10 | Amylin Pharmaceuticals, Inc. | Anti-obesity agents |
US5614379A (en) * | 1995-04-26 | 1997-03-25 | Eli Lilly And Company | Process for preparing anti-obesity protein |
GB9509164D0 (en) * | 1995-05-05 | 1995-06-28 | Smithkline Beecham Plc | Novel compounds |
ES2339846T3 (en) * | 1995-11-22 | 2010-05-26 | Amgen Inc. | METHOD FOR INCREASING THE MAGRO FABRIC MASS USING PROTEIN COMPOSITIONS OB. |
AU5901698A (en) * | 1996-12-20 | 1998-07-17 | Eli Lilly And Company | Protein crystals, their preparation and use |
-
1997
- 1997-12-19 EP EP97954183A patent/EP1007648A4/en not_active Withdrawn
- 1997-12-19 JP JP52899598A patent/JP2001507228A/en active Pending
- 1997-12-19 CA CA002274799A patent/CA2274799A1/en not_active Abandoned
- 1997-12-19 WO PCT/US1997/023549 patent/WO1998028414A1/en not_active Application Discontinuation
- 1997-12-19 AU AU58028/98A patent/AU5802898A/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
JP2001507228A (en) | 2001-06-05 |
WO1998028414A1 (en) | 1998-07-02 |
EP1007648A1 (en) | 2000-06-14 |
AU5802898A (en) | 1998-07-17 |
EP1007648A4 (en) | 2003-01-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5554727A (en) | Anti-obesity proteins | |
US5552522A (en) | Anti-obesity proteins | |
US5559208A (en) | Anti-obesity proteins | |
US5605886A (en) | Anti-obesity proteins | |
US5552524A (en) | Anti-obesity proteins | |
US5552523A (en) | Anti-obesity proteins | |
US5532336A (en) | Anti-obesity proteins | |
US5691309A (en) | Anti-obesity proteins | |
US5719266A (en) | Anti-obesity proteins | |
EP0725078A1 (en) | Anti-obesity proteins | |
US5594104A (en) | Anti-obesity proteins | |
US5580954A (en) | Anti-obesity proteins | |
US5567678A (en) | Anti-obesity proteins | |
US5569744A (en) | Anti-obesity proteins | |
US5563243A (en) | Anti-obesity proteins | |
US5525705A (en) | Anti-obesity proteins | |
US5569743A (en) | Anti-obesity proteins | |
US5574133A (en) | Anti-obesity proteins | |
US5563245A (en) | Anti-obesity proteins | |
US5563244A (en) | Anti-obesity proteins | |
US5567803A (en) | Anti-obesity proteins | |
EP0743321A2 (en) | Obesity gene product | |
EP0849276A1 (en) | Anti-obesity proteins | |
US5922678A (en) | Methods for treating diabetes | |
CA2274799A1 (en) | Anti-obesity proteins |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |