EP1756740A1 - Crystal structure of dipeptidyl peptidase iv (dpp-iv) and uses thereof - Google Patents
Crystal structure of dipeptidyl peptidase iv (dpp-iv) and uses thereofInfo
- Publication number
- EP1756740A1 EP1756740A1 EP05744309A EP05744309A EP1756740A1 EP 1756740 A1 EP1756740 A1 EP 1756740A1 EP 05744309 A EP05744309 A EP 05744309A EP 05744309 A EP05744309 A EP 05744309A EP 1756740 A1 EP1756740 A1 EP 1756740A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- dpp
- computer
- amino acid
- coordinates
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 101000930822 Giardia intestinalis Dipeptidyl-peptidase 4 Proteins 0.000 title claims abstract description 135
- 102000016622 Dipeptidyl Peptidase 4 Human genes 0.000 title claims abstract description 133
- 239000013078 crystal Substances 0.000 title claims abstract description 58
- 239000003446 ligand Substances 0.000 claims abstract description 72
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 55
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 37
- 238000012545 processing Methods 0.000 claims abstract description 21
- 238000000034 method Methods 0.000 claims description 67
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 63
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 62
- 150000001413 amino acids Chemical class 0.000 claims description 59
- 229920001184 polypeptide Polymers 0.000 claims description 57
- 230000027455 binding Effects 0.000 claims description 40
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 36
- 239000000203 mixture Substances 0.000 claims description 26
- 238000013500 data storage Methods 0.000 claims description 21
- 230000000694 effects Effects 0.000 claims description 17
- 150000001875 compounds Chemical class 0.000 claims description 15
- 125000000539 amino acid group Chemical group 0.000 claims description 12
- 238000002425 crystallisation Methods 0.000 claims description 12
- 230000008025 crystallization Effects 0.000 claims description 12
- 239000000243 solution Substances 0.000 claims description 10
- 230000003936 working memory Effects 0.000 claims description 10
- 238000013461 design Methods 0.000 claims description 9
- 238000009510 drug design Methods 0.000 claims description 8
- 238000005094 computer simulation Methods 0.000 claims description 6
- 239000011232 storage material Substances 0.000 claims description 6
- 239000000758 substrate Substances 0.000 claims description 6
- 239000007788 liquid Substances 0.000 claims description 5
- 238000000502 dialysis Methods 0.000 claims description 4
- 238000009792 diffusion process Methods 0.000 claims description 4
- 239000007864 aqueous solution Substances 0.000 claims 2
- 238000002156 mixing Methods 0.000 claims 1
- 239000003112 inhibitor Substances 0.000 abstract description 18
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 14
- 201000010099 disease Diseases 0.000 abstract description 13
- 235000001014 amino acid Nutrition 0.000 description 52
- 229940024606 amino acid Drugs 0.000 description 49
- 210000004027 cell Anatomy 0.000 description 40
- 235000018102 proteins Nutrition 0.000 description 33
- 239000013598 vector Substances 0.000 description 33
- 102000004190 Enzymes Human genes 0.000 description 22
- 108090000790 Enzymes Proteins 0.000 description 22
- 229940088598 enzyme Drugs 0.000 description 22
- 108020004414 DNA Proteins 0.000 description 19
- 229940079593 drug Drugs 0.000 description 18
- 239000003814 drug Substances 0.000 description 18
- 239000000126 substance Substances 0.000 description 18
- 239000013604 expression vector Substances 0.000 description 16
- 102000039446 nucleic acids Human genes 0.000 description 15
- 108020004707 nucleic acids Proteins 0.000 description 15
- 150000007523 nucleic acids Chemical class 0.000 description 15
- 230000014509 gene expression Effects 0.000 description 14
- 241000700605 Viruses Species 0.000 description 12
- 230000003197 catalytic effect Effects 0.000 description 11
- 238000003860 storage Methods 0.000 description 11
- 208000008589 Obesity Diseases 0.000 description 10
- 235000020824 obesity Nutrition 0.000 description 10
- 239000008194 pharmaceutical composition Substances 0.000 description 10
- 239000002609 medium Substances 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 108020003175 receptors Proteins 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 8
- 238000013459 approach Methods 0.000 description 8
- 230000004071 biological effect Effects 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 238000000126 in silico method Methods 0.000 description 8
- 108091033319 polynucleotide Proteins 0.000 description 8
- 102000040430 polynucleotide Human genes 0.000 description 8
- 239000002157 polynucleotide Substances 0.000 description 8
- 208000011580 syndromic disease Diseases 0.000 description 8
- 238000001890 transfection Methods 0.000 description 8
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 8
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 7
- 239000002773 nucleotide Substances 0.000 description 7
- 125000003729 nucleotide group Chemical group 0.000 description 7
- 238000006467 substitution reaction Methods 0.000 description 7
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 6
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 6
- 241000238631 Hexapoda Species 0.000 description 6
- 241000282414 Homo sapiens Species 0.000 description 6
- 208000001132 Osteoporosis Diseases 0.000 description 6
- 108020004511 Recombinant DNA Proteins 0.000 description 6
- 239000000556 agonist Substances 0.000 description 6
- 239000005557 antagonist Substances 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 6
- 238000004590 computer program Methods 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 230000003993 interaction Effects 0.000 description 6
- 238000012216 screening Methods 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 241000701447 unidentified baculovirus Species 0.000 description 6
- 208000004476 Acute Coronary Syndrome Diseases 0.000 description 5
- 101000908391 Homo sapiens Dipeptidyl peptidase 4 Proteins 0.000 description 5
- 241000124008 Mammalia Species 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 description 5
- 238000002441 X-ray diffraction Methods 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 229910052799 carbon Inorganic materials 0.000 description 5
- 238000000205 computational method Methods 0.000 description 5
- 238000007876 drug discovery Methods 0.000 description 5
- 230000000144 pharmacologic effect Effects 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 208000010444 Acidosis Diseases 0.000 description 4
- 208000007848 Alcoholism Diseases 0.000 description 4
- 208000019901 Anxiety disease Diseases 0.000 description 4
- 201000001320 Atherosclerosis Diseases 0.000 description 4
- 208000010392 Bone Fractures Diseases 0.000 description 4
- 206010065687 Bone loss Diseases 0.000 description 4
- 208000002177 Cataract Diseases 0.000 description 4
- 208000000094 Chronic Pain Diseases 0.000 description 4
- 208000007342 Diabetic Nephropathies Diseases 0.000 description 4
- 208000032131 Diabetic Neuropathies Diseases 0.000 description 4
- 208000032781 Diabetic cardiomyopathy Diseases 0.000 description 4
- 206010012689 Diabetic retinopathy Diseases 0.000 description 4
- 206010061818 Disease progression Diseases 0.000 description 4
- 208000030814 Eating disease Diseases 0.000 description 4
- 208000019454 Feeding and Eating disease Diseases 0.000 description 4
- 208000036119 Frailty Diseases 0.000 description 4
- 208000002705 Glucose Intolerance Diseases 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- 206010018473 Glycosuria Diseases 0.000 description 4
- 206010056438 Growth hormone deficiency Diseases 0.000 description 4
- 208000031226 Hyperlipidaemia Diseases 0.000 description 4
- 206010020772 Hypertension Diseases 0.000 description 4
- 208000031773 Insulin resistance syndrome Diseases 0.000 description 4
- 208000001145 Metabolic Syndrome Diseases 0.000 description 4
- 206010027417 Metabolic acidosis Diseases 0.000 description 4
- 208000029725 Metabolic bone disease Diseases 0.000 description 4
- 108090000189 Neuropeptides Proteins 0.000 description 4
- 206010049088 Osteopenia Diseases 0.000 description 4
- 208000002193 Pain Diseases 0.000 description 4
- 208000020221 Short stature Diseases 0.000 description 4
- 206010049416 Short-bowel syndrome Diseases 0.000 description 4
- 208000013738 Sleep Initiation and Maintenance disease Diseases 0.000 description 4
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 4
- 208000025865 Ulcer Diseases 0.000 description 4
- 201000000690 abdominal obesity-metabolic syndrome Diseases 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 230000036506 anxiety Effects 0.000 description 4
- 206010003246 arthritis Diseases 0.000 description 4
- 206010003549 asthenia Diseases 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 206010012601 diabetes mellitus Diseases 0.000 description 4
- 208000033679 diabetic kidney disease Diseases 0.000 description 4
- 230000006806 disease prevention Effects 0.000 description 4
- 235000014632 disordered eating Nutrition 0.000 description 4
- 206010015037 epilepsy Diseases 0.000 description 4
- 206010016256 fatigue Diseases 0.000 description 4
- 102000037865 fusion proteins Human genes 0.000 description 4
- 108020001507 fusion proteins Proteins 0.000 description 4
- 230000035780 glucosuria Effects 0.000 description 4
- 102000045598 human DPP4 Human genes 0.000 description 4
- 201000001421 hyperglycemia Diseases 0.000 description 4
- 208000000509 infertility Diseases 0.000 description 4
- 230000036512 infertility Effects 0.000 description 4
- 231100000535 infertility Toxicity 0.000 description 4
- 230000002757 inflammatory effect Effects 0.000 description 4
- 230000005764 inhibitory process Effects 0.000 description 4
- 206010022437 insomnia Diseases 0.000 description 4
- 230000008991 intestinal motility Effects 0.000 description 4
- 208000002551 irritable bowel syndrome Diseases 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 230000004060 metabolic process Effects 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 201000010065 polycystic ovary syndrome Diseases 0.000 description 4
- 230000003389 potentiating effect Effects 0.000 description 4
- 201000009104 prediabetes syndrome Diseases 0.000 description 4
- 150000003384 small molecules Chemical class 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 231100000397 ulcer Toxicity 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 101710198884 GATA-type zinc finger protein 1 Proteins 0.000 description 3
- 108010004460 Gastric Inhibitory Polypeptide Proteins 0.000 description 3
- 102100039994 Gastric inhibitory polypeptide Human genes 0.000 description 3
- 102400000322 Glucagon-like peptide 1 Human genes 0.000 description 3
- DTHNMHAUYICORS-KTKZVXAJSA-N Glucagon-like peptide 1 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 DTHNMHAUYICORS-KTKZVXAJSA-N 0.000 description 3
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 238000005481 NMR spectroscopy Methods 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 238000004617 QSAR study Methods 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- 241000700618 Vaccinia virus Species 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 150000005829 chemical entities Chemical class 0.000 description 3
- 238000007385 chemical modification Methods 0.000 description 3
- 235000018417 cysteine Nutrition 0.000 description 3
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 3
- 238000013480 data collection Methods 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000002050 diffraction method Methods 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 3
- 229960000310 isoleucine Drugs 0.000 description 3
- 229920002521 macromolecule Polymers 0.000 description 3
- 229910021645 metal ion Inorganic materials 0.000 description 3
- 238000000302 molecular modelling Methods 0.000 description 3
- 229910052757 nitrogen Inorganic materials 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 238000007899 nucleic acid hybridization Methods 0.000 description 3
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Chemical group OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 230000006798 recombination Effects 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- -1 such as Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- 241001515965 unidentified phage Species 0.000 description 3
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 2
- 102000055025 Adenosine deaminases Human genes 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 101800001415 Bri23 peptide Proteins 0.000 description 2
- 102400000107 C-terminal peptide Human genes 0.000 description 2
- 101800000655 C-terminal peptide Proteins 0.000 description 2
- 108010012236 Chemokines Proteins 0.000 description 2
- 102000019034 Chemokines Human genes 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 241001452028 Escherichia coli DH1 Species 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 239000007993 MOPS buffer Substances 0.000 description 2
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 2
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 2
- 102000003797 Neuropeptides Human genes 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 229920001030 Polyethylene Glycol 4000 Polymers 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 2
- 230000006044 T cell activation Effects 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 230000003042 antagnostic effect Effects 0.000 description 2
- 235000021229 appetite regulation Nutrition 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 239000011230 binding agent Substances 0.000 description 2
- OWMVSZAMULFTJU-UHFFFAOYSA-N bis-tris Chemical compound OCCN(CCO)C(CO)(CO)CO OWMVSZAMULFTJU-UHFFFAOYSA-N 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000003833 cell viability Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000005352 clarification Methods 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000005138 cryopreservation Methods 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 239000012595 freezing medium Substances 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 230000015654 memory Effects 0.000 description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000000329 molecular dynamics simulation Methods 0.000 description 2
- 238000000324 molecular mechanic Methods 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 239000000813 peptide hormone Substances 0.000 description 2
- 239000000816 peptidomimetic Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 229940002612 prodrug Drugs 0.000 description 2
- 239000000651 prodrug Substances 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 238000004088 simulation Methods 0.000 description 2
- 239000001632 sodium acetate Substances 0.000 description 2
- 235000017281 sodium acetate Nutrition 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 230000000153 supplemental effect Effects 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 238000002424 x-ray crystallography Methods 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- IHBAVXVTGLANPI-QMMMGPOBSA-N 2-amino-3-methyl-1-pyrrolidin-1-yl-butan-1-one Chemical compound CC(C)[C@H](N)C(=O)N1CCCC1 IHBAVXVTGLANPI-QMMMGPOBSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- 101710129138 ATP synthase subunit 9, mitochondrial Proteins 0.000 description 1
- 101710168506 ATP synthase subunit C, plastid Proteins 0.000 description 1
- 101710114069 ATP synthase subunit c Proteins 0.000 description 1
- 101710197943 ATP synthase subunit c, chloroplastic Proteins 0.000 description 1
- 101710187091 ATP synthase subunit c, sodium ion specific Proteins 0.000 description 1
- 102000006410 Apoproteins Human genes 0.000 description 1
- 108010083590 Apoproteins Proteins 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 102100025012 Dipeptidyl peptidase 4 Human genes 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 208000016169 Fish-eye disease Diseases 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- 102000004877 Insulin Human genes 0.000 description 1
- 108090001061 Insulin Proteins 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 108010037255 Member 7 Tumor Necrosis Factor Receptor Superfamily Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 108010067902 Peptide Library Proteins 0.000 description 1
- 101710182846 Polyhedrin Proteins 0.000 description 1
- 230000010799 Receptor Interactions Effects 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 229940124639 Selective inhibitor Drugs 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 239000012505 Superdex™ Substances 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 229910052784 alkaline earth metal Inorganic materials 0.000 description 1
- 150000001342 alkaline earth metals Chemical class 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000003178 anti-diabetic effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000003472 antidiabetic agent Substances 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 239000013602 bacteriophage vector Substances 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 102000023732 binding proteins Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 239000012867 bioactive agent Substances 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000008512 biological response Effects 0.000 description 1
- 230000036983 biotransformation Effects 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 230000004700 cellular uptake Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 150000001860 citric acid derivatives Chemical class 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000002288 cocrystallisation Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000009918 complex formation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000008602 contraction Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 210000004246 corpus luteum Anatomy 0.000 description 1
- 239000002577 cryoprotective agent Substances 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 229910001873 dinitrogen Inorganic materials 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000009881 electrostatic interaction Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007247 enzymatic mechanism Effects 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000014101 glucose homeostasis Effects 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 150000004820 halides Chemical class 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 239000004030 hiv protease inhibitor Substances 0.000 description 1
- 239000012510 hollow fiber Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 239000000859 incretin Substances 0.000 description 1
- MGXWVYUBJRZYPE-YUGYIWNOSA-N incretin Chemical class C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)[C@@H](C)O)[C@@H](C)CC)C1=CC=C(O)C=C1 MGXWVYUBJRZYPE-YUGYIWNOSA-N 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229940125396 insulin Drugs 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 125000000741 isoleucyl group Chemical group [H]N([H])C(C(C([H])([H])[H])C([H])([H])C([H])([H])[H])C(=O)O* 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 150000002611 lead compounds Chemical class 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 102000019758 lipid binding proteins Human genes 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000003712 lysosome Anatomy 0.000 description 1
- 230000001868 lysosomic effect Effects 0.000 description 1
- 229910052749 magnesium Inorganic materials 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- VDXZNPDIRNWWCW-JFTDCZMZSA-N melittin Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(N)=O)CC1=CNC2=CC=CC=C12 VDXZNPDIRNWWCW-JFTDCZMZSA-N 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 230000004001 molecular interaction Effects 0.000 description 1
- 208000031225 myocardial ischemia Diseases 0.000 description 1
- 239000006225 natural substrate Substances 0.000 description 1
- 239000002858 neurotransmitter agent Substances 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000011164 ossification Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 235000021317 phosphate Nutrition 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- GCYXWQUSHADNBF-AAEALURTSA-N preproglucagon 78-108 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 GCYXWQUSHADNBF-AAEALURTSA-N 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 125000006239 protecting group Chemical group 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 210000000512 proximal kidney tubule Anatomy 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 238000007670 refining Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000036186 satiety Effects 0.000 description 1
- 235000019627 satiety Nutrition 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 150000003467 sulfuric acid derivatives Chemical class 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 229910052723 transition metal Inorganic materials 0.000 description 1
- 150000003624 transition metals Chemical class 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 125000000430 tryptophan group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C2=C([H])C([H])=C([H])C([H])=C12 0.000 description 1
- 230000036967 uncompetitive effect Effects 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2299/00—Coordinates from 3D structures of peptides, e.g. proteins or enzymes
Definitions
- the present invention relates to crystalline compositions of mammalian dipeptidyl peptidase IV (DPP-IV), methods of preparing said compositions, methods of determining the three-dimensional (3-D) X- ray atomic coordinates of said composition, methods of identifying ligands of DPP-IV using structure based drug design, the use of 3-D crystal structures to design, modify and assess the activity of potential inhibitors, and to the use of such inhibitors for the treatment of, for example, diabetes, glucose tolerance obesity, appetite regulation, hpidemia, osteoporosis neuropeptide metabolism and T-cell activation, among others
- DPP-IV se ⁇ ne peptidase dipeptidyl peptidase IV
- DPP-IV is a multifunctional type II cell surface glycoprotein, which is widely expressed in a variety of cell types, particularly on differential epithelial cells of the intestine, liver, prostate tissue, corpus luteum, and kidney proximal tubules (Thoma et al , Structure, 11 , 947-959, 947 (2003) citing Hartel et al , Histochemistry 89, 151 -161 (1988), cCaughan et al , Hepatology 11 , 534-544 (1990) as well as leukocytes subsets (Thoma et al , (2003) citing Gorrell et al , Cell Immunol 134, 205-215 (1991 )) DPP-IV has roles in many biological processes including its ability to modulate the biological activity of several peptide hormones, chemokines and neuropeptides by specifically cleaving
- DPPs such as DPP-IV as a single molecule of an asymmetric unit
- X-ray or NMR studies or from homology models
- analyzing the structures using computational methods facilitates such discovery efforts.
- SUMMARY OF THE INVENTION The present invention provides crystalline compositions of DPP-IV, and specifically of DPP-IV, having one molecule per asymmetric unit.
- the invention further provides methods of preparing said compositions, methods of determining the 3-D X-ray atomic coordinates of said crystalline compositions, methods of using the atomic coordinates in conjunction with computational methods to identify binding site(s), methods to elucidate the 3-D structure of homologues of DPP-IV, and methods to identify ligands which interact with the binding site(s) to agonize or antagonize the biological activity of DPP-IV, methods for identifying inhibitors of DPP-IV, pharmaceutical compositions of inhibitors, and methods of treatment of Type 2 diabetes, Type 1 diabetes, impaired glucose tolerance, hyperglycemia, metabolic syndrome (syndrome X and/or insulin resistance syndrome), glucosuria, metabolic acidosis, arthritis, cataracts, diabetic neuropathy, diabetic nephropathy, diabetic retinopathy, diabetic cardiomyopathy, obesity, conditions exacerbated by obesity, hypertension, hyperlipidemia, atherosclerosis, osteoporosis, osteopenia, frailty, bone loss, bone fracture, acute
- the invention provides crystalline compositions of the extracellular domain of DPP-IV (residues 31 -766 of SEQ ID NO:1 ), whereby the crystal structure is derived from 5 mammal, preferably human.
- One aspect of the present invention provides methods for crystallizing a DPP-IV polypeptide.
- the methods for crystallizing the DPP-IV polypeptide comprising an amino acid sequence spanning the amino acids 31 to 766 listed in SEQ ID NO:1 comprise the steps of: (a) preparing solutions of the polypeptide and precipitant; (b) growing a crystal comprising molecules of the polypeptide from said 10 mixture solution; and (c) separating said crystal from said solution.
- the crystallization growth can be carried out by various techniques known by those skilled in the art, such as for example, batch crystallization, liquid bridge, vapor diffusion, crystallization, or dialysis crystallization. Preferably, the crystallization growth is achieved using vapor diffusion techniques.
- the present invention provides vectors useful in methods for preparing a substantially purified extracellular domain of DPP-IV comprising the polypeptide with an amino acid 20 sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1.
- Yet another embodiment of the present invention provides a DPP-IV crystal of SEQ ID NO:2, or a homologue, analogue or variant thereof.
- the present invention provides methods for determining the X-ray atomic coordinates of the crystalline compositions at a 2.7A resolution. 25
- the present invention provides a molecule or molecular complex crystal, wherein the crystal has substantially similar atomic coordinates to the atomic coordinates listed in FIG. 2 or portions thereof, or any scalable variations thereof.
- the present invention provides a molecule or molecular complex crystal, wherein the crystal comprises a polypeptide with an amino acid sequence spanning the amino acids Gly31 to Pro766 30 listed in SEQ ID NO:1.
- a further embodiment of the invention provides a crystal comprising an amino acid sequence that is at least 98% or 95% homologous to a polypeptide with an amino acid sequence spanning the amino acids Gly31 to Pro766 listed in SEQ ID NO:1.
- An even further embodiment of the invention provides a crystal comprising an amino acid sequence that is at least 98% or 95% homologous to a polypeptide with an amino acid sequence spanning the amino 35 acids Gly31 to Pro766 listed in SEQ ID NO:1 , and having the atomic coordinates listed in FIG. 2.
- the present invention provides a molecule or molecular complex crystal, wherein the crystal comprises a polypeptide, or a portion thereof, with atomic coordinates having a root mean square deviation from the protein backbone atoms (N, C ⁇ , C, and O) listed in FIG. 2 of less than 0.2, 0.5, 0.7, 1.0, 1.2, 1.5, 2.0 or 2.5 A. 40.
- the present invention provides a scalable, or translatable, three dimensional configuration of points derived from structural coordinates of at least a portion of a DPP-IV molecule or molecular complex comprising a polypeptide with an amino acid sequence spanning the amino acids Gly31 to Pro766 listed in SEQ ID NO:1.
- the invention also comprises the structural coordinates of at least a portion of a molecule or a molecular complex that is structurally homologous to a DPP-IV molecule or molecular complex.
- the configuration of points derived from a homologous molecule or molecular complex has a root mean square deviation of less than about 0.2, 0.5, 0.7, 1.0, 1.2, 1.5, 2.0 or 2.5 A from the structural coordinates provided in FIG. 2.
- the invention provides computers for producing a three-dimensional respresentation of aspect eight of the present invention can be used to design and identify potential ligands or inhibitors of DPP-IV by, for example commercially available molecular modeling software in conjunction with structure-based drug design as provided herein.
- the present invention provides computer for producing three-dimensional representations of: a. a molecule or molecular complex comprising a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1 , or a homologue, or a variant thereof; b.
- a molecule or molecular complex wherein the atoms of the molecule or molecular complex are represented by atomic coordinates that are substantially similar to, or are subsets of, the atomic coordinates listed in FIG. 2; c. a molecule or molecular complex, wherein the molecule or molecular complex comprises atomic coordinates having a root mean square deviation of less than 0.2, 0.5, 0.7, 1.0, 1.2 , 1.5, 2.0 or even 2.5 A from the atomic coordinates for the carbon backbone atoms listed in FIG. 2; or d. a molecule or molecular complex, wherein the molecule or molecular complex comprises a binding pocket or site defined by the structure coordinates that are substantially similar to the atomic coordinates listed in FIG.
- said computer comprises: (i) a computer-readable data storage medium comprising a data storage medium encoded with computer-readable data, wherein said data comprises the structure coordinates of FIG.
- the present invention provides methods involving molecular replacement to obtain structural information about a molecule or molecular complex of unknown structure.
- the method includes crystallizing the molecule or molecular complex, generating an x-ray diffraction pattern from the crystallized molecule or molecular complex, and applying at least a portion of the structure coordinates set forth in FIG. 2 to the x-ray diffraction pattern to generate a three-dimensional electron density map of at least a portion of the molecule or molecular complex.
- the present invention provides methods for generating 3-D atomic coordinates of protein homologues, analogues, or variants of DPP-IV using the x-ray coordinates of DPP- IV described in FIG. 2, comprising: a. identifying one or more homologous polypeptide sequences to DPP-IV; b.
- DPP-IV which comprises a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1 ; c identifying structurally conserved and structurally variable regions between said homologous sequence(s) and DPP-IV; d. generating 3-D coordinates for structurally conserved residues of the said homologous sequence(s) from those of DPP-IV using the coordinates listed in FIG. 2; e. generating conformations for the loops in the structurally variable regions of said homologous sequence(s); f. building the side-chain conformations for said homologous sequence(s); and g.
- Embodiments of the ninth aspect provide methods which further comprise refining and evaluating the full or partial 3-D coordinates. These methods may thus be used, for example, to generate 3- dimensional structures for proteins for which 3-dimensional atomic coordinates have not been determined. As such, the newly generated structure can help to elucidate enzymatic mechanisms, or be used in conjunction with other molecular modeling techniques in structure based drug design.
- the present invention provides methods for identifying inhibitors, ligands, and the like, of DPP-IV by providing the coordinates of a molecule of DPP-IV to a computerized modeling system; identifying chemical entities that are likely to bind or interact with the molecule (e.g., by screening a small molecule library); and, optionally, procuring or synthesizing and assaying the compounds or analogues derived thereof for bioactivity.
- Further aspects of the present invention relate to methods for identifying potential ligands for DPP-IV or homologues, or analogue or variants thereof comprising: a.
- DPP-IV enzyme or homologue or analogue or variant thereof as defined by atomic coordinates that are substantially similar to the atomic coordinates listed in FIG. 2 on a computer display screen; b. optionally replacing one or more the enzyme amino acid residues listed in SEQ ID NO:1 , or preferably one or more amino acid residues selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740, in said three-dimensional structure with a different naturally occurring amino acid or an unnatural amino acid to display a variant structure; c.
- the structural aspects of the ligand may be modified to generate a structural analog of the ligand.
- This analog can then be used in the above methods to identify binding ligands.
- One of ordinary skill in the art will know the various ways a structure may be modified.
- the methods further comprise computationally modifying the structure of the ligand; computationally determining the fit of the modified ligand using the three-dimensional coordinates described in FIG. 2, or portions thereof; contacting said modified ligand with said enzyme, or homologue, or variant thereof in an in vitro or in vivo setting; and measuring the ability of said ligand to modulate the activity of said enzyme.
- the present invention provides compositions, such as, pharmaceutical compositions comprising the inhibitors or ligands designed according to any of the methods of the present invention.
- a composition is provided that includes an inhibitor or ligand designed or identified by any of the above methods.
- the composition is a pharmaceutical composition.
- the twelfth aspect of the present invention are methods for treating Type 2 diabetes, Type 1 diabetes, impaired glucose tolerance, hyperglycemia, metabolic syndrome (syndrome X and/or insulin resistance syndrome), glucosuria, metabolic acidosis, arthritis, cataracts, diabetic neuropathy, diabetic nephropathy, diabetic retinopathy, diabetic cardiomyopathy, obesity, conditions exacerbated by obesity, hypertension, hyperlipidemia, atherosclerosis, osteoporosis, osteopenia, frailty, bone loss, bone fracture, acute coronary syndrome, short stature due to growth hormone deficiency, infertility due to polycystic ovary syndrome, anxiety, depression, insomnia, chronic fatigue, epilepsy, eating disorders, chronic pain, alcohol addiction, diseases associated with intestinal motility, ulcers, irritable bowel syndrome, inflammatory bowel syndrome; short bowel syndrome; and the prevention of disease progression in Type 2 diabetes, comprising administering pharmaceutical compositions, identified by structure based design using the atomic coordinates, or portions thereof,
- Figure 1 is an orthogonal view of an embodiment of DPP-IV shown in a ribbon representation.
- FIG. 2 is a list of the X-ray coordinates of a DPP-IV crystal as described in the Examples.
- DETAILED DESCRIPTION OF THE INVENTION The present invention relates to crystalline compositions of DPP-IV, 3-D X-ray atomic coordinates of said crystalline compositions, methods of preparing said compositions, methods of determining the 3-D X-ray atomic coordinates of said crystalline compositions, and methods of using said atomic coordinates in conjunction with computational methods to identify binding site(s), or identify ligands which interact with said binding site(s) to agonize or antagonize DPP-IV.
- affinity refers to the tendency of a molecule to associate with another.
- the affinity of a drug is its ability to bind to its biological target (receptor, enzyme, transport system, etc.)
- affinity can be thought of as the frequency with which the drug, when brought into the proximity of a receptor by diffusion, will reside at a position of minimum free energy within the force field of that receptor.
- agonist refers to an endogenous substance or a drug that can interact with a receptor and initiate a physiological or a pharmacological response characteristic of that receptor (contraction, relaxation, secretion, enzyme activation, etc.)
- analog refers to a drug or chemical compound whose structure is related in some way to that of another drug or chemical compound, but whose chemical and biological properties may be quite different.
- antagonist refers to a drug or a compound that opposes the physiological effects of another. At the receptor level, it is a chemical entity that opposes the receptor- associated responses normally induced by another bioactive agent.
- asymmetric unit refers to the basic motif, which is repeated in 3-D space by the symmetry operators of the crystallographic space group, of which the coordinated of the structure are determined. It is the smallest part of the crystal structure from which the complete structure can be built using space group symmetry.
- binding site refers to a specific region (or atom) in a molecular entity that is capable of entering into a stabilizing interaction with another molecular entity. In certain embodiments the term also refers to the reactive parts of a macromolecule that directly participate in its specific combination with another molecule. In other embodiments, a binding site may be comprised or defined by the three dimensional arrangement of one or more amino acid residues within a folded polypeptide.
- the binding site further comprises prosthetic groups, water molecules or metal ions which may interact with one or more amino acid residues.
- Prosthetic groups, water molecules, or metal ions may be apparent from X-ray crystallographic data, or may be added to an apo protein or enzyme using in silico methods.
- bioactivity refers to DPP-IV activity that exhibits a biological property conventionally associated with a DPP-IV agonist or antagonist, such as a property that would allow treatment of one or more of the various diseases such as, for example, diabetes, glucose tolerance, obesity, appetite regulation, lipidemia, osteoporosis, neuropeptide metabolism and T-cell activation, among others.
- catalytic domain refers to the catalytic domain of the DPP-IV class of enzymes, which features a conserved segment of amino acids in the carboxy-terminal portion of the proteins, wherein this segment has been demonstrated to include the catalytic site of these enzymes. This conserved catalytic domain extends approximately from residue 552 to 766 of the full-length enzyme of DPP-IV (SEQ ID NO:1 ). "To clone” as used herein, means obtaining exact copies of a given polynucleotide molecule using recombinant DNA technology.
- to clone into may be meant as inserting a given first polynucleotide sequence into a second polynucleotide sequence, preferably such that a functional unit combining the functions of the first and the second polynucleotides results.
- a polynucleotide from which a fusion protein may be translationally provided which fusion protein comprises amino acid sequences encoded by the first and the second polynucleotide sequences.
- co-crystallization is taken to mean crystallization of a preformed protein/I igand complex.
- complex or “co-complex” are used interchangeably and refer to a DPP-IV molecule, or a variant, or homologue of DPP-IV in covalent or non-covalent association with a substrate, ligand, inhibitor.
- contacting as used herein applies to in silico, in vitro, and/or in vivo experiments.
- Diseases and particularly “diseases that are associated with proteins that are subject to processing by DPP-IV”, include, but are not limited to, for example, Type 2 diabetes, Type 1 diabetes, impaired glucose tolerance, hyperglycemia, metabolic syndrome (syndrome X and/or insulin resistance syndrome), glucosuria, metabolic acidosis, arthritis, cataracts, diabetic neuropathy, diabetic nephropathy, diabetic retinopathy, diabetic cardiomyopathy, obesity, conditions exacerbated by obesity, hypertension, hyperlipidemia, atherosclerosis, osteoporosis, osteopenia, frailty, bone loss, bone fracture, acute coronary syndrome, short stature due to growth hormone deficiency, infertility due to polycystic ovary syndrome, anxiety, depression, insomnia, chronic fatigue, epilepsy, eating disorders, chronic pain, alcohol addiction, diseases associated with intestinal motility, ulcers, irritable bowel syndrome, inflammatory bowel syndrome; short bowel syndrome; and the prevention of disease progression in Type 2 diabetes.
- extracellular domain refers to the extracellular domain of DPP-IV, which features a conserved segment of amino acids, whereby this segment has been demonstrated to include glycosylation sites, a cysteine-rich region and the catalytic active site. This conserved extracellular domain extends approximately from residue Gly31 to Pro766 of the full length enzyme (SEQ ID NO:1 ).
- gene refers to a nucleic acid comprising an open reading frame encoding a polypeptide, including both exon and (optionally) intron sequences.
- hDPP-IV Human DPP-IV
- ADA adenosine deaminase binding protein
- hDPP-IV is a single polypeptide chain of 766 amino acids, which consists if five regions: a cytoplasmic region (residues about 1 -6), a transmembrane region (residues about 7-28), a highly gycosylated region (residues about 29-323), a cysteine-rich region (residues about 324-551 ), and a catalytic region (residues about 552-766) (Hiramatsu et al., (2003)).
- the term "high affinity” as used herein means strong binding affinity between molecules with a dissociation constant K D of no greater than 1 ⁇ M.
- the K D is less than 100 nM, 10 nM, 1 nM, 100 pM, or even 10 pM or less.
- the two molecules can be covalently linked (K D is essentially 0).
- the term "homologue” as used herein means a protein, polypeptide, oligopeptide, or portion thereof, having preferably at least 95% amino acid sequence identity with DPP-IV enzyme as described in SEQ ID NO: 1 or SEQ ID NO:2 or with any extracellular domain described herein, or with any functional or structural domain of lipid binding protein.
- SEQ ID NO:1 is the full-length amino acid sequence of a wild- type human DPP-IV
- SEQ ID NO:2 is the hDPP-IV construct including residues 31 -766 which, as described in the Examples, was purified, expressed and crystallized.
- residues 31 -766 which, as described in the Examples, was purified, expressed and crystallized.
- the term "substantially similar atomic coordinates" or atomic coordinates that are “substantially similar” refers to any set of structure coordinates of DPP-IV or DPP-IV homologues, or DPP-IV variants, polypeptide fragments, described by atomic coordinates that have a root mean square deviation for the atomic coordinates of protein backbone atoms (N, C ⁇ , C, and O) of less than about 2.5, 2.0 1.5, 1.2, 1.0, 0.7, 0.5, or even 0.2 A when superimposed using backbone atoms of structure coordinates listed in FIG. 2.
- structures that have substantially similar coordinates as those listed in FIG. 2 shall be considered identical to the coordinates listed in FIG. 2.
- substantially similar also applies to an assembly of amino acid residues that may or may not form a contiguous polypeptide chain, but whose three dimensional arrangement of atomic coordinates have a root mean square deviation for the atomic coordinates of protein backbone atoms (N, C ⁇ , C, and O), or the side chain atoms, of less than about 2.5, 2.0, 1.5, 1.2, 1.0, 0.7, 0.5, or even 0.2 A when superimposed-using backbone atoms, or the side chain atoms- of the atomic coordinates of similar or the same amino acids from the coordinates listed in FIG. 2.
- Those skilled in the art further understand that the coordinates listed in FIG.
- the coordinates listed in Fig. 2, or portions thereof may be transformed by algorithms which translate or rotate the atomic coordinates.
- molecular mechanics, molecular dynamics or ab intio algorithms may modify the atomic coordinates.
- Atomic coordinates generated from the coordinates listed in FIG. 2, or portions thereof, using any of the aforementioned algorithms shall be considered identical to the coordinates listed in FIG. 2.
- the term "in silico" as used herein refers to experiments carried out using computer simulations.
- the in silico methods are molecular modeling methods wherein 3-dimensional models of macromolecules or ligands are generated.
- the in silico methods comprise computationally assessing ligand binding interactions.
- ligand describes any molecule, e.g., protein, peptide, peptidomimetics, oligopeptide, small organic molecule, polysaccharide, polynucleotide, etc., which binds or interacts, generally but not necessarily specifically to or with another molecule.
- the ligand is an agonist, whereby the molecule upregulates (i.e., activates or stimulates, e.g., by agonizing or potentiating) activity, while in another aspect of the invention the ligand is an inhibitor or antagonist, whereby the molecule down regulates (i.e., inhibits or suppresses, e.g. by antagonizing, decreasing or inhibiting) the activity.
- modulate refers to both upregulation (i.e., activation or stimulation, e.g., by agonizing or potentiating) and down-regulation (i.e., inhibition or suppression, e.g., by antagonizing, decreasing or inhibiting) of a bioactivity.
- pharmacophore refers to the " ensemble of steric and electronic features of a particular structure that is necessary to ensure the optimal supramolecular interactions with a specific biological target structure and to trigger (or to block) its biological response.
- a pharmacophore is an abstract concept that accounts for the common molecular interaction capacities of a group of compounds towards their target structure.
- the term can be considered as the largest common denominator shared by a set of active molecules.
- Pharmacophoric descriptors are used to define a pharmacophore, including H-bonding, hydrophobic and electrostatic interaction sites, defined by atoms, ring centers and virtual points.
- a pharmacophore may represent an ensemble of steric and electronic factors which are necessary to insure supramolecular interactions with a specific biological target structure.
- a pharmacophore may represent a template of chemical properties for an active site of a protein/enzyme representing these properties' spatial relationship to one another that theoretically defines a ligand that would bind to that site.
- the term "precipitant" as used herein includes any substance that, when added to a solution, causes a precipitate to form or crystals to grow.
- Suitable precipitants include, but are not limited to, alkali (e.g., Li, Na, or K), or alkaline earth metal (e.g., Mg, or Ca) salts, and transition metal (e.g., Mn, or Zn) salts.
- Common counter ions to the metal ions include, but are not limited to, halides, phosphates, citrates and sulfates.
- prodrug refers to drugs that, once administered, are chemically modified by metabolic processes to become pharmaceutically active. In certain embodiments the term also refers to any compound that undergoes biotransformation before exhibiting its pharmacological effects.
- Prodrugs can thus be viewed as drugs containing specialized non-toxic protective groups used in a transient manner to alter or to eliminate properties, usually undesirable, in the parent molecule.
- the term "receptor” as used here in refers to a protein or a protein complex in or on a cell that specifically recognizes and binds to a compound acting as a molecular messenger (neurotransmitter, hormone, lymphokine, lectin, drug, etc.). In a broader sense, the term receptor is used interchangeably with any specific (as opposed to non-specific, such as binding to plasma proteins) drug binding site, also including nucleic acids such as DNA.
- recombinant protein refers to a polypeptide which is produced by recombinant DNA techniques, wherein generally, DNA encoding a polypeptide is inserted into a suitable expression vector which is, in turn, used to transform a host cell to produce the polypeptide encoded by the DNA.
- This polypeptide may be one that is naturally expressed by the host cell, or it may be heterologous to the host cell, or the host cell may have been engineered to have lost the capability to express the polypeptide which is otherwise expressed in wild type forms of the host cell.
- the polypeptide may also be, for example, a fusion polypeptide.
- the phrase "derived from”, with respect to a recombinant gene, is meant to include within the meaning of "recombinant protein” those proteins having an amino acid sequence of a native polypeptide, or an amino acid sequence similar thereto which is generated by mutations, including substitutions, deletions and truncation, of a naturally occurring form of the polypeptide.
- selective DPP-IV inhibitor refers to a substance, such as for example, an organic molecule, that effectively inhibits an enzyme from the DDP-IV family to a greater extent than any other DPP enzyme.
- a selective DPP-IV inhibitor is a substance, having a K, for inhibition of DPP-IV that is less than about one-half, one-fifth, or one-tenth the K, that the substance has for inhibition of any other DPP enzyme.
- the substance inhibits DPP-IV activity to the same degree at a concentration of about one-half, one-fifth, one-tenth or less than the concentration required for any other DPP enzyme.
- a substance is considered to effectively inhibit DPP-IV if it has an IC 50 or K, of less than or about 10 mM, 1 mM, 500 nM, 100 nM, 50 nM or 10 nM.
- small molecules refers to drugs as they are orally available (unlike proteins which must be administered by injection, topically or inhalation).
- the size of the small molecules is generally under 1000 Daltons, but many estimates seem to range between 300 to 700 Daltons.
- space group refers to the lattice and symmetry of the crystal. In a space group designation the capital letter indicates the lattice type and the other symbols represent symmetry operations that can be carried out on the contents of the asymmetric unit without changing its appearance.
- therapeutically effective amount is meant that amount which is capable of at least partially reversing and/or treat the symptoms of the disease.
- a therapeutically effective amount can be determined on an individual basis and will be based, at least in part, on a consideration of the species of the mammal, the size of the mammal, the type of delivery system used, and the type of administration relative to the progression of the disease.
- a therapeutically effective amount can be determined by one of ordinary skill in the arts.
- the term "transfection” means the introduction of a nucleic acid, e.g., via an expression vector, into a recipient cell by nucleic acid-mediated gene transfer.
- Transformation refers to a process in which a cell's genotype is changed as a result of the cellular uptake of exogenous DNA or RNA and, for example, the transformed cell expresses a recombinant form of a polypeptide or, in the case of anti-sense expression from the transferred gene, the expression of a naturally-occurring form of the polypeptide is disrupted.
- variants in relation to the polypeptide sequence in SEQ ID NO:1 or SEQ ID NO:2 include any substitution of, variation of, modification of, replacement of, deletion of, or addition of one or more amino acids from or to the sequence providing a resultant polypeptide sequence for an enzyme having DPP-IV activity.
- the variant, homologue, fragment or portion, of SEQ ID NO:1 or SEQ ID NO:2 comprises a polypeptide sequence of at least 5 contiguous amino acids, preferably at least 10 contiguous amino acids, preferably at least 15 contiguous amino acids, preferably at least 20 contiguous amino acids, preferably at least 25 contiguous amino acids, or preferably at least 30 contiguous amino acids.
- vector refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
- One type of preferred vector is an episome, i.e., a nucleic acid capable of extra- chromosomal replication.
- Suitable host-vector systems include but are not limited to mammalian cell systems infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected with virus (e.g., baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA.
- virus e.g., vaccinia virus, adenovirus, etc.
- insect cell systems infected with virus e.g., baculovirus
- microorganisms such as yeast containing yeast vectors; or bacteria transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA.
- the expression elements of vectors vary in their strengths and specificities. As those depending on the host-vector system utilized, any one of a number of suitable transcription and translation elements may be used.
- Preferred vectors are those capable of autonomous replication and/or expression of nucleic acids to which
- expression vectors In general, expression vectors of utility in recombinant DNA techniques are often in the form of "plasmids" which refer generally to circular double stranded DNA loops which, in their vector form are not bound to the chromosome.
- plasmid and “vector” are used interchangeably as the plasmid is the most commonly used form of vector.
- the invention is intended to include such othefr forms of expression vectors which serve equivalent functions and which become known in the art subsequently hereto.
- nucleotide sequence coding for a DPP-IV polypeptide, or a functional fragment, including the C-terminal peptide fragment of the catalytic domain of DPP-IV protein, and/or derivatives or analogs thereof, including a chimeric protein, thereof can be inserted into a suitable expression vector, i.e., a vector which contains the necessary elements for the transcription and translation of the inserted protein-coding sequence.
- the nucleic acid encoding a DPP-IV polypeptide of the invention or a functional fragment comprising the extracellular domain of of the DPP-IV protein, or a homologue, an analog, or variant thereof is operationally associated with a promoter in an expression vector of the invention.
- the expression vector contains the nucleotide sequence coding for the polypeptide comprising the amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1. Both cDNA and genomic sequences can be cloned and expressed under the control of such regulatory sequences.
- An expression vector also preferably includes a replication origin.
- the necessary transcriptional and translational signals can be provided on a recombinant expression vector.
- all genetic manipulations described for the DPP-IV gene in this section may also be employed for genes encoding a functional fragment, including the C-terminal peptide fragment of the catalytic domain of the DPP-IV protein, derivatives or analogs thereof, including a chimeric protein thereof.
- Suitable host-vector systems include but are not limited to mammalian cell systems infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected with virus (e.g., baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA.
- virus e.g., vaccinia virus, adenovirus, etc.
- insect cell systems infected with virus e.g., baculovirus
- microorganisms such as yeast containing yeast vectors
- bacteria transformed with bacteriophage DNA, plasmid DNA, or cosmid DNA.
- the expression elements of vectors vary in their strengths and specificities. As those depending on the host-vector system utilized, any one of a number of suitable transcription and translation elements may be used.
- a recombinant DPP-IV protein of the invention may be expressed chromosom
- any of a number of amplification systems may be used to achieve high levels of stable gene expression.
- a suitable cell for purposes of this invention is one into which the recombinant vector comprising the nucleic acid encoding DPP-IV protein is cultured in an appropriate cell culture medium under conditions that provide for expression of DPP-IV protein by the cell.
- Any of the methods previously described for the insertion of DNA fragments into a cloning vector may be used to construct expression vectors containing a gene consisting of appropriate transcriptional/translational control signals and the protein coding sequences.
- DPP-IV protein expression may be controlled by any promoter/enhancer element known in the art, provided that these regulatory elements must be functional in the host selected for expression, as would be appreciated by those of skill in the art.
- Vectors containing a nucleic acid encoding a DPP-IV protein of the invention can be identified, for example, by four general approaches: (1 ) PCR amplification of the desired plasmid DNA or specific mRNA; (2) nucleic acid hybridization; (3) presence or absence of selection marker gene functions; and (4) expression of inserted sequences.
- the invention is further intended to include other forms of identification of vectors, containing a nucleic acid encoding a DPP-IV protein of the present invention, which serve equivalent functions and which become known in the art subsequently hereto.
- the nucleic acids can be amplified by PCR to provide for detection of the amplified product.
- the presence of a foreign gene inserted in an expression vector can be detected by nucleic acid hybridization using probes comprising sequences that are homologous to an inserted marker gene.
- the recombinant vector/host system can be identified and selected based upon the presence or absence of certain "selection marker" gene functions (e.g., beta-galactosidase activity, thymidine kinase activity, resistance to antibiotics, transformation phenotype, occlusion body formation in baculovirus, etc.) caused by the insertion of foreign genes in the vector.
- selection marker e.g., beta-galactosidase activity, thymidine kinase activity, resistance to antibiotics, transformation phenotype, occlusion body formation in baculovirus, etc.
- recombinant expression vectors can be identified by assaying for the activity, biochemical, or immunological characteristics of the gene product expressed by the recombinant vector, provided that the expressed protein assumes a functionally active conformation.
- a wide variety of host/expression vector combinations may be employed in expressing the DNA sequences of this invention as known by those of skill in the art. Once a particular recombinant DNA molecule is identified and isolated, several methods known in the art may be used to propagate it. Once a suitable host system and growth conditions are established, recombinant expression vectors can be propagated and prepared in quantity.
- the expression vectors which can be used include, but are not limited to, the following vectors or their derivatives: human or animal viruses such as vaccinia virus or adenovirus; insect viruses such as baculovirus; yeast vectors; bacteriophage vectors (e.g., lambda); and plasmid and cosmid DNA vectors, to name but a few.
- Vectors can be introduced into the desired host cells by methods known in the art, e.g., transfection, electroporation, microinjection, transduction, cell fusion, DEAE dextran, calcium phosphate precipitation, lipofection (lysosome fusion), use of a gene gun, or a DNA vector transporter (see, e.g., Wu et al., 1992, J. Biol. Chem. 267:963-967; Wu and Wu, 1988, J. Biol. Chem. 263:14621 -14624; Hartmut et al., Canadian Patent Application No. 2,012,311 , filed Mar. 15, 1990).
- X-ray structure coordinates define a unique configuration of points in space.
- a set of structure coordinates for a protein or a protein/ligand complex, or a portion thereof define a relative set of points that, in turn, define a configuration in three dimensions.
- a similar or identical configuration can be defined by an entirely different set of coordinates, provided the distances and angles between atomic coordinates remain essentially the same.
- a scalable configuration of points can be defined by increasing or decreasing the distances between coordinates by a scalar factor while keeping the angles essentially the same.
- One aspect of the present invention relates to a crystalline composition
- a crystalline composition comprising a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1.
- the crystallized complex is characterized by the structural coordinates listed in FIG. 2, or portions thereof.
- the atoms of the ligand are within about 5 angstroms of one or more DPP-IV amino acids in SEQ ID NO: 1 preferably selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740.
- One embodiment of the crystallized complex is characterized as belonging to the space group P4 3 2 !
- the ligand may be a small molecule which binds to DPP-IV extracelluar domain defined by SEQ ID NO:2, or portions thereof, with a Kj of less than about 10 mM, 1 mM, 500 nM, 100 nM, 50 nM, or 10 nM.
- Various computational methods can be used to determine whether a molecule or a binding pocket portion thereof is "structurally equivalent,” defined in terms of its three-dimensional structure, to all or part of DPP-IV or its binding pocket(s).
- Such methods may be carried out in current software applications, such as the molecular similarity application of QUANTA (Accelrys Inc., San Diego, Calif.).
- the molecular similarity application permits comparisons between different structures, different conformations of the same structure, and different parts of the same structure.
- the procedure used in molecular similarity to compare structures is divided into four steps: (1 ) load the structures to be compared; (2) optionally define the atom equivalences in these structures; (3) perform a fitting operation; and (4) analyze the results.
- Each structure is identified by a name.
- One structure is identified as the target (i.e., the fixed structure), while all remaining structures are working structures (i.e., moving structures).
- equivalent atoms are defined as protein backbone atoms (N, C ⁇ , C, and O) for all conserved residues between the two structures being compared.
- a conserved residue is defined as a residue that is structurally or functionally equivalent (See Table 4 infra).
- rigid fitting operations are considered.
- flexible fitting operations may be considered.
- Particularly preferred structurally equivalent molecules or molecular complexes are those that are defined by the entire set of structural coordinates listed in FIG.
- root mean square deviation means the square root of the arithmetic mean of the squares of the deviations. It is a way to express the deviation or variation from a trend or object.
- root mean square deviation defines the variation in the backbone of a protein from the backbone of DPP-IV or a binding pocket portion thereof, as defined by the structural coordinates of DPP-IV described herein.
- the refined x-ray coordinates of the extracellular domain of DDP-IV (amino acids 38 to 766 as listed in SEQ ID NO:2), Zn 2+ , Mg 2+ , and 32 water molecules are as listed in FIG. 2.
- One orthogonal view of the molecule is shown in FIG. 1.
- the crystal structure of the extracellular domain (amino acids 31 -766 of SEQ ID NO:1 ) was solved to a resolution 2.7 A.
- the asymmetric unit is composed of one dimer.
- the structure includes two domains, the ⁇ -propeller domain (residues 55-497) and the catalytic domain (residues 508-766), together with a couple of linker regions (1 -54 and 498-507).
- the propeller domain packs against the hydrolase domain, and the catalytic triad of DPP-IV composed of residues Ser630, His740 and Asp708, which are located which the last 140 residues of the C-terminal region is at the interface of the two domains.
- the present invention provides a molecule or molecular complex that includes at least a portion of a DDP-IV and/or substrate binding pocket.
- the DDP-IV binding pocket includes the amino acids listed in Table 1 , the binding pocket being defined by a set of points having a root mean square deviation of less than about 2.5, 2.0, 1.5, 1.2, 1.0, 0.7, 0.5, or even 0.2 A, from points representing the backbone atoms of the amino acids in Table 1.
- the DPP-IV substrate binding pocket includes the amino acids selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740 from SEQ ID NO:1.
- Table 1 Identified residues 5 A away from the binding pocket of the DPP-IV crystal structure.
- One embodiment of the invention describes an isolated polypeptide consisting of a portion of DPP-IV which functions as the binding site when folded in a 3-D orientation.
- One embodiment is an isolated polypeptide comprising a portion of DPP-IV, wherein the portion starts at about amino acid residue Gly31 , and ends at about amino acid residue Pro766 as described in SEQ ID NO: , or a sequence that is at least 95% or 98% homologous to a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 as listed in SEQ ID NO:1.
- Another embodiment comprises crystalline compositions comprising variants of DPP-IV.
- Variants of the present invention may have an amino acid sequence that is different by one or more amino acid substitutions to the sequence disclosed in SEQ ID NO:1 or SED ID NO:2.
- Embodiments which comprise amino acid deletions and/or additions are also provided.
- the variant may, for example, have conservative changes (amino acid similarity), wherein a substituted amino acid has similar structural or chemical properties, for example, the replacement of leucine with isoleucine.
- determining which and how many amino acid residues may be substituted, inserted, or deleted without adversely affecting biological or pharmacological activity may be reasonably inferred in view of this disclosure, and may further be found using computer programs well known in the art, for example, DNAStar® software (DNAStar Inc.
- amino acid substitutions may be made, for instance, on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues provided that a biological and/or pharmacological activity of the native molecule is retained.
- Negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; amino acids, with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine, and valine; amino acids with aliphatic head groups include glycine, alanine; asparagine, glutamine, serine; and amino acids with aromatic side chains include threonine, phenylalanine, and tyrosine. Examples of conservative substitutions are set forth in Table 4 as follows: Table 4:
- “Homology” is a measure of the identity of nucleotide sequences or amino acid sequences. To characterize the homology, subject sequences are aligned so that the highest percentage homology (match) is obtained, after introducing gaps, if necessary, to achieve maximum percent homology. N- or C- terminal extensions shall not be construed as affecting homology. "Identity" per se has an art-recognized meaning and can be calculated using published techniques. Computer program methods to determine identity between two sequences, for example, include DNAStar® software; the GCG® program package (Devereux, J., et al. Nucleic Acids Research (1984) 12(1 ): 387); BLASTP, BLASTN, FASTA (Atschul, S.F.
- the parameters are set such that the percentage of identity is calculated over the full length of the reference nucleotide sequence or amino acid sequence and that gaps in homology of up to about 90% of the total number of nucleotides in the reference sequence are allowed.
- Similarity between two sequences includes direct matches as well a conserved amino acid substitutes which possess similar structural or chemical properties, e.g., similar charge as described in Table 2. Percentage similarity (conservative substitutions) between two polypeptides may also be scored by comparing the amino acid sequences of the two polypeptides by using programs well known in the art, including the BESTFIT program, by employing default settings for determining similarity.
- a further embodiment of the invention is a crystal comprising the coordinates of FIG. 2, wherein the amino acid sequence is represented by SEQ ID NO:1.
- a further embodiment of the invention is a crystal comprising the coordinates of FIG.2, wherein the amino acid sequence is at least 95% or 98% homologous to the amino acid sequence represented by SEQ ID NO:1.
- Various methods for obtaining atomic coordinates of structurally homologous molecules and molecular complexes using homology modeling are disclosed in, for example, US Patent No: 6,356,845.
- a ligand (antagonist or agonist) may be examined through the use of computer modeling using a docking program such as GRAM, DOCK, or AUTODOCK (See for example, Morris et al., J. Computational Chemistry, 19:1639-1662 (1998)). This procedure can include in silico fitting of potential ligands to the DPP-IV crystal structure to ascertain how well the shape and the chemical structure of the potential ligand will complement or interfere with the catalytic domain of DPP-IV.
- One embodiment of the present invention relates to methods of identifying agents that bind to a binding site on DPP-IV extracellular domain wherein the binding site comprises amino acid residues Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740 of SEQ ID NO:1 , comprising: contacting DPP-IV with a test ligand under conditions suitable for binding of the test compound to the binding site, and determining whether the test ligand binds in the binding site, wherein if binding occurs, the test ligand is an agent that binds in the binding site.
- the testing may be carried out in silico using a variety of molecular modeling software algorithms including, but not limited to, DOCK, ALADDIN, CHARMM simulations, AFFINITY, C2-LIGAND FIT, Catalyst, LUDI, CAVEAT, and CONCORD.
- molecular modeling software algorithms including, but not limited to, DOCK, ALADDIN, CHARMM simulations, AFFINITY, C2-LIGAND FIT, Catalyst, LUDI, CAVEAT, and CONCORD.
- a potential ligand may be obtained by screening a random peptide library produced by a recombinant bacteriophage (Scott and Smith, Science, 249:386-390 (1990); Cwirla et al., Proc. Natl. Acad. Sci., 87:6378-6382 (1990); Devlin et al., Science, 249:404-406 (1990)) or a chemical library, or the like.
- a ligand selected in this manner can be then be systematically modified by computer modeling programs until one or more promising potential ligands are identified.
- Such analysis for example, has been shown to be effective in the development of HIV protease inhibitors.
- a potential ligand (agonist or antagonist)
- it can be either selected from a library of chemicals as are commercially available from most large chemical companies or, alternatively, the potential ligand may be synthesized de novo. As mentioned above, the de novo synthesis of one or even a relatively small group of specific compounds is reasonable in the art of drug design.
- the potential ligand can be placed into any standard binding assay as well known to those skilled in the art to test its effect on DPP-IV activity.
- a supplemental crystal can be grown comprising a protein- ligand complex formed between a DPP-IV protein and the drug.
- the crystal diffracts X-rays allowing the determination of the atomic coordinates of the protein-ligand complex to a resolution of less than 5.0 Angstroms, more preferably less than 3.0 Angstroms, and even more preferably less than 2.0 Angstroms.
- the three-dimensional structure of the supplemental crystal can be determined by Molecular Replacement Analysis. Molecular replacement uses a known three-dimensional structure as a search model to determine the structure of a closely related molecule or protein-ligand complex in a new crystal form.
- the measured X-ray diffraction properties of the new crystal are compared with the search model structure to compute the position and orientation of the protein in the new crystal.
- Computer programs that can be used include: X-PLOR and AMORE (J. Navaza, Acta Crystallographies ASO, 157-163 (1994)).
- X-PLOR and AMORE J. Navaza, Acta Crystallographies ASO, 157-163 (1994)
- an electron density map can be calculated using the search model to provide X-ray phases. Thereafter, the electron density is inspected for structural differences, and the search model is modified to conform to the new structure.
- the claimed structure of DPP-IV can be used to solve the three- dimensional structures of any such DPP-IV complexed with a ligand.
- STAT crystals include: QUANTA; CHARMM; INSIGHT; SYBYL; MACROMODEL; and ICM.
- Suitable in silico methods for screening, designing or selecting ligands are disclosed in, for example, U.S. Patent No. 6,356,845.
- the present invention discloses binding agents which interact with a binding site of DPP-IV defined by a set of points having a root mean square deviation of less than about 2.5, 2.0, 1.7, 1.5, 1.2, 1.0, 0.7, 0.5, or even 0.2 A from points representing the backbone atoms of the amino acids represented by the structure coordinates listed in FIG. 2. Such embodiments represent variants of the DPP-IV crystal.
- the present invention provides ligands which bind to a folded polypeptide comprising an amino acid sequence spanning amino acids 31 to 766 listed in SEQ ID NO:1 , or a homologue or variant thereof.
- the ligand is a competitive or uncompetitive inhibitor of DPP-IV.
- the ligand inhibits DPP-IV with an IC 50 or K, of less than about 10 mM, 1 mM, 500 nM, 100 nM, 50 nM or 10 nM. In yet further embodiments, the ligand inhibits DPP-IV with a K, that is less than about one-half, one-fifth, or one-tenth the K, that the substance has for inhibition of any other DPP-IV enzyme. In other words, the substance inhibits DPP-IV activity to the same degree at a concentration of about one-half, one-fifth, one-tenth or less than the concentration required for any other DPP enzyme.
- One embodiment of the present invention relates to ligands, such as proteins, peptides, peptidomimetics, small organic molecules, etc., designed or developed with reference to the crystal structure of DPP-IV as represented by the coordinates presented herein in FIG. 2, and portions thereof.
- Such binding agents interact with the binding site of the DPP-IV represented by one or more amino acid residues selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740.
- Machine Readable Storage Media Transformation of the structure coordinates for all or a portion of DPP-IV or one of its binding pockets, for structurally homologous molecules as defined below, or for the structural equivalents of any of these molecules or molecular complexes as defined above, into three-dimensional graphical representations of the molecule or complex can be conveniently achieved through the use of commercially-available software.
- the invention thus further provides a machine-readable storage medium comprising a data storage material encoded with machine readable data which, when using a machine programmed with instructions for using said data, is capable of displaying a graphical three-dimensional representation of any of the molecule or molecular complexes of this invention that have been described above.
- the machine-readable data storage medium comprises a data storage material encoded with machine readable data which, when using a machine programmed with instructions for using said data, is capable of displaying a graphical three-dimensional representation of a molecule or molecular complex comprising all or any parts of a DPP-IV binding pocket, as defined above.
- the machine-readable data storage medium is capable of displaying a graphical three-dimensional representation of a molecule or molecular complex defined by the structure coordinates of the amino acids listed in FIG. 4, plus or minus a root mean square deviation from the backbone atoms of said amino acids of not more than 2.0 .A.
- the machine-readable data storage medium comprises a data storage material encoded with a first set of machine readable data which comprises the Fourier transform of the structural coordinates set forth in FIG. 2, and which, when using a machine programmed with instructions for using said data, can be combined with a second set of machine readable data comprising the X-ray diffraction pattern of a molecule or molecular complex to determine at least a portion of the structural coordinates corresponding to the second set of machine readable data.
- a system for reading a data storage medium may include a computer comprising a central processing unit (“CPU"), a working memory which may be, e.g., RAM (random access memory) or “core” memory, mass storage memory (such as one or more disk drives or CD-ROM drives), one or more display devices (e.g., cathode-ray tube (“CRT”) displays, light emitting diode (“LED”) displays, liquid crystal displays (“LCDs”), electroluminescent displays, vacuum fluorescent displays, field emission displays (“FEDs”), plasma displays, projection panels, etc.), one or more user input devices (e.g., keyboards, microphones, mice, touch screens, etc.), one or more input lines, and one or more output lines, all of which are interconnected by a conventional bidirectional system bus.
- CPU central processing unit
- working memory which may be, e.g., RAM (random access memory) or “core” memory, mass storage memory (such as one or more disk drives or CD-ROM drives), one or more display devices (e
- the system may be a stand-alone computer, or may be networked (e.g., through local area networks, wide area networks, intranets, extranets, or the internet) to other systems (e.g., computers, hosts, servers, etc.).
- the system may also include additional computer controlled devices such as consumer electronics and appliances.
- Input hardware may be coupled to the computer by input lines and may be implemented in a variety of ways. Machine-readable data of this invention may be inputted via the use of a modem or modems connected by a telephone line or dedicated data line. Alternatively or additionally, the input hardware may comprise CD-ROM drives or disk drives. In conjunction with a display terminal, a keyboard may also be used as an input device.
- Output hardware may be coupled to the computer by output lines and may similarly be implemented by conventional devices.
- the output hardware may include a display device for displaying a graphical representation of a binding pocket of this invention using a program such as QUANTA as described herein.
- Output hardware might also include a printer, so that hard copy output may be produced, or a disk drive, to store system output for later use.
- a CPU coordinates the use of the various input and output devices, coordinates data accesses from mass storage devices, accesses to and from working memory, and determines the sequence of data processing steps.
- a number of programs may be used to process the machine-readable data of this invention. Such programs are discussed in reference to the computational methods of drug discovery as described herein.
- Machine-readable storage devices useful in the present invention include, but are not limited to, magnetic devices, electrical devices, optical devices, and combinations thereof.
- Examples of such data storage devices include, but are not limited to, hard disk devices, CD devices, digital video disk devices, floppy disk devices, removable hard disk devices, magneto-optic disk devices, magnetic tape devices, flash memory devices, bubble memory devices, holographic storage devices, and any other mass storage peripheral device.
- these storage devices include necessary hardware (e.g., drives, controllers, power supplies, etc.) as well as any necessary media (e.g., disks, flash cards, etc.) to enable the storage of data.
- compositions The present invention provides methods for treating certain diseases in a mammal, preferably a human being, in need of such treatment using the ligands, and preferably the inhibitors, as described herein.
- the ligand can be advantageously formulated into pharmaceutical compositions comprising a therapeutically effective amount of the ligand, a pharmaceutically acceptable carrier and other compatible ingredients, such as adjuvants, Freund's complete or incomplete adjuvant, suitable for formulating such pharmaceutical compositions as is known to those skilled in the art.
- compositions containing the ligand can be used for treatment of diseases that are associated with proteins that are subject to processing by DPP-IV, such as Type 2 diabetes, Type 1 diabetes, impaired glucose tolerance, hyperglycemia, metabolic syndrome (syndrome X and/or insulin resistance syndrome), glucosuria, metabolic acidosis, arthritis, cataracts, diabetic neuropathy, diabetic nephropathy, diabetic retinopathy, diabetic cardiomyopathy, obesity, conditions exacerbated by obesity, hypertension, hyperlipidemia, atherosclerosis, osteoporosis, osteopenia, frailty, bone loss, bone fracture, acute coronary syndrome, short stature due to growth hormone deficiency, infertility due to polycystic ovary syndrome, anxiety, depression, insomnia, chronic fatigue, epilepsy, eating disorders, chronic pain, alcohol addiction, diseases associated with intestinal motility, ulcers, irritable bowel syndrome, inflammatory bowel syndrome; short bowel syndrome; and the prevention of disease progression in Type 2 diabetes.
- the pharmaceutical composition is administered to the mammal in a therapeutically effective amount such that treatment of the disease occurs.
- the present invention is further illustrated by the following examples, which should not be construed as limiting in any way.
- the contents of all cited references (including literature references, patents, published patent applications as cited throughout this application) are hereby expressly incorporated by reference in their entireties.
- the practice of the present invention will employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, molecular biology, microbiology and recombinant DNA, X-ray crystallography, and molecular modeling which are within the skill of the art. As those of skill in the art will understand, such techniques are explained fully in the literature.
- hDPP-IV Construction of hDPP-IV: Residues 31 to 766 of homo sapien (human) wild type DPP-IV (SEQ ID NO:1) were amplified by PCR using the following primers DPPIV-Fc31 - BamF (5'-
- TTAAGGATCCTGGCACAGATGATGCTACAGCTGAC-3' (SEQ ID NO: 3)), which introduced a Bam HI site at the N-terminus
- DPPIV-ChisT-XhoR (5'-AATTCTCGAGTTACTAGTGAT GATGGTGGTGATGGCTGCCGCGCGGCACCAGAGGTAAAGAGAAACATTGTTTTATGAAGTGGC-3' (SEQ ID NO: 4)), which introduced His 6 -tag, thrombin cleavage site and Xho I site at the C-terminus.
- the vector contained a polyhedrin promoter and the honeybee melittin secretion signal for efficient, high-level secretion of the recombinant protein.
- the transfection mixture was removed after 5 h of incubation (27°C) and the cells were overlaid with 25 ml of Sf-900 II SFM.
- the recombinant virus were harvested at 72h post-transfection and further amplification of the virus was achieved by infecting 100 ml of Sf-9 cells (1.2 X 10 6 cells/ml) with 2 ml of the recombinant virus for 65-72 h.
- BIIC Baculo Infected Insect Cells
- E coli clones with recombinant bacmid were obtained after transformation of E. coli DH1 OBac cells (Invitrogen Corp., Frederick, MD) with 5 ng of DPPIV-HBM31 -HT plasmid DNA and blue/white-screening according to manufacturer's protocol (Invitrogen).
- Monolayers of Sf9 cells (20 X 10 6 cells in a 162 cm 2 culture flask) were transfected by overlaying 20 ml of transfection mixture containing 100 ⁇ l of mini-prep bacmid DNA and 100 ⁇ l of CellFECTIN reagent (Gibco BRL) in Sf-900 II SFM.
- the transfection mixture was removed after 5 h of incubation (27 a C) and the cells were overlaid with 25 ml of Sf-900 II SFM.
- the recombinant virus were harvested at 72h post-transfection and further amplification of the virus was achieved by infecting 100 ml of Sf-9 cells (1.2 X 10 6 cells/ml) with 2 ml of the recombinant virus for 65-72 h.
- BIIC Baculo Infected Insect Cells
- Example 2 Purification of His6-tagged DPP-IV wild type extracellular domain After clarification by centrifugation and filtration, 10 liters of culture media containing secreted human DPP-IV-31-766-C-his 6 was concentrated 10-20 fold using a hollow fiber filter unit which had been washed with exchange Buffer A (50 mM Tris, 0.3 M NaCI, 1 mM TCEP, pH 8). The concentrated media was exchanged with 5 volumes of Buffer A. After clarification by filtration, imidazole was added to 10 mM by addition of Buffer B (50 mM TrisCI, 0.3 M NaCI, 0.25 M imidazole, 1 mM TCEP, pH 8).
- Buffer A 50 mM Tris, 0.3 M NaCI, 1 mM TCEP, pH 8.
- the sample was applied to a 40 mL immobilized metal affinity column (Ni-NTA Superflow, Qiagen), which had been equilibrated in Buffer A (50 mM TrisCI, 0.3 M NaCI, 1 mM TCEP, pH 8) at 6-8 mL/min.
- Buffer A 50 mM TrisCI, 0.3 M NaCI, 1 mM TCEP, pH 8
- the column was washed with Buffer A to achieve a stable baseline at 280 nm.
- Bound protein was eluted at a lower flow rate in a linear gradient from 0 - 20%B in 4 column volumes (cv) (5%B / cv) followed by a step to 100% B, held isocratic ally for 4 cv.
- Fractions were analyze by SDS-PAGE on 4-12% bis-tris in MOPS buffer using the NuPAGE system (Invitrogen). Fractions containing DPP-IV were pooled and dialyzed at 4 degrees C against 2 changes of dialysis buffer (50 mM TrisCI, 0.1 M NaCI, 1 mM TCEP, pH 8. After dialysis, the sample was concentrated to 6-10 mg/mL and fractionated by size exclusion chromatography on Superdex 200 prep grade HiLoad 16/60 (Amersham Biosciences). DPP-IV eluted as an apparent dimer. Fractions were analyze by SDS-PAGE on 4-12% bis-tris in MOPS buffer using the NuPAGE system (Invitrogen).
- Example 3 Crystallization of DPP-IV DPP-IV of Example 2 was concentrated into buffer containing 50 mM TrisCI, 25 mM NaCI, 1 mM TCEP, pH 8, to 8-10 mg/mL. Leads were obtained through sparse matrix screening at 22°. Optimized crystals grew in drops made from 1.5 ⁇ L of protein + 1.5 ⁇ L of reservoir solution (0.1 M TrisCI, pH 8.5, 0.2 M sodium acetate, 10-16% PEG 4000) equilibrating over the same reservoir solution.
- Crystals were transferred to a solution containing 0.1 M TrisCI, pH 8.5, 0.2 M sodium acetate, 14-16% PEG 4000, and 20% ethylene glycol. Crystals were flash frozen in gaseous or liquid nitrogen for data collection.
- Example 4 X-ray data collection, structure determination and refinement of DPP-IV The crystals prepared in Example 3 were transferred to a cryoprotectant solution, made up of the reservoir solution, with 15-25% ethylene glycol, and then flash-frozen in a stream of cold nitrogen gas at 100K. A full data set was collected from one crystal frozen in this manner at the Advanced Photon Sources of Argonne National Laboratory on a on a ADSC Quantum 210 CCD detector.
- the data is 96.5% complete to 2.7 A resolution with an R m ⁇ rge ⁇ f 0.062 and an average redundancy of 4.4.
- the final model was built with manual rebuilding on the graphics screen, using the program XtalView (McRee, .D. E., Practical Protein Crystallography, Academic Press, San Diego, 1993). Refinement in Refmac was carried out using all data in the resolution range 50.0-2.7A.
- the R-factor for the current model is 0.257 (free R-factor, 5% of the data, 0.340).
- the refinement statistics are summarized in Table 5b.
- the current model contains residues 38-766 (others are disordered in the crystal), 12 sugar and 32 water molecules. Table 5a -Data statistics
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
Abstract
A crystal structure of dipeptidyl peptidase IV (DPP-IV), and the three-dimensional atomic coordinates of the DPP-IV extracellular domain, as described and used for the identification of ligands, including DPP-IV inhibitors, used for the treatment of diseases that are associated with proteins that are subject to processing by DPP-IV.
Description
CRYSTAL STRUCTURE OF DIPEPTIDYL PEPTIDASE IV (DPP-IV) AND USES THEREOF FIELD OF THE INVENTION The present invention relates to crystalline compositions of mammalian dipeptidyl peptidase IV (DPP-IV), methods of preparing said compositions, methods of determining the three-dimensional (3-D) X- ray atomic coordinates of said composition, methods of identifying ligands of DPP-IV using structure based drug design, the use of 3-D crystal structures to design, modify and assess the activity of potential inhibitors, and to the use of such inhibitors for the treatment of, for example, diabetes, glucose tolerance obesity, appetite regulation, hpidemia, osteoporosis neuropeptide metabolism and T-cell activation, among others
BACKGROUND OF THE INVENTION The seπne peptidase dipeptidyl peptidase IV (DPP-IV) is a multifunctional type II cell surface glycoprotein, which is widely expressed in a variety of cell types, particularly on differential epithelial cells of the intestine, liver, prostate tissue, corpus luteum, and kidney proximal tubules (Thoma et al , Structure, 11 , 947-959, 947 (2003) citing Hartel et al , Histochemistry 89, 151 -161 (1988), cCaughan et al , Hepatology 11 , 534-544 (1990) as well as leukocytes subsets (Thoma et al , (2003) citing Gorrell et al , Cell Immunol 134, 205-215 (1991 )) DPP-IV has roles in many biological processes including its ability to modulate the biological activity of several peptide hormones, chemokines and neuropeptides by specifically cleaving after a proline or alanine at ammo acid position 2 from the N terminus, a rate-limiting step in the degradation of peptides (Mentlem, R Regul Pept 85, 9-24 (1999) Therefore, the natural substrates of DPP-IV include several chemokines, cytokines, neuropeptides, circulating hormones, and bioactive peptides which as those of skill in the art will appreciate, suggests a key regulatory role in the metabolism of peptide hormones and in ammo acid transport ((Lambeir et al , FEBS Lett 507, 327-330 (2001), (Hildebrandt et
DPP-IV has been implicated in the control of glucose homeostasis because its substrates include the incretin peptides glucagon-like peptide 1 (GLP-1 ) and gastric inhibitory polypeptide (GIP) Cleavage of the N-termmal ammo acids from these peptides renders them functionally inactive As such, GLP-1 has been shown to be an effective anti-diabetic therapy in Type 2 diabetic patients and to reduce the meal- related insulin requirement in Type 1 diabetic patients Moreover, GLP-1 and/or GIP are believed to regulate satiety, hpidemia and osteogenesis, and therefore exogenous GLP-1 has been proposed as a treatment for patients suffering from acute coronary syndrome, angina and ischemic heart disease As those of skill in the art will appreciate, several methods have been used in the past and continue to be used to discover selective inhibitors of biomolecular targets such as DPP-IV The various approaches include ligand-directed drug discovery (LDD), quantitative structure activity relationship (QSAR) analyses, and comparative molecular field analysis (CoMFA) CoMFA is a particular type of QSAR method that uses statistical correlation techniques for the analysis of the quantitative relationship between the biological activity of a set of compounds with a specified alignment, and their three- dimensional electronic and steπc properties Other properties such as hydrophobicity and hydrogen bonding can also be incorporated into the analysis
An invaluable component of these drug discovery approaches is structure based design, which is a design strategy for new chemical entities, or optimization of lead compounds identified by other methods, using the 3D structure of the biological macromolecule target obtained by for example, X-ray or nuclear magnetic resonance (NMR) studies, or from homology models. Analyzing 3-D structures of proteins provides crucial insights into the behavior and mechanics of drug binding and biological activity. Coupled with computational techniques including modeling and simulation, the study of biomolecular interactions provides details of events that may be difficult to investigate experimentally in the laboratory, and can help reveal topological features important for determining molecular recognition. As those skilled in the art will recognize, this information can, in turn, be used for predicting ligand-receptor complex formation, and for designing ligands and protein mutations that produce desired ligand receptor interactions. Although, crystal structures of DPP-IV have been previously disclosed, none have been able to solve a three-dimensional structure with only one molecule per asymmetric unit, in turn providing a greater advantage for iterative structure based design. (Aertgeets et al., Protein Science, 13:412-421 (2004) hDPP-IV complexed with a decapaptide; Engel et al., PNAS, 100:9:5063-5068 (2003) crystal structure of native porcine DPP-IV; Hiramatsu et al., BBRC, 302:849-854 (2003) crystal structure of hDPP-IV at 2.6A resolution; Rasmussen et al., Nature Structural Biology, 10:19-25 (2003) 2.5A structure of the extracellular region of DPP-IV in complex with the inhibitor valine-pyrrolidide; Thoma et al., Structure, 11 :947-959 (2003) expressed and purified the ectodomain of human DPP-IV in Pichia pastoris and determined X-ray structure at 2.1 A resolution.) To that end, the quest for specific and potent DPP-IV inhibitors for use in physiological studies and therapeutic settings continues. Thus, obtaining three-dimensional (3D) structures of DPPs, such as DPP-IV as a single molecule of an asymmetric unit, by, for example, X-ray or NMR studies, or from homology models, and analyzing the structures using computational methods, facilitates such discovery efforts. SUMMARY OF THE INVENTION The present invention provides crystalline compositions of DPP-IV, and specifically of DPP-IV, having one molecule per asymmetric unit. The invention further provides methods of preparing said compositions, methods of determining the 3-D X-ray atomic coordinates of said crystalline compositions, methods of using the atomic coordinates in conjunction with computational methods to identify binding site(s), methods to elucidate the 3-D structure of homologues of DPP-IV, and methods to identify ligands which interact with the binding site(s) to agonize or antagonize the biological activity of DPP-IV, methods for identifying inhibitors of DPP-IV, pharmaceutical compositions of inhibitors, and methods of treatment of Type 2 diabetes, Type 1 diabetes, impaired glucose tolerance, hyperglycemia, metabolic syndrome (syndrome X and/or insulin resistance syndrome), glucosuria, metabolic acidosis, arthritis, cataracts, diabetic neuropathy, diabetic nephropathy, diabetic retinopathy, diabetic cardiomyopathy, obesity, conditions exacerbated by obesity, hypertension, hyperlipidemia, atherosclerosis, osteoporosis, osteopenia, frailty, bone loss, bone fracture, acute coronary syndrome, short stature due to growth hormone deficiency, infertility due to polycystic ovary syndrome, anxiety, depression, insomnia, chronic fatigue, epilepsy, eating disorders, chronic pain, alcohol addiction, diseases associated with intestinal motility, ulcers, irritable bowel syndrome,
inflammatory bowel syndrome; short bowel syndrome; and the prevention of disease progression in Type 2 diabetes, using said pharmaceutical compositions. In a preferred embodiment the invention provides crystalline compositions of the extracellular domain of DPP-IV (residues 31 -766 of SEQ ID NO:1 ), whereby the crystal structure is derived from 5 mammal, preferably human. One aspect of the present invention provides methods for crystallizing a DPP-IV polypeptide. Preferably the methods for crystallizing the DPP-IV polypeptide comprising an amino acid sequence spanning the amino acids 31 to 766 listed in SEQ ID NO:1 comprise the steps of: (a) preparing solutions of the polypeptide and precipitant; (b) growing a crystal comprising molecules of the polypeptide from said 10 mixture solution; and (c) separating said crystal from said solution. The crystallization growth can be carried out by various techniques known by those skilled in the art, such as for example, batch crystallization, liquid bridge, vapor diffusion, crystallization, or dialysis crystallization. Preferably, the crystallization growth is achieved using vapor diffusion techniques. An embodiment of the present invention provides crystalline compositions of DPP-IV comprising a 15 crystalline form of a polypeptide with an amino acid sequence spanning the amino acids Gly31 to Pro766 listed in SEQ ID NO:1 , wherein the crystalline composition has a space group P432!2 and unit cell dimensions a=b=68.7 A, c= 421.2 A. In a second aspect, the present invention provides vectors useful in methods for preparing a substantially purified extracellular domain of DPP-IV comprising the polypeptide with an amino acid 20 sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1. Yet another embodiment of the present invention provides a DPP-IV crystal of SEQ ID NO:2, or a homologue, analogue or variant thereof. In a third aspect, the present invention provides methods for determining the X-ray atomic coordinates of the crystalline compositions at a 2.7A resolution. 25 In a fourth aspect, the present invention provides a molecule or molecular complex crystal, wherein the crystal has substantially similar atomic coordinates to the atomic coordinates listed in FIG. 2 or portions thereof, or any scalable variations thereof. In a fifth aspect, the present invention provides a molecule or molecular complex crystal, wherein the crystal comprises a polypeptide with an amino acid sequence spanning the amino acids Gly31 to Pro766 30 listed in SEQ ID NO:1. A further embodiment of the invention provides a crystal comprising an amino acid sequence that is at least 98% or 95% homologous to a polypeptide with an amino acid sequence spanning the amino acids Gly31 to Pro766 listed in SEQ ID NO:1. An even further embodiment of the invention provides a crystal comprising an amino acid sequence that is at least 98% or 95% homologous to a polypeptide with an amino acid sequence spanning the amino 35 acids Gly31 to Pro766 listed in SEQ ID NO:1 , and having the atomic coordinates listed in FIG. 2. In a sixth aspect, the present invention provides a molecule or molecular complex crystal, wherein the crystal comprises a polypeptide, or a portion thereof, with atomic coordinates having a root mean square deviation from the protein backbone atoms (N, Cα, C, and O) listed in FIG. 2 of less than 0.2, 0.5, 0.7, 1.0, 1.2, 1.5, 2.0 or 2.5 A. 40. In a seventh aspect, the present invention provides a scalable, or translatable, three dimensional configuration of points derived from structural coordinates of at least a portion of a DPP-IV molecule or
molecular complex comprising a polypeptide with an amino acid sequence spanning the amino acids Gly31 to Pro766 listed in SEQ ID NO:1. In an embodiment of this aspect, the invention also comprises the structural coordinates of at least a portion of a molecule or a molecular complex that is structurally homologous to a DPP-IV molecule or molecular complex. On a molecular scale, the configuration of points derived from a homologous molecule or molecular complex has a root mean square deviation of less than about 0.2, 0.5, 0.7, 1.0, 1.2, 1.5, 2.0 or 2.5 A from the structural coordinates provided in FIG. 2. In an eighth aspect, the invention provides computers for producing a three-dimensional respresentation of aspect eight of the present invention can be used to design and identify potential ligands or inhibitors of DPP-IV by, for example commercially available molecular modeling software in conjunction with structure-based drug design as provided herein. In a further aspect, the present invention provides computer for producing three-dimensional representations of: a. a molecule or molecular complex comprising a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1 , or a homologue, or a variant thereof; b. a molecule or molecular complex, wherein the atoms of the molecule or molecular complex are represented by atomic coordinates that are substantially similar to, or are subsets of, the atomic coordinates listed in FIG. 2; c. a molecule or molecular complex, wherein the molecule or molecular complex comprises atomic coordinates having a root mean square deviation of less than 0.2, 0.5, 0.7, 1.0, 1.2 , 1.5, 2.0 or even 2.5 A from the atomic coordinates for the carbon backbone atoms listed in FIG. 2; or d. a molecule or molecular complex, wherein the molecule or molecular complex comprises a binding pocket or site defined by the structure coordinates that are substantially similar to the atomic coordinates listed in FIG. 2, or a subset thereof, or more preferably the structural coordinates in FIG. 2 corresponding to one or more DPP-IV amino acids, or conservative replacements thereof, in SEQ ID NO:1 selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740; wherein said computer comprises: (i) a computer-readable data storage medium comprising a data storage medium encoded with computer-readable data, wherein said data comprises the structure coordinates of FIG. 2, or portions thereof, or substantially similar coordinates thereof; (ii) a working memory for storing instructions for processing said computer-readable data; (iii) a central-processing unit coupled to said working memory and to said computer- readable data storage medium for processing said computer-machine readable data into said three- dimensional representation; and (iv) a display coupled to said central-processing unit for displaying said representation. In a ninth aspect, the present invention provides methods involving molecular replacement to obtain structural information about a molecule or molecular complex of unknown structure. In one embodiment, the method includes crystallizing the molecule or molecular complex, generating an x-ray diffraction pattern from the crystallized molecule or molecular complex, and applying at least a portion of
the structure coordinates set forth in FIG. 2 to the x-ray diffraction pattern to generate a three-dimensional electron density map of at least a portion of the molecule or molecular complex. In yet another aspect, the present invention provides methods for generating 3-D atomic coordinates of protein homologues, analogues, or variants of DPP-IV using the x-ray coordinates of DPP- IV described in FIG. 2, comprising: a. identifying one or more homologous polypeptide sequences to DPP-IV; b. aligning said sequences with the sequence of DPP-IV which comprises a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1 ; c identifying structurally conserved and structurally variable regions between said homologous sequence(s) and DPP-IV; d. generating 3-D coordinates for structurally conserved residues of the said homologous sequence(s) from those of DPP-IV using the coordinates listed in FIG. 2; e. generating conformations for the loops in the structurally variable regions of said homologous sequence(s); f. building the side-chain conformations for said homologous sequence(s); and g. combining the 3-D coordinates of the conserved residues, loops and side-chain conformations to generate full or partial 3-D coordinates for said homologous sequences. Embodiments of the ninth aspect provide methods which further comprise refining and evaluating the full or partial 3-D coordinates. These methods may thus be used, for example, to generate 3- dimensional structures for proteins for which 3-dimensional atomic coordinates have not been determined. As such, the newly generated structure can help to elucidate enzymatic mechanisms, or be used in conjunction with other molecular modeling techniques in structure based drug design. In the tenth aspect, the present invention provides methods for identifying inhibitors, ligands, and the like, of DPP-IV by providing the coordinates of a molecule of DPP-IV to a computerized modeling system; identifying chemical entities that are likely to bind or interact with the molecule (e.g., by screening a small molecule library); and, optionally, procuring or synthesizing and assaying the compounds or analogues derived thereof for bioactivity. Further aspects of the present invention relate to methods for identifying potential ligands for DPP-IV or homologues, or analogue or variants thereof comprising: a. displaying the three dimensional structure of DPP-IV enzyme or homologue or analogue or variant thereof, as defined by atomic coordinates that are substantially similar to the atomic coordinates listed in FIG. 2 on a computer display screen; b. optionally replacing one or more the enzyme amino acid residues listed in SEQ ID NO:1 , or preferably one or more amino acid residues selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740, in said three-dimensional structure with a different naturally occurring amino acid or an unnatural amino acid to display a variant structure; c. optionally conducting ab intio, molecular mechanics or molecular dynamics calculations on the displayed three dimensional structure to generate a modified structure;
d. employing said three-dimensional structure, variant structure, or modified structure to design or select said ligand; d. synthesizing or obtaining said ligand; e. contacting said ligand with said enzyme in the presence of one or more substrates; and f. measuring the ability of said ligand to modulate the activity of said enzyme. Those of skill in the art can appreciate that the information obtained by the methods for identifying inhibitors and ligands of DPP-IV, as described above, can be used to iteratively refine or modify the structure of original ligand. Thus, once a ligand is found to modulate the activity of said enzyme, the structural aspects of the ligand may be modified to generate a structural analog of the ligand. This analog can then be used in the above methods to identify binding ligands. One of ordinary skill in the art will know the various ways a structure may be modified. In embodiments, the methods further comprise computationally modifying the structure of the ligand; computationally determining the fit of the modified ligand using the three-dimensional coordinates described in FIG. 2, or portions thereof; contacting said modified ligand with said enzyme, or homologue, or variant thereof in an in vitro or in vivo setting; and measuring the ability of said ligand to modulate the activity of said enzyme. In an eleventh aspect, the present invention provides compositions, such as, pharmaceutical compositions comprising the inhibitors or ligands designed according to any of the methods of the present invention. In one embodiment, a composition is provided that includes an inhibitor or ligand designed or identified by any of the above methods. In another embodiment, the composition is a pharmaceutical composition. The twelfth aspect of the present invention are methods for treating Type 2 diabetes, Type 1 diabetes, impaired glucose tolerance, hyperglycemia, metabolic syndrome (syndrome X and/or insulin resistance syndrome), glucosuria, metabolic acidosis, arthritis, cataracts, diabetic neuropathy, diabetic nephropathy, diabetic retinopathy, diabetic cardiomyopathy, obesity, conditions exacerbated by obesity, hypertension, hyperlipidemia, atherosclerosis, osteoporosis, osteopenia, frailty, bone loss, bone fracture, acute coronary syndrome, short stature due to growth hormone deficiency, infertility due to polycystic ovary syndrome, anxiety, depression, insomnia, chronic fatigue, epilepsy, eating disorders, chronic pain, alcohol addiction, diseases associated with intestinal motility, ulcers, irritable bowel syndrome, inflammatory bowel syndrome; short bowel syndrome; and the prevention of disease progression in Type 2 diabetes, comprising administering pharmaceutical compositions, identified by structure based design using the atomic coordinates, or portions thereof, listed in FIG. 2, effective in treating the disorders or conditions.
BRIEF DESCRIPTION OF THE DRAWINGS Figure 1 is an orthogonal view of an embodiment of DPP-IV shown in a ribbon representation.
The N- and C-termini of the polypeptide are also depicted. Figure 2 is a list of the X-ray coordinates of a DPP-IV crystal as described in the Examples.
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to crystalline compositions of DPP-IV, 3-D X-ray atomic coordinates of said crystalline compositions, methods of preparing said compositions, methods of determining the 3-D X-ray atomic coordinates of said crystalline compositions, and methods of using said atomic coordinates in conjunction with computational methods to identify binding site(s), or identify ligands which interact with said binding site(s) to agonize or antagonize DPP-IV. For convenience, certain terms employed in the specification, examples, and appendant claims are collected here. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the arts. The term "affinity" as used herein refers to the tendency of a molecule to associate with another. The affinity of a drug is its ability to bind to its biological target (receptor, enzyme, transport system, etc.) For pharmacological receptors, affinity can be thought of as the frequency with which the drug, when brought into the proximity of a receptor by diffusion, will reside at a position of minimum free energy within the force field of that receptor. The term "agonist" as used herein refers to an endogenous substance or a drug that can interact with a receptor and initiate a physiological or a pharmacological response characteristic of that receptor (contraction, relaxation, secretion, enzyme activation, etc.) The term "analog" as used herein refers to a drug or chemical compound whose structure is related in some way to that of another drug or chemical compound, but whose chemical and biological properties may be quite different. The term "antagonist" as used herein refers to a drug or a compound that opposes the physiological effects of another. At the receptor level, it is a chemical entity that opposes the receptor- associated responses normally induced by another bioactive agent. The term "asymmetric unit" refers to the basic motif, which is repeated in 3-D space by the symmetry operators of the crystallographic space group, of which the coordinated of the structure are determined. It is the smallest part of the crystal structure from which the complete structure can be built using space group symmetry. As used herein the term "binding site" refers to a specific region (or atom) in a molecular entity that is capable of entering into a stabilizing interaction with another molecular entity. In certain embodiments the term also refers to the reactive parts of a macromolecule that directly participate in its specific combination with another molecule. In other embodiments, a binding site may be comprised or defined by the three dimensional arrangement of one or more amino acid residues within a folded polypeptide. In yet further embodiments, the binding site further comprises prosthetic groups, water molecules or metal ions which may interact with one or more amino acid residues. Prosthetic groups, water molecules, or metal ions may be apparent from X-ray crystallographic data, or may be added to an apo protein or enzyme using in silico methods. The term "bioactivity" refers to DPP-IV activity that exhibits a biological property conventionally associated with a DPP-IV agonist or antagonist, such as a property that would allow treatment of one or more of the various diseases such as, for example, diabetes, glucose tolerance, obesity, appetite regulation, lipidemia, osteoporosis, neuropeptide metabolism and T-cell activation, among others.
The term "catalytic domain" as used herein, refers to the catalytic domain of the DPP-IV class of enzymes, which features a conserved segment of amino acids in the carboxy-terminal portion of the proteins, wherein this segment has been demonstrated to include the catalytic site of these enzymes. This conserved catalytic domain extends approximately from residue 552 to 766 of the full-length enzyme of DPP-IV (SEQ ID NO:1 ). "To clone" as used herein, means obtaining exact copies of a given polynucleotide molecule using recombinant DNA technology. Furthermore, "to clone into" may be meant as inserting a given first polynucleotide sequence into a second polynucleotide sequence, preferably such that a functional unit combining the functions of the first and the second polynucleotides results. For example, without limitation, a polynucleotide from which a fusion protein may be translationally provided, which fusion protein comprises amino acid sequences encoded by the first and the second polynucleotide sequences. Specifics of molecular cloning can be found in a number of commonly used laboratory protocol books such as Molecular Cloning: A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989). The term "co-crystallization" as used herein is taken to mean crystallization of a preformed protein/I igand complex. The term "complex" or "co-complex" are used interchangeably and refer to a DPP-IV molecule, or a variant, or homologue of DPP-IV in covalent or non-covalent association with a substrate, ligand, inhibitor. The term "contacting" as used herein applies to in silico, in vitro, and/or in vivo experiments. "Diseases" and particularly "diseases that are associated with proteins that are subject to processing by DPP-IV", include, but are not limited to, for example, Type 2 diabetes, Type 1 diabetes, impaired glucose tolerance, hyperglycemia, metabolic syndrome (syndrome X and/or insulin resistance syndrome), glucosuria, metabolic acidosis, arthritis, cataracts, diabetic neuropathy, diabetic nephropathy, diabetic retinopathy, diabetic cardiomyopathy, obesity, conditions exacerbated by obesity, hypertension, hyperlipidemia, atherosclerosis, osteoporosis, osteopenia, frailty, bone loss, bone fracture, acute coronary syndrome, short stature due to growth hormone deficiency, infertility due to polycystic ovary syndrome, anxiety, depression, insomnia, chronic fatigue, epilepsy, eating disorders, chronic pain, alcohol addiction, diseases associated with intestinal motility, ulcers, irritable bowel syndrome, inflammatory bowel syndrome; short bowel syndrome; and the prevention of disease progression in Type 2 diabetes. The term "extracellular domain" as used herein, refers to the extracellular domain of DPP-IV, which features a conserved segment of amino acids, whereby this segment has been demonstrated to include glycosylation sites, a cysteine-rich region and the catalytic active site. This conserved extracellular domain extends approximately from residue Gly31 to Pro766 of the full length enzyme (SEQ ID NO:1 ). As used herein, the terms "gene", "recombinant gene" and "gene construct" refer to a nucleic acid comprising an open reading frame encoding a polypeptide, including both exon and (optionally) intron sequences. The term "intron" refers to a DNA sequence present in a given gene which is not translated into protein and is generally found between exons. "Human DPP-IV" (hDPP-IV) is a cell surface type-ll membrane glycoprotein and is also called adenosine deaminase (ADA) binding protein or CD26, which is known as a T-cell activation antigen.
hDPP-IV is a single polypeptide chain of 766 amino acids, which consists if five regions: a cytoplasmic region (residues about 1 -6), a transmembrane region (residues about 7-28), a highly gycosylated region (residues about 29-323), a cysteine-rich region (residues about 324-551 ), and a catalytic region (residues about 552-766) (Hiramatsu et al., (2003)). The term "high affinity" as used herein means strong binding affinity between molecules with a dissociation constant KD of no greater than 1 μM. In a preferred case, the KD is less than 100 nM, 10 nM, 1 nM, 100 pM, or even 10 pM or less. In a most preferred embodiment, the two molecules can be covalently linked (KD is essentially 0). The term "homologue" as used herein means a protein, polypeptide, oligopeptide, or portion thereof, having preferably at least 95% amino acid sequence identity with DPP-IV enzyme as described in SEQ ID NO: 1 or SEQ ID NO:2 or with any extracellular domain described herein, or with any functional or structural domain of lipid binding protein. SEQ ID NO:1 is the full-length amino acid sequence of a wild- type human DPP-IV, while SEQ ID NO:2 is the hDPP-IV construct including residues 31 -766 which, as described in the Examples, was purified, expressed and crystallized. Those of skill in the art understand that a set of structure coordinates determined by X-ray crystallography is not without standard error. As used herein, and for the purpose of this invention, the term "substantially similar atomic coordinates" or atomic coordinates that are "substantially similar" refers to any set of structure coordinates of DPP-IV or DPP-IV homologues, or DPP-IV variants, polypeptide fragments, described by atomic coordinates that have a root mean square deviation for the atomic coordinates of protein backbone atoms (N, Cα, C, and O) of less than about 2.5, 2.0 1.5, 1.2, 1.0, 0.7, 0.5, or even 0.2 A when superimposed using backbone atoms of structure coordinates listed in FIG. 2. For the purpose of this invention, structures that have substantially similar coordinates as those listed in FIG. 2 shall be considered identical to the coordinates listed in FIG. 2. The term "substantially similar" also applies to an assembly of amino acid residues that may or may not form a contiguous polypeptide chain, but whose three dimensional arrangement of atomic coordinates have a root mean square deviation for the atomic coordinates of protein backbone atoms (N, Cα, C, and O), or the side chain atoms, of less than about 2.5, 2.0, 1.5, 1.2, 1.0, 0.7, 0.5, or even 0.2 A when superimposed-using backbone atoms, or the side chain atoms- of the atomic coordinates of similar or the same amino acids from the coordinates listed in FIG. 2. Those skilled in the art further understand that the coordinates listed in FIG. 2 or portions thereof may be transformed into a different set of coordinates using various mathematical algorithms without departing from the present invention. For example, the coordinates listed in Fig. 2, or portions thereof, may be transformed by algorithms which translate or rotate the atomic coordinates. Alternatively, molecular mechanics, molecular dynamics or ab intio algorithms may modify the atomic coordinates. Atomic coordinates generated from the coordinates listed in FIG. 2, or portions thereof, using any of the aforementioned algorithms shall be considered identical to the coordinates listed in FIG. 2. The term "in silico" as used herein refers to experiments carried out using computer simulations. In an embodiments, the in silico methods are molecular modeling methods wherein 3-dimensional models of macromolecules or ligands are generated. In other embodiments, the in silico methods comprise computationally assessing ligand binding interactions. The term "ligand" describes any molecule, e.g., protein, peptide, peptidomimetics, oligopeptide, small organic molecule, polysaccharide, polynucleotide, etc., which binds or interacts, generally but not
necessarily specifically to or with another molecule. In one aspect the ligand is an agonist, whereby the molecule upregulates (i.e., activates or stimulates, e.g., by agonizing or potentiating) activity, while in another aspect of the invention the ligand is an inhibitor or antagonist, whereby the molecule down regulates (i.e., inhibits or suppresses, e.g. by antagonizing, decreasing or inhibiting) the activity. The term "modulate" as used herein refers to both upregulation (i.e., activation or stimulation, e.g., by agonizing or potentiating) and down-regulation (i.e., inhibition or suppression, e.g., by antagonizing, decreasing or inhibiting) of a bioactivity. The term "pharmacophore" as used herein refers to the" ensemble of steric and electronic features of a particular structure that is necessary to ensure the optimal supramolecular interactions with a specific biological target structure and to trigger (or to block) its biological response. In an embodiments, a pharmacophore is an abstract concept that accounts for the common molecular interaction capacities of a group of compounds towards their target structure. In yet a further embodiments, the term can be considered as the largest common denominator shared by a set of active molecules. Pharmacophoric descriptors are used to define a pharmacophore, including H-bonding, hydrophobic and electrostatic interaction sites, defined by atoms, ring centers and virtual points. Accordingly, in the context of enzyme ligands, such as for example agonists or antagonists, a pharmacophore may represent an ensemble of steric and electronic factors which are necessary to insure supramolecular interactions with a specific biological target structure. As such, a pharmacophore may represent a template of chemical properties for an active site of a protein/enzyme representing these properties' spatial relationship to one another that theoretically defines a ligand that would bind to that site. The term "precipitant" as used herein includes any substance that, when added to a solution, causes a precipitate to form or crystals to grow. Examples of suitable precipitants include, but are not limited to, alkali (e.g., Li, Na, or K), or alkaline earth metal (e.g., Mg, or Ca) salts, and transition metal (e.g., Mn, or Zn) salts. Common counter ions to the metal ions include, but are not limited to, halides, phosphates, citrates and sulfates. The term "prodrug" as used herein refers to drugs that, once administered, are chemically modified by metabolic processes to become pharmaceutically active. In certain embodiments the term also refers to any compound that undergoes biotransformation before exhibiting its pharmacological effects. Prodrugs can thus be viewed as drugs containing specialized non-toxic protective groups used in a transient manner to alter or to eliminate properties, usually undesirable, in the parent molecule. The term "receptor" as used here in refers to a protein or a protein complex in or on a cell that specifically recognizes and binds to a compound acting as a molecular messenger (neurotransmitter, hormone, lymphokine, lectin, drug, etc.). In a broader sense, the term receptor is used interchangeably with any specific (as opposed to non-specific, such as binding to plasma proteins) drug binding site, also including nucleic acids such as DNA. The term "recombinant protein" refers to a polypeptide which is produced by recombinant DNA techniques, wherein generally, DNA encoding a polypeptide is inserted into a suitable expression vector which is, in turn, used to transform a host cell to produce the polypeptide encoded by the DNA. This polypeptide may be one that is naturally expressed by the host cell, or it may be heterologous to the host cell, or the host cell may have been engineered to have lost the capability to express the polypeptide which is otherwise expressed in wild type forms of the host cell. The polypeptide may also be, for
example, a fusion polypeptide. Moreover, the phrase "derived from", with respect to a recombinant gene, is meant to include within the meaning of "recombinant protein" those proteins having an amino acid sequence of a native polypeptide, or an amino acid sequence similar thereto which is generated by mutations, including substitutions, deletions and truncation, of a naturally occurring form of the polypeptide. As used herein, the term "selective DPP-IV inhibitor" refers to a substance, such as for example, an organic molecule, that effectively inhibits an enzyme from the DDP-IV family to a greater extent than any other DPP enzyme. A selective DPP-IV inhibitor is a substance, having a K, for inhibition of DPP-IV that is less than about one-half, one-fifth, or one-tenth the K, that the substance has for inhibition of any other DPP enzyme. In other words, the substance inhibits DPP-IV activity to the same degree at a concentration of about one-half, one-fifth, one-tenth or less than the concentration required for any other DPP enzyme. In general a substance is considered to effectively inhibit DPP-IV if it has an IC50 or K, of less than or about 10 mM, 1 mM, 500 nM, 100 nM, 50 nM or 10 nM. As used herein the term "small molecules" refers to drugs as they are orally available (unlike proteins which must be administered by injection, topically or inhalation). The size of the small molecules is generally under 1000 Daltons, but many estimates seem to range between 300 to 700 Daltons. The term "space group" refers to the lattice and symmetry of the crystal. In a space group designation the capital letter indicates the lattice type and the other symbols represent symmetry operations that can be carried out on the contents of the asymmetric unit without changing its appearance. By "therapeutically effective" amount is meant that amount which is capable of at least partially reversing and/or treat the symptoms of the disease. A therapeutically effective amount can be determined on an individual basis and will be based, at least in part, on a consideration of the species of the mammal, the size of the mammal, the type of delivery system used, and the type of administration relative to the progression of the disease. A therapeutically effective amount can be determined by one of ordinary skill in the arts. As used herein, the term "transfection" means the introduction of a nucleic acid, e.g., via an expression vector, into a recipient cell by nucleic acid-mediated gene transfer. "Transformation" refers to a process in which a cell's genotype is changed as a result of the cellular uptake of exogenous DNA or RNA and, for example, the transformed cell expresses a recombinant form of a polypeptide or, in the case of anti-sense expression from the transferred gene, the expression of a naturally-occurring form of the polypeptide is disrupted. The term "variants" in relation to the polypeptide sequence in SEQ ID NO:1 or SEQ ID NO:2 include any substitution of, variation of, modification of, replacement of, deletion of, or addition of one or more amino acids from or to the sequence providing a resultant polypeptide sequence for an enzyme having DPP-IV activity. Preferably the variant, homologue, fragment or portion, of SEQ ID NO:1 or SEQ ID NO:2, comprises a polypeptide sequence of at least 5 contiguous amino acids, preferably at least 10 contiguous amino acids, preferably at least 15 contiguous amino acids, preferably at least 20 contiguous amino acids, preferably at least 25 contiguous amino acids, or preferably at least 30 contiguous amino acids.
The term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of preferred vector is an episome, i.e., a nucleic acid capable of extra- chromosomal replication. Suitable host-vector systems include but are not limited to mammalian cell systems infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected with virus (e.g., baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression elements of vectors vary in their strengths and specificities. As those depending on the host-vector system utilized, any one of a number of suitable transcription and translation elements may be used. Preferred vectors are those capable of autonomous replication and/or expression of nucleic acids to which they are linked. Vectors capable of directing the expression of genes to which they are operatively linked are referred to herein as
"expression vectors". In general, expression vectors of utility in recombinant DNA techniques are often in the form of "plasmids" which refer generally to circular double stranded DNA loops which, in their vector form are not bound to the chromosome. In the present specification, "plasmid" and "vector" are used interchangeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include such othefr forms of expression vectors which serve equivalent functions and which become known in the art subsequently hereto. The following amino acid abbreviations are used throughout this disclosure: A = Ala = Alanine T = Thr = Threonine V = Val = Valine C = Cys = Cysteine L = Leu = Leucine Y = Tyr = Tyrosine I = lie = Isoleucine N = Asn = Asparagine P = Pro = Proline Q = Gin = Glutamine F = Phe = Phenylalanin D = Asp = Aspartic Acid W = Trp = Tryptophan E = Glu = Glutamic Acid M = Met = Methionine K = Lys = Lysine G = Gly = Glycine R = Arg = Arginine S = Ser = Serine H = His = Histidine A. Clones and Expressions As would be appreciated by those skilled in the art, the nucleotide sequence coding for a DPP-IV polypeptide, or a functional fragment, including the C-terminal peptide fragment of the catalytic domain of DPP-IV protein, and/or derivatives or analogs thereof, including a chimeric protein, thereof, can be inserted into a suitable expression vector, i.e., a vector which contains the necessary elements for the transcription and translation of the inserted protein-coding sequence. The elements mentioned above are termed herein a "promoter." Thus, the nucleic acid encoding a DPP-IV polypeptide of the invention or a functional fragment comprising the extracellular domain of of the DPP-IV protein, or a homologue, an analog, or variant thereof, is operationally associated with a promoter in an expression vector of the invention. In preferred embodiments, the expression vector contains the nucleotide sequence coding for the polypeptide comprising the amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1. Both cDNA and genomic sequences can be cloned and expressed under the control of such regulatory sequences. An expression vector also preferably includes a replication origin. The necessary transcriptional and translational signals can be provided on a recombinant expression vector. As detailed below, all genetic manipulations described for the DPP-IV gene in this section, may also be employed for genes encoding a functional fragment, including the C-terminal peptide fragment of the catalytic domain of the DPP-IV protein, derivatives or analogs thereof, including a chimeric protein thereof.
Suitable host-vector systems include but are not limited to mammalian cell systems infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected with virus (e.g., baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression elements of vectors vary in their strengths and specificities. As those depending on the host-vector system utilized, any one of a number of suitable transcription and translation elements may be used. A recombinant DPP-IV protein of the invention may be expressed chromosomally, after integration of the coding sequence by recombination. In this regard, any of a number of amplification systems may be used to achieve high levels of stable gene expression. (See Sambrook et al., 1989, infra). A suitable cell for purposes of this invention is one into which the recombinant vector comprising the nucleic acid encoding DPP-IV protein is cultured in an appropriate cell culture medium under conditions that provide for expression of DPP-IV protein by the cell. Any of the methods previously described for the insertion of DNA fragments into a cloning vector may be used to construct expression vectors containing a gene consisting of appropriate transcriptional/translational control signals and the protein coding sequences. These methods may include in vitro recombinant DNA and synthetic techniques, and in vivo recombination (genetic recombination). Expression of the DPP-IV protein may be controlled by any promoter/enhancer element known in the art, provided that these regulatory elements must be functional in the host selected for expression, as would be appreciated by those of skill in the art. Vectors containing a nucleic acid encoding a DPP-IV protein of the invention can be identified, for example, by four general approaches: (1 ) PCR amplification of the desired plasmid DNA or specific mRNA; (2) nucleic acid hybridization; (3) presence or absence of selection marker gene functions; and (4) expression of inserted sequences. The invention is further intended to include other forms of identification of vectors, containing a nucleic acid encoding a DPP-IV protein of the present invention, which serve equivalent functions and which become known in the art subsequently hereto. In the first approach, the nucleic acids can be amplified by PCR to provide for detection of the amplified product. In the second approach, the presence of a foreign gene inserted in an expression vector can be detected by nucleic acid hybridization using probes comprising sequences that are homologous to an inserted marker gene. In the third approach, the recombinant vector/host system can be identified and selected based upon the presence or absence of certain "selection marker" gene functions (e.g., beta-galactosidase activity, thymidine kinase activity, resistance to antibiotics, transformation phenotype, occlusion body formation in baculovirus, etc.) caused by the insertion of foreign genes in the vector. In another example, if the nucleic acid encoding DPP-IV protein is inserted within the "selection marker" gene sequence of the vector, recombinant vectors containing the DPP-IV protein insert can be identified by the absence of the DPP-IV protein gene function. In the fourth approach, recombinant expression vectors can be identified by assaying for the activity, biochemical, or immunological characteristics of the gene product expressed by the recombinant vector, provided that the expressed protein assumes a functionally active conformation. A wide variety of host/expression vector combinations may be employed in expressing the DNA sequences of this invention as known by those of skill in the art.
Once a particular recombinant DNA molecule is identified and isolated, several methods known in the art may be used to propagate it. Once a suitable host system and growth conditions are established, recombinant expression vectors can be propagated and prepared in quantity. As noted above, the expression vectors which can be used include, but are not limited to, the following vectors or their derivatives: human or animal viruses such as vaccinia virus or adenovirus; insect viruses such as baculovirus; yeast vectors; bacteriophage vectors (e.g., lambda); and plasmid and cosmid DNA vectors, to name but a few. Vectors can be introduced into the desired host cells by methods known in the art, e.g., transfection, electroporation, microinjection, transduction, cell fusion, DEAE dextran, calcium phosphate precipitation, lipofection (lysosome fusion), use of a gene gun, or a DNA vector transporter (see, e.g., Wu et al., 1992, J. Biol. Chem. 267:963-967; Wu and Wu, 1988, J. Biol. Chem. 263:14621 -14624; Hartmut et al., Canadian Patent Application No. 2,012,311 , filed Mar. 15, 1990).
B. Crystal and Space Groups X-ray structure coordinates define a unique configuration of points in space. Those skilled in the art understand that a set of structure coordinates for a protein or a protein/ligand complex, or a portion thereof, define a relative set of points that, in turn, define a configuration in three dimensions. A similar or identical configuration can be defined by an entirely different set of coordinates, provided the distances and angles between atomic coordinates remain essentially the same. In addition, a scalable configuration of points can be defined by increasing or decreasing the distances between coordinates by a scalar factor while keeping the angles essentially the same. One aspect of the present invention relates to a crystalline composition comprising a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1. In another embodiment, the crystallized complex is characterized by the structural coordinates listed in FIG. 2, or portions thereof. In yet a further embodiments, the atoms of the ligand are within about 5 angstroms of one or more DPP-IV amino acids in SEQ ID NO: 1 preferably selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740. One embodiment of the crystallized complex is characterized as belonging to the space group P432!2 and unit cell dimensions a=b=68.7 A, c= 421.2 A, α=β=γ=90.0°. This embodiment is encompassed by the structural coordinates of FIG. 2. The ligand may be a small molecule which binds to DPP-IV extracelluar domain defined by SEQ ID NO:2, or portions thereof, with a Kj of less than about 10 mM, 1 mM, 500 nM, 100 nM, 50 nM, or 10 nM. Various computational methods can be used to determine whether a molecule or a binding pocket portion thereof is "structurally equivalent," defined in terms of its three-dimensional structure, to all or part of DPP-IV or its binding pocket(s). Such methods may be carried out in current software applications, such as the molecular similarity application of QUANTA (Accelrys Inc., San Diego, Calif.). The molecular similarity application permits comparisons between different structures, different conformations of the same structure, and different parts of the same structure. The procedure used in molecular similarity to compare structures is divided into four steps: (1 ) load the structures to be compared; (2) optionally define the atom equivalences in these structures; (3) perform a fitting operation; and (4) analyze the results. Each structure is identified by a name. One structure is identified as the target (i.e., the fixed structure), while all remaining structures are working structures (i.e., moving structures). Since atom equivalency
within molecular similarity applications is defined by user input, for the purpose of this invention equivalent atoms are defined as protein backbone atoms (N, Cα, C, and O) for all conserved residues between the two structures being compared. A conserved residue is defined as a residue that is structurally or functionally equivalent (See Table 4 infra). In further embodiments rigid fitting operations are considered. In other embodiments, flexible fitting operations may be considered. When a rigid fitting method is used, the working structure is translated and rotated to obtain an optimum fit with the target structure. The fitting operation uses an algorithm that computes the optimum translation and rotation to be applied to the moving structure, such that the root mean square deviation of the fit over the specified pairs of equivalent atoms is an absolute minimum. This number, given in angstroms (A), is reported by the molecular similarity application. Any molecule or molecular complex or binding pocket thereof, or any portion thereof, that has a root mean square deviation of conserved residue backbone atoms (N, Cα, C, and O) of less than about 2.5, 2.0, 1.5 A, 1.0 A, 0.7 A, 0.5 A or even 0.2 A, when superimposed on the relevant backbone atoms described by the reference structure coordinates listed in FIG. 2, is considered "structurally equivalent" to the reference molecule. That is to say, the crystal structures of those portions of the two molecules are substantially identical, within acceptable error. Particularly preferred structurally equivalent molecules or molecular complexes are those that are defined by the entire set of structural coordinates listed in FIG. 2, plus or minus a root mean square deviation from the conserved backbone atoms of those amino acids of not more than 2.5 A. More preferably, the root mean square deviation is less than about 1.0 A. The term "root mean square deviation" means the square root of the arithmetic mean of the squares of the deviations. It is a way to express the deviation or variation from a trend or object. For purposes of this invention, the "root mean square deviation" defines the variation in the backbone of a protein from the backbone of DPP-IV or a binding pocket portion thereof, as defined by the structural coordinates of DPP-IV described herein. The refined x-ray coordinates of the extracellular domain of DDP-IV (amino acids 38 to 766 as listed in SEQ ID NO:2), Zn2+, Mg2+, and 32 water molecules are as listed in FIG. 2. One orthogonal view of the molecule is shown in FIG. 1. The crystal structure of the extracellular domain (amino acids 31 -766 of SEQ ID NO:1 ) was solved to a resolution 2.7 A. The asymmetric unit is composed of one dimer. In the structure of the present invention the structure includes two domains, the β-propeller domain (residues 55-497) and the catalytic domain (residues 508-766), together with a couple of linker regions (1 -54 and 498-507). The propeller domain packs against the hydrolase domain, and the catalytic triad of DPP-IV composed of residues Ser630, His740 and Asp708, which are located which the last 140 residues of the C-terminal region is at the interface of the two domains. The present invention provides a molecule or molecular complex that includes at least a portion of a DDP-IV and/or substrate binding pocket. In one embodiment, the DDP-IV binding pocket includes the amino acids listed in Table 1 , the binding pocket being defined by a set of points having a root mean square deviation of less than about 2.5, 2.0, 1.5, 1.2, 1.0, 0.7, 0.5, or even 0.2 A, from points representing the backbone atoms of the amino acids in Table 1. In another embodiment, the DPP-IV substrate binding pocket includes the amino acids selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740 from SEQ ID NO:1.
Table 1 : Identified residues 5 A away from the binding pocket of the DPP-IV crystal structure.
C. Isolated Polypeptides and Variants One embodiment of the invention describes an isolated polypeptide consisting of a portion of DPP-IV which functions as the binding site when folded in a 3-D orientation. One embodiment is an isolated polypeptide comprising a portion of DPP-IV, wherein the portion starts at about amino acid residue Gly31 , and ends at about amino acid residue Pro766 as described in SEQ ID NO: , or a sequence that is at least 95% or 98% homologous to a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 as listed in SEQ ID NO:1. Another embodiment comprises crystalline compositions comprising variants of DPP-IV. Variants of the present invention may have an amino acid sequence that is different by one or more amino acid substitutions to the sequence disclosed in SEQ ID NO:1 or SED ID NO:2. Embodiments which comprise amino acid deletions and/or additions are also provided. The variant may, for example, have conservative changes (amino acid similarity), wherein a substituted amino acid has similar structural or chemical properties, for example, the replacement of leucine with isoleucine. Those skilled in the art will understand that determining which and how many amino acid residues may be substituted, inserted, or deleted without adversely affecting biological or pharmacological activity may be reasonably inferred in view of this disclosure, and may further be found using computer programs well known in the art, for example, DNAStar® software (DNAStar Inc. Madison, Wl). As those silled in the art will appreciate, amino acid substitutions may be made, for instance, on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues provided that a biological and/or pharmacological activity of the native molecule is retained. Negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; amino acids, with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine, and valine; amino acids with aliphatic head groups include glycine, alanine; asparagine, glutamine, serine; and amino acids with aromatic side chains include threonine, phenylalanine, and tyrosine. Examples of conservative substitutions are set forth in Table 4 as follows:
Table 4:
"Homology" is a measure of the identity of nucleotide sequences or amino acid sequences. To characterize the homology, subject sequences are aligned so that the highest percentage homology (match) is obtained, after introducing gaps, if necessary, to achieve maximum percent homology. N- or C- terminal extensions shall not be construed as affecting homology. "Identity" per se has an art-recognized meaning and can be calculated using published techniques. Computer program methods to determine identity between two sequences, for example, include DNAStar® software; the GCG® program package (Devereux, J., et al. Nucleic Acids Research (1984) 12(1 ): 387); BLASTP, BLASTN, FASTA (Atschul, S.F. et al., J. Molec Biol (1990) 215: 403). Homology (identity) as defined herein is determined conventionally using the well-known computer program, BESTFIT® (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wl, 53711 ). When using BESTFIT® or any other sequence alignment program (such as the Clustal algorithm from MegAlign software (DNAStar®)) to determine whether a particular sequence is, for example, about 95% homologous to a reference sequence, according to the present invention, the parameters are set such that the percentage of identity is calculated over the full length of the reference nucleotide sequence or amino acid sequence and that gaps in homology of up to about 90% of the total number of nucleotides in the reference sequence are allowed.
Ninety percent homology is therefore determined, for example, using the BESTFIT® program with parameters set such that the percentage of identity is calculated over the full length of the reference sequence, e.g., SEQ ID NO:1 , and wherein up to 5% of the amino acids in the reference sequence may be substituted with another amino acid. Percent homologies are likewise determined, for example, to identify preferred species, within the scope of the claims appended hereto, which reside within the range of about 95% to 100% homology to SEQ ID NO:1 as well as the binding site thereof. As noted above, N- or C-terminal extensions shall not be construed as affecting homology. Thus, when comparing two sequences, the reference sequence is generally the shorter of the two sequences. This means that, for example, if a sequence of 50 nucleotides in length with precise identity to a 50 nucleotide region within a 100 nucleotide polynucleotide is compared, there is 100% homology as opposed to only 50% homology. Although the natural polypeptide of SEQ ID NO:1 and a variant polypeptide may only possess a certain percentage identity, e.g., 95%, they are actually likely to possess a higher degree of similarity, depending on the number of dissimilar codons that are conservative changes. Conservative amino acid substitutions can frequently be made in a protein without altering either the conformation or function of the protein. Similarity between two sequences includes direct matches as well a conserved amino acid substitutes which possess similar structural or chemical properties, e.g., similar charge as described in Table 2. Percentage similarity (conservative substitutions) between two polypeptides may also be scored by comparing the amino acid sequences of the two polypeptides by using programs well known in the art, including the BESTFIT program, by employing default settings for determining similarity. A further embodiment of the invention is a crystal comprising the coordinates of FIG. 2, wherein the amino acid sequence is represented by SEQ ID NO:1. A further embodiment of the invention is a crystal comprising the coordinates of FIG.2, wherein the amino acid sequence is at least 95% or 98% homologous to the amino acid sequence represented by SEQ ID NO:1. Various methods for obtaining atomic coordinates of structurally homologous molecules and molecular complexes using homology modeling are disclosed in, for example, US Patent No: 6,356,845.
D. Structure Based Drug Design Once the three-dimensional structure of a crystal comprising a DPP-IV protein, a functional domain thereof, homologue, analogue or variant thereof, is determined, a ligand (antagonist or agonist) may be examined through the use of computer modeling using a docking program such as GRAM, DOCK, or AUTODOCK (See for example, Morris et al., J. Computational Chemistry, 19:1639-1662 (1998)). This procedure can include in silico fitting of potential ligands to the DPP-IV crystal structure to ascertain how well the shape and the chemical structure of the potential ligand will complement or interfere with the catalytic domain of DPP-IV. (Bugg et al., Scientific American, December:92-98 (1993); West et al., TIPS, 16:67-74 (1995)). As those of skill in the art can appreciate, computer programs can also be employed to estimate the attraction, repulsion, and steric hindrance of the ligand to the binding site. Generally the tighter the fit (e.g., the lower the steric hindrance, and/or the greater the attractive force) the more potent the potential drug will be since these properties are consistent with a tighter binding constant. Furthermore, the more specificity in the design of a potential drug the more likely that the drug will not interfere with the properties of other proteins. This will minimize potential side-effects due to unwanted interactions with other proteins.
One embodiment of the present invention relates to methods of identifying agents that bind to a binding site on DPP-IV extracellular domain wherein the binding site comprises amino acid residues Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740 of SEQ ID NO:1 , comprising: contacting DPP-IV with a test ligand under conditions suitable for binding of the test compound to the binding site, and determining whether the test ligand binds in the binding site, wherein if binding occurs, the test ligand is an agent that binds in the binding site. In further embodiments, the testing may be carried out in silico using a variety of molecular modeling software algorithms including, but not limited to, DOCK, ALADDIN, CHARMM simulations, AFFINITY, C2-LIGAND FIT, Catalyst, LUDI, CAVEAT, and CONCORD. (Brooks, et al. CHARMM: a program for macromolecular energy, minimization, and dynamics calculations. J Comp.Chem 1983, 4:187-217; E.C. Meng, B.K. Shoichet & I.D. Kuntz. Automated docking with grid-based energy evaluation. J Comp Chem 1992, 13:505-524. In another embodiment, a potential ligand may be obtained by screening a random peptide library produced by a recombinant bacteriophage (Scott and Smith, Science, 249:386-390 (1990); Cwirla et al., Proc. Natl. Acad. Sci., 87:6378-6382 (1990); Devlin et al., Science, 249:404-406 (1990)) or a chemical library, or the like. A ligand selected in this manner can be then be systematically modified by computer modeling programs until one or more promising potential ligands are identified. Such analysis, for example, has been shown to be effective in the development of HIV protease inhibitors. (Lam et al., Science 263:380-384 (1994); Wlodawer et al., Ann. Rev. Biochem. 62:543-585 (1993); Appelt, Perspectives in Drug Discovery and Design 1 :23-48 (1993); Erickson, Perspectives in Drug Discovery and Design 1 :109-128 (1993)). Computer modeling allows the selection of a finite number of rational chemical modifications, as opposed to the countless number of essentially random chemical modifications that could be made, of which any one might lead to a useful drug. Each chemical modification requires additional chemical steps, which while being reasonable for the synthesis of a finite number of compounds, quickly becomes overwhelming if all possible modifications needed to be synthesized are actually synthesized. Thus, through the use of the three-dimensional structure disclosed herein and computer modeling, a large number of these compounds can be rapidly screened on a computer monitor screen, and a few likely candidates can be determined without the laborious synthesis of untold numbers of compounds. Once a potential ligand (agonist or antagonist) is identified, it can be either selected from a library of chemicals as are commercially available from most large chemical companies or, alternatively, the potential ligand may be synthesized de novo. As mentioned above, the de novo synthesis of one or even a relatively small group of specific compounds is reasonable in the art of drug design. The potential ligand can be placed into any standard binding assay as well known to those skilled in the art to test its effect on DPP-IV activity. When a suitable drug is identified, a supplemental crystal can be grown comprising a protein- ligand complex formed between a DPP-IV protein and the drug. Preferably the crystal diffracts X-rays allowing the determination of the atomic coordinates of the protein-ligand complex to a resolution of less than 5.0 Angstroms, more preferably less than 3.0 Angstroms, and even more preferably less than 2.0 Angstroms. The three-dimensional structure of the supplemental crystal can be determined by Molecular Replacement Analysis. Molecular replacement uses a known three-dimensional structure as a search model to determine the structure of a closely related molecule or protein-ligand complex in a new crystal
form. The measured X-ray diffraction properties of the new crystal are compared with the search model structure to compute the position and orientation of the protein in the new crystal. Computer programs that can be used include: X-PLOR and AMORE (J. Navaza, Acta Crystallographies ASO, 157-163 (1994)). As those of skill in the art can appreciate, once the position and orientation are known, an electron density map can be calculated using the search model to provide X-ray phases. Thereafter, the electron density is inspected for structural differences, and the search model is modified to conform to the new structure. Using this approach, it is possible to use the claimed structure of DPP-IV can be used to solve the three- dimensional structures of any such DPP-IV complexed with a ligand. Other suitable computer programs that can be used to solve the structures of such STAT crystals include: QUANTA; CHARMM; INSIGHT; SYBYL; MACROMODEL; and ICM. Suitable in silico methods for screening, designing or selecting ligands are disclosed in, for example, U.S. Patent No. 6,356,845.
E. Ligands In one aspect, the present invention discloses binding agents which interact with a binding site of DPP-IV defined by a set of points having a root mean square deviation of less than about 2.5, 2.0, 1.7, 1.5, 1.2, 1.0, 0.7, 0.5, or even 0.2 A from points representing the backbone atoms of the amino acids represented by the structure coordinates listed in FIG. 2. Such embodiments represent variants of the DPP-IV crystal. In another aspect, the present invention provides ligands which bind to a folded polypeptide comprising an amino acid sequence spanning amino acids 31 to 766 listed in SEQ ID NO:1 , or a homologue or variant thereof. In further embodiments, the ligand is a competitive or uncompetitive inhibitor of DPP-IV. In yet further embodiments the ligand inhibits DPP-IV with an IC50 or K, of less than about 10 mM, 1 mM, 500 nM, 100 nM, 50 nM or 10 nM. In yet further embodiments, the ligand inhibits DPP-IV with a K, that is less than about one-half, one-fifth, or one-tenth the K, that the substance has for inhibition of any other DPP-IV enzyme. In other words, the substance inhibits DPP-IV activity to the same degree at a concentration of about one-half, one-fifth, one-tenth or less than the concentration required for any other DPP enzyme. One embodiment of the present invention relates to ligands, such as proteins, peptides, peptidomimetics, small organic molecules, etc., designed or developed with reference to the crystal structure of DPP-IV as represented by the coordinates presented herein in FIG. 2, and portions thereof. Such binding agents interact with the binding site of the DPP-IV represented by one or more amino acid residues selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740.
F. Machine Readable Storage Media Transformation of the structure coordinates for all or a portion of DPP-IV or one of its binding pockets, for structurally homologous molecules as defined below, or for the structural equivalents of any of these molecules or molecular complexes as defined above, into three-dimensional graphical representations of the molecule or complex can be conveniently achieved through the use of commercially-available software.
The invention thus further provides a machine-readable storage medium comprising a data storage material encoded with machine readable data which, when using a machine programmed with instructions for using said data, is capable of displaying a graphical three-dimensional representation of any of the molecule or molecular complexes of this invention that have been described above. In a preferred embodiment, the machine-readable data storage medium comprises a data storage material encoded with machine readable data which, when using a machine programmed with instructions for using said data, is capable of displaying a graphical three-dimensional representation of a molecule or molecular complex comprising all or any parts of a DPP-IV binding pocket, as defined above. In another preferred embodiment, the machine-readable data storage medium is capable of displaying a graphical three-dimensional representation of a molecule or molecular complex defined by the structure coordinates of the amino acids listed in FIG. 4, plus or minus a root mean square deviation from the backbone atoms of said amino acids of not more than 2.0 .A. In an alternative embodiment, the machine-readable data storage medium comprises a data storage material encoded with a first set of machine readable data which comprises the Fourier transform of the structural coordinates set forth in FIG. 2, and which, when using a machine programmed with instructions for using said data, can be combined with a second set of machine readable data comprising the X-ray diffraction pattern of a molecule or molecular complex to determine at least a portion of the structural coordinates corresponding to the second set of machine readable data. For example, a system for reading a data storage medium may include a computer comprising a central processing unit ("CPU"), a working memory which may be, e.g., RAM (random access memory) or "core" memory, mass storage memory (such as one or more disk drives or CD-ROM drives), one or more display devices (e.g., cathode-ray tube ("CRT") displays, light emitting diode ("LED") displays, liquid crystal displays ("LCDs"), electroluminescent displays, vacuum fluorescent displays, field emission displays ("FEDs"), plasma displays, projection panels, etc.), one or more user input devices (e.g., keyboards, microphones, mice, touch screens, etc.), one or more input lines, and one or more output lines, all of which are interconnected by a conventional bidirectional system bus. The system may be a stand-alone computer, or may be networked (e.g., through local area networks, wide area networks, intranets, extranets, or the internet) to other systems (e.g., computers, hosts, servers, etc.). The system may also include additional computer controlled devices such as consumer electronics and appliances. Input hardware may be coupled to the computer by input lines and may be implemented in a variety of ways. Machine-readable data of this invention may be inputted via the use of a modem or modems connected by a telephone line or dedicated data line. Alternatively or additionally, the input hardware may comprise CD-ROM drives or disk drives. In conjunction with a display terminal, a keyboard may also be used as an input device. Output hardware may be coupled to the computer by output lines and may similarly be implemented by conventional devices. By way of example, the output hardware may include a display device for displaying a graphical representation of a binding pocket of this invention using a program such as QUANTA as described herein. Output hardware might also include a printer, so that hard copy output may be produced, or a disk drive, to store system output for later use. In operation, a CPU coordinates the use of the various input and output devices, coordinates data accesses from mass storage devices, accesses to and from working memory, and determines the
sequence of data processing steps. A number of programs may be used to process the machine-readable data of this invention. Such programs are discussed in reference to the computational methods of drug discovery as described herein. References to components of the hardware system are included as appropriate throughout the following description of the data storage medium. Machine-readable storage devices useful in the present invention include, but are not limited to, magnetic devices, electrical devices, optical devices, and combinations thereof. Examples of such data storage devices include, but are not limited to, hard disk devices, CD devices, digital video disk devices, floppy disk devices, removable hard disk devices, magneto-optic disk devices, magnetic tape devices, flash memory devices, bubble memory devices, holographic storage devices, and any other mass storage peripheral device. It should be understood that these storage devices include necessary hardware (e.g., drives, controllers, power supplies, etc.) as well as any necessary media (e.g., disks, flash cards, etc.) to enable the storage of data.
G. Pharmaceutical Compositions The present invention provides methods for treating certain diseases in a mammal, preferably a human being, in need of such treatment using the ligands, and preferably the inhibitors, as described herein. The ligand can be advantageously formulated into pharmaceutical compositions comprising a therapeutically effective amount of the ligand, a pharmaceutically acceptable carrier and other compatible ingredients, such as adjuvants, Freund's complete or incomplete adjuvant, suitable for formulating such pharmaceutical compositions as is known to those skilled in the art. Pharmaceutical compositions containing the ligand can be used for treatment of diseases that are associated with proteins that are subject to processing by DPP-IV, such as Type 2 diabetes, Type 1 diabetes, impaired glucose tolerance, hyperglycemia, metabolic syndrome (syndrome X and/or insulin resistance syndrome), glucosuria, metabolic acidosis, arthritis, cataracts, diabetic neuropathy, diabetic nephropathy, diabetic retinopathy, diabetic cardiomyopathy, obesity, conditions exacerbated by obesity, hypertension, hyperlipidemia, atherosclerosis, osteoporosis, osteopenia, frailty, bone loss, bone fracture, acute coronary syndrome, short stature due to growth hormone deficiency, infertility due to polycystic ovary syndrome, anxiety, depression, insomnia, chronic fatigue, epilepsy, eating disorders, chronic pain, alcohol addiction, diseases associated with intestinal motility, ulcers, irritable bowel syndrome, inflammatory bowel syndrome; short bowel syndrome; and the prevention of disease progression in Type 2 diabetes. The pharmaceutical composition is administered to the mammal in a therapeutically effective amount such that treatment of the disease occurs. The present invention is further illustrated by the following examples, which should not be construed as limiting in any way. The contents of all cited references (including literature references, patents, published patent applications as cited throughout this application) are hereby expressly incorporated by reference in their entireties. The practice of the present invention will employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, molecular biology, microbiology and recombinant DNA, X-ray crystallography, and molecular modeling which are within the skill of the art. As those of skill in the art will understand, such techniques are explained fully in the literature. See, for example, Molecular Cloning: A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J.
Gait ed., 1984); Mullis et al. U.S. Patent No: 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); B. Perbal, A Practical Guide To Molecular Cloning (1984); the treatise, Methods In Enzymology (Academic Press, Inc., N.Y.); Methods In Enzymology, Vols. 154 and 155 (Wu et al. eds.), Crystallography Made Clear: A Guide For Users Of Macromolecular Models (Gales Rhodes, 2nd Ed. San Diego: Academic Press, 2000). EXAMPLES Example 1 : Cloning and Expression of human DPP-IV in Sf21 insect cells A. Construction of hDPP-IV: Residues 31 to 766 of homo sapien (human) wild type DPP-IV (SEQ ID NO:1) were amplified by PCR using the following primers DPPIV-Fc31 - BamF (5'-
TTAAGGATCCTGGCACAGATGATGCTACAGCTGAC-3' (SEQ ID NO: 3)), which introduced a Bam HI site at the N-terminus, and DPPIV-ChisT-XhoR (5'-AATTCTCGAGTTACTAGTGAT GATGGTGGTGATGGCTGCCGCGCGGCACCAGAGGTAAAGAGAAACATTGTTTTATGAAGTGGC-3' (SEQ ID NO: 4)), which introduced His6-tag, thrombin cleavage site and Xho I site at the C-terminus. The spin-column (Roche Applied Sciences Indianapolis, IN) purified PCR product (2208 bp) was digested with SamHI and Xho\ restriction enzymes and subcloned into a baculovirus transfer vector treated with the corresponding enzymes. The vector contained a polyhedrin promoter and the honeybee melittin secretion signal for efficient, high-level secretion of the recombinant protein. B. Production of recombinant baclovirus Cloning steps were monitored by restriction endonuclease mapping and sequencing analysis. E. coli clones with recombinant bacmid were obtained after transformation of E. coli DH1 OBac cells (Invitrogen Corp., Frederick, MD) with 5 ng of the final construct that encodes hDPP-IV residues 31-766 in pMCG243 (baculovirus transfer vector) (DPPIV-HBM31-HT plasmid DNA) and blue/white-screening according to manufacturer's protocol (Invitrogen Corp.). Monolayers of Sf9 cells (20 X 106 cells in a 162 cm2 culture flask) were transfected by overlaying 20 ml of transfection mixture containing 100 μl of mini-prep bacmid DNA and 100 μl of CellFECTIN reagent (Gibco BRL Gaithersburg, MD) in Sf-900 II SFM. The transfection mixture was removed after 5 h of incubation (27°C) and the cells were overlaid with 25 ml of Sf-900 II SFM. The recombinant virus were harvested at 72h post-transfection and further amplification of the virus was achieved by infecting 100 ml of Sf-9 cells (1.2 X 106 cells/ml) with 2 ml of the recombinant virus for 65-72 h. Baculo Infected Insect Cells (BIIC) stocks were prepared as follows; when the cell diameter increases by 2-4 Dm above the baseline (usually 65-72h) and while the cell viability is still > 80%, the BIIC's were gently spun down and re-suspended in a freezing medium (90% SF900 II, 1 % w/v BSA, 10% v/v DMSO) at 1 X 107 viable cells/ml. The BIIC's were frozen down as 1 ml aliquots using normal cryopreservation methods and stored in -70SC or liquid nitrogen for long-term storage. C. Expression of recombinant DPP-IV Cloning steps were monitored by restriction endonuclease mapping and sequencing analysis. E coli clones with recombinant bacmid were obtained after transformation of E. coli DH1 OBac cells (Invitrogen Corp., Frederick, MD) with 5 ng of DPPIV-HBM31 -HT plasmid DNA and blue/white-screening according to manufacturer's protocol (Invitrogen). Monolayers of Sf9 cells (20 X 106 cells in a 162 cm2 culture flask) were transfected by overlaying 20 ml of transfection mixture containing 100 μl of mini-prep bacmid DNA and 100
μl of CellFECTIN reagent (Gibco BRL) in Sf-900 II SFM. The transfection mixture was removed after 5 h of incubation (27aC) and the cells were overlaid with 25 ml of Sf-900 II SFM. The recombinant virus were harvested at 72h post-transfection and further amplification of the virus was achieved by infecting 100 ml of Sf-9 cells (1.2 X 106 cells/ml) with 2 ml of the recombinant virus for 65-72 h. Baculo Infected Insect Cells (BIIC) stocks were prepared as follows; when the cell diameter increases by 2-4 μm above the baseline (usually 65-72h) and while the cell viability is still > 80%, the BIIC's were gently spun down and resuspended in a freezing medium (90% SF900 II, 1% w/v BSA, 10% v v DMSO) at 1 X 107 viable cells/ml. The BIIC's were frozen down as 1 ml aliquots using normal cryopreservation methods and stored in -70SC or liquid nitrogen for long-term storage. Example 2: Purification of His6-tagged DPP-IV wild type extracellular domain After clarification by centrifugation and filtration, 10 liters of culture media containing secreted human DPP-IV-31-766-C-his6 was concentrated 10-20 fold using a hollow fiber filter unit which had been washed with exchange Buffer A (50 mM Tris, 0.3 M NaCI, 1 mM TCEP, pH 8). The concentrated media was exchanged with 5 volumes of Buffer A. After clarification by filtration, imidazole was added to 10 mM by addition of Buffer B (50 mM TrisCI, 0.3 M NaCI, 0.25 M imidazole, 1 mM TCEP, pH 8). The sample was applied to a 40 mL immobilized metal affinity column (Ni-NTA Superflow, Qiagen), which had been equilibrated in Buffer A (50 mM TrisCI, 0.3 M NaCI, 1 mM TCEP, pH 8) at 6-8 mL/min. The column was washed with Buffer A to achieve a stable baseline at 280 nm. Bound protein was eluted at a lower flow rate in a linear gradient from 0 - 20%B in 4 column volumes (cv) (5%B / cv) followed by a step to 100% B, held isocratic ally for 4 cv. Fractions were analyze by SDS-PAGE on 4-12% bis-tris in MOPS buffer using the NuPAGE system (Invitrogen). Fractions containing DPP-IV were pooled and dialyzed at 4 degrees C against 2 changes of dialysis buffer (50 mM TrisCI, 0.1 M NaCI, 1 mM TCEP, pH 8. After dialysis, the sample was concentrated to 6-10 mg/mL and fractionated by size exclusion chromatography on Superdex 200 prep grade HiLoad 16/60 (Amersham Biosciences). DPP-IV eluted as an apparent dimer. Fractions were analyze by SDS-PAGE on 4-12% bis-tris in MOPS buffer using the NuPAGE system (Invitrogen). DPP-IV isolated in this manner was used for crystallization. Example 3: Crystallization of DPP-IV DPP-IV of Example 2 was concentrated into buffer containing 50 mM TrisCI, 25 mM NaCI, 1 mM TCEP, pH 8, to 8-10 mg/mL. Leads were obtained through sparse matrix screening at 22°. Optimized crystals grew in drops made from 1.5 μL of protein + 1.5 μL of reservoir solution (0.1 M TrisCI, pH 8.5, 0.2 M sodium acetate, 10-16% PEG 4000) equilibrating over the same reservoir solution. Crystals were transferred to a solution containing 0.1 M TrisCI, pH 8.5, 0.2 M sodium acetate, 14-16% PEG 4000, and 20% ethylene glycol. Crystals were flash frozen in gaseous or liquid nitrogen for data collection. Example 4: X-ray data collection, structure determination and refinement of DPP-IV The crystals prepared in Example 3 were transferred to a cryoprotectant solution, made up of the reservoir solution, with 15-25% ethylene glycol, and then flash-frozen in a stream of cold nitrogen gas at 100K. A full data set was collected from one crystal frozen in this manner at the Advanced Photon Sources of Argonne National Laboratory on a on a ADSC Quantum 210 CCD detector. Data was processed using the HKL2000 suite of software (Otwinowski, Z. & Minor, W. Methods Enzymology 276, 307-326 (1997). Data collection statistics are summarized in Table 5a.
The crystals belong to space group P432!2 with unit cell dimensions a=b=68.7 A, c= 421.2 A, α= β= γ= 90°. They contain one molecule of the polypeptide per asymmetric unit. The structure was solved by the method of molecular replacement, using the program AMORE (Navaza, J., Acta Cryst, 157-163 (1994)). The crystal mosaicity is 0.6 A. The data is 96.5% complete to 2.7 A resolution with an RmΘrgeθf 0.062 and an average redundancy of 4.4. The final model was built with manual rebuilding on the graphics screen, using the program XtalView (McRee, .D. E., Practical Protein Crystallography, Academic Press, San Diego, 1993). Refinement in Refmac was carried out using all data in the resolution range 50.0-2.7A. The R-factor for the current model is 0.257 (free R-factor, 5% of the data, 0.340). The refinement statistics are summarized in Table 5b. The current model contains residues 38-766 (others are disordered in the crystal), 12 sugar and 32 water molecules. Table 5a -Data statistics
1 Numbers in parentheses refer to the highest resolution range (2.80-2.70A) 2 Rsym = Σ(l-<l>)/Σ <l>
Table 5b- Refinement statistics
J R = ∑||Fobs| - /jFcalc||/Σ|Fobs| Equivalents While specific embodiments of the subject invention have been discussed, the above specification is illustrative and not restrictive. Many variations of the invention will become apparent to those skilled in the art upon review of this specification. The appended claims should be interpreted by reference to the claims, along with their full scope of equivalents, and the specification, along with such variations.
Claims
1. A crystalline composition of the extracellular domain of mammalian dipeptidyl peptidase IV (DPP-IV) comprising one molecule per crystal asymmetric unit. 2. A crystal of claim 1 , wherein said extracellular domain has a three dimensional structure characterized by the atomic structure coordinates of Fig.
2.
3. The DPP-IV crystal comprising SEQ ID NO:2, or a homologue, analogue or variant thereof.
4. A crystal having one molecule per asymmetric unit comprising a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1 , or a homologue, analogue or variant thereof.
5. The crystal of claim 4, wherein the homologue or variant has an amino acid identity of at least 95% with a polypeptide having an amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID N0.1.
6. The crystal of claim 4 or 5, wherein the homologue or variant thereof has a protein backbone comprising the atomic coordinates, or portions thereof, that are within a root mean square deviation of less than 2.5, 2.0, 1.5, 1.2, 1.0, 0.7, 0.5, or 0.2 A of the atomic coordinates, or portions thereof, listed in FIG. 2. 7. A polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 1 or a homologue, or variant thereof, wherein the molecules are arranged in a crystalline manner in a space group of P43212 so as to form a unit cell of dimensions a=b=68.7 A, c= 421.2 A and which effectively diffracts X-rays for determination of the atomic coordinates of DPP-IV polypeptide to a resolution of about 2.
7 A.
8. A crystal of a protein-ligand molecule or molecular complex comprising: (a) a polypeptide with an amino acid sequence from Asp38 to Pro766 listed in SEQ ID NO:1 , or a homologue, or variant thereof; (b) a ligand; (c) and the crystal effectively diffracts X-rays for the determination of atomic coordinates of the protein-ligand complex to a resolution of greater than 2.7 Angstroms.
9. A method of designing a compound that binds to DPP-IV comprises the amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1 , or a homologue, or variant thereof using the crystal of claim 1 , comprising selecting a compound by performing structure-based drug design with the atomic coordinates determined for the crystal, wherein said selecting is performed in conjunction with computer modeling.
10. A method for crystallizing a DPP-IV polypeptide molecule or molecular complex comprising: (a) preparing a mixture of an aqueous solution comprising a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1 , or a homologue, or a variant thereof; (b) mixing said aqueous solution with a reservoir solution comprising a precipitant to from a mixed volume; and (c) crystallizing said mixed volume.
1 1. The method of claim 10 wherein step (c) is carried out by vapor diffusion crystallization, batch crystallization, liquid bridge crystallization, or dialysis crystallization.
12. A computer for producing a three-dimensional representation of a polypeptide with an amino acid sequence spanning amino acids Gly31 to Pro766 listed in SEQ ID NO:1 , or a homologue, or a variant thereof, comprising: a computer-readable data storage medium comprising a data storage material encoded with computer-readable data, wherein said data comprises the structure coordinates of FIG. 2, or portions thereof; a working memory for storing instructions for processing said computer-readable data; a central-processing unit coupled to said working memory and to said computer-readable data storage medium for processing said computer-machine readable data into said three-dimensional representation; and a display coupled to said central-processing unit for displaying said representation.
13. A computer for producing a three-dimensional representation of a molecule or molecular complex comprising the atomic coordinates having a root mean square deviation of less than 2.5, 2.0, 1.7, 1.5, 1.2, 1.0, 0.7, 0.5, or 0.2 A from the atomic coordinates for the carbon backbone atoms listed in FIG.2 comprising: a computer-readable data storage medium comprising a data storage material encoded with computer-readable data, wherein said data comprises the structure coordinates of FIG. 2, or portions thereof; a working memory for storing instructions for processing said computer-readable data; a central- processing unit coupled to said working memory and to said computer-readable data storage medium for processing said'computer-machine readable data into said three-dimensional representation; and a display coupled to said central-processing unit for displaying said representation.
14. A computer for producing a three-dimensional representation of a molecule or molecular complex comprising: a binding site defined by the structure coordinates in FIG. 2, or a the structural coordinates of a portion of the residues in FIG. 2, or the structural coordinates of one or more DPP-IV amino acids in SEQ ID NO:1 selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740 wherein said computer comprises; a computer-readable data storage medium comprising a data storage material encoded with computer-readable data, wherein said data comprises the structure coordinates of FIG. 2, or portions thereof; a working memory for storing instructions for processing said computer-readable data; a central-processing unit coupled to said working memory and to said computer-readable data storage medium for processing said computer-machine readable data into said three-dimensional representation; and a display coupled to said central-processing unit for displaying said representation.
15. A method for identifying potential ligands for DPP-IV, or homologues, analogues or variants thereof, comprising: displaying three dimensional structure of DPP-IV enzyme, or portions thereof, as defined by atomic coordinates in FIG. 2, on a computer display screen; optionally replacing one or more DPP-IV enzyme amino acid residues listed in SEQ ID NO:1 , or one or more of the amino acids listed in Table 1 , or one or more amino acid residues selected from Glu205, Glu206, Tyr547, Ser630, Tyr631 , Tyr662, Tyr666, Asp708, Asn710 and His740, in said three-dimensional structure with a different naturally occurring amino acid or an unnatural amino acid; employing said three-dimensional structure to design or select said ligand; contacting said ligand with DPP-IV, or variant thereof, in the presence of one or more substrates; and measuring the ability of said ligand to modulate the activity DPP- IV.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US57687704P | 2004-06-03 | 2004-06-03 | |
| PCT/IB2005/001630 WO2005119526A1 (en) | 2004-06-03 | 2005-05-26 | Crystal structure of dipeptidyl peptidase iv (dpp-iv) and uses thereof |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| EP1756740A1 true EP1756740A1 (en) | 2007-02-28 |
Family
ID=34969993
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP05744309A Withdrawn EP1756740A1 (en) | 2004-06-03 | 2005-05-26 | Crystal structure of dipeptidyl peptidase iv (dpp-iv) and uses thereof |
Country Status (4)
| Country | Link |
|---|---|
| EP (1) | EP1756740A1 (en) |
| CA (1) | CA2569095A1 (en) |
| MX (1) | MXPA06014083A (en) |
| WO (1) | WO2005119526A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2012170702A1 (en) | 2011-06-08 | 2012-12-13 | Arena Pharmaceuticals, Inc. | Modulators of the gpr119 receptor and the treatment of disorders related thereto |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| PE20080251A1 (en) | 2006-05-04 | 2008-04-25 | Boehringer Ingelheim Int | USES OF DPP IV INHIBITORS |
| KR101452915B1 (en) | 2006-05-04 | 2014-10-21 | 베링거 인겔하임 인터내셔날 게엠베하 | Polymorphism |
| EP1852108A1 (en) | 2006-05-04 | 2007-11-07 | Boehringer Ingelheim Pharma GmbH & Co.KG | DPP IV inhibitor formulations |
| US20200155558A1 (en) | 2018-11-20 | 2020-05-21 | Boehringer Ingelheim International Gmbh | Treatment for diabetes in patients with insufficient glycemic control despite therapy with an oral antidiabetic drug |
| US8513187B2 (en) | 2009-02-27 | 2013-08-20 | Trustees Of Tufts College | Soft protease inhibitors, and pro-soft forms thereof |
| US8716482B2 (en) * | 2009-09-25 | 2014-05-06 | Merck Sharp & Dohme Corp. | Substituted aminopiperidines as dipeptidyl peptidase-IV inhibitors for the treatment of diabetes |
| EP2366394A1 (en) * | 2010-03-17 | 2011-09-21 | IMTM GmbH | Characterization and validation of inhibitors and ligands of dipeptidyl aminopeptidase IV (DP IV) |
| WO2011146358A1 (en) * | 2010-05-21 | 2011-11-24 | Merck Sharp & Dohme Corp. | Substituted seven-membered heterocyclic compounds as dipeptidyl peptidase-iv inhibitors for the treatment of diabetes |
| US20130137852A1 (en) * | 2010-08-03 | 2013-05-30 | Cadila Corporate Campus | Novel compounds as dpp-iv inhibitors and process for preparation thereof |
| AR083878A1 (en) | 2010-11-15 | 2013-03-27 | Boehringer Ingelheim Int | VASOPROTECTORA AND CARDIOPROTECTORA ANTIDIABETIC THERAPY, LINAGLIPTINA, TREATMENT METHOD |
| WO2013171166A1 (en) | 2012-05-14 | 2013-11-21 | Boehringer Ingelheim International Gmbh | A xanthine derivative as dpp-4 inhibitor for use in the treatment of sirs and/or sepsis |
| MX390363B (en) | 2016-06-10 | 2025-03-20 | Boehringer Ingelheim Int | Combinations of linagliptin and metformin |
-
2005
- 2005-05-26 WO PCT/IB2005/001630 patent/WO2005119526A1/en not_active Ceased
- 2005-05-26 CA CA002569095A patent/CA2569095A1/en not_active Abandoned
- 2005-05-26 EP EP05744309A patent/EP1756740A1/en not_active Withdrawn
- 2005-05-26 MX MXPA06014083A patent/MXPA06014083A/en unknown
Non-Patent Citations (1)
| Title |
|---|
| See references of WO2005119526A1 * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2012170702A1 (en) | 2011-06-08 | 2012-12-13 | Arena Pharmaceuticals, Inc. | Modulators of the gpr119 receptor and the treatment of disorders related thereto |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2005119526A1 (en) | 2005-12-15 |
| MXPA06014083A (en) | 2007-02-15 |
| CA2569095A1 (en) | 2005-12-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2005113762A1 (en) | CRYSTAL STRUCTURE OF PROTEIN KINASE B-α (AKT-1) AND USES THEREOF | |
| US20050075795A1 (en) | Crystal structure of 3', 5'-cyclic nucleotide phosphodiesterase (PDE1B) and uses thereof | |
| WO2008109871A2 (en) | Crystal structure of proprotein convertase 9 (pcsk9) and uses thereof | |
| EP1756740A1 (en) | Crystal structure of dipeptidyl peptidase iv (dpp-iv) and uses thereof | |
| US20040171019A1 (en) | PIN1 peptidyl-prolyl isomerase polypeptides, their crystal structures, and use thereof for drug design | |
| US20050202550A1 (en) | Crystal structure of 3', 5'-cyclic nucleotide phosphodiesterase (PDE10A) and uses thereof | |
| EP1265927B1 (en) | Crystal | |
| US20030165984A1 (en) | Hepatitis C virus helicase crystals, crystallographic structure and methods | |
| EP4367226B1 (en) | Crystal structures of alk and ltk receptor tyrosine kinases and their ligands | |
| US20040018560A1 (en) | Crystallized LXR polypeptide in complex with a ligand and screening methods employing same | |
| US20080064052A1 (en) | Crystal of a Receptor-Ligand Complex and methods of use | |
| WO2005083069A1 (en) | Pde2 crystal structures for structure based drug design | |
| US20060234293A1 (en) | Polypeptide methods and means | |
| AU781654B2 (en) | Crystallization and structure determination of staphylococcus aureus thymidylate kinase | |
| EP1393235A1 (en) | Crystallized human xenobiotic nuclear receptor pxr/sxr ligand binding domain polypeptide and screening methods employing same | |
| US20080020984A1 (en) | Crystal Structure of a Receptor-Ligand Complex and Methods of Use | |
| US7297508B1 (en) | Crystallization of fibroblast activation protein alpha (FAPα) | |
| US20050131209A1 (en) | Crystallized hnf4 gamma ligand binding domain polypeptide and screening methods employing same | |
| WO2005119525A1 (en) | 11βHSD1 CRYSTAL STRUCTURES FOR STRUCTURE BASED DRUG DESIGN | |
| WO2005103241A1 (en) | Crystal structure of 3', 5'-cyclic nucleotide phosphodiesterase 9a (pde9a) and uses thereof | |
| MXPA06009940A (en) | Crystal structure of 3',5'-cyclic nucleotide phosphodiesterase (pde10a) and uses thereof |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| 17P | Request for examination filed |
Effective date: 20070103 |
|
| AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR |
|
| DAX | Request for extension of the european patent (deleted) | ||
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
| 18D | Application deemed to be withdrawn |
Effective date: 20071201 |