AU747300B2 - DNA binding pyrrole and imidazole polyamide derivatives - Google Patents
DNA binding pyrrole and imidazole polyamide derivatives Download PDFInfo
- Publication number
- AU747300B2 AU747300B2 AU71040/98A AU7104098A AU747300B2 AU 747300 B2 AU747300 B2 AU 747300B2 AU 71040/98 A AU71040/98 A AU 71040/98A AU 7104098 A AU7104098 A AU 7104098A AU 747300 B2 AU747300 B2 AU 747300B2
- Authority
- AU
- Australia
- Prior art keywords
- polyamide
- pypypy
- impypy
- pypypypy
- dna
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 229920002647 polyamide Polymers 0.000 title claims description 505
- 239000004952 Polyamide Substances 0.000 title claims description 503
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 title description 60
- 230000004568 DNA-binding Effects 0.000 title description 28
- KAESVJOAVNADME-UHFFFAOYSA-N Pyrrole Chemical compound C=1C=CNC=1 KAESVJOAVNADME-UHFFFAOYSA-N 0.000 title description 18
- 108020004414 DNA Proteins 0.000 claims description 154
- 229910052739 hydrogen Inorganic materials 0.000 claims description 62
- 239000002253 acid Substances 0.000 claims description 45
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims description 23
- IUNMPGNGSSIWFP-UHFFFAOYSA-N dimethylaminopropylamine Chemical compound CN(C)CCCN IUNMPGNGSSIWFP-UHFFFAOYSA-N 0.000 claims description 22
- 102000053602 DNA Human genes 0.000 claims description 14
- 229960002685 biotin Drugs 0.000 claims description 14
- 239000011616 biotin Substances 0.000 claims description 14
- 235000020958 biotin Nutrition 0.000 claims description 14
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical group NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 claims description 14
- 125000000217 alkyl group Chemical group 0.000 claims description 9
- 125000000304 alkynyl group Chemical group 0.000 claims description 9
- OXHNLMTVIGZXSG-UHFFFAOYSA-N 1-Methylpyrrole Chemical group CN1C=CC=C1 OXHNLMTVIGZXSG-UHFFFAOYSA-N 0.000 claims description 8
- 125000003342 alkenyl group Chemical group 0.000 claims description 8
- 150000003839 salts Chemical class 0.000 claims description 6
- MCTWTZJPVLRJOU-UHFFFAOYSA-N 1-methyl-1H-imidazole Chemical group CN1C=CN=C1 MCTWTZJPVLRJOU-UHFFFAOYSA-N 0.000 claims description 4
- 230000027455 binding Effects 0.000 description 123
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 99
- 239000011347 resin Substances 0.000 description 96
- 229920005989 resin Polymers 0.000 description 96
- 239000000243 solution Substances 0.000 description 88
- 238000004128 high performance liquid chromatography Methods 0.000 description 78
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 71
- 238000006243 chemical reaction Methods 0.000 description 68
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 66
- 239000000843 powder Substances 0.000 description 60
- 230000015572 biosynthetic process Effects 0.000 description 58
- 238000005481 NMR spectroscopy Methods 0.000 description 53
- 238000011084 recovery Methods 0.000 description 53
- -1 imidazole amino acids Chemical class 0.000 description 50
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 48
- 238000003786 synthesis reaction Methods 0.000 description 48
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 46
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-dimethylformamide Substances CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 43
- 238000003776 cleavage reaction Methods 0.000 description 43
- 108090000623 proteins and genes Proteins 0.000 description 42
- 230000007017 scission Effects 0.000 description 42
- 238000010532 solid phase synthesis reaction Methods 0.000 description 41
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 40
- 238000000034 method Methods 0.000 description 38
- RMVRSNDYEFQCLF-UHFFFAOYSA-N thiophenol Chemical compound SC1=CC=CC=C1 RMVRSNDYEFQCLF-UHFFFAOYSA-N 0.000 description 36
- 239000000203 mixture Substances 0.000 description 34
- 238000005160 1H NMR spectroscopy Methods 0.000 description 30
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 30
- 238000013461 design Methods 0.000 description 30
- 239000012634 fragment Substances 0.000 description 30
- 238000000746 purification Methods 0.000 description 27
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 24
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 24
- 108091028043 Nucleic acid sequence Proteins 0.000 description 24
- 239000007790 solid phase Substances 0.000 description 24
- 238000005859 coupling reaction Methods 0.000 description 23
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 22
- 230000014509 gene expression Effects 0.000 description 22
- 150000001412 amines Chemical class 0.000 description 21
- 150000001875 compounds Chemical class 0.000 description 21
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 21
- 238000001914 filtration Methods 0.000 description 20
- 239000011521 glass Substances 0.000 description 20
- UOWVMDUEMSNCAV-WYENRQIDSA-N rachelmycin Chemical compound C1([C@]23C[C@@H]2CN1C(=O)C=1NC=2C(OC)=C(O)C4=C(C=2C=1)CCN4C(=O)C1=CC=2C=4CCN(C=4C(O)=C(C=2N1)OC)C(N)=O)=CC(=O)C1=C3C(C)=CN1 UOWVMDUEMSNCAV-WYENRQIDSA-N 0.000 description 20
- 238000004007 reversed phase HPLC Methods 0.000 description 20
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 19
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 19
- 230000001588 bifunctional effect Effects 0.000 description 19
- 239000003153 chemical reaction reagent Substances 0.000 description 19
- 239000000539 dimer Substances 0.000 description 19
- 238000004992 fast atom bombardment mass spectroscopy Methods 0.000 description 19
- 239000000126 substance Substances 0.000 description 19
- UHOVQNZJYSORNB-UHFFFAOYSA-N Benzene Chemical compound C1=CC=CC=C1 UHOVQNZJYSORNB-UHFFFAOYSA-N 0.000 description 18
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 18
- 230000008878 coupling Effects 0.000 description 18
- 238000010168 coupling process Methods 0.000 description 18
- UPBAOYRENQEPJO-UHFFFAOYSA-N n-[5-[[5-[(3-amino-3-iminopropyl)carbamoyl]-1-methylpyrrol-3-yl]carbamoyl]-1-methylpyrrol-3-yl]-4-formamido-1-methylpyrrole-2-carboxamide Chemical compound CN1C=C(NC=O)C=C1C(=O)NC1=CN(C)C(C(=O)NC2=CN(C)C(C(=O)NCCC(N)=N)=C2)=C1 UPBAOYRENQEPJO-UHFFFAOYSA-N 0.000 description 18
- 239000003446 ligand Substances 0.000 description 17
- 239000000178 monomer Substances 0.000 description 17
- 238000005804 alkylation reaction Methods 0.000 description 16
- 150000001413 amino acids Chemical class 0.000 description 16
- 108010042747 stallimycin Proteins 0.000 description 16
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 15
- 239000000975 dye Substances 0.000 description 15
- 238000002474 experimental method Methods 0.000 description 15
- 239000000499 gel Substances 0.000 description 15
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 14
- 230000029936 alkylation Effects 0.000 description 14
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 14
- 238000004448 titration Methods 0.000 description 14
- QOSSAOTZNIDXMA-UHFFFAOYSA-N Dicylcohexylcarbodiimide Chemical compound C1CCCCC1N=C=NC1CCCCC1 QOSSAOTZNIDXMA-UHFFFAOYSA-N 0.000 description 13
- CWYNVVGOOAEACU-UHFFFAOYSA-N Fe2+ Chemical compound [Fe+2] CWYNVVGOOAEACU-UHFFFAOYSA-N 0.000 description 13
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 13
- 210000004027 cell Anatomy 0.000 description 13
- 239000013612 plasmid Substances 0.000 description 13
- 102000004169 proteins and genes Human genes 0.000 description 13
- ZCCUUQDIBDJBTK-UHFFFAOYSA-N psoralen Chemical compound C1=C2OC(=O)C=CC2=CC2=C1OC=C2 ZCCUUQDIBDJBTK-UHFFFAOYSA-N 0.000 description 13
- BBEAQIROQSPTKN-UHFFFAOYSA-N pyrene Chemical compound C1=CC=C2C=CC3=CC=CC4=CC=C1C2=C43 BBEAQIROQSPTKN-UHFFFAOYSA-N 0.000 description 13
- IAZDPXIOMUYVGZ-WFGJKAKNSA-N Dimethyl sulfoxide Chemical compound [2H]C([2H])([2H])S(=O)C([2H])([2H])[2H] IAZDPXIOMUYVGZ-WFGJKAKNSA-N 0.000 description 12
- 150000002148 esters Chemical class 0.000 description 12
- 239000001257 hydrogen Substances 0.000 description 12
- 230000001105 regulatory effect Effects 0.000 description 12
- KDLHZDBZIXYQEI-UHFFFAOYSA-N Palladium Chemical compound [Pd] KDLHZDBZIXYQEI-UHFFFAOYSA-N 0.000 description 11
- NPZTUJOABDZTLV-UHFFFAOYSA-N hydroxybenzotriazole Substances O=C1C=CC=C2NNN=C12 NPZTUJOABDZTLV-UHFFFAOYSA-N 0.000 description 11
- PXHVJJICTQNCMI-UHFFFAOYSA-N nickel Substances [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 11
- 150000007523 nucleic acids Chemical group 0.000 description 11
- 239000000047 product Substances 0.000 description 11
- 230000008685 targeting Effects 0.000 description 11
- 238000004809 thin layer chromatography Methods 0.000 description 11
- SECXISVLQFMRJM-UHFFFAOYSA-N N-Methylpyrrolidone Chemical compound CN1CCCC1=O SECXISVLQFMRJM-UHFFFAOYSA-N 0.000 description 10
- 108091023040 Transcription factor Proteins 0.000 description 10
- 102000040945 Transcription factor Human genes 0.000 description 10
- 102000039446 nucleic acids Human genes 0.000 description 10
- 108020004707 nucleic acids Proteins 0.000 description 10
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 10
- YZHUMGUJCQRKBT-UHFFFAOYSA-M sodium chlorate Chemical compound [Na+].[O-]Cl(=O)=O YZHUMGUJCQRKBT-UHFFFAOYSA-M 0.000 description 10
- 238000006467 substitution reaction Methods 0.000 description 10
- 238000012546 transfer Methods 0.000 description 10
- 238000003828 vacuum filtration Methods 0.000 description 10
- JMTMSDXUXJISAY-UHFFFAOYSA-N 2H-benzotriazol-4-ol Chemical compound OC1=CC=CC2=C1N=NN2 JMTMSDXUXJISAY-UHFFFAOYSA-N 0.000 description 9
- 125000003047 N-acetyl group Chemical group 0.000 description 9
- ZMANZCXQSJIPKH-UHFFFAOYSA-N Triethylamine Chemical compound CCN(CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-N 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 9
- 244000309466 calf Species 0.000 description 9
- 229910052799 carbon Inorganic materials 0.000 description 9
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 9
- 230000001976 improved effect Effects 0.000 description 9
- 238000009830 intercalation Methods 0.000 description 9
- 230000002687 intercalation Effects 0.000 description 9
- 229920002401 polyacrylamide Polymers 0.000 description 9
- 239000002244 precipitate Substances 0.000 description 9
- 108090000765 processed proteins & peptides Proteins 0.000 description 9
- 239000011541 reaction mixture Substances 0.000 description 9
- 239000007787 solid Substances 0.000 description 9
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 8
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 8
- 150000003973 alkyl amines Chemical class 0.000 description 8
- 125000005599 alkyl carboxylate group Chemical group 0.000 description 8
- 238000013459 approach Methods 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 125000001559 cyclopropyl group Chemical group [H]C1([H])C([H])([H])C1([H])* 0.000 description 8
- 230000001965 increasing effect Effects 0.000 description 8
- 230000003993 interaction Effects 0.000 description 8
- 150000003384 small molecules Chemical class 0.000 description 8
- 238000003860 storage Methods 0.000 description 8
- DYHSDKLCOJIUFX-UHFFFAOYSA-N tert-butoxycarbonyl anhydride Chemical compound CC(C)(C)OC(=O)OC(=O)OC(C)(C)C DYHSDKLCOJIUFX-UHFFFAOYSA-N 0.000 description 8
- 238000013518 transcription Methods 0.000 description 8
- 230000035897 transcription Effects 0.000 description 8
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 7
- 239000007983 Tris buffer Substances 0.000 description 7
- 239000000370 acceptor Substances 0.000 description 7
- 125000004429 atom Chemical group 0.000 description 7
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 7
- 210000004899 c-terminal region Anatomy 0.000 description 7
- 230000000875 corresponding effect Effects 0.000 description 7
- 125000004122 cyclic group Chemical group 0.000 description 7
- 230000029087 digestion Effects 0.000 description 7
- 238000010494 dissociation reaction Methods 0.000 description 7
- 230000005593 dissociations Effects 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 239000000706 filtrate Substances 0.000 description 7
- GVEPBJHOBDJJJI-UHFFFAOYSA-N fluoranthrene Natural products C1=CC(C2=CC=CC=C22)=C3C2=CC=CC3=C1 GVEPBJHOBDJJJI-UHFFFAOYSA-N 0.000 description 7
- 239000000138 intercalating agent Substances 0.000 description 7
- 125000005647 linker group Chemical group 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 239000012071 phase Substances 0.000 description 7
- 238000002360 preparation method Methods 0.000 description 7
- QQONPFPTGQHPMA-UHFFFAOYSA-N propylene Natural products CC=C QQONPFPTGQHPMA-UHFFFAOYSA-N 0.000 description 7
- 125000004805 propylene group Chemical group [H]C([H])([H])C([H])([*:1])C([H])([H])[*:2] 0.000 description 7
- 125000000168 pyrrolyl group Chemical group 0.000 description 7
- 125000005931 tert-butyloxycarbonyl group Chemical group [H]C([H])([H])C(OC(*)=O)(C([H])([H])[H])C([H])([H])[H] 0.000 description 7
- 210000001541 thymus gland Anatomy 0.000 description 7
- HBRCDTRQDHMTDA-UHFFFAOYSA-N 2-[[4-(diethylamino)phenyl]diazenyl]benzoic acid Chemical compound C1=CC(N(CC)CC)=CC=C1N=NC1=CC=CC=C1C(O)=O HBRCDTRQDHMTDA-UHFFFAOYSA-N 0.000 description 6
- JUIKUQOUMZUFQT-UHFFFAOYSA-N 2-bromoacetamide Chemical compound NC(=O)CBr JUIKUQOUMZUFQT-UHFFFAOYSA-N 0.000 description 6
- VXGRJERITKFWPL-UHFFFAOYSA-N 4',5'-Dihydropsoralen Natural products C1=C2OC(=O)C=CC2=CC2=C1OCC2 VXGRJERITKFWPL-UHFFFAOYSA-N 0.000 description 6
- XKRFYHLGVUSROY-UHFFFAOYSA-N Argon Chemical compound [Ar] XKRFYHLGVUSROY-UHFFFAOYSA-N 0.000 description 6
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 description 6
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 6
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 6
- 239000004471 Glycine Substances 0.000 description 6
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 6
- 108091006629 SLC13A2 Proteins 0.000 description 6
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 description 6
- RJURFGZVJUQBHK-UHFFFAOYSA-N actinomycin D Natural products CC1OC(=O)C(C(C)C)N(C)C(=O)CN(C)C(=O)C2CCCN2C(=O)C(C(C)C)NC(=O)C1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)NC4C(=O)NC(C(N5CCCC5C(=O)N(C)CC(=O)N(C)C(C(C)C)C(=O)OC4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-UHFFFAOYSA-N 0.000 description 6
- 230000004913 activation Effects 0.000 description 6
- MWPLVEDNUUSJAV-UHFFFAOYSA-N anthracene Chemical compound C1=CC=CC2=CC3=CC=CC=C3C=C21 MWPLVEDNUUSJAV-UHFFFAOYSA-N 0.000 description 6
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 108010009297 diglycyl-histidine Proteins 0.000 description 6
- 239000003814 drug Substances 0.000 description 6
- VFRSADQPWYCXDG-LEUCUCNGSA-N ethyl (2s,5s)-5-methylpyrrolidine-2-carboxylate;2,2,2-trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F.CCOC(=O)[C@@H]1CC[C@H](C)N1 VFRSADQPWYCXDG-LEUCUCNGSA-N 0.000 description 6
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 6
- 229960002449 glycine Drugs 0.000 description 6
- 238000010438 heat treatment Methods 0.000 description 6
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 6
- 230000002829 reductive effect Effects 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 239000011975 tartaric acid Substances 0.000 description 6
- 235000002906 tartaric acid Nutrition 0.000 description 6
- 230000001225 therapeutic effect Effects 0.000 description 6
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 6
- PIEPQKCYPFFYMG-UHFFFAOYSA-N tris acetate Chemical compound CC(O)=O.OCC(N)(CO)CO PIEPQKCYPFFYMG-UHFFFAOYSA-N 0.000 description 6
- RLOQBKJCOAXOLR-UHFFFAOYSA-N 1h-pyrrole-2-carboxamide Chemical group NC(=O)C1=CC=CN1 RLOQBKJCOAXOLR-UHFFFAOYSA-N 0.000 description 5
- VRPJIFMKZZEXLR-UHFFFAOYSA-N 2-[(2-methylpropan-2-yl)oxycarbonylamino]acetic acid Chemical compound CC(C)(C)OC(=O)NCC(O)=O VRPJIFMKZZEXLR-UHFFFAOYSA-N 0.000 description 5
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 5
- FUSGACRLAFQQRL-UHFFFAOYSA-N N-Ethyl-N-nitrosourea Chemical compound CCN(N=O)C(N)=O FUSGACRLAFQQRL-UHFFFAOYSA-N 0.000 description 5
- 150000001408 amides Chemical class 0.000 description 5
- 238000007098 aminolysis reaction Methods 0.000 description 5
- 238000000211 autoradiogram Methods 0.000 description 5
- UMOAGHPTFVVSHR-UHFFFAOYSA-N benzotriazol-1-yl 1-methyl-4-[(2-methylpropan-2-yl)oxycarbonylamino]pyrrole-2-carboxylate Chemical compound CN1C=C(NC(=O)OC(C)(C)C)C=C1C(=O)ON1C2=CC=CC=C2N=N1 UMOAGHPTFVVSHR-UHFFFAOYSA-N 0.000 description 5
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 5
- 238000004587 chromatography analysis Methods 0.000 description 5
- 150000004985 diamines Chemical class 0.000 description 5
- 239000012039 electrophile Substances 0.000 description 5
- 238000011067 equilibration Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000001727 in vivo Methods 0.000 description 5
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 5
- 229910052757 nitrogen Inorganic materials 0.000 description 5
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 5
- 238000001644 13C nuclear magnetic resonance spectroscopy Methods 0.000 description 4
- POLIXZIAIMAECK-UHFFFAOYSA-N 4-[2-(2,6-dioxomorpholin-4-yl)ethyl]morpholine-2,6-dione Chemical compound C1C(=O)OC(=O)CN1CCN1CC(=O)OC(=O)C1 POLIXZIAIMAECK-UHFFFAOYSA-N 0.000 description 4
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 4
- 229930024421 Adenine Natural products 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- QUSNBJAOOMFDIB-UHFFFAOYSA-N Ethylamine Chemical compound CCN QUSNBJAOOMFDIB-UHFFFAOYSA-N 0.000 description 4
- 102000020897 Formins Human genes 0.000 description 4
- 108091022623 Formins Proteins 0.000 description 4
- WMFOQBRAJBCJND-UHFFFAOYSA-M Lithium hydroxide Chemical compound [Li+].[OH-] WMFOQBRAJBCJND-UHFFFAOYSA-M 0.000 description 4
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 4
- GTDPSWPPOUPBNX-UHFFFAOYSA-N ac1mqpva Chemical compound CC12C(=O)OC(=O)C1(C)C1(C)C2(C)C(=O)OC1=O GTDPSWPPOUPBNX-UHFFFAOYSA-N 0.000 description 4
- 229960000643 adenine Drugs 0.000 description 4
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 4
- 125000003277 amino group Chemical group 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 239000004202 carbamide Substances 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 201000010099 disease Diseases 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- 229940079593 drug Drugs 0.000 description 4
- 238000010828 elution Methods 0.000 description 4
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- 238000005304 joining Methods 0.000 description 4
- 239000010410 layer Substances 0.000 description 4
- 239000012160 loading buffer Substances 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 229930014626 natural product Natural products 0.000 description 4
- IDBIFFKSXLYUOT-UHFFFAOYSA-N netropsin Chemical compound C1=C(C(=O)NCCC(N)=N)N(C)C=C1NC(=O)C1=CC(NC(=O)CN=C(N)N)=CN1C IDBIFFKSXLYUOT-UHFFFAOYSA-N 0.000 description 4
- 150000003141 primary amines Chemical group 0.000 description 4
- 239000000651 prodrug Substances 0.000 description 4
- 229940002612 prodrug Drugs 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 230000003584 silencer Effects 0.000 description 4
- 239000002002 slurry Substances 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 238000003756 stirring Methods 0.000 description 4
- 239000011550 stock solution Substances 0.000 description 4
- 238000010189 synthetic method Methods 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- GOLAEMFBWAIGRX-UHFFFAOYSA-N 1-methyl-4-[(2-methylpropan-2-yl)oxycarbonylamino]pyrrole-2-carboxylic acid Chemical compound CN1C=C(NC(=O)OC(C)(C)C)C=C1C(O)=O GOLAEMFBWAIGRX-UHFFFAOYSA-N 0.000 description 3
- KYWMCFOWDYFYLV-UHFFFAOYSA-N 1h-imidazole-2-carboxylic acid Chemical compound OC(=O)C1=NC=CN1 KYWMCFOWDYFYLV-UHFFFAOYSA-N 0.000 description 3
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 3
- 108090001008 Avidin Proteins 0.000 description 3
- OGNSCSPNOLGXSM-GSVOUGTGSA-N D-2,4-diaminobutyric acid Chemical compound NCC[C@@H](N)C(O)=O OGNSCSPNOLGXSM-GSVOUGTGSA-N 0.000 description 3
- 108010092160 Dactinomycin Proteins 0.000 description 3
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 3
- 229930192392 Mitomycin Natural products 0.000 description 3
- NWIBSHFKIJFRCO-WUDYKRTCSA-N Mytomycin Chemical compound C1N2C(C(C(C)=C(N)C3=O)=O)=C3[C@@H](COC(N)=O)[C@@]2(OC)[C@@H]2[C@H]1N2 NWIBSHFKIJFRCO-WUDYKRTCSA-N 0.000 description 3
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 3
- 108010090804 Streptavidin Proteins 0.000 description 3
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 3
- 108700009124 Transcription Initiation Site Proteins 0.000 description 3
- 239000008351 acetate buffer Substances 0.000 description 3
- RJURFGZVJUQBHK-IIXSONLDSA-N actinomycin D Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)N[C@@H]4C(=O)N[C@@H](C(N5CCC[C@H]5C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-IIXSONLDSA-N 0.000 description 3
- 239000012190 activator Substances 0.000 description 3
- 125000001931 aliphatic group Chemical group 0.000 description 3
- 229910052786 argon Inorganic materials 0.000 description 3
- 150000004982 aromatic amines Chemical class 0.000 description 3
- 150000001543 aryl boronic acids Chemical class 0.000 description 3
- 125000003118 aryl group Chemical group 0.000 description 3
- 239000012131 assay buffer Substances 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- LNQHREYHFRFJAU-UHFFFAOYSA-N bis(2,5-dioxopyrrolidin-1-yl) pentanedioate Chemical compound O=C1CCC(=O)N1OC(=O)CCCC(=O)ON1C(=O)CCC1=O LNQHREYHFRFJAU-UHFFFAOYSA-N 0.000 description 3
- 239000012267 brine Substances 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 230000009918 complex formation Effects 0.000 description 3
- 229960000640 dactinomycin Drugs 0.000 description 3
- 238000010511 deprotection reaction Methods 0.000 description 3
- 239000010432 diamond Substances 0.000 description 3
- BFMYDTVEBKDAKJ-UHFFFAOYSA-L disodium;(2',7'-dibromo-3',6'-dioxido-3-oxospiro[2-benzofuran-1,9'-xanthene]-4'-yl)mercury;hydrate Chemical compound O.[Na+].[Na+].O1C(=O)C2=CC=CC=C2C21C1=CC(Br)=C([O-])C([Hg])=C1OC1=C2C=C(Br)C([O-])=C1 BFMYDTVEBKDAKJ-UHFFFAOYSA-L 0.000 description 3
- 229960005501 duocarmycin Drugs 0.000 description 3
- 229930184221 duocarmycin Natural products 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 125000002883 imidazolyl group Chemical group 0.000 description 3
- AGBQKNBQESQNJD-UHFFFAOYSA-N lipoic acid Chemical compound OC(=O)CCCCC1CCSS1 AGBQKNBQESQNJD-UHFFFAOYSA-N 0.000 description 3
- FDZZZRQASAIRJF-UHFFFAOYSA-M malachite green Chemical compound [Cl-].C1=CC(N(C)C)=CC=C1C(C=1C=CC=CC=1)=C1C=CC(=[N+](C)C)C=C1 FDZZZRQASAIRJF-UHFFFAOYSA-M 0.000 description 3
- 229940107698 malachite green Drugs 0.000 description 3
- 238000004949 mass spectrometry Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- BRZNAATZQZPGBQ-UHFFFAOYSA-N methyl 4-nitro-1h-pyrrole-2-carboxylate Chemical compound COC(=O)C1=CC([N+]([O-])=O)=CN1 BRZNAATZQZPGBQ-UHFFFAOYSA-N 0.000 description 3
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 3
- 125000001570 methylene group Chemical group [H]C([H])([*:1])[*:2] 0.000 description 3
- 229960004857 mitomycin Drugs 0.000 description 3
- 238000000329 molecular dynamics simulation Methods 0.000 description 3
- KMBPCQSCMCEPMU-UHFFFAOYSA-N n'-(3-aminopropyl)-n'-methylpropane-1,3-diamine Chemical compound NCCCN(C)CCCN KMBPCQSCMCEPMU-UHFFFAOYSA-N 0.000 description 3
- LBZQAEQLSRWNLM-UHFFFAOYSA-N n-methyl-1h-pyrrole-2-carboxamide Chemical group CNC(=O)C1=CC=CN1 LBZQAEQLSRWNLM-UHFFFAOYSA-N 0.000 description 3
- 229910052698 phosphorus Inorganic materials 0.000 description 3
- 229920002704 polyhistidine Polymers 0.000 description 3
- 102000004196 processed proteins & peptides Human genes 0.000 description 3
- 230000035484 reaction time Effects 0.000 description 3
- 230000009257 reactivity Effects 0.000 description 3
- 229910052938 sodium sulfate Inorganic materials 0.000 description 3
- 235000011152 sodium sulphate Nutrition 0.000 description 3
- HPALAKNZSZLMCH-UHFFFAOYSA-M sodium;chloride;hydrate Chemical compound O.[Na+].[Cl-] HPALAKNZSZLMCH-UHFFFAOYSA-M 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- KRWRTUKGTQPOCZ-UHFFFAOYSA-N tert-butyl 1h-imidazole-2-carboxylate Chemical compound CC(C)(C)OC(=O)C1=NC=CN1 KRWRTUKGTQPOCZ-UHFFFAOYSA-N 0.000 description 3
- LGHAAAIUUYWURJ-UHFFFAOYSA-N tert-butyl 1h-pyrrole-2-carboxylate Chemical group CC(C)(C)OC(=O)C1=CC=CN1 LGHAAAIUUYWURJ-UHFFFAOYSA-N 0.000 description 3
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 3
- 229960002663 thioctic acid Drugs 0.000 description 3
- 239000001226 triphosphate Substances 0.000 description 3
- 235000011178 triphosphate Nutrition 0.000 description 3
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- HEOKCJUUKIPIMM-UHFFFAOYSA-N (5-methoxycarbonyl-1-methylpyrrol-3-yl)azanium;chloride Chemical compound Cl.COC(=O)C1=CC(N)=CN1C HEOKCJUUKIPIMM-UHFFFAOYSA-N 0.000 description 2
- FONKWHRXTPJODV-DNQXCXABSA-N 1,3-bis[2-[(8s)-8-(chloromethyl)-4-hydroxy-1-methyl-7,8-dihydro-3h-pyrrolo[3,2-e]indole-6-carbonyl]-1h-indol-5-yl]urea Chemical group C1([C@H](CCl)CN2C(=O)C=3NC4=CC=C(C=C4C=3)NC(=O)NC=3C=C4C=C(NC4=CC=3)C(=O)N3C4=CC(O)=C5NC=C(C5=C4[C@H](CCl)C3)C)=C2C=C(O)C2=C1C(C)=CN2 FONKWHRXTPJODV-DNQXCXABSA-N 0.000 description 2
- 108700000016 1-methylimidazole-2-carboxamide netropsin Proteins 0.000 description 2
- BBFDGMDENAEMKF-UHFFFAOYSA-N 2,2,2-trichloro-1-(1h-pyrrol-2-yl)ethanone Chemical compound ClC(Cl)(Cl)C(=O)C1=CC=CN1 BBFDGMDENAEMKF-UHFFFAOYSA-N 0.000 description 2
- HZAXFHJVJLSVMW-UHFFFAOYSA-N 2-Aminoethan-1-ol Chemical compound NCCO HZAXFHJVJLSVMW-UHFFFAOYSA-N 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- RSUDLACLOAKYCN-UHFFFAOYSA-N 4-amino-2-[(2-methylpropan-2-yl)oxycarbonyl]butanoic acid Chemical compound CC(C)(C)OC(=O)C(C(O)=O)CCN RSUDLACLOAKYCN-UHFFFAOYSA-N 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 2
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- 101100107610 Arabidopsis thaliana ABCF4 gene Proteins 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- FERIUCNNQQJTOY-UHFFFAOYSA-N Butyric acid Chemical compound CCCC(O)=O FERIUCNNQQJTOY-UHFFFAOYSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- GSNUFIFRDBKVIE-UHFFFAOYSA-N DMF Natural products CC1=CC=C(C)O1 GSNUFIFRDBKVIE-UHFFFAOYSA-N 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 229920002527 Glycogen Polymers 0.000 description 2
- 239000007995 HEPES buffer Substances 0.000 description 2
- 241000725303 Human immunodeficiency virus Species 0.000 description 2
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 108010042309 Netropsin Proteins 0.000 description 2
- GRYLNZFGIOXLOG-UHFFFAOYSA-N Nitric acid Chemical compound O[N+]([O-])=O GRYLNZFGIOXLOG-UHFFFAOYSA-N 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 229930183506 Pluramycin Natural products 0.000 description 2
- 239000004743 Polypropylene Substances 0.000 description 2
- 101100068078 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GCN4 gene Proteins 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- BQCHGWYGWQEMTJ-YLGMGCDCSA-N [(2r,3r,4s,6r)-4-(dimethylamino)-6-[8-[(2s,4s,5r,6r)-4-(dimethylamino)-5-hydroxy-6-methyloxan-2-yl]-11-hydroxy-5-methyl-2-[2-methyl-3-[(z)-prop-1-enyl]oxiran-2-yl]-4,7,12-trioxonaphtho[2,3-h]chromen-10-yl]-2,4-dimethyloxan-3-yl] acetate Chemical compound C\C=C/C1OC1(C)C1=CC(=O)C2=C(C)C=C(C(=O)C=3C(=C(O)C([C@@H]4O[C@H](C)[C@H](OC(C)=O)[C@](C)(C4)N(C)C)=CC=3[C@H]3O[C@H](C)[C@H](O)[C@H](C3)N(C)C)C3=O)C3=C2O1 BQCHGWYGWQEMTJ-YLGMGCDCSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 125000003368 amide group Chemical group 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- 229940045799 anthracyclines and related substance Drugs 0.000 description 2
- 230000000259 anti-tumor effect Effects 0.000 description 2
- 238000000376 autoradiography Methods 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 150000001615 biotins Chemical class 0.000 description 2
- 229950006844 bizelesin Drugs 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 239000003054 catalyst Substances 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 125000001309 chloro group Chemical group Cl* 0.000 description 2
- 238000004440 column chromatography Methods 0.000 description 2
- 230000021615 conjugation Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000009146 cooperative binding Effects 0.000 description 2
- 229920001577 copolymer Polymers 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 231100000135 cytotoxicity Toxicity 0.000 description 2
- 230000003013 cytotoxicity Effects 0.000 description 2
- 125000001295 dansyl group Chemical group [H]C1=C([H])C(N(C([H])([H])[H])C([H])([H])[H])=C2C([H])=C([H])C([H])=C(C2=C1[H])S(*)(=O)=O 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000003795 desorption Methods 0.000 description 2
- 238000006471 dimerization reaction Methods 0.000 description 2
- 125000000118 dimethyl group Chemical group [H]C([H])([H])* 0.000 description 2
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 2
- 239000003480 eluent Substances 0.000 description 2
- 238000012869 ethanol precipitation Methods 0.000 description 2
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 2
- 229960005542 ethidium bromide Drugs 0.000 description 2
- NOTZYDYZBOBDFE-UHFFFAOYSA-N ethyl 1-methylimidazole-2-carboxylate Chemical compound CCOC(=O)C1=NC=CN1C NOTZYDYZBOBDFE-UHFFFAOYSA-N 0.000 description 2
- CQRVNNMJJDJXPE-UHFFFAOYSA-N ethyl 4-amino-1-methylimidazole-2-carboxylate;hydrochloride Chemical compound Cl.CCOC(=O)C1=NC(N)=CN1C CQRVNNMJJDJXPE-UHFFFAOYSA-N 0.000 description 2
- WBJINCZRORDGAQ-UHFFFAOYSA-N ethyl formate Chemical compound CCOC=O WBJINCZRORDGAQ-UHFFFAOYSA-N 0.000 description 2
- IMBKASBLAKCLEM-UHFFFAOYSA-L ferrous ammonium sulfate (anhydrous) Chemical compound [NH4+].[NH4+].[Fe+2].[O-]S([O-])(=O)=O.[O-]S([O-])(=O)=O IMBKASBLAKCLEM-UHFFFAOYSA-L 0.000 description 2
- 239000006260 foam Substances 0.000 description 2
- 239000007789 gas Substances 0.000 description 2
- 238000007429 general method Methods 0.000 description 2
- 229940096919 glycogen Drugs 0.000 description 2
- 125000000623 heterocyclic group Chemical group 0.000 description 2
- 239000000710 homodimer Substances 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 239000005457 ice water Substances 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 239000000543 intermediate Substances 0.000 description 2
- 229940074320 iso-sulfan blue Drugs 0.000 description 2
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- GDOPTJXRTPNYNR-UHFFFAOYSA-N methyl-cyclopentane Natural products CC1CCCC1 GDOPTJXRTPNYNR-UHFFFAOYSA-N 0.000 description 2
- ZUHZZVMEUAUWHY-UHFFFAOYSA-N n,n-dimethylpropan-1-amine Chemical compound CCCN(C)C ZUHZZVMEUAUWHY-UHFFFAOYSA-N 0.000 description 2
- DMOMFSUPEGKPKO-UHFFFAOYSA-N n-[5-[[5-[3-(dimethylamino)propylcarbamoyl]-1-methylpyrrol-3-yl]carbamoyl]-1-methylpyrrol-3-yl]-1-methylimidazole-2-carboxamide Chemical compound CN1C(C(=O)NCCCN(C)C)=CC(NC(=O)C=2N(C=C(NC(=O)C=3N(C=CN=3)C)C=2)C)=C1 DMOMFSUPEGKPKO-UHFFFAOYSA-N 0.000 description 2
- 238000006386 neutralization reaction Methods 0.000 description 2
- 125000004433 nitrogen atom Chemical group N* 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000007248 oxidative elimination reaction Methods 0.000 description 2
- 238000010647 peptide synthesis reaction Methods 0.000 description 2
- 239000003208 petroleum Substances 0.000 description 2
- 210000002381 plasma Anatomy 0.000 description 2
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- BWHMMNNQKKPAPP-UHFFFAOYSA-L potassium carbonate Chemical compound [K+].[K+].[O-]C([O-])=O BWHMMNNQKKPAPP-UHFFFAOYSA-L 0.000 description 2
- 238000002953 preparative HPLC Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- WRHZVMBBRYBTKZ-UHFFFAOYSA-N pyrrole-2-carboxylic acid Chemical compound OC(=O)C1=CC=CN1 WRHZVMBBRYBTKZ-UHFFFAOYSA-N 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- NLUFDZBOHMOBOE-UHFFFAOYSA-M sodium;2-[[4-(diethylamino)phenyl]-(4-diethylazaniumylidenecyclohexa-2,5-dien-1-ylidene)methyl]benzene-1,4-disulfonate Chemical compound [Na+].C1=CC(N(CC)CC)=CC=C1C(C=1C(=CC=C(C=1)S([O-])(=O)=O)S([O-])(=O)=O)=C1C=CC(=[N+](CC)CC)C=C1 NLUFDZBOHMOBOE-UHFFFAOYSA-M 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 229950009902 stallimycin Drugs 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- VZGDMQKNWNREIO-UHFFFAOYSA-N tetrachloromethane Chemical compound ClC(Cl)(Cl)Cl VZGDMQKNWNREIO-UHFFFAOYSA-N 0.000 description 2
- ACOJCCLIDPZYJC-UHFFFAOYSA-M thiazole orange Chemical compound CC1=CC=C(S([O-])(=O)=O)C=C1.C1=CC=C2C(C=C3N(C4=CC=CC=C4S3)C)=CC=[N+](C)C2=C1 ACOJCCLIDPZYJC-UHFFFAOYSA-M 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 238000001269 time-of-flight mass spectrometry Methods 0.000 description 2
- PVFOMCVHYWHZJE-UHFFFAOYSA-N trichloroacetyl chloride Chemical compound ClC(=O)C(Cl)(Cl)Cl PVFOMCVHYWHZJE-UHFFFAOYSA-N 0.000 description 2
- RIOQSEWOXXDEQQ-UHFFFAOYSA-N triphenylphosphine Chemical compound C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1 RIOQSEWOXXDEQQ-UHFFFAOYSA-N 0.000 description 2
- 238000002211 ultraviolet spectrum Methods 0.000 description 2
- 239000003643 water by type Substances 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- OGNSCSPNOLGXSM-UHFFFAOYSA-N (+/-)-DABA Natural products NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 1
- 125000004169 (C1-C6) alkyl group Chemical group 0.000 description 1
- ADFXKUOMJKEIND-UHFFFAOYSA-N 1,3-dicyclohexylurea Chemical compound C1CCCCC1NC(=O)NC1CCCCC1 ADFXKUOMJKEIND-UHFFFAOYSA-N 0.000 description 1
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 description 1
- ASOKPJOREAFHNY-UHFFFAOYSA-N 1-Hydroxybenzotriazole Chemical compound C1=CC=C2N(O)N=NC2=C1 ASOKPJOREAFHNY-UHFFFAOYSA-N 0.000 description 1
- GZBSJWBQXNCOFP-UHFFFAOYSA-N 1-methyl-4-[(2-methylpropan-2-yl)oxycarbonylamino]imidazole-2-carboxylic acid Chemical compound CN1C=C(NC(=O)OC(C)(C)C)N=C1C(O)=O GZBSJWBQXNCOFP-UHFFFAOYSA-N 0.000 description 1
- WLDPWZQYAVZTTP-UHFFFAOYSA-N 1-methyl-imidazole-2-carboxylic acid Chemical compound CN1C=CN=C1C(O)=O WLDPWZQYAVZTTP-UHFFFAOYSA-N 0.000 description 1
- RQPUFMWMEOSRRU-UHFFFAOYSA-N 1-n"-methyl-1-n"-propylpropane-1,1,1-triamine Chemical compound CCCN(C)C(N)(N)CC RQPUFMWMEOSRRU-UHFFFAOYSA-N 0.000 description 1
- YRBRVMGXTWJLBZ-UHFFFAOYSA-N 1h-imidazole;1h-pyrrole Chemical compound C=1C=CNC=1.C1=CNC=N1 YRBRVMGXTWJLBZ-UHFFFAOYSA-N 0.000 description 1
- WASYNLWEPOHNBM-UHFFFAOYSA-N 2,2,2-trichloro-1-(1-methyl-4-nitropyrrol-2-yl)ethanone Chemical compound CN1C=C([N+]([O-])=O)C=C1C(=O)C(Cl)(Cl)Cl WASYNLWEPOHNBM-UHFFFAOYSA-N 0.000 description 1
- BLCJBICVQSYOIF-UHFFFAOYSA-N 2,2-diaminobutanoic acid Chemical compound CCC(N)(N)C(O)=O BLCJBICVQSYOIF-UHFFFAOYSA-N 0.000 description 1
- OSBLTNPMIGYQGY-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid;boric acid Chemical compound OB(O)O.OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O OSBLTNPMIGYQGY-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- GLVYLTSKTCWWJR-UHFFFAOYSA-N 2-carbonoperoxoylbenzoic acid Chemical compound OOC(=O)C1=CC=CC=C1C(O)=O GLVYLTSKTCWWJR-UHFFFAOYSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical group OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- YEDUAINPPJYDJZ-UHFFFAOYSA-N 2-hydroxybenzothiazole Chemical compound C1=CC=C2SC(O)=NC2=C1 YEDUAINPPJYDJZ-UHFFFAOYSA-N 0.000 description 1
- QCQCHGYLTSGIGX-GHXANHINSA-N 4-[[(3ar,5ar,5br,7ar,9s,11ar,11br,13as)-5a,5b,8,8,11a-pentamethyl-3a-[(5-methylpyridine-3-carbonyl)amino]-2-oxo-1-propan-2-yl-4,5,6,7,7a,9,10,11,11b,12,13,13a-dodecahydro-3h-cyclopenta[a]chrysen-9-yl]oxy]-2,2-dimethyl-4-oxobutanoic acid Chemical compound N([C@@]12CC[C@@]3(C)[C@]4(C)CC[C@H]5C(C)(C)[C@@H](OC(=O)CC(C)(C)C(O)=O)CC[C@]5(C)[C@H]4CC[C@@H]3C1=C(C(C2)=O)C(C)C)C(=O)C1=CN=CC(C)=C1 QCQCHGYLTSGIGX-GHXANHINSA-N 0.000 description 1
- UBOXAPSORVLBPX-UHFFFAOYSA-N 4-amino-1-methylpyrrole-2-carboxylic acid;hydrochloride Chemical compound Cl.CN1C=C(N)C=C1C(O)=O UBOXAPSORVLBPX-UHFFFAOYSA-N 0.000 description 1
- JVVRCYWZTJLJSG-UHFFFAOYSA-N 4-dimethylaminophenol Chemical compound CN(C)C1=CC=C(O)C=C1 JVVRCYWZTJLJSG-UHFFFAOYSA-N 0.000 description 1
- 229960000549 4-dimethylaminophenol Drugs 0.000 description 1
- VHYFNPMBLIVWCW-UHFFFAOYSA-N 4-dimethylaminopyridine Substances CN(C)C1=CC=NC=C1 VHYFNPMBLIVWCW-UHFFFAOYSA-N 0.000 description 1
- HVCNXQOWACZAFN-UHFFFAOYSA-N 4-ethylmorpholine Chemical compound CCN1CCOCC1 HVCNXQOWACZAFN-UHFFFAOYSA-N 0.000 description 1
- JCLFHZLOKITRCE-UHFFFAOYSA-N 4-pentoxyphenol Chemical compound CCCCCOC1=CC=C(O)C=C1 JCLFHZLOKITRCE-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 108020003591 B-Form DNA Proteins 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 0 C*1C(C(Nc2c[n](C)c(C(NCCCN(C)C)=O)c2)=O)=CC(NC(c2cc(NC(c3cc(NC(c4cc(NC(c5cc(NC(c6cc(NC(c7ncc[n]7C)=O)c[n]6C)=O)c[n]5C)=O)c[n]4C)=O)c[n]3C)=O)c[n]2C)=O)=C1 Chemical compound C*1C(C(Nc2c[n](C)c(C(NCCCN(C)C)=O)c2)=O)=CC(NC(c2cc(NC(c3cc(NC(c4cc(NC(c5cc(NC(c6cc(NC(c7ncc[n]7C)=O)c[n]6C)=O)c[n]5C)=O)c[n]4C)=O)c[n]3C)=O)c[n]2C)=O)=C1 0.000 description 1
- OPHHOJIFDDODSK-UHFFFAOYSA-N CN(C)CCCNC(c1cc(NC(c2cc(NC(c3cc(NC(c4cc(NC(c5ncc[n]5C)O)c[n]4C)=O)c[n]3C)O)c[n]2C)=O)c[n]1C)=O Chemical compound CN(C)CCCNC(c1cc(NC(c2cc(NC(c3cc(NC(c4cc(NC(c5ncc[n]5C)O)c[n]4C)=O)c[n]3C)O)c[n]2C)=O)c[n]1C)=O OPHHOJIFDDODSK-UHFFFAOYSA-N 0.000 description 1
- AUKLDGJGCSGWDR-UHFFFAOYSA-N CN(C)CCNC(c1cc(NC(c2cc(NC(c3cc(NC(c4ncc[n]4C)=O)c[n]3C)=O)c[n]2C)=O)c[n]1C)=O Chemical compound CN(C)CCNC(c1cc(NC(c2cc(NC(c3cc(NC(c4ncc[n]4C)=O)c[n]3C)=O)c[n]2C)=O)c[n]1C)=O AUKLDGJGCSGWDR-UHFFFAOYSA-N 0.000 description 1
- 101100440271 Caenorhabditis elegans ccf-1 gene Proteins 0.000 description 1
- 101100516554 Caenorhabditis elegans nhr-5 gene Proteins 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-N Carbamic acid Chemical group NC(O)=O KXDHJXZQYSOELW-UHFFFAOYSA-N 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 239000004971 Cross linker Substances 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 239000012624 DNA alkylating agent Substances 0.000 description 1
- 229940126161 DNA alkylating agent Drugs 0.000 description 1
- 230000007118 DNA alkylation Effects 0.000 description 1
- 230000008836 DNA modification Effects 0.000 description 1
- 230000001482 DNA photocleavage Effects 0.000 description 1
- 230000007018 DNA scission Effects 0.000 description 1
- 241001475178 Dira Species 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- QTANTQQOYSUMLC-UHFFFAOYSA-O Ethidium cation Chemical class C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 QTANTQQOYSUMLC-UHFFFAOYSA-O 0.000 description 1
- PIICEJLVQHRZGT-UHFFFAOYSA-N Ethylenediamine Chemical compound NCCN PIICEJLVQHRZGT-UHFFFAOYSA-N 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 238000005033 Fourier transform infrared spectroscopy Methods 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 102000009331 Homeodomain Proteins Human genes 0.000 description 1
- 108010048671 Homeodomain Proteins Proteins 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- RAXXELZNTBOGNW-UHFFFAOYSA-O Imidazolium Chemical compound C1=C[NH+]=CN1 RAXXELZNTBOGNW-UHFFFAOYSA-O 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 239000006391 Luria-Bertani Medium Substances 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 239000012901 Milli-Q water Substances 0.000 description 1
- 101100010166 Mus musculus Dok3 gene Proteins 0.000 description 1
- 101100030361 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) pph-3 gene Proteins 0.000 description 1
- 108091081548 Palindromic sequence Proteins 0.000 description 1
- 108010043958 Peptoids Proteins 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 108091027981 Response element Proteins 0.000 description 1
- UIIMBOGNXHQVGW-DEQYMQKBSA-M Sodium bicarbonate-14C Chemical compound [Na+].O[14C]([O-])=O UIIMBOGNXHQVGW-DEQYMQKBSA-M 0.000 description 1
- 241000133426 Streptomyces zelensis Species 0.000 description 1
- 108700026226 TATA Box Proteins 0.000 description 1
- 102000006467 TATA-Box Binding Protein Human genes 0.000 description 1
- 108010044281 TATA-Box Binding Protein Proteins 0.000 description 1
- 102000002463 Transcription Factor TFIIIB Human genes 0.000 description 1
- 108010068071 Transcription Factor TFIIIB Proteins 0.000 description 1
- NQPDZGIKBAWPEJ-UHFFFAOYSA-N Valeric acid Natural products CCCCC(O)=O NQPDZGIKBAWPEJ-UHFFFAOYSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- IBZPKQZTZXRCKY-UHFFFAOYSA-N [5-(2-amino-6-oxo-3h-purin-9-yl)-2-(phosphonooxymethyl)oxolan-3-yl] [5-(4-amino-2-oxopyrimidin-1-yl)-3-hydroxyoxolan-2-yl]methyl hydrogen phosphate Chemical compound O=C1N=C(N)C=CN1C1OC(COP(O)(=O)OC2C(OC(C2)N2C3=C(C(N=C(N)N3)=O)N=C2)COP(O)(O)=O)C(O)C1 IBZPKQZTZXRCKY-UHFFFAOYSA-N 0.000 description 1
- YEVSRNMRARXQOG-UHFFFAOYSA-N [chloro(fluoro)-$l^{3}-chloranyl]formyl chloride Chemical compound FCl(Cl)C(Cl)=O YEVSRNMRARXQOG-UHFFFAOYSA-N 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- BHELZAPQIKSEDF-UHFFFAOYSA-N allyl bromide Chemical compound BrCC=C BHELZAPQIKSEDF-UHFFFAOYSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000006229 amino acid addition Effects 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 150000008064 anhydrides Chemical class 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000002246 antineoplastic agent Substances 0.000 description 1
- 229940041181 antineoplastic drug Drugs 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- UIGRBLXJJABSCB-UHFFFAOYSA-N benzo[e]indol-4-one Chemical compound O=C1C=C2C=CC=CC2=C2C1=NC=C2 UIGRBLXJJABSCB-UHFFFAOYSA-N 0.000 description 1
- AGEZXYOZHKGVCM-UHFFFAOYSA-N benzyl bromide Chemical compound BrCC1=CC=CC=C1 AGEZXYOZHKGVCM-UHFFFAOYSA-N 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- 230000003592 biomimetic effect Effects 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- OWMVSZAMULFTJU-UHFFFAOYSA-N bis-tris Chemical compound OCCN(CCO)C(CO)(CO)CO OWMVSZAMULFTJU-UHFFFAOYSA-N 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000005013 brain tissue Anatomy 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- HXCHCVDVKSCDHU-LULTVBGHSA-N calicheamicin Chemical compound C1[C@H](OC)[C@@H](NCC)CO[C@H]1O[C@H]1[C@H](O[C@@H]2C\3=C(NC(=O)OC)C(=O)C[C@](C/3=C/CSSSC)(O)C#C\C=C/C#C2)O[C@H](C)[C@@H](NO[C@@H]2O[C@H](C)[C@@H](SC(=O)C=3C(=C(OC)C(O[C@H]4[C@@H]([C@H](OC)[C@@H](O)[C@H](C)O4)O)=C(I)C=3C)OC)[C@@H](O)C2)[C@@H]1O HXCHCVDVKSCDHU-LULTVBGHSA-N 0.000 description 1
- 229930195731 calicheamicin Natural products 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- PFKFTWBEEFSNDU-UHFFFAOYSA-N carbonyldiimidazole Chemical compound C1=CN=CN1C(=O)N1C=CN=C1 PFKFTWBEEFSNDU-UHFFFAOYSA-N 0.000 description 1
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 230000004700 cellular uptake Effects 0.000 description 1
- SMNPLAKEGAEPJD-UHFFFAOYSA-N chembl34922 Chemical compound Cl.Cl.Cl.C1CN(C)CCN1C1=CC=C(NC(=N2)C=3C=C4N=C(NC4=CC=3)C=3C=CC(O)=CC=3)C2=C1 SMNPLAKEGAEPJD-UHFFFAOYSA-N 0.000 description 1
- 150000001805 chlorine compounds Chemical class 0.000 description 1
- ZYVSOIYQKUDENJ-WKSBCEQHSA-N chromomycin A3 Chemical compound O([C@@H]1C[C@@H](O[C@H](C)[C@@H]1OC(C)=O)OC=1C=C2C=C3C[C@H]([C@@H](C(=O)C3=C(O)C2=C(O)C=1C)O[C@@H]1O[C@H](C)[C@@H](O)[C@H](O[C@@H]2O[C@H](C)[C@@H](O)[C@H](O[C@@H]3O[C@@H](C)[C@H](OC(C)=O)[C@@](C)(O)C3)C2)C1)[C@H](OC)C(=O)[C@@H](O)[C@@H](C)O)[C@@H]1C[C@@H](O)[C@@H](OC)[C@@H](C)O1 ZYVSOIYQKUDENJ-WKSBCEQHSA-N 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 238000006352 cycloaddition reaction Methods 0.000 description 1
- 238000006114 decarboxylation reaction Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 229910003460 diamond Inorganic materials 0.000 description 1
- MHDVGSVTJDSBDK-UHFFFAOYSA-N dibenzyl ether Chemical compound C=1C=CC=CC=1COCC1=CC=CC=C1 MHDVGSVTJDSBDK-UHFFFAOYSA-N 0.000 description 1
- IJKVHSBPTUYDLN-UHFFFAOYSA-N dihydroxy(oxo)silane Chemical compound O[Si](O)=O IJKVHSBPTUYDLN-UHFFFAOYSA-N 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 238000004821 distillation Methods 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- VQNATVDKACXKTF-XELLLNAOSA-N duocarmycin Chemical class COC1=C(OC)C(OC)=C2NC(C(=O)N3C4=CC(=O)C5=C([C@@]64C[C@@H]6C3)C=C(N5)C(=O)OC)=CC2=C1 VQNATVDKACXKTF-XELLLNAOSA-N 0.000 description 1
- 230000001819 effect on gene Effects 0.000 description 1
- 238000005421 electrostatic potential Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 239000002024 ethyl acetate extract Substances 0.000 description 1
- RIFGWPKJUGCATF-UHFFFAOYSA-N ethyl chloroformate Chemical compound CCOC(Cl)=O RIFGWPKJUGCATF-UHFFFAOYSA-N 0.000 description 1
- DNJIEGIFACGWOD-UHFFFAOYSA-N ethyl mercaptane Natural products CCS DNJIEGIFACGWOD-UHFFFAOYSA-N 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000005281 excited state Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000003818 flash chromatography Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 238000011331 genomic analysis Methods 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 150000008282 halocarbons Chemical class 0.000 description 1
- AEDIXYWIVPYNBI-UHFFFAOYSA-N heptanamide Chemical compound CCCCCCC(N)=O AEDIXYWIVPYNBI-UHFFFAOYSA-N 0.000 description 1
- 238000001511 high performance liquid chromatography nuclear magnetic resonance spectroscopy Methods 0.000 description 1
- 239000012456 homogeneous solution Substances 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 150000002460 imidazoles Chemical class 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000002329 infrared spectrum Methods 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000007257 malfunction Effects 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 230000003340 mental effect Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000000302 molecular modelling Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- SWVGZFQJXVPIKM-UHFFFAOYSA-N n,n-bis(methylamino)propan-1-amine Chemical compound CCCN(NC)NC SWVGZFQJXVPIKM-UHFFFAOYSA-N 0.000 description 1
- VBTQNRFWXBXZQR-UHFFFAOYSA-N n-bromoacetamide Chemical compound CC(=O)NBr VBTQNRFWXBXZQR-UHFFFAOYSA-N 0.000 description 1
- YRDFLCZJNHSYIQ-UHFFFAOYSA-N n-iodonaphthalen-1-amine Chemical compound C1=CC=C2C(NI)=CC=CC2=C1 YRDFLCZJNHSYIQ-UHFFFAOYSA-N 0.000 description 1
- XOOMNEFVDUTJPP-UHFFFAOYSA-N naphthalene-1,3-diol Chemical compound C1=CC=CC2=CC(O)=CC(O)=C21 XOOMNEFVDUTJPP-UHFFFAOYSA-N 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 101150009274 nhr-1 gene Proteins 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 229910017604 nitric acid Inorganic materials 0.000 description 1
- 238000007339 nucleophilic aromatic substitution reaction Methods 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 239000012044 organic layer Substances 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- HUPQYPMULVBQDL-UHFFFAOYSA-N pentanoic acid Chemical compound CCCCC(O)=O.CCCCC(O)=O HUPQYPMULVBQDL-UHFFFAOYSA-N 0.000 description 1
- 239000012466 permeate Substances 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- 238000003322 phosphorimaging Methods 0.000 description 1
- 230000000886 photobiology Effects 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920000515 polycarbonate Polymers 0.000 description 1
- 239000004417 polycarbonate Substances 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920000379 polypropylene carbonate Polymers 0.000 description 1
- 230000008092 positive effect Effects 0.000 description 1
- 229910000027 potassium carbonate Inorganic materials 0.000 description 1
- 239000013636 protein dimer Substances 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000000425 proton nuclear magnetic resonance spectrum Methods 0.000 description 1
- 150000003212 purines Chemical class 0.000 description 1
- 150000003220 pyrenes Chemical class 0.000 description 1
- 150000003230 pyrimidines Chemical class 0.000 description 1
- DOYOPBSXEIZLRE-UHFFFAOYSA-N pyrrole-3-carboxylic acid Natural products OC(=O)C=1C=CNC=1 DOYOPBSXEIZLRE-UHFFFAOYSA-N 0.000 description 1
- 150000003233 pyrroles Chemical class 0.000 description 1
- 150000003254 radicals Chemical class 0.000 description 1
- 239000012048 reactive intermediate Substances 0.000 description 1
- 238000010992 reflux Methods 0.000 description 1
- 230000014493 regulation of gene expression Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- RZWQDAUIUBVCDD-UHFFFAOYSA-M sodium;benzenethiolate Chemical compound [Na+].[S-]C1=CC=CC=C1 RZWQDAUIUBVCDD-UHFFFAOYSA-M 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000011895 specific detection Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- LXMSZDCAJNLERA-ZHYRCANASA-N spironolactone Chemical compound C([C@@H]1[C@]2(C)CC[C@@H]3[C@@]4(C)CCC(=O)C=C4C[C@H]([C@@H]13)SC(=O)C)C[C@@]21CCC(=O)O1 LXMSZDCAJNLERA-ZHYRCANASA-N 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 208000026082 sterile multifocal osteomyelitis with periostitis and pustulosis Diseases 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 125000004213 tert-butoxy group Chemical group [H]C([H])([H])C(O*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- WROMPOXWARCANT-UHFFFAOYSA-N tfa trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F.OC(=O)C(F)(F)F WROMPOXWARCANT-UHFFFAOYSA-N 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 238000006257 total synthesis reaction Methods 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- DBGVGMSCBYYSLD-UHFFFAOYSA-N tributylstannane Chemical compound CCCC[SnH](CCCC)CCCC DBGVGMSCBYYSLD-UHFFFAOYSA-N 0.000 description 1
- ILWRPSCZWQJDMK-UHFFFAOYSA-N triethylazanium;chloride Chemical compound Cl.CCN(CC)CC ILWRPSCZWQJDMK-UHFFFAOYSA-N 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 238000009424 underpinning Methods 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D403/00—Heterocyclic compounds containing two or more hetero rings, having nitrogen atoms as the only ring hetero atoms, not provided for by group C07D401/00
- C07D403/14—Heterocyclic compounds containing two or more hetero rings, having nitrogen atoms as the only ring hetero atoms, not provided for by group C07D401/00 containing three or more hetero rings
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/62—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being a protein, peptide or polyamino acid
- A61K47/64—Drug-peptide, drug-protein or drug-polyamino acid conjugates, i.e. the modifying agent being a peptide, protein or polyamino acid which is covalently bonded or complexed to a therapeutically active agent
- A61K47/645—Polycationic or polyanionic oligopeptides, polypeptides or polyamino acids, e.g. polylysine, polyarginine, polyglutamic acid or peptide TAT
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D207/00—Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom
- C07D207/02—Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom with only hydrogen or carbon atoms directly attached to the ring nitrogen atom
- C07D207/30—Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom with only hydrogen or carbon atoms directly attached to the ring nitrogen atom having two double bonds between ring members or between ring members and non-ring members
- C07D207/34—Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom with only hydrogen or carbon atoms directly attached to the ring nitrogen atom having two double bonds between ring members or between ring members and non-ring members with hetero atoms or with carbon atoms having three bonds to hetero atoms with at the most one bond to halogen, e.g. ester or nitrile radicals, directly attached to ring carbon atoms
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D233/00—Heterocyclic compounds containing 1,3-diazole or hydrogenated 1,3-diazole rings, not condensed with other rings
- C07D233/54—Heterocyclic compounds containing 1,3-diazole or hydrogenated 1,3-diazole rings, not condensed with other rings having two double bonds between ring members or between ring members and non-ring members
- C07D233/66—Heterocyclic compounds containing 1,3-diazole or hydrogenated 1,3-diazole rings, not condensed with other rings having two double bonds between ring members or between ring members and non-ring members with hetero atoms or with carbon atoms having three bonds to hetero atoms with at the most one bond to halogen, e.g. ester or nitrile radicals, directly attached to ring carbon atoms
- C07D233/90—Carbon atoms having three bonds to hetero atoms with at the most one bond to halogen, e.g. ester or nitrile radicals
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K5/00—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof
- C07K5/04—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing only normal peptide links
- C07K5/06—Dipeptides
- C07K5/06139—Dipeptides with the first amino acid being heterocyclic
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K5/00—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof
- C07K5/04—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing only normal peptide links
- C07K5/06—Dipeptides
- C07K5/06139—Dipeptides with the first amino acid being heterocyclic
- C07K5/06182—Dipeptides with the first amino acid being heterocyclic and Pristinamycin II; Derivatives thereof
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K7/00—Peptides having 5 to 20 amino acids in a fully defined sequence; Derivatives thereof
- C07K7/04—Linear peptides containing only normal peptide links
-
- C—CHEMISTRY; METALLURGY
- C08—ORGANIC MACROMOLECULAR COMPOUNDS; THEIR PREPARATION OR CHEMICAL WORKING-UP; COMPOSITIONS BASED THEREON
- C08G—MACROMOLECULAR COMPOUNDS OBTAINED OTHERWISE THAN BY REACTIONS ONLY INVOLVING UNSATURATED CARBON-TO-CARBON BONDS
- C08G69/00—Macromolecular compounds obtained by reactions forming a carboxylic amide link in the main chain of the macromolecule
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6839—Triple helix formation or other higher order conformations in hybridisation assays
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Polymers & Plastics (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Biotechnology (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Epidemiology (AREA)
- Pharmacology & Pharmacy (AREA)
- General Engineering & Computer Science (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Polyamides (AREA)
Description
Preparation and Use of Bifunctional Molecules that Bind to Predetermined DNA Molecules Field of the Invention This invention relates to the fields of molecular biology, biochemistry, and drug design. More particularly, the present invention provides synthetic polyamides containing pyrrole and imidazole amino acids which bind specific base pair sequences of double helical DNA with affinities and specificities comparable to DNA binding proteins such as the transcription factors. A series of molecular templates are described which allow for rational targeting of any predetermined DNA sequence .00o e o* 0* [R:\LIBC]07376.doc:bav WO 98/49142 PCT/US98/06997 of therapeutic potential. This non-biological approach to DNA recognition provides an underpinning for the design of synthetic cell-permeable ligands for the control of gene-expression.
BACKGROUND OF THE INVENTION In every human cell, genetic information is stored on a string-like DNA polymer which is approximately 1 meter in length and contains 3 x 109 units of information in the form of base pairs, within which is encoded approximately 80,000 to 100,000 genes or sets of instructions. (Watson, J.D. Gene, 135, 309-315 (1993).) The specific interaction of proteins such as transcription factors with DNA controls the regulation of genes and hence cellular processes. (Roeder, R.G. TIBS, 9, 327-335 (1996).) A wide variety of human conditions ranging from cancer to viral infection arise from malfunctions in the biochemical machinery that regulates gene-expression. Tjian, Sci. Am., 2, 54-61 (1995).) Designed small molecules which target specific DNA sequences offer a potentially general approach for genespecific regulation. (Gottesfeld, et al. Nature Accepted.
(1997). Such molecules could be powerful therapeutics for combating life threatening diseases which result from misregulation in transcription.
Designed bifunctional small molecules which target specific DNA sequences offer a potentially general approach for gene-specific, sequence-specific, or organism specific modification, detection or capture of plasmids, genes, cDNA, cosmids, or chromosomes. More specifically, a life threatening disease may result from a single error within the 3 x 109 units of information stored within the double helix. Sequence-specific polyamides may discriminate such small errors, hence WO 98/49142 PCT/US98/06997 bifunctional polyamides could have broad diagnostic applications which range from determining the molecular basis of life threatening diseases to sequence-specific visualization of disease genes in living organisms.
The genetic information is in fact, stored on two stands of DNA (in antiparallel orientation) in a structure termed the double helix. The DNA double helix consists of A,T and G,C base pairs held together by specific Watson-Crick hydrogen bonds like rungs on a twisted ladder. (Dickerson, et al. Science, 216, 475 (1982). The common B-form of DNA is characterized by a wide (12A) and shallow major groove and a deep and narrow (4-6 A) minor. Individual sequences may be distinguished by the pattern of hydrogen bond donors and acceptors displayed on the edges of the base pairs. (Principles of Nucleic Acid Structure Sanger, Springer-Verlag, New York, 1984.) In the minor groove, the A,T base pair presents two symmetrically placed hydrogen bond acceptors in the minor groove, the purine N3 and the pyrimidine 02 atoms. The G,C base pair presents these two acceptors, but in addition presents a hydrogen bond donor, the 2amino group of guanine (Steitz, T.A. Quart. Rev. Biophys.
23, 205).
Small molecules isolated from natural sources which bind DNA are found to be a structurally diverse class, as evidenced by consideration of four representative molecules, chromomycin, distamycin, actinomycin D, and calicheamicin. (Gao, et al. J. Mol. Biol. 223, 259-279.
(1992); Kamitori, et al. J. Mol. Biol. 225, 445-456 (1992); Paloma, et al. J. Am. Chem. Soc. 116, 3697-3708 (1994); Coll, et al. Proc. Natl. Acad. Sci. U.S.A. 84, 8385-8389 (1987)). There is no simple natural recognition code for the readout of specific sequences of DNA.
WO 98/49142 PCT/US98/06997 The structures of four small molecules isolated from natural sources are shown in Figure 1. Among these DNAbinding molecules, distamycin is distinguished by its structural simplicity, having no chiral centers and an oligopyrrolecarboxamide core structure. (Zimmer, C.
Prog. Nucleic Acid Res. Mol. Biol. (1975) 15, 285; Baguley, B.C. Molecular and Cellular Biochemistry (1982) 43, 167-181; Zimmer, et al., Prog. Biophy. Mol. Biol. 47, 31 (1986)). Structural studies of distamycin-DNA complexes reveal modular complexes in which adjacent pyrrolecarboxamides makes similar contacts with adjacent DNA base pairs. The relative simplicity of distamycin, with respect both to its chemical structure and its complexes with DNA, guided the initial decision to use distamycin as a basis for designed polyamides having novel DNA-binding sequence specificity. (Dervan, P.B.
Science 232, 464-471 (1986).) A schematic representation of recognition of A,T rich sequences in the minor groove by Distamycin is shown below:
NHJ+
H
2 N NH 2 H 2
_H
T :-rHN O O N
N
T N T/ N H
P
N .H HN -N A 0' 0 NHN T r 0 3' 5' 3
H
+NH
2 1:I Distamycin*DNA Complex 2:1 Distamycin*DNA Complex Two distinct DNA binding modes exist for Distamycin A. In the first binding mode, a single molecule of Distamycin bindsin the middle of the minor groove of a WO 98/49142 PCT/US98/06997 ba.se pair A,T rich sequence. The amide hydrogens of the N-methylpyrrole-carboxamides form bifurcated hydrogen bonds with Adenine N3 and thymine 02 atoms on the floor of the minor groove.
10 In the second binding mode, 2 distamycin ligands form an antiparallel side-by-side dimer in the minor groove of a 5 base pair A,T rich site.
(Pelton, J.G. Wemmer, D.E. (1989) Proc. Natl. Acad.
Sci. 86, 5723-5727; Pelton, J.G. Wemmer, D.E. (1990) J.
Am. Chem. Soc. 112, 1393-1399; Chen, et al. (1994).
Nature Struct. Biol. 1, 169-175.) In the 2:1 model each polyamide subunit forms hydrogen bonds to a unique DNA strand in the minor groove.
Polyamides containing N-methylpyrrole (Py) and Nmethylimidazole (Im) amino acids provide a model for the design of artificial molecules for recognition of double helical DNA. For side-by-side complexes of Py/Impolyamides in the minor groove of DNA, the DNA binding sequence specificity depends on the sequence of side-byside amino acid pairings. (Wade, et al. (1992). J. Am.
Chem. Soc. 114, 8783-8794; Mrksich, et al. (1992). Proc.
Natl. Acad. Sci. U.S.A. 89, 7586-7590; Wade, W.S., Mrksich, M. Dervan, P.B. (1993); Biochemistry 32, 11385-11389 (1993)). A pairing of Im opposite Py targets a G*C base pair while a pairing of Py opposite Im targets a C*G base pair. A Py/Py combination is degenerate targeting both A*T and T*A base pairs. Specificity for G,C base pairs results from the formation of a putative hydrogen bond between the imidazole N3 and the exocyclic amine group of guanine. Validity of the pairing rules is supported by a variety of footprinting and NMR structure studies. (Mrksich, et al., J. Am. Chem. Soc., 115, 2572 (1993); Geierstanger, et al. Science, 266, 646 (1994); Mrksich et al., J. Am. Chem. Soc., 117, 3325 (1995).) WO 98/49142 PCT/US98/06997 A schematic representation of the polyamide pairing rules is shown below:
N-H
o o Py Im targets C.G N
HN
N
H N
N\
NH N Py Py targets A-T and T*A N .G Ha'-
HN
N
-N
.QNI Im Py targets G*C 0
SN-H
3' In parallel with the elucidation of the scope and limitations of the pairing rules, efforts have been made to increase the DNA-binding affinity and specificity of pyrrole-imidazole polyamides by covalently linking polyamide subunits. (Mrksich, M. Dervan, P.B. (1993).
J. Am. Chem. Soc. 115, 9892-9899; Dwyer, et al. (1993).
J. Am. Chem. Soc. 115, 9900-9906; Mrksich, M. Dervan, P.B. (1994). J. Am. Chem. Soc. 116, 3663-3664; Chen, Y.H.
and Lown, J.W. (1994) J. Am. Chem. Soc. 116, 6995-7005.
Chen, Y.H. and Lown, J.W. Heterocycles 41, 1691-1707 (1995). Geierstanger, et al., Nature Structural Biology, 3, 321 (1996). Chen, et al. J. Biomol. Struct. Dyn. 14, 341-355 (1996); Cho, et al. Proc. Natl. Acad. Sci. USA, 92, 10389 (1995)). A simple hairpin polyamide motif with y-aminobutyric acid serving as a turn-specific internal-guide-residue provides a synthetically accessible method of linking polyamide subunits within the 2:1 motif. The head-to-tail linked polyamide ImPyPy-y- PyPyPy-dimethylaminopropylamide (Dp) was shown to specifically bind the designated target site 5'-TGTTA-3' with an equilibrium association constant of Ka 8 x 10 7 WO 98/49142 PCT/US98/06997
M
1 an increase of 300-fold relative to the unlinked three-ring polyamide pair ImPyPy and PyPyPy. (Mrksich, et al. J. Am. Chem. Soc. 116, 7983-7988). The hairpin polyamide model is supported by footprinting, affinity cleaving and NMR structure studies. (Church, et al.
Biochemistry 1990, 29, 6827; He, et al. J. Am. Chem. Soc.
1993, 115, 7061; de Clairac, et al. J. Am. Chem. Soc.
submitted.) A schematic representation of recognition of a TGTTA-3' sequence by unlinked subunits (left) and yaminobutyric acid linked subunits (right) is shown below: NH-- HO 0 O 0 H H <NH- N H n
SNH---NH-A
N N 0 I T N T HN 'N T
-HN
iN^ 0 H v i G H iI G 0
H
2 N 0 3' N"
O
+NH2 5' 3' H ImPyPy-Dp PyPyPy TGTTA ImPyPy-y-PyPyPy-Dp
TGTTA
Ka 2 x 10 M K 8 x 10 7
M
Closing the ends of the hairpin to form a cyclic polyamide increases the overall energetics for DNAbinding presumably by restricting conformational space for the molecule. (Lown, J.W. and Krowicki, K. J. Org.
Chem. 1985, 50, 3774.) A cyclic polyamide cyclo-(ImPyPy-y- PyPyPy-y-) was shown to specifically bind the designated target site 5'-TGTTA-3' with an equilibrium association constant of Ka 2.9 x 109 M-1, an increase of relative to the corresponding hairpin polyamide of WO 98/49142 PCT/US98/06997 sequence composition ImPyPy-y-PyPyPy. The sequencespecificity versus single base pair mismatch sites drops from 30-fold for the hairpin polyamide to 2-fold for the cyclic polyamide.
A schematic representation of a cyclic polyamide recognizing the minor groove is shown below: 0 0 0 0 NH- .4 HN NH-A HN N O H N
-N
T H T H N N NH N 3 0 5 0 NH 2 ImyPy-y-PyPyPy-Dp 5'-TGTTA-3' cyclo-(ImPyPy-y-PyPyPy-y-) 5'-TGTTA-3' Ka 7 x 10 7 M- K 2 x 10 M- 1 specificity: 30-fold specificity: 2-fold Despite the design breakthrough in molecular recognition of DNA, the binding affinities of linked and unlinked polyamide dimers of the prior art are modest when compared to those found with natural DNA binding proteins. (Clemens, et al. J. Mol. Biol. 244, 23-35 (1994)). For example DNA-binding transcription factors recognize their cognate sites at subnanomolar concentrations. (Jamieson, et al. Biochemistry 33, 5689- 5695 (1994); Choo, Y. and Klug, A. Proc.Natl. Acad. Sci.
U.S.A. 91, 11168-11172 (1994); Greisman, H.A. and Pabo, C.O. Science 275, 657-661 (1997)). Six-ring hairpin polyamides require concentrations greater than 10 nM to occupy their target sites. The only class of polyamides described in the prior art with affinities similar to DNA-binding proteins are the 6-ring cyclic polyamides; however, this class of molecules lacks the sequence- WO 98/49142 PCT/US98/06997 specificity of proteins an energetic penalty for binding a single base pair mismatch site) and therefore currently has no potential for therapeutic applications.
Two prior approaches for the development of synthetic transcriptional antagonists have been reported.
Oligodeoxynucleotides which recognize the major groove of double helical DNA via triple helix formation bind a broad sequence repertoire with high affinity and specificity (Moser, H. E. Dervan, P. B. Science 238, 645-650 (1987); Thuong, et al. Angew. Chem. Int. Ed.
Engl. 32, 666-690 (1993)). Although oligonucleotides and their analogs have been shown to interfere with gene expression (Maher, et al. Biochemistry 31, 70-81 (1992); Duvalvalentin, et al. Proc. Natl. Acad. Sci. U.S.A. 89, 504-508 (1992)). The triple helix approach is limited to purine tracks and suffers from poor cellular uptake.
There are a few examples of cell-permeable carbohydrate based ligands that interfere with transcription factor function. (Ho, et al. Proc. Natl. Acad. Sci. USA 91, 9203-9207 (1994); Liu, C. et al. Proc. Natl. Acad. Sci.
USA 93, 940-944 (1996)). However oligosaccharides are not yet amenable to recognition of a broad range of predetermined DNA sequences.
Because of the small size and hydrophobic nature of polyamides (MW 1200) and because the parent ligand Distamycin is itself cell-permeable these ligands have the potential to underpin a new field of small molecule regulation of gene expression. It remained to be determined if low molecular weight (MW 1200) pyrroleimidazole polyamides could be constructed which recognize predetermined DNA sites at subnanomolar concentrations without compromising sequence-selectivity.
Summary of the Invention This invention provides improved polyamides for selectively binding a DNA molecule.
According to a first embodiment of the invention, there is provided a polyamide of the formula R z X Z R4
N
N
R1 O R* \1 0 /n RR n or a pharmaceutically acceptable salt thereof, wherein: n is 5-11; 10 each Y is independently selected from the group consisting of -NH- and -NH---(CH 2
)--CH
2
-C-NH
where p is 0 to 5, provided that at least one Y is not -NH-; Z is
II
CH2-Cwherein r is 0 to each X is independently selected from the group consisting CH, COH and N; each R is independently selected from the group consisting of H, unsubstituted C 1 6 alkyl, and Cl-6alkyl-L; each R 2 is independently selected from the group consisting of H, NH2, SH, Cl, Br, F, N-acetyl and N-formyl;
R
4 is -NH(CH 2 )0- 6
NR
5
R
6 or -NH(CH 2 )0-6CONH(CH 2 0 6NR 5
R
6 where R 5 and
R
6 are independently selected from the group consisting of H, Cl, NO, N-acetyl, benzyl, Cl-6alkyl, Ci-6alkylamine, Cl-6alkyldiamine, C -6alkylcarboxylate, C -6alkenyl, Ci-6alkynyl and C-l 6 alkyl-L; wherein L is a detectable label, and wherein at least one.R',
R
S or R 6 is Cl.
6 alkyl-L.
According to a second embodiment of the invention, there is provided a polyamide s selected from the group consisting of IMPyPYPY-Y-PyPyPyPy-fP-p, PyPylmPy-y-PyPyPyPy-p3-Dp, ImPyPyPy-y-JrnPyPyPy-p- Dp, PylmPyPy-y-PylmPyPy-p3-Dp, ImPylmPy-y-PyPyPyPy43 -Dp, Im~mPyPy-y- PyPyPyPy-p3-Dp, Im~mIrnPy-y-PyPyPyPy-f3-Dp, JmimPyPy-y-ImPyPyPy-P-Dp, ImPyPyPy-y-ImImPyPy-P -Dp, ImIn~yPy-y-Jm~mPyPy- j3-Dp, JrnPylmPy-7-ImPylmPy.
j3-Dp, TmImImPy-y-IniPyPyPyPy.P3Dp, Inmmlm-y-PyPyPyPy-p3-Dp, Im-P3-PyPy-y-Imf3-PyPy-f3-Dp, Im-I 3 -ImIm-y-Py-P-PyPy-P-Dp, Im-f3-ImPy-y-Im--JmPyfP-p, IMPYPyPyPy-y-ImPyPyPyPypDp, Im~mPyPyPy-y-ImPyPyPyPy-P-Dp, LmPyImPyPy-y- ImPyPyPyPy-p-Dp, ImImPylrnIm-y-PyPyPyPyPy-P-Dp, ImPyPylmPy-y-TmPyPyImPyP Dp, ImnPy-f3-PyPy-y-ImPy-p-PyPy-p-Dp, ImIm-j3-Jmnm-y-PyPy-f-PyPyPDp, ImPy-p3- ImPy-y-ImPy-j3-ImPy-f3-Dp, ImPy-f3-PyPyPy-y-ImPyPy-3 -PyPy-f3-Dp, TmIm-j3-PyPyPy- 'y-PyPyPy-f3-PyPy-3-Dp, ImPy-f-ImPyPy-yImPyPyqPyPypDp, ImIm-j3-PyPyPy-y- ImImPy-j3-PyPy-f3-Dp, ImPy-f-PyPyPy-y-PyPyPy-]34mPy-4-Dp, Jyyyyy-7- ImPyPyPyPyPy-f3-Dp, InlPyPy-f3-PyPy-y-ImPyPy-3-PyPy-p3Dp, ImPyPyPy-3-Py-y-lmf3P-PyPyPyPy-f3-Dp, TmImPyPyPyPy-7-ImImPyPyPyPy-jp-Dp, Im-f3-PyPyPyPy--m-P3i PyPyPyPy-p3-Dp, ImPyPyPy-o-Py-y-ImPyPyPy-f3.Py--Dp, ImPyImPyPyPy-Y- ImPyPyPyPyPy-3-Dp, ImPyPy-I3-PyPy-y-In1Py-f-PyPyPy--Dp, ImPYPYPYPY-3-y- ImPyPyPyPy-3-f-Dp, ImPy-f3-ImPyPy-y-Inl~y-f3-TmPyPy..j3Dp, Im-j3-PyPyPyPy-y- ImPyPyPy-f3-Py-P-Dp, Im-I3-ImPyPyPy-7-ImPyPyPy-f3Py-p-Dp, JmPyPy-f3-PyPyPy-3- Dp, Jm~mPy-13-PyPyPy-VKDp, Jmhm~m-f3-PyPyPy-P3-Dp, JmPyPyPyPy-f3-PyPyPy-j3-Dp, 20 ImPyPyPy-13-PyPyPy-f3-Dp, ImPyPy-f3-PyPyPyPyPy-3-Dp, ImPyPyPy-f3-PyPyPyPy-3- Dp, JmImPyPy-P-PyPyPyPy. 1-Dp, ImImImPy-P-PyPyPyPy-P3.Dp, ImPyPyPy-3- 0 ImPyPyPy-p3-Dp, ImImPyPy-13-ImPyPyPy-j3-Dp, ImhnPyPyPy-I-PyPyPyPyPyPDp, I mImImPyPy-t-PyPyPyPyPyP3Dp, ImIm-f3-PyPy- P-PyPy-f3-PyPy-f3-Dp, ImmPy-p- PyPyPy-I3-PyPyPy-j3-Dp, ImImPyPy-j3-Py-j3-PyPyPyPy -3-Dp, JmPyPy-7-ImPyPy-p~- PyPyPy-p-Dp, ImPyPy-y-PyPyPy-P-PyPyPy 13-Dp, PylrnPy-y-ImPyPy-PJ-PyPyPy-P-Dp, PylmPy-y-ImPyPy-p3-PyPyPy f3-PyPyPy-f3-Dp, LmImPy-y-JmPyPy-Pf-PyPyPy-P3-Dp, ImPyPy-y-IrnPyPy-G-PyPyPy-P-Dp, JmPyPyPy-y-JmImImPy-3-PyPyPyPy-3-Dp, ImImPyPy-y-JmImPyPy-3-PyPyPyp3-Dp, and hlmPyPy-y-PyPyPyPy f3-PyPyPyPy-3- Dp, bound to a detectable label, wherein Im is N-methyl-imidazole, Py is methylpyrrole, P3 is f3-alanine, Dp is dimethylaminopropylamide, and ~y is y-aminobutyric acid.
Brief Description of the Figures FIGURE 1. Small molecules isolated from natural sources.
FIGURE 2. Hairpin polyamides.
FIGURE 3. Chemical structures of polyamides.
-A [1A\ yLib\LIB3FF]39058spec.doc:gcc FIGURE 4.
FIGURE 5.
FIGURE 6.
FIGURE 7.
FIGURE 8.
FIGURE 9.
FIGURE 10.
FIGURE 11.
FIGURE 12.
FIGURE 13.
FIGURE 14.
FIGURE 15.
FIGURE 16.
FIGURE 17.
15 FIGURE 18.
FIGURE 19.
FIGURE 20.
12 Solid phase synthesis of polyamides.
Extended hairpin polyamides.
Association profile of extended hairpin polyamides.
Binding models for polyamides.
Schematic binding models for eight ring hairpin polyamide.
Eight-residue hairpin polyamides.
Structure of 4-P-4 polyamides.
Recognition of DNA by 4-P-4 polyamides.
Placement of p/p pairs.
P-linked fully overlapped polyamide complexes.
10-ring hairpin polyamides.
Discrimination of seven base pair sequence by polyamides.
Hairpin polyamides that recognize seven base pair sequence.
Dnase I footprint titration.
Ni(II)-Gly-Gly-His modified polyamide.
Bromoacetylated hairpin polyamide.
Structure CC-1065 and duocarmycins.
a .a
S
0** 0 000000 0 000 0 0000 0 0000 00 0 *s 00 0000 *000 0 0 S 0 00..
00 0 00 0 00 This page left blank intentionally.
[1:\DayLib\LIBFF]39058spec.doc:gcc WO 98/49142 PCT/US98/06997 FIGURE 21.Alkylation mechanism of CC-1065.
FIGURE 22.Structure of Bizlesin and CBI.
FIGURE 23.Synthesis of CBI-polyamide conjugate.
FIGURE 24.Synthesis of bifunctional methidium-polyamide conjugates.
FIGURE 25.Synthesis of polyamide-rhodamine conjugate.
FIGURE 26.Structure of polyamide-DYE conjugates.
FIGURE 27.Synthesis of biotin-polyamide conjugates.
FIGURE 28.Bifunctional biotin-polyamide conjugates.
FIGURE 29.Affinity capture using bifunctional biotinpolyamide conjugates.
FIGURE 30.Psoralen-polyamide conjugate.
FIGURE 31.Cooperative dimerization of polyamides.
FIGURE 32.Binding of polyamides to mismatched sites.
FIGURE 33.Footprint titration of polyamides.
FIGURE 34.Generalizable polyamide motifs.
FIGURE 35.Examples of polyamides.
FIGURE 36.Determination of polyamide affinity.
FIGURE 37.N-terminally extended polyamides.
FIGURE 38.Polyamides binding 16 base pair sequence.
FIGURE 39.Determination of 16 base pair sequence.
FIGURE 40.Binding of polyamides to mismatched sites.
FIGURE 41. P-substitution in polyamides.
FIGURE 42.Affinity determinations for P-substituted polyamides.
FIGURE 43.Binding of polyamides to TATA box.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Within this _pplication, unless otherwise stated, definitions of the terms and illustration of the techniques of this application may be found in any of several well-known references such as: Sambrook, et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press (1989); Goeddel, ed., Gene WO 98/49142 PCT/US98/06997 Expression Technology, Methods in Enzymology, 185, Academic Press, San Diego, CA (1991); "Guide to Protein Purification" in Deutshcer, ed., Methods in Enzymology, Academic Press, San Diego, CA (1989); Innis, et al., PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, CA (1990); Freshney, Culture of Animal Cells: A Manual of Basic Technique, 2 nd Ed., Alan Liss, Inc. New York, NY (1987); Murray, ed., Gene Transfer and Expression Protocols, pp. 109-128, The Humana Press Inc., Clifton, NJ and Lewin, Genes VI, Oxford University Press, New York (1997).
For the purposes of this application, a promoter is a regulatory sequence of DNA that is involved in the binding of RNA polymerase to initiate transcription of a gene. A gene is a segment of DNA involved in producing a peptide, polypeptide or protein, including the coding region, non-coding regions preceding ("leader") and following ("trailer") the coding region, as well as intervening non-coding sequences ("introns") between individual coding segments ("exons"). Coding refers to the representation of amino acids, start and stop signals in a three base "triplet" code. Promoters are often upstream '5 to") the transcription initiation site of the corresponding gene. Other regulatory sequences of DNA in addition to promoters are known, including sequences involved with the binding of transcription factors, including response elements that are the DNA sequences bound by inducible factors. Enhancers comprise yet another group of regulatory sequences of DNA that can increase the utilization of promoters, and can function in either orientation or and in' any location (upstream or downstream) relative to the promoter. Preferably, the regulatory sequence has a WO 98/49142 PCT/US98/06997 positive activity, binding of an endogeneous ligand a transcription factor) to the regulatory sequence increases transcription, thereby resulting in increased expression of the corresponding target gene. In such a case, interference with transcription by binding a polyamide to a regulatory sequence would reduce or abolish expression of a gene.
The promoter may also include or be adjacent to a regulatory sequence known in the art as a silencer. A silencer sequence generally has a negative regulatory effect on expression of the gene. In such a case, expression of a gene may be increased directly by using a polyamide to prevent binding of a factor to a silencer regulatory sequence or indirectly, by using a polyamide to block transcription of a factor to a silencer regulatory sequence.
It is to be understood that the polyamides of this invention bind to double stranded DNA in a sequence specific manner. The function of a segment of DNA of a given sequence, such as 5'-TATAAA-3', depends on its position relative to other functional regions in the DNA sequence. In this case, if the sequence 5'-TATAAA-3' on the coding strand of DNA is positioned about 30 base pairs upstream of the transcription start site, the sequence forms part of the promoter region (Lewin, Genes VI, pp. 831-835). On the other hand, if the sequence TATAAA-3' is downstream of the transcription start site in a coding region and in proper register with the reading frame, the sequence encodes the tyrosyl and lysyl amino acid residues (Lewin, Genes VI, pp. 213-215).
While not being held to one hypothesis, it is believed that the binding of the polyamides of this invention modulate gene expression by altering the binding of DNA binding proteins, such as RNA polymerase, WO 98/49142 PCT/US98/06997 transcription factors, TBF, TFIIIB and other proteins.
The effect on gene expression of polyamide binding to a segment of double stranded DNA is believed to be related to the function, promoter, of that segment of DNA.
It is to be understood by one skilled in the art that the improved polyamides of the present invention may bind to any of the above-described DNA sequences or any other sequence having a desired effect upon expression of a gene. In addition, U.S. Patent No. 5,578,444 describes numerous promoter targeting sequences from which base pair sequences for targeting an improved polyamide of the present invention may be identified.
It is generally understood by those skilled in the art that the basic structure of DNA in a living cell includes both major and a minor groove. For the purposes of describing the present invention, the minor groove is the narrow groove of DNA as illustrated in common molecular biology references such as Lewin, Genes VI, Oxford University Press, New York (1997).
To affect gene expression in a cell, which may include causing an increase or a decrease in gene expression, a effective quantity of one or more polyamide is contacted with the cell and internalized by the cell.
The cell may be contacted in vivo or in vitro. Effective extracellular concentrations of polyamides that can modulate gene expression range from about 10 nanomolar to about 1 micromolar. Gottesfeld, et al., Nature 387 202-205 (1997). To determine effective amounts and concentrations of polyamides in vitro, a suitable number of cells is plated on tissue culture plates and various quantities of one or more polyamide are added to separate wells. Gene expression following exposure to a polyamide can be monitored in the cells or medium by detecting the amount of the protein gene product present as determined WO 98/49142 PCT/US98/06997 by various techniques utilizing specific antibodies, including ELISA and western blot. Alternatively, gene expression following exposure to a polyamide can be monitored by detecting the amount of messenger RNA present as determined by various techniques, including northern blot and RT-PCR.
Similarly, to determine effective amounts and concentrations of polyamides for in vivo administration, a sample of body tissue or fluid, such as plasma, blood, urine, cerebrospinal fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate tissue source is analyzed. Gene expression following exposure to a polyamide can be monitored by detecting the amount of the protein gene product present as determined by various techniques utilizing specific antibodies, including ELISA and western blot. Alternatively, gene expression following exposure to a polyamide can be monitored by the detecting the amount of messenger RNA present as determined by various techniques, including northern blot and RT-PCR.
The polyamides of this invention may be formulated into diagnostic and therapeutic compositions for in vivo or in vitro use. Representative methods of formulation may be found in Remington: The Science and Practice of Pharmacy, 19th ed., Mack Publishing Co., Easton, PA (1995).
For in vivo use, the polyamides may be incorporated into a physiologically acceptable pharmaceutical composition that is administered to a patient in need of treatment or an animal for medical or research purposes.
The polyamide composition comprises pharmaceutically acceptable carriers, excipients, adjuvants, stabilizers, and vehicles. The composition may be in solid, liquid, gel, or aerosol form. The polyamide composition of the WO 98/49142 PCT/US98/06997 present invention may be administered in various dosage forms orally, parentally, by inhalation spray, rectally, or topically. The term parenteral as used herein includes, subcutaneous, intravenous, intramuscular, intrasternal, infusion techniques or intraperitoneally.
The selection of the precise concentration, composition, and delivery regimen is influenced by, inter alia, the specific pharmacological properties of the particular selected compound, the intended use, the nature and severity of the condition being treated or diagnosed, the age, weight, gender, physical condition and mental acuity of the intended recipient as well as the route of administration. Such considerations are within the purview of the skilled artisan. Thus, the dosage regimen may vary widely, but can be determined routinely using standard methods.
Polyamides of the present invention are also useful for detecting the presence of double stranded DNA of a specific sequence for diagnostic or preparative purposes.
The sample containing the double stranded DNA can be contacted by polyamide linked to a solid substrate, thereby isolating DNA comprising a desired sequence.
Alternatively, polyamides linked to a suitable detectable marker, such as biotin, a hapten, a radioisotope or a dye molecule, can be contacted by a sample containing double stranded DNA.
The design of bifunctional sequence specific DNA binding molecules requires the integration of two separate entities: recognition and functional activity.
Polyamides that specifically bind with subnanomolar affinity to the minor groove of a predetermined sequence of double stranded DNA are linked to a functional molecule, providing the corresponding bifunctional conjugates useful in molecular biology, genomic WO 98/49142 PCT/US98/06997 sequencing, and human medicine. Polyamides of this invention can be conjugated to a variety of functional molecules, which can be independently chosen from but is not limited to arylboronic acids, biotins, polyhistidines comprised from about 2 to 8 amino acids, haptens to which an antibody binds, solid phase supports, oligodeoxynucleotides, N-ethylnitrosourea, fluorescein, bromoacetamide, iodoacetamide, DL-a-lipoic acid, acridine, captothesin, pyrene, mitomycin, texas red, anthracene, anthrinilic acid, avidin, DAPI, isosulfan blue, malachite green, psoralen, ethyl red, 4-(psoraen-8yloxy)-butyrate, tartaric acid, (+)-a-tocopheral, psoralen, EDTA, methidium, acridine, Ni(II)*Gly-Gly-His, TO, Dansyl, pyrene, N-bromoacetamide, and gold particles.
Such bifunctional polyamides are useful for DNA affinity capture, covalent DNA modification, oxidative DNA cleavage, DNA photocleavage. Such bifunctional polyamides are useful for DNA detection by providing a polyamide linked to a detectable label. DNA complexed to a labeled polyamide can then be determined using the appropriate detection system as is well known to one skilled in the art. For example, DNA associated with a polyamide linked to biotin can be detected by a streptavidin alkaline phosphatase system.
The present invention also describes a diagnostic system, preferably in kit form, for assaying for the presence of the double stranded DNA sequence bound by the polyamide of this invention in a body sample, such brain tissue, cell suspensions or tissue sections, or body fluid samples such as CSF, blood, plasma or serum, where it is desirable to detect the presence, and preferably the amount, of the double stranded DNA sequence bound by the polyamide in the sample according to the diagnostic methods described herein.
WO 98/49142 PCT/US98/06997 The diagnostic system includes, in an amount sufficient to perform at least one assay, a specific polyamide as a separately packaged reagent. Instructions for use of the packaged reagent(s) are also typically included. As used herein, the term "package" refers to a solid matrix or material such as glass, plastic polyethylene, polypropylene or polycarbonate), paper, foil and the like capable of holding within fixed limits a polyamide of the present invention. Thus, for example, a package can be a glass vial used to contain milligram quantities of a contemplated polyamide or it can be a microliter plate well to which microgram quantities of a contemplated polypamide have been operatively affixed, linked so as to be capable of being bound by the target DNA sequence. "Instructions for use" typically include a tangible expression describing the reagent concentration or at least one assay method parameter such as the relative amounts of reagent and sample to be admixed, maintenance time periods for reagent or sample admixtures, temperature, buffer conditions and the like.
A diagnostic system of the present invention preferably also includes a detectable label and a detecting or indicating means capable of signaling the binding of the contemplated polyamide of the present invention to the target DNA sequence. As noted above, numerous detectable labels, such as biotin, and detecting or indicating means, such as enzyme-linked (direct or indirect) streptavidin, are well known in the art.
Trauger,et al. (Nature, 382: 559-561) and Swalley, et al. Am. Chem. Soc. 119: 6953-6961) have described recognition of DNA by certain polyamides at subnanomolar concentrations. Pairing specific carboxyamide groups allows for recognition of specific DNA sequences (Swalley, et al. supra). Polyamides comprising Hp, Im, WO 98/49142 PCT/US98/06997 and Py provide for coded targeting of pre-determined
DNA
sequences with high affinity and specificity. Im and Py polyamides may be combined to form Im/Py, Py/Im, Py/Py binding pairs which complement the four Watson-Crick base pairs A, C, G, and T. Table 1 illustrates such pairings.
WO 98/49142 PCT/US98/06997 TABLE 1 Pairing Codes for Base Pair Recognition* Pair G*C C*G T*A A*T Im/Py Py/Im Im/P P/Im Py/Py *favored disfavored The basic polyamide pairing rules of the prior art are insufficient for design of ligands recognizing target sites having subnanomolar binding affinities. Additional second generation rules for polyamide design are provided herein. Each additional rule alone may not be sufficient for design of polyamides with subnanomolar affinity.
However, simultaneous application of the second generation design rules provided herein allows for the construction of a number of versatile molecular templates for polyamide design.
It has been found that a hairpin polyamide synthesized from Boc-P-alanine-Pam-Resin, ImPyPy-y-PyPyPy- P-Dp binds with both enhanced affinity and specificity relative to the parent compound, ImPyPy-y-PyPyPy-Dp, which lacks the C-terminal P-alanine residue. (Optimization of the Hairpin Polyamide Design for Recognition of the Minor Groove of DNA. M. E. Parks, E. E. Baird and P. B.
Dervan, J. Am. Chem. Soc., 118, 6147 (1996).) More specifically ImPyPy-y-PyPyPy-p-Dp binds with an apparent first order association constant, Ka 3 x 108 M 1 a factor of four greater than the parent polyamide, ImPyPy- Y-PyPyPy-Dp, Ka 8 x 107 M Furthermore, ImPyPy-y-PyPyPy- P-Dp binds the target 5'-TGTTA-3' match site with WO 98/49142 PCT/US98/06997 specificity relative to a single base pair 5'-TGACA-3' mismatch site. This can be compared with the parent polyamide ImPyPy-y-PyPyPy-Dp which has a 24-fold specific binding relative to the same two DNA sites. The modest increased binding affinity of the C-terminal -alanine polyamide, may result from an additional hydrogen bond between the P-alanine carboxamide and a 'sixth' base pair of the binding site.
Three or four-ring improved polyamides of the present invention are covalently coupled to form six or eight-ring structures, respectively, that bind specifically to four or six base pair targets, respectively, at subnanomolar concentrations. As such, the improved polyamides of the present invention may be directed to any DNA sequence comprised of A, C, G, or T.
In one embodiment, the present invention comprises improved polyamides having three or four-ring polyamide structures covalently coupled to form six or eight-ring hairpin structures, respectively, of the general structures I XXVIII:
X
1
X
2
X
3
X
4 yX 5
X
6
X
7
X
8 XiX 2
X
3
X
4 Xs'X 6
X
7
X
8
X
9 XIo I II
XIX
2
X
3 pX 4
X
5
X
6
X
1 X2X 3 X4PXsX 6
X
7
X
8 III IV XlX 2
X
3
X
4
X
5
X
6
X
7
X
8
X
1
X
2
X
3 X4pX 5
X
6
X
7 V VI
XIX
2
X
3 pX 4
X
5
X
6
X
7
X
8
XIX
2
X
3
X
4 XsX 6
X
7
X
8 VII VIII XIX2X 3 X4X 5
X
6
X
7
XX
9 XIo XIX2X 3 X4X5X6yX7X8X 9
X
0 I IX X 12 IX X
XIX
2
X
3
X
4
X
5 PyX 6
X
7 XsX 9 Xio X 1
X
2
X
3 YX4PX5X 6 WO 98/49142 WO 9849142PCT/US98/06997
XI
XIX
2 f3X 3 X4YX 5
X
6 f3X 7
X
8
XII
X
1
X
2
X
3 f3X 4 XsYX 6
X
7 XsI3X 9
XIO
Xv
X
1 O3X 2
X
3
X
4
X
5 7X 6 f3X 7
X
8
X
9
X]
0
XVII
X
1
X
2
X
3
PX
4
X
5
YX
6
X
7 I3X 8
X
9 X]o xIX
X
1 P3X 2
X
3
XC
4 X5YX 6
X
7
X
8
X
9
PX
10
XXI
X
1
X
2
X
3 P3X 4
X
5
X
6 I3X 7
X
8
X
9 XX11
X
1
X
2
X
3
YX
4
X
5
X
6
PX
7
X
8
X
9 XXV1
XIX
2
X
3 yX4X5X6GX 7
X
8
X
9
XXVII
X~i
X
1
X
2 I3X 3
X
4
X
5
YX
6
X
7
X
8 I3X 9
X
10
X
1
X
2
X
3 X(4PX 5
YX
6 I3X 7
X
8
X
9
X
10
XVI
X
1
X
2
X
3 X4PX 5
YX
6
X
7 X8X 9 P3XIO
X
1 X2PX 3
X
4
X
5
YX
6 X7P3X8X 9
X
1
O
X
1 X2DX 3 X4PX 5
X
6
PX
7
XC
8
XMI
XiX 2
X
3
X
4 I3X5PYX 6
X
7
X
8
X
9
XIX
2
X
3
YX
4
X
5
X
6
PX
7 X8X 9
PX
0
X
1
IX
12
X
1
X
2
X
3
X
4
YX
5
X
6
X
7
X
8 3X 9
X
0
X
1
IX
12
XXVII
where Xl- 12 is a substituted imidazole such as Nmethylimjidazolecarboxamide or a substituted pyrrole such as N-methylpyrrolecarboxamide An improved polyamide of the present invention may also include a Cterminal aliphatic amino acid such as a P-alanine residue (P3) joined to an amide group such as dimethylaminopropylamide In addition, an improved polyamide of the present invention may further include a aliphatic amino acid such as J-alanine residue or glycine an amide group such as dime thyl ami nopropyl amide (Dp) an alcohol such as EtOH, an acid such as ethyl enediaminetetraacet ic acid (EDTA) or any derivative thereof joined to the y-axinobutyric acid residue.
WO 98/49142 PCT/US98/06997 The use of P-alanine in the synthetic methods provides aromatic/aliphatic pairing (Im/p, P/Im, Py/, and P/Py) and aliphatic/aliphatic pairing (p/p) substitution. The use of y-aminobutyric acid, or a substituted y-aminobutyric acid such as (R)-2,4 diaminobutyric acid, provides for preferred hairpin turns. Many other groups suitable for the purposes of practicing this invention are well known and widely available to one skilled in the art.
The polyamide subunit structures I-XXVIII above, and XXIX below may be covalently coupled through the y residue which represents a -NH-CH 2 -CH2-CH 2 -CONH- hairpin linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric acid. The present invention provides the reagents and methodologies for substituting the y-residue of certain polyamides with a moiety such as (R)-2,4,-diaminobutyric acid H2N).
The NMR structure of a hairpin polyamide of sequence composition ImPyPy-y-Pypypy complexed with a 5'-TGTTA-3' target site indicated that it was possible to substitute the a-position of the y-aminobutyric acid residue within the hairpin-DNA complex (de Claire, et al. J. Am. Chem.
Soc. 1997, 119, 7909). Modeling indicated that replacing the a-H of y with an amino group that may confer an Rconfiguration at the a-carbon and could be accommodated within the floor and walls of the minor groove.
A polyamide of Formulas I-XXIX may also be conjugated to a bifunctional group including but not limited to arylboronic acid, biotins, polyhistidine of 2 to 8 amino acids, hapten to which an antibody binds, solid phase support, oligodeoxynucleotide,
N-
ethylnitrosourea, fluorescein, bromoacetamide, WO 98/49142 WO 9849142PCT/US98/06997 iodoacetamide, DL-ct-lipoic acid, acridine, captothesin, pyrene, mitomycin, texas red, anthracene, anthrinilic acid, avidin, DAPI, isosulfan blue, malachite green, psoralen, ethyl red, 4- (psoraen-8-yloxy) -butyrate, tartaric acid, or -c-tocopheral. Many other groups suitable for the purposes of practicing this invention are well known and widely available to one skilled in the art.
As used herein "polyamide" refers to a polymer comprising the subunits listed below:
H
2 N R 2 H2N -rIN 2 OH X KOH XN O 0 R 3 0 R 3 0
H
2 N Y-1-OH
H
2 N -)AOH
H
2 N Y- OH
R
3
R
3 Rl 3
R
3 a-amino acid P-amino acid y-amino acid where R 1 is Cl-.
10 alkyl (preferably C 1 1 alkyl such as methyl, ethyl, isopropyl), C 1 100 alkylamine (preferably C 1
I
0 alkylamine such as ethylamine) C1I-oo alkyldiamine (preferably Cl- 10 alkyldiamine such as N,N-dimethylpropylamine), Cl 1 00 alkylcarboxylate (preferably a C 1 io alkylcarboxylate such as-CH 2 COOH)
CI..
100 alkenyl (preferably Ci-jo alkenyl such as
CH
2
CH=CH
2 CI-loo alkynyl (preferably Cl-lo alkynyl such as CH 2
C=-CH
3 or C 1 10 0
L;
L includes but is not limited to an arylboronic acid, biotin, polyhistidine comprising from 2 to 8 amino acids, hapten to which an antibody binds, solid phase support, oligodeoxynucleotide, Nethylnitrosourea, fluorescein, bromoacetamide, WO 98/49142 WO 9849142PCT/US98/06997 iodoacetanide, DL-a-lipoic acid, acridine, captothesin, pyrene, mitomycin, texas red, anthracene, anthrinilic acid, avidin, DAPI, isosuif an blue, malachite green, psoralen, ethyl red, 4-(psoralen-8-yloxy)-butyrate, tartaric acid, and -c-tocopheral; m is an integer value ranging from 0 to 12; R 2is H, NH 2 SH, Cl, Br, F, N-acetyl, or N-formyl; R3 is H, NH 2 OH, SH, Br, Cl, F, OMe, CH- 2 0H, CH 2 SH, or
CH
2
NH
2 and, X is N, CH, COH, CCH 3
CNH
2 CCl, or CF.
In a preferred embodiment, R 5 and R 6 are H.
The compounds of the present invention may comprise a compound of Formula XXIX or XXX:
XXIX
1 a b e f o where R1, R R R R R Ri, Rn, and R' are chosen independently from H, Cl, NO, N-acetyl, benzyl, C 16 WO 98/49142 WO 9849142PCT/US98/06997 alkyl, C 1 -6 alkylamine, CI6 alkyldiamine,
C-
alkylcarboxylate,
C
16 alkenyl, and C 1 -6 alkynyl; R2 is selected from the group consisting of H, NH 2 SH, Cl, Br, F, N-acetyl, and N-formyl; R 3, R d, R1 and R are selected independently from the group consisting of H, NH 2 OH, SH, Br, Cl, F, OMe, CH 2
OH,
CH
2 SH, CH 2
NH
2 R4is -NH (CH 2 o.
6
NR
5 R 6 or NH (CH 2 rCO NH (CH 2 0 6 NRSR 6 or NHR 5 or NH (CH 2 rCONHR, 5 where R 5 and R 6 are independently chosen from H, Cl, NO, N-acetyl, benzyl, C.1 6 alkyl, C 1 6 alkylamine,
C
1 -6 alkyldiamine,
C
1 -6 alkylcarboxylate,
C
1 -6 alkenyl, C 1 6 L, where L groups are independently chosen from biotin, oligodeoxynucleotide, N-ethylnitrosourea, fluorescein, bromoacetamide, iodoacetamide, DL-ct-lipoic acid, acridine, ethyl red, 4-(psoralen-8-yloxy)-butyrate, tartaric acid, (+)-ct-tocopheral, and C 1 -6 alkynyl, where r is an integer having a value ranging from 0 to 6; X, Xa ,Xb Xe ,Xf xi Xi, Xn X 0 are chosen independently from the group consisting of N, CH, COH,
CCH
3
CNN
2 CM, CF; and a, b, c, d, e, f, i, j, k, and m are integers chosen independently, having values ranging from 0 to or a pharmaceutically acceptable salt thereof.
The invention further comprises a polyamide having the formula: WO 98/49142 WO 9849142PCTIUS98/06997 mx where R 1 Ralim, and R blj,m) are chosen independently from H, C11 NO, N-acetyl, benzyl, CI-6 alkyl, C1-6 alkylamine, Cl-( alkyldiamine, Cl-c 6 alkylcarboxylate, Cj- 6 alkenyl, and C 1 -6 alkynyl; R2 is selected from the group consisting of H, NH 2 SH, Cl, Br, F, N-acetyl, and N-formyl; Rf (M)and Rc (km) are selected independently from the group consisting of Nil 2 OH, SH, Br, Cl, F, OMe, CH 2
OH,
CH
2 SH, CH 2
NH
2 R 4 is -NH(CH 2 ).-6NRSRE or NH(CH)CO NH(CH2) NH 5
,R
6 or NI{R' or NH(CH 2 )rCONHR 5 where R 5 and R 6 are independently chosen from H, Cl, NO, N-acetyl, benzyl, C 1 6 alkyl, C 1 -6 alkylamine,
C
1 -6 alkyldiamine,
C
1 -6 alkylcarboxylate, C1alkenyl, C 1 6 L, where L groups are independently chosen from biotin, oligodeoxynucleotide, N-ethylnitrosourea, fluorescein, bromoacetamide, iodoacetamide, DL-a-lipoic acid, acridine, ethyl red, 4 -(psoralen-8-yloxy)-butyrate, tartaric acid, -a-tocopheral, and C 1 6 alkynyl, where r is an integer having a value ranging from 0 to 6; WO 98/49142 PCT/US98/06997 X, Xa i and Xb(j,m) are chosen independently from the group consisting of N, CH, COH, CCH 3 CNH2, CC1, CF; and a, b, c, d, e, f, g, h, i, j, k, 1, m, n, o and p are integers chosen independently, having values ranging from 0 to or a pharmaceutically acceptable salt thereof.
Baird, et al. Am. Chem. Soc. 118: 6141-6146) and PCT/US97/003332 describe methods for synthesis of polyamides which are suitable for preparing polyamides of this invention. Polyamides of the present invention may be synthesized by solid phase methods using compounds such as Boc-protected 3-methoxypyrrole, imidazole, and pyrrole aromatic amino acids, which are cleaved from the support by aminolysis, deprotected with sodium thiophenoxide, and purified by reverse-phase HPLC. The identity and purity of the polyamides may be verified using any of a variety of analytical techniques available to one skilled in the art such as 1H-NMR, analytical HPLC, and/or matrix-assisted laser-desorption ionization time-of-flight mass spectrometry (MALDI-TOF MSmonoisotropic).
In addition, the above polyamide subunits can be synthesized in small scale by methods known in the art.
The synthesis of Boc-Py-OBt ester 7 (Grehn, L. and Ragnarsson, U. J. Org. Chem. 1981, 46, 3492.) and Boc-Im acid 11 (Grehn, et al. Acta. Chim. Scand. 1990, 44, 67.) has been previously described. Available procedures provide only milligram to gram quantities of monomer (J.
Org. Chem. 52, 3493-3500 (1987); Bailey, et al. Org.
Synth. 51, 101 (1971); Nishsiwaki, et al. Heterocycles 27, 1945 (1988). Bailey, et al. J. Pharm. Sci. 78, 910.
(1989)). while requiring difficult column chromatography and the use of toxic chlorofluorophosgene for WO 98/49142 PCT/US98/06997 introduction of the Boc group. An optimized synthesis, using inexpensive starting materials, has been developed by the present inventor allowing Boc-Py-OBt ester and Boc-Im acid monomers to be prepared on 50 g scale without the use of column chromatography. Two dimeric building blocks have also been prepared, Boc-Py-Im acid and Boc-y-Im acid.
A general method for preparation of these compounds is as follows: OzN 0 2 N *CIH 3 N BocNH BocNH OMe 2) HCI.Pd/C(0 a
OM
e 1) Boc 2 CO3 OH DCC/HOBT
N
O N 2) LiOH/MeOH 86 N N 91%I o 3 1 0 o OzN -Cl*HN BocNH N 1) H. Pd/C (l atm) -N I) BocDIEA N OMe 2) HCl/EO 2 N" O 2) LiOH/McOH N OH 0 81% 0 8s8 1 0 The polyamide polymer can be a homopolymer of Py and Im subunits or a copolymer with strategically placed aliphatic amino acid monomers such as a-amino acids (including but not limited to the naturally occurring amino acids and preferably being glycine); amino acids of the formula -NH-(CH)n-CO-, where n is an integer from 1-12 (preferably n being 1 as in P-alanine or 2 as in yaminobutyric acid).
The carboxy terminus of the polyamide may comprise
NH(CH
2 0
NR
1
R
2 or NH(CH 2 )bCO NH(CH 2 0 6
NR
1
R
2
NHR
1 or
NH(CH
2 )bCO NHR 1 where b is an integer from 1-6 (preferably 1) and R 1 and R 2 are independently chosen from C1-6 alkyl (preferably C 1 -3 alkyl such as methyl, ethyl, isopropyl), Ci-6 alkylamine (preferably C1-3 alkylamine such as ethylamine), Ci_. alkyldiamine (preferably C1- 3 alkyldiamine such as N,N-dimethylpropylamine)
C
1 -6 alkylcarboxylate (preferably a C 1 -3 alkylcarboxylate such as-CH 2 COOH), Ci-6 alkenyl (preferably C 1 -3 alkenyl such as CH 2
CH=CH
2 C1-6 alkynyl (preferably C1- 3 alkynyl such as -CH 2
C=CH
3 or a WO 98/49142 WO 9849142PCTIUS98/06997
CI
1 6 L where L includes but is not limited to biotin, oligodeoxyriucleotide,, N-ethylnitrosourea, fluorescein, bromoacetamide, iodoacetamide, DL-ct-lipoic acid, acridine, ethyl red, 4- (psoraen-8-yloxy) -butyrate, tartaric acid, (+)-at-tocopheral.
Most preferred compounds of the instant invention are polyamides core sequence composition: ImPyPyPy-y- PyPyPyPy, P yPyIMPY-Y-PyPyPYPy, IMPyPyPy-Y--IMPypypy, PyIMPyPy-Y-PYIMPyPY, IMPyIMPy-Y-PyPYPyPy, ImImPYPy-Y- PyPyPyPy, IMIMIMPy-Y-PYPYPYPy, IrnlmPYPy-Y- ImPYPYPY, IMPyPyPy-Y-IMIMPYPy, IMIMPYPy-Y-IMImPyPy, IMPyImPY-Y- IMPyIMPy, ImIMIMPy-Y- IMPYPyPyPy, ImIMIMIM-Y-PyPyPyPy, Im- J-PyPy-Y-Im-P3-PyPy, IM-j3-ImIm-Y-Py-P-PyPy, IM-P3-ImPy-Y-Imf3-IMPy, IrPYPyPyPy-Y-IMPyPyPyPy, IMImPyPyPy-Y- IMPyPYPYPy, IMPyIMPyPy-Y-ImPyPYPyPy, ImImPyIMIM-Y-PyPyPyPYPy, IMPyPyImPy-Y-IrnPyPyIMPy, ImPy-P-PyPy-Y-ImPy-V3PyPy, ImIm- I-IMIM-Y-PYPY-P-PyPY, IMPy-P-IMPy-Y-IMPY-f3-IMPy IMPy-13- PyPyPy-Y-ImPyPy-3-PyPy, ImIm-fP-PyPyPy-y-PyPyPy-V3PyPy, IMPy-P3-IMPyPy-Y-IMPyPY-P3-PYPY, ImIm-P3-PyPyPy-Y-ImImPY-3- PyPy, ImPy-]3-PYPyPy-Y-PyPYPy-3-IrnPY, IMPYPYPyPYPy-y- IMPyPyPyPyPy, ImPyPy-I 3 -PyPy-y-ImPyPy-P-Pypy, IMPYPYPY-D3- Py-y-Im-P-PyPyPyPy, ImImPyPyPyPy-Y-ImImPyPyPyPy, Im-13- PyPyPyPy-Y-Im-V3PyPyPyPy, ImPyPYPy-P-PY-Y-ImPYPyPY-P3-Py, IMPyIMPyPyPy-Y-ImPYPyPYPypy, ImPyPy-P3-PYPy-Y-IMPY-P- PyPyPy, ImPyPyPyPy-j3-Y-IMPPyPyPyp-P, IMPy-P-ImPyPy-Y-IMPyf-IMPyPy, Im-P-PyPyPyPy-Y-ImPyPyPY-P3-PY, Im-13-ImPyPyPy-7- IMPyPyPy-P3-Py, IMPyPy-P3-PyPyPy, ImIrnPy-P-PyPyPy, ImImIm-P- PyPyPy, IMPyPyPyPy-Pj- PyPyPy, ImPyPyPy- f3-PyPyPy, ImPyPY-P- PyPyPyPyPy, IMPyPyPy-P- PyPyPyPy, IMlnPypy-P~- PyPyPyPy, ImImImPy-f3-PyPyPyPy, IMPyPyPY-P-IMPYPyP, IMIMPyPy-3- IMPyPyPy, IMIMPyPyPy-P3-PyPyPyPypy, ImImImPYPY-P3- WO 98/49142 PCT/US98/06997 PyPyPyPyPy, ImIm-P-PyPy-P-PyPy-- PyPy, ImImPy- -Pypypy-p- PyPyPy, ImImPyPy-P-Py-P-PyPypypy, ImPyPy-y-ImPyPy-p- PyPyPy, ImPyPy-y-PyPyPy-P-Pypypy, PyImPy-y-ImPyPy-p- PyPyPy, PyImPy-y-ImPyPy-P-PyPyPy-P-pypypy, ImImPy-y- ImPyPy-p-PyPyPy, ImPyPy-y-ImPyPy-G-PyPyPy, ImPyPyPy-y- ImlmlmPy-P-PyPyPyPy, ImImPyPy-y-ImImPyPy-p-PyPypyPy, and ImImPyPy-y-PyPypypy-p-pypypypy.
The compounds of the following invention may be synthesized by any of several well-known and widely available techniques. Distamycin and its analogs can be produced by traditional multi-step synthetic organic chemistry (Weiss, et al. J. Am. Chem. Soc. 1957, 79, 1266; Arcamone, et al. Gazz. Chim. Ital. 1967, 97, 1097; Penco, et al Gazz. Chim. Ital. 1967, 97, 1110; Bailer, et al. Tetrahedron 1978, 34, 2389.) The repeating amide of distamycin is formed from an aromatic carboxylic acid and an aromatic amine, both of which have proven problematic for solution phase coupling reactions. The aromatic acid is often unstable to decarboxylation and the aromatic amines have been found to be air and light sensitive.
(Lown, et al. J. Org. Chem. 1985, 50, 3774.) Variable coupling yields, long reaction times (often 24 h), numerous side products, and reactive intermediates (acid chlorides and trichloroketones) are characteristic of the traditional solution phase coupling reactions. (Church, et al. Biochemistry 1990, 29, 6827. He, et al. J. Am.
Chem. Soc. 1993, 115, 7061.) The process of expanding the 2:1 polyamide-DNA motif to include longer sequences recognized by increasingly complex polyamides is demanding. For example, using previously described multi- step solution phase chemistry, the total synthesis of the hairpin polyamides ImPyPy-y-PyPyPy-Dp required more than a month's effort.
WO 98/49142 PCT/US98/06997 The chemical structures of the polyamide of the prior art ImPyPy-y-PyPyPy-Dp, and the optimized hairpin polyamide ImPyPy-y-PyPyPy-P-Dp provided by the instant invention are shown below: o I
N
H H 0 0N N)/ 1 ImPyy-y.PyPyPyDp Ka 7 x 10 7
M'
1 specificty: 24-fold H
N
0 N N NN o
N
0 0 ImPyPy-y-PPyPyPy--Dp Ka x 10 s
M'
specificty: Hereinafter hairpins may be shown as chemical structures binding to a schematic representation of the minor groove. An abbreviated representation may alternatively be used wherein, imidazole rings are represented as filled circles, pyrrole rings are represented as unfilled circles, p-alanine is represented as a diamond, Glycine is represented as a triangle, amide bonds are represented as lines, y-aminobutyric acid is represented as a curved line, and the positively charged dimthylaminopropylamide is represented with a An example of both notations is shown below for the optimized 6-ring hairpin polyamide ImPyPy-y-PyPyPy-p-Dp binding to a cognate 5'-TGTTA-3' site: WO 98/49142 PCT/US98/06997 No T HN 5'-T G T A-3' NH-
-H
N 3-A C A N o 0 ImPyPy-y-PyPyPy-3-Dp 5'-TGTTA-3'
H-
It has been shown that the Py/Py pair is approximately degenerate for recognition of A,T base pairs, affording generality with regard to targeting sequences of mixed A*T/T*A composition. (White, et al.
Biochemistry 35, 12532-12537 (1996)) To test the extent of this degeneracy, the affinity of the hairpin polyamide ImPyPy-y-PyPyPy-p-Dp was measured for eight possible five base pair 5'-TG(A,T) 3 match sites. Quantitative DNase I footprint titration experiments reveal that ImPyPy-y- PyPyPy-P-Dp binds all eight 5'-TG(A,T) 3 target sites with only a '12-fold difference in the equilibrium association constants between the strongest site, TGTTT-3' (Ka 2.1 x 108 M- 1 and the weakest site, TGAAT-3' (Ka 1.8 x 107 M- 1 (10 mM Tris.HCl, 10 mM KC1, mM MgC12, 5 mM CaC12, pH 7.0, 22 0
C).
Sites are recognized with decreasing affinity: TGTTT-3' 5'-TGTTA-3' 5'-TGTAA-3' 5'-TGTAT-3' TGATT-3' 5'-TGATA-3' 5'-TGAAA-3' 5'-TGAAT-3' as WO 98/49142 PCT/US98/06997 shown in schematic form below: G T T T-31 5'-T G T T A-3' 5'-T G T A A-3- 5'-T G T A T-3' 3'-A C A A A-5, 3'-A C A A T-5' 3'-A C A T T-5- 3'-A C A T Ka =2.1 x 10 8
M-
1 Ka =1.5x10 8
M-
1 Ka 7.3 x 10 7
M-
1 Ka 4.7 x 10 7
M-
1 G A T T-3' 5'-T G A T A-3 5-T G A A A-3' 5'-T G A A T-3' 3'-A C T A A-5' 3'-A C T A T-5' 3'-A C T T T-5 3'-AC T T Ka 3.9 x 107 M- 1 Ka 2.5 x 10 7
M-
1 Ka 2.2 x 10 7
M-
1 Ka =1.8x 10 7
M-
1 These results indicate that all sites of the form 3 are structurally compatible with polyamide- DNA complex formation. However, the affinities of ImPyPy- Y-PYPyPy-P-Dp for 5'-TG(A,T) 3 binding sites may be grouped into two sets according to sequence composition: 2 and 5'-TGA(A,T) 2 ImPyPy-y-PyPyPy-P-Dp binds 5'-TGT(A,T) 2 sites with between 2-fold and 12fold higher affinity than 5'-TGA(A,T) 2 sites. Therefore binding sites containing 5'-GT-3' steps may be preferred over those containing 5'-GA-3' steps for therapeutic targets.
These results indicate that at least a 10-fold range of binding affinities and sequence specificities will be observed for a polyamide binding to a designated set of match sites containing A*T base pairs. This relatively small range indicates that, in contrast to the Im/Py pair which may distinguish G*C from C*G and both of these from A-T/ T*A base pairs, the Py/Py pair appears not to distinguish A*T from T*A base pairs. The similarity of the polyamide binding affinities for the eight
TG(A,T)
3 match sites reflects a limit to the specificity of the hairpin polyamide binding motif.
Because G*C is distinct from C*G, the most specific recognition will be observed for GeC rich sequences.
WO 98/49142 PCT/US98/06997 In principle, individual polyamide subunits can recognize DNA with two possible binding orientations.
Recognition of 5'-TGTTA-3' by a polyamide of core sequence composition ImPyPy-y-PyPyPy places the N-terminus of each polyamide subunit at the 5'-side of each recognized DNA strand. Placement of the polyamide Nterminus at the 3' side of each recognized strand would result in targeting of a 5'-TCTTA-3' sequence. Each binding orientation represents a unique and distinguishable hairpin fold. Subunit orientation preference was not defined by the prior art, however, in order to successfully apply the pairing rules towards polyamide design, a single predictable subunit binding orientation must be preferred.
A schematic model of two possible hairpin polyamide DNA-binding orientations is shown below: G T T A-3' 3-A C A A N to C rEecognition G T T A-3' C A A C to N rrecognition It has been observed that a 30-fold (2 kcal/mol) binding-orientation preference exists for a 6-ring hairpin polyamide binding with the N-terminal end of each subunit located towards the 5'-side of the respective targeted DNA strand. The pyrrole-imidazole polyamide DNAbinding orientation preference defines a second order design rule which must be considered for successful application of the pairing rules for polyamide design.
The potential degeneracy of the Im/Py and Py/Im pairs for recognition of G*C and C*G has not been WO 98/49142 PCT/US98/06997 sufficiently addressed by the prior art. The exocyclic amine group of guanine is symmetrically placed in the floor of the minor groove, and will therefore be displayed in the same location for COG and G*C base pairs. Single mismatch binding sites described in prior art were exclusively G*C to A*T substitutions. It was unclear to those skilled in the art whether sequences which differ by a single GoC to COG substitution would be discriminated by the pyrrole-imidazole polyamide-DNA motif. The rapid design of new polyamides for elucidation complete pairing rules was aided by the discovery that the hairpin- polyamide motif is compatible with solid phase synthetic methods.
A series of four polyamides were prepared: ImPyPy-y- PyPyPy-P-Dp, ImImPy-y-PyPyPy-p-Dp, ImPyPy-y-PyImPy-P-Dp, and ImImPy-y-PyImPy-P-Dp. Each polyamide places a Py/Py, Im/Py, Py/Im, or Im/Im pair opposite either a T/A or G/C base pair in eight possible ring pairing-base pair combinations. The structure of four hairpin polyamides, which differ in the central ring pairings, are shown in Figure 2.
It was determined that Im/Py and Py/Im pairs effectively discriminate GoC from COG base pairs, respectively and that a Im/Im pairing represents an energetically unfavored pairing. Quantitative DNaseI footprinting experiments reveal energetics of the four possible pyrrole-imidazole polyamide ring pairings. Py/Py is found to bind preferably to AOT/T*A G*C/CoG, Im/Py binds GoC T*A/A.T >COG, and Im/Im does not bind GoC/CeG or AoT/ToA. A schematic representation of the eight possible ring pairing-base pair interactions. is shown below: WO 98/49142 PCT/US98/06997 7T-- G T A-3 5'-T G T A-3 Im/Py vs. G-C Im/Py vs. T-A X 108 M- 1 .5.0 X 106 M- 1 3'-A C 'GA T-5' 3'-A C A A T A-3' 5'-T G T A-3' Im/m vs. G*C Im/lm vs. T-A "s 51.0 X 106 M- 1 1.0 X 106 M- 1 3'-A C A T-5' 3'-A C A G 4T A-3' 5'-T G T A-3' Py/Im vs. G-C *0X Py/Im vs. T.A 1.0 X 106 M-1 2.5 X 106 M-1 3'-A C A T-5' 3'-A C A G T A-3' 5'-T G T A-3' Py/Py vs. G-C Py/Py vs. T-A +s 1.0 X 10 6
M-
1 1.4 X 108 M- 1 3'-A C |t A T-5' 3'-A C A These results show that G*C and C*G base pairs may be distinguished in the minor groove, while the energetic penalty for formation of an Im/Im pairing provides a basis for design of specific unlinked overlapped polyamide complexes as will become evident below.
It has been determined that the 6-ring hairpin polyamide motif provides a versatile template for recognition of a wide variety of sequences in the DNA minor groove. (Parks, et al. J. Am. Chem. Soc., 118, 6153 (1996); Szewczyk, et al. Angew. Chemie, 35, 1487-1489 (1996) Swalley, et al. J. Am. Chem. Soc. 118, 8198- 8206(1996)). Six-ring hairpin polyamides recognize their cognate sites with affinities ranging from 1 x 107 M to 1 x 108 M-1 and specificity against single base pair mismatch sites ranging from 2-fold to A schematic of nine 6-ring hairpin polyamides recognizing cognate 5 base pair sites is shown below: WO 98/49142 PCT/US98/06997 G W W W-3' 5'-W G G W W-3' 5'-W G G G W-3' 3'-W C W W W-5' 3'-W C C W W-5' 3'-W C CC 1 x 10 8
M
1 1 x 108 M 1 1 x 107M- 1 W W C W-3' 5'-W W C C W-3' 5'-W G G W W-3' 3'-W W W G W-5' 3'-W W G G W-5' 3'-W C C W 1 x 10 7 M- 1 x 10 7
M
1 1 x 105 M-1 G W C W-3' 5'-W G G C W-3' 5'-W G C W W-3' 3'-W C W G W-5' 3'-W C C G W-5' 3'-W C G W 1 x 10 8
M-
1 1 x 10 8
M
1 1 x 10 8
M-
1 The broad sequence repertoire recognized by the 6ring hairpin motif represents a significant advance in ligand design. However, no 6-ring hairpin polyamide has been identified which recognizes a target site with subnanomolar affinity.
To determine the effect of polyamide length on binding site size, binding affinity, and sequence specificity, a series of six polyamides containing three to eight rings was synthesized. (Kelly, et al. Proc.
Natl. Acad. Sci. U.S.A. 93, 6981-6985 (1996).) The series is based on ImPyPy-Dp with pyrrolecarboxamide moieties added sequentially to the C-termini to afford ImPyPyPy-Dp, ImPyPyPyPy-Dp, ImPyPyPyPyPy-Dp, ImPyPyPyPyPyPy-Dp, and ImPyPyPyPyPyPyPy-Dp which are designed to bind 5 to 10 base pair sites, respectively as side-by-side antiparellel dimers. DNA binding sites are based on a 5'-TGACA-3' core sequence and contain sequential A,T inserts in the center of the binding site that will be recognized by the additional pyrrole carboxamides. Chemical structures of the polyamides are shown in Figure 3.
WO 98/49142 PCT/US98/06997 It was determined that polyamides based on 4 or ring subunits are optimal, and that subunits must not contain more than 5 consecutive rings. Binding affinity reaches a maximum value for the five ring polyamide ImPyPyPyPy-Dp and addition of up to two additional pyrrolecarboxamides has no effect on the observed association constant (Table Furthermore, sequence specificity decreases as the length of the polyamides increases beyond five rings.
WO 98/49142 WO 9849142PCT[US98/06997 TABLE 2 Table 1* polyamide-DNA complemx association constaant specificity, G A C A-3' +)OC*1.3 x 105 M-1 3'-A C T G G T A C A-3' x10M- 5.3-fold 3'-A C A T G G T A A C A-3' X iOX M- 5.7-fold 3'-A C A T T G G T T A A C A-3' +XQ-Q.-C-Q 5.3 x iO -,2.7-fl 3'-A C A A T T G G T T A A A C A-3 *@000000>-( 4. 0 -128fl 3'-A CAATT T G T-5 G T T T A A A C A-B' <2 x 107 M-1 1-f old 3'-A C A A A T T T G T-S' Values reported are the nean values from at least threte footprint titration experiments. The assays werre performed at 22 *C pH 7.0, uin the presence of 10 mM TrisHCI, 10 mM KCl, 101DM McRC1 2 and 5mM CaC 2 t Defined as the ratio of tthe match site affinity to the! affinity of the single base pair mismatch site.
*Values reported are the mean values from at least thr footprint titration experiments. The assays we: performed at 22 0 C, pH 7.0, in the presence of 10 mM Tri~ HCl, 10 MM KCl, 10 MM MgC1 2 and 5 mM CaCl 2 3- These results, specifically the failure of an eightring polyamide to recognize a 10-base pair target site suggested that a new class of polyamides was needed was needed for extension of the 2:1 polyamide-DNA motif to sequences longer than 9 base pairs. The present invention WO 98/49142 PCT/US98/06997 provides for the replacement of a central pyrrole or imidazole amino acid with a more flexible amino acid subunit, thus allowing the antiparallel dimer to reset the register for continued gain in affinity and specificity.
To identify a flexible linker amino acid, four polyamides of the formula ImPyPy-X-PyPyPy-Dp where X Py, G (glycine), P, or y, respectively, were synthesized and their equilibrium association constants determined for 5'-TGTTAAACA-3' (9 base pair) sites. (Trauger, et al.
J. Am. Chem. Soc., 118, 6160 (1996).) The structures of polyamides based on ImPyPy and PyPyPy-Dp subunits linked by pyrrole or flexible glycine or P-alanine linkers are shown below: SN H N"N" y H' O N N H N N I 0 V H
H
ImPyPy-PyPyPyy-Dp' o N i- HI 0
O
N N Y H H 0 1 0 N N H H 1 1 N N
N
N l
H
ImPyPy-P-PyPyPy-Dp N I o It was determined that P-alanine is an optimal linker for joining polyamide subunits in an extended conformation, providing a useful structural motif for the design of new polyamides targeted to sequences longer than 7 base pairs. The P-alanine-linked compound ImPyPy- P-PyPyPy-Dp has the highest binding affinity of the four polyamides, binding the 9 bp site 5'-TGTTAAACA-3' (Ka 8 WO 98/49142 PCT/US98/06997 x 108 M 1 with affinities higher than the formally Nmethylpyrrole-linked polyamide ImPyPy-Py-PyPyPy-Dp by a factor of Solid phase synthesis involves the stepwise assembly of a molecule while one end is covalently anchored to an insoluble matrix at all stages of the synthesis.
(Merrifield, J. Am. Chem. Soc. 85, 2149-2154 (1963); Merrifield, Science 232, 341-347.) The solid phase approach has been successfully developed for a variety of proteins' (Gutte, et al. 246, 1922-1941 (1971)), oligonucleotides (Kent, S.B.H. Ann. Rev. Biochem. 57, 957-989 (1988); Caruthers, et al. Methods In Enzymology 154, 287-313 (1987); Caruthers, M.H. Acc. Chem. Res. 24, 278-284 (1991)) peptoids, (Simon, et al. Proc. Natl Acad.
Sci. 89, 9367-9371 (1992); Zuckermann, et al. J.
Am. Chem. Soc. 114 10646-10647 (1992)), oligosacharides (Science 269, 202-204 (1995); Science 260, 1307-1309 (1993)), and small non-polymeric molecules (Ellman, J.A.
Acc. Chem. Res. 29, 132-143.) General protocols have been developed for manual and machine-assisted Bocchemistry solid phase synthesis of pyrrole- imidazole polyamides. (Baird and Dervan, J. Am. Chem. Soc., 118, 6141 (1996)). More specifically, the following components were developed: a synthesis which provides large quantities of appropriately protected monomer or dimer building blocks in high purity, optimized protocols for forming an amide in high yield from a support bound aromatic amine and an aromatic carboxylic acid, (3) methods for monitoring reactions on the solid support, a stable resin linkage agent that can be cleaved in high yield upon completion of the synthesis. Solid phase synthesis protocols for pyrrole- imidazole polyamides reduce the synthetic investment from months to days.
WO 98/49142 PCT/US98/06997 A representative solid phase synthesis of a polyamide is shown in Figure 4. Polyamides containing more than 4 residues are preferably prepared by solid phase methodology. For solid phase synthesis, the polyamide is attached to an insoluble matrix by a linkage which is cleaved by a single step process which introduces a positive charge into the polyamide. The addition of an aliphatic amino acid at the C-terminus of the pyrrole- imidazole polyamides allows the use of Boc- 0-alanine-Pam-Resin resin which is commercially available in appropriate substitution levels (0.2 mmol/gram) (Mitchell, et al. J. Org. Chem. 1978, 43, 2845.) Aminolysis of the resin ester linkage provides a simple and efficient method for cleaving the polyamide from the support.
Solid phase polyamide synthesis protocols were modified from the in situ neutralization Boc-chemistry protocols recently reported by Kent and coworkers.
(Schnolzer, et al. Int. J. Peptide. Protein. Res. 1992, 40, 180; Milton, et al. Science 1992, 256,1445.) Coupling cycles are rapid, 72 min per residue for manual synthesis or 180 min per residue for machine-assisted synthesis, and require no special precautions beyond those used for ordinary solid phase peptide synthesis. The manual solid phase protocol for synthesis of pyrrole-imidazole polyamides has been adapted for use on a ABI 430A peptide synthesizer. Stepwise cleavage of a sample of resin and analysis by HPLC indicates that high stepwise yields 99%) are routinely achieved.
The large number of polyamides made available by solid phase synthetic methodology makes possible the elucidation of the rules necessary for development of polyamides which bind DNA with subnanomolar affinities.
Cleavage of the polyamide from the resin with a primary WO 98/49142 PCT/US98/06997 diamine provides a polyamide having an unmodified primary amine group. The amine group may then be modified with an activated carboxylic acid or by nucleophilic aromatic substitution to provide a bifunctional polyamide.
Standard techniques available to one skilled in the art may be used to determine the DNA binding properties of novel pyrrole-imidazole polyamides. Affinity cleaving titration experiments ((25 mM Tris-Acetate, 20 mM NaC1, 100 mM bp calf thymus DNA, pH 7, 22 0 C, 10 mM DTT, 10 mM Fe(II)) using polyamides modified with EDTA*Fe(II) at the C-terminus are used to determine oriented binding.
MPE*Fe(II) footprinting experiments (Hertzberg and Dervan, J. Am. Chem. Soc., 104, 313 (1982); Van Dyke and Dervan, Biochemistry, 22, 2373 (1983); Van Dyke and Dervan, Nucleic Acids Res., 11, 5555 (1983); Hertzberg and Dervan, Biochemistry, 23, 3934 (1984)) (25 mM Trisacetate, 10 mM NaCl, 100 AM calf thymus DNA, 5 mM DTT, pH and 22 OC) are used to determine binding site size.
Quantitative DNaseI footprinting (Brenowitz, et al.
(1986). Methods Enzymol. 130, 132-181.; Fox and Waring (1984). Nucleic Acids Res. 12, 9271-9285 Brenowitz, M., Senear, Shea, M.A. Ackers, G.K. (1986); Proc.
Natl. Acad. Sci. U.S.A. 83, 8462-8466.) (10 mM Tris-HC1, mM KC1, 10 mM MgC12, and 5mM CaC1 2 pH 7.0, 22 oC) reveals the equilibrium association constants for binding to match and mismatch sites. All footprinting experiments are performed on 3' and 5' 32 P end restriction fragments derived from plasmids. 3'-shifted cleavage patterns are consistent with location of the polyamide in the minor groove.
Tert-butoxycarbonylaminoacyl-4-(oxymethyl)-phenylacetamidomethyl-resin (PAM resin) is commercially available and cleaved in high yield by aminolysis with primary amines. (Mitchell, Kent, Engelhard, WO 98/49142 PCT/US98/06997 M. Merifield, J. Org. Chem. 43, 2845.) Insertion of a C-terminal aliphatic amino acid residue makes the hairpin- polyamide motif compatible with solid phase synthetic methods, allowing the rapid design of new polyamides. This result sets the stage for the elucidation of the limits of hairpin motif with regards to binding site size, binding affinity, and sequence specificity.
A schematic representation of the recognition of a nine base pair target site, by a polyamide containing a P-spring is shown below:
N-H
HH
N. A S-2(ImPyPy--PyPyPw-Dp) G N 5'-TGTTAAACA-3'
I
H
The binding data for ImPyPy-y-PyPyPy-Dp, which was shown previously to bind DNA in a "hairpin" conformation, indicates that y-aminobutyric acid does not effectively link polyamide subunits in an extended conformation. The discovery of -alanine as an effective linker for joining discovery of P-alanine as an effective linker for joining WO 98/49142 PCT/US98/06997 polyamide subunits in an extended conformation, provides a useful structural motif for the design of new polyamides based on subunits 5-rings targeted to sites longer than 7 bp.
At least two distinct binding modes are expected to form for the ImPyPy-X-PyPyPy-Dp polyamides described above that bind in an extended conformation. These binding modes as "slipped" and "overlapped". In the overlapped (9 base pair) binding mode, two ImPyPy-X- PyPyPy-Dp polyamides bind directly opposite one another.
The "slipped" (13 base pair) binding mode integrates the 2:1 and 1:1 polyamide-DNA binding motifs at a single site. In this binding mode, the ImPyPy moieties of two ImPyPy-X-PyPyPy-Dp polyamides bind the central 3' sequence in a 2:1 manner as in the ImPyPy homodimer, and the PyPyPy moieties of the polyamides bind to A,T flanking sequences as in the 1:1 complexes of distamycin.
A schematic model of the "slipped" and "overlapped" binding modes is shown below.
GTTAAAC A-3' 5'-A AAAAGACAAAA A-3'
GYOCO<OOCK+
3'-A C AA T TT G T-5' 3'-T T T T TC AG T T T T "overlapped" binding mode "slipped" binding mode The present invention provides 3-alanine as an optimal linker for joining polyamide subunits in a "slipped" extended conformation, providing a structural motif whereby a MW 900 polyamide recognizes a 13 basepair DNA sequence. The P-alanine-linked compound ImPyPy- P-PyPyPy-Dp binds to a 13 bp 5'-AAAAAGACAAAAA-3' site with an association constant. Ka 5 x 109 M 1 that is higher than the formally N-methylpyrrole-linked polyamide ImPyPy-Py-PyPyPy-Dp by a factor of WO 98/49142 PCT/US98/06997 As described above, y-aminobutyric acid, and preferably P-alanine, effectively link polyamides in hairpin and extended conformations, respectively. It has also been demonstrated that y-aminobutyric does not optimally link polyamide subunits in extended conformations, and that Palanine does not optimally link polyamide subunits in hairpin conformations. These results suggested that yaminobutyric acid and P-alanine could be combined within a single polyamide with predictable results. (Trauger, et al, Chem. Biol., 3, 369 (1996)).
It has been determined that the nine-ring "extended hairpin" polyamide ImPyPy-y-ImPyPy-p-PypyPy-G-Dp binds a 9-bp target site 5'-AAAAAGACA-3' at 0.05 nM concentration, an increase in affinity relative to the six-ring hairpin polyamide ImPyPy-y-ImPyPy-p-Dp of -400fold. These results provide a strategy for increasing the DNA-binding affinity of hairpin polyamides into the subnanomolar range. Furthermore, as will become evident below, many important DNA binding transcription factors such as TBP and homeodomain proteins have A,T rich consensus sequences. Extended hairpin polyamides provide a general method by which a polyamide may interfere with protein-DNA interactions by recognizing a unique sequence adjacent to certain protein binding sites. A schematic binding model of extended hairpin polyamide recognition of a 9 base pair sequence is shown below: WO 98/49142 PCT/US98/06997 \y hairpin turn G 0 AAAGAC A-3' :A 3 extend 3'-T T T T T C T G T-5' A4 A
A
A 0
N
ImPyPy-y-ImPyPy-P-PyPyPy-G-Dp Ka 2 x 1010 M- 1 Provided herein are extended hairpin polyamide motifs that provide versatile templates for recognition of a wide variety of sequences in the DNA minor groove.
Extended hairpin polyamides recognize their 9 to 13 base pair sites target site with affinities ranging from 1 x 108 M to >5 x 10 10 M and specificity against single base pair mismatch sites ranging from 5-fold to 60-fold. A schematic of nine extended hairpin polyamides containing 9 to 12 rings and recognizing 9 to 13 base pair target sites is shown in Figure Provided herein is an endonuclease protection assay to measure the rate of polyamide-DNA complex formation.
Such an assay may comprise a labeled restriction fragment comprising a polyamide binding site that overlaps a restriction endonuclease cleavage site. Cleavage by the cognate is prevented when the overlapping polyamide WO 98/49142 PCT/US98/06997 binding site is occupied by the polyamide. As a control, a second labeled DNA fragment may be that contains the restriction site, but lacks the overlapping polyamide binding site. The rate of polyamide association with its target binding site may be assessed by incubating the solutions of the polyamide with the labeled target and reference fragments for a sufficient timer period. Using the experimental conditions provided herein, the reference site is nearly completely digested, but protection at the target site is observed and can be correlated with polyamide concentration and the time of equilibration. Similarly, the dissociation rate is analyzed by adding an excess of unlabeled competitor DNA to an equilibrated solution of the labeled DNA fragments and polyamide. Addition of the competitor reduces the concentration of free polyamide to zero. The rate at with polyamide dissociation occurs from the target site on the labeled fragment can be followed by the rate of loss of protection from restriction enzyme digestion as the re-equilibration time is increased.
First generation six-ring hairpin polyamides bind DNA with association constants of approximately 1 x 108
M
1 (Figure 6) The observation that unlinked four-ring polyamides form 2:1 complexes with affinity relative to three-ring polyamides suggested an eight-ring hairpin polyamide motif for recognition of DNA at subnanomolar concentration. The present inventor has shown that two eight-ring pyrrole-imidazole polyamides differing in sequence by a single amino acid bind specifically to respective six base pair target sites which differ in sequence by a single base pair.
(Trauger, et al. Nature, 382, 559-561 (1996)). Binding is observed at subnanomolar concentrations of ligand.
WO 98/49142 PCT/US98/06997 DNA-binding affinities were determined for two eight-ring hairpin polyamides, ImPyPyPy-y-ImPyPyPy-P-Dp and ImPyPyPy-y-PyPyPyPy-p-Dp, which differ by a single amino acid, for two 6 base pair (bp) target sites, AGTACT-3' and 5'-AGTATT-3', which differ by a single base pair. Based on the pairing rules for polyamide-DNA complexes, the sites 5'-AGTACA-3' and 5'-AGTATT-3' are for ImPyPyPy-y-ImPyPyPy-p-Dp "match" and "single base pair mismatch" sites, respectively, and for polyamide ImPyPyPy-y-PyPyPyPy-P-Dp "single base pair mismatch" and "match" sites, respectively. Binding models for AGTACT-3' and 5'-AGTATT-3' in complex with ImPyPyPy-y- ImPyPyPy-p-Dp and ImPyPyPy-y-PyPyPyPy-P-Dp are shown in Figure 7.
ImPyPyPy-y-ImPyPyPy-P-Dp and ImPyPyPy-y-PyPyPyPy-P-Dp were synthesized by solid phase methods and purified by reversed phase HPLC. Equilibrium association constants for match and mismatch six base pair binding sites on a P-labeled 229 bp restriction fragment were determined by quantitative DNase I footprint titration experiments. ImPyPyPy-y-ImPyPyPy-p-Dp binds its match site 5'-AGTACT-3' at 0.03 nM concentration and its single base pair mismatch site 5'-AGTATT-3' with nearly 100-fold lower affinity. ImPyPyPy-y-PyPyPyPy-p-Dp binds its designated match site 5'-AGTATT-3' at 0.3 nM concentration and its single base pair mismatch site AGTACT-3' with nearly 10-fold lower affinity. The specificity of ImPyPyPy-y-ImPyPyPy-p-Dp and ImPyPyPy-y- PyPyPyPy-P-Dp for their respective match sites results from very small structural changes. Replacing a single nitrogen atom in ImPyPyPy-y-ImPyPyPy-p-Dp with C-H reduces the affinity of the polyamide*5'-AGTACT-3' complex by WO 98/49142 PCT/US98/06997 representing a free energy difference of kcal/mole. Similarly, replacing a C-H in ImPyPyPy-y- PyPyPyPy-P-Dp with N reduces the affinity of the polyamide*5'-AGTATT-3' complex -10-fold, a loss in binding energy of -1.3 kcal/mol.
These results show that using a simple molecular shape and a two letter aromatic amino acid code, pyrroleimidazole polyamides can achieve affinities and specificities comparable to DNA-binding proteins. It remained to be determined if additional motifs could be discovered to provide polyamides with subnanomolar binding affinities.
It has been suggested that pyrrole-imidazole polyamides would bind G/C rich sequences with low binding affinity due to steric hindrance with the exocyclic amines of the guanine bases. It has also been noted that the lower negative electrostatic potential of a G/C rich minor groove relative to an A,T rich minor groove might prohibit high affinity binding. (Pullman, et al.
Quarterly Reviews of Biophysics. (1981) 14, 289-380; Pullman, B. Advances in Drug Research. (1989) 18, 1- 113. Manning, G.S. Q. Rev. of Biophysics. (1978) 11, 179-246; Honig and Nicholls. Science (1995) 268, 1144.) It has been found that an 8-ring hairpin polyamide can recognize a G/C rich target sequence with subnanomolar affinity.
Schematic binding models of eight-ring hairpin polyamides designed for recognition of
(G/C)
4 sequences.
G G C C W-3' 5'-W G C G C W-3' 5'-W G G G G W-3' 3'-W C C G G W-5' 3'-W C G C G W-5' 3'-W CC C C Ka 1 x 10
M-
1 K 4 x 107 M'1 K 3 x 107 M-1 WO 98/49142 PCT/US98/06997 To examine whether a core sequence of purely G,C base pairs could be recognized with high affinity and specificity, three eight-ring hairpin polyamides which differ only by the arrangement of pyrrole and imidazole amino acids, ImImPyPy-y-ImImPyPy-P-Dp, ImPyImPy-y- ImPyImPy-P-Dp, and ImImImIm-y-PyPyPyPy-p-Dp were designed for recognition of three core sequences consisting of solely G,C base pairs. DNase I footprint titrations allow the determination of equilibrium association constants (Ka) for each polyamide. ImImPyPy-y-ImImPyPy-P- Dp binds the match site 5"-TGGCCA-3' with an equilibrium association constant of Ka 1 x 1010 M 1 (10 mM Tris.HCl, mM KC1, 10 mM MgC12 and 5 mM CaC12, pH 7.0 and 22 0
C).
The two designed double base pair mismatch sequences, TGCGCA-3' and 5'-TGGGGA-3', are bound with at least 200fold reduced affinity. ImPyImPy-y-ImPyImPy-p-Dp binds the site 5'-TGCGCA-3' with a Ka 4 x 107 M 1 with 4-fold specificity, and ImImImIm-y-PyPyPyPy-P-Dp binds the site 5'-TGGGGA-3' with a Ka 3 x 107 M with 6-fold specificity.
These results indicate that the positioning of the Im amino acids have a profound effect on the binding affinities of pyrrole-imidazole polyamides. More specifically these results indicate that binding affinity could be restored by the design of hairpin polyamides where a pyrrole ring has been substituted by more flexible spacer amino acid such as p-alanine.
It has been found that replacement of a pyrrole residue with a p-alanine spacer residue in each subunit of ImPyImPy-y-ImPyImPy-p-Dp provides an eight residue hairpin polyamide, Im-P-ImPy-y-Im-P-ImPy-P-Dp, which recognizes 5'-TGCGCA-3' sequences with subnanomolar affinities.
WO 98/49142 PCT/US98/06997 Structures and schematic binding models for the eight ring hairpin polyamide ImPyImPy-y-ImPyImPy-P-Dp and the eight residue hairpin polyamide Im-P-ImPy-y-Im-P-ImPy- P-Dp are shown in Figure 8.
It has been found that the four ring hairpin polyamide motif provides a versatile template for recognition of a wide variety of sequences in the DNA minor groove. Eight ring and residue hairpin polyamides recognize 6 base pair target sites with affinities ranging from 1 x 10 7
M
1 to >1 x 010 M' and specificity against single base pair mismatch sites ranging from 2fold to 100-fold. A schematic of fifteen 8-residue hairpin polyamides recognizing 6 base pair target sites is shown in Figure 9.
First generation fully overlapped P-linked polyamides based on three ring subunits bind DNA with association constants of approximately 8 x 108 M 1 The observation that unlinked four-ring polyamides form 2:1 complexes with 70-fold-higher affinity relative to threering polyamides suggested a fully overlapped 8-ring 4-P-4 polyamide motif for recognition of 11 base pairs of DNA at subnanomolar concentration. The chemical structures of three 4-P-4 "polyamides are shown in Figure It has been found that three eight ring 4-P-4 pyrrole-imidazole polyamide, ImImImPy-P-PyPyPyPy-P-Dp, ImImPyPy-P-PyPyPyPy-P-Dp and ImPyPyPy-P-PyPyPyPy-P-Dp specifically recognize targeted 5'-AGGGATTCCCT-3', AGGTATTATCCT-3' and 5'-AGTAATTTACT-3' sites, respectively. DNase I footprint titrations reveal that each polyamide binds its respective target site at subnanomolar concentrations with equilibrium association constants ranging from Ka 7 x 109 M 1 to Ka 5 x 1010 M" WO 98/49142 PCT/US98/06997 1 and with 7 to 30-fold specificity over double base pair mismatch sites.
The ability of 3-P-3 and 4-p-4 polyamides to recognize both "slipped" and "overlapped" complexes for recognition of two separate classes of target sites represents a limit to the sequence specificity of the 3extended polyamide motif. The discovery that a Im/Im polyamide pairing is disfavored, suggests that the 4-p-4 polyamide ImImImPy-P-PyPyPyPy-P-Dp should bind preferentially in the fully overlapped polyamide motif. A schematic representation of the recognition of three targeted DNA sites by three 4-P-4 polyamides is shown in Figure 11.
The 4 -y-4 polyamide ImImImPy-y-PyPyPyPy-p-Dp binds a 5'-AGGGAA-3' target site in a hairpin conformation with an association constant of Ka 4 x 108. The 4-y-4 polyamide ImImImPy-y-PyPyPyPy-p-Dp is related to the 4-p-4 polyamide ImImImPy-P-PPyPyyPy-P-Dp by deletion of a single methylene unit (MW 14) from the linker region.
The y and P linkers specificity turn and extended binding respectively and enlarge targeted binding site size from 6 to 11 base pairs, resulting in a 2.1 kcal/mol enhancement in binding energy. These results, the specific recognition of a G,C-rich 11 base pair sequence, represent a significant advance in the development of general DNA-binding that can recognize a single site in the human genome.
It has been determined that there exists at least a preference for placement of a P/p pair opposite an A*T or T*A base pair relative to a G*C or C*G base pair. Quantitative DNase I footprint titration experiments reveal that ImImImPy-P-PyPyPyPy-P-Dp binds the designed match site 5'-AGGGAATCCCT-3' with an WO 98/49142 PCT/US98/06997 equilibrium association constant of Ka 1.4 x 1010 M 1 and the single base pair p/P mismatch sequence AGGGAGTCCCT-3' with at least 20-fold lower affinity (Ka 6.9 x 108 These results implicate the P /P combination as both a flexible spacer unit and a sequence-specific DNA binding element. The specificity of the P/P pairing reveals an additional pairing rule pivotal to the design of polyamides for recognition of longer binding sites. A schematic model of placement of the P/P pair opposite G,C or A,T base pairs is shown in Figure 12.
It has been found that the extended, fully overlapped polyamide-DNA motif, provides a versatile template for recognition of symmetric sequences containing from 9 to 13 base pairs in the minor groove.
Equilibrium association constants for cooperative complex formation range from Ka 1 x 107 M 1 to Ka 1 x 10 11
M
1 Specificities have been found to range from 2-fold to for discrimination of single base pair mismatch sites. A schematic representation of several p-linked fully overlapped polyamide complexes is shown in Figure 13.
To further expand the targetable binding site size and sequence repertoire available to the hairpin polyamide motif, two polyamides containing either two or three Im amino acid residues, ImPyPyPyPy-y-ImPyPyPyPy-P-Dp and ImImPyPyPy-y-ImPyPyPyPy-p-Dp, were prepared by solid phase synthetic methodology and their DNA binding properties analyzed. The structures of two hairpin polyamides are shown in Figure 14.
It has been shown that that ImPyPyPyPy-y-ImPyPyPyPy- P-Dp binds the formal 7 bp match sequence 5'-TGTAACA-3' with an equilibrium association constant (Ka) of Ka 1.2 WO 98/49142 PCT/US98/06997 x 1010 M 1 and the single base pair mismatch sequence TGGACA-3' with Ka 6.8 x 10 8
M
1 (10 mM Tris*HC1, 10 mM KC1, 10 mM MgC1 2 5 mM CaC1 2 pH 7.0, 22 oC) ImImPyPyPy-y-ImPyPyPyPy-P-Dp, which differs from ImPyPyPyPy-y-ImPyPyPyPy-p-Dp by a single amino acid substitution binds its formal match sequence 3' with an equilibrium association constant of Ka 3.6 x 109 M" and its corresponding single base pair mismatch sequence 5'-TGTAACA-3' with Ka 1 x 107 M 1 The replacement of a single electron lone-pair with a hydrogen atom within a -1500 MW polyamide is found to modulate affinity and specificity by more than an order of magnitude. Sequence-specific recognition of a 7 bp target site by a ten-ring hairpin polyamide at subnanomolar concentration expands the effective targetable sequence repertoire of the pyrrole-imidazole polyamide-DNA motif.
A schematic model of two 10-ring hairpin polyamides recognizing match and mismatch 7 base pair sequences is shown below: G T A A C A-3' 5'-T G A A C A-3' 3'-A C A T T G T-5' 3'-A C A T T G Ka 1 x10 1 0 Ka 1x10 7 G A A C A-3' 5'-T G G A A C A-3' 3'-A C C T T G T-5' 3'-A C C T T G Ka 7 x 10 8 Ka 4 x 10 9 The specificity of that ImPyPyPyPy-y-ImPyPyPyPy-P-Dp and that ImImPyPyPy-y-ImPyPyPyPy-P-Dp for their respective match sites results from very small structural changes.
Replacing a single C-H in that ImPyPyPyPy-y-ImPyPyPyPy-P- WO 98/49142 PCT/US98/06997 Dp with a nitrogen atom as in that ImImPyPyPy-y- ImPyPyPyPy-p-Dp reduces the affinity of the ImImPyPyPy-y- ImPyPyPyPy-p-Dpe5'-TGTAACA-3' complex relative to the ImPyPyPyPy-y-ImPyPyPyPy-p-Dp*5'-TGTAACA-3' complex by 300-fold, a free energy difference of at least 4 kcal/mol. Similarly, replacing a N in that ImlmPyPyPy-y- ImPyPyPyPy-p-Dp with a C-H as in that ImPyPyPyPy-y- ImPyPyPyPy-P-Dp, reduces the affinity of the ImPyPyPyPy-y- ImPyPyPyPy--Dp.5'-TGGAACA-3' complex relative to the ImImPyPyPy-y-ImPyPyPyPy-p-Dp5' -TGGAACA-3' complex by a factor of 5-fold, a loss in binding energy of -1 kcal/mol. The reduced overall specificity and binding affinity of that ImImPyPyPy-y-ImPyPyPyPy-P-Dp relative to that ImPyPyPyPy-y-ImPyPyPyPy-P-Dp most likely results from the presence of a 5'-GA-3' step in the designated target site.
A polyamide, ImPy-P-ImPy-y-ImPy-P-ImPy-P-Dp, based on P-alanine linked 2-ring subunits was prepared to target a seven basepair region adjacent to a binding site for the transcription factor TBP in a conserved HIV gene-promoter sequence. The polyamide was designed based on the pairing rules described herein, and was found to recognize its designated 5'-TGCTGCA-3' target sequence with a binding affinity of Ka 3.6 x 109 M 1 An isomeric mismatched polyamide, ImPy-P-ImPy-y-ImPy-P-ImPy-P-Dp, which differs only by the position of the Py and Im amino acids within the 2-P-2-y-2-P-2 molecular template binds the targeted 5-TGCTGCA-3' sequence with 100-fold reduced affinity. A schematic representation of a polyamide and a control polyamide which are molecular isomers, yet discriminate a 7 -base pair sequence of an HIV gene WO 98/49142 PCT/US98/06997 promoter with a 100-fold specificity is shown in Figure These results reveal that hairpin polyamides based on 5-ring subunits provide a useful structural motif for the recognition of 7 bp binding sites at subnanomolar concentrations. For targeting 5'-WGWWWCW-3' sequences a polyamide, ImPyPyPyPy-y-ImPyPyPyPy-p-Dp based on 2 y-aminobutyric acid linked 5-ring subunits is preferred over the corresponding P-substituted, 2-p-2-y-2-P-2 polyamide ImPy-P-PyPy-y-ImPy-P-PyPy-P-Dp. For targeting 5'-WGCWGCW-3' and 5'-WGGWGGW-3' sequences the respective P-substituted, 2 -p-2-y-2-p-2 polyamides ImPy-P-ImPy-y- ImPy-P-ImPy-P-Dp and ImIm-P-ImIm-y-PyPy-p-PyPy-P-Dp are preferred over the respective 5-y-5 polyamides, ImPyPyImPy-y-ImPyPyImPy-P-Dp and ImImPyImIm-y-PyPypypypy- P-Dp based on y-aminobutyric acid linked 5-ring subunits.
A series of hairpin polyamides which recognize 7 base pair target sites are shown in Figure 16.
The present inventor has discovered that a p/p pairing is preferred to a Py/P pairing for extension of the targetable binding site size of the hairpin polyamide motif. Three "12-ring hairpin" polyamides, ImPyPyPyPyPyy-ImPyPyPyPyPy-p-Dp, ImPyPy-P-PyPy-y-ImPyPy-p-PyPy-P-Dp and ImPy-p-PyPyPy-y-ImPyPy-p-PyPy-p-Dp were synthesized by solid phase synthetic methodology.
TABLE 3 WO 98/49142 PCT/US98/06997 Mismatch Mismatch Specificity G T T A A C A-3' 5'-T GT A A C A-3' 6-fold 3'-A C A A T T G T-5' 3'-A C A C T T G T-5' Ka 2.5 x 10 9 Ka= 3.9 x 10 8 G T T A A C A-3' 5'-T GT A A C A-3' ,0 i6 6-fold 3'-A C A A T T G T-5' 3'-A C A C T T Gt T-5' Ka= 2.9 x 10 9 Ka= 4.5 x 10 9 G T T A A C A-3' 5'-TG T A C A-3' .0 3'-A C AA T T G T-5' 3'-A CA CT T G T-5' Ka= 1.2 x 10" 1 Ka 2.2 x 109 DNase I footprint titrations reveal that the hairpin polyamide based on 6-ring subunits, ImPyPyPyPyPy-y- ImPyPyPyPyPy-3-Dp, binds the formal 8 bp match sequence 5'-TGTTAACA-3' with an equilibrium association constant (Ka) of Ka 4 x 109 M- and the single base pair mismatch sequence 5'-TGTGAACA-3' with Ka 2 x 108 M 2 ImPyPy-P- PyPy-y-ImPyPy-P-PyPy-P-Dp which differs from ImPyPyPyPyPyy-ImPyPyPyPyPy-p-Dp by substitution of two flexible aliphatic amino acid residues for two pyrrole rings, binds a 5'-TGTTAACA-3' match site Ka 2 x 1010 M1 and a 5'-TGTGAACA-3' mismatch with Ka 1 x 109 M1. ImPy-P- PyPyPy-y-ImPyPy-P-PyPy-P-Dp binds a 5'-TGTTAACA-3' match site with an equilibrium association constant of Ka 1 x 101 and a single base pair mismatch sequence 3' with Ka 1 x 109. (Table II).These results expand the targetable binding site size accessible to the hairpin polyamide motif to 8 base pairs.
WO 98/49142 PCT/US98/06997 P/P pairing within the hairpin polyamide motif as shown below completely abolishes DNA-binding: G T T A AC A-3' 3'-A C AA T T G No Specific Binding The present inventor has found that a paired p/p substituted hairpin motif allows specific targeting of sequences of the form 5'-WGWGWWCW-3'. Substitution of a P/P pair for the second pyrrole pairing of a 12-ring hairpin polyamide, provides polyamides which target a wide variety of 8 base pair sequences of mixed sequence composition. Sequences are bound with subnanomolar affinity and 50-100 fold specificity versus single base pair mismatch sites as shown in Table 4.
TABLE 4 WO 98/49142 WO 9849142PCTIUS98/06997 Match Mismatch Specificity G TT A ACA- 3 G T AAC -3' 3- AC A A TT GT-5' 3'-A CA CT TGT-5'8fl Ka =3 x10 9 Ka =4 x10 8 GTG AA CA-3' 5'-T G TWAAC A-3' 8fl 3'-A CA A T TGT-5' 3'-A C AC T Ka=3X xi10 Ka =4 x10 8 TGTT ACA- 5 3'-A CA A T TGT-5 3'-A C ACT T Ka =5X xi10 Ka <1IX10 8 G TGA A CA-3 5 GT EA C A3' 3'-A CA CT T GT-5 3'-A CA AT T Ka1iX 10 1 0 Ka <IX 10 8 The following examples illustrate particular embodiments of the present invention and are not limiting of the specification and claims in any way.
WO 98/49142 PCT/US98/06997
EXAMPLES
Synthesis of Polyamides A. Materials Boc-P-alanine-(-4-carboxamidomethyl)-benzyl-estercopoly(styrene-divinylbenzene) resin (Boc-p-Pam-Resin), dicyclohexylcarbodiimide (DCC), hydroxybenzotriazole (HOBt), 2-(1H-benzotriazole-1-yl)-1,1,3,3tetramethyluronium hexa-fluorophosphate (HBTU), Bocglycine, and Boc-p-alanine were purchased from Peptides International. N,N-diisopropylethylamine (DIEA), N,Ndimethylformamide (DMF), N-methylpyrrolidone (NMP), and DMSO/NMP were purchased from Applied Biosystems. Boc-yaminobutyric acid was from NOVA Biochem, dichloromethane (DCM) and triethylamine (TEA) was reagent grade from EM, thiophenol (PhSH), dimethylaminopropylamine, trichloroacetyl chloride, N-methylpyrrole, and Nmethylimidazole from Aldrich, and trifluoroacetic acid (TFA) from Halocarbon. All reagents were used without further purification.
1H NMR were recorded on a GE 300 instrument operating at 300 MHz. Chemical shifts are reported in ppm relative to the solvent residual signal. UV spectra were measured on a Hewlett-Packard Model 8452A diode array spectrophotometer. IR spectra were recorded on a Perkin- Elmer FTIR spectrometer. High- resolution FAB mass spectra were recorded at the Mass Spectroscopy Laboratory at the University of California, Riverside. Matrix- assisted, laser desorption/ionization time of flight mass spectrometry was carried out at the Protein and Peptide Microanalytical Facility at the California Institute of Technology. HPLC analysis was performed either on a HP WO 98/49142 PCT/US98/06997 1090M analytical HPLC or a Beckman Gold system using a RAINEN C18, Microsorb MV, 5gm, 300 x 4.6 mm reversed phase column in 0.1% (wt/v) TFA with acetonitrile as eluent and a flow rate of 1.0 mL/min, gradient elution 1.25% acetonitrile/min. Preparatory HPLC was carried out on a Beckman HPLC using a Waters DeltaPak 25 x 100 mm 100pm C 18 column equipped with a guard, 0.1% (wt/v) TFA, 0.25% acetonitrile/min. 18MQ water was obtained from a Millipore MilliQ water purification system, and all buffers were 0.2gm filtered. Thin- layer chromatography (TLC) was performed on silica gel 60 F 2 54 precoated plates. Reagentgrade chemicals were used unless otherwise stated.
B. SYNTHESIS OF BOC-PROTECTED PYRROLE AND IMIDAZOLE
MONOMER
1. 4 -Nitro- 2 -trichloroacetyl-l-methylpyrrole:
O
2
N
CC
1 3
N
0
O
To a well stirred solution of trichloroacetyl chloride (1 kg, 5.5 mole) in 1.5 liter ethyl ether in a 12 liter flask was added dropwise over a period of 3 h a solution of N-methylpyrrole (0.45 kg, 5.5 mole) in 1.5 liter anhydrous ethyl ether. The reaction was stirred for an additional 3 hours and quenched by the dropwise addition of a solution of 400 g potassium carbonate in 1.5 liters water. The layers were separated and the ether layer concentrated in vacuo to provide 2 -(trichloroacetyl)pyrrole (1.2 kg, 5.1 mol) as a yellow crystalline solid sufficiently pure to be WO 98/49142 PCT/US98/06997 used without further purification. To a cooled (-40 0
C)
solution of 2-(trichloroacetyl) pyrrole (1.2 kg, 5.1 mol) in acetic anhydride (6 L) in a 12 L flask equipped with a mechanical stirrer was added 440 mL fuming nitric acid over a period of 1 hour while maintaining a temperature of 0 The reaction was carefully allowed to warm to room temperature and stir an additional 4 h. The mixture was cooled to -30 and isopropyl alcohol (6 L) added. The solution was stirred at -20 oC for 30 min during which time a white precipitate forms. The solution was allowed to stand for 15 min and the resulting precipitate collected by vacuum filtration to provide 4-Nitro-2-trichloroacetyl-lmethylpyrrole. (0.8 kg, 54 yield) TLC (7:2 benzene/ethyl acetate) Rf 0.7; 1 H NMR (DMSO-d 6 5 8.55 1 H, J 1.7 Hz), 7.77 1 H, J 1.7 Hz), 3.98 3 H) 1C NMR (DMSO-d 6 6 173.3, 134.7, 133.2, 121.1, 116.9, 95.0, 51.5; IR (KBr) 1694, 1516, 1423, 1314, 1183, 1113, 998, 750.
FABMS m/e 269.936 (M H 269.937 calc. for C 7 HsN 2 0 3 C1 3 2. Methyl 4-nitropyrrole-2-carboxylate: 0 2
N
N
To a solution of 4-Nitro-2-trichloroacetyl-lmethylpyrrole (800 g, 2.9 mol) in 2.5 L methanol in a 4 L Erlenmeyer flask equipped with a mechanical stirrer was added dropwise a solution of NaH (60% dispersion in oil) 0.25 mol) in 500 mL methanol. The reaction was stirred 2 h. at room temperature, and quenched by the addition of conc. sulfuric acid (25 mL). The reaction was WO 98/49142 PCT/US98/06997 then heated to reflux, allowed to slowly cool to room temperature as methyl 4 -nitropyrrole-2-carboxylate crystallizes as white needles which were collected by vacuum filtration and dried in vacuo. (450 g, 47% yield) TLC (ethyl acetate) Rf 0.8; 1H NMR (DMSO-d 6 6 8.22 1 H, J 1.7 Hz), 7.22 1 H, J 1.6 Hz), 3.88 3 3.75 3 1 C NMR (DMSO-d 6 6 37.8, 52.2, 112.0, 123.0, 129.9, 134.6, 160.3; IR(KBr) 3148, 1718, 1541, 1425, 1317, 1226, 1195, 1116, 753. FABMS m/e 184.048 (M H 184.048 calc. for C 7
H
8
N
2 0 4 3. Methyl 4 -amino-l-methyl-pyrrole-2-carboxylate hydrochloride
HCI-H
2
N
N
Methyl-4-nitropyrrole-2-carboxylate (450g, 2.8 mol) was dissolved in ethyl acetate (8 A slurry of 40 g of Pd/C in 800 mL ethyl acetate was then added and the mixture stirred under a slight positive pressure of hydrogen 1.1 atm) for 48 h. Pd/C was removed by filtration through Celite, washed 1 x 50 mL ethyl acetate, and the volume of the mixture reduced to c.a. 500 mL. 7 L of cold ethyl ether was added and HC1 gas gently bubbled through the mixture. The precipitated amine hydrochloride was then collected by vacuum filtration to yield (380 g, 81.6 of Methyl 4 -amino-l-methyl-pyrrole-2-carboxylate hydrochloride. as a white powder. TLC (ethyl acetate) Rf(amine) 0.6, Rf salt 1 H NMR (DMSO-d 6 6 10.23 (br s, 3H), 7.24 1H J 6.79 1H, J 3.83 WO 98/49142 PCT/US98/06997 3H), 3.72 3H) 13C NMR (DMSO-d 6 6 160.8, 124.3, 121.2, 113.4, 112.0, 51.8, 37.1; IR (KBr) 3095, 2693, 1709, 1548, 1448, 1266, 1102, 802, 751. FABMS m/e 154.075 (154.074 calc. for C 7 HIoN 2 0 2 4. 4-[(tert-Butoxycarbonyl)amino]-l-methylpyrrole-2carboxylic acid BocHN
OH
N
The hydrochloride salt of the pyrrole amine Methyl 4amino-l-methyl-pyrrole-2-carboxylate hydrochloride (340 g, 1.8 mol) was dissolved in 1 L of 10% aqueous sodium carbonate in a 3 L flask equipped with a mechanical stirrer, di-t-butyldicarbonate (400 g, 2.0 mmol) slurried in 500 mL of dioxane was added over a period of thirty min maintaining a temperature of 20 oC. The reaction was allowed to proceed for three h and was determined complete by TLC, cooled to 5 0 C for 2 h and the resulting white precipitate collected by vacuum filtration. The Boc-pyrrole ester contaminated with Boc-anhydride was dissolved in 700 mL MeOH, 700 mL of 2M NaOH was added and the solution heated at 60 OC for 6 h. The reaction was cooled to room temperature, washed with ethyl ether (4 x 1000 mL), the pH of the aqueous layer reduced to c.a. 3 with 10% H 2
SO
4 and extracted with ethyl acetate (4 x 2000 mL). The combined ethyl acetate extracts were dried (sodium sulfate) and concentrated in vacuo to provide a tan foam. The foam was dissolved in 500 mL of DCM and 2 L petroleum ether added, the resulting slurry was concentrated in vacuo. The WO 98/49142 PCT/US98/06997 reaction was redissolved and concentrated three additional times to provide (320 g, 78 yield) of 4- [(tert-Butoxycarbonyl)amino]-l-methylpyrrole-2-carboxylic acid as a fine white powder. TLC (7:2 benzene/ethyl acetate v/v) Rf (ester) 0.8, Rf (acid) 0.1. (ethyl acetate), Rf (acid) 0.6, 1 H NMR (DMSO-d 6 8 12.10 1H), 9.05 1H) 7.02 1H) 6.55 1H) 3.75 3H) 1.41 9H) 13C NMR (DMSO-d 6 6 162.4, 153.2, 123.3, 120.1, 119.2, 107.9, 78.9, 36.6, 28.7.; IR(KBr) 3350, 2978, 1700, 1670, 1586, 1458, 1368, 1247, 1112, 887, 779.
FABMS m/e 241.119 (M H 241.119 calc. for C 1
H
17
N
2 0 4 1,2,3-Benzotriazol-l-yl 4-[(tertbutoxycarbonyl)-amino]-l-methylpyrrole-2carboxylate
H
O N N Boc-Py-acid, 4-[(tert-Butoxycarbonyl)amino]-1methylpyrrole-2-carboxylic acid (31 g, 129 mmol) was dissolved in 500 mL DMF, HOBt (17.4 g, 129 mmol) was added followed by DCC (34 g, 129 mmol). The reaction was stirred for 24 h and then filtered dropwise into a well stirred solution of 5 L of ice water. The precipitate was allowed to sit for 15 min at 0 oC and then collected by filtration. The wet cake was dissolved in 500 mL DCM, and the organic layer added slowly to a stirred solution of cold petroleum ether (4 OC). The mixture was allowed to stand at -20 °C for 4 h and then collected by vacuum filtration and dried in vacuo to provide (39 g, yield) of 1,2,3- WO 98/49142 PCT/US98/06997 Benzotriazol-1-yl 4-[(tert-butoxycarbonyl)-amino]-1methylpyrrole-2-carboxylate as a finely divided white powder. TLC (7:2 benzene/ ethyl acetate v/v) Rf 0.6, 1H NMR (DMSO-d 6 6 9.43 1H), 8.12 1H, J 8.4 Hz), 7.80 1H, J 8.2Hz), 7.64 1H, J 7.0 Hz), 7.51 (m, 2H), 7.18 1H), 3.83 3H), 1.45 9H) 1C NMR (DMSO-d 6 6 156.5, 153.3, 143.2, 129.6, 129.2, 125.7, 125.2, 124.6, 120.3, 112.8, 110.3, 109.8, 79.5, 36.8, 28.6.; IR (KBr) 3246, 3095, 2979, 1764, 1711, 1588, 1389, 1365, 1274, 1227, 1160, 1101, 999, 824, 748.; FABMS m/e 358.152 (M H 358.151 calc. for C7H 20 NsO4) 6. Ethyl l-methylimidazole-2-carboxylate.
O
N
1 0 N-methylimidazole (320 g, 3.9 mol) was combined with 2 L acetonitrile and 1 L triethylamine in a 12 L flask equipped with a mechanical stirrer and the solution cooled to -20 OC. Ethyl chloroformate (1000 g, 9.2 mol) was added with stirring, keeping the temperature between -20 oC and 25 oC. The reaction was allowed to slowly warm to room temperature and stir for 36 h. Precipitated triethylamine hydrochloride was removed by filtration and the solution concentrated in vacuo. at 65 OC. The resulting oil was purified by distillation under reduced pressure (2 torr, 102 oC) to provide Ethyl 1-methylimidazole-2-carboxylate as a white solid (360 g, 82 yield). TLC (7:2 benzene/ ethyl acetate v/v) Rf 0.2; H NMR (DMSO-d 6 5 7.44 1 H, J 2.8 Hz), 7.04 1 H, J 2.8 Hz), 4.26 2 H, J WO 98/49142 PCT/US98/06997 Hz), 3.91 3 1.26 3 H, J 3.5 Hz); 13C NMR (DMSO-d 6 6 159.3, 129.1, 127.7, 61.0, 36.0, 14.5; IR(KBr) 3403, 3111, 2983, 1713, 1480, 1422, 1262, 1134, 1052, 922, 782, 666; FABMS m/e 155.083 (M H 155.083 calc. for C-HlN 2 0 2 7. Ethyl l-methyl-4-nitroimidazole-2-carboxylate: 0 2
N
N
Ethyl 1-methylimidazole-2-carboxylate was carefully dissolved in 1000 mL of concentrated sulfuric acid cooled to 0°C. 90% nitric acid (1 L) was slowly added maintaining a temperature of 0 OC. The reaction was then refluxed with an efficient condenser (-20 oC) in a well ventilated hood for 50 min. The reaction was cooled with an ice bath, and quenched by pouring onto 10 L ice. The resulting blue solution was then extracted with 20 L DCM, the combined extracts dried (sodium sulfate) and concentrated in vacuo to yield a tan solid which was recrystallized from 22 L of 21:1 carbon tetrachloride/ethanol. The resulting white crystals are collected by vacuum filtration to provide pure Ethyl l-methyl-4-nitroimidazole-2-carboxylate. (103 g, 22% yield). TLC (7:2 benzene/ ethyl acetate v/v) Rf 0.5, 1H NMR (DMSO-d 6 8 8.61 1 4.33 2 H, J 6.4 Hz), 3.97 3 1.29 3 H, J 6.0 Hz), "C NMR (DMSO-d 6 6 158.2, 145.4, 135.3, 127.4, 62.2, 37.3, 14.5; IR(KBr) 3139, 1719, 1541, 1498, 1381, 1310, 1260, 1122, 995, 860, 656.; FABMS m/e 200.066 (M +H 200.067 calc. for C 7
H,
0
N
3 0 4 WO 98/49142 PCT/US98/06997 8. Ethyl 4 -amino-l-methylimidazole-2-carboxylate hydrochloride
HCIH
2 N
N
N
o The nitro imidazole ethyl ester Ethyl l-methyl-4nitroimidazole-2-carboxylate (103g, 520 mmol) was dissolved in 5 L of 1:1 ethanol/ethyl acetate, 20g 10% Pd/C slurried in 500 mL ethyl acetate was added and the mixture stirred under a slight positive pressure of hydrogen 1.1 atm) for 48 h. The reaction mixture was filtered, concentrated in vacuo to a volume of 500 mL and 5 L of cold anhydrous ethyl ether added. Addition of HC1 gas provided a white precipitate. The solution was cooled at -20 oC for 4 h and the precipitate collected by vacuum filtration and dried in vacuo to provide (75 g, 78% yield) of ethyl 4-amino-lmethylimidazole-2-carboxylate hydrochloride as a fine white powder. TLC (7:2 benzene: ethyl acetate) Rf (amine) 0.3, Rf (salt) 0.0. H NMR (DMSO-d 6 8 10.11 (br s, 3H), 7.43 (s, 1H), 4.28 2H, J 7.1 Hz), 3.92 1H), 1.28 3H, J 7.1 Hz) 13C NMR (DMSO-d 6 6 157.6, 132.6, 117.4, 117.3, 61.8, 36.6, 14.5; IR(KBr) 3138, 2883, 1707, 1655, 1492, 1420, 1314, 1255, 1152, 1057, 837, 776.; FABMS m/e 169.085 (169.084 calc. for C-7HIN30 2 WO 98/49142 PCT/US98/06997 9. 4-[(tert-butoxycarbonyl)amino]-1-methylimidazole- 2-carboxylic acid
H
0 N
OH
1 0 The imidazole amine ethyl 4 -amino-l-methylimidazole- 2-carboxylate hydrochloride (75 g, 395 mmol) was dissolved in 200 mL DMF. DIEA (45 mL, 491 mmol) was added followed by di-t-butyldicarbonate (99 g, 491 mmol) The mixture was shaken at 60 OC for 18 h, allowed to assume room temperature, and partitioned between 500 mL brine, 500 mL ethyl ether. The ether layer was extracted (2 x 200 mL each) 10% citric acid, brine, satd. sodium bicarbonate, brine, dried over sodium sulfate and concentrated in vacuo to yield the Boc-ester contaminated with 20% Boc-anhydride as indicated by 1 H NMR. The Boc-ester, used without further purification, was dissolved in 200 mL 1M NaOH. The reaction mixture was allowed to stand for 3 h at 60 oC with occasional agitation. The reaction mixture was cooled to 0 oC, and carefully neutralized with 1 M HC1 to pH 2, at which time a white gel forms. The gel was collected by vacuum filtration, frozen before drying, and remaining water lyophilized to yield 4-[(tert-butoxycarbonyl)amino]l-methylimidazole-2-carboxylic acid as a white powder. (51 g, 54% yield). 1H NMR (DMSO-ds) 6 9.47 1H), 7.13 (s, 1H) 3.85 3H) 1.41 9H). "C NMR (DMSO-d 6 6 160.9, 152.9, 137.5, 134.5, 112.4, 79.5, 35.7, 28.6; IR(KBr) 3448, 2982, 1734, 1654, 1638, 1578, 1357, 1321, 1249, 1163, 799.; FABMS m/e 241.105 (241.106 calc. for C1 0
H
5 iN 3 0 4 WO 98/49142 PCT/US98/06997 y- (tert-butoxycarbonyl) amino] -butyric acid carboxamido-1-methyl-imidazole)-2-carboxylic acid O
H
O N N N H 0 N
OH
1 0 To a solution of Boc-y-aminobutyric acid (10 g, 49 mmol) in 40 mL DMF was added 1.2 eq HOBt (7.9 g, 59 mmol) followed by 1.2 eq DCC (11.9 g, 59 mmol). The solution was stirred for 24 h, and the DCU removed by filtration.
Separately, to a solution of ethyl 4-nitro-lmethylimidazole-2-carboxylate (9.8 g, 49 mmol) in 20 mL DMF was added Pd/C catalyst 1 g) and the mixture was hydrogenated in a Parr bom apparatus (500 psi H 2 for 2 h.
The catalyst was removed by filtration through celite and filtrate immediately added to the -OBt ester solution. An excess of DIEA (15 mL) was then added and the reaction stirred at 37 oC for 48 h. The reaction mixture was then added dropwise to a stirred solution of ice water and the resulting precipitate collected by vacuum filtration to provide crude ethyl (tert- butoxy) carbonyl] amino]butyric acid -(4-carboxamido-1-methyl-pyrrole)-2carboxylate (5 g, 14.1 mmol). To the crude ester dissolved in 50 mL methanol was added 50 mL 1M KOH and the resulting mixture allowed to stir for 6 h at 37 0 C. Excess methanol was removed in vacuo and the resulting solution acidified by the addition of 1 M HC1. The resulting precipitate was collected by vacuum filtration and dried in vacuo to yield y- (tert-butoxycarbonyl) amino] -butyric acid -(4-carboxamidol-methyl-imidazole)-2-carboxylic acid as a brown powder.
WO 98/49142 PCT/US98/06997 (4.4g, 89% yield). H NMR (DMSO-d 6 6 10.50 1 7.45 1 6.82 1 H, J 3.6 Hz), 3.86 3 2.86 2 H, J 4.6 Hz), 2.22 2 H, J 7.4 Hz), 1.57 (quintet, 2 H, J 5.9 Hz), 1.29 9 IR 3416, 2950, 2841, 1650,1538 1449, 1392, 1250, 1165, 1108; FABMS m/e 326.160 (326.159 calc. for C 14
H
22
N
4 0s) 11. 4-[(tert-butoxycarbonyl)amino]-l-methylpyrrole-2- 4 -carboxamido-1-methyl imidazole)-2-carboxylic acid
H
O N
H
N
I O 4-[(tert-butoxycarbonyl)amino]-l-methylpyrrole-2-(4carboxamido-1-methyl imidazole)-2-carboxylic acid was prepared as described below for y-[(tert-butoxycarbonyl)amino]-butyric acid 4 -carboxamido-1-methyl-imidazole)-2carboxylic acid substituting Boc-Pyrrole acid for Boc-yaminobutyric acid. (4.1 g, 91% yield). 'H NMR (DMSO-d) 6 10.58 1 9.08 1 7.57 1 6.97 1 6.89 1 3.89 3 3.75 3 1.35 9 13 C NMR (DMSO-d 6 8 160.36, 159.1, 153.4, 137.9, 132.3, 122.8, 122.3, 118.5, 115.5, 105.5, 105.4, 78.8, 28.7, 24.9; IR 3346, 2929, 1685, 1618, 1529, 1342, 1274, 1179, 997, 761. FABMS m/e 364.161 (364.162 calc. for C 16
H
22 NsOs).
WO 98/49142 PCT/US98/06997 Solution Phase Synthesis of Polyamides Using Boc- Protected Pyrrole and Imidazole Building Blocks.
1. Aminohexa- (N-methylpyrrolecarboxamide) ditrifluoroacetate.
BocHN
H
0 N N CCF 1 C O O H 0 OH 2) H2NPyPyPy-Dp. DIEA N I o To a solution of N- (tert-butoxycarbonyl) -tris (Nmethylpyrrolecarboxamide) (20 mg, 41 pmol) in DMF (1001) was added HBTU (26 mg, 69 imol) followed by DIEA (50 Al, 288 mol) The reaction was allowed to stand for minutes, agitated, and allowed to stand for an additional five minutes. Aminotris- (N-methylpyrrolecarboxamide) (24 mg, 41 pmol) was then added followed by DIEA (50 41, 288 Amol) and the reaction agitated for 2 hours. The reaction mixture was concentrated in vacuo and TFA (10 ml) added.
After 2 minutes the TFA was removed in vacuo. Purification of the resulting brown oil by reversed phase HPLC afforded the diamine aminohexa-(N-methylpyrrolecarboxamide) ditrifluoroacetate as a white powder. Yield: 26 mg 'H NMR (DMSOd 6 6 10.06 1 9.95 2 9.91 1 9.84 1 9.44 (br s, 1 8.16 1 H, J Hz), 7.22 4 7.16 1 H, J 1.7 Hz), 7.10 1 H, J 1.7 Hz), 7.07 3 6.98 1 H, J 1.7 Hz), 6.93 1 H, J 1.8 Hz), 3.88 6 3.84 12 H), 3.79 6 3.21 2 3.04 2 2.77 6 H, J 4.8 Hz), 1.80 2 FABMS m/e 835.412 (M H, 835.416 calc. for C 41 Hs 1 N1 4 0 6 WO 98/49142 PCT/US98/06997 2. ImPyPyPyPyPyPy-Dp N oN I O
N
N 0 N-methyl-Imidazole-2-carboxylic acid (100 mg, 741 imol) and HOBt (72 mg, 500 gmol) were suspended in 500 Al DMF. Upon addition of DCC (100 mg, 500 imol) the reaction mixture became a homogeneous solution. The activation was allowed to stand for 12 hours, precipitated dicyclohexylurea removed by filtration and Aminohexa-(Nmethylpyrrolecarboxamide) ditrifluoroacetate (10 mg, 9.4 ymol) added followed by DIEA (100 il, 576 imol), and the reaction allowed to stand for 2 hours. Reversed phase HPLC purification of the reaction mixture afforded ImPyPyPyPyPyPy-Dp as a white powder. Yield: 6.3 mg HPLC, r.t. 27.4 min; UV Xmax 246 (34,100), 304 (56,600) nm; 1H NMR (DMSO-d 6 8 10.46 1 9.55 1 9.94 3 9.90 1 9.20 (br s, 1 8,14 1 H, J 7.2 Hz), 7.38 1 7.28 1 H, J 1.4 Hz), 7.26 1 H, J 1.4 Hz), 7.23 4 7.08 5 7.04 1 H, J 1.2 Hz), 6.93 1 H, J 1.6 Hz), 3.98 3 3.84 15 3.83 1 3.30 2 H, J 7.4 Hz), 3.21 2 H, J 7.1 Hz), 2.77 6H, J 4.1 Hz), 1.74 2 MALDI-TOF MS 944.21 (M H 944.04 calc.); FABMS m/e 965.430 (M Na, 965.426 calc. for C 46 Hs 4 Ni 6 0 7 Na).
WO 98/49142 PCT/US98/06997 D. Solid Phase Syntheses 1. Activation of Imidazole-2-carboxylic acid, Boc-yaminobutyric acid, Boc-glycine, and Boc-p-alanine The appropriate amino acid or acid (2 mmol) was dissolved in 2 mL DMF. HBTU (720 mg, 1.9 mmol) was added followed by DIEA (1 mL) and the solution lightly shaken for at least 5 min.
2. Activation of Boc-Imidazole acid Boc imidazole acid (257 mg, 1 mmol) and HOBt (135 mg, 1 mmol) were dissolved in 2 mL DMF, DCC (202 mg, 1 mmol) is then added and the solution allowed to stand for at least min.
3. Activation of Boc-y-Imidazole acid and Boc- Pyrrole-Imidazole acid The appropriate dimer (1 mmol) and HBTU (378 mg, 1 mmol) are combined in 2 mL DMF. DIEA (1 mL) is then added and the reaction mixture allowed to stand for 5 min.
4. Activation of Boc-Pyrrole acid. (for coupling to Imidazole amine) Boc-Pyrrole acid (514 mg, 2 mmol) was dissolved in 2 mL dichloromethane, DCC (420 mg, 2 mmol) added, and the solution allowed to stand for 10 min, DMAP (101 mg, 1 mmol) was added and the solution allowed to stand for 1 min.
Acetylation Mix.
2 mL DMF, DIEA (710 AL, 4.0 mmol), and acetic anhydride (380 pjL, 4.0 mmol) were combined immediately before use.
WO 98/49142 PCT/US98/06997 6. Manual Synthesis Protocol Boc-p-alanine-Pam-Resin (1.25 g, 0.25 mmol) is placed in a 20 mL glass reaction vessel, shaken in DMF for 5 min and the reaction vessel drained. The resin was washed with DCM (2 x 30 and the Boc group removed with PhSH, 1 x 30s., 1 x 20 min The resin was washed with DCM (2 x 30 followed by DMF (1 x 30 A resin sample (5 10 mg) was taken for analysis. The vessel was drained completely and activated monomer added, followed by DIEA if necessary. The reaction vessel was shaken vigorously to make a slurry. The coupling was allowed to proceed for 45 min, and a resin sample taken.
The reaction vessel was then washed with DCM, followed by
DMF.
7. Machine-Assisted Protocols Machine-assisted synthesis was performed on a ABI 430A synthesizer on a 0.18 mmol scale (900 mg resin; 0.2 mmol/gram). Each cycle of amino acid addition involved: deprotection with approximately 80% TFA/DCM/0.4M PhSH for 3 minutes, draining the reaction vessel, and then deprotection for 17 minutes; 2 dichloromethane flow washes; an NMP flow wash; draining the reaction vessel; coupling for 1 hour with in situ neutralization, addition of dimethyl sulfoxide (DMSO)/NMP, coupling for 30 minutes, addition of DIEA, coupling for 30 minutes; draining the reaction vessel; washing with DCM, taking a resin sample for evaluation of the progress of the synthesis by HPLC analysis; capping with acetic anhydride/DIEA in DCM for 6 minutes; and washing with DCM. A double couple cycle is WO 98/49142 PCT/US98/06997 employed when coupling aliphatic amino acids to imidazole, all other couplings are performed with single couple cycles.
The ABI 430A synthesizer was left in the standard hardware configuration for NMP-HOBt protocols. Reagent positions 1 and 7 were DIEA, reagent position 2 was thiophenol, reagent position 3 was ethanolamine/methanol, reagent position 4 was acetic anhydride, reagent position 5 was DMSO/NMP, reagent position 6 was methanol, and reagent position 8 was DMF.
New activator functions were written, one for direct transfer of the cartridge contents to the concentrator (switch list 21, 25, 26, 35, 37, 44), and a second for transfer of reagent position 8 directly to the cartridge (switch list 37, 39, 45, 46).
Boc-Py-OBt ester (357 mg, 1 mmol) was dissolved in 2 ml DMF and filtered into a synthesis cartridge. Boc-Im acid monomer was activated (DCC/HOBt), filtered, and placed in a synthesis cartridge. Imidazole-2-carboxylic acid was added manually. At the initiation of the coupling cycle the synthesis was interrupted, the reaction vessel vented and the activated monomer added directly to the reaction vessel through the resin sampling loop via syringe. When manual addition was necessary an empty synthesis cartridge was used. Aliphatic amino acids (2 mmol) and HBTU (1.9 mmol) were placed in a synthesis cartridge. 3 ml of DMF was added using a calibrated delivery loop from reagent bottle 8, followed by calibrated delivery of 1 ml DIEA from reagent bottle 7, and a 3 minute mixing of the cartridge.
The activator cycle was written to transfer activated monomer directly from the cartridge to the concentrator vessel, bypassing the activator vessel. After transfer, 1 WO 98/49142 PCT/US98/06997 ml of DIEA was measured into the cartridge using a calibrated delivery loop, and the DIEA solution combined with the activated monomer solution in the concentrator vessel. The activated ester in 2:1 DMF/DIEA was then transferred to the reaction vessel. All lines were emptied with argon before and after solution transfers.
8. Stepwise HPLC analysis A resin sample 4 mg) was placed in a 4 mL glass test tube, 200iL of N,N-dimethylaminopropylamine was added and the mixture heated at 100 0 C for 5 min. The cleavage mixture was filtered and a 25gL sample analyzed by analytical HPLC at 254 nm.
9. Typical Manual Synthesis Protocol: PyPyPy-y- ImImPy- -Dp
N
HN 0
HH
O N N 0 oN N H H H 0 N 0 0 Boc-P-Pam-resin (1.25 g, 0.25 mmol amine) was shaken in DMF for 30 min and drained. The N-Boc group removed by washing with DCM for 2 x 30 s, followed by a 1 min shake in TFA/DCM/0.5M PhSH, draining the reaction vessel and a brief 80% TFA/DCM/ 0.5 M PhSH wash, and 20 min shaking in TFA/DCM/0.5M PhSH solution. The resin was washed 1 min with DCM and 30 s with DMF. A resin sample (8-10 mg) was taken for analysis. The resin was drained completely and WO 98/49142 PCT/US98/06997 Boc-pyrrole-OBt monomer (357 mg, 1 mmol) dissolved in 2 ml DMF added followed by DIEA (1 ml) and the resin shaken vigorously to make a slurry. The coupling was allowed to proceed for 45 min. A resin sample (8-10 mg) was taken after 40 min to check reaction progress. The reaction vessel was washed with DMF for 30 s and dichloromethane for 1 min to complete a single reaction cycle. Six additional cycles were performed adding, BocIm-OH (DCC/HOBt), BocIm-OH (DCC/HOBt), Boc-y-aminobutyric acid (HBTU/DIEA) and allowed to couple for 2 hours, BocPy-OBt, BocPy-OBt, and pyrrole-2carboxylic acid (HBTU/DIEA). The resin was washed with DMF, DCM, MeOH, and ethyl ether and then dried in vacuo.
PyPyPy-y-ImImPy-P-Pam-Resin (180 mg, 29 Jmol) 12 was weighed into a glass scintillation vial, 1.5 ml of N,Ndimethylaminopropylamine added, and the mixture heated at oC for 18 hours. The resin was removed by filtration through a disposable polypropylene filter and washed with ml of water, the amine solution and the water washes combined, and the solution loaded on a Cs preparatory
HPLC
column, the column allowed to wash for 4 min in 0.1% TFA at 8 ml/min, the polyamide was then eluted in 100 min. as a well defined peak with a gradient of 0.25% acetonitrile per min. The polyamide was collected in four separate 8 ml fractions, the purity of the individual fractions verified by HPLC and 1H NMR, to provide purified PyPyPy-y-ImImPy-p- Dp(11.2 mg, 39% recovery), UV 246 (31,100), 312 (51,200) HPLC, r.t. 23.6, 1 H NMR (DMSO-d 6 6 10.30 1 H), 10.26 1 9.88 1 9.80 1 9.30 1 9.2 (br s, 1 8.01 3 7.82 (br s 1 7.54 1 7.52 1 7.20 1 H, J 1.3 Hz), 7.18 1 H, J 1.2 Hz), 7.15 1 H, J 1.3 Hz), 7.01 1 WO 98/49142 PCT/US98/06997 H, J 1.4 Hz), 6.96 1 H, J 1.4 Hz), 6.92 1 H, J 1.8 Hz), 6.89 2 6.03 1 H, J 2.4 Hz), 3.97 3 H) 3.96 3 H) 3.85 3 H) 3.82 3 3.78 6 3.37 2 H) 3.20 2 H, J 5.7 Hz), 3.08 2 H J 6.6 Hz), 2.94 2 H J 5.3 Hz), 2.71 6 H J 5.8 Hz), 2.32 4 H) 1.83 4 MALDI-TOF-MS, 978.7 (979.1 calc. for M+H).
9. ImImPy-y-PyPyPy-p-Dp 0
N
HN S 0
N
H
H
N H I 0 N N T jr\
H
N
Y 1 J H H H I 0 .N
N.NN-
I 0 0 Polyamide was prepared by machine assisted solid phase synthesis protocols and 900 mg resin cleaved and purifed to provide ImImPy-y-PyPyPy-p-Dp as a white powder. (69 mg, 48% recovery), UV Xa, 246 (43,300), 308 (54,200) HPLC, r.t.
23.9, 1 H NMR (DMSO-d 6 6 10.31 1 9.91 1 9.90 1 9.85 1 9.75 1 9.34 (br s, 1 H), 8.03 3 7.56 1 7.46 1 7.21 2 H), 7.15 2 7.07 1 H J 1.2 Hz), 7.03 1 H, J 1.3 Hz), 6.98 1 H, J 1.2 Hz), 6.87 2 4.02 (m, 6 H) 3.96 6 H) 3.87 6 3.75 2 H, J 4.9 Hz), 3.36 2 H, J 4.0 Hz), 3.20 2 H, J 4.7 Hz), 3.01 2 H J 5.1 Hz), 2.71 6H, J 4.8 Hz), 2.42 4 H) 1.80 4 H) MALDI-TOF-MS 978.8, (979.1 calc.
for M H) WO 98/49142 PCT/US98/06997 AcImImPy-y-Pypypy--Dp 0o
I
N
O0 HN 0 oN -UY HA
H
O O N H N N0 N H H K HI Polyamide was prepared by manual solid phase protocols and isolated as a white powder. (8 mg, 28% recovery),
UV
max, 246 (43,400), 312 (50,200) HPLC, r.t. 24.8, H NMR (DMSO-d 6 6 10.35 1 10.30 1 9.97 1 H), 9.90 1 9.82 1 9.30 1 9.2 (br s 1H), 8.02 3 7.52 1 7.48 1 7.21 (m, 2H), 7.16 1 H, J 1.1 Hz), 7.11 1 H, J 1.2 Hz), 7.04 1 H, J 1.1 Hz), 6.97 1 H, J 1.3 Hz), 6.92 1 H, J 1.4 Hz), 6.87 1 H, J 1.2 Hz), 3.99 3 3.97 3 3.83 3 3.82 3 3.80 3 3.79 3 3.47 2 H, J 4.7 Hz), 3.30 2 H, J 4.6 Hz), 3.20 2 H, J 5.0 Hz), 3.05 2 H, J 5.1 Hz), 2.75 6 H, J 4.1 Hz), 2.27 4 2.03 (s, 3 1.74 4 H) MALDI-TOF-MS, 1036.4 (1036.1 calc. for
M+H).
WO 98/49142 PCT/US98/06997 11. AcPyPyPy-y-ImImPy-p-Dp O I CHN O
N
N N -N H H 1
N
O N 1 0 0 O O Polyamide was prepared by machine assisted solid phase methods protocols as a white powder. (14 mg, 48% recovery), UV max, 246 (44,400), 312 (52,300) HPLC, r.t.
23.8, H NMR (DMSO-d 6 10.32 1 10.28 1 9.89 2 9.82 1 9.18 1 9.10 (br s, 1 H), 8.03 3 7.55 1 7.52 1 7.21 1 H, J 1.1 Hz) 7.18 1 H, J 7.16) 7.15 1 H, J Hz), 7.12 1H, J 1.0 Hz), 7.02 1 H, J 1.0 Hz), 6.92 1 H, J 1.1 Hz), 6.87 1H, J 1.1 Hz), 6.84 1H, J 1.0 Hz), 3.97 3 3.93 3 3.87 (s, 3 3.80 3 3.78 6 3.35 2 H, J 5.6 Hz), 3.19 2 H, J 5.3 Hz), 3.08 2 H, J 5.7 Hz), 2.87 2 H, J 5.8 Hz), 2.71 6 H, J 4.0 Hz), 2.33 4 1.99 3 H) 1.74 4 H) MALDI-TOF-MS, 1036.2 (1036.1 calc for M+H).
WO 98/49142 PCT/US98/06997 12. ImPyPy-y-PyPyPy-P-Dp 0 1 HN 0 0N H
N
H F1 N 0 N N H H H I O N N N N-- I O o ImPyPy-y-PyPyPy-p-Pam-Resin was prepared by machineassisted synthesis protocols. A sample of resin (1 g, 0.17 mmol was placed in a 20 mL glass scintillation vial, 4 mL of dimethylaminopropylamine added, and the solution heated at 55 oC for 18 h. Resin substitution is calculated as Lnew(mmol/g) Lold/ (1 Lold(Wnew Wold) x 10- 3 L is the loading, and W is the molecular weight of the polyamide attached to the resin. (Barlos, et al. Int. J. Peptide Protein Res. 1991, 37, 513.) Resin is removed by filtration through a disposable propylene filter and 16 mL of water added. The polyamide/amine mixture was purified directly by preparatory HPLC and the appropriate fractions lyophylized to yield a white powder. (103 mg, 61 recovery) HPLC r.t. 24.1, UV nax(H20) 234 nm (39,300), 304 nm (52,000); H NMR (DMSO-d 6 10.47 1 9.91 (s, 1 9.89 1 9.87 1 9.84 1 9.2 (br s, 1 8.08 3 7.38 1 7.26 1 H, J 1.0 Hz), 7.20 1 H, J 1.0 Hz), 7.14 4 7.04 (d, 1 H, J 1.1 Hz), 7.02 1 H, J 1.1 Hz), 6.89 1 H, J 1.0 Hz), 6.85 2 3.97 3 3.82 6 3.81 3 H) 3.77 6 3.34 2 H, J 3.9 Hz), 3.18 2 H, J 5.5 Hz), 3.06 2 H, J 5.7 Hz), WO 98/49142 WO 9849142PCT[US98/06997 2.95 (in, 2 H, J 9 Hz), 2. 71 (di, 6 H, J 4 .6 Hz), 2. (in, 6 1. 75 (mn, 4 MALDI -TOF MS 9 78. 0 (9 78. 1 calic.
f or M+H).
13. ImiPYPy-y-PyPyPy-G-Dp
N
HN 0
N
N 01
HH
N H D N H 0 N 0 I 0 I0 H
H
ImPyPy-y-PyPyPy--G-Dp was prepared as described for ImPyPy-y-PyPyPy-P3-Dp. (12 mg, 40% recovery). HPLC, r.t.
2 6. 9, UV Xa (120) 246 (41,100), 306 (51,300) 'H NIVR (DMSOdr,) 8 10. 50 1 9.95 1 9.93 1 H) 9. 92 (s, 1 H) 9.86 1 9.2 (br s, 1H), 8.29 1 H, J =4.4 Hz), 8. 07 1 H, J 5.2 Hz) 8. 03 1 H, J =5.4 Hz), 7.39 1 7. 27, (di, 1 H, J 1 .6 H z) 7. 22 (in, 2 7. 16 (in, 2 H) 7. 04 (in, 2 H) 6. 92 1 H, J 1.6 Hz), 6. 89 1 H, J 1. 7 Hz) 6. 86 (di, 1 H, J 1. 6 Hz), 3.97 Cs, 3 H) 3.82 (mn, 6 H) 3. 81 3 3.78 (in, 6 3.70 (di, 2 H, J 5. 7 Hz) 3.20 2 H, J 3.11 2 H, J 4. 2 Hz) 3. 00 2 H, J 4.4 Hz), 2.76 (d, 6 H, J 4. 7 Hz) 2. 24 2 H, J 4. 8 Hz) 1. 77 (mn, 4 H) MALDI -TOF-MS, 964. 3 (964. 1 caic. f or M+H) WO 98/49142 WO 9849142PCTIUS98/06997 14. AcImPyPY-Y-PyPyPy-G-Dp
N
HN N
N
H N 0 0 H H N H
H
0 N 0 'N
H
S
N\
0 H
H
Ac ImPyPy-y- PyPyPy-G- Dp was prepared as described for ImPyPy-y-PyPyPy-P-Dp. (13.1 mg, 30% yield) HPLC, r.t. 24.0, UJV 4,a (H 2 0) 246 (35, 900) 312 (48, 800) 1 H NMR (DMSO-d 6 )6 10.23 1 H) 9.98 1 .9.32 1 H) 9. 90 (in, 2 H) 9. 84 1 H) 9. 2 (br s, 1 H) 8.27 1 H, J 5. 0) 8. 05 (in, 2 H) 7.41 1 7.25 1 H, J 1.4 Hz) 7. 22 (mn, 2 H) 7.16 (in, 2 7.12 1 H, J 1. 7 Hz) 7.05 1 H, J 1. 5 Hz) 6. 94 I H, J 1. 6 Hz) 6.89 1 H, J 1.7 Hz) 6.87 1 H, J 1. 6 Hz) ,3.93 3 3.83 3 H) 3. 82 (in, 6 H) 3. 81 3 3.79 3 3.71 2 H, J 5. 1 Hz) 3.19 2 H, J 5. 8 Hz) 3.12 2 H, J 5. 0 Hz) 3. 01 2 H, J 4. 2 Hz) 2.74 6 H, J 4.6 Hz), 2.26 (t 2 H, J 4.6 Hz), 2.00 3 1. 75 (in, 4 H) MALDI-TOF-MS, 1021.6 (1021.1 caic. for
M+H).
WO 98/49142 WO 9849142PCT/US98/06997 AcIzPyPy-y-pypypy-3pD HN N 0
N
N
H
N H N H H H 0N
N
1 0 0 AcImPyPy-y-PyPyyp -Dp was prepared as described for IMPyPy-Y-PyPyPy-p-Dp. (9.2 mg 31% yield), UV X. 0) 24 6 (42,800), 312 (50,400) HPLC, r.t. 24.9, 1H NNR (DMSO-d 6 10.-2 5 1 H) 10.0o1 (s 1 H) 9. 92 (in, 3 H) 9. 86 (s,1 H) 9.3 (br s, 1 H) 8. 10 (in, 3 7.42 1 7.25 (d, 1 H, J 1. 5 H 7. 20 1 H, J 6 H z) 7.16 (mn, 3 H), 7.12 1 H, J =1.4 Hz) 7.03 1 H J 6.89 (d, 1 H, J 1.6 Hz), 6.86 (mn, 2 H) 3 .92 3 3.83 3 H) 3.-8 2 3 H) 3. 80 6H), 3.78 3 3.35 2 H, J 5.5 Hz), 3.20 2 J 3.8 Hz), 3.08 2 H, J 3. 3 Hz) 2. 97 2 H, J 3.8 Hz), 2.75 6 H J 4. 8 Hz) 2. 34 2 H, J 5. 0 Hz), 2.24 2 H, J 4.4 Hz), 2. 00 3 H) 1. 71 (in, 4 MALDI-TOF-MS, 1035.4 (1035.1 caic. for M+H).
WO 98/49142 PCT/US98/06997 16. ImImIm-y-PyPyPy-P-Dp.
0 I HN iN 0 L N NF N H H N HH H H N N O O 0 The product was synthesized by manual solid phase protocols and recovered as a white powder (2.4 mg, 4% recovery). UV 4ax 312 (48,500); 1H NMR (DMSO-d 6 d 10.09 1 9.89 1 9.88 1 9,83 1 9.57 1 9.19 (br s, 1 8.36 1 H, J 5.6 Hz), 8.03 2 7.64 1 7.51 1 7.45 1 7.20 1 H, J 1.0 Hz), 7.15 1 H, J 2.0 Hz), 7.14 1 7.08 1 7.04 1 6.87 2 H, J 2.2 Hz), 4.01 3 3.99 3 3.95 3 3.82 (s, 3 H) 3.82 3 3.79 3 3.37 2 H, J 5.8 Hz), 3.26 2 H, J 6.1 Hz), 3.10 2 H, J 6.1 Hz), 2.99 2 2.73 6 H, J 4.8 Hz), 2.34 2 H, J 7.2 Hz), 2.27 2 H, J 7.3 Hz), 1.79 4 MALDI- TOF-MS, 980.1 (980.1 calc. for M+H).
WO 98/49142 PCT/US98/06997 17. ImImIm-y-PyPyPy- -Dp-NH 2 0 1 HN- 0 N N N SNN
N
H H H N
N
S
H
O O I 0 0 A sample of machine-synthesized resin (350 mg, 0.17 mmol/gram') was placed in a 20 mL glass scintillation vial, and treated with 2 mL 3 ,3 -diamino-N-methyldipropylamine at OC for 18 hours.
The resin was removed by filtration through a disposable propylene filter, and the resulting solution dissolved with water to a total volume of 8 mL, and purified directly by preparatory reversed phase HPLC to provide ImImIm-y-PyPyPy-P-Dp-NH2 (28 mg, 41% recovery) as a white powder. H NMR (DMSO-d 6 6 10.14 1 9.89 1 9.88 1 9.83 1 9.6 (br s, 1 9.59 (s, 1 8.36 1 H, J 5.5 Hz), 8.09 1 H, J Hz), 8.03 1 H, J 5.0 Hz), 7.9 (br s, 3 7.63 1 7.50 1 7.44 1 7.19 1 H, J 1.2 Hz), 7.13 2 7.08 1 H, J 1.3 Hz), 7.02 1 H, J 1.2 Hz), 6.85 2 4.01 3 3.99 3 3.97 6 3.80 3 3.77 3 3.34 2 H, J 5.3 Hz), 3.23 2 H, J 6.0 Hz), 3.05 6 H), 2.83 2 H, J 5.0 Hz), 2.70 3 H, J 4.0 Hz), 2.32 2 H, J 6.9 Hz), 2.25 2 H, J 6.9 Hz), 1.90 2 WO 98/49142 PCT/US98/06997 1,77 4 H) MALDI-TOF-MS, 1022.8 (1023.1 calc. for
M+H).
18. ImPyPy-G-PyPyPy-G-Dp-NH 2 Polyamide was prepared by manual solid phase methods as a white powder upon cleavage of 240 mg resin with Nmethyl-bis(aminopropyl)amine (2 ml, 55 OC) (19.0 mg, 44 recovery after HPLC purification). 1H NMR (DMSO-d 6 5 10.49 1 9.97 1 9.93 1 9.91 1 9.89 1 9.7 (br s, 1 8.27 2 8.04 1 H, J 5.1 Hz), 7.88 (br s, 3 7.39 1 7.27 1 H, J 1.6 Hz) 7.21 3 H) 7.15 2 H) 7.05 2 H) 6.93 3 3.97 3 3.96 6 3.92 9 H), 3.72 4 3.14 6 3.05 2 H, J 5.4 Hz), 2.73 3 H, J 3.3 Hz), 1.88 (quintet, 2 H, J 4.6 Hz), 1.75 (quintet, 2 H, J 6.3 Hz). MALDI-TOF-MS, 979.0 (979.1 calc for M+H).
19. ImPyPy-G-PyPyPy- -Dp-NH 2 Polyamide was prepared by manual solid phase methods as a white powder upon cleavage of 240 mg resin with Nmethyl-bis(aminopropyl)amine (2 ml, 55 oC) (25 mg, 55 recovery). HPLC, r.t. 22.0; H NMR (DMSO-d6) 5 10.53 1 10.00 1 9.98 1 9.93 1 9.92 (s, 1 9.7 (br s, 1 8.31 1 H, J 5.7 Hz), 8.12 (t, 1 H, J 5.5 Hz), 8.04 1 H, J 5.6 Hz), 7.9 (br s, 3 7.41 1 7.29 1 H, J 1.7 Hz), 7.23 1 H, J 1.5 Hz), 7.22 1 H, J 1.4 Hz), 7.16 3 H), 7.07 1 H, J 1.2 Hz), 7.03 1 H, J 1.3 Hz), 6.94 1 H, J 1.6 Hz), 6.93 1 H, J 1.5 Hz), 6.86 (d, 1 H, J 1.4 Hz), 3.98 3 3.88 2 H, J 5.6 WO 98/49142 PCT/US98/06997 Hz), 3.83 3 3.82 6 3.80 3 3.78 (s, 3 3.37 2 H, J 6.4 Hz), 3.11 6 2.86 2 H, J 6.1 Hz), 2.70 3 H, J 4.6 Hz), 2.32 2 H, J 7.2 Hz), 1.87 (quintet, 2 H, J 7.4 Hz), 1.75 (quintet, 2 H, J 6.0 Hz), MALDI-TOF-MS, 993.3 (993.1 calc for M+H) ImPyPy-P-PyPyPy-G-Dp-NH 2 Polyamide was prepared by automated solid phase methods as a white powder upon cleavage of 240 mg resin with N-methyl-bis(aminopropyl)amine (2 ml, 55 OC) (23.0 mg, 53 recovery). HPLC, r.t. 20.6; H NMR (DMSO-d 6 8 10.45 1 9.95 1 9.92 3 9.6 (br s, 1 H), 8.27 1 H, J 4.7 Hz), 8.11 2 7.9 3 H), 7.38 1 7.26 1 H, J 1.7 Hz), 7.21 2 H), 7.17 2 7.13 1 H, J 1.8 Hz), 7.05 2 H), 6.93 1 H, J 1.6 Hz), 6.88 1 H, J 1.6 Hz), 6.83 1 H, J 1.7 Hz), 3.97 3 3.82 9 3.81 3 3.79 3 3.73 2 3.44 2 H, J 5.5 Hz), 3.2 6 2.85 2 H, J 5.8 Hz), 2.73 (d, 3 H, J 4.5 Hz), 1.89 (quintet, 2 H, J 6.4 Hz), 1.77(quintet, 2 H, J 6.9 Hz) MALDI-TOF-MS, 992.9 (993.1 calc for M+H).
WO 98/49142 PCT/US98/06997 21. ImPyPy-y-ImPyPy-P-PyPyPy-G-Dp-NH 2 0 HNI O H HN N
N
N H H I o N
H
I 0 N N NH
HN
The polyamide was prepared by machine-assisted solid phase methods as a white powder. (29 mg 59 recovery).
HPLC r.t. 21.5, 1H NMR (DMSO-d 6 6 10.50 1 10.27 1 9.96 1 9.93 5 9.2 br s, 1 H), 8.27 1 H, J 5.1 Hz), 8.03 3 7.90 3 H), 7.45 1 7.40 1 7.27 1 H, J 1.3 Hz), 7.25 1 H, J 1.4 Hz), 7.22 2 7.18 2 H), 7.17 1 H, J 1.4 Hz), 7.14 1 H, J 1.3 Hz), 7.11 2 7.06 1 H, J 1.5 Hz), 6.94 1 H, J 1.3 Hz), 6.88 2 6.84 1 H, J 1.4 Hz), 3.97 3 3.93 3 3.83 9 3.80 6 3.76 6 3.72 2 H, J 5.2 Hz), 3.43 2 H, J 5.0 Hz), 3.17 6 3.11 2 H, J 5.3 Hz), 2.85 2 H, J 5.2 Hz), 2.73 3 H, J 3.9 Hz), 2.51 2 H, J Hz), 2.35 2 H, J 6.7 Hz), 1.92 (quintet, 2 H, J 6.8 Hz), 1.78 4 H) MALDI-TOF MS 1445.6 (1445.6 calc for
M+H).
WO 98/49142 PCT/US98/06997 22. ImImImPy-y-PyPyPyPy-P-Dp-NH 2 A sample of machine-synthesized resin (350 mg, 0.16 mmol/gram) was placed in a 20 mL glass scintillation vial, and treated with 2 mL 3 ,3'-diamino-N-methyldipropylamine at oC for 18 hours. The resin was removed by filtration through a disposable propylene filter, and the resulting solution dissolved with water to a total volume of 8 mL, and purified directly by preparatory reversed phase HPLC to provide ImImImPy-y-PyPyPyPy-P-Dp-NH 2 (31 mg, 40% recovery) as a white powder. 1H NMR (DMSO-d 6 6 10.37 1 10.16 1 9.95 1 9.93 1 9.91 1 9.86 1 9.49 (br s, 1 9.47 1 8.12 3 H), (br s 3 7.65 1 7.57 1 7.46 1 7.20 3 7.16 2 7.09 1 H, J Hz), 7.05 2 7.00 1 H, J 1.6 Hz), 6.88 2 4.01 3 3.99 3 3.98 3 3.83 3 3.82 3 3.81 3 3.79 3 3.78 3 3.36 2 H, J 5.3 Hz), 3.21- 3.05 8 2.85 2 H, J 4.9 Hz), 2.71 3 H, J 4.4 Hz), 2.34 (t, 2 H, J 5.9 Hz), 2.26 2 H, J 5.9 Hz), 1.85 (quintet, J 5.7 Hz), 1.72 4 MALDI-TOF-MS, 1267.1 (1267.4 calc. for M+H).
WO 98/49142 PCT/US98/06997 23. ImPyPyPyPy-y-ImPyPyPyPy-P-Dp-NH2 I 6 0 I 6 N f y H H I 0I N N NH 2 I 0 0 A sample of ImPyPyPyPy-y-ImPyPyPyPy-P-resin prepared by machine-assisted solid phase synthesis (240 mg, 0.16 mmol/gram) was placed in a 20 mL glass scintillation vial, and treated with 3 ,3-diamino-N-methyldipropylamine (2 mL) at 55 oC for 18 hours. Resin was removed by filtration, and the filtrate diluted to a total volume of 8 mL with 0.1 (wt/v) aqueous TFA. The resulting crude polyamide/amine solution was purified directly by reversed phase HPLC to provide the trifluoroacetate salt of ImPyPyPyPy-y- ImPyPyPyPy-p-NH 2 (31 mg, 40% recovery) as a white powder. UV kmax 241, 316 83300 (calculated based on e 8,333/ring 5 H NMR (DMSO-d6) 6 10.53 1 10.28 1 10.03 1 10.00 1 9.96 2 9.92 (m, 2 9.6 (br s, 1 8.07 4 7.89 3 7.45 1 7.41 1 7.27 2 H, J= 8.5 Hz), 7.23 (m, 4 7.16 4 7.06 4 6.87 2 3.98, 3 3.94 3 3.84, 6 3.79 3 H), 3.35 2 H, J= 5.7 Hz), 3.16 8 2.85 2 H, J= 5.6 Hz), 2.72 2 H, J= 4.2 Hz), 2.34 2 1.91 (m, WO 98/49142 WO 9849142PCT/US98/06997 4. H) 1.7 8 4 f or M+H) MALDI-TOF MS, 1510.4 (1510.7 caic.
24. ImImPyPyPy-yImPyPyPyPyoDp-NH~ 2 '6I 6 H HI 2 0 1 6 The polyamide was prepared as a white powder as described for ImPyPyPyPy-y-ImPyPyPyPy-g3-j 2 1NMR
(DMSO-
d 6 8 10.39 Cs, 1 H) 10.28 Cs, 1 H) 10.03 Cs, 1 H) 10.00 1 H) 9. 92 Cm, 2 H) 9.82 1 9.66 (br s, 1 H) 8.11 Cm, 4 H) 7.89 3 H) 7.57 Cs, 1 H) 7.46 2 H, J=2.4 Hz) 7.27 (dd, 2 H, J= 1. 0 Hz) 7. 23 Cm, 4 7. 16 Cm, 4 7.08 Cm, 4 H) 6. 88 (in, 1 H) 4.00 3 3. 94 3 H) 3.78 Cs, 3 H) 3.19 2 H, J=5.1 Hz) 3.05 (mn, 8 H) 2. 86 2 H, J= 4. 8 Hz) 2. 72 2 H, J= 4.4 Hz) 2.34 Cm, 4 H) 1.90 Cm, 4 1.78 Cm, 4 MALDI-TOF-MS, 1510.4 (1511.7 caic. for M+H).
WO 98/49142 PCT/US98/06997 ImImIm-y-PyPyPy-P-Dp-EDTA 0 HN N O H H I I N H N N N H H
H
SNN
NI Ni-I 0 EDTA-dianhydride (50 mg) was dissolved in 1 mL DMSO/NMP solution and 1 mL DIEA by heating at 55 oC for min. The dianhydride solution was added to ImImIm-y-PyPyPy- P-Dp-NH 2 (8.0 mg, 7 #mol) dissolved in 750 AL DMSO. The mixture was heated at 55 oC for 25 minutes, and treated with 3 mL 0.1M NaOH, and heated at 55 oC for 10 minutes.
0.1% TFA was added to adjust the total volume to 8 mL and the solution purified directly by preparatory HPLC chromatography to provide ImImIm-y-PyPyPy-P-Dp-EDTA as a white powder. (3.3 mg, 30% recovery) 1H NMR (DMSO-d 6 d 10.14 1 9.90 1 9.89 1 9.85 1 9.58 1 9.3 (br s, 1 8.40 2 8.02 (m, 2 7.65 1 7.51 1 7.45 1 7.20 (d, 1 H, J 1.5 Hz), 7.15 2 7.08 1 H, J 1.1 Hz), 7.04 1 H, J 1.5 Hz), 6.86 2 4.00 3 3.98 3 3.94 3 3.87 4 3.82 3 3.81 3 3.78 3 3.72 4 3.4-3.0 16 2.71 3 H, J 4.2 Hz), 2.33 2 H, J 5.1 Hz), 2.25 2 H, J 5.9 Hz), 1.75 6 MALDI-TOF- MS, 1298.4 (1298.3 calc. for M+H) The polyamide was loaded with Fe(II) by standard methods.
WO 98/49142 PCT/US98/06997 26. ImPyPy-y-ImPyPy-P-Dp O
I
IIN
HNN 0N ON N N N-- N
N
N i\
H
O O N H H H 1 0 0 The polyamide was prepared by machine-assisted solid phase methods as a white powder. (17 mg, 56% recovery).
HPLC r.t. 26.1, UV X, 234 (39,300), 312 (53,200) nm; 1H NMR (DMSO-d 6 d 10.53 1 10.27 1 H) 10.04 1 9.96 1 9.94 1 9.2 (br s, 1 H), 8.08 3 7.49 2 7.44 1 7.31 1 H, J 1.0 Hz), 7.23 1 H, J 1.1 Hz), 7.19 3 7.10 1 6.92 1 H, J 1.1 Hz), 6.90 1 H, J 1.1 Hz). 4.01 3 3.97 3 3.86 6 3.82 (m, 6 3.41 2 H, J 6.0 Hz), 3.22 2 H, J 5.9 Hz), 3.13 2 H, J 5.9 Hz), 3.0 2 H, J 5.6 Hz), 2.76 6 H, J 4.8 Hz), 2.37 4 1.78 4 MALDI- TOF MS 979.3 (979.1 calc. for M+H).
27. ImPyPy-G-PyPyPy-G-Dp Ni N-H- S N H 0 NN N N H H 0 N N H N N H 0 WO 98/49142 WO 98/91 42PCTIUS98/06997 Polyamide was prepared by manual solid phase methods and obtained as a white powder upon cleavage of 240 mg resin. (initial subsitution of 0.2 mrnol Boc -Glyc ine /gram) with dime thyl ami nopropyl ami ne (11.9 mg, 29% recovery).
HPLC, r.t. .26. 9 min.; UV Xm, 246 (41,100), 312 (48,400) nm; 1 H NNR CDMSO-d 6 8 10. 49 Cs, 1 H) 9.98 1 H) 9. Cs, 1 H) 9. 92 1 H) 9.89 1 H) 9.2 (br s, 1 H), 8. 30 Cm, 2 H) 8. 06 1 H, J 5.8 Hz) 7.40 1 H), 7.24, 1 H, J 1. 7 Hz) 7.23 Cm, 3 H) 7.17 (in, 2 H), 7.06 (in, 2 6.94 Cm, 3 H) 3. 99 Cs, 3 H) 3. 89 Cd, 2 H, J 5.8 Hz), 3.84 Cs, 3 H) 3. 84 3 H) 3.83 Cs, 3 H), 3.81 Cs, 3 H) 3.80 Cs, 3 3.72 Cd, 2 H, J =4.3 Hz), 3. 13 2 H, J 5. 7 Hz) 3. 01 2 H, J 5. 2 Hz), 2.76 Cd, 6 H, J 3 Hz) 1. 77 (quintet, 2 H, J 7. 4 Hz) MALDI -TOF MS 935. 7 CM H 936. 0 caic for C 44 H55Ns06); FABMS m/e 935.433 CM H 935.439 calcd. for C 44
H
55
N,
6 0 8 28. ImPyPy-G-PyPyPy-P3-Dp N
H
N
Nll
H
0 N N
H
0
NH
N
1 0 0 Polyamide was prepared by manual solid phase methods as a white powder upon cleavage of 180 mg resin Cinitial subsitution of 0.2 mmol B~oc-p-alanine/gram) with dimethylaminopropylamine (2 ml, 55 OC). C12.3 mng, 38 recovery after HPLC purification) .HPLC, r.t. 25.5, UV Xma Ws, 246 (39,500), 312 C52,000) nrn; 'H NNR CDMSO-d 6 10.46 WO 98/49142 PCT/US98/06997 1 9.96 1 9.90 1 9.88 2 9.21 (br s, 1 8.27 1 H, J 4.2 Hz), 8.06 2 7.39 1 7.28 1 H, J 1.6 Hz), 7.23 1 H, J 1.7 Hz), 7.20 1 H, J 1.5 Hz) 7.15 3 7.04 2 7.03 1 H, J 1.6 Hz), 6.94 1 H, J 1.7 Hz), 6.92 1 H, J 1.4 Hz), 3.98 3 3.88 2 H, J 5.6 Hz), 3.83 3 3.82 6 3.79 3 H), 3.78 3 3.36 2 H, J 5.3 Hz), 3.09 2 H, J Hz), 2.75 2 H, J 5.2 Hz), 2.72 6 H, J 4.8 Hz), 2.30 2 H, J 6.3 Hz), 1.72 (quintet, 2 H, J 5.7 Hz) MALDI-TOF MS 950.1 (950.0 calc for M FABMS m/e 949.462 (M H 949.455 calc. for C 4 sH 5 7N 16 0 8 29. ImPyPy-P-PyPyPy-G-Dp Polyamide was prepared by automated solid phase methods as a white powder upon cleavage of 180 mg resin (initial subsitution of 0.2 mmol Boc-Glycine/gram) with dimethylaminopropylamine (2 ml, 55 (17.2 mg, 57 recovery after HPLC purification). HPLC, r.t. 26.5; UV max 246 (46,500), 312 (54,800) nm; 'H NMR (DMSO-d 6 6 10.54 1 9.92 1 9.90 3 9.23 (br s, 1 H), 8.27 1 H, J 5.5 Hz), 8.06 1 H, J 6.3 Hz), 8.03 1 H, J 6.2 Hz), 7.39 1 7.26 1 H, J 1.7 Hz) 7.20 2 H) 7.17 3 7.13 2 7.04 (d, 1 H, J 1.5 Hz), 6.87 1 H, J 1.8 Hz), 6.83 1 H, J 1.8 Hz), 3.97 3 H) 3.82 15 H) 3.78 2 H, J 3.4 Hz) 3.27 4 H) 3.15 2 H) 3.79 2 H) 2.76 6 H, J 4.9 Hz), 1.78 (quintet, 2 H, J 6.6 Hz) MALDI-TOF MS 950.2 (950.0 calc. for M FABMS m/e 949.458 (M H 949.455 calc. for C 45
H
57
N,
6 0 8 s) WO 98/49142 PCT/US98/06997 ImPyPy-P-PyPyPy-p-Dp Polyamide was prepared by automated solid phase methods as a white powder upon cleavage of 240 mg resin (initial subsitution of 0.2 mmol Boc-p-alanine/gram) with dimethylaminopropylamine (2 ml, 55 oC) (19.0 mg, 43 recovery after HPLC purification). HPLC, r.t. 26.8; UV lax 246 (42,100), 312 (53,900) nm; 1H NMR (DMSO-d 6 6 10.56 1 9.90 1 9.89 2 9.87 1 9.21 (br s, 1 8.24 1 H, J 5.2 Hz), 8.04 1 H, J 6.1 Hz), 8.01 1 H, J 6.0 Hz), 7.35 1 7.26 (d, 1 H, J 1.6 Hz), 7.23 3 7.16 3 7.12 1 7.02 1 H, J 1.5 Hz), 6.85 1 H, J 1.9 Hz), 6.80 1 H, J 1.8 Hz), 3.96 3 3.79 3 H), 3.78 3 3.36 2 H, J 5.3 Hz), 3.09 2 H, J 6.0 Hz), 2.75 2 H, J 5.0 Hz), 2.72 6 H, J 4.7 Hz), 2.30 2 H, J 6.1 Hz), 1.72 (quintet, 2 H, J Hz); MALDI-TOF MS 964.2 (964.1 calc. for M H) 31. ImPyPy-Py-PyPyPy-G-Dp Polyamide was prepared by manual solid phase methods.
Recovery is based on cleavage of 180 mg resin (initial subsitution of 0.2 mmol Boc-Glycine/gram) with dimethylaminopropylamine (2 ml, 55 oC). (8 mg, 24% recovery after HPLC purification). A small quantity of the failure heptamide AcPyPyPyPyPyPy-Dp was found in the initial preparation and was removed by a second preparatory HPLC purification to afford pure ImPyPy-Py-PyPypy-G-Dp as a white powder (1.2 mg). HPLC, r.t. 28.5, UV Xma 246 (34,600), 312 (55,300) H NMR (DMSO-d 6 6 10.55 1 H), 10.02(s, 1 10.00 4 9.3 (br s, 1 8.32 1 H, J 6.2 Hz), 8.06 1 H, J 5.9 Hz), 7.44 1 H), WO 98/49142 PCT/US98/06997 7-.31 1 H, J 1.7 Hz), 7.26 5H), 7.19 1 H, J 1.8 Hz) 7.10 5 6.97 1 H, J 1.7 Hz) 4.01 3 3.87 15 3.82 3 H) 3.73 2 H, J Hz), 3.16 2 H, J 6.2 Hz), 3.03 2 H, J 5.2 Hz), 2.74 6 H, J 4.9 Hz), 1.77 (quintet, 2 H, J 6.7 Hz); MALDI-TOF MS 1000.5; FABMS m/e 1001.471 (M H 1001.473 calcd. for C 48
H
59
NI
7 0 8 32. ImPyPy-Py-PypyPy-P-Dp Polyamide was prepared by machine assisted solid phase synthesis to afford a white powder upon cleavage of 800 mg resin (initial subsitution of 0.2 mmol Boc-p-alanine/gram) with dimethylaminopropylamine (2 ml, 55 oC) (56 mg, 36 recovery after HPLC purification) 246 (34,800), 308 (57,000); HPLC r.t. 27.9 min.; 'H NMR (DMSO-d 6 8 10.47 (s, 1 9.95 4 9.89 1 H) 9.2 (br s, 1 8.03 2 H) 7.39 1 7.27 1 H, J 1.3 Hz), 7.22 4 7.15 2 7.07 4 H) 7.03 1 H, J 1.4 Hz), 6.86 1 H, J 1.0 Hz), 3.97 3 3.84 (m, 12 3.82 3 3.77 3 (r-ala quartet covered by water.), 3.11 2 H, J 5.1 Hz), 3.08 2 H, J Hz), 2.72 6 H, J 4.8 Hz) 2.34 2 H, J 4.4 Hz), 1.7 2 H) MALDI-TOF-MS, 1014.7 (1015.1 calc for
M+H).
33. ImPyPy-G-PyPyPy-G-Dp Polyamide was prepared by manual solid phase methods as a white powder upon cleavage of 240 mg resin with Nmethyl-bis(aminopropyl)amine (2 ml, 55 oC) (19.0 mg, 44 recovery after HPLC purification). 'H NMR (DMSO-d) 8 10.49 1 9.97 1 9.93 1 9.91 1 9.89 WO 98/491!42 PCTIUS98/06997 1 9.7 (br s, 1 8.27 2 8.04 1 H, J 5.1 Hz), 7.88 (br s, 3 7.39 1 7.27 1 H, J 1.6 Hz), 7.21 3 H) 7.15 2 7.05 2 H), 6.93 3 3.97 3 3.96 6 H) 3.92 9 H), 3.72 4 3.14 6 3.05 2 H, J 5.4 Hz), 2.73 3 H, J 3.3 Hz), 1.88 (quintet, 2 H, J 4.6 Hz) 1.75 (quintet, 2 H, J 6.3 Hz). MALDI-TOF-MS, 979.0 (979.1 calc for M+H) 34. ImPyPy-G-PyPyPy-P-Bp Polyamide was prepared by manual solid phase methods as a white powder upon cleavage of 240 mg resin with Nmethyl-bis(aminopropyl)amine (2 ml, 55 OC) (25 mng, 55 recovery). HPLC, r.t. 22.0; 'H NMR (DMSO-d 6 5 10.53 1 10.00 1 9.98 1 9.93 1 9.92 (s, 1 9.7 (br s, 1 8.31 1 H, J 5.7 Hz), 8.12 (t, 1 H, J 5.5 Hz), 8.04 1 H; J 5.6 Hz), 7.9 (br s, 3 7.41 1 7.29 1 H, J 1.7 Hz), 7.23 1 H, J 1.5 Hz) 7.22 1 H, J 1.4 Hz) 7.16 3 H) 7.07 1 H, J 1.2 Hz), 7.03 1 H, J 1.3 Hz), 6.94 1 H, J 1.6 Hz), 6.93 1 H, J 1.5 Hz), 6.86 (d, 1 H, J 1.4 Hz), 3.98 3 3.88 2 H, J 5.6 Hz) 3.83 3 H) 3.82 6 H) 3.80 3 H) 3.78 (s, 3 3.37 2 H, J 6.4 Hz), 3.11 6 2.86 2 H, J 6.1 Hz), 2.70 3 H, J 4.6 Hz), 2.32 2 H, J 7.2 Hz), 1.87 (quintet, 2 H, J 7.4 Hz), 1.75 (quintet, 2 H, J 6.0 Hz) MALDI-TOF-MS, 993.3 (993.1 calc for
M+H).
WO 98/49142 PCT/US98/06997 ImPyPy-P-PyPyPy-G-Dp Polyamide was prepared by automated solid phase methods as a white powder upon cleavage of 240 mg resin with N-methyl-bis(aminopropyl)amine (2 ml, 55 oC) (23.0 mg, 53 recovery). HPLC, r.t. 20.6; 1 H NMR (DMSO-d 6 6 10.45 1 9.95 1 9.92 3 9.6 (br s, 1 H), 8.27 1 H, J 4.7 Hz), 8.11 2 7.9 3 H), 7.38 1 7.26 1 H, J 1.7 Hz), 7.21 2 H), 7.17 2 7.13 1 H, J 1.8 Hz), 7.05 2 H), 6.93 1 H, J 1.6 Hz), 6.88 1 H, J 1.6 Hz), 6.83 1 H, J 1.7 Hz), 3.97 3 3.82 9 3.81 3 3.79 3 3.73 2 3.44 2 H, J Hz), 3.2 6 2.85 2 H, J 5.8 Hz), 2.73 (d, 3 H, J 4.5 Hz), 1.89 (quintet, 2 H, J 6.4 Hz), 1.77(quintet, 2 H, J 6.9 Hz) MALDI-TOF-MS, 992.9 (993.1 calc for M+H).
36. ImPyPy-G-PyPyPy-G-Dp-EDTA EDTA-dianhydride (50 mg) was dissolved in 1 mL DMSO/NMP solution and 1 mL DIEA by heating at 55 °C for 5 min. The dianhydride solution was added to ImPyPy-G-PyPyPy-G-Bp (12.0 mg, 11 pmol) dissolved in 750 pL DMSO. The mixture was heated at 55 °C for 25 minutes, and treated with 3 mL 0.1M NaOH, and heated at 55 °C for 10 minutes. 0.1% TFA was added to adjust the total volume to 8 mL and the solution purified directly by preparatory HPLC chromatography to provide ImPyPy-G-PyPyPy-G-Bp-EDTA as a white powder. (4.7 mg, 31% recovery after HPLC purification); HPLC, r.t. 28.8; 1H NMR (DMSO-ds) 6 10.49 1 9.97 1 9.91 1 9.89 2 9.4 (br s, 1 8.42 1 H, J Hz), 8.31 1 H, J 5.5 Hz), 8.00 2 7.38 1 WO 98/49142 WO 9849142PCTIUS98/06997 H) 7. 26 1 H, J 1. 5 H z) 7 .2 2 1 H, J =1 .4 H z), 7. 20 1 H, J 1. 4 H 7. 14 3 7. 03 (i,2 H), 6. 92 1 H, J 5 Hz), 3. 95 Cs, 3 3. 85 (i,4 H), 3.84 3 3. 80 (in, 6 H) 3. 78 3 3. 76 3 H), 3 .69 6 3. 55 2 H, J 7 Hz), 3. 3 0 Cm, 12 H) 2.69 3 H, J 3. 9 Hz) 2. 31 Ct, 2 H, J 6. 8 Hz), 1.73 Cm, 4 MALDI-TOF-MS, 1254.8 (1254.3 caic for M+H).
37. ImPyPy-G-PyPyPy-P-Bp-EDTA Polyamide was prepared from IinPyPy-G-PyPyPy-P-Bp mng) as described for ImPyPy-G-PyPyPy-G-Bp-EDTA. (13.0 mg, recovery after HPLC purification) HPLC, r.t. 27.3; 1
H
NMR CDMSO-d 6 8 10.49 Cs, 1 9.97 1 9.91 1 H) 9. 89 2 H) 9. 4 (br s, 1 H) 8. 42 Ct, 1 H, J Hz), 8. 31 Ct, 1 H, J 5 Hz) 8. 00 2 H) 7.38 1 7.26 Cd, 1 H, J 1. 5 Hz) 7.22 Cd, 1 H, J 1. 4 Hz) 7.20 Cd, 1 H, J 1. 4 Hz), 7. 14 Cm, 3 H) 7. 03 2 H) 6.92 Cd, 1 H, J 5 Hz), 3. 95 Cs, 3 H) 3. 85 Cm, 4 H) 3.84 3 3.80 Cm, 6 3.78 Cs, 3 3.76 Cs, 3 H), 3.69 6 H) 3.55 2 H, J 5.7 Hz), 3.3-3.0 Cm, 12 2.69 Cd, 3 H, J 3.9 Hz), 2.31 Ct, 2 H, J 6.8 Hz), 1.73 Cm, 4 MALDI-TOF-MS, 1268.5 (1268.3 caic for M+H).
38. ImPyPy-P-PyPyPy-G-Bp-EDTA Polyamide was prepared from ImPyPy-f3-PyPyPy-G-Bp (12 mg) as described for ImPyPy-G-PyPyPy-G-Bp-EDTA. (6 mg, 42 recovery af ter HPLC purif ication) HPLC, r. t. 28. 0; 1 H NMvR (DMSO-d 6 6 10.46 Cs, 1 H) 9.95 1 H) 9.93 Cm, 3 H) 9. 9 Cbr s, 1 H) 8. 43 Ct, 1 H, J 5. 1 Hz) 8. 28 Ct, 1 H, J 5.3 Hz) 8.03 Cm, 2 H) 7.38 Cs, 1 H) 7.26 Cm, 2 H) 7. 21 1 H, J 1. 6 Hz) 7. 17 1 H, J 1. 8 Hz) 7. 12 WO 98/49142 PCT/US98/06997 1 H, J 1.8 Hz), 7.10 1 7.04 1 H, J 1.6 Hz), 6.93 2 6.88 1 H, J 1.6 Hz), 6.84 1 H, J 1.4 Hz), 3.97 3 3.87 4 3.82 9 3.79 3 3.78 3 3.68 6 3.3-2.9 16 H) 2.71 3 H, J 4.1 Hz) 1.78 4 H); MALDI-TOF-MS, 1268.9 (1268.3 calc for M+H).
39. ImPyPy-y-ImPyPy-p-pypyPy-G-Dp 01 o N
N
H H NI H N N N SH
N
N H I 0 No H H+r
N
I N o S The polyamide was prepared by machine-assisted solid phase methods as a white powder. (12 mg 19 recovery) HPLC r.t. 29.5, UV ,max 238 (53,900), 312 (71,100) nm; 1H NMR (DMSO-d 6 d 10.46 1 10.24 1 9.96 (s, 1 9.90 5 9.2 br s, 1 8.25 1 8.00 3 7.44 1 7.39 1 7.26 1 H, J 1.3 Hz), 7.24 1 H, J 1.5 Hz), 7.20 2 7.16 (m, 2 7.13 2 7.11 1 H, J 1.4 Hz), 7.05 1 H, J 1.4 Hz), 7.03 1 H, J 1.5 Hz), 6.93 1 H, J 1.3 Hz), 6.87 2 6.84 1 H, J 1.5 Hz), 3.97 3 3.92 3 3.82 9 3.79 6 3.76 6 3.73 2 3.44 2 H, J 5.0 Hz), 3.17 4 3.03 2 2.74 6 H, J 4.8 Hz), 2.50 WO 98/49142 WO 9849142PCTIUS98/06997 (in, 2 2. 33 (t 2 H, J 6. 7 H z) 1. 77 (in, 4 MALDI TOF MS 1402.2 (1402.5 caic for M+H).
ImPyPy-y-ImPyPy-3-PyPyPy-.G..Dp-NH 2 HN 0 H H
N
N N 0 H H H
N
0 V N N N
N
0 N Y 0
N~
N
NH
2 0 H I The polyamide was prepared by machine -assisted solid phase methods as a white powder. (29 mng 59 %recovery).
HPLC r.t. 21.5, 'H NMR (DMSO-d 6 10.50 1 H) 10.27 1 H) 9. 96 1 H) 9. 93 (in, 5 H) 9.2 (br s, 1 H), 8.27 1 H, J 5. 1 Hz) 8. 03 (mn, 3 7.90 3 H), 7.45 1 H) 7. 40 1 H) 7. 27 1 H, J 1. 3 Hz), 7.25 1 H, J 4 Hz), 7. 22 (mn, 2 7.18 (in, 2 H), 7.17 1 H, J =1.4 Hz), 7.14 1 H, JT 1. 3 Hz), 7.11 (mn, 2 7.06 1 H, J 1.5 Hz), 6. 94 1 H, J 1.3 Hz), 6. 88 (in, 2 H) 6. 84 1 H, J 1. 4 H z) 3. 9 7 3 3.93 3 3. 83 (in, 9 H) 3. 80 (in, 6 H) 3. 76 (in, 6 3.72 2H, J =5.2Hz), 3 .43 2H, J 0Hz), 3.17 (mn, 6 H) 3. 11 2 H, J 5.3 Hz) 2.85 2 H, J= 5.2 Hz), 2. 73 3 H, J 3. 9 Hz) 2.51 2 H, J 6. Hz) 2. 35 2 H, J 6. 7 Hz) 1. 92 (quintet, 2 H, J 6 .8 H z) 1 78 (in, 4 H) MALD I- TQF MS 1445.6 (1445.6 caic for
M+H).
WO 98/49142 WO 9849142PCTIUS98/06997 41. ImPyPy-y-IniPyPy-I 3 PyPyPy-.G-.Dp-EDTA EDTA-dianhydride (50 mg) was dissolved in 1 ML DMSO/NMP solution and 1 mL DIRA by heating at 55 0 C for min. The dianhydride solution was added to ImPyPy-y-ImPyPyf-PyPyPy-G-Dp-NH 2 (9.0 Mg, 5 ymrol) dissolved in 750 giL DMSO.
The mixture was heated at 55 0 C for 25 minutes, and treated with 3 mL 0.1M NaOH, and heated, at 55 0 C for 10 minutes.
0.1% TFA was added to adjust the total volume to 8 mL and the solution purified directly by reversed-phase HPLC to provide ImPyPy-y-ImPyPy-f3-PyPyPy-G-Dp-EDTA as a white powder. (3 mg, 30% recovery after HPLC purification); MALDI-TOF MS 1720.1 (1719.8 caic for M+H).
42. Ac-PyImPy-y-ImPyPy-p-pypypy-J3..p 01 HN N ~HQ HN0 kN H H .yN N H H 1 0 N N H HI
H
0 0 0N WO 98/49142 PCT/US98/06997 The polyamide was prepared by machine assisted solid phase methods (29) as a white powder (5 mg, 20 recovery).
UV (H20) Xnax 242 nm, 310 nm e 75,000, calculated based on e 8333/ring 1H NMR (DMSO-d6): 6 10.27 2 H); 10.02 1 9.99 1 9.92 3 9.90 1 9.85 1 8.6 (br s, 1 8.08 4 7.52 (s, 1 7.43 1 7.24 2 7.22 1 H, J 1.7 Hz), 7.20 1 H, J 1.6 Hz), 7.19 1 H, J 1.6 Hz), 7.14 1 H, J 1.5 Hz), 7.11 1 H, J 1.6 Hz), 7.07 2 7.02 1 H, J 1.4 Hz), 6.96 1 H, J 1.7 Hz), 6.90 2 6.88 1 H, J 1.8 Hz), 6.83 1 H, J 1.6 Hz), 3.94 3 3.92 3 3.81 12 3.79 3 3.78 3 3.78 3 3.52 4 3.33 6 2.95 2 H) 2.71 6 H, J 4.7 Hz), 2.32 4 1.94 3 1.73 4 MALDI-TOF MS; 1472.1 (1472.5 calc for M+H).
43. DM-y-PyImPy-y-ImPyPy-P-PyPyPy-p-Dp 0 I HN oI 0O H N I O 0,,XNN N H 0 N N N N H H+ 0 0 The polyamide was prepared by machine assisted solid phase methods as a white powder (13 mg, 52 recovery). UV ima 242 nm, 310 nm (e 75,000, calculated based on E 8333/ring(30)); 1H NMR (DMSO-d6): 6 10.28 1 10.26 1 9.99 1 9.96 1 9.94 1 H), WO 98/49142 PCT/US98/06997 9.90(m, 3 9.88 1 9.3 (br s, 1 9.2 (br s, 1 8.05 4 7.52 1 7.43 1 7.27 1 H, J 1.6 Hz), 7.24 1 H, J 1.7 Hz), 7.20 1 H, J 1.6 Hz), 7.17 4 7.14 1 H, J 1.6 Hz), 7.12 1 H, J 1.5 Hz), 7.03 1 H, J 1.6 Hz), 6.96 1 H, J 1.6 Hz), 6.90 1 H, J 1.5 Hz), 6.86 2 H), 3.94 3 3.92 3 3.81 12 3.78 9 3.56 4 3.39 6 2.95 4 2.76 6 H, J 4.6 Hz), 2.71 6 H, J 4.6 Hz), 2.30 4 H), 1.88 2 1.73 4 H) MALDI-TOF MS; 1543.3 (1543.6 calc for M+H).
44. DM-y-ImPyPy-y-ImPyPy-p-PyPyPy-P-PyPyPy-p-Dp 0 1 N 0 oo
O
H
SO N o N. N H 0 N 0 The polyamide was prepared by machine assisted solid phase methods as a white powder (3 mg, 10 recovery). UV ,,ax 239 nm, 308 nm (E 100,000, calculated based on e 8333/ring); MALDI-TOF MS; 1981.3 (1981.1 calc for M+H).
WO 98/49142 WO 98/9 142PCTIUS98/06997 ImPyPyPy-y-ImpypyyP3Dp 0j
N
HHN
-NN
0 H H/
H
N N N N N NN N H H N N
'N'
Ia 0 Polyamide ImPYPYPY-7-IMPyPyPy-P3-Dp was prepared by machine -assisted solid phase methods as a white powder (17 mg, 56% recovery) HPLC, r. t. 2 6. 1 min; UV, Xa 234 nm (3 9, 3 00), 3 12 nm (5 3, 20 0 1H NR (DMSO -d 6 8 10. 53 (s, 1 10.27 1 10.04 Cs, 1 9.96 1 9.94 1 H) 9.2 (br s, 1 H) 8. 08 3 H) 7. 49 2 H), 7.44 1 7. 31 1 H, J 1 .0 H z) 7. 23 1 H, J= 1.1 Hz), 7.19 Cm, 3 H) 7. 10 1 H) 6. 92 1 H, J 1.1 Hz), 6.90 1 H, J 1.-1 Hz) 4. 01 3 3.97 Cs, 3 3. 86 Cm, 6 H) 3. 82 6 H) 3. 41 Cq, 2 H, J Hz), 3.22 2 H, J 5.9 Hz) 3.13 2 H, J 5.9 Hz) 3.0 2 H, J 5. 6 Hz) 2. 76 Cd, 6 H, J 4. 8 Hz) 2.37 Cm, 4 1. 78 (in, 4 H) MALDI-TOF MS: 1223.4 (1223. 3 caic. for M+H).
WO 98/49142 PCT/US98/06997 46. ImPyPyPy-y-PyPyPyPy-P-Dp: 0
N
NN 0 0 1 N 0
N
N
N
N
H
N H H
H+
O O The polyamide ImPyPyPy-y-PyPyPyPy-P-Dp was prepared by machine-assisted solid phase methods as a white powder (12 mg, 19 recovery). HPLC, 29.5 min; UV, max 238 nm (53,900), 312 nm (71,100); 1 H NMR (DMSO-d 6 8 10.46 (s, 1 10.24 1 9.96 1 9.90 5 9.2 br s, 1 8.25 1 8.00 3 7.44 1 H), 7.39 1 7.26 1 H, J 1.3 Hz), 7.24 1 H, J 1.5 Hz), 7.20 2 7.16 2 7.13 2 7.11 1 H, J 1.4 Hz), 7.05 1 H, J 1.4 Hz), 7.03 1 H, J 1.5 Hz), 6.93 1 H, J 1.3 Hz), 6.87 2 H), 6.84 1 H, J 1.5 Hz), 3.97 3 3.92 3 H), 3.82 9 3.79 6 3.76 6 3.73 2 H), 3.44 2 H, J 5.0 Hz), 3.17 4 3.03 2 H), 2.74 6 H, J 4.8 Hz), 2.50 2 H) 2.33 2 H, J 6.7 Hz), 1.77 4 MALDI-TOF MS: 1222.3 (1222.3 calc for M+H).
WO 98/49142 PCTIUS98/06997 47. ImImImPy-y-PyPyPyPy-P-Dp: O
I
QN
HN N H N 0 O N H N N N N' NH H N o ZN H H H N
NQ
O O The product was synthesized by manual solid phase protocols and recovered as a white powder (7.6 mg, 11% recovery) UV 248 (42,000), 312 (48,500); H NMR (DMSO-de) d 10.32 1 H) 10.13 1 H) 9.93 1 H), 9.90 1 9.89 1 H) 9.84 1 9.59 1 H), 9.23 (br s, 1 H) 8.09 1 H, J 5.3 Hz), 8.04 2 H), 7.65 1 7.57 1 7.46 1 H, J 0.6 Hz) 7.22 3 7.16 2 7.09 1 H, J 0.8 Hz), 7.06 2 H, J 1.1 Hz), 7.00 1 H, J 6.88 (d, 1 H, J 1.8) 6.87 1 H, J 1.8 Hz), 4.02 3 H) 4.00 3 3.99 3 3.84 3 3.83 3 H), 3.83 3 H) 3.80 3 H) 3.79 3 H) 3.37 2 H, J 6.2 Hz), 3.21 2 H, J 6.4 Hz), 3.10 2 H, J 6.2 Hz), 3.00 2 H) 2.73 6 H, J 4.9 Hz) 2.34 (t, 2 H, J 7.2 Hz), 2.28 2 H, J 7.0 Hz), 1.76 4 H); MALDI-TOF-MS, 1225.9 (1224.3 calc. for M+H).
WO 98/49142 PCT/US98/06997 48. ImImImPy-y-PyPyPyPy-P-Dp-NH 2 O I HN N0 O
N
Sy H H
I
0 A sample of polyamide machine-synthesized on resin (350 mg, 0.16 mmol/gram) was placed in a 20 mL glass scintillation vial, and treated with 2 mL 3,3'-diamino-Nmethyldipropylamine at 55 OC for 18 hours. The resin was removed by filtration through a disposable propylene filter, and the resulting solution dissolved with water to a total volume of 8 mL, and purified directly by preparatory reversed phase HPLC to provide ImImImPy-y- PyPyPyPy-P-Dp-NH 2 (31 mg, 40% recovery) as a white powder.
1H NMR (DMSO-d 6 6 10.37 1 10.16 1 9.95 (s, 1 9.93 1 9.91 1 9.86 1 9.49 (br s, 1 9.47 1 8.12 3 8.0 (br s 3 H), 7.65 1 7.57 1 7.46 1 7.20 3 H), 7.16 2 7.09 1 H, J 1.5 Hz), 7.05 2 H), 7.00 1 H, J 1.6 Hz), 6.88 2 4.01 3 H), 3.99 3 3.98 3 3.83 3 3.82 3 H), 3.81 3 3.79 3 3.78 3 3.36 2 H, J 5.3 Hz), 3.21- 3.05 8 2.85 2 H, J 4.9 Hz), 2.71 3 H, J 4.4 Hz), 2.34 2 H, J 5.9 Hz), 2.26 2 H, J 5.9 Hz), 1.85 (quintet, J 5.7 WO 98/49142 WO 9849142PCTIUS98/06997 Hz) 1.72 (in, 4 H) MALDI-TOF-MS, 1267.1 (1267.4 caic. for
M+H).
49. Im3:IMmPY-Y-PYPYPy-o-p-ED)TA.
N
HN 0~~0 H N' N H NI H
H
0 N NN~ N H H 0 1 0 N N D 1 N 01 0 0 Compound was prepared as described f or ImImIm-y-PyPypy- P-Dp-EDTA. (3.8 mg, 40%) 1H NMR (DMSO-d 6 6 10.34 1 H), 11 1 H) 9. 92 1 H) 9.90 1 H) 9. 89 1 H) 9. 84 1 9. 57 1 H) 8. 42 1 8.03 (m 3 7.64 1 H) 7.56 1 H) 7.44 1. 7.20 (in, 3 7.15 (mn, 2 7.07 1 H, J 1.6 Hz), 7.05 Cm, 2 6.99 1 H, J 1. 6 Hz) 6. 87 2 4.00 3 3.98 3 3. 97 3 H) 3. 83 (in, 4 3.82 6 3.79 3 H) 3.78 6 H) 3. 67 (in, 4 3.4- (in, 16 H) 2. 71 3 H, J 2 Hz) 2. 34 2 H, J= 5.4 Hz), 2.25 2 H, J 5.9 Hz), 1.72 (in, 6 MALDI- TOF-MS, 1542.2 (1542.6 caic. for The polyamide was loaded with Fe(II) by standard methods.
WO 98/49142 PCT/US98/06997 ImImPyPy-y-ImImPyPy-P-Dp: 0O oN HN 0
N
SN N/ N r
N
H F NA^ OI N Ni/ N N H S~ H H H+ S N NN N- 0 The polyamide ImImPyPy-y-ImImPyPy-p-PAM-Resin was assembled on 0.2 mmol/gram Boc-P-PAM-resin by machine assisted synthesis. The y-Im step was assembled using Boc-y- Im acid (HBTU, DIEA), all other residues was added as appropriate activated Boc protected monomer units. A sample of resin (250 mg, 0.16 mmol/gram 21 was placed in a 20 mL glass scintillation vial, 2 mL dimethylaminopropylamine added and the mixture allowed to stand at 55 oC for 18 hours. Resin was removed by filtration through a disposable propylene filter, and the resulting solution diluted with water to a total volume of 8 mL, and purified directly by preparatory reversed phase HPLC to provide ImImPyPy-y- ImImPyPy-P-Dp (26 mg, 45% recovery) as a white powder. UV km(H 2 0) 248, 312 (66,000); 'H NMR (DMSO-dg) d 10.34 2 10.32 2 9.73 2 9.5 (br s, 1 H) 9.32 1 8.10 3 7.55 2 7.52 1 7.44 1 7.23 2 7.14 4 7.06 1 H, J 1.4 Hz); 6.86 2 3.98 9 3.95 3 3.81 6 3.77 6 3.31 2 3.17 2 H, J Hz) 3.06 2 H, J 5.7 Hz); 2.93 2 H, J 4.7 WO 98/49142 PCT/US98/06997 Hz); 2.74 6 H, J 4.4 Hz); 2.30 4 1.74 4 MALDI-TOF-MS, 1224.9 (1225.3 calc. for M+H).
51. ImPyImPy-y-ImPyImPy-P-Dp 0 HN 0 O N N -N N H H N N N HN SN N HH jN-(N 0 N ZY l H H H+ O N N N- I O The polyamide ImPyImPy-y-ImPyImPy-p-PAM-Resin was assembled on 0.2 mmol/gram Boc-P-PAM-resin by manual polyamide synthesis. The Py-Im and y-Im steps were addedusing Boc-y-Im acid and Boc-Py-Im acid (HBTU, DIEA), all other residues were added as appropriate activated Boc protected monomer units. A sample of resin (250 mg, 0.16 mmol/gram 21 was placed in a 20 mL glass scintillation vial, 2 mL dimethylaminopropylamine added and the mixture allowed to stand at 55 oC for 18 hours. Resin was removed by filtration through a disposable propylene filter, and the resulting solution diluted with water to a total volume of 8 mL, and purified directly by preparatory reversed phase HPLC to provide ImPyImPy-y-ImPyImPy-P-Dp (19 mg, 32% recovery) as a white powder. UV ax(H 2 0) 246, 312 (66,000); 1 H NMR (DMSO-d 6 d 10.33 2 10.25 2 10.04 (m, 2 9.95 1 9.5 (br s, 1 8.10 3 7.57 2 7.48 1 7.42 1 7.40 1 7.23 2 7.17 1 H; J 1.5 Hz); 7.03 1 H, J WO 98/49142 PCT/US98/06997 Hz) 6.98 3 4.02 3 3.99 6 3.81 (m, 6 3.97 3 3.88 6 3.83 6 3.42 (m, 2 3.18 2 H, J 5.2 Hz) 3.06 2 H, J 5.5 Hz); 2.80 2 H, J 4.7 Hz); 2.76 6 H, J 4.4 Hz); 2.38 4 H) 1.93 4 H) MALDI-TOF-MS, 1225.2. (1225.3 calc. for M+H).
52. ImImImIm-y-PyPyPyPy-P-Dp 0 HN 0N N N
N
H H 01 N H H SN N H NN
N
O N f H, H H H+ 0 0 The polyamide ImImImIm-y-PyPyPyPy-P-PAM-Resin was assembled on 0.2 mmol/gram Boc-b-PAM-resin by manual polyamide synthesis. The y-Im step was added using Boc-y-Im acid (HBTU, DIEA) all other residues were added as appropriate activated Boc protected monomer units. A sample of resin (250 mg, 0.16 mmol/gram 21 was placed in a 20 mL glass scintillation vial, 2 mL dimethylaminopropylamine added and the mixture allowed to stand at 55 OC for 18 hours. Resin was removed by filtration through a disposable propylene filter, and the resulting solution diluted with water to a total volume of 8 mL, and purified directly by preparatory reversed phase HPLC to provide ImImImIm-y- PyPyPyPy-P-Dp (12 mg, 21% recovery) as a white powder. UV ax(H 2 0) 246, 314 (66,000) H NMR (DMSO-d 6 d 9.91 2 WO 98/49142 PCT/US98/06997 9.89 4 9.83 1 9.60 1 9.5 (br s, 1 8.34 1 8.10 2 7.63 2 7.50 (s, 1 7.42 1 7.19 2 7.13 2 7.04 (m, 2 6.86 2 3.98 6 3.96 3 3.93 (s, 3 3.81 6 3.77 3 3.73 3 3.30 (m, 2 3.10 2 H, J 5.3 Hz) 3.09 2 H, J 5.5 Hz); 2.91 2 H, J 4.6 Hz); 2.71 6 H, J 4.2 Hz); 2.32 4 1.70 4 MALDI-TOF-MS, 1225.6 (1225.3 calc.
for M+H).
53. ImImImPy- -PyPyPyPy- -Dp NNH H O Vv"N No N N 0
N
0
N
Y T N H H H+ 1 0 0 A sample of ImImImPy-P-PyPyPyPy--resin prepared by machine-assisted solid phase synthesis (240 mg, 0.16 mmol/gram) was placed in a 20 mL glass scintillation vial, and treated with dimethylaminopropylamine (2 mL) at 55 °C for 18 hours. Resin was removed by filtration, and the filtrate diluted to a total volume of 8 mL with 0.1 (wt/v) aqueous TFA. The resulting crude polyamide/amine solution was purified directly by reversed phase HPLC to provide the trifluoroacetate salt of ImImImPy-P-PyPyPyPy-p- Dp (31 mg, 40% recovery) as a white powder. 1 H NMR (300 MHz,
[D
6 ]DMSO, 20 oC): d 10.37 1H; NH), 10.12 1H; NH), 9,95 1H; NH), 9.94 1H; NH), 9.93 1H; NH), 9.92 1H; NH), 9.59 1H; NH), 9.4 (br s, 1H; CF 3 COOH), 8.09 WO 98/49142 PCT/US98/06997 3H; NH), 7.65 1H; CH), 7.56 1H; CH), 7.45 (s, 1H; CH) 7.27 2 J(H,H) 1.3 Hz, 1H; CH) 7.22 2H; CH) 7.18 J(H,H) 1.2 Hz, 1H; CH) 7.16 J(H,H) Hz, 1H; CH), 7.07 2H; CH) 6.95 2 J(H,H) 1.1 Hz, 1H; CH), 6.88 2 J(H,H) 1.4 Hz, 1H; CH), 6.86 (d, 2 J(H,H) 1.3 Hz, 1H; CH), 4.01 3H; NCH 3 3.98 2H;
NCH
3 3.83 3H; NCH 3 3.82 6H; NCH 3 3.80 3H;
NCH
3 3.78 3H; NCH 3 3.4 6H; CH 2 3.11 4
J(H,H)
5.2 Hz, 2H; CH 2 2.94 4 J(H,H) 5.3 Hz, 2H; CH 2 2.69 2 J(H,H) 4.4 Hz, 6H; N(CH 3 2 2.33 3 J(H,H) 5.4 Hz, 2H; CH 2 1.75 SJ(H,H) 7.1 Hz, 2H; CH 2 UV/VIS (H 2 0) .ax 304 (66,600, calculated from e 8,333/ring 1 4 c 241 nm; MALDI-TOF-MS 1210.4: calc.
1210.3.
54. ImImPyPy-P-PyPyPypy- -Dp 0
H
N H.
NY H
H
N N IN T V H 0 NO 0 I H H H 0 A sample of ImImPyPy--PyPyPyPy--resin prepared by machine-assisted solid phase synthesis (240 mg, 0.16 mmol/gram 5 was placed in a 20 mL glass scintillation vial, and treated with dimethylaminopropylamine (2 mL) at 55 °C for 18 hours. Resin was removed by filtration, and the filtrate diluted to a total volume of 8 mL with 0.1 (wt/v) aqueous TFA. The resulting crude polyamide/amine solution was purified directly by reversed phase HPLC to WO 98/49142 PCT/US98/06997 provide the trifluoroacetate salt of ImImPyPy-P-Pypypypy-P- Dp (31 mg, 40% recovery) as a white powder. 'H NMR (300 MHz,
[D
6 ]DMSO, 20 oC): 6 10.38 1H; NH), 9.95 1H; NH), 9,93 1H; NH), 9.91 1H; NH), 9.90 2H; NH), 9.76 1H; NH), 9.4 (br s, 1H; CF 3 COOH), 8.09 3H; NH), 7.56 1H; CH) 7.46 1H; CH) 7.27 2 J(H,H) 1.8 Hz, 1H; CH), 7.21 2 J(H,H) 1.7 Hz, 1H; CH), 7.20 (d, 2 J(H,H) =1.9 Hz, 1H; CH), 7.19 2 J(H,H) 1.9 Hz, 1H; CH), 7.16 2 J(H,H) 1.9 Hz, 1H; CH), 7.15 2 j(H,H) 1.6 Hz, 1H; CH), 7.14 2 J(H,H) 1.9 Hz, 1H; CH), 7.12 2 J(H,H) 1.6 Hz, 1H; CH), 7.07 1H; CH), 7.05 (d, 2 J(H,H) 1.5 Hz, 1H; CH), 6.87 2 J(H,H) 1.9 Hz, 1H; CH), 6.86 2 J(H,H) 1.6 Hz, 1H; CH), 6.84 2 j(H,H) 1.6 Hz, 1H; CH), 3.99 6H; NCH 3 3.82 12H; NCH 3 3.80 3H; NCH 3 3.78 3H; NCH 3 3.4 6H; CH 2 3.09 4 J(H,H) 5.6 Hz, 2H; CH 2 2.97 4 J(H,H) 5.2 Hz, 2H; CH 2 2.71 2 J(H,H) 4.2 Hz, 6H; N(CH 3 2 2.32 3 J(H,H) 5.1 Hz, 2H; CH 2 1.71 sJ(H,H) 7.4 Hz, 2H; CH 2 UV/VIS (H 2 0) %a 306 (66,600, calculated from e 8,333/ring), 243 nm; MALDI-TOF-MS 1209.1: calc. 1209.3 ImPyPyPy-P-PyPyPyPy-P-Dp NXI/
H
N
NO
H
N
HH
H
N NN
N
l
H
I 0 0 N N H 10
N
NY
0 N~ 0N H H H+
N
1 0 0 WO 98/49142 WO 9849142PCTIUS98/06997 A sample of ImPyPyPy-P3-PyPyPyPy-P3-resin prepared by machine-assisted solid phase synthesis (240 mg, 0.16 rnmol/gram) was placed in a 20 mL glass scintillation vial, and treated with dimethylaminopropylamine (2 mL) at 55 0
C
for 18 hours. Resin was removed by filtration, and the filtrate diluted to a total volume of 8 mL with 0.1 (wt/v) aqueous TFA. The resulting crude polyamide/amine solution was purified directly by reversed phase HPLC to provide the trifluoroacetate salt of ImPyPyPy-3-PyPyPyPy.f3 Dp (31 mg, 40% recovery) as a white powder. 1 H NMR (300 MHz,
[D
6 ]DMSO, 20 IC) 8 10.49 1H; NH) 9.97 1H; NH), 9.95 1H; NH), 9.94 1H; NHl), 9.93 (in, 2H; NH), 9.91 1H; NH), 9.4 (br s, 1H; CF 3 COOH) 8.10 (in, 3H; NH), 7.38 1H; CH), 7.28 (d 2 j 1.6 Hz, 1H; CH), 7.22 (in, 3H; CH), 7.19 (in, 2H; CH) 7.16 (mn, 2H; CH), 7.09 (in, 2H; CH), 7.04 (in, 2H; CH), 6. 87 'J H) 1.6 Hz, 1H; CH), 6.86 (d 2 j =1.6 Hz, 1H; CH), 6.84 (d 2 j H) Hz, 1H; CH), 3.97 3H; NCH 3 3.82 (in, 15H; NCH 3 3.80 3H; NCH 3 3.78 3H; NCH 3 3.4 (mn, 6H; CH 2 3.10 (q 4 J(H, H) 5. 4 Hz, 2H; CH 2 2. 98 (q 4 J(H,H) =5.3 Hz, 2H; CH 2 2. 72 (d 2 j H) 7 Hz, 6H; N (CH 3 2 2. 33 (t, 3 J(H,H) =7.0 Hz, 2H; CE 2 1.71 5 J(H,H) 6.4 Hz, 2H;
CH
2 tJV/VIS (H 2 0) Xa. =312 (66,600, calculated from 6 8,333/ring), 244 nin; MALDI-TOF-MS 1208.2: calc.
1208.3.
WO 98/49142 PCT/US98/06997 56. ImPyPyPyPy-y- ImPyPyPyPy- -Dp 9o 9o I I I 6 0 H H H+ 0 I 6 A sample of ImPyPyPyPy-y- ImPyPyPyPy-P-resin prepared by machine-assisted solid phase synthesis (240 mg, 0.16 mmol/gram) was placed in a 20 mL glass scintillation vial, and treated with dimethylaminopropylamine (2 mL) at 55 °C for 18 hours. Resin was removed by filtration, and the filtrate diluted to a total volume of 8 mL with 0.1 (wt/v) aqueous TFA. The resulting crude polyamide/amine solution was purified directly by reversed phase HPLC to provide the trifluoroacetate salt of ImPyPyPyPy-y- ImPyPyPyPy-P-Dp (13 mg, 18% recovery) as a white powder. UV
(H
2 0) Xax 241, 316 83300 (calculated based on E 8,333/ring); 1 H NMR (DMSO-d 6 6 10.52 1 10.29 1 10.04 1 10.00 1 9.97 3 9.92 (m, 2 9.22 (br s, 1 8.06 3 8.03 2 7.46 1 7.41 1 7.29 1 H, J= 1.0 Hz), 7.23 (m, 1 7.17 1 7.07 1 6.90 1 H, J= 6.9 Hz), 3.99 3 3.94 3 3.85 6 3.79 (s, 3 3.38 2 H, J= 3.2 Hz), 3.20 2 H, J=2.7 Hz), 3.11 2 H, J=1.8 Hz), 3.00 2 H, J=2.1 Hz), 2.72 (d, WO 98/49142 WO 9849142PCTIUS98/06997 6. H, J= 4. 8 H z) 2. 35 (in, 4 H) 1. 75 Cm, 4 MALD I -TOF -MS, 1466.1 (1467.6 caic. for M+H) 57. ImImPyPyPy-y-InPyPyPyPy-f3-Dp HN 0 0 NH
N
N H N 0 0 NN
N
N O HH N N
H
Z N N\ ,H
HH+
0
N
I~0 The polyamide was prepared as described for ImPyPyPyPy-y-ImPyPyPyPy-p-Dp as a white powder (28 mg, 34% recovery). UV Xma,, 310 (83,300); 1 H NNR (DMSO-d 6 d 10.38 1 10.28 Cs, 1 10.02 Cs, 1 9.99 1 H), 9.96 (in, 2 H) 9.91 2 9.76 Cs, 1 H) 9.18 (br s, 1 8.05 3 7.57 1 7.46 2 7.25 Cdd, 2 H, J= 5. 7. 23 4 H) 7. 16 4 H) 7. 07 (in, 4 H) 6. 88 1 H, J= 5. 4. 00 3 H) 3. 94 3 H) 3.8 (mn, 6H) 3. 79 Cs, 3 H) 2. 99 2 H, J 5. 1) 2. 73 6 H, J 4.8 Hz) 2.34 Cm, 4 H) 1.75 (mn, 4 H) MALDI-TOF-MS, 1468.2 (1468.6 caic. for M+H).
WO 98/49142 PCT/US98/06997 58. ImPyPPyPy-y--ImPyPyPyPy--Dp-NH 2 0 1 HN 0
H
N
N N N 0V H H N I 0 A sample of ImPyPyPyPy-y-ImPyPyPyPy-p-resin prepared by machine-assisted solid phase synthesis (240 mg, 0.16 mmol/gram) was placed in a 20 mL glass scintillation vial, and treated with 3 3 -diamino-N-methyldipropylamine (2 mL) at 55 oC for 18 hours. Resin was removed by filtration, and the filtrate diluted to a total volume of 8 mL with 0.1 (wt/v) aqueous TFA. The resulting crude polyamide/amine solution was purified directly by reversed phase HPLC to provide the trifluoroacetate salt of ImPyPyPyPy-y- ImPyPyPyPy-p-NH 2 (31 mg, 40% recovery) as a white powder. UV Lmax 241, 316 83300 (calculated based on 8,333/ring 5 1 H NMR (DMSO-d 6 6 10.53 1 10.28 1 10.03 1 10.00 1 9.96 2 9.92 (m, 2 9.6 (br s, 1 8.07 4 7.89 3 7.45 1 7.41 1 7.27 2 H, J= 8.5 Hz), 7.23 (m, 4 7.16 4 7.06 4 6.87 2 3.98, 3 3.94 3 3.84, 6 3.79 3 H), 3.35 2 H, J= 5.7 Hz), 3.16 8 2.85 2 H, J= 5.6 Hz), 2.72 2 H, J= 4.2 Hz), 2.34 2 1.91 (m, 4 1.78 4 MALDI-TOF MS, 1510.4 (1510.7 calc.
for M+H).
WO 98/49142 PCT/US98/06997 59. ImImPyPyPy-y-ImPyPyPyPy- -Dp-NH 2 O I HN 0 H N N 0 r. H I o N H N i N H H HH
U/
0 N
N
N H H N HII ,N NH I O
O
The polyamide was prepared as a white powder as described for ImPyPyPyPy-y-ImPyPyPyPy-p-NH 2 1 H NMR (DMSOd 6 6 10.39 1 10.28 1 10.03 1 10.00 1 9.92 2 9.82 1 9.66 (br s, 1 H), 8.11 4 7.89 3 7.57 1 7.46 2 H, J=2.4 Hz), 7.27 (dd, 2 H, J= 1.0 Hz) 7.23 4 7.16 4 7.08 4 6.88 1 4.00 3 3.94 3 3.78 3 3.19 2 H, J=5.1 Hz), 3.05 (m, 8 2.86 2 H, J= 4.8 Hz), 2.72 2 H, J= 4.4 Hz), 2.34 4 1.90 4 1.78 4 MALDI-TOF-MS, 1510.4 (1511.7 calc. for M+H).
ImPyPyPyPy-y-ImPyPyPyPy- -Dp -EDTA EDTA-dianhydride (50 mg) was dissolved by heating at oC for 5 min. in a solution of DMSO/NMP (1 ml) and DIEA (1 mL). The dianhydride solution was added to ImPyPyPyPy-y- ImPyPyPyPy-p-Dp-NH 2 (8.1 mg) dissolved in DMSO (750 pL).
The mixture was heated at 55 oC for 25 minutes, and treated with 0.1M NaOH (3 mL), and heated at 55 OC for 10 minutes.
Aqueous 0.1% (wt/v) TFA was added to adjust the total WO 98/49142 PCT/US98/06997 volume to 8 mL and the solution purified directly by preparatory HPLC chromatography to provide ImPyPyPyPy-y- ImPyPyPyPy-P-Dp-EDTA as a white powder. (2.4 mg, 22% recovery) MALDI-TOF-MS, 1766.4 (1766.7 calc. for M+H) E. Plasmids, Footprinting, Affinity Cleavage 1. Construction of plasmid DNA The experimental target plasmid pSES9hp was constructed by hybridization of the inserts:
GATCCTATGTCAGTCATGGGGATGACTGTCAGTCATGGCCATGACTGTCAGTCAT
GCGCATGACTGTCAGTCTTAAGC-3' and
GATACAGTCAGTACCCCTACTGACAGTCAGTACCGGTACTGACAGTCAGTACGCG
TACTGACAGTCAGAATTCGTCGA-3'.
The hybridized insert was ligated into linearized pUC19 BamHI/HindIII plasmid using T4 DNA ligase. The resultant constructs were used to transform ToplOF' OneShot competent cells from Invitrogen. Ampicillin-resistant white colonies were selected from 25 mL Luria-Bertani medium agar plates containing 50 ig/mL ampicillin and treated with XGAL and IPTG solutions. Large-scale plasmid purification was performed with Qiagen Maxi purification kits. Dideoxy sequencing was used to verify the presence of the desired insert. Concentration of the prepared plasmid was determined at 260 nm using the relationship of 1 OD unit 50 pg/mL duplex DNA.
2. Preparation of and 5'-End-Labeled Restriction Fragments The plasmid pSES9hp was linearized with EcoRI and PvuII and then treated with Klenow fragment, deoxyadenosine 32 p]triphosphate and thymidine 32 P] triphosphate WO 98/49142 PCT/US98/06997 for 3' labeling. Alternatively, pSES9hp was linearized with EcoRI, treated with calf alkaline phosphatase, and then 5' labeled with T4 polynucleotide kinase and deoxyadenosine 32 P]triphosphate. The 5" labeled fragment was then digested with PvuII. The labeled fragment or was loaded onto a 5% non-denaturing polyacrylamide gel, and the desired 282 base pair band was visualized by autoradiography and isolated. Chemical sequencing reactions were performed according to published methods. (Maxam, A.M. Gilbert, W.S. (1980). Sequencing End-Labeled DNA with Base-Specific Chemical Cleavages.
Methods Enzymol. 65, 499-560; Iverson, B.L. Dervan, P.B.
(1987). Adenine-specific DNA chemical sequencing reaction.
Methods Enzymol. 15, 7823-7830.) 3. MPEoFe(II) Footprinting All reactions were carried out in a volume of 40 ILL.
A polyamide stock solution or water (for reference lanes) was added to an assay buffer where the final concentrations were: 25 mM Tris-acetate buffer (pH 10 mM NaC1, 100 pM/base pair calf thymus DNA, and 30 kcpm or radiolabeled DNA. The solutions were allowed to equilibrate for 4 hours. A fresh 50 pM MPE*Fe(II) solution was made from 100 pL of a 100 pM MPE solution and 100 pL of a 100 pM ferrous ammonium sulfate (Fe (NH 4 2 (SO 4 2*6H 2 0) solution. MPE*Fe(II) solution (5 pM) was added to the equilibrated DNA, and the reactions were allowed to equilibrate for 5 minutes. Cleavage was initiated by the addition of dithiothreitol (5 mM) and allowed to proceed for 14 min. Reactions were stopped by ethanol precipitation, resuspended in 100 mM WO 98/49142 PCT/US98/06997 formamide loading buffer, denatured at 85 0 C for 5 min, placed on ice, and half of each tube 15 kcpm) was immediately loaded onto an 8% denaturing polyacrylamide gel crosslink, 7 M urea) at 2000 V.
4. Affinity Cleaving. All reactions were carried out in a volume of 40 pL. A polyamide stock solution or water (for reference lanes) was added to an assay buffer where the final concentrations were: 25 mM Tris-acetate buffer (pH 10 mM NaCl, 100 yM/base pair calf thymus DNA, and 20 kcpm or 5"-radiolabeled DNA. The solutions were allowed to equilibrate for 4 hours. A fresh solution of ferrous ammonium sulfate (Fe(NH 4 2
(SO
4 )2*6H 2 0) (10 yM) was added to the equilibrated DNA, and the reactions were allowed to equilibrate for 15 minutes. Cleavage was initiated by the addition of dithiothreitol (10 mM) and allowed to proceed for 30 min. Reactions were stopped by ethanol precipitation, resuspended in 100 mM tris-borateformamide loading buffer, denatured at 85 0 C for min, placed on ice, and the entire sample was immediately loaded onto an 8% denaturing polyacrylamide gel crosslink, 7 M urea) at 2000 V.
Identification of Binding Orientation by Affinity Cleaving.
Affinity cleavage assays (25 mM Tris-acetate, 10 mM NaCl, 100 iM/base pair calf thymus DNA, pH 7.0 and 22 0
C)
were performed in order to identify the binding orientations of the EDTA analogues of the three hairpin polyamides: ImImPyPy-y-ImImPyPy-P-Dp-EDTA, ImPyImPy-y- ImPyImPy-P-Dp-EDTA and ImImImIm-y-PyPyPyPy-P-Dp-EDTA WO 98/49142 PCT/US98/06997 The polyamides ImImPyPy-y-ImImPyPy-P-Dp-EDTA, ImPyImPy-y- ImPyImPy-p-Dp-EDTA recognize their respective palindromic match sequences, 5'-TGGCCA-3' and 5'-TGCGCA-3', in two equivalent orientations, consistent with hairpin formation.
In contrast, the polyamide ImImImIm-y-PyPyPyPy-p-Dp-EDTA recognizes a non-palindromic sequence, 5'-TGGGGA-3 in a single orientation with cleavage visible only on the side of the site, as predicted by the hairpin model.
Depicted in Figure 17 is a representative affinity cleaving experiment on a 3'-32P-labeled 282 bp EcoRI/PvuII restriction fragment from plasmid pSES9hp. The 5'-TGCGCA-3 and 5'-TGGGGA-3' sites are shown on the right side of the autoradiogram. Lane 1, A reaction; lane 2, C reaction; lanes 3-5, 1 iM, 2 AM and 5 iM ImImPyPy-y- ImImPyPy-p-Dp-EDTA lanes 6-8, 1 AM, 2 iM and 5 iM ImPyImPy-y-ImPyImPy-p-Dp-EDTA lanes 9-11, 1 AM, 2 iM and 5 pM ImImImIm-y-PyPyPyPy-p-Dp-EDTA lane 12, intact DNA. All lanes contain 15 kcpm 3'-radiolabeled DNA, mM Tris-acetate buffer (pH 10 mM NaC1, and 100 iM/base pair calf thymus DNA. (Right) Affinity cleavage patterns for ImImPyPy-y-ImImPyPy-P-Dp-EDTA and ImPyImPy-y- ImPyImPy-P-Dp-EDTA at 1 iM concentration, and ImImImIm-y- PyPyPyPy-P-Dp-EDTA at 5 pM concentration. Illustration of the 282 bp restriction fragment with the position of the sequence indicated. Bar heights are proportional to the relative protection from cleavage at each band. Boxes represent equilibrium binding sites determined by the published model, and only sites that were quantitated by DNase I footprint titrations are boxed.
WO 98/49142 PCT/US98/06997 6. DNase I Footprinting All reactions were carried out in a volume of 400 pL.
We note explicitly that no carrier DNA was used in these reactions. A polyamide stock solution or water (for reference lanes) was added to an assay buffer where the final concentrations were: 10 mM Tris*HCl buffer (pH mM KC1, 10 mM MgCl 2 5 mM CaC12, and 20 kcpm 3radiolabeled DNA. The solutions were allowed to equilibrate for a minimum of 12 hours at 22 0 C. Cleavage was initiated by the addition of 10 AL of a DNase I stock solution (diluted with 1 mM DTT to give a stock concentration of 0.28 u/mL) and was allowed to proceed for min at 22 0 C. The reactions were stopped by adding 50 mL of a solution containing 2.25 M NaC1, 150 mM EDTA, 0.6 mg/mL glycogen, and 30 mM base-pair calf thymus DNA, and then ethanol precipitated. The cleavage products were resuspended in 100 mM tris-borate-EDTA/80% formamide loading buffer, denatured at 85 0 C for 5 min, placed on ice, and immediately loaded onto an 8% denaturing polyacrylamide gel crosslink, 7 M urea) at 2000 V for 1 hour. The gels were dried under vacuum at 80 0 C, then quantitated using storage phosphor technology.
The data were analyzed by performing volume integrations of the 5-TGGCCA-3', 5'-TGCGCA-3', and TGGGGA-3' sites and a reference site. The apparent DNA target site saturation, Oapp, was calculated for each concentration of polyamide using the following equation: 1 Itot ref tot/I r ef WO 98/49142 PCT/US98/06997 where Itot and Iref are the integrated volumes of the target and reference sites, respectively, and Itot o and Iref correspond to those values for a DNase I control lane to which no polyamide has been added. The ([L]tot, Oapp) data points were fit to a Langmuir binding isotherm (eq 2, n=lor n=2) by minimizing the difference between Oapp and Ofit, using the modified Hill equation: Kan[L]nto t Ofit 8min (Kmaxmin)tot 1 Kan[L]ntot (2) where [L tot corresponds to the total polyamide concentration, Ka corresponds to the equilibrium association constant, and Omin and 8max represent the experimentally determined site saturation values when the site is unoccupied or saturated, respectively. Data were fit using a nonlinear least-squares fitting procedure of KaleidaGraph software (version 2.1, Abelbeck software) with Ka, Imax, and 8min as the adjustable parameters. All acceptable fits had a correlation coefficient of R 0.97.
At least three sets of acceptable data were used in determining each association constant. All lanes from each gel were used unless visual inspection revealed a data point to be obviously flawed relative to neighboring points. The data were normalized using the following equation: WO 98/49142 PCT/US98/06997 Onom app' Oin norm o -o max min (3) 7. Quantitation by Storage Phosphor Technology Autoradiography Photostimulable storage phosphorimaging plates (Kodak Storage Phosphor Screen S0230 obtained from Molecular Dynamics) were pressed flat against gel samples and exposed in the dark at 22 0 C for 12-20 h. A Molecular Dynamics 400S PhosphorImager was used to obtain all data from the storage screens. The data were analyzed by performing volume integrations of all bands using the ImageQuant v. 3.2.
EXAMPLE 2 Synthesis And Oxidative Cleavage Of Double-Helical Dna By Polyamides Modified With A Polyamide-Ni(Ii) Tripeptide Conjugate Many anticancer drugs act through their ability to modify DNA. Novel polyamide conjugates have been designed which modify double-helical DNA in a sequence specific manner. More specifically the metalopeptide Ni(II)*Gly-Gly- His has been covalently attached to a pyrrole-imidazole polyamide. The conjugate was synthesized using manual solid phase synthesis protocols developed by the Dervan group using Boc-pyrrole-OBt ester and Boc-imidazole acid monomers, activated esters of y-aminobutyric acid and Palanine, and Boc-P-alanine-Pam resin. Individual polyamides are purified by reversed phase HPLC and characterized by MALDI-TOF mass spectrometry.
The metallopeptide Ni(II)*Gly-Gly-His has been shown to promote the efficient oxidative cleavage of DNA in the WO 98/49142 PCT/US98/06997 presence of monoperoxyphthalic acid. (Mack and Dervan, J.
Am. Chem. Soc., 112, 4604 (1990); Mack and Dervan, Biochemistry, 31, 9399 (1992)).
The reaction is thought to proceed through a mechanism that involves abstraction of hydrogen atom(s) from the deoxyribose backbone of DNA by a nondiffusable high valent nickel bound oxygen. Bifunctional conjugates were designed in order to combine the ability of polyamides to recognize any predetermined DNA sequence with the Ni(II)*Gly-Gly-His chemistry. The symmetric anhydride of the amino acid His and the activated ester of Gly were coupled to the extended hairpin polyamide directly on the P-alanine-Pam resin employing solid phase chemistry protocols. Denaturing polyacrylamide gel electrophoresis of 32P end-labeled DNA treated with the Ni(II)*Gly-Gly-His modified polyamide at pH 7.5 demonstrated the ability of the conjugate to cleave the double helical DNA in a sequence selective manner in 77 and 72 yields on the 3'-end-labeled DNA (at 10 nM polyamide). The chemical structure of the Ni(II)*Gly-Gly- His modified polyamide is shown in Figure 18.
EXAMPLE 3 Sequence Specific Alkylation Of Dna By Pyrrole-Imidazole Polyamides Modified With Dna Reactive Agents The design of sequence specific DNA binding-modifying molecules requires the integration of two separate entities: recognition and functional reactivity. The present inventor has discovered ligands which combine pyrrole-imidazole polyamide DNA binding motifs with mechanism based reactive functionalities capable of electrophilic modification of bases in the minor groove.
WO 98/49142 PCT/US98/06997 The design of sequence specific molecules for alkylation of double helical DNA requires both a specific DNA binding molecule and an atom specific DNA cleaving moiety. Hairpin polyamides are sequence specific molecules that can bind to any predetermined DNA sequence.
Bromoacetyl and the prodrug analogue of the cyclopropyl electrophile of CC-1065 react in an atom specific manner with double helical DNA. By tethering a bromoacetyl moiety or the prodrug analogue of the cyclopropyl electrophile of CC-1065 to a hairpin polyamide the present inventor has discovered a sequence specific DNA alkylating agent which can be targeted to any predetermined DNA sequence at subnanomolar concentration.
The two criteria for successful bifunctional molecule design are sequence specific reactions at designated single atoms within the bound complex, and cleavage yields that are quantitative under physiological conditions (i.e.
neutral pH, 37° C, 100-200nM KC1/NaCl). In order to maximize stoichiometric reaction on the DNA, the 'cleaving functionality' must be sufficiently reactive with DNA at 37 C, be inert in aqueous media, and not react with buffer components, and not suffer unimolecular decomposition in competition with desired reactions on DNA. In order to design such bifunctional molecules, hairpin polyamides equipped with either an N-terminal bromoacetyl group or a prodrug analogue of the cyclopropyl electrophile of CC-1065 have been prepared.
A. Bromoacetylated polyamides The polyamide NH2PPyPyPy-g-ImPyPyPy-b-Dp was designed to target the sequence 5'-AGTTT*A-3'. T* indicates the WO 98/49142 PCT/US98/06997 thymine opposite the alkylated adenine. The polyamide was synthesized by solid phase protocols, cleaved from the solid support with dimethyl amino propylamine, and purified by reverse phase HPLC chromatography. The terminal pyrrole residue was deprotected and left unacetylated, leaving a free primary amine on the N-terminus.
In order to bromoacetylate the polyamide, bromoacetic acid was activated with HOBt and DCC in 1 ml DMF. After minutes, the DCU was filtered off and the solution added to the polyamide with DIEA. After 15 minutes, .the reaction mixture was purified directly by reversed phase HPLC to isolate the bromoacetylated polyamide. Short reaction times were used to avoid alkylation of the unprotected imidazole ring nitrogen. The purified N-bromoacetyl hairpin polyamide was characterized by mass spectrometry. The synthesis of a bromoacetylated hairpin polyamide is described in Figure 19.
Another set of polyamides was synthesized, based on an extended hairpin motif. This motif combines the y-turn of the hairpin motif with the P-alanine spacer of the extended motif, combining the 2:1 binding mode with the 1:1 binding mode. The following compounds were synthesized: PyPy-P- PyPyPy-y-ImPyPy-P-Dp, PyPyPy-P-PyPyPy-y-ImPyPy-P-Dp, PyPy-p- PyPyPy-y-PyPyIm-P-Dp, and PyPyPyPyPyPy-y-PyPyIm-P-Dp. The syntheses of the bromoacetylated extended hairpins were successful, and were prepared as described for the bromoacetylated hairpin polyamide. As controls, all four of the acetylated compounds were made as well.
WO 98/49142 PCT/US98/06997 B. Typical manual solid phase polyamide synthesis of PYPyPyPy-y- ImPyPyPy- -Dp.
Boc-P-alanine-Pam resin (1.25 g, 0.25 mmol) was placed in a 20 ml glass reaction vessel and shaken in DMF for minutes and drained. The resin was washed with DCM (2 volumes) and deprotected with 80% TFA/DCM/0.5 M PhSH (1 wash, 1 X 20 minutes). Following deprotection, the resin was washed 3 time with DCM and 1 time with DMF. Bocpyrrole-OBt ester (357 mg, 1 mmol) was added in 2 ml of DMF followed by 1 ml DIEA. The coupling reaction was shaken vigorously for 45 minutes. Resin samples (5 mg)were taken periodically to monitor the synthesis by HPLC. Successive cycles of the remaining monomers, Boc-Py-OBt Boc-y-Im- COOH, Boc-Py-OBt Boc-y-Im-COOH was activated by addition of HBTU (378 mg, 1 mmol) in 2 ml of DMF. DIEA (1 ml) was added and the solution was allowed to stand for minutes until clear. After completion of the synthesis, the resin was washed with DMF, DCM, methanol, and ethyl ether. The resin was then lyophilized to remove solvent.
The polyamide was cleaved off the resin with dimethylamino propylamine (2 ml) in a glass scintillation vial at 55 C for 12 hours. The polyamide was filtered and HPLC purified in 0.1% TFA with a 0.25% CH3CN min-1 gradient.
C. Synthesis of bromoacetylated polyamides Bromoacetic acid (65 mg, 0.5 mmol) and hydroxybenzotriazole (65 mg, 0.5 mmol) were dissolved in 1 ml DMF. DCC (102 mg, 0.5 mmol) was added. After minutes, the DCU was filtered off, and the solution added to the polyamide (10 mg, 0.1 mmol). The filter was washed WO 98/49142 WO 9849142PCTIUS98/06997 with 1 ml DMF, and 300 p1 DIEA which was added to the reaction. The reaction was allowed to sit at room temperature for 15 minutes. HPLC purification in 0.1% TFA with gradient elution of 0.25% CH3CN min- Bromoacetylated polyamide was recovered, (0.184 mg, 135.6 P mol). UV kmax 312 nm (66,600). MALDI-TOF MS: 1358.3 (1357.29 calculated for M+1).
E. AcPyPyPyPy-y-InipyPyPy-Dp Synthesized as above. UV mx(E): 318 nm (66,600).
MALDI-TOF MS: 1279.5 (1279.4 calculated for M+1) F. NH2 PYPY-P-PyPyPy- y -ImPyPy Dp Synthesized as above. UV kmax 310 nm (66,600) H NNR (DM so-d 6 8 10.25(s, 1H), 9.98 9.9 3H), 9. 72 2H) 9. 3 (1H, br s) 8. 04 02 (in, 4H), 7.44 (s, 1H), 7.23 1H) 7.20 1H-) 7.18 1H), 7.17 (d, 1H), 7.167 1H) 7. 145 1H), 7.119 1H), 7.08 Cs, 1H), 7. 0 25 1H), 6. 9 1H) 6.85 1H), 6.80 (d, 1H), 6.79 1H), 3.90 3H) 3.85 3H), 3.81 Cs, 3H), 3. 80 3 H) 3 77 3 H) 3. 66 (br, 12 H) 3.43-3.34 (in, 8H-) 3. 17 Cm, 2H) 3. 08 Cm, 2H) 2. 98 (in, 21-) 2. 32 (in, 6H), 1.74 Cm, 4H).
G. AcPyPy-fP-PyPyPy-y-Im~yPy-P3-Dp To a solution of N112PYPY-P-PYPYPY-Y-Impypy-f3-Dp in DMSO/NNP (500 pi) and DIEA (500 pl) was added acetic anhydride (400 g1). The reaction was heated at 55 0 C for minutes and HPLC purified as above. UV kmax( Cs): 310 nin WO 98/49142 PCT/US98/06997 (66,600). MALDI-TOF MS: 1351.0 (1350.5 calculated for M+1).
H. BrAcPyPy-p-PyPyPy-y-ImPyPy-p-Dp Synthesized as bromoacetylated polyamide above. UV k max(S): 314 nm (66,600). MALDI-TOF MS: 1429.3 (1429.4 calculated for M+1).
I. NH2PyPyPy-P-PyPyPy-y-ImPyPy- -Dp Synthesized as above. UV kmax(E): 314 nm (74,925). 1
H
NMR (DMSO-d 6 6 10.25 1H), 10.02 1H), 9.99 1H), 9.90 3H), 9.72 2H),8.03-8.06 4H), 7.44 1H), 7.23-7.21(m, 3H) 7.17-7.12 4H), 7.09 1H), 6.98-6.83 6H), 3.92 3H) 3.87 3H), 3.82 3H), 3.79 (s, 3H) 3.77 3H) 3.51 12 2.72 4H) 2.48 (m, 4H), 2.32 6H) 1.78 4H).
J. AcPyPyPy--PyPyPy-y-ImPyPy-p-Dp Acetylated as above. UV kmax() 314 nm (74,925).
MALDI-TOF MS: 1472.0 (1472.6 calculated for M+1) NH2PyPy- -PyPyPy-y-PyPyIm-P-Dp. Synthesized as above. UV X max(E): 310 nm (66,600). 1H NMR (DMSO-d 6 6 10.31 1H), 9.98 1H) 9.91 1H), 9.89 1H), 9.84 1H), 9.71 (br, 2H) 8.06-8.08 3H), 7.95 1H) 7.48 (s, 1H), 7.28-7.15 H) 7.08 1H), 7.02 2H), 6.91- 6.86 3H) 6.80 1H) 3.91 3H) 3.86 3H) 3.82-3.78 12H) 3.56-3.43(m, 12H) 3.22 2H) 3.10 2H), 3.0 2H), 2.4 2H), 2.3 6H), 1.7 4H) WO 98/49142 WO 9849142PCTIUS98/06997 K AcPyPy-P PYPYPY-Y PYPYIM- P-DP Acetylated as above. LTV Xmax(c): 310 nm (66,600).
MALDI-TOF MS: 1350.0 (1350.5 calculated for M+ 1) NH2PYPYPYPYPyPY-y-PyPY-Tm-PDp- Synthesized as above. LIVX max(c): 310 rim (74,925). 1HNMR (DMSO-d 6 8 10.31 1H) 10.04 1H), 9.95 3H1), 9.90 1H), 9.84 2H1), 8.08-8.06 (in, 3H), 7.48 1H), 7.28 1H1), 7.23-7.21 (in, 2H), 7.15 1H), 7.10-7.07 (mn, 8H1), 6.94-6.87 (in, 4H), 3.95-3.78 (inm, 3.58 (in, 2H), 3.43 (in, 2H), 3.19 (mn, 2H), 3. 07 (in, 2H) 2. 36 (in, 2H) 2. 32 (in, 6H1), 2. 25 (m, 2H), 1.74 (in, 4H).
L. AcPyPyPyPyPyPy-y-PyPym-P -Dp Acetylated as above. LIV kmax(SW 310 nin (74,925).
MALDI-TOF MS: 1401.0 (1401.5 calculated for M+1).
B-rACPYPYPYPYPYPY-Y-PyPy-13-Dp Bromoacetylated as above.
UIV Xmax(E) 310 rim (74,925). MALDI-TOF MS: 1480.7 (1480.4 calculated for M+1).
M. Alkylation reactions Alkylation was examined on a 262 bp restriction fragment (EcoRI/FspI) of pBR322, radiolabeled on the end (10,000 cpin/reaction) Polyainide or bromodistamycin were added at appropriate concentrations. Final reaction concentrations were 10 inN sodium phosphate (pH- 100 jiM sonicated calf thymnus DNA. The reactions were incubated at 37 0C for 0, 1, 5, 10, 20, and 40 hours. Following incubation, the reactions were ethanol precipitated and dissolved in 10 gl 10 mM sodium phosphate buffer and heated WO 98/49142 PCT/US98/06997 at 90 C for 15 minutes. Piperidine (40 til, 1.4 M) was added and the reaction heated again for 15 minutes at 90 C.
Piperidine was lyophilized off and the reactions were resuspended in 7 tl lx TBE/80% formamide loading buffer, denatured by heating at 85 C for 10 minutes and placed on ice. Reactions were electrophoresed on 8% polyacrylamide gels cross link, 7 M urea) in lx TBE at 2000V. Gels were dried and exposed to a storage phosphor screen (Molecular Dynamics).
N. NH2PyPyPyPy-y-ImPyPyPy-NH(CH 2 Polyamide was synthesized as above on glycine linked Pam resin. For cleavage, resin (500 mg) was weighed out into a 50 ml flask in 5 ml 100% EtOH. An equal weight of LiBH 4 (500 mg, 23 mmol) was slowly added. Resin was incubated at 55 C for 2 hours, adding more ethanol as needed. Polyamide was HPLC purified as above. UV kmax(s): 314 nm (66,600). MALDI-TOF MS: 1124.0 (1124.2 calculated). 1H NMR (DMSO-d 6 8 10.26 1H), 10.02 (s, 1H) 9.99 1H), 9.94 2H) 9.89 2H), 8.02 (m, 1H) 7.91 1H), 7.43 1H) 7.24-7.20 2H) 7.16- 7.12 2H), 7.09-7.02 4H) 6.92-6.84 4H) 3.92- 3.87 6H), 3.83-3.77 18H).
EXAMPLE 4 Polyamide CBI Unit (+)CC-1065 is a natural product isolated from Streptomyces zelensis. It binds in the minor groove and shows antitumor activity due to a reactive cyclopropyl WO 98/49142 PCT/US98/06997 moiety which alkylates preferentially at N3 of adenine (Boger and Johnson. Angew. Chem. Int. Ed. Eng. 1996, 1438-1474).
Also in this class of antitumor antibiotics are the duocarmycins. They are structurally very similar to CC- 1065, having the reactive cyclopropyl ring, but lacking the third conjugated ring system. These compounds bind in AT tracts, and display strong sequence selectivity for alkylation at adenines. Alkylation will occur at N3 of guanine as well, but only when other AT bp are protected in the minor groove. The flanking sequence preferences for alkylation by CC-1065 are 5'-AAA-3'>5'-TTA-3'>5'-TAA-3'>5,- ATA-3'. The alkylation reaction is reversible for the two duocarmycin compounds but irreversible for CC-1065. This discrepancy is explained by the more extensive non-covalent interactions of CC-1065 with the DNA minor groove. 1065 is the natural enantiomer. The unnatural enantiomer has been synthesized by Boger and coworkers and shown to alkylate DNA as well. Interestingly, where the natural enantiomer binds 3' to 5' from the site of alkylation, the unnatural enantiomer binds 5' to Structures of CC- 1065 and the duocarmycins are shown in Figure When compared to N-Bromoacetyldistamycin, CC-1065 shows very different reactivity. For reaction times of 1 hour at 370 C, N-Bromoacetyldistamycin shows almost no visible cleavage, while (+)CC-1065 shows intense cleavage at 13 adenines. After 10 hours at 370, N- Bromoacetyldistamycin shows a comparable amount of cleavage to CC-1065 at 1 hour, but at only one adenine. Despite the apparent similarities between these two molecules, being crescent-shaped with an electrophile that covalently WO 98/49142 PCT/US98/06997 binds DNA, the cyclopropyl electrophile of CC-1065 alkylation shows faster kinetics than that of N- Bromoacetyldistamycin. The alkylation mechanism for CC-1065 is shown in Figure 21.
Several pro-drug analogues of CC-1065 have also been made. One of the most popular is bizelesin, a bifunctional interstrand DNA crosslinker synthesized by Upjohn. It is believed to go through the same cyclopropyl intermediate as CC-1065, but is more stable than the cyclopropyl analogues.
The structures of Bizelesin and CBI are shown in Figure 22.
Boger et al have synthesized many modified versions of the A ring of (+)CC-1065 to examine the effects of steric changes on the alkylation potency of these drugs. In his work with CC-1065 derivatives, it has been shown that there is a direct linear correlation between drug stability and cytotoxicity. The more solvolytically stable compounds also show the highest degree of cytotoxicity. The most successful modification thus far, is the synthesis of 1,2,9,9a-tetrahydrocyclopropa[c]benz[e]indol-4-one
(CBI)
which replaces the fused pyrrole with a fused benzene ring, releasing ring strain in the system. (Boger, D. Yun, and Han, N. Bioorganic and Medicinal Chemistry 1995, 3, 1429-1453.) When coupled to the B and C rings of CC-1065, CBI showed greater stability, reactivity, and selectivity than (+)CC-1065 itself. Boger has also shown that a Boc protected CBI unit is sufficient for DNA alkylation. Its fast kinetics and efficient alkylation make CBI an ideal candidate to tether to a hairpin to generate a powerful sequence specific alkylator.
WO 98/49142 PCT/US98/06997 The CBI subunit was synthesized as described by Boger.
(Boger, D. L. a. McKie., J.A. J. Org. Chem. 1995, 60, 1271- 1275.) Briefly, N-Boc-4-hydroxy-2-napthylamine was synthesized by the condensation of ammonia and 1,3 dihydroxynaphthalene with immediate Boc protection by Boc anhydride. After protection of the alcohol with benzyl bromide, treatment with NIS provided the iodonaphthylamine.
Alkylation with allyl bromide provided a substrate for a favorable 5-exo-trig aryl radical-alkene cyclization to occur, using Bu3SnH and TEMPO radical trap. Cleavage of the TEMPO trap intermediate occurred upon heating with activated Zn powder. Treatment with PPh3/CC1 4 followed by hydrolysis of the benzyl ether gave the desired product.
In order to work out the coupling conditions of the polyamide-CBI unit, a simple three ring compound, ImPyPy-p- NH(CH2) 2
NH
2 was made. A new activation strategy used disuccinimidyl glutarate (DSG), a diacid activated with NHS esters. A P-alanine linker was added to the CBI unit to facilitate completion of the reaction, according to the procedure by Lukhtanov, E.A. and coworkers. (Lukhtanov, et al. Nucleic Acids Research 1996, 24, 683-687.) After HPLC purification, one major peak was isolated. This fraction was analyzed by mass spectrometry and NMR and could be identified as the polyamide-CBI (chloro) conjugate. The synthesis of a CBI-polyamide conjugate is shown in Figure 23.
A. ImPyPy-P-ED Polyamide was synthesized as above, and cleaved with ethylene diamine. HPLC purification as above. UV kmax(s): WO 98/49142 PCT/US98/06997 300 nm (24,975). 1H NMR (DMSO-d 6 8 10.47 1H) 9.91 (s, 1H) 8.06 1H) 7.67 1H) 7.38 1H) 7.25 (s, 1H), 7.15 1H), 7.12 1H) 7.02 1H) 6.84 (s, 1H), 3.96 3H) 3.80 3H), 3.76 3H) 3.37 (m, 2H), 3.35 2H), 2.83 2H), 2.46 2H).
B. ImPyPy-P-ED-succinimide-NHS ImPyPy-P-ED (10 mg) was dissolved in 2 ml DMF added 100 p.l at a time to a solution of disuccinimidyl glutarate (100 mg) and DIEA (10 p1) in 1 ml DMF at room temperature. The reaction was monitored by analytical HPLC and was complete within an hour after final addition of polyamide.
Preparative HPLC gave a white powder. MS (FAB) 695.2 (calculated 694.3). 1H NMR (DMSO-d6) 8 10.49 1H), 9.91 1H), 8.01 1H), 7.92 1H), 7.87 1H), 7.38 (s, 1H) 7.25 1H) 7.17 1H) 7.11 1H), 7.04 (s, 1H), 6.78 1H), 3.96 3H) 3.80 3H), 3.76 (s, 3H), 3.33 2H) 3.05 2H) 2.77 2H), 2.65 (m, 2H) 2.55 2H) 2.30 2H) 2.28 2H) 2.16 (m, 2H), 2.13 2H), 1.78 2H) C. Boc-P-alanine-CBI Deprotect alcohol (217 mg, 0.65 mmol) as above. After removing ethyl acetate, dissolve in dry DMF (10 ml). Add to Boc-P-alanine (245.98 mg, 1.3 mmol) and EDC (767 mg, 4 mmol). Reaction was stirred under argon overnight.
Solvent was removed in vacuo and precipitated in 20 ml of water. The precipitate was centrifuged, washed, with water, and lyophilized. Flash chromatography gave a yellow powder. 1H NMR (DMSO-d6) 8 9.43 (br s, 1H) 8.31 1H) WO 98/49142 PCT/US98/06997 8.29 1H), 7.65 1H), 7.53 1H), 7.40 1H), 5.56 1H).
D. CBI conjugated polyamide Boc-p-alanine CBI was deprotected as above. A solution of ImPyPy-p-ED-succinimide-NHS (20 mg) in 100 tl DMF was added with 10 gi DIEA. Reaction was stirred at room temperature under argon overnight. HPLC purification gave a white powder. MS(FAB): 884.1 (calculated 884.4) 1H NMR (DMSO-d 6 8 10.50 1H), 10.36 (br s, 1H), 9.91 1H), 8.04 2H) 7.92 2H) 7.87 1H) 7.81 1H) 7.73 2H), 7.44 2H), 7.41 1H), 7.27-7.25 (m, 3H), 7.16 1H), 7.10 1H), 7.06 1H).
EXAMPLE Polyamide-Intercalator Conjugates The artificial regulation of protein:DNA interactions is a potentially powerful therapeutic tool. Protein recognition of DNA, both specific and non-specific, is based heavily on the nearby DNA structure. (Luisi, B.
(1995) in DNA-Protein: Structural Interactions, ed. Lilley, D. M. J. (IRL Press, Oxford), p. 23.) For example, a bent sequence of DNA may recruit a non-specific protein, as in HMG-I, or prevent a protein from making the appropriate contacts for high-affinity binding. Small molecules designed to bind predetermined sequences of DNA and modulate the local DNA topology may be a general approach for regulation of the function of DNA binding proteins.
Intercalators are a class of molecules which are potent antibiotic and antitumor drugs. (Neidle and Abraham, (1984) CRC Crit. Rev. Biochem. 17, 73-121. Wang, A. H-J.
WO 98/49142 PCT/US98/06997 (1992) Curr. Opin. Struct. Biol. 2, 361-368.) Lerman first described intercalation as the insertion of a flat, aromatic chromophore between adjacent base pairs of the double helix. (Lerman, L. S. (1961) J. Mol. Biol. 3, 18-30.) The rise of B-form DNA is usually 3.4A/base pair. The stacking of the intercalator separates the adjacent base pairs by another 3.4A and extends the length of the helix and equivalent amount per bound intercalator. The base pairs neighboring the intercalation site are also unwound 10-260 with respect to one another. Generally, it is these structural distortions introduced by intercalation which are considered to be the basis for their therapeutic activity.
However, it is important to note that in most cases the DNA helix returns to its B-form structure within a few base pairs of the intercalation site.
Due to their nature of stacking between the base pairs, intercalators generally exhibit little or no sequence specificity. A few natural products, such as actinomycin D and the anthracycline and pluramycin families of intercalators, have added functionalitites which impart preference for certain dinucleotide steps. (Hansen and Hurley (1996) Acc. Chem. Res. 29, Actinomycin D, consists of an aromatic phenoxazone core coupled to two identical cyclic pentapeptides that make contacts to the exocyclic amine of guanine, granting specificity for intercalation at 5'-GC-3' steps. Similarly, carbohydrate moieties attached to the chromophore of the anthracycline and pluramycin intercalators interact with the DNA bases in both the major and minor grooves and grant these molecules their sequence preferences. In almost all cases, the sequence specificity of these natural products is limited to the two base pairs adjacent to the intercalation site.
WO 98/49142 PCT/US98/06997 Netropsin and distamycin A are pyrrole carboxamide natural products which bind in the minor groove of DNA at sites of 4-5 contiguous A,T base pairs. (Krylov, et al.
(1979) Nucleic Acids Res. 6, 289-304; Zasedatelev, et al.
(1974) Mol. Biol. Rep. 1, 337-342; Zasedatelev, et al.
(1976) Dokl. Akad. Nauk SSSR (1976) 231, 1006-1009; Zimmer, and Wanhert (1986) Prog. Biophys. Mol. Biol. 47, 31-112; Van Dyke, et al. (1982) Proc. Natl. Acad. Sci., USA 79, 5470-5474; Van Dyke and Dervan (1982) Cold Spring Harbor Symp. Quant. Biol. 47, 347-353; Van Dyke and Dervan, (1983) Biochemistry 47, 2373-2377; Harshman and Dervan (1985) Nucleic Acids Res. 13, 4825-4835; Fox and Waring, (1984) Nucleic Acids Res. 12, 9271-9285; Lane et al. (1983) Proc.
Natl. Acad. Sci., USA 80, 3260-3264). In an attempt to create an intercalator with designed sequence specificity, a number of researchers have linked analogs of distamycin or netropsin to a non-specific intercalator. (Bailly and Henichart, (1991) Bioconj. Chem. 2, 379-393; Bourdouxhe- Housiaux, et al. (1996) Biochemistry 35, 4251-4264; Bailly, et al. (1994) Biochemistry 33, 15348-15364; Subra, et al.
(1991) Biochemistry 30, 1642-1650; Eliadis, et al. (1988) J. Chem. Soc. Chem. Comm. 1049-1052; Wang, et al. (1994) Gene 149, 63-67; Arcamone, F. (1994) Gene 149, 57-61.) Although these efforts have met with some success, these compounds target mixed sequences of A*T and G*C base pairs.
More specifically none of these compounds can bind a broad range of predetermined DNA sequences. Even more specifically, none of these compounds can bind a predetermined sequence with subnanomolar affinity.
Linking a non-specific intercalator moiety to a polyamide may produce the sequence specific distortions of WO 98/49142 PCT/US98/06997 DNA structure required to regulate protein:DNA interactions. Ethidium bromide is a common intercalator which has been shown to bind DNA with a Ka of approximately 105 M-1 and unwind the DNA helix by 260. (LePecq and Paoletti (1967) J. Mol. Biol. 27, 87-106; Waring, M. (1970) J. Mol. Biol. 54, 247-279; Wang, J. C. (1974) J. Mol. Biol.
89, 783-801; Bresloff and Crothers (1975) J. Mol. Biol. 103-123.) A derivative of ethidium, methidium, has been used previously in the preparation of designed intercalators and serves as the basis of methidium-propyl- Fe*EDTA (MPE) footprinting. (Dervan and Becker (1978) J. Am.
Chem. Soc. 100, 1968-1970; Hertzberg and Dervan (1982) J.
Am. Chem. Soc. 104, 313-315.) The synthesis and DNAbinding properties of a series of methidium-polyamide conjugates have been discovered by the present inventor.
Methidium-polyamide conjugates are designed to sequence specifically induce helical unwinding and extension which may be sufficient to inhibit DNA binding by a wide variety of DNA binding proteins, such as the transcription factor, GCN-4, SP1, and NF-KB.
A. Design and Synthesis of Methidium-Polyamide Intercalators A series of methidium-polyamide conjugates of the general design DMy-ImPyPy-y-ImPyPy-p-Cn-Mdm (DMy=N,Ndimethyl-y-aminobutyric acid, Cn=diamine linker of n carbons, Mdm=p-carboxymethidium) were synthesized.
Polyamides generally contain a C-terminal positively charged dimethylaminopropyl amide. In this case, since the C-terminus is conjugated to the methidium, DMy was placed on the N-terminus to retain the net positive charge. This WO 98/49142 PCT/US98/06997 alteration has no significant effect on polyamide binding.
Boc-chemistry solid phase polyamide synthesis allows for the rapid preparation of milligram quantities' of purified polyamide suitable for methidium conjugation in solution.
DMy-ImPyPy-y-ImPyPy-p-Pam-resin was prepared from Boc-Py-OBt ester and Boc-Im acid monomers. Aminolysis with various diamines (NH2(CH 2 nNH2, n=2, 4, 6) followed by preparatory HPLC purification afforded free polyamide with a C-terminal primary amine suitable for coupling to methidium. Reaction of the polyamide amine with the acylimidazole ester ofpcarboxy methidium and HPLC purification produced a series of methidium-polyamide conjugates with various linker lengths. The polyamide/methidium coupling reaction was quantitative by analytical HPLC. An average recovery of purified conjugate of 14.5% from DMy-ImPyPy-y-ImPyPy-P-Cn- NH2 was achieved. The 1H NMR spectrum of each conjugate has resonances consistent with polyamide and methidium protons, as well as an additional broad triplet at 8.75 ppm resulting from the amide bond formed in the polyamide/methidium coupling reaction. MALDI-TOF mass spectrometry analysis of each conjugate reveals the presence of compounds consistent with the mass of the conjugated species, with no free polyamide or methidium observed. The synthesis of bifunctional methidium-polyamide conjugates is described in Figure 24.
DMy-ImPyPy-y-ImPyPy-P-Cn-Mdm conjugates are targeted to the 5'-TGACT-3' portion of the ARE and GCRE binding sites of GCN4. By CPK modeling, intercalation is expected to occur between the two base pairs at the 3' end of the GCN4 biding site, AT for ARE (5'-CTGACTAAT-3') and TT GCRE (5'-ATGACTCTT-3') (intercalation site bolded). Coupling of WO 98/49142 PCTIUS98/06997 the methidium (Ka 105 M and polyamide (Ka 105 M
I
moieties is also expected to produce a significant increase in binding affinity. Binding of a methidium-polyamide conjugate to a 5'-AGTGTA-3' site is depicted below. The methidium is represented as a grey rectangle and is placed between the base pairs where intercalation is predicted based on molecular modeling studies.
C A TAAGTGTAA A G C C-3' 32 P 3'-C G T A T T C A C A T T C G H NMR were recorded on a GE 300 instrument operating at 300MHz. Spectra were recorded in DMSO-d 6 with chemical shifts reported in parts per million relative to residual DMSO-d 5 UV spectra were measured on a Hewlett-Packard Model 8452A diode array spectrophotometer. Matrixassisted, laser desorption/ionization time of flight mass spectrometry was carried out at the Protein and Peptide Microanalytical Facility at the California Institute of Technology. HPLC analysis was performed either on a HP 1090 M analytical HPLC or a Beckman Gold system using a Rainen C18, Microsorb MV, 5 Am, 300 x 4.6 mm reversed phase column in 0.1% (wt/v) TFA with acetonitrile as eluent and a flow rate of 1.0 ml/min, gradient elution 1.25% acetonitrile/min. Preparatory HPLC was carried out on a Beckman instrument using a Waters DeltaPak 25 x 100 mm 100 Am C 18 column in 0.1% (wt/v) TFA, gradient elution 0.25%/min. CH3CN. Water was obtained from a Millipore Milli-Q water purification system. Reagent-grade chemicals WO 98/49142 PCT/US98/06997 were used unless otherwise stated. Restriction endonucleases were purchased from either New England Biolabs or Boeringher-Mannheim and used according to the manufacturer's protocol. Sequenase (version 2.0) was obtained from United States Biochemical, and DNase I (FPLCpure) was obtained from Boeringher-Mannheim. [a-32p]_ (3000 Ci/mmol), and [a-32p] (6000 Ci/mmol), were purchased from Du Pont/NEN.
B. Synthesis of Polyamide-Methidium Conjugates Boc-Im acid and Boc-Py-OBt were synthesized in 5 and 6 steps, respectively. DMy-ImPyPy-y-ImPyPy-P-Pam-resin was prepared using Boc-chemistry manual solid phase synthesis protocols. Polyamide was cleaved from the resin (400 mg) by aminolysis in neat diamine (2 mL, 24-48 hours, 600 C) and purified by preparative HPLC. p-Carboxy methidium acid mg) in DMSO (1 mL) was activated by reaction with carbonyl diimidazole (22 mg) and N-ethylmorpholine (15 jiL) in DMSO (200 iL) (250 C, 1 hour). Aliquots of this solution (375 L) and DIEA (150 4L) were added to DMy-ImPyPy-y- ImPyPy-p-Cn-NH2 4, 6) polyamides, each in DMSO (150 i After 12-24 hours the reaction was diluted with 0.1% (wt/v) TFA (5 mL) and purified by HPLC.
C. DM-y-ImPyPy--ImPyPy--C 2 -Mdm Coupling of p-carboxy methidium acid acid to DMy- ImPyPy--ImPyPy-p-C 2 -NH2 (20 mg, 19 imol) afforded DMy- ImPyPy-y-ImPyPy-P-C2-Mdm as a purple powder. (3.0 mg, 2 p WO 98/49142 PCT/US98/06997 mol, 10.5% recovery). HPLC r.t. 28.9, UV kmax 290 (93,000); H NMR (DMSO-d 6 d 10.38 1 10.25 1 9.99 1 9.95 1 9.89 2 9.4 (br s, 1 H) 8.80 1 H) 8.65 2 H) 8.18 2 H) 8.08 3 7.73 2 7.48 1 7.42 1 H), 7.41 1 7.30 2 7.23 2 7.15 2 H), 6.86 1 6.82 1 6.30 1 3.93 9 H), 3.82 3 3.80 3 3.78 6 3.38 2 H), 3.18 2 3.08 4 2.76 6 2.35 9 H), 1.92 2 1.76 2 MALDI-TOF MS 1390.8 (1390.6 calc. for M+H) D. DMy-ImPyPy-y-ImPyPy-p-C 4 -Mdm Coupling of p-carboxy methidium acid acid to DMy- ImPyPy-7-ImPypy-P3-C 4
-NH
2 (35 mg, 32 tmol) afforded DMy- ImPyPy-y-ImPyPy-p-C4-Mdm as a purple powder. (8.7 mg, 6 p mol, 19% recovery). HPLC r.t. 29.7, UV ?,max 290 (93,000); H NMR (DMSO-d 6 d 10.38 1 10.25 1 9.97 1 9.95 1 9.90 2 9.4 (br s, 1 8.74 1 H) 8.61 2 H) 8.17 2 8.02 2 H) 7.95 1 H) 7.73 2 H) 7.48 1 H), 7.43 1 7.42 1 7.30 2 7.23 2 H), 7.15 2 6.87 1 6.82 1 6.30 1 H), 3.93 6 3.92 3 3.82 3 3.81 3 H), 3.77 6 3.38 2 3.18 2 3.08 4 H), 2.76 6 2.40 9 1.92 2 1.78 2 H), 1.50 4 MALDI-TOF MS 1418.7 (1418.6 calc. for M+H).
WO 98/49142 PCT/US98/06997 E. DMy-ImPyPy-y-ImPypy-P-C 6 -Mdm Coupling of p-carboxy methidium acid acid to DMy- ImPyPy-y-ImPyPy-p-C 6
-NH
2 (30 mg, 27 mol) afforded DMy- ImPyPy-y-ImPyPy-P-C6-Mdm as a purple powder. (5.3 mg, 3.7 I mol, 14% recovery). HPLC r.t. 30.6, UV kmax 290 (93,000); H NMR (DMSO-d 6 d 10.38 1 10.26 1 9.98 1 9.95 1 9.90 2 9.4 (br s, 1 8.73 1 8.61 2 8.17 2 8.02 2 7.89 1 7.73 2 7.48 1 H), 7.44 1 7.43 1 7.30 2 7.24 2 H), 7.14 2 6.88 1 6.83 1 6.31 1 H), 3.93 9 3.83 6 3.78 6 3.31 2 H), 3.18 2 3.03 4 2.76 6 2.35 9 H), 1.92 2 1.78 2 1.50 6 MALDI-TOF
MS
1446.7 (1446.9 calc. for M+H).
EXAMPLE 6 Polyamide Dye Conjugates Solution methods for the sequence-specific detection of nucleic acids offer several advantages in terms of sample preparation and of time resolution of measurements.
Currently most efforts in this direction focus on hybridization methods of single stranded targets. The targeting of double helical DNA allow for the direct detection of biological DNA samples including plasmid, cosmid, or genomic DNA. DNA-binding pyrrole-imidazole polyamide will sequence-specifically deliver environmentally sensitive fluorochromes to the DNA. Several dyes show a markedly increased fluorescence upon binding to DNA, among these are Hoechst 33258, ethidium bromide, and WO 98/49142 PCT/US98/06997 most notably thiazole orange. More generally, dyes such as dansyl and mansyl show tremendous sensitivity to environment.
Conjugates have been prepared with a number of such dyes in order to develop sequence-specific, high affinity DNA fluorochromes. The polyamide portion of each dye was prepared using solid phase synthetic methodology and reacted with an amine reactive fluorochrome. A number of dyes and 'linker diamines' are being investigated. These conjugates are unique, in that they combine the ability to recognize any predetermined DNA sequence with the ability to signal binding events directly.
The synthesis of a polyamide-rhodamine conjugate is outlined below. 80% TFA/DCM, 0.4M PhSH; (ii) BocPy- OBt, DIEA, DMF; (iii) 80% TFA/DCM, 0.4M PhSH; (iv) BocPy- OBt, DIEA, DMF; 80% TFA/DCM, 0.4M PhSH; (vi) BocPy-OBt, DIEA, DMF; (vii) 80% TFA/DCM, 0.4M PhSH; (viii) Boc-yaminobutyric acid-Im-OBt, HBTU, DIEA, DMF; (ix) TFA/DCM, 0.4M PhSH; BocPy-OBt, DIEA, DMF; (xi) TFA/DCM, 0.4M PhSH; (xii) BocPy-OBt, DIEA, DMF; (xiii) TFA/DCM, 0.4M PhSH; (xiv) BocPy-OBt, DIEA, DMF; (xv) TFA/DCM, 0.4M PhSH; (xvi) Imidazole-2-carboxylic acid (HBTU/DIEA); (xvii) diamino-N-methyldipropylamine, 55 oC; (xviii) 5-carboxyrhodamine 6G succinimidyl ester, HEPES, pH 7.5, 25 oC] (Figure 25). The chemical structures of a number of polyamide-DYE conjugates are shown in Figures 26A-D.
EXAMPLE #7 DNA Detection Through Energy Transfer Systems which show enhanced or specific fluorescence upon binding to a specific DNA sequence could be useful WO 98/49142 PCT/US98/06997 reagents for genomic analysis. Energy transfer between Dyes provides a means of detecting simultaneous binding of sequence-specific imidazole-pyrrole polyamides to proximal DNA binding sites. (Ju, et al. Proc. Natl. Acad. Sci. 92, 4347-4351; Nie et al., Science 266, 1018-1021.) Two DNA binding polyamides will be prepared to target adjacent DNA binding site, one conjugated to a donor dye, the other conjugated to an acceptor dye. Dye pairs will be chosen such that the donor dye can be excited without exciting the acceptor. With excitation at this energy, fluorescence of the acceptor fluorochrome will only occur while proximal to the donor fluorochrome through energy transfer from the donor. The required binding of the two polyamides will lengthen the effective recognition sequence to the levels appropriate for genomic level analysis and will improve the specificity of the technique.
Using dye conjugation chemistry developed by the present inventor, conjugates will be prepared purified, and characterized. Donor-acceptor pairs such as fluoresceinrhodamine or thiazole-orange/rhodamine will be analyzed for their computability in this system. This energy transfer system increases the currently accessible recognition sequence for polyamides and provides for a unique bindingdependent signal, applicable for both homogeneous and heterogeneous detection systems.
Pyrene and similar systems for excimers (excited state dimers) provide two or more molecules are close in three dimensional space. DNA-binding polyamides deliver pyrene to proximal positions on DNA. Binding is then monitored by the formation of the excimer.
WO 98/49142 PCTIUS98/06997 The structure of a pyrene polyamide conjugate is shown below.
S N
H
NN H H N 0/ H H H N0
O
0 0 o o I Pyrene-PyPyPy-P-ImPyPy--ImPyPy-P-Dp EXAMPLE 8 Polyamide Biotin Conjugates Conjugates prepared between sequence specific DNA binding polyamides and biotin are useful for a variety of applications. First, such compounds can be readily attached to a variety of matrices through the strong interaction of biotin with the protein streptavidin. (Weber, Ohlendorf, Wendoloski, Salemme, F.R. Science 243, 85-88) Readily available strepdavidin-derivatized matrices include magnetic beads for separations as well as resins for chromatography. (Ito, T., Smith, Cantor, C.R. Proc. Natl. Acad. Sci. 89, 495498; Tagle, Swaroop, Lovett, Collins, F.S. Nature 361, 751-753) A number of such polyamide-biotin conjugates have been synthesized by solid phase synthetic methods. Following resin cleavage with a variety of diamines, the polyamides were reacted with carious biotin carboxylic acid derivatives to yield conjugates. The conjugates were WO 98/49142 PCT/US98/06997 purified by HPLC and characterized by MALDI-TOF mass spectroscopy and 1H NMR. The synthesis of biotin-polyamide conjugate is shown in Figure 27. The chemical -structure of a number of bifunctional biotin-polyamide conjugates prepared by the present invention are shown in Figures 28A- D. A scheme for sequence specific affinity capture by a bifunctional polyamide-biotin conjugates is outlined in Figure 29.
EXAMPLE 9 Photoactivated Modification of DNA By A Polyamide-Psoralen Conjugate Photoactivated modification of DNA by a polyamidepsoralen conjugate. Psoralen and psoralen derivatives have been used as photoactive drugs in the treatment of cancer.
(Edelson, et al. N. Engl. J. Med. 316. 297 (1987)) These molecules intercalate into double-helical DNA and upon irradiation with UVA undergo a cycloaddition reaction with the 5,6 double bond of thymine residues to form both monoadducts and interstrand DNA cross-links. (Psoralen DNA Photobiology, Volumes 1 and 2; Gesparro, Ed. CRC Press, Inc., Boca Raton, Fl. 1988.) Our recent interests have focused on the synthesis and in vitro analysis of photoactive polyamide-psoralen conjugute B which is designed to form covalent attachments to DNA in a sequencespecific manner. The use of light as a trigger for the permanent covalent modification of DNA may prove to be attractive tool for potential in vivo applications such as the specific inhibition of transcription by minor groove binding polyamides. The extended hairpin polyamide-psoralen conjugate B was synthesized by coupling the OBt ester of (8-psoralenyloxy)pentanoic acid. (Lee, et al., J. Med.
WO 98/49142 PCT/US98/06997 Chem. 1994, 37, 1208.) to the extended hairpin polyamide directly on the P-alanine-Pam resin. Upon equilibration of the psoralen-polyamide conjugate at pH 7.5 with a 247 bp restriction fragment followed by irradiation at 360 nm, the extent of intrastrand cross-link formation was shown to be between 15-20% and 54-57% at 10 nM and 100 nM concentrations of polyamide respectively. Our current work involves the use of a polymerase stop assay as a tool to map the sites of intrastrand covalent modification as well as sites of potential monoadduct formation on doublehelical DNA. The structure of the psoralen-polyamide conjugate is shown in Figure EXAMPLE In vitro assay for polyamide binding An engineered, radiolabeled restriction fragment from pUC-19 was prepared in which a nine bp polyamide binding site overlaps by two base pairs with the cleavage site for the restriction endonuclease Pvu II. Cleavage by Pvu II is prevented when the overlapping polyamide binding site is occupied by the polyamide. As a control, a second radiolabeled DNA fragment was prepared which contains a Pvu II site, but lacks the overlapping polyamide binding site.
The rate of polyamide association with its target binding site was assessed by combining solutions of the polyamide with the radiolabeled target and reference fragments and allowing them to for 5 minutes to 5 hours before initiating a treatment (1-2 minutes) with the enzyme Pvu II. Under the experimental conditions, the reference site is nearly completely digested, but protection at the target site is observed and can be correlated with WO 98/49142 PCT/US98/06997 polyamide concentration and the time of equilibration.
Similarly, the dissociation rate is analyzed by adding an excess of unlabeled competitor DNA to an equilibrated solution of the labeled DNA fragments and polyamide.
Addition of the competitor reduces the concentration of free polyamide to zero. The rate at with polyamide dissociation occurs from the target site on the labeled fragment can be followed by the rate of loss of protection from Pvu II digestion as the re-equilibration time is increased. The association profile with respect to time for the 9-ring extended hairpin polyamide ImPyPy-y-ImPyPy-P- PyPyPy-G-Dp binding its cognate 9 base pair match site is shown in Figure 6.
The extended hairpin dissociation rate has been determined to be kd 3.1 ±0.2 x 10- 5 s- 1 this corresponds to a half-time of 6.2 h (10 mM Bis-Tris (pH 50 mM NaC1, 5 mM MgC12, 1 mM mercaptoethanol at 37 OC). Wherein half-time is defined as the time required for 50% of a population of DNA and polyamide to dissociate or associate.
The association rate has been determined at ka 1.3 +0.8 x 104 M-1s-1; this corresponds to a half-time of 3.0 h at nM. The determined value for the equilibrium association constant (Keq 6.3 +0.8 x 108 M- 1 correlates well with the kinetically determined ratio (ka/kd 4.2 2.6 x 10 8
M-
1 These results demonstrate that polyamides bind to a designated target site within seconds to minutes, but that it may take hours for dissociation to occur at such a site.
More specifically these results demonstrate that polyamides bind DNA with a combination of association and dissociation WO 98/49142 PCT/US98/06997 rates which provide effective modulation of the activity of DNA binding proteins.
EXAMPLE 11 Cooperative Hairpin Dimers for Recognition of DNA by Py-Im Polyamides Small molecules which permeate cells and bind predetermined DNA sequences have the potential to control the expression of specific genes. Trauger, et al. Nature 1996, 382, 559-561; Gottesfeld, et al. Nature 1997, 387, 202-205). Recently, an eight-ring polyamide which binds to a six base pair target site was shown to inhibit gene transcription in cell culture (Gottesfeld, et al. Nature 1997, 387, 202-205). Polyamides recognizing longer DNA sequences should provide more specific biological activity B. Dervan, Science 1986, 232, 464) which could be achieved by synthesizing larger hairpins (Turner, et al.
EJ. Am. Chem. Soc. 1997, 119, 7636-7644). However, the upper limit of polyamide size with regard to efficient cell permeation is not known.
Alternatively, a more biomimetic approach is to bind larger DNA sequences while maintaining the size of the polyamide. Nature's transcription factors often bind large DNA sequences by formation of cooperative protein dimers at adjacent half-sites (Ptashne, et al. A Genetic Switch, Blackwell Scientific Publications and Cell Press: Palo Alto, CA, 1986; Pabo, et al. Ann. Rev. Biochem. 1992, 61, 1053-1095; Marmorstein, et al. Nature 1992, 356, 408-414; Klemm, et al. Cell 1994, 77, 21-32; Bellon, et al. Nature Struct. Biol. 1997, 4, 586-591). For cooperatively binding extended Py-Im polyamide dimers, the two ligands can slip sideways with respect to one another, allowing recognition WO 98/49142 PCT/US98/06997 of other sequences (Trauger, et al. J. Am. Chem. Soc. 1996, 118, 6160-6166; Swalley, et al. Chem. Eur. J. 1997, 3, 1600-1607). Hairpin polyamides utilizing the turn-specific y-aminobutyric acid linker are constrained to be fully overlapped and preclude the "slipped motif" option (Mrksich, et al. J. Am. Chem. Soc. 1994, 116, 7983-7988; Parks, et al. ibid. 1996, 118, 6153-6159; Swalley, et al.
ibid. 1996, 118, 8198-8206; Swalley, et al. ibid. 1997, 119, 6953-6961; Trauger, et al. Chem. Biol. 1996, 3, 369- 377; Declairac, et al., J. Am. Chem. Soc. 1997, 119, 7909- 7916). Provided herein is a cooperative six-ring extended hairpin polyamide which dimerizes to specifically bind a predetermined ten base pair sequence.
A sequence contained in the regulatory region of the HIV-1 genome was selected as the target site (Jones, et al.
Ann. Rev. Biochem. 1994, 63, 717-743; Frech, et al.
Virology 1996, 224, 256-267). To design the ligand, the polyamide ring pairing rules provided herein, such as the inclusion of 0-alanine to relax ligand curvature, and the preference of y-aminobutyric acid for a "hairpin turn" conformation within polyamide-DNA complexes were considered. This analysis suggested that the six-ring polyamide having the core sequence ImPy-3-ImPy-y-ImPy might bind the target sequence 5'-AGCAGCTGCT-3' through formation of a cooperative hairpin dimer (Figure 31). To avoid a collision between the N-terminal end of one ligand and the C-terminal end of the second within the complex, the positively-charged P-alanine-dimethylaminoproplyamide
C-
terminus used in standard polyamides has been replaced with the shorter, uncharged (CH 2 2 0H group (C 2 The cationic "turn" residue (R)-2,4-diaminobutyric acid ((R)H 2
NY)
WO 98/49142 PCT/US98/06997 maintains the overall +1 charge for optimal solubility in water.
Polyamide ImPy- P-ImPy- (R)H 2 NY-ImPy-C 2 -OH was synthesized using solid-phase methods E. Baird, P. B.
Dervan, J. Am. Chem. Soc. 1996, 118, 6141-6146) on glycine- PAM resin (available in 0.3 mmol/g substitution from Peptides International, Louisville, KY), reductively cleaved from the solid support using LiBH 4 (Mitchell, et al., J. Org. Chem. 1978, 43, 2845; Stewart, et al. Solid Phase Peptide Synthesis, Pierce Chemical Company, Rockford, IL, 1984) and purified by HPLC (reverse-phase). The identity and purity of the polyamides was confirmed by 1H NMR, analytical HPLC, and MALDI-TOF MS. MALDI-TOF MS (monoisotopic) ImPy- -ImPy- H2N-ImPy-C 2 -OH, obsd 953.3, calcd (C 42
H
53
N
8 0 9 O) 953.4; ImPy-P-ImPyPy-(R) 2 Ny-PyImPy-
C
2 -OH, obsd 1197.5, calcd (C 54
H
65
N
22 0 11 1197.5.
Figure 32 illustrates the (ImPy- -ImPy-(R)H2N -ImPy-C2-
OH)
2 5'-AGCAGCTGCT-3' complex, demonstrating binding models for complexes of a 10 base pair match and singlebase pair mismatch sites (the mismatched base pair is highlighted by shading). The shaded and open circles represent imidazole and pyrrole rings, respectively, diamonds represent P-alanine, half-circles represent
(CH
2 2 0H groups, and curved lines represent diaminobutyric acid.
Figure 33a represents a storage phosphor autoradiogram of the 8% denaturing polyacrylamide gel used to separate the fragments generated by DNase I digestion in a quantitative footprint titration experiment with polyamide ImPy- -ImPy-(R)"2N -ImPy-C2-OH: lane 1, A lane; lane 2, DNase I digestion products obtained in the absence of WO 98/49142 PCT/US98/06997 polyamide; lanes 3-12, DNase I digestion products obtained in the presence of 0.1, 0.2, 0.5, 1, 2, 5, 10, 20, 50, and 100 nM polyamide ImPy- P-ImPy- (R)H 2 Ny-ImPy-C 2
-OH,
respectively. All reactions contain pJT-LTR 3'-32P-endlabeled EcoRI/HindIII restriction fragment (15 kcpm), 10 mM Tris*HCl, 10 mM KC1, 10 mM MgC12, and 5 mM CaC12 (pH 7.0, 24 OC). Plasmid pJT-LTR was prepared by ligating an insert having the sequence
AAAGCAGCTGCTTATATGCAGCATCTGAGGGACGCCACTCCCCAGTCCCGCCCAGGCCA
CGCCTCCCTGGAAAGTCCCCAGCGGAAAGTCCCTTGTAGAAAGCTCGATGTCAGCAGTC
TTTGTAGTACTCCGGATGCAGCTCTCGGGCCACGTGATGAAATGCTAGGCGGCTGTCAA
TCGA-3' to the large AvaI/SalI fragment of pUC19.
Quantitative DNase I footprinting on a 245 base pair 3'- 32 P-end-labeled restriction fragment showed that ImPy-P- ImPy-(R)H 2 Ny-ImPy-C 2 -OH binds its match site 5'-AGCAGCTGCT-3' at nanomolar concentrations (apparent monomeric association constant, Ka 1.9 x 108 M- 1 and also binds a single-base pair mismatch site 5'-AGATGCTGCA-3' with 9-fold lower affinity, Ka 2.2 x 107 M 1 (Figure 33b).
The binding data for match and single-base pair mismatch sites were well-fit by cooperative binding isotherms, consistent with formation of cooperative 2:1 polyamide-DNA complexes.[ 5 A double-base pair mismatch site, 5'-AGCTGCATCC-3', is also bound with 65-fold lower affinity. The fact that this mismatch site, which contains the "half-site" 5'-AGCTGCA-3', is not effectively bound indicates that recognition of the match site occurs through cooperative dimerization, and not due to formation of 1:1 hairpin complexes.
Further study of the generality and sequence specificity of this motif is in progress and will be WO 98/49142 PCT/US98/06997 reported in due course. For example, we found that the eight-ring polyamide ImPy-p-ImPyPy-(R) 2 Ny-PyImPy-C 2 -OH binds the twelve base pair match site 5'-AAGCAGCTGCTT-3' with fold higher affinity than ImPy- -ImPy-(R) H2y-ImPy-C 2
-OH,
and is approximately 100-fold specific for this site versus the double-base pair mismatch site 5'-CAGATGCTGCAT-3' The DNA-binding affinity and specificity of the sixring polyamide ImPy-P-ImPy-(R) "y-ImPy-C 2 -OH for its ten base pair binding site are typical of standard six-ring hairpins which recognize five base pairs. Thus, use of a the cooperative hairpin dimer motif doubles the binding site size relative to the standard hairpin motif without sacrificing affinity or specificity, and without increasing the molecular weight of the ligand. As provided herein, a novel cooperative hairpin dimer motif, relatively low molecular weight pyrrole-imidazole polyamides
(MW
approximately 950-1,200) can specifically recognize 10-12 base pairs of DNA.
Figure 34 provides general polyamide motifs for use in desigining polyamides having improved binding and specificity. Figure 35 provides five general formulas for polyamides of the present invention. Figure 36 illustrates the DNA footprint analysis and affinities of additional cooperatively-bound polyamides. Figure 37 demonstrates the N-terminal extension of the polyamide ImPyPy-X-ImImPy-y- PyPyPy-P-Dp where X is y, C5-8, P- or WO 98/49142 WO 98/91 42PCT/US98/06997 *Table 5 illustrates recognition of 15 Base-Pairs by ImPyPy-X-ImImPy-y-PyPyPy-P-Dp polyamides. Association constants (Ka) for the match site 5'-AACCAAGTCTTGGTA-31 and specificities for the match site versus center AACCAACTGTTGGTA- and edge -AACCAAGTCTTGgGA- 3') mismatch sites are Also illustrated.
TABLE Length of X K. Spedifiity X atoms) Match Center mismnatch Edge mnismatch Y 1x 10 1 1 6 4110O8 11 19 C6 7 2 x10 812 C7 8 2 10 82 2 1313 8 3 x10 82 2 C8 9 2x10 01 O-510 lxl10 2 2 Solution conditions: 10 mM Tris*HC1, 10 mM KC1, 10 MM MgCl 2 and 5 mM CaCl 2 at 24 OC and pH 7.0. The parent hairpin Ac- ImImPy-y-PyPyPy-P3-Dp binds both 5 base pair binding sites within the 15 base pair target site 5'-AACCAAGTCTTGGTA-3' with Ka =3 x l0*' WO 98/49142 PCTIUS98/06997 PAGE INTENTIONALLY LEFT BLANK Table 6 illustrates recognition of 15 Base-Pairs by ImImPy-y-PyPyPy-X-ImPyPy-3-Pp polyamides. Association constants for the match site 51-AACCAAGTCTTGGTA-31 and specificities for the match site versus center (5'-AACCAACTgTTGGTA-3') and edge AACCAAGTCTTGCGA-3') mismatch sites are shown.
TABLE 6 Length of X K. Specificity X (#atoms) Match Center mismatch Edge nmismatch 6 1X10 9 2 2 C6 72 x10 81 2 C7 8 1 x 109 11 PP8 lxIO 1 Solution conditions: 10 mM TrisoHCl, 10 mM KCl, 10 MM MgCl 2 and 5 mM CaC1 2 at 24 OC and pH 7. 0. The parent hairpin ImImPy-y-PyPyPy-3-Dp binds both 5 base pair binding sites within the 15 base pair target site 5'-AACCAAGTCTTGGTA-3' with K, 1 x 10' Table 7 illustrates recognition of 15 base pairs by ImPyPy-X-ImImPy-y-PyPyPy-C3-OH Association constants for the match site 5'-AACCAAGTCTTGGTA-3' and specificities for the match site versus center (5'-AACCAACTGTTGGTA-3') and edge (5'-AACCAAGTCTTGCGA-3') mismatch sites.
TABLE 7 K. Specificity Polyamide 5'-AACGAAGTCTTGGTA-3' 5'-AACCAACTGTTIGGTA-3' 5'-AACCAAGTCTTGCGA-3' IniPyPy-C5-Im~mPy-y-PyPyPy-C3-OH 3 x 3 3 ImPyPy-C6-ImIm.Py-y-PyPyPy-C3-OH 3 x 1'2 2 ImPyPy-13-f-ImlmPy-y-PyPyPy-C3-OH I x I 08 3 3 Ac-Mnmy-y-PyPyPy-C3-OH 7 x Table 8 illustrates recognition of 16 Base-Pairs by ImPyPyPy-X-ImImPy-y-PyPyPy-IbDp.
Association constants (Ka) for the match site 51-AAC CAAGTACTTGGTA-31 and specificities for the match site versus center (5'-AACCAACTAGTTGGTA-3') and edge (51-AACCAAGTACTTGCGA-3') mismatch sites.
TABLE 8 Ka Specificity Polyamide 5'-AACCAAGTACTrGGTA 5'-AACCAACTAGTrGGTA 5 t
-AACCAAGTACITGCGA
7 x 10' 3 >3 lrnPyPyPy-G6-ImImPy-y-PyPyPy-f3-Dp I x 10, 1 1 ImPyPyPy-f3-f-ImImPy--PyPyPy-f..Dp 3 x 10' 1, 1 Ac-ImImPy-y-PyPyPy- 3-Dp 3x 10 7 Dp.
Table 9 illustrates recognition of recognition of 9-11 base pairs by ImPyPy-X-ImPyPy-3-
OI
TABLE 9 Polyamide 5STGTCAGACA-3' 5'-TGTCAAGACA-3' 5'-TGTGAAAGACA-3' 5'-gcggtTGTCAacccg- 3' ImPyPy-G-lmPyPy-3-Dp I x 10, 3 x 107 3 x 107 3 x 107 ImPyPy-13-ImPyPy-f3-Dp 2 x 1'2 x 108 1 x 109 1 0 ImPyPy-y-ImPyPy-P3-Dp 5 X 107 1 x 101 1 x 10, 2 x 108 2 x 10' 1 x 109 1 x 109 1 x ImPyPy-C6-ImPyPy-3-Dp I x iO1 2 x 10' 5 X-107 5 x 107 ImPyPy-C7-ImPyPy-3-Dp 2 x 107 1 x 10, 3 x 107 7 x 107 ImPyPy-C8-ImPyPy-P-Dp 1 X 107 3 x 107 1 X 107 1 x ImPyPy-C1I-ImPyPy-P3-Dp 5 x 106 2 x 107 2 x 107 1 X 107 lmPyPy-I3.ImPyPy-13-Dp 2 x 108 2 x 108 1 x 109 1 x 109 ImPyPy-f3-P-ImPyPy-P-Dp 1 X 107 5 X 107 1 X 107 2 x 107 ImfPyPy-j3-P-P-ImPyPy-P-Dp I x 107 1 X 107 7 x 10' 1X 107 ImPyPy-f3-Py-I3-ImPyPy-P-Dp 3 x 101 2 x 108 1 x 109 108 IniPyPy-P-ImPyPy-3-Dp 2 x 108 2 x 10' 1 x 10 1x 0 ImPyPy-IP-PyPyPy-J3-Dp 2 x 108 5 x 10' 5 x 10' 3 x Solution conditions: 10 mM Tris*HCl, 10 mM KCl, 10 MM MgCl 2 and 5 mM CaCl 2 at 22 0 C and pH Table 10 illustrates recognition of 16 Base-Pairs by a polyamide having the formula ImPyPyPy-X-ImImPy-y-PyPyPy-C3-OH. Association constants (Kae) for the match site 51- AACCAAGTACTTGGTA-31 and specificities for the match site versus center (51-AACCAACTAGTTGGTA- 31) and edge (5'-AACCAAGTACTTGCGA-3') mismatch sites.
TABLE K, Specificity Polyamide 5'-AACCAAGTACTTGGTA 5'-AACCAACTAGflGGTA 5-AACCAAGTACTT1GCGA ImiPyPyPy-C6-ImImPy-y-PyPyPy-C3-OH 108 Ac-ImImPy-y-PyPyPy-C3-OH 7 x Table 11 illustrates recognition of 17 Base-Pairs by a polyamide of the formula ImPyPyj-X-P-ImImPy-y-PyPyPy-3-Dp, X=13 or Py. Association constants for the match site AACCATAGTCTATGGTA-31 and specificities for the match site versus center AACCATAGCGTATGGTA- 31) and edge -AAccAAGTCTTGCGA- mismatch sites.
TABLE 11 K. Specificity Polyamide 5'-AACCATAGTCTATGGTA 5'-AACCATAGCGTATGGTA ImPyPy--03-0-ImImPy-y-PyPyPy-f3-Dp9 ImPyPy-J3-Py-p-ImlmPy-Y-PyPyPy-f -Dp ?9 Ac-ImJmPy-y-PyPyPy-[3-Dp 3 WO 98/49142 PCT/US98/06997 EXAMPLE 12 Recognition of 16 Base Pairs in the Minor Groove of DNA by an Im-Py Polyamide Dimer Cell-permeable small molecules which bind predetermined DNA sequences with affinity and specificity comparable to natural DNA-binding proteins have the potential to regulate the expression of specific genes.
Recently, an 8-ring hairpin Py-Im polyamide which binds 6 base pairs of DNA was shown to inhibit transcription of a specific gene in cell culture (Gottesfeld, et al. Nature 1997, 387, 202-205). Polyamides recognizing longer DNA sequences should provide more specific biological activity. To specify a single site within the 3 billion base pair human genome, ligands which specifically recognize 15-16 base pairs are necessary. For this reason, recognition of 16 base pairs represents a milestone in the development of chemical approaches to DNA recognition (Dervan, P.B. Science 1986, 232, 464; Dervan, P.B. In The Robert A. Welch Foundation Conference on Chemical Research XXXI. Design of Enzymes and Enzyme Models; Houston, Texas, November 2-4, 1987; pp 93-109; Dervan, P.B. In Nucleic Acids and Molecular Biology, Vol. 2; Springer-Verlag: Heidelberg, 1988; pp 49-64; Moser, et al. Science 1987, 238, 645-650; Le Doan, et al. Nucleic Acids Res. 1987, 15, 7749; Strobel, et al.
Science 1991, 254, 1639-1642; Thuong, et al. Angew. Chem.
Int. Ed. Engl. 1993, 32, 666-690). A Py-Im polyamide dimer which targets 16 contiguous base pairs in the minor groove of DNA is provided herein.
As the length of a polyamide dimer having the general sequence ImPy2-6 increases beyond 5 rings (corresponding to a 7 base pair binding site), the DNAbinding affinity ceases to increase with polyamide length (Kelly, et al. Proc. Natl. Acad. Sci. U.S.A. 1996, 93, WO 98/49142 PCT/US98/06997 6981-6985). A structural basis for this observation is provided by the recently determined X-ray crystal structure structure of a 4-ring homodimer in complex with DNA, which reveals a perfect match of polyamide rise-perresidue with the pitch of the DNA duplex, but overwound ligand curvature (Keilkopf, et al. Nature Struct. Biol.
1998, 5, 104-109). The curvature mismatch explains the observation that flexible P-alanine residues reset an optimum fit of polyamide dimers with the DNA helix at long binding sites (Trauger, et al. J. Am. Chem. Soc.
1996, 118, 6160-6166; Swalley, et al. Chem. Eur. J.
1997, 3, 1600-1607).
The 16 base pair sequence 5'-ATAAGCAGCTGCTTTT-3' present in the regulatory region of the HIV-1 genome was utilized as a binding site (Jones, et al. Ann. Rev.
Biochem. 1994, 63, 717-743; Frech, et al. Virology 1996, 224, 256-267). Consideration of the previously published polyamide ring pairing rules (Wade, et al. J. Am. Chem.
Soc. 1992, 114, 8783-8794; Mrksich, et al. Proc. Natl.
Acad. Sci. 1993, 32, 11385-11389; Geierstanger, et al. J.
Am. Chem. Soc. 1993, 115, 4474-4482; White, et al. Chem.
Biol. 1997, 4, 569-578; Pelton, et al. Proc. Natl.
Acad. Sci. U.S.A. 1989, 86, 5723-5727; Pelton, et al. J.
Am. Chem. Soc. 1990, 112, 1393-1399; Chen, et al. Nature Struct. Biol. 1994, 1, 169-175; White, et al.
Biochemistry 1996, 35, 12532-12537), the A,T specificity of p/p pairs, and the "slipped" dimer motif (Geierstanger, et al. Nature Struct. Biol. 1996, 3, 321- 324; Trauger, et al. Chem. Biol. 1996, 3, 369-377) suggested that the 8-ring polyamide ImPy-P-ImPy-P-ImPy-P- PyPy-P-Dp would specifically bind the target sequence as a cooperative antiparallel dimer (Fig. 38).
WO 98/49142 PCT/US98/06997 Figure 38 illustrates a model of the complex of ImPy-p-ImPy-P-ImPy-P-PyPy-p-Dp R H) or ImPy-P-ImPy- P-ImPy-p-PyPy- -Dp-EDTA*Fe(II) R RE) (Im Nmethylimidazole, Py N-methylpyrrole, P P-alanine, Dp dimethylaminopropylamide) with 5'-ATAAGCAGCTGCTTTT-3'.
The shaded and open circles represent imidazole and pyrrole rings, respectively, and the diamonds represents P-alanine. Circles with dots represent lone pairs on N3 of purines and 02 of pyrimidines, and circles containing an H represent the N2 hydrogen of guanine. Putative hydrogen bonds are illustrated by dashed lines. The polyamides were synthesized using solid-phase methods (Baird, et al. J. Am. Chem. Soc. 1996, 118, 6141-6146), purified by HPLC, and the identity and purity confirmed by 1 H NMR, analytical HPLC and MALDI-TOF MS.
A quantitative DNase I footprinting experiment carried out on a 245 base pair 3'- 2 "P-end-labeled restriction fragment revealed that the polyamide specifically binds it target site at subnanomolar concentrations (apparent monomeric association constant, Ka 3.5 x 1010 (Fig. 39) (Baird, et al. J. Am. Chem.
Soc. 1996, 118, 6141-6146; Brenowitz, et al. Methods Enzymol. 1986, 130, 132-181; Cantor, Schimmel, Biophysical Chemistry, Part III: The Behavior of Biological Macromolecules; W.H. Freeman, New York, N.Y., 1980, p 863).
Figure 39 illustrates a storage phosphor autoradiogram of an 8% denaturing polyacrylamide gel used to separate the fragments generated by DNase I digestion in a quantitative footprint titration experiment with polyamide ImPy-P-ImPy-P-ImPy-P-PyPy- -Dp lane 1, A lane; lane 2, DNase I digestion products obtained in the absence of polyamide; lanes 3-10, DNase I digestion WO 98/49142 PCT/US98/06997 products obtained in the presence of 0.01, 0.02, 0.05, 0.1, 0.2, 0.5, and 1 nM polyamide ImPy-P-ImPy--ImPy-p- PyPy-P-Dp, respectively; lane 11, intact DNA. All reactions contain 3'-2P-end-labeled EcoRI/HindIII restriction fragment from plasmid pJT-LTR (15 kcpm), mM Tris*HC1, 10 mM KC1, 10 mM MgCl 2 and 5 mM CaC12 (pH 24 oC). b) Autoradiogram of a gel used to separate the fragments generated by an affinity cleavage reaction using polyamide ImPy-p-ImPy-P-ImPy-P-PyPy-p-Dp-EDTA*Fe(II) Lanes 1 and 5: A sequencing lanes; lanes 2-4: cleavage products obtained in the presence of 0.03, 0.1, 0.3 and 1 nM ImPy-p-ImPy-p-ImPy-P-PyPy-p-Dp-EDTA*Fe(II), respectively; lane 6: intact DNA. All reactions contain labeled restriction fragment (7 kcpm), 20 mM HEPES, 300 mM NaC1, 50 g/mL glycogen, 1 jM Fe(II), and 5 mM DTT (pH 7.3, 24 OC). The sequence of the restriction fragment in the region of the 16 base pair target site and a model of the (ImPy-P-ImPy-p-ImPy--PyyPy-p-Dp-EDTA*Fe(II) )2DNA complex are shown along the right side of the autoradiogram. Line heights are proportional to the observed cleavage intensity at the indicated base.
The method used for determining association constants involves the assumption that [L]tot [L]free, where [L]free is the concentration of polyamide free in solution (unbound). For very high association constants this assumption becomes invalid, resulting in underestimated association constants. In the experiments described here, the DNA concentration is estimated to be 5 pM. As a consequence, apparent association constants greater than 1-2 x 101 M- 1 represent a lower limit of the true association constant. The binding data were wellfit by a cooperative binding isotherm, consistent with formation of a cooperative 2:1 complex. To provide WO 98/49142 PCT/US98/06997 further evidence that ImPy-p-ImPy-(-ImPy-p-PyPy-P-Dp binds as an extended dimer, an affinity cleavage experiment was carried out with the polyamide-EDTA.Fe(II) conjugate of ImPy-P-ImPy-P-ImPy-P-PyPy-P-Dp shown in Fig. 39b.
Cleavage was observed at each end of the match sequence, consistent with a dimeric, antiparallel binding mode.
With regard to sequence specificity, there is a proximal two-base pair mismatch site, 5'-cAGATGCTGCATATa-3', to the 5' side of the "P-labeled strand which is bound with at least 35-fold lower affinity than the match site.
However, other mismatch sites on the restriction fragment are bound with 10-20-fold lower affinity, revealing limitations of this first effort at 16 base pair recognition. Undoubtedly there is ample room for further optimization of sequence specificity.
The high binding affinity and the affinity cleavage pattern observed for the 16 base pair polyamideeDNA complex inidcates that 8 pairs of amide residues form a fully overlapped core which properly positions the 6 Im/Py pairs for recognition of 6 G,C base pairs and 2 P/p pairs for recognition of 2 A,T base pairs. Polyamides composed of 2-ring subunits connected by P-alanine appear to be isohelical with B-DNA, and allow placement of imidazole residues at any ring position, thus providing a generalizable motif for recognition of predetermined
DNA
sequences. The data presented herein allows for the design of polyamides capable of binding 16 base pairs of DNA at subnanomolar concentrations of suitable size for permeating cells MW 1,200).
The references described throughout this specification are fully incorporated by reference. While a preferred form of the invention has been shown in the drawings and described, since variations in the preferred WO 98/49142 PCT[US98/06997 form will be apparent to those skilled in the art, the invention should not be construed as limited to the specific form shown and described, but instead is as set forth in the claims.
Claims (7)
1. A polyamnide of the formula R 2 N Ii N R 1 1 R n or a pharmnaceutically acceptable salt thereof, wherein: n is 5-11; :each Y is independently selected from the group consisting of 0 -NH- and -NH-(CH 2 CH 2 -C-NH where p is 0 to 5, provided that at least one Y is not -NIE-; Z is -N C H 2 C H 2 -C- wherein r isO0 to ea. X9sidpnetyslce rmth ru ossigCCHadN 1 each RXis independently selected from the group consisting o OHndN R is independently selected from the group consisting of H, H SunsulBstitute C 6 akyl and C 1 6 akyl-L eac 6 is independently selected from the group consisting of H NH 2 cey, SHzy, C 1 6 alkyl, C 1 -6alkylamine, C 1 -6alkyldiamine, C I 6alkylcarboxylate, C 1 6 alkenyl, CI- 6 alkynyl and Cl 6 alkyl-L; wherein L is a detectable label, and wherein at least one R 1 R 5 or R 6 is C 1 6 alkyl-L.
2. The polyamide of claim 1, wherein one or more X is CH, one or more R, is CH 3 and one or more X is N.
3. The polyamide of claim 1, selected from the group consisting of ImPyPyPy-y-PyPyPyPy-3 -Dp, PyPyImPy-y-PyPyPyyp4Dp, JmPyPyPy-y-IniPyPyPy-3- Dp, PyImPyPy-'y-PyImPyPy-P1-Dp, ImPylmPy-y-PyPyPyPy-p3-Dp, ImhnPyPy-y- [1:\DayLib\LIBFF]39058specdoc~gcc 182 PYPYPYPy-P3-Dp, ImlmImPy--y-PyPyPyPy-f3-Dp, JrnImPYPY-Y-ImPYPyPy-P-.Dp, JmPYPYPY-Y-JmImPyPy-P3-Dp, ImlmPyPy-y-JmlmPyPyJ3-Dp, ImPyIrnPy-y-JmPylmPy- P-Dp, LmImlmPy-y-ImPyPyPyPy-P-Dp, and lImmImm-y-PyPyPyPy.43.Dp, bound to a detectable label, wherein Im is N-methyl-imidazole, Py is niethylpyrrole, P3 is f3-alanine, Dp is dimethylaminopropylamide, and y is -y-arinobutyric acid.
4. The polyamide of claim 1, wherein said polyamide binds to a double-stranded DNA molecule with an association constant of at least 109~ M-1. The polyamide of claim 1, wherein the detectable label is a fluorescent molecule.
6. The polyamide of claim 1, wherein the detectable label is biotin.
7. A polyamide selected from the group consisting of S ImPyPyPy-y-PyPyPyPy-P3-Dp, PyPyImPy-y-PyPyPyPy-f3-Dp, JmPyPyPy-y-TmPyPyPy-3- Dp, PylmPyPy-y-PylmPyPy-p-Dp, JmPylmPy-y-PyPyPyPy-3-Dp, LmImPyPy-y- PyPyPyPy-43-Dp, ImlmlmPy-y-PyPyPyPy-j3-Dp, ImlrnPyPy-7-LmPyPyPy-P-Dp, l mPyPyPy--TmlmPyPy-P3-Dp, ImlmPyPy-y-ImnImPyPy-P3-Dp, JmPyImPy-y-JmPy~mPy- f3-Dp, JmlmImPy-y-ImPyPyPyPy-P3-Dp, ImlImmm-y-PyPyPyPy-f3-Dp, Im-j3-PyPy-y-Lm- f3-PyPy-p3-Dp, Im-f3-ImIm-y-Py-f3-PyPy-P3-Dp, Jm-P3-LmPy-y-Lm-f-mPy-f3-Dp, St ImPyPyPyPy-yTIPyPyPyPyPDp, ImImPyPyPy-y-JmiPyPyPyPy-f3-Dp, ImPylmPyPy-y- ImPyPyPyPy-j3-Dp, ImmPyIm-'-PyPyPyPyPy-P3Dp, ImPyPyImPy-y-ImPyPylmPy-3- Dp, LmPy-P-PyPy-y-ImPy-P-.PyPy-P-Dp, lIrnm-f3-Im~m-y-PyPy-P3-PyPy-P3-Dp, ImPy-f3- S JmPy-y-ImPy-I3-ImPyPDp, LmPy-f-PyPyPy-y-ImPyPy..]3PyPy--Dp, Tmlm-f3-PyPyPy- y-PyPyPy-f3-PyPy-f3-Dp, ImPy-I3.ImPyPy-y-ImPyPyf3 -PyPy-p3-Dp, Imlm-f3-PyPyPy-y- ImlmPy-P3-PyPy-P3-Dp, LmPy-I-PyPyPy-y-PyPyPy-p3-LmPy-f3Dp, IniPyPyPyPyPy-y- ImPyPyPyPyPy-f3-Dp, ImPyPy-o3-PyPy-y-ImPyPy-f3-PyPy-3-Dp, ImPyPyPy-P-Py-7y-Im- 1-PyPyPyPy-13-Dp, JmImPyPyPyPy-y-ImImPyPyPyPy.p3Dp, Tm-3-PyPyPyPy-y-m-3- PyPyPyPy-f3-Dp, JImPyPyPy-p-Py-y-ImPyPyPy-o-Py--Dp, ImPyIMPyPyPy-Y- hnPyPyPyPyPy-3-Dp, ImyypP~--myoP~~--p ImPyPyPyPy-3-y- ImPyPyPyPy-3-P-Dp, JmPy-f-ImPyPy-y-ImPy-1-hnPyPy.p-Dp, Jm-P-PyPyPyPy-y- ImPyPyPy-3-Py-P3-Dp, Im-1-ImPyPyPy-y-ImPyPyPyjpy-f3Dp, lmPyPy-f3-PyPyPy-3- Dp, ImlmPy-P3-PyPyPy-f3-Dp, lmlmIm-f3-PyPyPy-3-Dp, LmPyPyPyPy-f3-PyPyPy-f3-Dp, ImPyPyPy-3-PyPyPy-3-Dp, JmPyPy-j3-PyPyPyPyPy-P3-Dp, ImPyPyPy-f3-PyPyPyPy-3- Dp, JmhnPyPy-3-PyPyPyPy-P3-Dp, ImIm~mPy-P-PyPyPyPy-3-Dp, ImPyPyPy-3- ImPyPyPy-p-Dp, ImlmPyPy-3-ImPyPyPy-P-Dp, hnlmPyPyPy-3-PyPyPyPyPy-3-Dp, ImImlmPyPy-f-PyPyPyPyPyPDp, ImnIm-f3-PyPy-f-PyPy-P-PyPyPDp, LmImPy-f3- RA~ PyPyPy-I3-PyPyPy-P3-Dp, lImPyPy-P3-Py-I3-PyPyPyPy-P-.Dp, mnPyPy-y-ImPyPy-3- [I:\DayLib\LIBFF]39058spec.doc:gcc 183 PyPyPy-f3-Dp, JmPyPy-y-PyPyPy-P-PyPyPy-j.Dp, Py1rnPy-Y-JmPyPy-t3-PyPyPy-P3Dp, PylmPy-y-ImPyPy-p-PyPyPy-4-PyPyPy-oDp, Im1mPy-Y-ImPyPy-P3-PyPyPy-f3-Dp, ImPyPy-y-ImPyPy-G-PyPyPyPDp, TmPyPyPy-y-ImImImPy-p3-PyPyPyPy-f3-Dp, ImImPyPy-y-Jm~mPyPy-f3-PyPyyPy-pDp, and ImImPyPy-y-PyPyPyPy-f-PyPyPyPyP Dp, bound to a detectable label, wherein Im is N-methyl-imidazole, Py is methylpyrrole, P3 is f3-alanine, Dp is dimethylaminopropylamide, and y is y-aminobutyric acid.
8. A polyamide of claim 1, substantially as hereinbefore described with reference to any one of the examples. Dated 25 February, 2002 California Institute of Technology 0 Patent Attorneys for the Applicant/Nominated Person SPRUSON FERGUSON .0 0, 0: e 00 .00.0: so 0:
Applications Claiming Priority (13)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US4202297P | 1997-04-06 | 1997-04-06 | |
US4344697P | 1997-04-08 | 1997-04-08 | |
US4344497P | 1997-04-08 | 1997-04-08 | |
US60/043446 | 1997-04-08 | ||
US60/043444 | 1997-04-08 | ||
US60/042022 | 1997-04-16 | ||
US08/837,524 US6143901A (en) | 1996-07-31 | 1997-04-21 | Complex formation between dsDNA and pyrrole imidazole polyamides |
US08/837524 | 1997-04-21 | ||
US08/853522 | 1997-05-08 | ||
US08/853,522 US6635417B1 (en) | 1996-07-31 | 1997-05-08 | Complex formation between DSDNA and oligomer of cyclic heterocycles |
PCT/US1997/012722 WO1998050582A1 (en) | 1997-05-08 | 1997-07-21 | COMPLEX FORMATION BETWEEN dsDNA AND OLIGOMER OF HETEROCYCLES |
WOUS97/12722 | 1997-07-21 | ||
PCT/US1998/006997 WO1998049142A1 (en) | 1997-04-06 | 1998-04-08 | Dna-binding pyrrole and imidazole polyamide derivatives |
Publications (2)
Publication Number | Publication Date |
---|---|
AU7104098A AU7104098A (en) | 1998-11-24 |
AU747300B2 true AU747300B2 (en) | 2002-05-16 |
Family
ID=27534780
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU71040/98A Ceased AU747300B2 (en) | 1997-04-06 | 1998-04-08 | DNA binding pyrrole and imidazole polyamide derivatives |
Country Status (1)
Country | Link |
---|---|
AU (1) | AU747300B2 (en) |
-
1998
- 1998-04-08 AU AU71040/98A patent/AU747300B2/en not_active Ceased
Also Published As
Publication number | Publication date |
---|---|
AU7104098A (en) | 1998-11-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2286232A1 (en) | Dna-binding pyrrole and imidazole polyamide derivatives | |
US6506906B1 (en) | Preparation and use of bifunctional molecules having DNA sequence binding specificity | |
US6472537B1 (en) | Polyamides for binding in the minor groove of double stranded DNA | |
AU735022B2 (en) | Method for the synthesis of pyrrole and imidazole carboxamides on a solid support | |
Herman et al. | Tandem hairpin motif for recognition in the minor groove of DNA by pyrrole–imidazole polyamides | |
US7049061B1 (en) | Stereochemical control of the DNA binding affinity, sequence specificity, and orientation-preference of chiral hairpin polyamides in the minor groove | |
AU2002359679B2 (en) | Guanidinium transport reagents and conjugates | |
US6958240B1 (en) | Inhibition of major groove DNA binding proteins by modified polyamides | |
WO1998037066A9 (en) | Improved polyamides for binding in the minor groove of double stranded dna | |
JP2005508832A (en) | Transporter with arginine part at intervals | |
JP2011523415A (en) | Novel dual-targeting anti-tumor complex | |
US6555692B1 (en) | Preparation and use of bifunctional molecules having DNA sequence binding specificity | |
Melander et al. | Discrimination of A/T sequences in the minor groove of DNA within a cyclic polyamide motif | |
Greenberg et al. | A Comparison of H‐Pin and Hairpin Polyamide Motifs for the Recognition of the Minor Groove of DNA | |
EP4511385A1 (en) | Cyclic peptides for delivering therapeutics | |
US6559125B1 (en) | Polyamide-alkylator conjugates and related products and method | |
US7605125B2 (en) | DNA-binding polyamide drug conjugates | |
AU747300B2 (en) | DNA binding pyrrole and imidazole polyamide derivatives | |
CA2281947A1 (en) | Improved polyamides for binding in the minor groove of double stranded dna | |
JP2002514209A (en) | Stereochemical control of DNA binding affinity, sequence specificity, and orientation preference of chiral hairpin polyamide in minor groove | |
Szewczyk et al. | Recognition of the DNA Minor Groove by Pyrrole-Imidazole Polyamides: Comparison of Desmethyl-and N-Methylpyrrole | |
Dervan et al. | Cooperative Hairpin Dimers for DNA Recognition by Pyrrole-Imidazole Polyamides | |
Dervan | Strand Selective Cleavage of DNA by Diastereomers of Hairpin Polyamide-seco-CBI Conjugates |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
DA3 | Amendments made section 104 |
Free format text: THE NATURE OF THE AMENDMENT IS: AMEND THE INVENTION TITLE TO READ DNA BINDING PYRROLE AND IMIDAZOLEPOLYAMIDE DERIVATIVES |
|
FGA | Letters patent sealed or granted (standard patent) |