Combinatorial libraries of synthetic DNA are increasingly being used to identify and evolve prote... more Combinatorial libraries of synthetic DNA are increasingly being used to identify and evolve proteins with novel folds and functions. An effective strategy for maximizing the diversity of these libraries relies on the assembly of large genes from smaller fragments of synthetic DNA. To optimize library assembly and screening, it is desirable to remove from the synthetic libraries any sequences that contain unintended frameshifts or stop codons. Although genetic selection systems can be used to accomplish this task, the tendency of individual segments to yield misfolded or aggregated products can decrease the effectiveness of these selections. Furthermore, individual protein domains may misfold when removed from their native context. We report the development and characterization of an in vivo system to preselect sequences that encode uninterrupted gene segments regardless of the foldedness of the encoded polypeptide. In this system, the inserted synthetic gene segment is separated from an intein/thymidylate synthase (TS) reporter domain by a polyasparagine linker, thereby permitting the TS reporter to fold and function independently of the folding and function of the segment-encoded polypeptide. TS-deficient Escherichia coli host cells survive on selective medium only if the insert is uninterrupted and in-frame, thereby allowing selection and amplification of desired sequences. We demonstrate that this system can be used as a highly effective preselection tool for the production of large, diverse and high-quality libraries of de novo protein sequences.
DNA double-strand break repair involves phosphorylation of histone variant H2AX ('γH2AX')... more DNA double-strand break repair involves phosphorylation of histone variant H2AX ('γH2AX'), which accumulates in foci at sites of DNA damage. In current models, the recruitment of multiple DNA repair proteins to γH2AX foci depends mainly on recognition of this 'mark' by a single protein, MDC1. However, DNA repair proteins accumulate at γH2AX sites without MDC1, suggesting that other 'readers' of this mark exist. Here, we use a quantitative chemical proteomics approach to profile direct, phospho-selective γH2AX binders in native proteomes. We identify γH2AX binders, including the DNA repair mediator 53BP1, which we show recognizes γH2AX through its BRCT domains. Furthermore, we investigate the targeting of wild-type 53BP1, or a mutant form deficient in γH2AX binding, to chromosomal breaks resulting from endogenous and exogenous DNA damage. Our results show how direct recognition of γH2AX modulates protein localization at DNA damage sites, and suggest how specif...
Methods to evolve synthetic, rather than biological, polymers could significantly expand the func... more Methods to evolve synthetic, rather than biological, polymers could significantly expand the functional potential of polymers that emerge from in vitro evolution. Requirements for synthetic polymer evolution include (i) sequence-specific polymerization of synthetic building blocks on an amplifiable template, (ii) display of the newly translated polymer strand in a manner that allows it to adopt folded structures, (iii) selection of synthetic polymer libraries for desired binding or catalytic properties and (iv) amplification of template sequences that survive selection in a manner that allows subsequent translation. Here we report the development of such a system for peptide nucleic acids (PNAs) using a set of 12 PNA pentamer building blocks. We validated the system by performing six iterated cycles of translation, selection and amplification on a library of 4.3 x 10(8) PNA-encoding DNA templates and observed >1,000,000-fold overall enrichment of a template encoding a biotinylated (streptavidin-binding) PNA. These results collectively provide an experimental foundation for PNA evolution in the laboratory.
Protein kinases are attractive therapeutic targets, but their high sequence and structural conser... more Protein kinases are attractive therapeutic targets, but their high sequence and structural conservation complicates the development of specific inhibitors. We recently identified, in a DNA-templated macrocycle library, inhibitors with unusually high selectivity among Src-family kinases. Starting from these compounds, we developed and characterized in molecular detail potent macrocyclic inhibitors of Src kinase and its cancer-associated 'gatekeeper' mutant. We solved two cocrystal structures of macrocycles bound to Src kinase. These structures reveal the molecular basis of the combined ATP- and substrate peptide-competitive inhibitory mechanism and the remarkable kinase specificity of the compounds. The most potent compounds inhibit Src activity in cultured mammalian cells. Our work establishes that macrocycles can inhibit protein kinases through a bisubstrate-competitive mechanism with high potency and exceptional specificity, reveals the precise molecular basis for their desirable properties and provides new insights into the development of Src-specific inhibitors with potential therapeutic relevance.
The DNA-templated polymerization of synthetic building blocks provides a potential route to the l... more The DNA-templated polymerization of synthetic building blocks provides a potential route to the laboratory evolution of sequence-defined polymers with structures and properties not necessarily limited to those of natural biopolymers. We previously reported the efficient and sequence-specific DNA-templated polymerization of peptide nucleic acid (PNA) aldehydes. Here, we report the enzyme-free, DNA-templated polymerization of side-chain-functionalized PNA tetramer and pentamer aldehydes. We observed that polymerization of tetramer and pentamer PNA building blocks with a single lysine-based side chain at various positions in the building block could proceed efficiently and sequence specifically. In addition, DNA-templated polymerization also proceeded efficiently and in a sequence-specific manner with pentamer PNA aldehydes containing two or three lysine side chains in a single building block to generate more densely functionalized polymers. To further our understanding of side-chain compatibility and expand the capabilities of this system, we also examined the polymerization efficiencies of 20 pentamer building blocks each containing one of five different side-chain groups and four different side-chain regio- and stereochemistries. Polymerization reactions were efficient for all five different side-chain groups and for three of the four combinations of side-chain regio- and stereochemistries. Differences in the efficiency and initial rate of polymerization correlate with the apparent melting temperature of each building block, which is dependent on side-chain regio- and stereochemistry but relatively insensitive to side-chain structure among the substrates tested. Our findings represent a significant step toward the evolution of sequence-defined synthetic polymers and also demonstrate that enzyme-free nucleic acid-templated polymerization can occur efficiently using substrates with a wide range of side-chain structures, functionalization positions within each building block, and functionalization densities.
DNA-templated organic synthesis enables the translation of DNA sequences into synthetic small-mol... more DNA-templated organic synthesis enables the translation of DNA sequences into synthetic small-molecule libraries suitable for in vitro selection. Previously, we described the DNA-templated multistep synthesis of a 13,824-membered small-molecule macrocycle library. Here, we report the discovery of small molecules that modulate the activity of kinase enzymes through the in vitro selection of this DNA-templated small-molecule macrocycle library against 36 biomedically relevant protein targets. DNA encoding selection survivors was amplified by PCR and identified by ultra-high-throughput DNA sequencing. Macrocycles corresponding to DNA sequences enriched upon selection against several protein kinases were synthesized on a multimilligram scale. In vitro assays revealed that these macrocycles inhibit (or activate) the kinases against which they were selected with IC(50) values as low as 680 nM. We characterized in depth a family of macrocycles enriched upon selection against Src kinase, and showed that inhibition was highly dependent on the identity of macrocycle building blocks as well as on backbone conformation. Two macrocycles in this family exhibited unusually strong Src inhibition selectivity even among kinases closely related to Src. One macrocycle was found to activate, rather than inhibit, its target kinase, VEGFR2. Taken together, these results establish the use of DNA-templated synthesis and in vitro selection to discover small molecules that modulate enzyme activities, and also reveal a new scaffold for selective ATP-competitive kinase inhibition.
Microtubules are hollow tube-like biological polymers required for transport in diverse cellular ... more Microtubules are hollow tube-like biological polymers required for transport in diverse cellular contexts and are important drug targets. Microtubule function depends on interactions with associated proteins and post-translational modifications at specific sites located on its interior and exterior surfaces. However, we lack strategies to selectively perturb or probe these basic biochemical mechanisms. In this work, by combining amber suppression-mediated non-natural amino acid incorporation and tubulin overexpression in budding yeast, we demonstrate, for the first time, a general strategy for site-specific chemistry on microtubules. Probes and labels targeted to precise sites on the interior and exterior surfaces of microtubules will allow analysis and modulation of interactions with proteins and drugs, and elucidation of the functions of post-translational modifications.
Researchers seeking to improve the efficiency and cost effectiveness of the bioactive small-molec... more Researchers seeking to improve the efficiency and cost effectiveness of the bioactive small-molecule discovery process have recently embraced selection-based approaches, which in principle offer much higher throughput and simpler infrastructure requirements compared with traditional small-molecule screening methods. Since selection methods benefit greatly from an information-encoding molecule that can be readily amplified and decoded, several academic and industrial groups have turned to DNA as the basis for library encoding and, in some cases, library synthesis. The resulting DNA-encoded synthetic small-molecule libraries, integrated with the high sensitivity of PCR and the recent development of ultra high-throughput DNA sequencing technology, can be evaluated very rapidly for binding or bond formation with a target of interest while consuming minimal quantities of material and requiring only modest investments of time and equipment. In this tutorial review we describe the development of two classes of approaches for encoding chemical structures and reactivity with DNA: DNA-recorded library synthesis, in which encoding and library synthesis take place separately, and DNA-directed library synthesis, in which DNA both encodes and templates library synthesis. We also describe in vitro selection methods used to evaluate DNA-encoded libraries and summarize successful applications of these approaches to the discovery of bioactive small molecules and novel chemical reactivity.
Pharmacologic agents capable of increasing kinase function would be useful for treating diseases ... more Pharmacologic agents capable of increasing kinase function would be useful for treating diseases associated with reduced kinase activity, such as inherited forms of Parkinson's disease. In this issue, Hertz et al. report an innovative approach for activating the Parkinson's-associated kinase PINK1 in cells with an ATP-derived neo-substrate.
Despite decades of speculation that inhibiting endogenous insulin degradation might treat type-2 ... more Despite decades of speculation that inhibiting endogenous insulin degradation might treat type-2 diabetes, and the identification of IDE (insulin-degrading enzyme) as a diabetes susceptibility gene, the relationship between the activity of the zinc metalloprotein IDE and glucose homeostasis remains unclear. Although Ide(-/-) mice have elevated insulin levels, they exhibit impaired, rather than improved, glucose tolerance that may arise from compensatory insulin signalling dysfunction. IDE inhibitors that are active in vivo are therefore needed to elucidate IDE's physiological roles and to determine its potential to serve as a target for the treatment of diabetes. Here we report the discovery of a physiologically active IDE inhibitor identified from a DNA-templated macrocycle library. An X-ray structure of the macrocycle bound to IDE reveals that it engages a binding pocket away from the catalytic site, which explains its remarkable selectivity. Treatment of lean and obese mice with this inhibitor shows that IDE regulates the abundance and signalling of glucagon and amylin, in addition to that of insulin. Under physiological conditions that augment insulin and amylin levels, such as oral glucose administration, acute IDE inhibition leads to substantially improved glucose tolerance and slower gastric emptying. These findings demonstrate the feasibility of modulating IDE activity as a new therapeutic strategy to treat type-2 diabetes and expand our understanding of the roles of IDE in glucose and hormone regulation.
Combinatorial libraries of synthetic DNA are increasingly being used to identify and evolve prote... more Combinatorial libraries of synthetic DNA are increasingly being used to identify and evolve proteins with novel folds and functions. An effective strategy for maximizing the diversity of these libraries relies on the assembly of large genes from smaller fragments of synthetic DNA. To optimize library assembly and screening, it is desirable to remove from the synthetic libraries any sequences that contain unintended frameshifts or stop codons. Although genetic selection systems can be used to accomplish this task, the tendency of individual segments to yield misfolded or aggregated products can decrease the effectiveness of these selections. Furthermore, individual protein domains may misfold when removed from their native context. We report the development and characterization of an in vivo system to preselect sequences that encode uninterrupted gene segments regardless of the foldedness of the encoded polypeptide. In this system, the inserted synthetic gene segment is separated from an intein/thymidylate synthase (TS) reporter domain by a polyasparagine linker, thereby permitting the TS reporter to fold and function independently of the folding and function of the segment-encoded polypeptide. TS-deficient Escherichia coli host cells survive on selective medium only if the insert is uninterrupted and in-frame, thereby allowing selection and amplification of desired sequences. We demonstrate that this system can be used as a highly effective preselection tool for the production of large, diverse and high-quality libraries of de novo protein sequences.
DNA double-strand break repair involves phosphorylation of histone variant H2AX ('γH2AX')... more DNA double-strand break repair involves phosphorylation of histone variant H2AX ('γH2AX'), which accumulates in foci at sites of DNA damage. In current models, the recruitment of multiple DNA repair proteins to γH2AX foci depends mainly on recognition of this 'mark' by a single protein, MDC1. However, DNA repair proteins accumulate at γH2AX sites without MDC1, suggesting that other 'readers' of this mark exist. Here, we use a quantitative chemical proteomics approach to profile direct, phospho-selective γH2AX binders in native proteomes. We identify γH2AX binders, including the DNA repair mediator 53BP1, which we show recognizes γH2AX through its BRCT domains. Furthermore, we investigate the targeting of wild-type 53BP1, or a mutant form deficient in γH2AX binding, to chromosomal breaks resulting from endogenous and exogenous DNA damage. Our results show how direct recognition of γH2AX modulates protein localization at DNA damage sites, and suggest how specif...
Methods to evolve synthetic, rather than biological, polymers could significantly expand the func... more Methods to evolve synthetic, rather than biological, polymers could significantly expand the functional potential of polymers that emerge from in vitro evolution. Requirements for synthetic polymer evolution include (i) sequence-specific polymerization of synthetic building blocks on an amplifiable template, (ii) display of the newly translated polymer strand in a manner that allows it to adopt folded structures, (iii) selection of synthetic polymer libraries for desired binding or catalytic properties and (iv) amplification of template sequences that survive selection in a manner that allows subsequent translation. Here we report the development of such a system for peptide nucleic acids (PNAs) using a set of 12 PNA pentamer building blocks. We validated the system by performing six iterated cycles of translation, selection and amplification on a library of 4.3 x 10(8) PNA-encoding DNA templates and observed >1,000,000-fold overall enrichment of a template encoding a biotinylated (streptavidin-binding) PNA. These results collectively provide an experimental foundation for PNA evolution in the laboratory.
Protein kinases are attractive therapeutic targets, but their high sequence and structural conser... more Protein kinases are attractive therapeutic targets, but their high sequence and structural conservation complicates the development of specific inhibitors. We recently identified, in a DNA-templated macrocycle library, inhibitors with unusually high selectivity among Src-family kinases. Starting from these compounds, we developed and characterized in molecular detail potent macrocyclic inhibitors of Src kinase and its cancer-associated 'gatekeeper' mutant. We solved two cocrystal structures of macrocycles bound to Src kinase. These structures reveal the molecular basis of the combined ATP- and substrate peptide-competitive inhibitory mechanism and the remarkable kinase specificity of the compounds. The most potent compounds inhibit Src activity in cultured mammalian cells. Our work establishes that macrocycles can inhibit protein kinases through a bisubstrate-competitive mechanism with high potency and exceptional specificity, reveals the precise molecular basis for their desirable properties and provides new insights into the development of Src-specific inhibitors with potential therapeutic relevance.
The DNA-templated polymerization of synthetic building blocks provides a potential route to the l... more The DNA-templated polymerization of synthetic building blocks provides a potential route to the laboratory evolution of sequence-defined polymers with structures and properties not necessarily limited to those of natural biopolymers. We previously reported the efficient and sequence-specific DNA-templated polymerization of peptide nucleic acid (PNA) aldehydes. Here, we report the enzyme-free, DNA-templated polymerization of side-chain-functionalized PNA tetramer and pentamer aldehydes. We observed that polymerization of tetramer and pentamer PNA building blocks with a single lysine-based side chain at various positions in the building block could proceed efficiently and sequence specifically. In addition, DNA-templated polymerization also proceeded efficiently and in a sequence-specific manner with pentamer PNA aldehydes containing two or three lysine side chains in a single building block to generate more densely functionalized polymers. To further our understanding of side-chain compatibility and expand the capabilities of this system, we also examined the polymerization efficiencies of 20 pentamer building blocks each containing one of five different side-chain groups and four different side-chain regio- and stereochemistries. Polymerization reactions were efficient for all five different side-chain groups and for three of the four combinations of side-chain regio- and stereochemistries. Differences in the efficiency and initial rate of polymerization correlate with the apparent melting temperature of each building block, which is dependent on side-chain regio- and stereochemistry but relatively insensitive to side-chain structure among the substrates tested. Our findings represent a significant step toward the evolution of sequence-defined synthetic polymers and also demonstrate that enzyme-free nucleic acid-templated polymerization can occur efficiently using substrates with a wide range of side-chain structures, functionalization positions within each building block, and functionalization densities.
DNA-templated organic synthesis enables the translation of DNA sequences into synthetic small-mol... more DNA-templated organic synthesis enables the translation of DNA sequences into synthetic small-molecule libraries suitable for in vitro selection. Previously, we described the DNA-templated multistep synthesis of a 13,824-membered small-molecule macrocycle library. Here, we report the discovery of small molecules that modulate the activity of kinase enzymes through the in vitro selection of this DNA-templated small-molecule macrocycle library against 36 biomedically relevant protein targets. DNA encoding selection survivors was amplified by PCR and identified by ultra-high-throughput DNA sequencing. Macrocycles corresponding to DNA sequences enriched upon selection against several protein kinases were synthesized on a multimilligram scale. In vitro assays revealed that these macrocycles inhibit (or activate) the kinases against which they were selected with IC(50) values as low as 680 nM. We characterized in depth a family of macrocycles enriched upon selection against Src kinase, and showed that inhibition was highly dependent on the identity of macrocycle building blocks as well as on backbone conformation. Two macrocycles in this family exhibited unusually strong Src inhibition selectivity even among kinases closely related to Src. One macrocycle was found to activate, rather than inhibit, its target kinase, VEGFR2. Taken together, these results establish the use of DNA-templated synthesis and in vitro selection to discover small molecules that modulate enzyme activities, and also reveal a new scaffold for selective ATP-competitive kinase inhibition.
Microtubules are hollow tube-like biological polymers required for transport in diverse cellular ... more Microtubules are hollow tube-like biological polymers required for transport in diverse cellular contexts and are important drug targets. Microtubule function depends on interactions with associated proteins and post-translational modifications at specific sites located on its interior and exterior surfaces. However, we lack strategies to selectively perturb or probe these basic biochemical mechanisms. In this work, by combining amber suppression-mediated non-natural amino acid incorporation and tubulin overexpression in budding yeast, we demonstrate, for the first time, a general strategy for site-specific chemistry on microtubules. Probes and labels targeted to precise sites on the interior and exterior surfaces of microtubules will allow analysis and modulation of interactions with proteins and drugs, and elucidation of the functions of post-translational modifications.
Researchers seeking to improve the efficiency and cost effectiveness of the bioactive small-molec... more Researchers seeking to improve the efficiency and cost effectiveness of the bioactive small-molecule discovery process have recently embraced selection-based approaches, which in principle offer much higher throughput and simpler infrastructure requirements compared with traditional small-molecule screening methods. Since selection methods benefit greatly from an information-encoding molecule that can be readily amplified and decoded, several academic and industrial groups have turned to DNA as the basis for library encoding and, in some cases, library synthesis. The resulting DNA-encoded synthetic small-molecule libraries, integrated with the high sensitivity of PCR and the recent development of ultra high-throughput DNA sequencing technology, can be evaluated very rapidly for binding or bond formation with a target of interest while consuming minimal quantities of material and requiring only modest investments of time and equipment. In this tutorial review we describe the development of two classes of approaches for encoding chemical structures and reactivity with DNA: DNA-recorded library synthesis, in which encoding and library synthesis take place separately, and DNA-directed library synthesis, in which DNA both encodes and templates library synthesis. We also describe in vitro selection methods used to evaluate DNA-encoded libraries and summarize successful applications of these approaches to the discovery of bioactive small molecules and novel chemical reactivity.
Pharmacologic agents capable of increasing kinase function would be useful for treating diseases ... more Pharmacologic agents capable of increasing kinase function would be useful for treating diseases associated with reduced kinase activity, such as inherited forms of Parkinson's disease. In this issue, Hertz et al. report an innovative approach for activating the Parkinson's-associated kinase PINK1 in cells with an ATP-derived neo-substrate.
Despite decades of speculation that inhibiting endogenous insulin degradation might treat type-2 ... more Despite decades of speculation that inhibiting endogenous insulin degradation might treat type-2 diabetes, and the identification of IDE (insulin-degrading enzyme) as a diabetes susceptibility gene, the relationship between the activity of the zinc metalloprotein IDE and glucose homeostasis remains unclear. Although Ide(-/-) mice have elevated insulin levels, they exhibit impaired, rather than improved, glucose tolerance that may arise from compensatory insulin signalling dysfunction. IDE inhibitors that are active in vivo are therefore needed to elucidate IDE's physiological roles and to determine its potential to serve as a target for the treatment of diabetes. Here we report the discovery of a physiologically active IDE inhibitor identified from a DNA-templated macrocycle library. An X-ray structure of the macrocycle bound to IDE reveals that it engages a binding pocket away from the catalytic site, which explains its remarkable selectivity. Treatment of lean and obese mice with this inhibitor shows that IDE regulates the abundance and signalling of glucagon and amylin, in addition to that of insulin. Under physiological conditions that augment insulin and amylin levels, such as oral glucose administration, acute IDE inhibition leads to substantially improved glucose tolerance and slower gastric emptying. These findings demonstrate the feasibility of modulating IDE activity as a new therapeutic strategy to treat type-2 diabetes and expand our understanding of the roles of IDE in glucose and hormone regulation.
Uploads
Papers by Ralph Kleiner