[go: up one dir, main page]

WO2003061596A2 - Methods and reagents for generating chimeric serum peptide carri ers - Google Patents

Methods and reagents for generating chimeric serum peptide carri ers Download PDF

Info

Publication number
WO2003061596A2
WO2003061596A2 PCT/US2003/002085 US0302085W WO03061596A2 WO 2003061596 A2 WO2003061596 A2 WO 2003061596A2 US 0302085 W US0302085 W US 0302085W WO 03061596 A2 WO03061596 A2 WO 03061596A2
Authority
WO
WIPO (PCT)
Prior art keywords
test
protein
receptor
peptide
cell
Prior art date
Application number
PCT/US2003/002085
Other languages
French (fr)
Other versions
WO2003061596A3 (en
Inventor
Jeno Gyuris
Aaron J. Morris
Scott Wick
Original Assignee
Gpc Biotech, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gpc Biotech, Inc. filed Critical Gpc Biotech, Inc.
Priority to AU2003222199A priority Critical patent/AU2003222199A1/en
Publication of WO2003061596A2 publication Critical patent/WO2003061596A2/en
Publication of WO2003061596A3 publication Critical patent/WO2003061596A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B40/00Libraries per se, e.g. arrays, mixtures
    • C40B40/02Libraries contained in or displayed by microorganisms, e.g. bacteria or animal cells; Libraries contained in or displayed by vectors, e.g. plasmids; Libraries containing only microorganisms or vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1037Screening libraries presented on the surface of microorganisms, e.g. phage display, E. coli display
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/5005Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
    • G01N33/5008Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
    • G01N33/5082Supracellular entities, e.g. tissue, organisms
    • G01N33/5085Supracellular entities, e.g. tissue, organisms of invertebrates

Definitions

  • High throughput screening has become a dominant tool in the pharmaceutical industry for the discovery of lead compounds that can be modified into candidates for drug development. For instance, it is abundantly used for identification of ligands with high affinity for receptors.
  • combinatorial techniques have provided approaches to generating and deconvoluting large libraries of test compounds in high throughput screens. It involves selection and amplification of a subset of molecules with desired biological properties from complex libraries.
  • One technique which has emerged for identification of peptide leads involves the use of peptide display methodologies such as phage display.
  • Phage-displayed peptide libraries can comprise vast collections of short, randomized polypeptides that are displayed on the surface of a filamentous bacteriophage particle.
  • each "peptide” is actually the N-terminal sequence of a phage-coat protein, that is encoded by a randomly-mutated region of the phage genome responsible for the production of the coat protein.
  • each unique peptide in the library is physically linked with the DNA molecule encoding it.
  • Antibodies and other binding molecules can be used as "targets” to specifically select rare phage clones bearing ligand peptides, and sequencing of the corresponding viral DNA will reveal their amino acid sequences.
  • Relatively high-affinity peptides for a variety of peptide- and non-peptide-binding targets have been affinity-isolated from epitope libraries. This technology has been used to map epitopes on proteins and to find peptide mimics for a variety of target molecules. Many powerful applications can be envisioned in the areas of drug design and the development of diagnostic markers, vaccines and toleragens.
  • RES reticuloendothelial system
  • many pharmaceutical companies and other institutions have devoted considerable time and resources into extending the duration of peptide drugs in the human body.
  • the advantages of having a patient take such a drug less often are numerous; such as, better compliance, more predictable concentrations in the body, and fewer side effects from the sudden rush of medication shortly after it is administered. All medications, especially those given prophylactically or for a long period of time, are more readily accepted by the patient if they need to be taken less often.
  • Peptides which are most active in such binding assays may not be the most active peptides from the library when formatted as part of a chimeric serum protein, e.g., as a consequence to a different steric, electronic and hydrophobicity environment. Indeed, in that context, other peptide sequences firom the library may in fact be more ideally suited for incorporation in a chimeric serum peptide, but are not selected due to the original format of the peptide library.
  • One aspect of the invention provides a method for generating a chimeric serum peptide (CSP) with a selected biological activity, comprising:
  • CSPs test chimeric serum peptides
  • test CSPs in a secretion mode, expressing, preferably simultaneously, the enriched test CSP sub-population under conditions wherein the test CSPs are secreted and are free of the display packages; and, (iv) assessing the ability of the secreted test CSPs to regulate a selected biological activity in a target cell;
  • the invention provides a method for generating a chimeric serum peptide (CSP) with a selected biological activity, comprising:
  • test peptide sequences are separated from the serum protein sequences by splice sites that are functional in eukaryotic cells, but not in prokaryotic cells;
  • test peptide sub-population in a secretion mode, expressing, preferably simultaneously, the enriched test peptide sub-population under conditions wherein the test peptide is flanked by serum protein sequences, such that the test peptide is in the form of a test chimeric serum peptide (CSP), and the test CSPs are secreted and are free of the display packages; and,
  • the display library can be a phage display library, e.g., which utilizes phage particles such as M13, fl, fd, Ifl, Ike, Xf, Pfl, Pf3, ⁇ , T4, T7, P2, P4, ⁇ X-174, MS2 or f2.
  • the phage display library is generated with a filamentous bacteriophage specific for Escherichia coli and the phage coat protein is coat protein III or coat protein VIII.
  • the filamentous bacteriophage can be Ml 3, fd, and fl.
  • the display library is a bacterial cell-surface display library or a spore display library.
  • the test CSPs are enriched from the display library in the display mode by a differential binding means comprising affinity separation of test CSPs which specifically bind the cell or component thereof from test CSPs which do not.
  • the differential binding means can include panning the display library on whole cells, affinity chromatographic means in which a component of a cell is provided as part of an insoluble matrix (e.g., a cell surface protein attached to a polymeric support), and/or immunoprecipitating the display packages.
  • the test CSPs in the display mode, can be enriched for those which bind to a cell-type specific marker and/or a cell surface receptor protein.
  • test CSP library can be enriched in the display mode for test CSPs which bind to a G-protein coupled receptor, such as a chemoattractant peptide receptor, a neuropeptide receptor, a light receptor, a neurotransmitter receptor, a cyclic AMP receptor, or a polypeptide hormone receptor.
  • G-protein coupled receptor such as a chemoattractant peptide receptor, a neuropeptide receptor, a light receptor, a neurotransmitter receptor, a cyclic AMP receptor, or a polypeptide hormone receptor.
  • the G-protein coupled receptor can also be ⁇ lA-adrenergic receptor, ⁇ lB-adrenergic receptor, ⁇ 2- adrenergic receptor, ⁇ 2B-adrenergic receptor, ⁇ l -adrenergic receptor, ⁇ 2- adrenergic receptor, ⁇ 3-adrenergic receptor, ml acetylcholine receptor (AChR), m2 ACliR, m3 AChR, m4 AChR, m5 AChR, Dl dopamine receptor, D2 dopamine receptor, D3 dopamine receptor, D4 dopamine receptor, D5 dopamine receptor, Al adenosine receptor, A2b adenosine receptor, 5-HTla, 5-HTlb, 5HTl-like, 5-HTld, 5HTld-like, 5HTld beta, substance K (neurokinin A), fMLP receptor, fMLP-like receptor, angiotensin II type
  • the test CSP library can be enriched in the display mode for test CSPs which bind to a receptor tyrosine kinase, such as an EPH receptor (for example, eph, elk, eck, sek, mek4, hek, hek2, eek, erk, tyrol, tyro4, tyro ⁇ , tyro ⁇ , tyroll, cek4, cek5, cek ⁇ , cek7, cek8, cek9, ceklO, bsk, rtkl, rtk2, rtk3, mykl, myk.2, ehkl, ehk2,pagliaccio, htk, erk ox nuk receptors).
  • EPH receptor for example, eph, elk, eck, sek, mek4, hek, hek2, eek, erk, tyrol, tyro4, t
  • test CSP library can be enriched in the display mode for test CSPs which bind to a cytokine receptor or an MIRR receptor. In certain embodiments, the test CSP library can be enriched in the display mode for test CSPs which bind to an orphan receptor.
  • test CSP library can be enriched in the display mode for test CSPs which bind to extracellular proteins, such as secreted proteins, or to infectious agents, such as viruses, fungi or bacteria.
  • display library includes at least about 10 different test CSPs.
  • test CSPs are from about 3 to about 100 amino acid residues in length. In more preferred embodiments, the test CSPs are from about 4 to about 20 amino acid residues in length.
  • each of the test CSPs are encoded by a chimeric gene comprising (i) a coding sequence for the test CSP, (ii) a coding sequence for a surface protein of the display package for displaying the test CSPs on the surface of a population of display packages, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in the display mode, the chimeric gene is expressed as fusion protein including the test CSP and the surface protein, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing.
  • the test CSPs are expressed by a eukaryotic cell, more preferably a mammalian cell, in the secretion mode.
  • the target cell is a eukaryotic cell, more preferably a mammalian cell such as a human cell.
  • the biological activity scored for in the secretion mode includes a change in cell proliferation, cell differentiation or cell death.
  • the biological activity which is detected is changes in intracellular calcium mobilization, intracellular protein phosphorylation, phospholipid metabolism, and/or expression of cell-specific marker genes.
  • the target cell includes a reporter gene construct containing a reporter gene in operative linkage with one or more transcriptional regulatory elements responsive to the signal transduction activity of the cell surface receptor protein, expression of the reporter gene providing the detectable signal.
  • the reporter gene can encode a gene product that gives rise to a detectable signal selected from: color, fluorescence, luminescence, cell viability relief of a cell nutritional requirement, cell growth, and drug resistance.
  • the reporter gene encodes a gene product selected from: chloramphenicol acetyl transferase, beta-galactosidase or secreted alkaline phosphatase.
  • the reporter gene encodes a gene product which confers a growth signal.
  • the secretion mode includes expression of the test CSPs by a host cell co-cultured with the target cell.
  • the co-cultured host and target cells are separated by a membrane which is permeable to the test CSP.
  • the secretion mode includes assessing the ability of the secreted test CSPs to inhibit the biological activity of an exogenously added compound on the target cells.
  • display packages which bind to endothelial cells are isolated; and in step (iv) above, the ability of the secreted test CSPs to inhibit proliferation of endothelial cells is assessed.
  • the ability of the secreted test CSPs to inhibit proliferation of endothelial cells in the presence of an angiogenic amount of an endogenous growth factor can be assessed.
  • the serum protein sequence is selected from: albumins, ⁇ -globulins, ⁇ -globulins, ⁇ -globulins, haptoglobin, transthyretin, collagen, 2 macroglobulin, ⁇ 2 microglobulin, C Reactive Protein, apolipoproteins, lipoproteins, cathepsins amylase, antichymotrypsin, ferritin, ⁇ fetoprotein, elastin and fibronectin and coagulation factors including fibrinogen, fibrin, thrombin, ceruloplasmin, antiplasmin and antithrombin III, or fragments thereof.
  • the CSPs are selected, at least in part, because the CSP is more potent than the test peptide sequence by itself, e.g., not fused to a serum protein.
  • a CSP library can be assigned to identify members that are at least 10 times, 100 times, or even 1000 times more active than the test peptide sequence alone, e.g., 1, 2, or even 3 orders of magnitude more active.
  • the IC 50 of the CSP may be 10 times lower, 100 times lower, or even 1000 times lower than the IC 50 of the test peptide alone; and in embodiments wherein the CSP induces or promotes a biological activity, the EC 50 of the CSP may be 10 times lower, 100 times lower, or even 1000 times lower than the EC 50 of the test peptide alone.
  • the dissociation constant K of the CSP and the biological molecule to which it binds may be 10 times lower, 100 times lower, or even 1000 times lower than the Kd for binding between the biological molecule and the test peptide alone, e.g., binding of the two entities is increasingly favored over their dissociation.
  • the CSPs are represented by the formula A-B-C, wherein A represents a first fragment of a serum protein or homolog tliereof, B represents a test peptide sequence, and C represents another fragment of a serum protein or a homolog thereof.
  • the method further comprises formulating, with a pharmaceutically acceptable carrier, one or more test CSPs which regulate the biological activity in the target cell or peptidomimetics thereof.
  • the chimeric peptides are selected, at least in part, based on having a half-life in bodily fluids, such as urine, lymph, CSF or blood of no less than 10 days, preferably no less than about 14 days, and most preferably no less than 50% of the serum half-life of the native form of the serum protein.
  • the chimeric polypeptide is capable of binding to an extracellular receptor or ion channel.
  • the chimeric polypeptide may be an agonist or an antagonist of an extracellular receptor or ion channel.
  • the chimeric polypeptide of this embodiment may, for example, induce apoptosis, modulate cell proliferation, or modulate differentiation of cell types.
  • the test CSP library can be enriched in the display mode for test CSPs which bind to extracellular proteins, such as secreted proteins, or to infectious agents, such as viruses, fungi or bacteria.
  • the subject invention also specifically contemplates that peptides identified in the secretion mode can be converted into peptidomimietics.
  • the subject method includes the further step of formulating, with a pharmaceutically acceptable carrier, one or more test CSPs which regulate the biological activity in the target cell or peptidomimetics thereof.
  • Another aspect of the present invention provides a display library enriched for test chimeric serum peptides (CSPs) having a desired binding specificity and/or affinity for a cell or a component thereof and which regulate a biological activity in a target cell.
  • CSPs test chimeric serum peptides
  • Still another aspect of the present invention relates to a vector comprising a chimeric gene for a chimeric serum peptide (CSP), which chimeric gene comprises (i) a coding sequence for a test CSP, which CSP includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C-terminal end or internal site of the serum protein sequence, (ii) a coding sequence for a surface protein of a display package, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in a display mode, the chimeric gene is expressed as a fusion protein including the test CSP and the surface protein such that the test CSP can be displayed on the surface of a population of display packages, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing.
  • CSP chimeric serum peptide
  • the invention provides a vector comprising a chimeric gene for a chimeric serum peptide (CSP), which chimeric gene comprises (i) a coding sequence for a test CSP, which CSP includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C- terminal end or internal site of the serum protein sequence, (ii) a coding sequence for a surface protein of a display package, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in a display mode, the chimeric gene is expressed as a test peptide protein not including the serum protein sequence and the surface protein such that the test peptide can be displayed on the surface of a population of display packages, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing.
  • CSP chimeric serum peptide
  • the chimeric gene can include a secretion signal sequence for secretion of the test CSP in the secretion mode, e.g., secretion of the test CSP from eukaryotic cells, preferably mammalian cells.
  • the display package is a phage, such as Ml 3, fl, fd,
  • the phage is a filamentous bacteriophage specific for Escherichia coli and the surface protein is coat protein III or coat protein VIII.
  • the filamentous bacteriophage can be Ml 3, fd, or fl.
  • each vector comprising a chimeric gene for a chimeric serum peptide (CSP), which chimeric gene comprises (i) a coding sequence for a test CSP, which CSP includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C-terminal end or internal site of the serum protein sequence, which test peptide sequence is variegated amongst members of the library, (ii) a coding sequence for a surface protein of a display package, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in a display mode, the chimeric gene is expressed as fusion protein including the test CSP and the surface protein such that the test CSP can be displayed on the surface of a population of display packages, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing
  • the chimeric gene further comprises a secretion signal sequence for secretion of the test CSP in the secretion mode.
  • the secretion signal sequence causes secretion of the test CSP from eukaryotic cells, such as mammalian cells.
  • the display package is a phage, such as Ml 3, fl, fd, Ifl, Ike, Xf, Pfl, Pf3, ⁇ , T4, T7, P2, P4, ⁇ X-174, MS2 or f2.
  • the phage is a filamentous bacteriophage specific for Escherichia coli and the surface protein is coat protein III.
  • the filamentous bacteriophage can be M13, fd, or fl .
  • the vector library collectively encodes at least about 10 3 different test CSPs.
  • the test CSPs are from about 3 to about 100 amino acid in length, more preferably from about 4 to about 20 amino acid residues in length.
  • Another aspect of the present invention is a cell composition comprising a population of cells containing the vector library described above.
  • Another aspect of the present invention provides a construct as shown in Figure 1, 3, 9 or 15.
  • Still another aspect of the present invention provides a method for identifying a peptide with a selected antimicrobial activity, comprising: (i) providing a recombinant host cell population which expresses a soluble peptide library comprising a variegated population of test chimeric serum proteins (CSPs), which includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C-terminal end or internal site of the serum protein sequence;
  • CSPs test chimeric serum proteins
  • the target microorganism is a bacteria or a fungus.
  • the host cells are cultured on agar embedded with the target microorganisms.
  • antimicrobial activity of a test CSP can be determined by zone clearing in the agar.
  • the chimeric serum peptides of the present invention may also be useful in determining the three dimensional structure of a protein, for example, throughout the use of nuclear magnetic resonance (NMR) or x-ray crystallography.
  • NMR nuclear magnetic resonance
  • Another aspect of the present invention provides a method of conducting a drug discovery business comprising: i) identifying, by the method of the present invention, a chimeric serum peptides having a desired biological activity; ii) conducting therapeutic profiling of the chimeric serum protein identified in step (i) for efficacy and toxicity in mammals; and, formulating a pharmaceutical preparation including a chimeric serum protein identified in step (ii) as having an acceptable therapeutic profile.
  • the method can include an additional step of establishing a distribution system for distributing the pharmaceutical preparation for sale, and may optionally include establishing a sales group for marketing the pharmaceutical preparation.
  • Another aspect of the present invention provides a method of conducting a drug discovery business comprising: i) identifying, by the method of the present invention, a chimeric serum peptides having a desired biological activity; ii) conducting therapeutic profiling of the chimeric serum protein identified in step (i) for efficacy and toxicity in mammals; and, iii) licensing, to a third party, the rights for further drug development of one or more a chimeric serum peptides identified in step (ii) as having an acceptable therapeutic profile.
  • Another aspect of the present invention provides a method of conducting a drug discovery business comprising: i) identifying, by the method of the present invention, a chimeric serum peptides having a desired biological activity; ii) licensing, to a third party, the rights for further drug development based on one or more chimeric serum peptides identified in step (i).
  • Figure 1 Schematic of pSWl expression plasmid.
  • Figure 2 Original vector sequence for pSWl phage construct.
  • Figure 3 Comparison of pSWl and pSW2 expression plasmids.
  • FIG. 4 Sequence for pSW2 (cassette version).
  • Figure 6 Reduction of flanking amino acids around peptide libraries.
  • Figure 7 Engineering of sites to allow fusion to carrier proteins in vitro.
  • Figure 8 Insertion of peptide libraries.
  • Figure 10 Sequences for pSW3, serum albumin construct.
  • Figure 11 Insertion of N-terminus of SA as Xbal/Sall PCR fragments.
  • Figure 12 Comparison of splice acceptor sequence in pSW2 and pSW3.
  • Figure 13 Certain vector sequences for pSW3.
  • Figure 14 Movement of peptide sequences to pCDNA3 + MSA for expression and purification of MSA-peptide protein.
  • Figure 15 Comparison of pSW2 and pSW4 expression vectors.
  • Figure 16 Sequences for pSW4, display of MSA + peptide domain I of phage coat.
  • Figure 17 Insert SA pre-pro secretion signal sequence as Xbal/Sall site.
  • Figure 18 Insert Domain I of MSA (aal-96) as Hindlll/BamHI with engineered Sphl and Haindlll within MSA to allow insertion of oligoes and encoding peptide libraries.
  • Figure 19 Insert libraries as Sphl/Hindlll oligomers.
  • Figure 20 Splice acceptor sequence for pSW4.
  • Figure 21 Reasoning behind two RBS signals for pill phage display.
  • Figure 22 Sequence for pSW2 + phage displayed serum albumin construct.
  • the present invention makes available a powerful directed approach for isolating biologically active peptides in the context of a chimeric serum protein.
  • One aspect of the present invention is the synthesis of a binary method that combines variegated display libraries, e.g., in a "display mode", with soluble secreted peptide libraries, e.g., in a "secretion mode", to yield a method for the efficient isolation of peptides having a desired biological activity.
  • the peptides of the library are provided as part of a chimeric serum protein or relevant fragment thereof.
  • the subject method accesses the relevant 3 -dimensional conformational space for the peptides such that there is a degree of confidence that isolation and testing is optimized for the final use of the peptide in a chimeric serum protein.
  • a chimeric serum protein library can first be reduced in complexity by panning or other affinity purification techniques.
  • the subject method selects peptides, in the context of a chimeric serum protein, having a certain affinity profile, e.g., a specificity and/or binding affinity for a discrete cell or protein or other cellular component thereof by (i) displaying the chimeric serum protein on the outer surface of a replicable genetic display package to create a display library, and (ii) using affinity selection teclmiques to emich the population of display packages for those containing chimeric serum peptides which have a desired binding specificity for the target cell or cellular component (herein collectively referred to as the "target").
  • affinity profile e.g., a specificity and/or binding affinity for a discrete cell or protein or other cellular component thereof by (i) displaying the chimeric serum protein on the outer surface of a replicable genetic display package to create a display library, and (ii) using affinity selection tec
  • the resulting sub-library is then utilized in a secretion mode whereby the chimeric serum peptides are secreted as soluble extracellular factors and their effect as a paracrine or autocrine factor is scored. That is, the secretion mode measures biological activity of the chimeric serum peptides in order to distinguish between agonist, antagonist, and inactive peptide sequences with regard to regulating a particular biological response of, e.g., a test cell or tissue.
  • the display mode and secretion mode can be carried out without the need to sub-clone the coding sequences for each test chimeric serum protein into another vector.
  • Figures 9 and 15 show exemplary vectors, pSW3 and pSW4, for sequential use in both the display and secretion modes.
  • vectors which can be used in preferred embodiments.
  • a test peptide is fused to a bacterial gene III, with mammalian splice donor and acceptor sites surrounding the variegated peptide.
  • the vectors will produce a fusion protein consisting of a secretion signal sequence, the test peptide and the remaining C-terminal portion of the bacterial gene III.
  • the resulting chimeric protein is capable of being incorporated into an Ml 3 phage particle (display mode).
  • the Ml 3 coding sequences are removed from the mature mRNA and the test peptide is inserted into an appropriate loop of serum albumin via splice sites.
  • the test peptide will be expressed and secreted as part of a chimeric serum protein, CSP, such as chimeric albumin.
  • CSP chimeric serum protein
  • the vector comprises the chimeric serum peptide - with the test peptide fused to bacterial gene III already displayed in an appropriate loop of a serum protein, such as serum albumin, surrounded by splice sites.
  • the vectors will produce the fusion protein consisting of a secretion signal sequence, the CSP, and the remaining C-terminal portion of the bacterial gene III protein, and incorporate it into an Ml 3 phage particle (display mode).
  • the CSP rather than the test peptide alone, will be displayed.
  • the Ml 3 coding sequences are removed from the mature mRNA by virtue of splice sites.
  • the CSP is left and will be secreted.
  • the serum protein or peptide such as serum albumin
  • the serum peptide is normally a smaller peptide fragment, such as a peptide of serum albumin.
  • the serum peptide may comprise one or more domains of serum albumin, such as domain 1 (approximately amino acids 1 to 96 or amino acids 1 to 106).
  • the mature mRNA in mammalian cells, encodes a secretion signal sequence and the chimeric serum protein alone, which is secreted as a soluble peptide from the cell.
  • the chimeric serum peptides of the invention include peptide sequences which make the protein useful as a therapeutic or diagnostic agent.
  • Non-limiting examples of activity which can be imparted by the chimeric peptide sequence include enzyme inhibition, hormone agonism or antagonism, cytokine agonism or antagonism, analgesics, antipyretic, anti-inflammatory, antibiotic, antiviral, antisepsis, anti-fungal, cardiovascular agonism or antagonism, angiogenic agonism or antagonism, renal function and electrolyte metabolism agonism or antagonism, and chemotherapeutic drugs, to name but a few.
  • the improved pharmacokinetic or pharmacodynamic properties of the chimeric serum peptides can, in certain embodiments, provide for low-dose pharmaceutical formulations and novel pharmaceutical compositions.
  • the invention provides for methods of using the novel compositions including the therapeutic or diagnostic use of the chimeric peptides.
  • peptide refers to an oligomer in which the monomers are amino acids (usually alpha-amino acids) joined together through amide bonds. Peptides are two or more amino acid monomers long, but more often are between three and 100 amino acids in length. In preferred embodiments, the peptides may be from about 4 to about 20 amino acids or more, although peptides longer than 20 amino acids can also be referred to as "polypeptides.”
  • protein is well known in the art and usually refers to a very large polypeptide, or set of associated homologous or heterologous polypeptides, that has some biological function.
  • peptide and “polypeptide” are largely interchangeable, and refer to domains or regions of proteins which may be primarily or secondarily responsible for some activity of a protein.
  • chimeric serum peptide and “CSP” are used interchangeably and refer to serum peptides which have been recombinantly engineered to include one or more heterologous amino acid sequences, peptides, at internal positions within the sequence of the serum protein, or appended at the amino- or carboxy-terminus of the protein. Unless otherwise apparent from the context, the term covers chimeric peptides which include the full-length sequence of a serum protein, as well as chimeric peptides in which only a fragment of the serum protein is retained.
  • inserted into is used herein to include both insertion of a first sequence between two amino acids of a second sequence, and replacement of one or more amino acids of the second sequence with the amino acids of the first sequence (e.g., replacing one or more amino acids of the second sequence with a first sequence of amino acids having the same or a different number of amino acids), unless the latter is clearly excluded.
  • test CSP refers to a chimeric serum peptide which is being investigated for activity, e.g., being selected form a library or tested for activity on a cell.
  • test peptide sequence refers to the peptide sequence(s) which may be inserted into a serum protein and included in a CSP.
  • fusion and chimeric, as used herein to describe proteins and polypeptides, relate to polypeptides or proteins wherein two individual polypeptides or portions thereof are fused to form a single amino acid chain. Such fusion may arise from the expression of a single continuous coding sequence formed by recombinant DNA techniques.
  • fusion polypeptides and “chimeric” polypeptides include contiguous polypeptides comprising a first polypeptide covalently linked via an amide bond to one or more amino acid sequences which define polypeptide domains that are foreign to and not substantially homologous with any domain of the first polypeptide.
  • Gene constructs encoding fusion proteins are likewise referred to a "chimeric genes" or “fusion genes”.
  • "Homology” and “identity” each refer to sequence similarity between two polypeptide sequences, with identity being a more strict comparison. Homology and identity can each be determined by comparing a position in each sequence which may be aligned for purposes of comparison. When a position in the compared sequence is occupied by the same amino acid residue, then the polypeptides can be referred to as identical at that position; when the equivalent site is occupied by the same amino acid (e.g., identical) or a similar amino acid (e.g., similar in steric and/or electronic nature), then the molecules can be referred to as homologous at that position.
  • a percentage of homology or identity between sequences is a function of the number of matching or homologous positions shared by the sequences.
  • An "unrelated”, “heterologous”, or “non-homologous” sequence shares less than 40 percent identity, though preferably less than 25 percent identity, with a sequence to which it is compared.
  • a “heterologous peptide sequence” is a peptide sequence substantially dissimilar to a sequence to which it is compared.
  • serum albumin (S A) is intended to include (but not necessarily to be restricted to) serum albumin proteins of living organisms, preferably mammalian serum albumins, even more preferably known or yet-to-be-discovered polymorphic forms of human serum albumin (HSA), and fragments and variants thereof.
  • HSA human serum albumin
  • fragments and variants thereof domain 1 of serum albumin (aa 1-96, or aa 1-106) can be used in place of the full protein.
  • fragments is intended to include (but not necessarily be restricted to) one or more functional domains of serum albumin, including at least one loop selected from (Cys 53 -Cys 62 , Cys 75 -Cys 91 , Cys 90 -Cys 101 , Cys 245 -Cys 253 , Cys 266 -Cys 279 , Cys 360 -Cys 369 , Cys 461 -Cys 477 , Cys 476 -Cys 487 , and Cys 558 -Cys 567 ).
  • variants the human serum albumin Naskapi has Lys-372 in place of Glu-372, and albumin Wales has an altered pro-sequence.
  • variants is intended to include (but not necessarily be restricted to) homologs of SA proteins with minor artificial variations in sequence (such as molecules lacking one or a few residues, having conservative substitutions or minor insertions of residues, or having minor variations of amino acid structure).
  • polypeptides which have 80%, 85%, 90%, or 99% homology with a native SA are deemed to be “variants.” It is also preferred for such variants to share at least one pharmacological utility with a native SA.
  • Any putative variant which is to be used pharmacologically should be non-immunogenic in the animal (especially human) being treated.
  • Sequences of a number of contemplated serum albumin proteins can be obtained from GenBank (National Center for Biotechnology Information), including human, bovine, mouse, pig, horse, sheep, and chick serum albumins.
  • “native” is used to describe a protein which occurs naturally in a living organism. Wild-type proteins are thus native proteins. Proteins which are non- native are those which have been generated by artificial mutation, recombinant design, or other laboratory modification and are not known in natural populations. "Conservative substitutions" are those where one or more amino acids are substituted for others having similar properties such that one skilled in the art of polypeptide chemistry would expect at least the secondary structure, and preferably the tertiary structure, of the polypeptide to be substantially unchanged. For example, typical such substitutions include asparagine for glutamine, serine for asparagine, and arginine for lysine.
  • physiologically functional equivalents also encompasses larger molecules comprising the native sequence plus a further sequence at the N-terminus (for example, pro-HSA, pre-pre-HSA, and met-HSA).
  • Tertiary structure refers to the three-dimensional structure of a protein. Proteins which have similar tertiary structures will have similar shapes and surfaces, even if the amino acid sequences (the "secondary structure") is not identical. Tertiary structure is a consequence of the folding and twisting of an amino acid chain upon itself and can be disrupted by chemical means, e.g., strong acid or base, or by physical means, e.g., heating.
  • the term "simultaneously expressing” refers to the expression of a representative population of a peptide library, or chimeric peptide library as the case may be, e.g., at least 50 percent, more preferably 75, 80, 85, 90, 95 or 98 percent of all the different peptide sequences of a library. "Simultaneous expression” may be independent for each member of a library, but may share common inducing factors, such that the expression of each takes place at approximately the same time interval.
  • random peptide library refers to a set of random or semi-random peptide sequences, as well as sets of fusion proteins containing those random peptides (as applicable). As used herein, the term will be understood to include libraries of peptide sequences provided in the form of chimeric serum peptides.
  • the term "effective amount” refers to an amount sufficient to induce a statistically significant result.
  • ligand refers to a molecule that is recognized by a particular protein, e.g., a receptor. Any agent bound by or reacting with a protein is called a "ligand,” so the term encompasses the substrate of an enzyme and the reactants of a catalyzed reaction.
  • ligand does not imply any particular molecular size or other structural or compositional feature other than that the substance in question is capable of binding or otherwise interacting with a protein.
  • a “ligand” may serve either as the natural ligand to which the protein binds or as a functional analogue that may act as an agonist or antagonist.
  • the language “replicable genetic display package” or “display package” describes a biological particle which has genetic information providing the particle with the ability to replicate.
  • the package can display a chimeric serum protein, e.g., including a peptide derived from the variegated peptide library.
  • the test peptide portion of the chimeric serum protein is presented by the display package in a context which permits the test CSP portion of the CSP to bind to a target that is contacted with the display package.
  • the display package will generally be derived from a system that allows the sampling of very large variegated peptide libraries.
  • the display package can be, for example, derived from vegetative bacterial cells, bacterial spores, and bacterial viruses.
  • differential binding means refer to the separation of members of the display library based on the differing abilities of CSP sequences on the surface of each of the display packages of the library to bind to the target.
  • the differential binding of a target by different members of the display library can be used in the affinity separation of tliose cliimeric serum peptides which specifically bind the target from those which do not.
  • the affinity selection protocol can also include a pre- or post-enrichment step wherein display packages capable of binding "background targets", e.g., as a negative selection, are removed from the library.
  • affinity selection means include affinity chromatography, immunoprecipitation, fluorescence activated cell sorting, agglutination, and plaque lifts.
  • affinity chromatography includes bio-panning techniques using either purified, immobilized target proteins or the like, as well as whole cells.
  • the phrases “individually selective manner” and “individually selective binding”, with respect to binding of a test CSP with a target protein, refers to the binding of a peptide to a certain protein target which binding is specific for, and dependent on, the molecular identity of the protein target.
  • solid support refers to a material having a rigid or semi-rigid surface. Such materials will preferably take the form of small beads, pellets, disks, chips, dishes, multi-well plates, wafers or the like, although other forms may be used. In some embodiments, at least one surface of the substrate will be substantially flat.
  • surface refers to any generally two-dimensional structure on a solid substrate and may have steps, ridges, kinks, terraces, and the like without ceasing to be a surface.
  • vector refers to a DNA molecule, capable of replication in a host cell, into which a gene can be inserted to construct a recombinant DNA molecule.
  • phage vector and “phagemid” are art-recognized and generally refer to a vector derived by modification of a phage genome, containing an origin of replication for a bacteriophage, and preferably, though optional, an origin (ori) for a bacterial plasmid.
  • origin for a bacteriophage
  • origin for a bacterial plasmid.
  • the use of phage vectors rather than the phage genome itself provides greater flexibility to vary the ratio of chimeric peptide/coat protein to wild- type coat protein, as well as supplement the phage genes with additional genes encoding other heterologous polypeptides, such as "auxiliary polypeptides" which may be useful in the "dual" CSP display constructs described below.
  • helper phage describes a phage which is used to infect cells containing a defective phage genome or phage vector and which functions to complement the defect.
  • the defect can be one which results from removal or inactivation of phage genomic sequence required for production of phage particles.
  • helper phage are M13K07.
  • cell surface receptor refers to molecules that occur on the surface of cells, interact with the extracellular environment, and (directly or indirectly) transmit or transduce the information regarding the environment intracellularly in a manner that may modulate intracellular second messenger activities or transcription of specific promoters, resulting in transcription of specific genes.
  • extracellular signals include a molecule or other change in the extracellular environment that is transduced intracellularly via cell surface proteins that interact, directly or indirectly, with the signal.
  • An extracellular signal or effector molecule includes any compound or substance that in some manner alters the activity of a cell surface protein. Examples of such signals include, but are not limited to, molecules such as acetylcholine, growth factors and hormones, lipids, sugars and nucleotides that bind to cell surface and/or intracellular receptors and ion channels and modulate the activity of such receptors and channels.
  • the term also include as yet unidentified substances that modulate the activity of a cellular receptor, and thereby influence intracellular functions.
  • Such extracellular signals are potential pharmacological agents that may be used to treat specific diseases by modulating the activity of specific cell surface receptors.
  • Orphan receptors is a designation given to a receptors for which no specific natural ligand has been described and/or for which no function has been determined.
  • a "reporter gene construct” is a nucleic acid that includes a “reporter gene” operatively linked to at least one transcriptional regulatory sequence. Transcription of the reporter gene is controlled by these sequences to which they are linked. The activity of at least one or more of these control sequences can be directly or indirectly regulated by the target receptor protein. Exemplary transcriptional control sequences are promoter sequences.
  • a reporter gene is meant to include a promoter-reporter gene construct which is heterologously expressed in a cell.
  • indicator gene generically refers to an expressible (e.g., able to be transcribed and (optionally) translated) DNA sequence which is, for example, expressed in response to a signal transduction pathway modulated by a target receptor or ion channel.
  • exemplary indicator genes include unmodified endogenous genes of the host cell, modified endogenous genes, or a reporter gene of a heterologous construct, e.g., as part of a reporter gene construct.
  • Signal transduction is the processing of physical or chemical signals from the cellular environment through the cell membrane, and may occur through one or more of several mechanisms, such as activation/inactivation of enzymes (such as proteases, or other enzymes which may alter phosphorylation patterns or other post- translational modifications), activation of ion channels or intracellular ion stores, effector enzyme activation via guanine nucleotide binding protein intermediates, formation of inositol phosphate, activation or inactivation of adenylyl cyclase, direct activation (or inhibition) of a transcriptional factor and/or activation.
  • modulation of a signal transduction activity of a receptor protein in its various grammatical forms, as used herein, designates induction and/or potentiation, as well as inhibition of one or more signal transduction pathways downstream of a receptor.
  • Agonists and antagonists are "receptor effector" molecules that modulate signal transduction via a receptor.
  • Receptor effector molecules are capable of binding to the receptor, though not necessarily at the binding site of the natural ligand.
  • Receptor effectors can modulate signal transduction when used alone, i.e. can be surrogate ligands, or can alter signal transduction in the presence of the natural ligand, either to enhance or inhibit signaling by the natural ligand.
  • antagonists are molecules that block or decrease the signal transduction activity of receptor, e.g., they can competitively, noncompetitively, and/or allosterically inhibit signal transduction from the receptor, whereas "agonists” potentiate, induce or otherwise enhance the signal transduction activity of a receptor.
  • receptor activator and “surrogate ligand” refer to an agonist which induces signal transduction from a receptor.
  • compound as used herein is meant to include both exogenously added test compounds and peptides or chimeric serum peptides expressed from a library of the present invention.
  • biologically active refers to an entity which interacts in some way with a living organism on a molecular level. Entities which are biologically active may activate a receptor, provoke an immune reaction, interact with a membrane or ion channel, or otherwise induce a change in a biological function of an organism or any part of an organism.
  • target cells means cells, either in vivo or ex vivo, for which it is desired to modify the behavior tlirough interaction with a ligand.
  • the ligand may interact with a target cell via binding to a cellular receptor.
  • the ligand may be a peptide, or a synthetic molecule which is able to mimic the effects of such a peptide.
  • Target cells may be any type of cell, including blood cells, skeletal muscle cells, stem cells, skin cells, liver cells, secretory gland cells, hematopoietic cells, and marrow cells.
  • “Serum half-life” as used herein refers to the time required for half of a quantity of a peptide in the bloodstream to be degraded.
  • the subject peptide sequences are included as part of a fusion protein with a serum protein, being added at either the N- or C- terminus of the proteins, or at one or more internal sites.
  • serum proteins which can be used in the present invention include albumin, -globulins, ⁇ -globulins, ⁇ -globulins, haptoglobin, transthyretin, collagen, 2 macroglobulin, ⁇ 2 microglobulin, C Reactive Protein, apolipoproteins, lipoproteins, cathepsins amylase, antichymotrypsin, ferritin, ⁇ fetoprotein, elastin and fibronectin and coagulation factors including fibrinogen, fibrin, thrombin, ceruloplasmin, antiplasmin and antithrombin III, and the like.
  • the bioavailability of the various proteins may also be a consideration.
  • the CSP is selected to be one which is capable of crossing the blood-brain barrier by transcytosis.
  • the CSP can be generated by use of scaffold sequences from insulin, transferrin, IGF-I, IGF-II, basic albumin or prolactin.
  • the variegated peptide sequences of the CSP are in the range of from about 3 to about 100 amino acids in length, more preferably from about 4 to about 50 amino acids in length or from about 4 to about 20 amino acids, and even more preferably at least 5, 10, 13, 15, 20 or 25 amino acid residues in length.
  • test CSP is flanked by cysteine residues in order to provide a constrained environment.
  • test peptide portion of the CSP may be represented in the general formula Cys-(Xaa) 3- 3 -Cys, where Xaa is independently selected for each position in the peptide chain.
  • one or more heterologous peptide fragments are inserted into a serum albumin protein or a homolog thereof.
  • the heterologous peptide fragment may optionally replace a portion of the serum protein sequence.
  • a peptide fragment which replaces a portion of the serum protein sequence need not be of the same length as the fragment it replaces.
  • a chimeric serum protein according to this aspect may include more than one heterologous peptide fragments which replaces a portion of the serum protein sequence.
  • the included fragments may be identical or different, and may be random, semi-random or sequences from a protein unrelated to serum protein.
  • a CSP of this aspect may comprise the structure A-B-C, wherein A represents a first fragment of a serum protein or homolog thereof, B represents a test peptide sequence, and C represents another fragment of a serum protein or a homolog thereof.
  • a chimeric polypeptide may comprise the structure A-B-C-D-E, wherein A, C, and E represent fragments of a serum protein and B and D represent test peptide sequences, which may be the same or different.
  • the test peptide portion includes at least 6 amino acids, at least 12 amino acids, or at least 18 amino acids.
  • Serum albumin for example, is the major protein constituent of the circulatory system, has a half-life in the blood of about three weeks (Rothschild, M.A. et al. Hepatology 1988, 8, 385-401), and is present in quantity (40 g/L in the serum). It is also l ⁇ iown that the normal adult human liver produces approximately 15 grams of human serum albumin (HSA) per day, or about 200 mg per kilogram of body weight. Serum albumin has no immunological activity or enzymatic function, and is a natural carrier protein used to transport many natural and therapeutic molecules.
  • Fusion proteins wherem a therapeutic polypeptide has been covalently linked to serum albumin have been shown to have serum half-lives many times longer than the half- life of the therapeutic peptide itself (Syed, S. et al. Blood 1997, 89, 3243-3252; Yeh, P. et al. Proc. Natl. Acad. Sci. USA 1992, 89, 1904-1908).
  • the half-life of the fusion protein was more than 140 times greater than that of the therapeutic polypeptide itself, and approached the half-life of unfused serum albumin.
  • the amino-terminal portion of serum albumin has been found to favor particularly efficient translocation and export of the fusion proteins in eukaryotic cells (PCT publication WO 90/13653).
  • cysteine-constrained loops may be selected for replacement, e.g., on the presumption that structural changes to the loop are likely to minimally affect the tertiary structure of the protein as a whole.
  • Figures 4A-I of U.S.S.N. 09/768,183 show the locations of several such loops on the mouse serum albumen protein.
  • the present invention contemplates insertion into or replacement of any one of the loops in serum albumin that meet this criteria, or any combination of such loops.
  • a loop selected for insertion or replacement is located at or near the surface of the serum albumen protein to facilitate intermolecular interactions.
  • serum albumen proteins e.g., bovine, human, and other serum albumen proteins.
  • a chimeric polypeptide of the present invention may include a biologically active heterologous peptide sequence inserted into the peptide sequence of a serum albumin protein.
  • the inserted sequence may optionally replace a portion of the serum albumin sequence, whether that portion is of similar or dissimilar length. In some cases, more than one insertion may be required to obtain the desired biological activity.
  • a biologically active heterologous peptide sequence may be placed between two fragments of a serum albumin sequence to create such a chimeric polypeptide.
  • one or more additional biologically active peptide sequences may be placed between fragments of serum albumin protein.
  • Chimeric polypeptides of the present invention may also be described as a biologically active heterologous peptide sequence flanked on one side by an N-terminal fragment of serum albumin protein and on the other side by a C-terminal fragment of serum albumin protein.
  • a space-filling model of human serum albumin (HSA) has been previously generated (see Figure 1 of U.S.S.N. 09/768,183). The tertiary structure of HSA reveals the presence often approximate helical regions or loops, each constrained by disulfide bonded cysteine pairs. The space-filling model was used to predict loop regions that are exposed on the surface of the protein.
  • Two amino acid segments were chosen to represent surface exposed regions (loop 53-62 and loop 360-369) and a third to represent a region assumed to be buried within the protein (loop 450-463).
  • These and other candidate loops (Cys 53 -Cys 62 , Cys 75 -Cys 91 , Cys 90 -Cys 101 , Cys 245 -Cys 253 , Cys 266 -Cys 279 , Cys 360 -Cys 369 , Cys 461 - Cys 477 , Cys 476 -Cys 487 , and Cys 558 -Cys 567 ) are depicted in Figures 4A-I of U.S.S.N. 09/768,183. These amino acid segments represent surface exposed regions.
  • the variegated peptide libraries of the subject method can be generated by any of a number of methods, and, though not limited by, preferably exploit recent trends in the preparation of chemical libraries. For instance, chemical synthesis of a degenerate gene sequence can be carried out in an automatic DNA synthesizer, and the synthetic genes then ligated into an appropriate expression vector. The purpose of a degenerate set of genes is to provide, in one mixture, all of the sequences encoding the desired set of potential test sequences. The synthesis of degenerate oligonucleotides is well known in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al.
  • variants refers to the fact that a population of peptides is characterized by having a peptide sequence which differ from one member of the library to the next. For example, in a given peptide library of n amino acids in length, the total number of different peptide sequences in the library is given by the product °f "!
  • the CSP display collectively produces a peptide library including at least 96 to 10 7 different chimeric serum peptides, so that diverse peptides may be simultaneously assayed for the ability to interact with the target protein.
  • the test CSP library is derived to express a combinatorial library of peptide sequences in the chimeric serum protein which are not based on any known sequence, nor derived from cDNA. That is, the sequences of the test peptides of the library are largely, if not entirely, random.
  • the peptide library is derived to express a combinatorial library of peptides sequences which are based at least in part on a known polypeptide sequence or a portion thereof (though preferably not a cDNA library). That is, the sequences of the library is semi-random, being derived by combinatorial mutagenesis of a known sequence(s). See, for example, Ladner et al. PCT publication WO 90/02909; Garrard et al, PCT publication WO 92/09690; Marks et al. (1992) J. Biol. Chem. 267:16007-16010; Griffths et al.
  • sequences for peptides that are l ⁇ iown ligands for a receptor can be mutagenized by standard techniques to derive a variegated library of polypeptide sequences which can be incorporated into the subject chimeric serum protein and screened for agonists and/or antagonists.
  • the harnessing of biological systems for the generation of peptide diversity is now a well established technique which can be exploited to generate the CSP libraries of the subject method.
  • the source of diversity is the combinatorial chemical synthesis of mixtures of oligonucleotides.
  • Oligonucleotide synthesis is a well- characterized chemistry that allows tight control of the composition of the mixtures created.
  • Degenerate DNA sequences produced are, in preferred embodiments, ligated with coding sequences for the serum protein.
  • the DNAs are synthesized a base at a time.
  • a suitable mixture of nucleotides is reacted with the nascent DNA, rather than the pure nucleotide reagent of conventional polynucleotide synthesis.
  • the second method provides more exact control over the amino acid variation.
  • trinucleotide reagents are prepared, each trinucleotide being a codon of one (and only one) of the amino acids to be featured in the peptide library.
  • a mixture is made of the appropriate trinucleotides and reacted with the nascent DNA.
  • chemical synthesis of a degenerate peptide coding sequence can be carried out in an automatic DNA synthesizer, and the synthetic genes can then be ligated into an appropriate gene for expression.
  • the synthesis of degenerate oligonucleotides is well l ⁇ iown in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed.
  • a library of test peptides or test CSPs is expressed by a population of display packages to form a display library.
  • the display package will preferably be able to be (i) genetically altered to encode the test peptide or chimeric serum peptides, (ii) maintained and amplified in culture, (iii) manipulated to display the test peptide- containing gene product or CSP-containing gene product in a manner permitting the test peptide or CSP to interact with a target during an affinity separation step, and (iv) affinity separated while retaining the nucleotide sequence encoding the test peptide or test CSP (herein "test gene”) such that the sequence of the test gene can be obtained.
  • the display remains viable after affinity separation.
  • the display package comprises a system that allows the sampling of very large variegated display libraries, rapid sorting after each affinity separation round, and easy isolation of the test gene from purified display packages or further manipulation of that sequence in the secretion mode.
  • the most attractive candidates for this type of screening are prokaryotic organisms and viruses, as they can be amplified quickly, they are relatively easy to manipulate, and large number of clones can be created.
  • Preferred display packages include, for example, vegetative bacterial cells, bacterial spores, and most preferably, bacterial viruses (especially DNA viruses).
  • the present invention also contemplates the use of eukaryotic cells, including yeast and their spores, as potential display packages.
  • kits for generating phage display libraries e.g. the Pharmacia Recombinant Phage Antibody System, catalog no. 27- 9400-01; and the Stratagene SurfZAPTM phage display kit, catalog no. 240612
  • methods and reagents particularly amenable for use in generating the variegated display library of the present invention can be found in, for example, the Ladner et al. U.S. Patent No. 5,223,409; the Kang et al. International Publication No. WO 92/18619; the Dower et al. International Publication No. WO 91/17271; the Winter et al. International Publication WO 92/20791; the Markland et al.
  • the display means of the package will comprise at least two components.
  • the first component is a secretion signal which directs the recombinant test gene product to be localized on the extracellular side of the cell membrane (of the host cell when the display package is a phage). This secretion signal can be selected so as to be cleaved off by a signal peptidase to yield a processed, "mature" polypeptide, e.g., a mature test peptide or CSP.
  • the second component is a display anchor protein which directs the display package to associate the test gene product with its outer surface. As described below, this anchor protein can be derived from a surface or coat protein native to the genetic package.
  • the means for arraying the variegated peptide library comprises a derivative of a spore or phage coat protein amenable for use as a fusion protein.
  • an unstructured polypeptide linker region between portions of the chimeric peptide, e.g., between the test gene product and anchor polypeptide.
  • This linker can facilitate enhanced flexibility of the chimeric peptide allowing the test gene product to freely interact with a target by reducing steric hindrance between the two fragments, as well as allowing appropriate folding of each portion to occur.
  • the linker can be of natural origin, such as a sequence determined to exist in random coil between two domains of a protein.
  • the linker can be of synthetic origin.
  • the sequence (Gly4Ser)3 can be used as a synthetic unstructured linker. Linkers of this type are described in Huston et al.
  • the number of possible combinations in a peptide library can get larger as the length is increased and selection criteria for degenerating at each position is relaxed.
  • To sample as many combinations as possible depends, in part, on the ability to recover large numbers of transformants.
  • electrotransformation provides an efficiency comparable to that of phage-transfection with in vitro packaging, in addition to a very high capacity for DNA input. This allows large amounts of vector DNA to be used to obtain very large numbers of transformants.
  • Nucleic Acids Res., 16:6127-6145 may be used to transform fd-tet derived recombinants at the rate of about 10 7 transformants/ ⁇ g of ligated vector into E. coli (such as strain MCI 061), and libraries may be constructed in fd-tet Bl of up to about 3 x 10 8 members or more.
  • Increasing DNA input and making modifications to the cloning protocol within the ability of the skilled artisan may produce increases of greater than about 10-fold in the recovery of transformants, providing libraries of up to 10 10 or more recombinants.
  • an important criteria for the present selection method can be that it is able to discriminate between peptides of different affinity for a particular target, and preferentially enrich for the peptides of highest affinity. Again, this is in the context of a chimeric serum protein.
  • manipulating the display package to be rendered effectively monovalent can allow affinity enrichment to be carried out for generally higher binding affinities (i.e. binding constants in the range of 10 6 to 10 10 M" 1 ) as compared to the broader range of affinities isolable using a multivalent display package.
  • the natural (i.e. wild-type) form of the surface or coat protein used to anchor the test peptide or CSP to the display can be added at a high enough level that it almost entirely eliminates inclusion of the test peptide or CSP in the display package.
  • a vast majority of the display packages can be generated to include no more than one copy of the test peptide or CSP (see, for example, Garrad et al. (1991) Bio/Technology 9:1373-1377).
  • the library of display packages will comprise no more than 5 to 10% polyvalent displays, and more preferably no more than 2% of the display will be polyvalent, and most preferably, no more than 1% polyvalent display packages in the population.
  • the source of the wild-type anchor protein can be, for example, provided by a copy of the wild-type gene present on the same construct as the peptide fusion protein, or provided by a separate construct altogether.
  • polyvalent displays can be generated to isolate a broader range of binding affinities. Such embodiments can be useful, for example, in purification protocols where avidity can be desirable.
  • Bacteriophage are attractive prokaryotic-related organisms for use in the subject method. Bacteriophage are excellent candidates for providing a display system of the variegated peptide library as there is little or no enzymatic activity associated with intact mature phage, and because their genes are inactive outside a bacterial host, rendering the mature phage particles metabolically inert. In general, the phage surface is a relatively simple structure. Phage can be grown easily in large numbers, they are amenable to the practical handling involved in many potential mass screening programs, and they carry genetic information for their own synthesis within a small, simple package.
  • the appropriate phage to be employed in the subject method will generally depend most on whether (i) the genome of the phage allows introduction of the test gene either by tolerating additional genetic material or by having replaceable genetic material; (ii) the virion is capable of packaging the genome after accepting the insertion or substitution of genetic material; and (iii) the display of the peptide on the phage surface does not disrupt virion structure sufficiently to interfere with phage propagation.
  • phage One concern presented with the use of phage is that the morphogenetic pathway of the phage determines the environment in which the peptide will have opportunity to fold.
  • Periplasmically assembled phage are preferred as the displayed peptides may contain essential disulfides, and such peptides may not fold correctly within a cell.
  • the display package forms intracellularly (e.g., where ⁇ phage are used)
  • it has been demonstrated in other instances that disulfide-containing peptides can assume proper folding after the phage is released from the cell.
  • the preferred display means is a protein that is present on the phage surface (e.g. a coat protein).
  • Filamentous phage can be described by a helical lattice; isometric phage, by an icosahedral lattice.
  • each monomer of each major coat protein sits on a lattice point and makes defined interactions with each of its neighbors. Proteins that fit into the lattice by making some, but not all, of the normal lattice contacts are likely to destabilize the virion by aborting formation of the virion as well as by leaving gaps in the virion so that the nucleic acid is not protected. Thus in bacteriophage, unlike the cases of bacteria and spores, it is generally important to retain in the fusion anchor proteins those residues of the coat protein that interact with other proteins in the virion.
  • the entire mature protein will generally be retained with the test peptide or CSP sequence being added to the N-terminus of cpVIII, while on the other hand it can suffice to retain only the last 100 carboxy terminal residues (or even fewer) of the Ml 3 cpIII coat protein in the peptide fusion protein.
  • the test gene library is expressed and exported, as part of the fusion protein, to the bacterial cytoplasm, such as when the ⁇ phage is employed.
  • the induction of the fusion protein(s) may be delayed until some replication of the phage genome, synthesis of some of the phage structural-proteins, and assembly of some phage particles has occurred.
  • the assembled protein chains then interact with the phage particles via the binding of the anchor protein on the outer surface of the phage particle.
  • the cells are lysed and the phage bearing the library-encoded test peptide or CSP (that corresponds to the specific library sequences carried in the DNA of that phage) are released and isolated from the bacterial debris.
  • phage harvested from the bacterial debris are affinity purified. As described below, when a test peptide or CSP which specifically binds a particular target is desired, the target can be used to retrieve phage displaying the desired test peptide or CSP.
  • the phage so obtained may then be amplified by infecting into host cells. Additional rounds of affinity enrichment followed by amplification may be employed until the desired level of enrichment is reached.
  • the enriched peptide-phage can also be screened with additional detection- techniques such as expression plaque (or colony) lift (see, e.g., Young and Davis, Science (1983) 222:778-782) whereby a labeled target is used as a probe.
  • Filamentous bacteriophages which include Ml 3, fl, fd, Ifl, Ike, Xf, Pfl, and
  • Pf3 are a group of related viruses that infect bacteria. They are termed filamentous because they are long, thin particles comprised of an elongated capsule that envelopes the deoxyribonucleic acid (DNA) that forms the bacteriophage genome.
  • the F pili filamentous bacteriophage (Ff phage) infect only gram-negative bacteria by specifically adsorbing to the tip of F pili, and include fd, fl and Ml 3.
  • filamentous phage in general are attractive and Ml 3 in particular is especially attractive because: (i) the 3-D structure of the virion is known; (ii) the processing of the coat protein is well understood; (iii) the genome is expandable; (iv) the genome is small; (v) the sequence of the genome is l ⁇ iown; (vi) the virion is physically resistant to shear, heat, cold, urea, guanidinium chloride, low pH, and high salt; (vii) the phage is a sequencing vector so that sequencing is especially easy; (viii) antibiotic-resistance genes have been cloned into the genome with predictable results (Hines et al.
  • Ml 3 is a plasmid and transformation system in itself, and an ideal sequencing vector. Ml 3 can be grown on Rec-strains of E. coli. The Ml 3 genome is expandable (Messing et al. in The Single-Stranded DNA Phages, eds Denhardt et al. (NY: CSHL Press, 1978) pages 449-453; and Fritz et al., supra) and Ml 3 does not lyse cells. Extra genes can be inserted into Ml 3 and will be maintained in the viral genome in a stable manner.
  • the mature capsule or Ff phage is comprised of a coat of five phage-encoded gene products: cpVIII, the major coat protein product of gene VIII that forms the bulk of the capsule; and four minor coat proteins, cpIII and cpIV at one end of the capsule and cpVII and cpIX at the other end of the capsule.
  • the length of the capsule is formed by 2500 to 3000 copies of cpVIII in an ordered helix array that forms the characteristic filament structure.
  • the gene Ill-encoded protein (cpIII) is typically present in 4 to 6 copies at one end of the capsule and serves as the receptor for binding of the phage to its bacterial host in the initial phase of infection.
  • the phage particle assembly involves extrusion of the viral genome through the host cell's membrane. Prior to extrusion, the major coat protein cpVIII and the minor coat protein cpIII are synthesized and transported to the host cell's membrane. Both cpVIII and cpIII are anchored in the host cell membrane prior to their incorporation into the mature particle. In addition, the viral genome is produced and coated with cpV protein. During the extrusion process, cpV-coated genomic DNA is stripped of the cpV coat and simultaneously recoated with the mature coat proteins.
  • Both cpIII and cpVIII proteins include two domains that provide signals for assembly of the mature phage particle.
  • the first domain is a secretion signal that directs the newly synthesized protein to the host cell membrane.
  • the secretion signal is located at the amino terminus of the polypeptide and targets the polypeptide at least to the cell membrane.
  • the second domain is a membrane anchor domain that provides signals for association with the host cell membrane and for association with the phage particle during assembly.
  • This second signal for both cpVIII and cpIII comprises at least a hydrophobic region for spanning the membrane.
  • the 50 amino acid mature gene VIII coat protein (cpVIII) is synthesized as a 73 amino acid precoat (Ito et al. (1979) PNAS 76:1199-1203).
  • cpVIII has been extensively studied as a model membrane protein because it can integrate into lipid bilayers such as the cell membrane in an asymmetric orientation with the acidic amino terminus toward the outside and the basic carboxy terminus toward the inside of the membrane.
  • the first 23 amino acids constitute a typical signal-sequence which causes the nascent polypeptide to be inserted into the inner cell membrane.
  • SP-I signal peptidase
  • the sequence of gene VIII is known, and the amino acid sequence can be encoded on a synthetic gene.
  • Mature gene VIII protein makes up the sheath around the circular ssDNA.
  • the gene VIII protein can be a suitable anchor protein because its location and orientation in the virion are known (Banner et al. (1981) Nature 289:814-816).
  • the peptide is attached to the amino terminus of the mature Ml 3 coat protein to generate the phage display library.
  • manipulation of the concentration of both the wild-type cpVIII and Ab/cpVIII fusion in an infected cell can be utilized to decrease the avidity of the display and thereby enhance the detection of high affinity peptides directed to the target(s).
  • Another vehicle for displaying the peptide is by expressing it as a domain of a chimeric gene containing part or all of gene III, e.g., encoding cpIII.
  • expressing the peptide as a fusion protein with cpIII can be a preferred embodiment, as manipulation of the ratio of wild-type cpIII to chimeric cpIII during formation of the phage particles can be readily controlled.
  • This gene encodes one of the minor coat proteins of Ml 3.
  • Genes VI, VII, and IX also encode minor coat proteins. Each of these minor proteins is present in about 5 copies per virion and is related to morphogenesis or infection. In contrast, the major coat protein is present in more than 2500 copies per virion.
  • the gene VI, VII, and IX proteins are present at the ends of the virion; these tliree proteins are not post- translationally processed (Rasched et al. (1986) Ann Rev. Microbiol. 41:507-541).
  • the single-stranded circular phage DNA associates with about five copies of the gene III protein and is then extruded through the patch of membrane-associated coat protein in such a way that the DNA is encased in a helical sheath of protein (Webster et al. in The Single-Stranded DNA Phages, eds Dressier et al. (NY:CSHL Press, 1978).
  • the CSP gene may be fused to gene III at the site used by Smith and by de la Cruz et al, at a codon corresponding to another domain boundary or to a surface loop of the protein, or to the amino terminus of the mature protein.
  • the successful cloning strategy utilizing a phage coat protein will provide expression of a peptide chain fused to the N-terminus of a coat protein (e.g., cpIII) and transport to the inner membrane of the host where the hydrophobic domain in the C-terminal region of the coat protein anchors the fusion protein in the membrane, with the N-terminus containing the peptide chain protruding into the periplasmic space.
  • a coat protein e.g., cpIII
  • Similar constructions could be made with other filamentous phage.
  • Pf3 is a well l ⁇ iown filamentous phage that infects Pseudomonos aerugenosa cells that harbor an IncP-I plasmid.
  • aerugenosa fused in-frame to a gene fragment encoding the peptide sequence, which is fused in-frame to DNA encoding the mature Pf3 coat protein.
  • DNA encoding a flexible linker of one to 10 amino acids is introduced between the CSP gene fragment and the Pf3 coat-protein gene. This tripartite gene is introduced into Pf3 so that it does not interfere with expression of any Pf3 genes.
  • the bacteriophage ⁇ X174 is a very small icosahedral virus which has been thoroughly studied by genetics, biochemistry, and electron microscopy (see The
  • G major spike protein, 60 copies per virion
  • H minor spike protein, 12 copies per virion
  • the G protein comprises 175 amino acids
  • H comprises 328 amino acids.
  • the F protein interacts with the single-stranded DNA of the virus.
  • the proteins F, G, and H are translated from a single mRNA in the viral infected cells.
  • ⁇ X174 is not typically used as a cloning vector due to the fact that it can accept very little additional DNA.
  • mutations in the viral G gene can be rescued by a copy of the wild-type G gene carried on a plasmid that is expressed in the same host cell (Chambers et al. (1982) Nuc Acid Res 10:6465- 6473).
  • one or more stop codons are introduced into the G gene so that no G protein is produced from the viral genome.
  • the variegated test or CSP gene library can then be fused with the nucleic acid sequence of the H gene.
  • the second plasmid can further include one or more copies of the wild-type H protein gene so that a mix of H and test/H proteins will be predominated by the wild-type H upon inco ⁇ oration into phage particles.
  • Phage such as ⁇ or T4 have much larger genomes than do M13 or ⁇ X174, and have more complicated 3-D capsid structures than M13 or ⁇ PX174, with more coat proteins to choose from.
  • bacteriophage ⁇ and derivatives thereof are examples of suitable vectors.
  • the intracellular morphogenesis of phage ⁇ can potentially prevent protein domains that ordinarily contain disulfide bonds from folding correctly.
  • variegated libraries expressing a population of functional peptides, which include such bonds have been generated in ⁇ phage. (Huse et al.
  • peptide sequences When used for expression of peptide sequences (isogenous nucleotide sequences), may be readily inserted into a ⁇ vector.
  • variegated peptide libraries can be constructed by modification of ⁇ ZAP II through use of the multiple cloning site of a ⁇ ZAP II vector (Huse et al. supra).
  • Recombinant peptides are able to cross bacterial membranes after the addition of appropriate secretion signal sequences to the N-terminus of the protein (Better et al (1988) Science 240:1041-1043; and Skerra et al. (1988) Science 240:1038-1041).
  • recombinant peptides have been fused to outer membrane proteins for surface presentation.
  • one strategy for displaying peptides on bacterial cells comprises generating a fusion protein by inserting the peptide into cell surface exposed portions of an integral outer membrane protein (Fuchs et al. (1991) Bio/Technology 9:1370-1372).
  • any well-characterized bacterial strain will typically be suitable, provided the bacteria may be grown in culture, engineered to display the test gene library on its surface, and is compatible with the particular affinity selection process practiced in the subject method.
  • the preferred display systems include Salmonella typhimurium, Bacillus subtilis, Pseudomonas aeruginosa, Vibrio cholerae, Klebsiella pneumonia, Neisseria gonorrhoeae, Neisseria meningitidis, Bacteroides nodosus, Moraxella bovis, and especially Escherichia coli.
  • LamB protein of E coli is a well understood surface protein that can be used to generate a variegated library of test peptides or CSPs on the surface of a bacterial cell (see, for example, Ronco et al. (1990) Biochemie 72:183-189; van der Weit et al. (1990) Vaccine 8:269-277; Charabit et al. (1988) Gene 70:181-189; and Ladner U.S. Patent No. 5,222,409). LamB of E.
  • LamB is transported to the outer membrane if a functional N-terminal signal sequence is present (Benson et al. (1984) PNAS 81:3830-3834). As with other cell surface proteins, LamB is synthesized with a typical signal-sequence which is subsequently removed.
  • the variegated test gene library can be cloned into the LamB gene such that the resulting library of fusion proteins comprise a portion of LamB sufficient to anchor the protein to the cell membrane with the test peptide or CSP fragment oriented on the extracellular side of the membrane.
  • E. coli LamB has also been expressed in functional form in S. typhimurium (Harkki et al. (1987) Mol Gen Genet 209:607-611), V. cholerae (Harkki et al. (1986) Microb Pathol 1:283-288), and K. pneumonia (Wehmeier et al. (1989) Mol Gen Genet 215:529-536), so that one could display a population of test peptides or CSPs in any of these species as a fusion to E. coli LamB.
  • K. pneumonia K.
  • LamB a maltoporin similar to LamB which could also be used.
  • the Dl protein (a homologue of LamB) can be used (Trias et al. (1988) Biochem Biophys Ada 938:493-496).
  • other bacterial surface proteins such as PAL, OmpA, OmpC, OmpF, PhoE, pilin, BtuB, FepA, FhuA, IutA, FecA and FhuE, may be used in place of LamB as a portion of the display means in a bacterial cell.
  • the fusion protein can be derived using the FliTrxTM Random Display library (Invitrogen). That library is a diverse population of random dodecapeptides inserted within the thioredoxin active-site loop inside the dispensable region of the bacterial flagellin gene (fliC). The resultant recombinant fusion protein (FLITRX) is exported and assembled into partially functional flagella on the bacterial cell surface, displaying the random peptide library.
  • FliTrxTM Random Display library Invitrogen. That library is a diverse population of random dodecapeptides inserted within the thioredoxin active-site loop inside the dispensable region of the bacterial flagellin gene (fliC).
  • FLITRX resultant recombinant fusion protein
  • phage display proteins are fused to the N- terminus of phage coat proteins in an unconstrained manner.
  • the unconstrained molecules possess many degrees of conformational freedom which may result in the lack of proper interaction with the target molecule. Without proper interaction, many potential protein-protein interactions may be missed.
  • phage display is limited by the low expression levels of bacteriophage coat proteins.
  • FliTrxTM and similar methods can overcome this limitation by using a strong promoter to drive expression of the test peptide or CSP fusions that are displayed as multiple copies.
  • the FliTrx vector can be modified to provide, similar to the illustrated vectors of the attached figures, a vector which is differentially spliced in mammalian cells to yield a secreted, soluble CSP.
  • Bacterial spores also have desirable properties as display package candidates in the subject method. For example, spores are much more resistant than vegetative bacterial cells or phage to chemical and physical agents, and hence permit the use of a great variety of affinity selection conditions. Also, Bacillus spores neither actively metabolize nor alter the proteins on their surface. However, spores have the disadvantage that the molecular mechanisms that trigger sporulation are less well worked out than is the formation of Ml 3 or the export of protein to the outer membrane of E. coli, though such a limitation is not a serious defractant from their use in the present invention.
  • Bacteria of the genus Bacillus form endospores that are extremely resistant to damage by heat, radiation, desiccation, and toxic chemicals (reviewed by Losick et al. (1986) Ann Rev Genet 20:625-669). This phenomenon is attributed to extensive intermolecular cross-linking of the coat proteins.
  • Bacillus spores can be the preferred display package. Endospores from the genus Bacillus are more stable than are, for example, exospores from Streptomyces.
  • Bacillus subtilis forms spores in 4 to 6 hours, whereas Streptomyces species may require days or weeks to sporulate.
  • genetic knowledge and manipulation is much more developed for B. subtilis than for other spore-forming bacteria.
  • the variegated test peptide or CSP display library is subjected to affinity enrichment in order to select for test peptides or CSPs which bind preselected targets.
  • affinity separation or “affinity enrichment” includes, but is not limited to: (1) affinity chromatography utilizing immobilized targets, (2) immunoprecipitation using soluble targets, (3) fluorescence activated cell sorting, (4) agglutination, and (5) plaque lifts.
  • the library of display packages are ultimately separated based on the ability of the associated test peptide or CSP to bind the target of interest. See, for example, the Ladner et al. U.S. Patent No. 5,223,409; the Kang et al.
  • the display library will be pre-enriched for test peptides or CSPs specific for the target by first contacting the display library with any negative controls or other targets for which differential binding by the test peptide or CSP is desired. Subsequently, the non-binding fraction from that pre-treatment step is contacted with the target and members from the display which are able to specifically bind the target are isolated.
  • chromatography techniques can be adapted for use in the present invention, ranging from column chromatography to batch elution, and including ELISA and biopanning techniques.
  • the target is a component of a cell, rather than a whole cell, the target is immobilized on an insoluble carrier, such as sepharose or polyacrylamide beads, or, alternatively, the wells of a microtitre plate.
  • an insoluble carrier such as sepharose or polyacrylamide beads, or, alternatively, the wells of a microtitre plate.
  • the cells on which the target is displayed may serve as the insoluble matrix carrier.
  • the population of display packages is applied to the affinity matrix under conditions compatible with the binding of the test peptide or CSP to a target.
  • the population is then fractionated by washing with a solute that does not greatly effect specific binding of test peptides or CSPs to the target, but which substantially disrupts any non-specific binding of the display package to the target or matrix.
  • a certain degree of control can be exerted over the binding characteristics of the peptides recovered from the display library by adjusting the conditions of the binding incubation and subsequent washing.
  • the temperature, pH, ionic strength, divalent cation concentration, and the volume and duration of the washing can select for peptides within a particular range of affinity and specificity.
  • peptides This is a useful characteristic for peptides to be used in affinity purification of various proteins when gentle conditions for removing the protein from the peptide are required.
  • Specific examples are peptides which depend on Ca ++ for binding activity and which lose or gain binding affinity in the presence of EGTA or other metal chelating agent.
  • Such peptides may be identified in the recombinant peptide library by a double screening technique isolating first those that bind the target in the presence of Ca ++ , and by subsequently identifying those in this group that fail to bind in the presence of EGTA.
  • specifically bound display packages can be eluted by either specific desorption (using excess target) or non-specific desorption (using pH, polarity reducing agents, or chaotropic agents).
  • the elution protocol does not kill the organism used as the display package such that the enriched population of display packages can be further amplified by reproduction.
  • the list of potential eluants includes salts (such as those in which one of the counter ions is Na + , NH , Rb + , SO 4 2 ⁇ H 2 PO 4- , citrate, K + , Li + , Cs + , HSO 4- , CO 3 2" , Ca 2+ , Sr 2+ , Cl “ , PO 4 2” , HCO 3” , Mg 2+ , Ba 2+ , Br “ , HPO 4 2” , or acetate), acid, heat, and, when available, soluble forms of the target (or analogs thereof).
  • salts such as those in which one of the counter ions is Na + , NH , Rb + , SO 4 2 ⁇ H 2 PO 4- , citrate, K + , Li + , Cs + , HSO 4- , CO 3 2" , Ca 2+ , Sr 2+ , Cl “ , PO 4 2” , HCO 3” , Mg 2+ , Ba 2+
  • buffer components especially eluates
  • Neutral solutes such as ethanol, acetone, ether, or urea, are examples of other agents useful for eluting the bound display packages.
  • affinity enriched display packages are iteratively amplified and subjected to further rounds of affinity separation until enrichment of the desired binding activity is detected.
  • the specifically bound display packages especially bacterial cells, need not be eluted per se, but rather, the matrix bound display packages can be used directly to inoculate a suitable growth media for amplification.
  • the display package is a phage particle
  • the fusion protein generated with the coat protein can interfere substantially with the subsequent amplification of eluted phage particles, particularly in embodiments wherein the cpIII protein is used as the display anchor.
  • the peptide can be derived on the surface of the display package so as to be susceptible to proteolytic cleavage which severs the covalent linkage of at least the target binding sites of the displayed test peptide or CSP from the remaining package.
  • such a strategy can be used to obtain infectious phage by treatment with an enzyme which cleaves between the test peptide or CSP portion and cpIII portion of a tail fiber fusion protein (e.g. such as the use of an enterokinase cleavage recognition sequence).
  • DNA prepared from the eluted phage can be transformed into host cells by electroporation or well known chemical means.
  • the cells are cultivated for a period of time sufficient for marker expression, and selection is applied as typically done for DNA transformation.
  • the colonies are amplified, and phage harvested for a subsequent round(s) of panning.
  • the CSPs for each of the purified display packages can be tested for biological activity in the secretion mode of the subject method.
  • the combinatorial peptide library which has been enriched in the display mode, is transfected into and expressed by eukaryotic cells.
  • the test CSPs are secreted by the host cells and screened for biological activity.
  • the subject vectors are constructed to include eukaryotic splice sites such that, in the mature mRNA, elements required for the display mode in prokaryotic cells are spliced out - at least those elements which would interfere with the secretion mode.
  • a variety of naturally and non-naturally occurring splice sites are available in the art and can be selected for, e.g., optimization in particular eukaryotic cells selected.
  • the vectors of the subject invention are used to transfect a cell that can be co-cultured with a target cell.
  • a biologically active protein secreted by the cells expressing the combinatorial library will diffuse to neighboring target cells and induce a particular biological response, such as to illustrate, proliferation or differentiation, or activation of a signal transduction pathway which is directly detected by other phenotypic criteria.
  • the pattern of detection of biological activity will resemble a gradient function, and will allow the isolation (generally after several repetitive rounds of selection) of cells producing CSPs having certain activity in the assay.
  • antagonists of a given factor can be selected in similar fashion by the ability of the cell producing a functional antagonist to protect neighboring cells from the effect of exogenous factor added to the culture media.
  • target cells are cultured in 24-well microtitre plates.
  • Other cells are transfected with the combinatorial CSP library, recovered after the display mode step, and cultured in cell culture inserts (e.g. Collaborative Biomedical Products, Catalog #40446) that are able to fit into the wells of the microtitre plate.
  • the cell culture inserts are placed in the wells such that recombinant test CSPs secreted by the cells in the insert can diffuse through the porous bottom of the insert and contact the target cells in the microtitre plate wells. After a period of time sufficient for a secreted test CSP to produce a measurable response in the target cells, the inserts are removed and the effect of the peptides on the target cells determined.
  • the target cell is a neural crest cell and the activity desired from the test CSPs is the induction of neuronal differentiation
  • fluorescently-labeled antibodies specific for Islet- 1 or other neuronal markers can be used to score for induction in the target cells as indicative of a functional neurotrophic peptide in that well.
  • Cells from the inserts corresponding to wells which score positive for activity can be split and re-cultured on several inserts, the process being repeated until the active CSP is identified.
  • intracellular second messenger generation can be measured directly.
  • intracellular effectors include adenylyl cyclase, cyclic GMP, phosphodiesterases, phosphoinositidases, phosphoinositol kinases, and phospholipases, as well as a variety of ions.
  • the GTPase enzymatic activity by G proteins can be measured in plasma membrane preparations by determining the breakdown of ⁇ 32 P GTP using techniques that are l ⁇ iown in the art (For example, see Signal Transduction: A Practical Approach. G. Milligan, Ed. Oxford University Press, Oxford England).
  • receptors that modulate cAMP are tested, it will be possible to use standard techniques for cAMP detection, such as competitive assays which quantitate [ ⁇ HJcAMP in the presence of unlabelled cAMP.
  • Inositol lipids can be extracted and analyzed using standard lipid extraction techniques. DAG can also be measured using thin-layer chromatography. Water soluble derivatives of all three inositol lipids (IPl, IP2, IP3) can also be quantitated using radiolabelling techniques or HPLC.
  • DAG can also be produced from phosphatidyl choline.
  • the breakdown of this phospholipid in response to receptor- mediated signaling can also be measured using a variety of radiolabelling techniques.
  • the activation of phospholipase A2 can easily be quantitated using known techniques, including, for example, the generation of arachadonate in the cell.
  • proteases are induced or activated in each of several arms of divergent signaling pathways. These may be independently monitored by following their unique activities with substrates specific for each protease.
  • Such assay formats may be useful when, for example, the assay is designed to detect an agonist or antagonist of a receptor kinase or phosphatase.
  • immunoblotting (Lyons and Nelson (1984) Proc. Natl. Acad. Sci. USA 81:7426-7430) using anti-phosphotyrosine, anti- phosphoserine or anti-phosphothreonine antibodies.
  • tests for phosphorylation could be also useful when the receptor itself may not be a kinase, but activates protein kinases or phosphatase that function downstream in the signal transduction pathway.
  • MAP kinase pathway that appears to mediate both mitogenic, differentiation and stress responses in different cell types. Stimulation of growth factor receptors results in Ras activation followed by the sequential activation of c-Raf, MEK, and p44 and p42 MAP kinases (ERK1 and ERK2). Activated MAP kinase then phosphorylates many key regulatory proteins, including p90RSK and Elk-1 that are phosphorylated when MAP kinase translocates to the nucleus. Homologous pathways exist in mammalian and yeast cells. For instance, an essential part of the S.
  • cerevisiae pheromone signaling pathway is comprised of a protein kinase cascade composed of the products of the STEl l, STE7, and FUS3/KSS1 genes (the latter pair are distinct and functionally redundant). Accordingly, phosphorylation and/or activation of members of this kinase cascade can be detected and used to quantitate receptor engagement.
  • Phosphotyrosine specific antibodies are available to measure increases in tyrosine phosphorylation and phospho-specific antibodies are commercially available (New England Biolabs, Beverly, MA).
  • the signal transduction pathway of interest may upregulate expression or otherwise activate an enzyme which is capable of modifying a substrate which can be added to the cell.
  • the signal can be detected by using a detectable substrate, in which case lose of the substrate signal is monitored, or alternatively, by using a substrate which produces a detectable product.
  • the conversion of the substrate to product by the activated enzyme produces a detectable change in optical characteristics of the test cell, e.g., the substrate and/or product is cl romogenically or fluorogenically active.
  • the signal transduction pathway causes a change in the activity of a proteolytic enzyme, altering the rate at which it cleaves a substrate peptide (or simply activates the enzyme towards the substrate).
  • the peptide includes a fluorogenic donor radical, e.g., a fluorescence emitting radical, and an acceptor radical, e.g., an aromatic radical which absorbs the fluorescence energy of the fluorogenic donor radical when the acceptor radical and the fluorogenic donor radical are covalently held in close proximity. See, for example, USSN 5,527,681, 5,506,115, 5,429,766, 5,424,186, and 5,316,691; and Capobianco et al.
  • the substrate peptide has a fluorescence donor group such as 1-aminobenzoic acid (anthranilic acid or ABZ) or aminomethylcoumarin (AMC) located at one position on the peptide and a fluorescence quencher group, such as lucifer yellow, methyl red or nitrobenzo-2- oxo-l,3-diazole (NBD), at a different position near the distal end of the peptide.
  • a cleavage site for the activated enzyme will be disposed between each of the sites for the donor and acceptor groups.
  • the intramolecular resonance energy transfer from the fluorescence donor molecule to the quencher will quench the fluorescence of the donor molecule when the two are sufficiently proximate in space, e.g., when the peptide is intact.
  • the quencher is separated from the donor group, leaving behind a fluorescent fragment.
  • activation of the enzyme results in cleavage of the detection peptide, and dequenching of the fluorescent group.
  • the detectable signal can be produced by use of enzymes or chromogenic/fluorescent probes whose activities are dependent on the concentration of a second messenger, e.g., such as calcium, hydrolysis products of inositol phosphate, cAMP, etc.
  • a second messenger e.g., such as calcium, hydrolysis products of inositol phosphate, cAMP, etc.
  • the mobilization of intracellular calcium or the influx of calcium from outside the cell can be measured using standard techniques.
  • the choice of the appropriate calcium indicator, fluorescent, bioluminescent, metallochromic, or Ca ⁇ -sensitive microelectrodes depends on the cell type and the magnitude and time constant of the event under study (Borle (1990) Environ Health Perspect 84:45-56).
  • Ca "1"1' detection cells could be loaded with the Ca "1"1" sensitive fluorescent dye fura-2 or indo-1, using standard methods, and any change in Ca "1"1” measured using a fluorometer.
  • the signal transduction activity of a receptor or ion channel pathway can be measured by detection of a transcription product, e.g., by detecting receptor/channel-mediated transcriptional activation (or repression) of a gene(s).
  • Detection of the transcription product includes detecting the gene transcript, detecting the product directly (e.g., by immunoassay) or detecting an activity of the protein (e.g., such as an enzymatic activity or chromogenic/fluorogenic activity); each of which is generally referred to herein as a means for detecting expression of the indicator gene.
  • the indicator gene may be an unmodified endogenous gene of the host cell, a modified endogenous gene, or a part of a completely heterologous construct, e.g., as part of a reporter gene construct. In one embodiment, the indicator gene is an unmodified endogenous gene.
  • the instant method can rely on detecting the transcriptional level of such endogenous genes as the c-fos gene (e.g., in mammalian cells) or the Barl or Fusl genes (e.g., in yeast cells) in response to such signal transduction pathways as originating from G protein coupled receptors.
  • endogenous genes as the c-fos gene (e.g., in mammalian cells) or the Barl or Fusl genes (e.g., in yeast cells) in response to such signal transduction pathways as originating from G protein coupled receptors.
  • the transcriptional activation ability of the signal pathway can be amplified by the overexpression of one or more of the proteins involved in the intracellular signal cascade, particularly enzymes involved in the pathway.
  • JNKs Jun kinases
  • increased expression of Jun kinases (JNKs) can potentiate the level of transcriptional activation by a signal in an MEKK/JNKK pathway.
  • overexpression of one or more signal transduction proteins in the yeast pheromone pathway can increase the level of Fusl and/or Barl expression. This approach can, of course, also be used to potentiate the level of transcription of a heterologous reporter gene as well.
  • the sensitivity of an endogenous indicator gene can be enhanced by manipulating the promoter sequence at the natural locus for the indicator gene. Such manipulation may range from point mutations to the endogenous regulatory elements to gross replacement of all or substantial portions of the regulatory elements.
  • manipulation of the genomic sequence for the indicator gene can be carried out using techniques known in the art, including homologous recombination.
  • the promoter (or other transcriptional regulatory sequences) of the endogenous gene can be "switched out" with a heterologous promoter sequence, e.g., to form a chimeric gene at the indicator gene locus. Again, using such techniques as homologous recombination, the regulatory sequence can be so altered at the genomic locus of the indicator gene.
  • a heterologous reporter gene construct can be used to provide the function of an indicator gene.
  • Reporter gene constructs are prepared by operatively linking a reporter gene with at least one transcriptional regulatory element. If only one transcriptional regulatory element is included it must be a regulatable promoter. At least one the selected transcriptional regulatory elements must be indirectly or directly regulated by the activity of the selected cell- surface receptor whereby activity of the receptor can be monitored via transcription of the reporter genes.
  • reporter genes include, but are not limited to CAT (chloramphenicol acetyl transferase) (Alton and Vapnek (1979), Nature 282: 864- 869) luciferase, and other enzyme detection systems, such as beta-galactosidase; firefly luciferase (deWet et al. (1987), Mol. Cell. Biol. 7:725-737); bacterial luciferase (Engebrecht and Silverman (1984), PNAS 1: 4154-4158; Baldwin et al. (1984), Biochemistry 23: 3663-3667); alkaline phosphatase (Toh et al. (1989) Eur. J. Biochem.
  • CAT chloramphenicol acetyl transferase
  • Transcriptional control elements for use in the reporter gene constructs, or for modifying the genomic locus of an indicator gene include, but are not limited to, promoters, enhancers, and repressor and activator binding sites.
  • Suitable transcriptional regulatory elements may be derived from the transcriptional regulatory regions of genes whose expression is rapidly induced, generally within minutes, of contact between the cell surface protein and the effector protein that modulates the activity of the cell surface protein. Examples of such genes include, but are not limited to, the immediate early genes (see, Sheng et al. (1990) Neuron 4: 477-485), such as c-fos.
  • Immediate early genes are genes that are rapidly induced upon binding of a ligand to a cell surface protein.
  • the transcriptional control elements that are preferred for use in the gene constructs include transcriptional control elements from immediate early genes, elements derived from other genes that exhibit some or all of the characteristics of the immediate early genes, or synthetic elements that are constructed such that genes in operative linkage therewith exhibit such characteristics.
  • the characteristics of preferred genes from which the transcriptional control elements are derived include, but are not limited to, low or undetectable expression in quiescent cells, rapid induction at the transcriptional level within minutes of extracellular simulation, induction that is transient and independent of new protein synthesis, subsequent shut-off of transcription requires new protein synthesis, and mRNAs transcribed from these genes have a short half-life. It is not necessary for all of these properties to be present.
  • VIP vasoactive intestinal peptide
  • somatostatin cAMP responsive; Montminy et al. (1986), Proc. Natl. Acad. Sci. 8.3:6682-6686
  • proenlcephalin promoter responsive to cAMP, nicotinic agonists, and phorbol esters; Comb et al.
  • a transcriptional based readout can be constructed using the cyclic AMP response element binding protein, CREB, which is a transcription factor whose activity is regulated by phosphorylation at a particular serine (SI 33).
  • CREB cyclic AMP response element binding protein
  • SI 33 serine
  • CREB binds to a recognition sequence known as a CRE (cAMP Responsive Element) found to the 5' of promoters known to be responsive to elevated cAMP levels.
  • CRE cAMP Responsive Element
  • Phosphorylation of CREB is seen in response to both increased cAMP levels and increased intracellular Ca levels. Increased cAMP levels result in activation of PKA, which in turn phosphorylates CREB and leads to binding to CRE and transcriptional activation. Increased intracellular calcium levels results in activation of calcium / calmodulin responsive kinase II (CaM kinase II). Phosphorylation of CREB by CaM kinase II is effectively the same as phosphorylation of CREB by PKA, and results in transcriptional activation of CRE containing promoters.
  • CaM kinase II calcium / calmodulin responsive kinase II
  • a transcriptionally-based readout can be constructed in cells containing a reporter gene whose expression is driven by a basal promoter containing one or more CRE. Changes in the intracellular concentration of Ca ++ (a result of alterations in the activity of the receptor upon engagement with a ligand) will result in changes in the level of expression of the reporter gene if: a) CREB is also co-expressed in the cell, and b) either an endogenous or heterologous CaM kinase phosphorylates CREB in response to increases in calcium or if an exogenously expressed CaM kinase II is present in the same cell.
  • stimulation of PLC activity may result in phosphorylation of CREB and increased transcription from the CRE-construct, while inhibition of PLC activity may result in decreased transcription from the CRE-responsive construct.
  • CNTF treatment of SK-N-MC cells leads to the enhanced interaction of STAT/p91 and STAT related proteins with specific DNA sequences suggested that these proteins might be key regulators of changes in gene expression that are triggered by CNTF.
  • a reporter construct for use in the present invention for detecting signal transduction through STAT proteins can be generated by using -71 to +109 of the mouse c-fos gene fused to the bacterial chloramphenicol acetyltransferase gene (-71fosCAT) or other detectable marker gene.
  • cytokine receptor induces the tyrosine phosphorylation of STAT and STAT-related proteins, with subsequent translocation and binding of these proteins to the STAT-RE. This then leads to activation of transcription of genes containing this DNA element within their promoters.
  • the reporter gene is a gene whose expression causes a phenotypic change which is screenable or selectable. If the change is selectable, the phenotypic change creates a difference in the growth or survival rate between cells which express the reporter gene and those which do not. If the change is screenable, the phenotype change creates a difference in some detectable characteristic of the cells, by which the cells which express the marker may be distinguished from those which do not. Selection is preferable to screening in that it can provide a means for amplifying from the cell culture those cells which express a test polypeptide which is a receptor effector.
  • the marker gene is coupled to the receptor signaling pathway so that expression of the marker gene is dependent on activation of the receptor. This coupling may be achieved by operably linking the marker gene to a receptor-responsive promoter.
  • a receptor-responsive promoter indicates a promoter which is regulated by some product of the target receptor's signal transduction pathway. Alternatively, the promoter may be one which is repressed by the receptor pathway, thereby preventing expression of a product which is deleterious to the cell. With a receptor repressed promoter, one screens for agonists by linking the promoter to a deleterious gene, and for antagonists, by linking it to a beneficial gene.
  • Repression may be achieved by operably linking a receptor- induced promoter to a gene encoding mRNA which is antisense to at least a portion of the mRNA encoded by the marker gene (whether in the coding or flanking regions), so as to inhibit translation of that mRNA.
  • Repression may also be obtained by linking a receptor-induced promoter to a gene encoding a DNA binding repressor protein, and incorporating a suitable operator site into the promoter or other suitable region of the marker gene.
  • the marker gene may also be a screenable gene.
  • the screened characteristic may be a change in cell morphology, metabolism or other screenable features.
  • Suitable markers include ⁇ -galactosidase (Xgal, C 12 FDG, Salmon-gal, Magenta-Gal (latter two from Biosynth Ag)), alkaline phosphatase, horseradish peroxidase, exo-glucanase (product of yeast exbl gene; nonessential, secreted); luciferase; bacterial green fluorescent protein; (human placental) secreted alkaline phosphatase (SEAP); and chloramphenicol transferase (CAT).
  • Xgal Xgal, C 12 FDG, Salmon-gal, Magenta-Gal (latter two from Biosynth Ag)
  • alkaline phosphatase horseradish peroxidase
  • exo-glucanase product of yeast exbl gene; nonessential, secreted
  • a preferred screenable marker gene is beta-galactosidase; yeast cells expressing the enzyme convert the colorless substrate Xgal into a blue pigment.
  • the promoter may be receptor-induced or receptor-inhibited.
  • the pheromone signal pathway in wild-type yeast is growth arrest. If one is testing for an antagonist of a G protein-coupled receptor, such as a human receptor engineered into a yeast cell, this normal response of growth arrest can be used to select cells in which the pheromone response pathway is inhibited. That is, cells exposed to a test compound will be growth arrested if the compound is an agonist, but will grow normally if the compound is neutral or an antagonist. Thus, the growth arrest response can be used to advantage to discover compounds that function as agonists or antagonists.
  • a G protein-coupled receptor such as a human receptor engineered into a yeast cell
  • the effect of growth arrest can provide a selective advantage in the presence of an agent which is cytotoxic to mitotic cells.
  • the cytotoxic agent is added to the culture. Cells which proceed tlirough the cell-cycle, e.g., which are not growth arrested, will be killed. At some time after the addition of the cytotoxic agent, it can be washed from the culture, and surviving cells permitted to proceed with proliferation. Cells which were arrested by the test compound will be enriched in the surviving population.
  • the growth arrest consequent to activation of the pheromone response pathway is an undesirable effect since cells that bind agonists stop growing while surrounding cells that fail to bind peptides will continue to grow. The cells of interest, then, will be overgrown or their detection obscured by the background cells, confounding identification of agonistic peptides.
  • the present invention teaches engineering the cell such that: 1) growth arrest does not occur as a result of exogenous signal pathway activation (e.g., by inactivating the FAR1 gene); and/or 2) a selective growth advantage is conferred by activating the pathway (e.g., by transforming an auxotrophic mutant with a HIS3 gene under the control of a pheromone-responsive promoter, and applying selective conditions). It is, of course, desirable that the exogenous receptor be exposed on a continuing basis to the peptides. Unfortunately, this is likely to result in desensitization of the pheromone pathway to the stimulus.
  • the mating signal transduction pathway is l ⁇ iown to become desensitized by several mechanisms including pheromone degradation and modification of the function of the receptor, G-proteins and/or downstream elements of the pheromone signal transduction by the products of the SST2, STE50, AFR1 (Konopka, J.B. (1993) Mol. Cell. Biol. 13:6876-6888) and SGV1, MSG5, and SIG1 genes. Selected mutations in these genes can lead to hypersensitivity to pheromone and an inability to adapt to the presence of pheromone.
  • the assay will not be able to distinguish between peptides which interact with the endogenous receptor and those which interact with the exogenous receptor. It is therefore desirable that the endogenous gene be deleted or otherwise rendered nonfunctional.
  • Suitable host cells for generating the target cells of subject assay include prokaryotes, yeast, or higher eukaryotic cells, including plant and animal cells, especially mammalian cells.
  • Prokaryotes include gram negative or gram positive organisms.
  • suitable mammalian host cell lines include the COS-7 line of monkey kidney cells (ATCC CRL 1651) (Gluzman (1981) Cell 23:175) CV-1 cells (ATCC CCL 70), L cells, C127, 3T3, Chinese hamster ovary (CHO), HeLa, HEK-293, SWISS 3T3, and BHK cell lines.
  • yeast cells may be of any species which are cultivable and in which an exogenous receptor can be made to engage the appropriate signal transduction machinery of the host cell. Suitable species include Kluyverei lactis, Schizosaccharomyces pombe, and Ustilaqo maydis; Saccharomyces cerevisiae is preferred. Other yeast which can be used in practicing the present invention are Neurospora crassa, Aspergillus niger, Aspergillus nidulans, Pichia pastoris, Candida tropicalis, and Hansenula polymorpha.
  • yeast includes not only yeast in a strictly taxonomic sense, i.e., unicellular organisms, but also yeast-like multicellular fungi or filamentous fungi.
  • reporter constructs can provide a selectable or screenable trait upon transcriptional activation (or inactivation) in response to a signal transduction pathway coupled to the target receptor.
  • the reporter gene may be an unmodified gene already in the host cell pathway. It may be a host cell gene that has been operably linked to a "receptor-responsive" promoter. Alternatively, it may be a heterologous gene (e.g., a "reporter gene construct") that has been so linked. Suitable genes and promoters are discussed below.
  • second messenger generation can be measured directly in the detection step, such as mobilization of intracellular calcium or phospholipid metabolism are quantitated.
  • indicator genes can be used to detect receptor-mediated signaling.
  • the host cell must have an appropriate phenotype.
  • generating a pheromone-responsive chimeric HIS3 gene in a yeast that has a wild-type HIS3 gene would frustrate genetic selection.
  • an auxotrophic strain is wanted.
  • a variety of complementations for use in the subject assay can be constructed. Indeed, many yeast genetic complementation with mammalian signal transduction proteins have been described in the art. For example, Mosteller et al. (1994) Mol Cell Biol 14:1104-12 demonstrates that human Ras proteins can complement loss of ras mutations in S. cerevisiae. Moreover, Toda et al. (1986) Princess Takamatsu Symp 17: 253-60 have shown that human ras proteins can complement the loss of RAS1 and RAS2 proteins in yeast, and hence are functionally homologous. Both human and yeast RAS proteins can stimulate the magnesium and guanine nucleotide-dependent adenylate cyclase activity present in yeast membranes.
  • Ballester et al. (1989) Cell 59: 681-6 describe a vector to express the mammalian GAP protein in the yeast S. cerevisiae. When expressed in yeast, GAP inhibits the function of the human ras protein, and complements the loss of IRAl .
  • IRAl is a yeast gene that encodes a protein with homology to GAP and acts upstream of RAS. Mammalian GAP can therefore function in yeast and interact with yeast RAS.
  • Wei et al. (1994) Gene 151: 279-84 describes that a human Ras-specific guanine nucleotide-exchange factor, Cdc25GEF, can complement the loss of CDC25 function in S. cerevisiae.
  • Martegani et al. (1992) EMBO J 11: 2151-7 describe the cloning by functional complementation of a mouse cDNA encoding a homolog of CDC25, a Saccharomyces cerevisiae RAS activator.
  • Vojtek et al. (1993) J Cell Sci 105: 777-85 and Matviw et al. (1992) Mol Cell Biol 12: 5033-40 describe how a mouse CAP protein, e.g., an adenylyl cyclase associated protein associated with ras- mediated signal transduction, can complements defects in S. cerevisiae. Papasavvas et al.
  • the Ca(2+)- and phospholipid-dependent Ser/Thr kinase PKC plays important roles in the transduction of cellular signals in mammalian cells. Marcus et al. (1995) PNAS 92: 6180-4 suggests the complementation of shkl null mutations in S. pombe by the either the structurally related S. cerevisiae Ste20 or mammalian p65PAK protein kinases.
  • the subject method can be used in a broad range of applications, including for the selection of CSPs having effects on proliferation, differentiation, cell death, cell migration, etc.
  • the target used in the display mode is an extracellular component of a cell.
  • the target for subject method can be an intracellular component and, during the secretion mode, the system can be augmented with agents which promote the cellular uptake of the test CSPs.
  • the subject method is utilized to identify CSPs which have antiproliferative activity with respect to one or more types of cells.
  • the test peptide or CSP library can be panned with the target cells for which an antiproliferative is desired in order to enrich for peptides which bind to that cell.
  • the test peptide or CSP library can also be panned against one or more control cell lines in order to remove peptides which bind the control cells.
  • the CSP library which is then tested in the secretion mode can be enriched for CSPs which selectively bind target cell (relative to the control cells).
  • the display mode can produce a library enriched for test peptides or CSPs which preferentially bind tumor cells relative to normal cells, which preferentially bind p53- cells relative to p53+ cells, which preferentially bind hair follicle cells relative to other epithelial cells, or any other differential binding characteristic.
  • the peptides are tested for antiproliferative activity against the target cell using any of a number of techniques known in the art. For instance, BrdU or other nucleotide uptake can be measured as an indicator of proliferation.
  • the secretion mode can include negative controls in order to select for peptides with specific antiproliferative activity.
  • peptides can be isolated from the library based on their ability to induce apoptosis or cell lysis, e.g., in a cell selective manner.
  • the subject method can be used to identify peptides with angiogenic or anti-angiogenic activity.
  • the CSP library can be enriched for peptides that bind to endothelial cells but which do not bind to fibroblasts.
  • the resulting sub-library can be screened for peptides which inhibit capillary endothelial cell proliferation and/or endothelial cell migration. Peptides scoring positive for one or both of these activities can also be tested for activity against other cell types, such as smooth muscle cells or fibroblasts, in order to select peptides active only against endothelial cells.
  • the subject method can be used to identify anti- infective peptides, e.g., which are active as anti-fungal or antibacterial agents.
  • the assay of the present invention can be used for identifying effectors of a receptor protein or complex thereof.
  • the assay is characterized by the use of a test cell which includes a target receptor or ion channel protein whose signal transduction activity can be modulated by interaction with an extracellular signal, the transduction activity being able to generate a detectable signal.
  • such embodiments of the subject assay are characterized by the use of a mixture of cells expressing a target receptor protein or ion channel capable of transducing a detectable signal in the reagent cell.
  • the receptor/channel protein can be either endogenous or heterologous.
  • a culture of the instant reagent cells will provide means for detecting agonists or antagonists of receptor function.
  • the ability of particular peptides to modulate a signal transduction activity of the target receptor or channel can be scored for by detecting up or down-regulation of the detection signal.
  • second messenger generation e.g. GTPase activity, phospholipid hydrolysis, or protein phosphorylation patterns as examples
  • an indicator gene can provide a convenient readout.
  • a detection means consists of an indicator gene.
  • peptides which induce a signal pathway from a particular receptor or channel can be identified. If a test CSP does not appear to induce the activity of the receptor/channel protein, the assay may be repeated as described above, and modified by the introduction of a step in which the reagent cell is first contacted with a known activator of the target receptor/channel to induce signal transduction, and the test CSP can be assayed for its ability to inhibit the activated receptor/channel, e.g., to identify antagonists. In yet other embodiments, peptides can be screened for those which potentiate the response to a l ⁇ iown activator of the receptor.
  • the receptor or ion channel it may be endogenously expressed by the host cell, or it may be expressed from a heterologous gene that has been introduced into the cell.
  • Methods for introducing heterologous DNA into eukaryotic cells are of course well known in the art and any such method may be used.
  • DNA encoding various receptor proteins is known to those of skill in the art or it may be cloned by any method l ⁇ iown to those of skill in the art.
  • the assays can be used to test functional ligand-receptor or ligand-ion channel interactions for cell surface-localized receptors and channels.
  • the subject assay can be used to identify effectors of, for example, G protein-coupled receptors, receptor tyrosine kinases, cytokine receptors, and ion channels.
  • the method described herein is used for identifying ligands for "orphan receptors" for which no ligand is known.
  • the receptor is a cell surface receptor, such as: a receptor tyrosine kinase, e.g., an EPH receptor; an ion channel; a cytokine receptor; an multisubunit immune recognition receptor, a chemokine receptor; a growth factor receptor, or a G-protein coupled receptor, such as a chemoattractant peptide receptor, a neuropeptide receptor, a light receptor, a neuiOtransmitter receptor, or a polypeptide hormone receptor.
  • a receptor tyrosine kinase e.g., an EPH receptor
  • an ion channel e.g., an EPH receptor
  • a cytokine receptor e.g., an cytokine receptor
  • an multisubunit immune recognition receptor e.g., a chemokine receptor
  • a growth factor receptor e.g., a growth factor receptor
  • G-protein coupled receptor such as a chemoattractant peptide receptor,
  • Preferred G protein coupled receptors include ⁇ lA-adrenergic receptor, lB-adrenergic receptor, ⁇ 2-adrenergic receptor, ⁇ 2B-adrenergic receptor, ⁇ l- adrenergic receptor, ⁇ 2-adrenergic receptor, ⁇ 3-adrenergic receptor, ml acetylcholine receptor (AChR), m2 AChR, m3 AChR, m4 AChR, m5 AChR, Dl dopamine receptor, D2 dopamine receptor, D3 dopamine receptor, D4 dopamine receptor, D5 dopamine receptor, Al adenosine receptor, A2b adenosine receptor, 5- HTla receptor, 5-HTlb receptor, 5HTl-like receptor, 5-HTld receptor, 5HTld-like receptor, 5HTld beta receptor, substance K (neurokinin A) receptor, fMLP receptor, fMLP-like receptor, angioten
  • Preferred EPH receptors include eph, elk, eek, sek, mek4, hek, helc2, eek, erk, tyrol, tyro4, tyro5, tyro6, tyrol 1, cek4, cek5, cek6, cek7, cek8, cek9, ceklO, bsk, rtkl , rtk2, rtk3, mykl , myk2, ehkl , ehk2, pagliaccio, htk, erk and nuk receptors.
  • the target receptor is a cytokine receptor.
  • Cytokines are a family of soluble mediators of cell-to-cell communication that includes interleukins, interferons, and colony-stimulating factors. The characteristic features of cytokines lie in their functional redundancy and pleiotropy. Most of the cytokine receptors that constitute distinct superfamilies do not possess intrinsic protein tyrosine kinase domains, yet receptor stimulation usually invokes rapid tyrosine phosphorylation of intracellular proteins, including the receptors themselves. Many members of the cytokine receptor superfamily activate the Jak protein tyrosine kinase family, with resultant ' ⁇ ⁇ sphorylation of the STAT transcriptional activator factors.
  • IL-2, IL-7, IL-2 and Interferon ⁇ have all been shown to activate Jak kinases (Frank et al (1995) Proc Natl Acad Sci USA 92:7779-7783); Scharfe et al. (1995) Blood 86:2077-2085); (Bacon et al. (1995) Proc Natl Acad Sci USA 92:7307-7311); and (Sakatsume et al (1995) J. Biol Chem 270:17528-17534). Events downstream of Jak phosphorylation have also been elucidated.
  • STAT signal transducers and activators of transcription
  • STAT1 D signal transducers and activators of transcription
  • STAT2D signal transducers and activators of transcription
  • STAT3 two STAT-related proteins, p94 and p95.
  • the STAT proteins were found to translocate to the nucleus and to bind to a specific DNA sequence, thus suggesting a mechanism by which IL-2 may activate specific genes involved in immune cell function (Frank et al. supra).
  • Jak3 is associated with the gamma chain of the IL-2, IL-4, and IL-7 cytokine receptors (Fujii et al.
  • Detection means which may be scored for in the present assay, in addition to direct detection of second messengers, such as by changes in phosphorylation, includes reporter constructs or indicator genes which include transcriptional regulatory elements responsive to the STAT proteins. Described infra.
  • MIRR Multisubunit Immune Recognition Receptor
  • the receptor is a multisubunit receptor.
  • Receptors can be comprised of multiple proteins referred to as subunits, one category of which is referred to as a multisubunit receptor is a multisubunit immune recognition receptor (MIRR).
  • MIRRs include receptors having multiple noncovalently associated subunits and are capable of interacting with src-family tyrosine kinases.
  • MIRRs can include, but are not limited to, B cell antigen receptors, T cell antigen receptors, Fc receptors and CD22.
  • An MIRR is an antigen receptor on the surface of a B cell.
  • the MIRR on the surface of a B cell comprises membrane-bound immunoglobulin (mlg) associated with the subunits Ig- and Ig- ⁇ or Ig- ⁇ , which forms a complex capable of regulating B cell function when bound by antigen.
  • An antigen receptor can be functionally linked to an amplifier molecule in a manner such that the amplifier molecule is capable of regulating gene transcription.
  • Src-family tyrosine kinases are enzymes capable of phosphorylating tyrosine residues of a target molecule.
  • a src-family tyrosine kinase contains one or more binding domains and a kinase domain.
  • a binding domain of a src-family tyrosine kinase is capable of binding to a target molecule and a kinase domain is capable of phosphorylating a target molecule bound to the kinase.
  • Members of the src family of tyrosine kinases are characterized by an N-terminal unique region followed by three regions that contain different degrees of homology among all the members of the family.
  • src homology region 1 SHI
  • SH2 src homology region 2
  • SH3 src homology region 3
  • Both the SH2 and SH3 domains are believed to have protein association functions important for the formation of signal transduction complexes.
  • the amino acid sequence of an N-terminal unique region varies between each src-family tyrosine kinase.
  • An N- terminal unique region can be at least about the first 40 amino acid residues of the N-terminal of a src-family tyrosine kinase.
  • Syk-family ldnases are enzymes capable of phosphorylating tyrosine residues of a target molecule.
  • a syk-family kinase contains one or more binding domains and a kinase domain.
  • a binding domain of a syk-family tyrosine kinase is capable of binding to a target molecule and a kinase domain is capable of phosphorylating a target molecule bound to the kinase.
  • Members of the syk- family of tyrosine kinases are characterized by two SH2 domains for protein association function and a tyrosine kinase domain.
  • a primary target molecule is capable of further extending a signal transduction pathway by modifying a second messenger molecule.
  • Primary target molecules can include, but are not limited to, phosphatidylinositol 3 -kinase (PI-3K), p21 ras GAP-activating protein and associated pi 90 and p62 protein, phospholipases such as PLC ⁇ l and PLC ⁇ 2, MAP kinase, She and VAV.
  • PI-3K phosphatidylinositol 3 -kinase
  • p21 ras GAP-activating protein and associated pi 90 and p62 protein phospholipases such as PLC ⁇ l and PLC ⁇ 2, MAP kinase, She and VAV.
  • a primary target molecule is capable of producing second messenger molecule which is capable of further amplifying a transduced signal.
  • Second messenger molecules include, but are not limited to diacylglycerol and inositol 1,4,5-triphosphat
  • Second messenger molecules are capable of initiating physiological events which can lead to alterations in gene transcription. For example, production of IP3 can result in release of intracellular calcium, which can then lead to activation of calmodulin kinase II, which can then lead to serine phosphorylation of a DNA binding protein referred to as ets-1 proto-onco-protein.
  • Diacylglycerol is capable of activating the signal transduction protein, protein kinase C which affects the activity of the API DNA binding protein complex.
  • Signal transduction pathways can lead to transcriptional activation of genes such as c-fos, egr-1, and c-myc.
  • An adapter molecule comprises a protein that enables two other proteins to form a complex (e.g., a three molecule complex). She protein enables a complex to form which includes Grb2 and
  • SOS She comprises an SH2 domain that is capable of associating with the SH2 domain of Grb2.
  • Molecules of a signal transduction pathway can associate with one another using recognition sequences.
  • Recognition sequences enable specific binding between two molecules. Recognition sequences can vary depending upon the structure of the molecules that are associating with one another. A molecule can have one or more recognition sequences, and as such can associate with one or more different molecules.
  • Signal transduction pathways for MIRR complexes are capable of regulating the biological functions of a cell. Such functions can include, but are not limited to the ability of a cell to grow, to differentiate and to secrete cellular products.
  • MIRR- induced signal transduction pathways can regulate the biological functions of specific types of cells involved in particular responses by an animal, such as immune responses, inflammatory responses and allergic responses.
  • Cells involved in an immune response can include, for example, B cells, T cells, macrophages, dendritic cells, natural killer cells and plasma cells.
  • Cells involved in inflammatory responses can include, for example, basophils, mast cells, eosinophils, neutrophils and macrophages.
  • the detection signal is a second messengers, such as a phosphorylated src-like protein, includes reporter constructs or indicator genes which include transcriptional regulatory elements such as serum response element (SRE), 12-O-tetradecanoyl-phorbol- 13 -acetate response element, cyclic AMP response element, c- fos promoter, or a CREB-responsive element.
  • SRE serum response element
  • cyclic AMP response element c- fos promoter
  • CREB-responsive element a CREB-responsive element
  • the target receptor is a receptor tyrosine kinase.
  • the receptor tyrosine kinases can be divided into five subgroups on the basis of structural similarities in their extracellular domains and the organization of the tyrosine kinase catalytic region in their cytoplasmic domains. Sub-groups I (epidermal growth factor (EGF) receptor-like), II (insulin receptor-like) and the eph/eck family contain cysteine-rich sequences (Hirai et al., (1987) Science 238:1717-1720 and Lindberg and Hunter, (1990) Mol. Cell. Biol. 10:6316-6324).
  • EGF epidermal growth factor
  • the functional domains of the kinase region of these three classes of receptor tyrosine kinases are encoded as a contiguous sequence ( Hanks et al. (1988) Science 241 :42-52).
  • Subgroups III platelet-derived growth factor (PDGF) receptor-like) and IV (the fibro-blast growth factor (FGF) receptors) are characterized as having immunoglobulin (Ig)-like folds in their extracellular domains, as well as having their kinase domains divided in two parts by a variable stretch of unrelated amino acids (Yanden and Ullrich (1988) supra and Hanks et al. (1988) supra).
  • the family with by far the largest number of l ⁇ iown members is the EPH family.
  • the expression patterns determined for some of the EPH family receptors have implied important roles for these molecules in early vertebrate development.
  • the timing and pattern of expression of sek, mek4 and some of the other receptors during the phase of gastrulation and early organogenesis has suggested functions for these receptors in the important cellular interactions involved in patterning the embryo at this stage (Gilardi-Hebenrison et al. (1992) Oncogene 7:2499-2506; Nieto et al. (1992) Development 116:1137-1150; Henkemeyer et al, supra; Ruiz et al., supra; and Xu et al., supra).
  • Sek shows a notable early expression in the two areas of the mouse embryo that show obvious segmentation, namely the somites in the mesoderm and the rhombomeres of the hindbrain; hence the name sek, for segmentally expressed kinase (Gilardi- Hebenrison et al., supra; Nieto et al., supra).
  • these segmental structures of the mammalian embryo are implicated as important elements in establishing the body plan.
  • the observation that Sek expression precedes the appearance of morphological segmentation suggests a role for sek in forming these segmental structures, or in determining segment-specific cell properties such as lineage compartmentation (Nieto et al., supra).
  • EPH receptors have been implicated, by their pattern of expression, in the development and maintenance of nearly every tissue in the embryonic and adult body. For instance, EPH receptors have been detected throughout the nervous system, the testes, the cartilaginous model of the skeleton, tooth primordia, the infundibular component of the pituitary, various epithelia tissues, lung, pancreas, liver and kidney tissues. Observations such as this have been indicative of important and unique roles for EPH family kinases in development and physiology, but further progress in understanding their action has been severely limited by the lack of information on their ligands.
  • EPH receptor or "EPH-type receptor” refer to a class of receptor tyrosine kinases, comprising at least eleven paralogous genes, though many more orthologs exist within this class, e.g. homologs from different species.
  • EPH receptors in general, are a discrete group of receptors related by homology and easily recognizable, e.g., they are typically characterized by an extracellular domain containing a characteristic spacing of cysteine residues near the N-terminus and two fibronectin type III repeats (Hirai et al. (1987) Science 238:1717-1720; Lindberg et al. (1990) Mol Cell Biol 10:6316-6324; Chan et al.
  • EPH receptors include the eph, elk, eek, sek, mek4, hek, hek.2, eek, erk, tyrol, tyro4, tyroS, tyro ⁇ , tyrol 1, cek4, cek5, cek ⁇ , cek7, cek8, cek9, ceklO, bsk, rtkl, rtk2, rtk3, mykl, myk2, ehkl, ehk.2, pagliaccio, htk, erk and nuk receptors.
  • the term "EPH receptor” refers to the membrane form of the receptor protein, as well as soluble extracellular fragments which retain the ability to bind the ligand of the present invention.
  • the detection signal is provided by detecting phosphorylation of intracellular proteins, e.g., MEKKs, MEKs, or Map kinases, or by the use of reporter constructs or indicator genes which include transcriptional regulatory elements responsive to c-fos and/or c-jun. Described infra.
  • G proteins One family of signal transduction cascades found in eukaryotic cells utilizes heterofrimeric "G proteins." Many different G proteins are known to interact with receptors. G protein signaling systems include three components: the receptor itself, a GTP-binding protein (G protein), and an intracellular target protein.
  • G protein GTP-binding protein
  • the cell membrane acts as a switchboard. Messages arriving through different receptors can produce a single effect if the receptors act on the same type of G protein. On the other hand, signals activating a single receptor can produce more than one effect if the receptor acts on different kinds of G proteins, or ifthe G proteins can act on different effectors.
  • the G proteins which consist of alpha, beta and gamma subunits, are complexed with the nucleotide guanosine diphosphate (GDP) and are in contact with receptors.
  • GDP nucleotide guanosine diphosphate
  • the receptor changes conformation and this alters its interaction with the G protein. This spurs the D subunit to release GDP, and the more abundant nucleotide guanosine triphosphate (GTP), replaces it, activating the G protein.
  • GTP nucleotide guanosine triphosphate
  • the G protein then dissociates to separate the alpha subunit from the still complexed beta and gamma subunits. Either the G alpha subunit, or the G beta-gamma complex, depending on the pathway, interacts with an effector.
  • the effector (which is often an enzyme) in turn converts an inactive precursor molecule into an active "second messenger," which may diffuse through the cytoplasm, triggering a metabolic cascade.
  • the G alpha converts the GTP to GDP, thereby inactivating itself.
  • the inactivated G alpha may then reassociate with the G beta- gamma complex.
  • G protein-coupled receptors are comprised of a single protein chain that is threaded through the plasma membrane seven times. Such receptors are often referred to as seven-transmembrane receptors (STRs). More than a hundred different STRs have been found, including many distinct receptors that bind the same ligand, and there are likely many more STRs awaiting discovery.
  • STRs seven-transmembrane receptors
  • STRs have been identified for which the natural ligands are unknown; these receptors are termed "orphan" G protein-coupled receptors, as described above. Examples include receptors cloned by Neote et al. (1993) Cell 72,
  • the "exogenous receptors" of the present invention may be any G protein-coupled receptor which is exogenous to the cell wliich is to be genetically engineered for the purpose of the present invention.
  • This receptor may be a plant or animal cell receptor. Screening for binding to plant cell receptors may be useful in the development of, e.g., herbicides.
  • an animal receptor it may be of invertebrate or vertebrate origin. If an invertebrate receptor, an insect receptor is preferred, and would facilitate development of insecticides.
  • the receptor may also be a vertebrate, more preferably a mammalian, still more preferably a human, receptor.
  • the exogenous receptor is also preferably a seven transmembrane segment receptor.
  • ligands for G protein coupled receptors include: purines and nucleotides, such as adenosine, cAMP, ATP, UTP, ADP, melatonin and the like; biogenic amines (and related natural ligands), such as 5-hydroxytryptamine, acetylcholine, dopamine, adrenaline, adrenaline, adrenaline., histamine, noradrenaline, noradrenaline, noradrenaline., tyramine/octopamine and other related compounds; peptides such as adrenocorticofrophic hormone (acth), melanocyte stimulating hormone (msh), melanocortins, neurotensin (nt), bombesin and related peptides, endothelins, cholecystokinin, gastrin, neurokinin b (nk3), invertebrate tachykinin-like peptides, substance k (nk2), substance k
  • G-protein coupled receptors include, but are not limited to, dopaminergic, muscarinic cholinergic, a-adrenergic, b-adrenergic, opioid (including delta and mu), cannabinoid, serotoninergic, and GABAergic receptors.
  • Preferred receptors include the 5HT family of receptors, dopamine receptors, C5a receptor and FPRL-1 receptor, cyclo-histidyl-proline-diketoplperazine receptors, melanocyte stimulating hormone release inhibiting factor receptor, and receptors for neurotensin, thyrotropin releasing hormone, calcitonin, cholecytokinin-A, neurokinin-2, histamine-3, cannabinoid, melanocortin, or adrenomodulin, neuropeptide- Yl or galanin.
  • Other suitable receptors are listed in the art.
  • the term "receptor,” as used herein, encompasses both naturally occurring and mutant receptors.
  • G protein-coupled receptors like the yeast alpha- and beta-factor receptors, contain seven hydrophobic amino acid-rich regions which are assumed to lie within the plasma membrane.
  • Specific human G protein-coupled STRs for which genes have been isolated and for which expression vectors could be constructed include those listed herein and others known in the art.
  • the gene would be operably linked to a promoter functional in the cell to be engineered and to a signal sequence that also functions in the cell.
  • suitable promoters include Ste2, Ste3 and gallO.
  • Suitable signal sequences include those of Ste2, Ste3 and of other genes which encode proteins secreted by yeast cells.
  • the codons of the gene would be optimized for expression in yeast. See Hoekema et al.,(1987) Mol. Cell. Biol, 7:2914-24; Sharp, et al, (1986)14:5125-43.
  • N- and C-terminal regions, and the cytoplasmic loop connecting transmembrane segments V and VI are more divergent.
  • a foreign receptor when the host cell is a yeast cell, a foreign receptor will fail to functionally integrate into the yeast membrane, and there interact with the endogenous yeast G protein. More likely, either the receptor will need to be modified (e.g., by replacing its V-VI loop with that of the yeast STE2 or STE3 receptor), or a compatible G protein should be provided.
  • the wild-type exogenous G protein-coupled receptor cannot be made functional in yeast, it may be mutated for this purpose.
  • a comparison would be made of the amino acid sequences of the exogenous receptor and of the yeast receptors, and regions of high and low homology identified. Trial mutations would then be made to distinguish regions involved in ligand or G protein binding, from those necessary for functional integration in the membrane.
  • the exogenous receptor would then be mutated in the latter region to more closely resemble the yeast receptor, until functional integration was achieved. If this were insufficient to achieve functionality, mutations would next be made in the regions involved in G protein binding. Mutations would be made in regions involved in ligand binding only as a last resort, and then an effort would be made to preserve ligand binding by making conservative substitutions whenever possible.
  • the yeast genome is modified so that it is unable to produce the yeast receptors which are homologous to the exogenous receptors in functional form. Otherwise, a positive assay score might reflect the ability of a peptide to activate the endogenous G protein-coupled receptor, and not the receptor of interest.
  • N-formyl peptide receptor is a classic example of a calcium mobilizing G protein-coupled receptor expressed by neutrophils and other phagocytic cells of the mammalian immune system (Snyderman et al. (1988) In Inflammation: Basic Principles and Clinical Correlates, pp. 309-323).
  • N-formyl peptides of bacterial origin bind to the receptor and engage a complex activation program that results in directed cell movement, release of inflammatory granule contents, and activation of a latent NADPH oxidase which is important for the production of metabolites of molecular oxygen.
  • This pathway initiated by receptor-ligand interaction is critical in host protection from pyrogenic infections. Similar signal transduction occurs in response to the inflammatory peptides C5a and IL-8.
  • FPRL formyl peptide receptor like
  • FPRL2 was found to mediate calcium mobilization in mouse fibroblasts transfected with the gene and exposed to formyl peptide.
  • FPRLl was found to be 69% identical in amino acid sequence to NFPR, it did not bind prototype N-formyl peptides ligands when expressed in heterologous cell types. This lead to the hypothesis of the existence of an as yet unidentified ligand for the FPRLl orphan receptor (Murphy et al. supra).
  • the yeast cell In the case of an exogenous G protein-coupled receptor, the yeast cell must be able to produce a G protein which is activated by the exogenous receptor, and wliich can in turn activate the yeast effector(s).
  • the endogenous yeast G ⁇ subunit e.g., GPA
  • GPA the endogenous yeast G ⁇ subunit
  • the G ⁇ subunit of the yeast G protein may be replaced by the G ⁇ subunit natively associated with the exogenous receptor.
  • the G ⁇ subunit may be modified to improve coupling. These modifications often will take the form of mutations which increase the resemblance of the G ⁇ subunit to the yeast G ⁇ while decreasing its resemblance to the receptor-associated G ⁇ . For example, a residue may be changed so as to become identical to the corresponding yeast G ⁇ residue, or to at least belong to the same exchange group of that residue. After modification, the modified G ⁇ subunit might or might not be "substantially homologous" to the foreign and/or the yeast G ⁇ subunit.
  • the modifications are preferably concentrated in regions of the G ⁇ which are likely to be involved in G ⁇ binding.
  • the modifications will take the form of replacing one or more segments of the receptor-associated G ⁇ with the corresponding yeast G ⁇ segment(s), thereby forming a chimeric G ⁇ subunit.
  • segment refers to tliree or more consecutive amino acids.
  • point mutations may be sufficient.
  • This chimeric G ⁇ subunit will interact with the exogenous receptor and the yeast G ⁇ complex, thereby permitting signal transduction. While use of the endogenous yeast G ⁇ is preferred, if a foreign or chimeric G ⁇ is capable of transducing the signal to the yeast effector, it may be used instead. V. Pharmaceutical Preparations of Identified Agents
  • test CSPs in the subject assay e.g. as potential surrogate ligands, or receptor antagonists
  • the practitioner of the subject assay will continue to test the efficacy and specificity of the selected peptides both in vitro and in vivo.
  • peptides identified in the subject assay, or peptidomimetics thereof can be formulated in pharmaceutical preparations for in vivo administration to an animal, preferably a human.
  • the peptides selected in the subject assay, or a pharmaceutically acceptable salt thereof may accordingly be formulated for administration with a biologically acceptable medium, such as water, buffered saline, polyol (for example, glycerol, propylene glycol, liquid polyethylene glycol and the like) or suitable mixtures thereof.
  • a biologically acceptable medium such as water, buffered saline, polyol (for example, glycerol, propylene glycol, liquid polyethylene glycol and the like) or suitable mixtures thereof.
  • a biologically acceptable medium includes any and all solvents, dispersion media, and the like which may be appropriate for the desired route of administration of the pharmaceutical preparation. The use of such media for pharmaceutically active substances is known in the art.
  • Suitable vehicles and their formulation inclusive of other proteins are described, for example, in the book Remington's Pharmaceutical Sciences (Remington's Pharmaceutical Sciences. Mack Publishing Company, Easton, Pa., USA 1985). These vehicles include injectable "deposit formulations". Based on the above, such pharmaceutical formulations include, although not exclusively, solutions or freeze-dried powders of the compound in association with one or more pharmaceutically acceptable vehicles or diluents, and contained in buffered media at a suitable pH and isosmotic with physiological fluids. In preferred embodiment, the peptide can be disposed in a sterile preparation for topical and/or systemic administration.
  • excipients such as, but not exclusively, mannitol or glycine may be used and appropriate buffered solutions of the desired volume will be provided so as to obtain adequate isotonic buffered solutions of the desired pH.
  • Similar solutions may also be used for the pharmaceutical compositions of compounds in isotonic solutions of the desired volume and include, but not exclusively, the use of buffered saline solutions with phosphate or citrate at suitable concentrations so as to obtain at all times isotonic pharmaceutical preparations of the desired pH, (for example, neutral pH).
  • vectors pSW3 and pSW4 Two distinct embodiments of the present invention are demonstrated in the preparation of vectors pSW3 and pSW4. These vectors can be used to display the CSPs useful in the instant application.
  • pSWl is the prototype vector for the KeyCode technology. Its backbone is a modified M13 bacteriophage (bp 1-4779 and 8585 to 9170) genome in which the wild type pill gene is deleted. Note that alternate embodiments are possible, for example the deletion (and relocation) of the wild type bacteriophage pVIII gene instead of pill. Inserted into this backbone is an Amp resistance gene (bp 4824- 5681) that allows for selection in E. coli.
  • the expression segment (bp 6046-8584) contains the CMV promoter, splice acceptor and donor sequences, and SV40 replication/termination elements which allow for expression of peptide sequences in eukaryotic hosts. These eukaryotic segments are interspersed with E.
  • coli expression elements lac promoter, Ml 3 phage gene III (or gene VIII), and E. coli termination signals
  • wliich allow the cloned library peptides to be displayed on the phage coat as a fusion to the pill phage protein.
  • Random oligonucleotides encoding the random peptides are directionally inserted into the phage as BstXI fragments.
  • pSW2 is a derivative of pSWl that was designed to aid the construction of albumin display vectors.
  • the relevant functional elements are present as cassettes that expedite the simple exchange of components.
  • pSW2 is another prototype vector for the KeyCode technology.
  • a Sail restriction site is introduced into pSWl through insertion of 4 bases (CGAC) at position 8728.
  • This insertion will provide a restriction site that allows the exchange of signal sequences and N-terminal protein sequences for expression in eukaryotic hosts (see diagrams 4 and 5).
  • the IgH secretion signal sequence could be excised and replaced with a serum albumin secretion signal sequence.
  • Hindlll, Sapl, and BamHl restriction sites are engineered between base pairs 6841 and 6975 of pSWl, thus eliminating 10 amino acids of flanking sequence around the peptide libraries in pSWl and eliminating the potential effects of flanking sequence on peptide binding during phage panning (see Figure 4 and 6).
  • a PflMI site is introduced at base pair 8264 by PCR to allow simple insertion of protein sequences, such as serum albumin, to which the peptide may be fused in eukaryotic expression ( Figure 4 and 7).
  • pSW2 can be further modified in order to house the libraries of test peptides fused to bacterial gene III (or gene VIII).
  • pSW2 was modified to form embodiments of vectors useful in the present invention, pSW3 and pSW4, as described below.
  • the vector allows for display of test peptides, followed by secretion of chimeric serum peptides.
  • pSW4 the vector allows for display and secretion of chimeric serum peptides.
  • the peptides are displayed on the surface of the page as pill (or pVIII) fusions without serum albumin.
  • the peptides are expressed in the cys53-cys62 loop of domain I (aa 1-96) or full length mouse serum albumin.
  • the serum albumin secretion signal sequence and amino acids 1 to 51 of the mature serum albumin protein are inserted between positions bp 6657 to 6725 of pSW2 by PCR into the Xbal/Sall sites. Due to requirements of the splice donor sequence at position 6721 of pSW2, amino acid 51 of serum albumin needs to be changed from Lys to Arg.
  • the serum albumin coding sequence (amino acids 65-608 or 65-96) is cloned by PCR as an XhoI/PflM fragment into pSW2 (8737 to 8763; see Figures 10 and 12). Due to the requirements of the splice acceptor sequence at position 8759, amino acid 65 of serum albumin changes from Ser to Ala.
  • Peptide libraries will be cloned as either Sapl or Hindlll/BamHI oligomers by gap duplex method. In cloning these peptides, oligos must contain the E. coli secretion signal/splice acceptor/splice donor as well as the peptide library flanked by amino acids 52 and 53 and amino acids 62 to 64 of mouse serum albumin (see
  • peptides inserted into domain I can be transferred into wild type albumin as an Nrul/Xcml fragment in the pcDNA 3 (NX)
  • the resulting clone will express the peptide in loop 53 in a wild type MSA protein (K51, S65) from eukaryotic host cells (see Figure 14) and can be used for the construction of CHO producer cell lines.
  • pSW4 allows the display the entire serum albumin domain 1 (aa 1-106) with the inserted peptides (for example, between Cys52-Cys63 and/or Cys91-Cysl00) on the surface of the virion as a pill fusion and can be used for the COS-7 expression of the albumin-peptide chimera.
  • pSW2 was modified to create pSW4 as described below:
  • the serum albumin secretion signal sequence is inserted as complimentary oligos between the Xbal and Sail sites of pSW2 at positions 6657 and 6725(see Figures 16 and 17).
  • amino acids 1 through 96 of serum albumin are inserted between the Hindlll and BamHl sites by PCR cloning.
  • the original Hindlll site of pSW2 is destroyed by silent mutation.
  • a new Hindlll site is engineered by silent mutagenesis in the serum albumin sequence at amino acids 65 and 66 (see diagrams 16 and 18).
  • the presence of a unique Sphl site in serum albumin at amino acids 39 and 40 allows insertion of peptide libraries as Sphl Hindlll oligos (see Figures 16 and 19).
  • SacII and Pstl sites are also engineered by silent mutagenesis in serum albumin at amino acids 86/87 and 105/106 respectively, which allow peptide library insertion between Cys 91 and Cys 100, in an additional surface loop of serum albumin.
  • the recovered phage must be amplified by re-infecting E. coli cells.
  • the expression cassette of pAMlO is the same as in pSW3except it is a phagemid.
  • pAMlO allows the display of the entire serum albumin domain I (aa 1- 106) with test peptides inserted between Cys52-Cys63 and/or Cys91-Cysl00 on the surface of the virion as a pill fusion.
  • the bacteriophage pVIII gene may be used in place of the pill gene. In such a case, a helper phage is used in order to produce wild-type pVIII, without the attached chimeric serum peptide.
  • the presence of the wild type pVIII allows formation of viable phage in which some of the phage pVIII coat protein will be attached to CSPs.
  • pAM9 When transfected into COS7 cells, it directs the expression and secretion of the albumin peptide chimera.
  • pAM9 was modified as described below to create pAMlO:
  • the pSW3 cassette comprised of the MSA signal sequence, lac promoter, E. coli signal sequence, MSA domain I (aa 1-106) and pill gene (or pVIII gene) with interspersed e ⁇ karyotic splice signals was used to replace the similar pAM9 cassette as an Xbal/Xhol fragment.
  • the expression cassette of pAMl 1 is the same as in pSW4 except it is a phagemid.
  • pAMl 1 allows the display of the test peptides on the- surface of the virion as a pill fusion.
  • the bacteriophage pVIII gene may be used in place of the pill gene.
  • a helper phage is used in order to produce wild-type pVIII, without the test peptide.
  • the presence of the wild type pVIII allows formation of viable phage in which some of the phage pVIII coat protein will be attached to test peptides. When transfected into COS7 cells, it directs the expression and secretion of the albumin peptide chimera.
  • mouse serum albumin was modified to include the myc epitope, EQKLISEEDL (SEQ ID NO: 1).
  • the myc epitope was inserted in the middle of each of three amino acid segments: between amino acids 57-58 for loop 53-62, amino acids 364-365 for loop 360-369 and amino acids 467- 468 for loop 450-467.
  • Cos7 cells were transfected with either wild type MSA or the various myc containing MSA constructs. The presence of the proteins in the medium was first determined by Western blot analysis using antibodies specific for MSA and the myc epitope.
  • the conditioned media was either mixed directly with the antibody (N, native) or first denatured in the presence of 0.1% SDS, 1 mM ⁇ - mercapthoethanol and heat (100°C for 10 min) and then antibody added (D, denatured).
  • D antibody added
  • the presence of the proteins that could be precipitated by the myc antibody were revealed by Western blot analysis using the MSA specific antibody.
  • the albumin proteins with myc inserted in loops 53-62 and 360-369 were bound by the myc antibody regardless of whether the protein was in its native or denatured form.
  • loop 450-463 the protein only bound the antibody when it was first denatured. This experiment clearly demonstrates that loops 53-62 and 360-369 are exposed on the surface of the MSA protein and therefore good for display. Additionally, the 450-463 loop is buried.
  • BCE bovine capillary endothelial cells
  • the goal of this experiment was determine the function of MSA with the RGD peptide (VRGDF, SEQ ID NO: 2) displayed on the surface of the protein in the loop 53-58 region (MSA-myc-RGD).
  • RGD was chosen, as this peptide can efficiently bind to *v*3 integrin receptors on endothelial cells and inhibit their proliferation.
  • Triplicate wells of Cos7 cells were transfected with the following constructs: MSA-myc (the myc epitope was added to the C-terminal tail of MSA in this iteration); MSA-myc-RGD; or pAM7-stuffer. These Cos7 cells were grown in the lower chamber of a Transwell ® tissue culture plate with BCE cells in the upper chamber.
  • FGF was added to the lower chamber or not in the case of no FGF control and the cells allowed to grow for 72 hours.
  • 6.25 *M c-RGD peptide was also added.
  • Cell growth was determined by a Calcein-binding fluorescence assay.
  • the left panel of Figure 3 is a graph of the optical density (OD) for each. The data reveals the addition of FGF results in a 2-fold stimulation of growth of the BCE cells. This growth was inhibited by the c-RGD peptide and also by the secreted MSA-myc-RGD protein.
  • the right panel is a different way of looking at the same data.
  • the degree of inhibition of growth is graphed for each.
  • the data shows that the MSA-Myc-RGD protein inhibited the growth of the BCE cell by 53% and the degree of inhibition was equivalent to that of the added RGD peptide.
  • the RGD peptide displayed on the surface of the MSA molecule inhibited BCE cell growth as efficiently as the endogenously added free RGD peptide demonstrating that the peptide retains its activity in the looped orientation.
  • MSA mouse serum albumin
  • EC endothelial cell binding
  • MSA full-length mouse serum albumin (negative control)
  • MSA-RGD in which the RGD sequence (VRGDF, SEQ ID NO: 2) replaces the MSA sequence between Cys 53 and Cys 62
  • MSA-11B3 in wliich the 11-B3 peptide sequence (PSTLRAQ, SEQ ID NO: 3) replaces the MSA sequence between Cys 53 and Cys 62
  • MSA-1H5 in wliich the 1-H5 peptide sequence (HTKQIPRHIYSA, SEQ ID NO: 4) is inserted between Glu 57 and Ser 58 within the Cys 53 and Cys 62 loop of MSA
  • MSA-9G5 in which the 9-G5 peptide sequence (DSHKRLK, SEQ ID NO: 5) replaces the MSA sequence between Cys 53 and Cys 62
  • MSA-myc in which the Myc epitope peptide sequence (EQKLISEEDL, SEQ ID NO: 1) is inserted between Glu 57 and Ser 58 within the Cys 53 and Cys 62 loop of MSA (negative control)
  • the transfected COS7-L cells were cultured in defined serum free media (VP-SFM). Each day for 5 days, the conditioned media were collected from the cells, centrifuged to remove dead cells and other cellular debris, and then frozen. The 5 days worth of cultured media were pooled and concentrated 500-fold using a Centiprep-80 with a molecular weight cut-off of 50 (for MSA, MSA-RGD, MSA- 9G5) or a molecular weight cut-off of 30 (for MSA-myc, MSA-11B3, MSA-1H5). The concentration of the albumin proteins was determined by Western blot analysis of each preparation using a rabbit anti-MSA antibody and using purified MSA of known concentration to generate a standard curve. Following development of the blot and exposure to film the autoradiographs were analyzed using the Gel Doc 1000 image analysis system and Molecular Analyst software (BioRad).
  • bovine capillary endothelial cells (BCE) at passage 11 were plated in 96 well tissue culture plates at a density of 2 X 103 cells per well in 100 ml 5% calf serum (CSV DMEM supplemented with penicillin/streptomycin (PS). The cells were then incubated overnight in an atmosphere of 10 % CO2, 37 °C.
  • the media was changed to 150 ml 2% CS/DMEM/PS.
  • the albumin proteins were added to the first well as 8.75 ml which contains an additional 150 ml of 2% CS/DMEM/PS.
  • 150 ml was then removed from this well and added to the next well resulting in a 1:2 dilution of the protein. This process was repeated for a total of six times each in triplicate.
  • 50 ml of 4 ng/ml FGF final concentration: 1 ng/ml FGF
  • a synthetic peptide of cyclic RGD (c-RGD) at a concentration of 4.1 mM was included to serve as a positive control for inhibition of proliferation.
  • Cells without addition of protein but with FGF added and without FGF added were included on each plate as additional controls.
  • the insertion of the EC binding peptides into MSA increased their inhibitory activity by approximately 1000-fold.
  • the MSA-EC binding peptide fusions inhibited BCE and HUVEC proliferation in the nanomolar (nM) range while the synthetic peptides were active in the micromolar (mM) range.
  • the control MSA and MSA- myc proteins did not significantly affect the proliferation of the target endothelial cells.
  • Cos7 cells leads to a 4.9 fold induction of apoptosis relative to the vector control in
  • NCI- 1869 cells NCI- 1869 cells. Incubation of these cells with purified RGD peptide also leads to the induction of apoptosis as assessed by microscopic analysis.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Biomedical Technology (AREA)
  • Organic Chemistry (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Genetics & Genomics (AREA)
  • Immunology (AREA)
  • Medicinal Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Urology & Nephrology (AREA)
  • General Health & Medical Sciences (AREA)
  • Hematology (AREA)
  • Cell Biology (AREA)
  • Toxicology (AREA)
  • Biophysics (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Virology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Food Science & Technology (AREA)
  • Plant Pathology (AREA)
  • Analytical Chemistry (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

One aspect of the present invention is the synthesis of a binary method that combines variegated display libraries, e.g., in a 'display mode' and a 'secretion mode', to yield a method for the efficient isolation of chimeric serum peptides in having a desired biological activity.

Description

Methods and Reagents for Generating Chimeric Serum Peptide Carriers
Reference to Related Application
This application claims priority to U.S. Provisional Application 60/351,225, filed on January 23, 2002, the entire content of which is incorporated herein by reference.
Background of the Invention
High throughput screening has become a dominant tool in the pharmaceutical industry for the discovery of lead compounds that can be modified into candidates for drug development. For instance, it is abundantly used for identification of ligands with high affinity for receptors. In this regard, combinatorial techniques have provided approaches to generating and deconvoluting large libraries of test compounds in high throughput screens. It involves selection and amplification of a subset of molecules with desired biological properties from complex libraries. One technique which has emerged for identification of peptide leads involves the use of peptide display methodologies such as phage display. Phage-displayed peptide libraries can comprise vast collections of short, randomized polypeptides that are displayed on the surface of a filamentous bacteriophage particle. Thus, each "peptide" is actually the N-terminal sequence of a phage-coat protein, that is encoded by a randomly-mutated region of the phage genome responsible for the production of the coat protein. In this manner, each unique peptide in the library is physically linked with the DNA molecule encoding it. Antibodies and other binding molecules can be used as "targets" to specifically select rare phage clones bearing ligand peptides, and sequencing of the corresponding viral DNA will reveal their amino acid sequences. Relatively high-affinity peptides for a variety of peptide- and non-peptide-binding targets have been affinity-isolated from epitope libraries. This technology has been used to map epitopes on proteins and to find peptide mimics for a variety of target molecules. Many powerful applications can be envisioned in the areas of drug design and the development of diagnostic markers, vaccines and toleragens.
For the purposes of drug discovery, there are potential advantages in the use of genetically encoded libraries, such as phage display (Scott et al, Science 249, 386 (1990); Devlin et al., Science 249, 386 (1990)), "peptide on plasmid" (Cull et al.
PNAS 89, 1865 (1992)), and in vitro translation-based systems (Mattheakis et al.
PNAS 91, 9022 (1994)), compared to the use of synthetic small molecule libraries
(Bunin et al. PNAS 91, 4708 (1994); Gordon et al. J. Med. Chem. 37, 1385 (1994); and Dooley et al., Science 266, 2019 (1994)). The genetic encoding of libraries allows the resynthesis and rescreening of molecules with a desired binding activity.
The resulting amplification of interacting molecules in subsequent rounds of selection can lead to the isolation of extremely rare, specific binders from a large
( pool of molecules.
However, despite the success of these methods, they suffer from numerous sources of error and bias, such as very low initial concentrations of species, nonspecific binding, and, significantly, the sampling of only a fraction of the library at the end of an experiment.
Once identified based on its binding activity, developing a therapeutic version of a peptide is further complicated be the often short half-life that peptides have in the circulation, being rapidly excreted through the kidneys or taken up by the reticuloendothelial system (RES) and other tissues. In recent years, many pharmaceutical companies and other institutions have devoted considerable time and resources into extending the duration of peptide drugs in the human body. The advantages of having a patient take such a drug less often are numerous; such as, better compliance, more predictable concentrations in the body, and fewer side effects from the sudden rush of medication shortly after it is administered. All medications, especially those given prophylactically or for a long period of time, are more readily accepted by the patient if they need to be taken less often.
One potential solution to these problems with serum-active proteins has been to provide the protein as part of a fusion protein with a serum protein. Serum albumin and other serum proteins have been demonstrated to provide a stable plasma transporter function for unstable proteins. However, using the prior art methods for isolating peptides based on binding activity, one cannot reasonably predict whether a selected peptide will retain that activity when presented in the context of a chimeric serum peptide. Further, the prior art does not demonstrate the use of fusion proteins in which a short peptide is fused to a serum protein to form a chimeric serum peptide (CSP). To illustrate, peptides isolated by phage display are isolated in the context of being a fusion protein with a viral coat protein. Peptides which are most active in such binding assays may not be the most active peptides from the library when formatted as part of a chimeric serum protein, e.g., as a consequence to a different steric, electronic and hydrophobicity environment. Indeed, in that context, other peptide sequences firom the library may in fact be more ideally suited for incorporation in a chimeric serum peptide, but are not selected due to the original format of the peptide library.
Summary of the Invention
One aspect of the invention provides a method for generating a chimeric serum peptide (CSP) with a selected biological activity, comprising:
(i) providing a display library comprising a variegated population of test chimeric serum peptides (CSPs) expressed on the surface of a population of display packages, each of which CSPs includes a serum protein sequence and at least one heterologous test peptide sequence which is variegated in the library and which is provided at an N- terminal end, C-terminal end or internal site of the serum protein sequence;
(ii) in a display mode, isolating, from the display library, a sub- population of display packages enriched for test CSPs which have a desired binding specificity and/or affinity for a cell or a component thereof;
(iii) in a secretion mode, expressing, preferably simultaneously, the enriched test CSP sub-population under conditions wherein the test CSPs are secreted and are free of the display packages; and, (iv) assessing the ability of the secreted test CSPs to regulate a selected biological activity in a target cell;
(v) selecting a chimeric serum peptide (CSP) possessing the ability to regulate the selected biological activity in the target cell, thereby generating a chimeric serum peptide (CSP) with a selected biological activity.
In a related aspect, the invention provides a method for generating a chimeric serum peptide (CSP) with a selected biological activity, comprising:
(i) providing a display library comprising a variegated population of test peptides expressed on the surface of a population of display packages, wherein the test peptide sequences are separated from the serum protein sequences by splice sites that are functional in eukaryotic cells, but not in prokaryotic cells;
(ii) in a display mode, isolating, from the display library, a sub- population of display packages enriched for test peptides which have a desired binding specificity and/or affinity for a cell or a component thereof;
(iii) in a secretion mode, expressing, preferably simultaneously, the enriched test peptide sub-population under conditions wherein the test peptide is flanked by serum protein sequences, such that the test peptide is in the form of a test chimeric serum peptide (CSP), and the test CSPs are secreted and are free of the display packages; and,
(iv) assessing the ability of the secreted test CSPs to regulate a selected biological activity in a target cell; (v) selecting a chimeric serum peptide (CSP) possessing the ability to regulate the selected biological activity in the target cell, thereby generating a chimeric serum peptide (CSP) with a selected biological activity.
For instance, the display library can be a phage display library, e.g., which utilizes phage particles such as M13, fl, fd, Ifl, Ike, Xf, Pfl, Pf3, λ, T4, T7, P2, P4, φX-174, MS2 or f2. In preferred embodiments, the phage display library is generated with a filamentous bacteriophage specific for Escherichia coli and the phage coat protein is coat protein III or coat protein VIII. For instance, the filamentous bacteriophage can be Ml 3, fd, and fl. In other embodiments, the display library is a bacterial cell-surface display library or a spore display library.
In certain embodiments, the test CSPs are enriched from the display library in the display mode by a differential binding means comprising affinity separation of test CSPs which specifically bind the cell or component thereof from test CSPs which do not. For example, the differential binding means can include panning the display library on whole cells, affinity chromatographic means in which a component of a cell is provided as part of an insoluble matrix (e.g., a cell surface protein attached to a polymeric support), and/or immunoprecipitating the display packages. In certain embodiments, in the display mode, the test CSPs can be enriched for those which bind to a cell-type specific marker and/or a cell surface receptor protein. For example, the test CSP library can be enriched in the display mode for test CSPs which bind to a G-protein coupled receptor, such as a chemoattractant peptide receptor, a neuropeptide receptor, a light receptor, a neurotransmitter receptor, a cyclic AMP receptor, or a polypeptide hormone receptor. The G-protein coupled receptor can also be αlA-adrenergic receptor, αlB-adrenergic receptor, α2- adrenergic receptor, α2B-adrenergic receptor, βl -adrenergic receptor, β2- adrenergic receptor, β3-adrenergic receptor, ml acetylcholine receptor (AChR), m2 ACliR, m3 AChR, m4 AChR, m5 AChR, Dl dopamine receptor, D2 dopamine receptor, D3 dopamine receptor, D4 dopamine receptor, D5 dopamine receptor, Al adenosine receptor, A2b adenosine receptor, 5-HTla, 5-HTlb, 5HTl-like, 5-HTld, 5HTld-like, 5HTld beta, substance K (neurokinin A), fMLP receptor, fMLP-like receptor, angiotensin II type 1, endothelin ETA, endothelin ETB, thrombin, growth hormone-releasing hormone (GHRH), vasoactive intestinal peptide, oxytocin, somatostatin SSTRl and SSTR2, SSTR3, cannabinoid, follicle stimulating hormone (FSH), leutropin (LH/HCG), thyroid stimulating hormone (TSH), thromboxane A2, platelet-activating factor (PAF), C5a anaphylatoxin, Interleukin 8 (IL-8) IL-8RA, IL-8RB, Delta Opioid, Kappa Opioid, mip-1/RANTES, Rhodopsin, Red opsin, Green opsin, Blue opsin, metabotropic glutamate m.GluRl-6, histamine H2, ATP, neuropeptide Y, amyloid protein precursor, insulin-like growth factor II, bradykinin, gonadotropin-releasing hormone, cholecystokinin, melanocyte stimulating hormone receptor, antidiuretic hormone receptor, glucagon receptor, or adrenocorticotropic hormone II.
In other embodiments, the test CSP library can be enriched in the display mode for test CSPs which bind to a receptor tyrosine kinase, such as an EPH receptor (for example, eph, elk, eck, sek, mek4, hek, hek2, eek, erk, tyrol, tyro4, tyroδ, tyroό, tyroll, cek4, cek5, cekό, cek7, cek8, cek9, ceklO, bsk, rtkl, rtk2, rtk3, mykl, myk.2, ehkl, ehk2,pagliaccio, htk, erk ox nuk receptors).
In still other embodiments, the test CSP library can be enriched in the display mode for test CSPs which bind to a cytokine receptor or an MIRR receptor. In certain embodiments, the test CSP library can be enriched in the display mode for test CSPs which bind to an orphan receptor.
In other embodiments, the test CSP library can be enriched in the display mode for test CSPs which bind to extracellular proteins, such as secreted proteins, or to infectious agents, such as viruses, fungi or bacteria. In preferred embodiments, the display library includes at least about 10 different test CSPs.
In preferred embodiments, the test CSPs are from about 3 to about 100 amino acid residues in length. In more preferred embodiments, the test CSPs are from about 4 to about 20 amino acid residues in length. In certain embodiments, each of the test CSPs are encoded by a chimeric gene comprising (i) a coding sequence for the test CSP, (ii) a coding sequence for a surface protein of the display package for displaying the test CSPs on the surface of a population of display packages, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in the display mode, the chimeric gene is expressed as fusion protein including the test CSP and the surface protein, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing.
In preferred embodiments, the test CSPs are expressed by a eukaryotic cell, more preferably a mammalian cell, in the secretion mode. In preferred embodiments, the target cell is a eukaryotic cell, more preferably a mammalian cell such as a human cell.
In certain embodiments, the biological activity scored for in the secretion mode includes a change in cell proliferation, cell differentiation or cell death. In other embodiments, the biological activity which is detected is changes in intracellular calcium mobilization, intracellular protein phosphorylation, phospholipid metabolism, and/or expression of cell-specific marker genes.
In certain embodiments, the target cell includes a reporter gene construct containing a reporter gene in operative linkage with one or more transcriptional regulatory elements responsive to the signal transduction activity of the cell surface receptor protein, expression of the reporter gene providing the detectable signal. For instance, the reporter gene can encode a gene product that gives rise to a detectable signal selected from: color, fluorescence, luminescence, cell viability relief of a cell nutritional requirement, cell growth, and drug resistance. In preferred embodiments, the reporter gene encodes a gene product selected from: chloramphenicol acetyl transferase, beta-galactosidase or secreted alkaline phosphatase. In other preferred embodiments, the reporter gene encodes a gene product which confers a growth signal.
In certain embodiments, the secretion mode includes expression of the test CSPs by a host cell co-cultured with the target cell. In a preferred embodiment, the co-cultured host and target cells are separated by a membrane which is permeable to the test CSP.
In certain embodiments, the secretion mode includes assessing the ability of the secreted test CSPs to inhibit the biological activity of an exogenously added compound on the target cells. In an exemplary embodiment: in step (ii) above, display packages which bind to endothelial cells are isolated; and in step (iv) above, the ability of the secreted test CSPs to inhibit proliferation of endothelial cells is assessed. For example, in step (iv) the ability of the secreted test CSPs to inhibit proliferation of endothelial cells in the presence of an angiogenic amount of an endogenous growth factor can be assessed.
In certain embodiments, the serum protein sequence is selected from: albumins, α-globulins, β-globulins, γ-globulins, haptoglobin, transthyretin, collagen, 2 macroglobulin, β2 microglobulin, C Reactive Protein, apolipoproteins, lipoproteins, cathepsins amylase, antichymotrypsin, ferritin, α fetoprotein, elastin and fibronectin and coagulation factors including fibrinogen, fibrin, thrombin, ceruloplasmin, antiplasmin and antithrombin III, or fragments thereof.
In certain embodiments, the CSPs are selected, at least in part, because the CSP is more potent than the test peptide sequence by itself, e.g., not fused to a serum protein. For example, a CSP library can be assigned to identify members that are at least 10 times, 100 times, or even 1000 times more active than the test peptide sequence alone, e.g., 1, 2, or even 3 orders of magnitude more active. Thus, in embodiments wherein the CSP inhibits a biological activity, the IC50 of the CSP may be 10 times lower, 100 times lower, or even 1000 times lower than the IC50 of the test peptide alone; and in embodiments wherein the CSP induces or promotes a biological activity, the EC50 of the CSP may be 10 times lower, 100 times lower, or even 1000 times lower than the EC50 of the test peptide alone. In embodiments wherein the CSP sequence binds to a biological molecule, such as a nucleic acid, peptide, or carbohydrate, the dissociation constant K of the CSP and the biological molecule to which it binds may be 10 times lower, 100 times lower, or even 1000 times lower than the Kd for binding between the biological molecule and the test peptide alone, e.g., binding of the two entities is increasingly favored over their dissociation.
In certain embodiments, the CSPs are represented by the formula A-B-C, wherein A represents a first fragment of a serum protein or homolog tliereof, B represents a test peptide sequence, and C represents another fragment of a serum protein or a homolog thereof.
In certain embodiments, the method further comprises formulating, with a pharmaceutically acceptable carrier, one or more test CSPs which regulate the biological activity in the target cell or peptidomimetics thereof.
In certain embodiments, the chimeric peptides are selected, at least in part, based on having a half-life in bodily fluids, such as urine, lymph, CSF or blood of no less than 10 days, preferably no less than about 14 days, and most preferably no less than 50% of the serum half-life of the native form of the serum protein. In another embodiment, the chimeric polypeptide is capable of binding to an extracellular receptor or ion channel. The chimeric polypeptide may be an agonist or an antagonist of an extracellular receptor or ion channel. The chimeric polypeptide of this embodiment may, for example, induce apoptosis, modulate cell proliferation, or modulate differentiation of cell types. In other embodiments, the test CSP library can be enriched in the display mode for test CSPs which bind to extracellular proteins, such as secreted proteins, or to infectious agents, such as viruses, fungi or bacteria.
The subject invention also specifically contemplates that peptides identified in the secretion mode can be converted into peptidomimietics. Moreover, in certain embodiments, the subject method includes the further step of formulating, with a pharmaceutically acceptable carrier, one or more test CSPs which regulate the biological activity in the target cell or peptidomimetics thereof.
Another aspect of the present invention provides a display library enriched for test chimeric serum peptides (CSPs) having a desired binding specificity and/or affinity for a cell or a component thereof and which regulate a biological activity in a target cell.
Still another aspect of the present invention relates to a vector comprising a chimeric gene for a chimeric serum peptide (CSP), which chimeric gene comprises (i) a coding sequence for a test CSP, which CSP includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C-terminal end or internal site of the serum protein sequence, (ii) a coding sequence for a surface protein of a display package, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in a display mode, the chimeric gene is expressed as a fusion protein including the test CSP and the surface protein such that the test CSP can be displayed on the surface of a population of display packages, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing. In a related aspect, the invention provides a vector comprising a chimeric gene for a chimeric serum peptide (CSP), which chimeric gene comprises (i) a coding sequence for a test CSP, which CSP includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C- terminal end or internal site of the serum protein sequence, (ii) a coding sequence for a surface protein of a display package, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in a display mode, the chimeric gene is expressed as a test peptide protein not including the serum protein sequence and the surface protein such that the test peptide can be displayed on the surface of a population of display packages, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing.
In certain embodiments, the chimeric gene can include a secretion signal sequence for secretion of the test CSP in the secretion mode, e.g., secretion of the test CSP from eukaryotic cells, preferably mammalian cells. In certain embodiments, the display package is a phage, such as Ml 3, fl, fd,
Ifl, Ike, Xf, Pfl, Pf3, λ, T4, T7, P2, P4, φX-174, MS2 or f2. In other embodiments, the phage is a filamentous bacteriophage specific for Escherichia coli and the surface protein is coat protein III or coat protein VIII. The filamentous bacteriophage can be Ml 3, fd, or fl. Yet another aspect of the present invention provides a vector library, each vector comprising a chimeric gene for a chimeric serum peptide (CSP), which chimeric gene comprises (i) a coding sequence for a test CSP, which CSP includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C-terminal end or internal site of the serum protein sequence, which test peptide sequence is variegated amongst members of the library, (ii) a coding sequence for a surface protein of a display package, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in a display mode, the chimeric gene is expressed as fusion protein including the test CSP and the surface protein such that the test CSP can be displayed on the surface of a population of display packages, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing, the vector library collectively encodes a variegated population of test CSPs.
In preferred embodiments, the chimeric gene further comprises a secretion signal sequence for secretion of the test CSP in the secretion mode. In most preferred embodiments, the secretion signal sequence causes secretion of the test CSP from eukaryotic cells, such as mammalian cells.
In preferred embodiments, the display package is a phage, such as Ml 3, fl, fd, Ifl, Ike, Xf, Pfl, Pf3, λ, T4, T7, P2, P4, φX-174, MS2 or f2. Alternatively, the phage is a filamentous bacteriophage specific for Escherichia coli and the surface protein is coat protein III. The filamentous bacteriophage can be M13, fd, or fl .
In preferred embodiments, the vector library collectively encodes at least about 103 different test CSPs. In preferred embodiments, the test CSPs are from about 3 to about 100 amino acid in length, more preferably from about 4 to about 20 amino acid residues in length. Another aspect of the present invention is a cell composition comprising a population of cells containing the vector library described above.
Another aspect of the present invention provides a construct as shown in Figure 1, 3, 9 or 15.
Still another aspect of the present invention provides a method for identifying a peptide with a selected antimicrobial activity, comprising: (i) providing a recombinant host cell population which expresses a soluble peptide library comprising a variegated population of test chimeric serum proteins (CSPs), which includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C-terminal end or internal site of the serum protein sequence;
(ii) culturing the host cells with a target microorganism under conditions wherein the peptide library is secreted and diffuses to the target microorganism; and, (iii) selecting a host cell expressing a test CSP that inhibits growth of the target microorganism, thereby identifying a peptide with a selected antimicrobial activity.
For example, the target microorganism is a bacteria or a fungus. In certain embodiments, the host cells are cultured on agar embedded with the target microorganisms. For example, antimicrobial activity of a test CSP can be determined by zone clearing in the agar.
Because the insertion of a relatively small peptide into a serum protein loop may stabilize the structure of the serum protein, the chimeric serum peptides of the present invention may also be useful in determining the three dimensional structure of a protein, for example, throughout the use of nuclear magnetic resonance (NMR) or x-ray crystallography.
Another aspect of the present invention provides a method of conducting a drug discovery business comprising: i) identifying, by the method of the present invention, a chimeric serum peptides having a desired biological activity; ii) conducting therapeutic profiling of the chimeric serum protein identified in step (i) for efficacy and toxicity in mammals; and, formulating a pharmaceutical preparation including a chimeric serum protein identified in step (ii) as having an acceptable therapeutic profile. The method can include an additional step of establishing a distribution system for distributing the pharmaceutical preparation for sale, and may optionally include establishing a sales group for marketing the pharmaceutical preparation.
Another aspect of the present invention provides a method of conducting a drug discovery business comprising: i) identifying, by the method of the present invention, a chimeric serum peptides having a desired biological activity; ii) conducting therapeutic profiling of the chimeric serum protein identified in step (i) for efficacy and toxicity in mammals; and, iii) licensing, to a third party, the rights for further drug development of one or more a chimeric serum peptides identified in step (ii) as having an acceptable therapeutic profile.
Another aspect of the present invention provides a method of conducting a drug discovery business comprising: i) identifying, by the method of the present invention, a chimeric serum peptides having a desired biological activity; ii) licensing, to a third party, the rights for further drug development based on one or more chimeric serum peptides identified in step (i).
Brief Description of the Drawings
Figure 1: Schematic of pSWl expression plasmid.
Figure 2: Original vector sequence for pSWl phage construct.
Figure 3 : Comparison of pSWl and pSW2 expression plasmids.
Figure 4: Sequence for pSW2 (cassette version).
Figure 5: Exchange of eukaryotic signal sequences by Xbal/Sall.
Figure 6: Reduction of flanking amino acids around peptide libraries.
Figure 7: Engineering of sites to allow fusion to carrier proteins in vitro.
Figure 8: Insertion of peptide libraries.
Figure 9: Comparison of pSW2 and pSW3 expression plasmids.
Figure 10: Sequences for pSW3, serum albumin construct. Figure 11 : Insertion of N-terminus of SA as Xbal/Sall PCR fragments.
Figure 12: Comparison of splice acceptor sequence in pSW2 and pSW3.
Figure 13 : Certain vector sequences for pSW3.
Figure 14: Movement of peptide sequences to pCDNA3 + MSA for expression and purification of MSA-peptide protein.
Figure 15: Comparison of pSW2 and pSW4 expression vectors.
Figure 16: Sequences for pSW4, display of MSA + peptide domain I of phage coat.
Figure 17: Insert SA pre-pro secretion signal sequence as Xbal/Sall site.
Figure 18: Insert Domain I of MSA (aal-96) as Hindlll/BamHI with engineered Sphl and Haindlll within MSA to allow insertion of oligoes and encoding peptide libraries.
Figure 19: Insert libraries as Sphl/Hindlll oligomers.
Figure 20: Splice acceptor sequence for pSW4.
Figure 21 : Reasoning behind two RBS signals for pill phage display.
Figure 22: Sequence for pSW2 + phage displayed serum albumin construct.
Detailed Description of the Invention
L Overview
The present invention makes available a powerful directed approach for isolating biologically active peptides in the context of a chimeric serum protein. One aspect of the present invention is the synthesis of a binary method that combines variegated display libraries, e.g., in a "display mode", with soluble secreted peptide libraries, e.g., in a "secretion mode", to yield a method for the efficient isolation of peptides having a desired biological activity. In each mode, the peptides of the library are provided as part of a chimeric serum protein or relevant fragment thereof. By providing the peptides as part of a chimeric serum protein in each of the steps, the subject method accesses the relevant 3 -dimensional conformational space for the peptides such that there is a degree of confidence that isolation and testing is optimized for the final use of the peptide in a chimeric serum protein.
Utilizing peptide display techniques, a chimeric serum protein library can first be reduced in complexity by panning or other affinity purification techniques. In particular, the subject method selects peptides, in the context of a chimeric serum protein, having a certain affinity profile, e.g., a specificity and/or binding affinity for a discrete cell or protein or other cellular component thereof by (i) displaying the chimeric serum protein on the outer surface of a replicable genetic display package to create a display library, and (ii) using affinity selection teclmiques to emich the population of display packages for those containing chimeric serum peptides which have a desired binding specificity for the target cell or cellular component (herein collectively referred to as the "target").
After the affinity enrichment step, the resulting sub-library is then utilized in a secretion mode whereby the chimeric serum peptides are secreted as soluble extracellular factors and their effect as a paracrine or autocrine factor is scored. That is, the secretion mode measures biological activity of the chimeric serum peptides in order to distinguish between agonist, antagonist, and inactive peptide sequences with regard to regulating a particular biological response of, e.g., a test cell or tissue. The display mode and secretion mode can be carried out without the need to sub-clone the coding sequences for each test chimeric serum protein into another vector. To illustrate, Figures 9 and 15 show exemplary vectors, pSW3 and pSW4, for sequential use in both the display and secretion modes.
In the present invention, there are several forms of vectors which can be used in preferred embodiments. In the vectors of one preferred embodiment (exemplified by pSW3), a test peptide is fused to a bacterial gene III, with mammalian splice donor and acceptor sites surrounding the variegated peptide. In bacterial cells, the vectors will produce a fusion protein consisting of a secretion signal sequence, the test peptide and the remaining C-terminal portion of the bacterial gene III. The resulting chimeric protein is capable of being incorporated into an Ml 3 phage particle (display mode). In mammalian cells (such as COS cells), the Ml 3 coding sequences are removed from the mature mRNA and the test peptide is inserted into an appropriate loop of serum albumin via splice sites. Thus, in mammalian cells, the test peptide will be expressed and secreted as part of a chimeric serum protein, CSP, such as chimeric albumin. In alternate embodiments (exemplified by pSW4), the vector comprises the chimeric serum peptide - with the test peptide fused to bacterial gene III already displayed in an appropriate loop of a serum protein, such as serum albumin, surrounded by splice sites. Thus, in bacterial cells, the vectors will produce the fusion protein consisting of a secretion signal sequence, the CSP, and the remaining C-terminal portion of the bacterial gene III protein, and incorporate it into an Ml 3 phage particle (display mode). In this case, the CSP, rather than the test peptide alone, will be displayed.
In mammalian cells (such as COS cells), the Ml 3 coding sequences are removed from the mature mRNA by virtue of splice sites. In this case, the CSP is left and will be secreted. In this, embodiment the serum protein or peptide, such as serum albumin, is normally a smaller peptide fragment, such as a peptide of serum albumin. For example, the serum peptide may comprise one or more domains of serum albumin, such as domain 1 (approximately amino acids 1 to 96 or amino acids 1 to 106). Thus, in both cases, the mature mRNA, in mammalian cells, encodes a secretion signal sequence and the chimeric serum protein alone, which is secreted as a soluble peptide from the cell.
Certain of these embodiments are appropriate for cells which do not have albumin receptors. However, in cells which do have albumin receptors, the chimeric serum peptide would not necessarily be detectable, as it could bind, as would native serum albumin protein. However, the second embodiment, in which only a fragment of the serum protein is used, can still be used in those cases.
One advantage to the vectors, compositions and methods of such embodiments of the subject method is the ability to reduce loss of peptide sequences from the sub-library by eliminating sub-cloning steps. The chimeric serum peptides of the invention include peptide sequences which make the protein useful as a therapeutic or diagnostic agent. Non-limiting examples of activity which can be imparted by the chimeric peptide sequence include enzyme inhibition, hormone agonism or antagonism, cytokine agonism or antagonism, analgesics, antipyretic, anti-inflammatory, antibiotic, antiviral, antisepsis, anti-fungal, cardiovascular agonism or antagonism, angiogenic agonism or antagonism, renal function and electrolyte metabolism agonism or antagonism, and chemotherapeutic drugs, to name but a few.
The improved pharmacokinetic or pharmacodynamic properties of the chimeric serum peptides can, in certain embodiments, provide for low-dose pharmaceutical formulations and novel pharmaceutical compositions. In certain aspects, the invention provides for methods of using the novel compositions including the therapeutic or diagnostic use of the chimeric peptides.
II. Definitions
Before further description of the invention, certain terms employed in the specification, examples and appended claims are, for convenience, collected here.
The term "peptide" refers to an oligomer in which the monomers are amino acids (usually alpha-amino acids) joined together through amide bonds. Peptides are two or more amino acid monomers long, but more often are between three and 100 amino acids in length. In preferred embodiments, the peptides may be from about 4 to about 20 amino acids or more, although peptides longer than 20 amino acids can also be referred to as "polypeptides." The term "protein" is well known in the art and usually refers to a very large polypeptide, or set of associated homologous or heterologous polypeptides, that has some biological function. For purposes of the present invention the terms "peptide" and "polypeptide" are largely interchangeable, and refer to domains or regions of proteins which may be primarily or secondarily responsible for some activity of a protein. For purposes of the present invention, it is preferred to fuse peptides or polypeptides to a serum protein to form "chimeric serum peptides" which can be used to generate the display library and so are collectively referred to as peptides. As used herein, the terms "chimeric serum peptide" and "CSP" are used interchangeably and refer to serum peptides which have been recombinantly engineered to include one or more heterologous amino acid sequences, peptides, at internal positions within the sequence of the serum protein, or appended at the amino- or carboxy-terminus of the protein. Unless otherwise apparent from the context, the term covers chimeric peptides which include the full-length sequence of a serum protein, as well as chimeric peptides in which only a fragment of the serum protein is retained.
The phrase "inserted into", as in the phrase "a biologically active peptide sequence inserted into a chimeric serum protein", is used herein to include both insertion of a first sequence between two amino acids of a second sequence, and replacement of one or more amino acids of the second sequence with the amino acids of the first sequence (e.g., replacing one or more amino acids of the second sequence with a first sequence of amino acids having the same or a different number of amino acids), unless the latter is clearly excluded.
A "test CSP" refers to a chimeric serum peptide which is being investigated for activity, e.g., being selected form a library or tested for activity on a cell. A "test peptide sequence" refers to the peptide sequence(s) which may be inserted into a serum protein and included in a CSP. The interchangeable terms "fusion" and "chimeric", as used herein to describe proteins and polypeptides, relate to polypeptides or proteins wherein two individual polypeptides or portions thereof are fused to form a single amino acid chain. Such fusion may arise from the expression of a single continuous coding sequence formed by recombinant DNA techniques. Thus, "fusion" polypeptides and "chimeric" polypeptides include contiguous polypeptides comprising a first polypeptide covalently linked via an amide bond to one or more amino acid sequences which define polypeptide domains that are foreign to and not substantially homologous with any domain of the first polypeptide.
Gene constructs encoding fusion proteins are likewise referred to a "chimeric genes" or "fusion genes". "Homology" and "identity" each refer to sequence similarity between two polypeptide sequences, with identity being a more strict comparison. Homology and identity can each be determined by comparing a position in each sequence which may be aligned for purposes of comparison. When a position in the compared sequence is occupied by the same amino acid residue, then the polypeptides can be referred to as identical at that position; when the equivalent site is occupied by the same amino acid (e.g., identical) or a similar amino acid (e.g., similar in steric and/or electronic nature), then the molecules can be referred to as homologous at that position. A percentage of homology or identity between sequences is a function of the number of matching or homologous positions shared by the sequences. An "unrelated", "heterologous", or "non-homologous" sequence shares less than 40 percent identity, though preferably less than 25 percent identity, with a sequence to which it is compared. Thus, a "heterologous peptide sequence" is a peptide sequence substantially dissimilar to a sequence to which it is compared. The term "serum albumin" (S A) is intended to include (but not necessarily to be restricted to) serum albumin proteins of living organisms, preferably mammalian serum albumins, even more preferably known or yet-to-be-discovered polymorphic forms of human serum albumin (HSA), and fragments and variants thereof. As one example of "fragments," domain 1 of serum albumin (aa 1-96, or aa 1-106) can be used in place of the full protein. The term "fragments" is intended to include (but not necessarily be restricted to) one or more functional domains of serum albumin, including at least one loop selected from (Cys53-Cys62, Cys75-Cys91, Cys90-Cys101, Cys245-Cys253, Cys266-Cys279, Cys360-Cys369, Cys461 -Cys477, Cys476-Cys487, and Cys558-Cys567). As examples of "variants," the human serum albumin Naskapi has Lys-372 in place of Glu-372, and albumin Christchurch has an altered pro-sequence. The term "variants" is intended to include (but not necessarily be restricted to) homologs of SA proteins with minor artificial variations in sequence (such as molecules lacking one or a few residues, having conservative substitutions or minor insertions of residues, or having minor variations of amino acid structure). Thus, polypeptides which have 80%, 85%, 90%, or 99% homology with a native SA are deemed to be "variants." It is also preferred for such variants to share at least one pharmacological utility with a native SA. Any putative variant which is to be used pharmacologically should be non-immunogenic in the animal (especially human) being treated. Sequences of a number of contemplated serum albumin proteins can be obtained from GenBank (National Center for Biotechnology Information), including human, bovine, mouse, pig, horse, sheep, and chick serum albumins.
The term "native" is used to describe a protein which occurs naturally in a living organism. Wild-type proteins are thus native proteins. Proteins which are non- native are those which have been generated by artificial mutation, recombinant design, or other laboratory modification and are not known in natural populations. "Conservative substitutions" are those where one or more amino acids are substituted for others having similar properties such that one skilled in the art of polypeptide chemistry would expect at least the secondary structure, and preferably the tertiary structure, of the polypeptide to be substantially unchanged. For example, typical such substitutions include asparagine for glutamine, serine for asparagine, and arginine for lysine.
The term "physiologically functional equivalents" also encompasses larger molecules comprising the native sequence plus a further sequence at the N-terminus (for example, pro-HSA, pre-pre-HSA, and met-HSA).
"Tertiary structure" refers to the three-dimensional structure of a protein. Proteins which have similar tertiary structures will have similar shapes and surfaces, even if the amino acid sequences (the "secondary structure") is not identical. Tertiary structure is a consequence of the folding and twisting of an amino acid chain upon itself and can be disrupted by chemical means, e.g., strong acid or base, or by physical means, e.g., heating. The term "simultaneously expressing" refers to the expression of a representative population of a peptide library, or chimeric peptide library as the case may be, e.g., at least 50 percent, more preferably 75, 80, 85, 90, 95 or 98 percent of all the different peptide sequences of a library. "Simultaneous expression" may be independent for each member of a library, but may share common inducing factors, such that the expression of each takes place at approximately the same time interval. The term "random peptide library" refers to a set of random or semi-random peptide sequences, as well as sets of fusion proteins containing those random peptides (as applicable). As used herein, the term will be understood to include libraries of peptide sequences provided in the form of chimeric serum peptides. The term "effective amount" refers to an amount sufficient to induce a statistically significant result.
The term "ligand" refers to a molecule that is recognized by a particular protein, e.g., a receptor. Any agent bound by or reacting with a protein is called a "ligand," so the term encompasses the substrate of an enzyme and the reactants of a catalyzed reaction. The term "ligand" does not imply any particular molecular size or other structural or compositional feature other than that the substance in question is capable of binding or otherwise interacting with a protein. A "ligand" may serve either as the natural ligand to which the protein binds or as a functional analogue that may act as an agonist or antagonist. The language "replicable genetic display package" or "display package" describes a biological particle which has genetic information providing the particle with the ability to replicate. The package can display a chimeric serum protein, e.g., including a peptide derived from the variegated peptide library. The test peptide portion of the chimeric serum protein is presented by the display package in a context which permits the test CSP portion of the CSP to bind to a target that is contacted with the display package. The display package will generally be derived from a system that allows the sampling of very large variegated peptide libraries. The display package can be, for example, derived from vegetative bacterial cells, bacterial spores, and bacterial viruses. The language "differential binding means", as well as "affinity selection" and "affinity enrichment", refer to the separation of members of the display library based on the differing abilities of CSP sequences on the surface of each of the display packages of the library to bind to the target. The differential binding of a target by different members of the display library can be used in the affinity separation of tliose cliimeric serum peptides which specifically bind the target from those which do not. For example, the affinity selection protocol can also include a pre- or post-enrichment step wherein display packages capable of binding "background targets", e.g., as a negative selection, are removed from the library. Examples of affinity selection means include affinity chromatography, immunoprecipitation, fluorescence activated cell sorting, agglutination, and plaque lifts. As described below, the affinity chromatography includes bio-panning techniques using either purified, immobilized target proteins or the like, as well as whole cells.
The phrases "individually selective manner" and "individually selective binding", with respect to binding of a test CSP with a target protein, refers to the binding of a peptide to a certain protein target which binding is specific for, and dependent on, the molecular identity of the protein target.
The term "solid support" refers to a material having a rigid or semi-rigid surface. Such materials will preferably take the form of small beads, pellets, disks, chips, dishes, multi-well plates, wafers or the like, although other forms may be used. In some embodiments, at least one surface of the substrate will be substantially flat. The term "surface" refers to any generally two-dimensional structure on a solid substrate and may have steps, ridges, kinks, terraces, and the like without ceasing to be a surface.
The term "vector" refers to a DNA molecule, capable of replication in a host cell, into which a gene can be inserted to construct a recombinant DNA molecule.
The terms "phage vector" and "phagemid" are art-recognized and generally refer to a vector derived by modification of a phage genome, containing an origin of replication for a bacteriophage, and preferably, though optional, an origin (ori) for a bacterial plasmid. The use of phage vectors rather than the phage genome itself provides greater flexibility to vary the ratio of chimeric peptide/coat protein to wild- type coat protein, as well as supplement the phage genes with additional genes encoding other heterologous polypeptides, such as "auxiliary polypeptides" which may be useful in the "dual" CSP display constructs described below.
The language "helper phage" describes a phage which is used to infect cells containing a defective phage genome or phage vector and which functions to complement the defect. The defect can be one which results from removal or inactivation of phage genomic sequence required for production of phage particles. Examples of helper phage are M13K07.
As used herein, "cell surface receptor" refers to molecules that occur on the surface of cells, interact with the extracellular environment, and (directly or indirectly) transmit or transduce the information regarding the environment intracellularly in a manner that may modulate intracellular second messenger activities or transcription of specific promoters, resulting in transcription of specific genes.
As used herein, "extracellular signals" include a molecule or other change in the extracellular environment that is transduced intracellularly via cell surface proteins that interact, directly or indirectly, with the signal. An extracellular signal or effector molecule includes any compound or substance that in some manner alters the activity of a cell surface protein. Examples of such signals include, but are not limited to, molecules such as acetylcholine, growth factors and hormones, lipids, sugars and nucleotides that bind to cell surface and/or intracellular receptors and ion channels and modulate the activity of such receptors and channels. The term also include as yet unidentified substances that modulate the activity of a cellular receptor, and thereby influence intracellular functions. Such extracellular signals are potential pharmacological agents that may be used to treat specific diseases by modulating the activity of specific cell surface receptors.
"Orphan receptors" is a designation given to a receptors for which no specific natural ligand has been described and/or for which no function has been determined.
As used herein, a "reporter gene construct" is a nucleic acid that includes a "reporter gene" operatively linked to at least one transcriptional regulatory sequence. Transcription of the reporter gene is controlled by these sequences to which they are linked. The activity of at least one or more of these control sequences can be directly or indirectly regulated by the target receptor protein. Exemplary transcriptional control sequences are promoter sequences. A reporter gene is meant to include a promoter-reporter gene construct which is heterologously expressed in a cell. The term "indicator gene" generically refers to an expressible (e.g., able to be transcribed and (optionally) translated) DNA sequence which is, for example, expressed in response to a signal transduction pathway modulated by a target receptor or ion channel. Exemplary indicator genes include unmodified endogenous genes of the host cell, modified endogenous genes, or a reporter gene of a heterologous construct, e.g., as part of a reporter gene construct.
"Signal transduction" is the processing of physical or chemical signals from the cellular environment through the cell membrane, and may occur through one or more of several mechanisms, such as activation/inactivation of enzymes (such as proteases, or other enzymes which may alter phosphorylation patterns or other post- translational modifications), activation of ion channels or intracellular ion stores, effector enzyme activation via guanine nucleotide binding protein intermediates, formation of inositol phosphate, activation or inactivation of adenylyl cyclase, direct activation (or inhibition) of a transcriptional factor and/or activation. The term "modulation of a signal transduction activity of a receptor protein" in its various grammatical forms, as used herein, designates induction and/or potentiation, as well as inhibition of one or more signal transduction pathways downstream of a receptor.
Agonists and antagonists are "receptor effector" molecules that modulate signal transduction via a receptor. Receptor effector molecules are capable of binding to the receptor, though not necessarily at the binding site of the natural ligand. Receptor effectors can modulate signal transduction when used alone, i.e. can be surrogate ligands, or can alter signal transduction in the presence of the natural ligand, either to enhance or inhibit signaling by the natural ligand. For example, "antagonists" are molecules that block or decrease the signal transduction activity of receptor, e.g., they can competitively, noncompetitively, and/or allosterically inhibit signal transduction from the receptor, whereas "agonists" potentiate, induce or otherwise enhance the signal transduction activity of a receptor.
The terms "receptor activator" and "surrogate ligand" refer to an agonist which induces signal transduction from a receptor. The term "compound" as used herein is meant to include both exogenously added test compounds and peptides or chimeric serum peptides expressed from a library of the present invention.
The term "biologically active" refers to an entity which interacts in some way with a living organism on a molecular level. Entities which are biologically active may activate a receptor, provoke an immune reaction, interact with a membrane or ion channel, or otherwise induce a change in a biological function of an organism or any part of an organism.
The term "target cells" as used herein means cells, either in vivo or ex vivo, for which it is desired to modify the behavior tlirough interaction with a ligand. For example, the ligand may interact with a target cell via binding to a cellular receptor. The ligand may be a peptide, or a synthetic molecule which is able to mimic the effects of such a peptide. Target cells may be any type of cell, including blood cells, skeletal muscle cells, stem cells, skin cells, liver cells, secretory gland cells, hematopoietic cells, and marrow cells.
"Serum half-life" as used herein refers to the time required for half of a quantity of a peptide in the bloodstream to be degraded.
III. Exemplary Embodiments
A. Chimeric Serum Peptides
In general, the subject peptide sequences are included as part of a fusion protein with a serum protein, being added at either the N- or C- terminus of the proteins, or at one or more internal sites. Examples of serum proteins which can be used in the present invention include albumin, -globulins, β-globulins, γ-globulins, haptoglobin, transthyretin, collagen, 2 macroglobulin, β2 microglobulin, C Reactive Protein, apolipoproteins, lipoproteins, cathepsins amylase, antichymotrypsin, ferritin, α fetoprotein, elastin and fibronectin and coagulation factors including fibrinogen, fibrin, thrombin, ceruloplasmin, antiplasmin and antithrombin III, and the like. In addition to serum half-life issues and other characteristics which may influence the choice of serum protein to use as the scaffold, the bioavailability of the various proteins may also be a consideration. For instance, in certain embodiments the CSP is selected to be one which is capable of crossing the blood-brain barrier by transcytosis. To illustrate, the CSP can be generated by use of scaffold sequences from insulin, transferrin, IGF-I, IGF-II, basic albumin or prolactin.
Chimeric serum albumin polypeptides and their uses are also described in detail in co-pending U.S. application 09/768,183, filed on January 23, 2001, the entire content of which is incorporated herein by reference. U.S.S.N. 09/768,183 describes in detail, and provides ample working examples of chimeric serum proteins, their general structures, production from suitable vectors, and especially the preferred insertion positions (Cysteine loops, see below) for heterologous polypeptides, such as the ones used in the instant application.
In preferred embodiments, the variegated peptide sequences of the CSP are in the range of from about 3 to about 100 amino acids in length, more preferably from about 4 to about 50 amino acids in length or from about 4 to about 20 amino acids, and even more preferably at least 5, 10, 13, 15, 20 or 25 amino acid residues in length.
In certain embodiments, the test CSP is flanked by cysteine residues in order to provide a constrained environment. For example, the test peptide portion of the CSP may be represented in the general formula Cys-(Xaa)3- 3-Cys, where Xaa is independently selected for each position in the peptide chain.
Merely to illustrate, in certain embodiments one or more heterologous peptide fragments are inserted into a serum albumin protein or a homolog thereof. The heterologous peptide fragment may optionally replace a portion of the serum protein sequence. A peptide fragment which replaces a portion of the serum protein sequence need not be of the same length as the fragment it replaces. A chimeric serum protein according to this aspect may include more than one heterologous peptide fragments which replaces a portion of the serum protein sequence. The included fragments may be identical or different, and may be random, semi-random or sequences from a protein unrelated to serum protein. A CSP of this aspect, for example, may comprise the structure A-B-C, wherein A represents a first fragment of a serum protein or homolog thereof, B represents a test peptide sequence, and C represents another fragment of a serum protein or a homolog thereof. Similarly, a chimeric polypeptide may comprise the structure A-B-C-D-E, wherein A, C, and E represent fragments of a serum protein and B and D represent test peptide sequences, which may be the same or different. In certain embodiments, the test peptide portion includes at least 6 amino acids, at least 12 amino acids, or at least 18 amino acids.
The systems and methods disclosed herein are directed towards increasing the lifetime of therapeutic peptides in the bloodstream by creating cliimeric polypeptides containing segments of serum proteins and the peptide. Serum albumin, for example, is the major protein constituent of the circulatory system, has a half-life in the blood of about three weeks (Rothschild, M.A. et al. Hepatology 1988, 8, 385-401), and is present in quantity (40 g/L in the serum). It is also lαiown that the normal adult human liver produces approximately 15 grams of human serum albumin (HSA) per day, or about 200 mg per kilogram of body weight. Serum albumin has no immunological activity or enzymatic function, and is a natural carrier protein used to transport many natural and therapeutic molecules. Fusion proteins wherem a therapeutic polypeptide has been covalently linked to serum albumin have been shown to have serum half-lives many times longer than the half- life of the therapeutic peptide itself (Syed, S. et al. Blood 1997, 89, 3243-3252; Yeh, P. et al. Proc. Natl. Acad. Sci. USA 1992, 89, 1904-1908). In both cited publications, the half-life of the fusion protein was more than 140 times greater than that of the therapeutic polypeptide itself, and approached the half-life of unfused serum albumin. Furthermore, the amino-terminal portion of serum albumin has been found to favor particularly efficient translocation and export of the fusion proteins in eukaryotic cells (PCT publication WO 90/13653). Generally, this means that such proteins are more efficiently secreted by a cell manufacturing such proteins than are the free therapeutic polypeptides themselves. The three-dimensional structure and the chemistry of SA have been well studied (Carter, D.C. et al. Eur. J. Biochem. 1994, 226, 1049-1052; He, X.M. et al. Nature 1992, 358, 209-215; Carter, D.C. et al. Science 1989, 244, 1195-1198). Thus, rather than relying on simple, binary fusion proteins as discussed above, portions of the SA protein may be strategically or combinatorially replaced by therapeutic polypeptides. For example, cysteine-constrained loops may be selected for replacement, e.g., on the presumption that structural changes to the loop are likely to minimally affect the tertiary structure of the protein as a whole. Figures 4A-I of U.S.S.N. 09/768,183 (supra) show the locations of several such loops on the mouse serum albumen protein. The present invention contemplates insertion into or replacement of any one of the loops in serum albumin that meet this criteria, or any combination of such loops. In certain embodiments, a loop selected for insertion or replacement is located at or near the surface of the serum albumen protein to facilitate intermolecular interactions. One of skill in the art will readily be able to adapt these techniques to other serum albumen proteins, e.g., bovine, human, and other serum albumen proteins.
Techniques of combinatorial mutagenesis combined with structurally motivated grafting procedures allow the random preparation of a library of many related polypeptides which carry a biologically active peptide fragment and are substantially similar to serum albumin in tertiary structure. For example, a chimeric polypeptide of the present invention may include a biologically active heterologous peptide sequence inserted into the peptide sequence of a serum albumin protein. The inserted sequence may optionally replace a portion of the serum albumin sequence, whether that portion is of similar or dissimilar length. In some cases, more than one insertion may be required to obtain the desired biological activity. Alternatively, a biologically active heterologous peptide sequence may be placed between two fragments of a serum albumin sequence to create such a chimeric polypeptide. Optionally, one or more additional biologically active peptide sequences may be placed between fragments of serum albumin protein. Chimeric polypeptides of the present invention may also be described as a biologically active heterologous peptide sequence flanked on one side by an N-terminal fragment of serum albumin protein and on the other side by a C-terminal fragment of serum albumin protein. A space-filling model of human serum albumin (HSA) has been previously generated (see Figure 1 of U.S.S.N. 09/768,183). The tertiary structure of HSA reveals the presence often approximate helical regions or loops, each constrained by disulfide bonded cysteine pairs. The space-filling model was used to predict loop regions that are exposed on the surface of the protein.
Two amino acid segments were chosen to represent surface exposed regions (loop 53-62 and loop 360-369) and a third to represent a region assumed to be buried within the protein (loop 450-463). These and other candidate loops (Cys53-Cys62, Cys75-Cys91, Cys90-Cys101, Cys245-Cys253, Cys266-Cys279, Cys360-Cys369, Cys461- Cys477, Cys476-Cys487, and Cys558-Cys567) are depicted in Figures 4A-I of U.S.S.N. 09/768,183. These amino acid segments represent surface exposed regions.
The variegated peptide libraries of the subject method can be generated by any of a number of methods, and, though not limited by, preferably exploit recent trends in the preparation of chemical libraries. For instance, chemical synthesis of a degenerate gene sequence can be carried out in an automatic DNA synthesizer, and the synthetic genes then ligated into an appropriate expression vector. The purpose of a degenerate set of genes is to provide, in one mixture, all of the sequences encoding the desired set of potential test sequences. The synthesis of degenerate oligonucleotides is well known in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11 :477. Such techniques have been employed in the directed evolution of other proteins (see, for example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815). As used herein, "variegated" refers to the fact that a population of peptides is characterized by having a peptide sequence which differ from one member of the library to the next. For example, in a given peptide library of n amino acids in length, the total number of different peptide sequences in the library is given by the product °f "! l 2 •••• Vn-1 Vn\ wnere eacn vn represents the number different amino acid residues occurring at position n of the peptide. In a preferred embodiment of the present invention, the CSP display collectively produces a peptide library including at least 96 to 107 different chimeric serum peptides, so that diverse peptides may be simultaneously assayed for the ability to interact with the target protein.
In one embodiment, the test CSP library is derived to express a combinatorial library of peptide sequences in the chimeric serum protein which are not based on any known sequence, nor derived from cDNA. That is, the sequences of the test peptides of the library are largely, if not entirely, random.
In another embodiment, the peptide library is derived to express a combinatorial library of peptides sequences which are based at least in part on a known polypeptide sequence or a portion thereof (though preferably not a cDNA library). That is, the sequences of the library is semi-random, being derived by combinatorial mutagenesis of a known sequence(s). See, for example, Ladner et al. PCT publication WO 90/02909; Garrard et al, PCT publication WO 92/09690; Marks et al. (1992) J. Biol. Chem. 267:16007-16010; Griffths et al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461. Accordingly, sequences for peptides that are lαiown ligands for a receptor can be mutagenized by standard techniques to derive a variegated library of polypeptide sequences which can be incorporated into the subject chimeric serum protein and screened for agonists and/or antagonists. The harnessing of biological systems for the generation of peptide diversity is now a well established technique which can be exploited to generate the CSP libraries of the subject method. The source of diversity is the combinatorial chemical synthesis of mixtures of oligonucleotides. Oligonucleotide synthesis is a well- characterized chemistry that allows tight control of the composition of the mixtures created. Degenerate DNA sequences produced are, in preferred embodiments, ligated with coding sequences for the serum protein.
There are two principal ways in which to prepare the required degenerate mixture. In one method, the DNAs are synthesized a base at a time. When variation is desired at a base position dictated by the genetic code a suitable mixture of nucleotides is reacted with the nascent DNA, rather than the pure nucleotide reagent of conventional polynucleotide synthesis. The second method provides more exact control over the amino acid variation. First, trinucleotide reagents are prepared, each trinucleotide being a codon of one (and only one) of the amino acids to be featured in the peptide library. When a particular variable residue is to be synthesized, a mixture is made of the appropriate trinucleotides and reacted with the nascent DNA. Whatever the method may be for generating diversity at the codon level, chemical synthesis of a degenerate peptide coding sequence can be carried out in an automatic DNA synthesizer, and the synthetic genes can then be ligated into an appropriate gene for expression. The synthesis of degenerate oligonucleotides is well lαiown in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477.
B. Display Mode
In its "display mode", a library of test peptides or test CSPs is expressed by a population of display packages to form a display library. With respect to the display package on which the variegated peptide library is manifest, it will be appreciated from the discussion provided herein that the display package will preferably be able to be (i) genetically altered to encode the test peptide or chimeric serum peptides, (ii) maintained and amplified in culture, (iii) manipulated to display the test peptide- containing gene product or CSP-containing gene product in a manner permitting the test peptide or CSP to interact with a target during an affinity separation step, and (iv) affinity separated while retaining the nucleotide sequence encoding the test peptide or test CSP (herein "test gene") such that the sequence of the test gene can be obtained. In preferred embodiments, the display remains viable after affinity separation.
Ideally, the display package comprises a system that allows the sampling of very large variegated display libraries, rapid sorting after each affinity separation round, and easy isolation of the test gene from purified display packages or further manipulation of that sequence in the secretion mode. The most attractive candidates for this type of screening are prokaryotic organisms and viruses, as they can be amplified quickly, they are relatively easy to manipulate, and large number of clones can be created. Preferred display packages include, for example, vegetative bacterial cells, bacterial spores, and most preferably, bacterial viruses (especially DNA viruses). However, the present invention also contemplates the use of eukaryotic cells, including yeast and their spores, as potential display packages.
In addition to commercially available kits for generating phage display libraries (e.g. the Pharmacia Recombinant Phage Antibody System, catalog no. 27- 9400-01; and the Stratagene SurfZAP™ phage display kit, catalog no. 240612), examples of methods and reagents particularly amenable for use in generating the variegated display library of the present invention can be found in, for example, the Ladner et al. U.S. Patent No. 5,223,409; the Kang et al. International Publication No. WO 92/18619; the Dower et al. International Publication No. WO 91/17271; the Winter et al. International Publication WO 92/20791; the Markland et al. International Publication No. WO 92/15679; the Breitling et al. International Publication WO 93/01288; the McCafferty et al. International Publication No. WO 92/01047; the Garrard et al. International Publication No. WO 92/09690; the Ladner et al. International Publication No. WO 90/02809; Fuchs et al. (1991) Bio/Technology 9:1370-1372; Hay et al. (1992) Hum Antibod Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281; Griffths et al. (1993) EMBO J 12:725- 734; Hawkins et al. (1992) J Mol Biol 226:889-896; Clackson et al. (1991) Nature 352:624-628; Gram et al. (1992) PNAS 89:3576-3580; Garrad et al. (1991) Bio/Technology 9:1373-1377; Hoogenboom et al. (1991) Nuc Acid Res 19:4133- 4137; and Barbas et al. (1991) PNAS 88:7978-7982. These systems can, with modifications described herein, be adapted for use in the subject method. When the display is based on a bacterial cell, or a phage which is assembled periplasmically, the display means of the package will comprise at least two components. The first component is a secretion signal which directs the recombinant test gene product to be localized on the extracellular side of the cell membrane (of the host cell when the display package is a phage). This secretion signal can be selected so as to be cleaved off by a signal peptidase to yield a processed, "mature" polypeptide, e.g., a mature test peptide or CSP. The second component is a display anchor protein which directs the display package to associate the test gene product with its outer surface. As described below, this anchor protein can be derived from a surface or coat protein native to the genetic package.
When the display package is a bacterial spore, or a phage whose protein coating is assembled intracellularly, a secretion signal directing the test gene product to the inner membrane of the host cell is unnecessary. In these cases, the means for arraying the variegated peptide library comprises a derivative of a spore or phage coat protein amenable for use as a fusion protein.
In some instances it may be necessary to introduce an unstructured polypeptide linker region between portions of the chimeric peptide, e.g., between the test gene product and anchor polypeptide. This linker can facilitate enhanced flexibility of the chimeric peptide allowing the test gene product to freely interact with a target by reducing steric hindrance between the two fragments, as well as allowing appropriate folding of each portion to occur. The linker can be of natural origin, such as a sequence determined to exist in random coil between two domains of a protein. Alternatively, the linker can be of synthetic origin. For instance, the sequence (Gly4Ser)3 can be used as a synthetic unstructured linker. Linkers of this type are described in Huston et al. (1988) PNAS 85:4879; and U.S. Patent Nos. 5,091,513 and 5,258,498. Naturally occurring unstructured linkers of human origin are preferred as they reduce the risk of immunogenicity. In the instance wherein the display package is a phage, the cloning site for the test gene sequences in the phagemid should be placed so that it does not substantially interfere with normal phage function. One such locus is the intergenic region as described by Zinder and Boeke, (1982) Gene 19:1-10.
The number of possible combinations in a peptide library can get larger as the length is increased and selection criteria for degenerating at each position is relaxed. To sample as many combinations as possible depends, in part, on the ability to recover large numbers of transformants. For phage with plasmid-like forms (as filamentous phage), electrotransformation provides an efficiency comparable to that of phage-transfection with in vitro packaging, in addition to a very high capacity for DNA input. This allows large amounts of vector DNA to be used to obtain very large numbers of transformants. The method described by Dower et al. (1988) Nucleic Acids Res., 16:6127-6145, for example, may be used to transform fd-tet derived recombinants at the rate of about 107 transformants/μg of ligated vector into E. coli (such as strain MCI 061), and libraries may be constructed in fd-tet Bl of up to about 3 x 108 members or more. Increasing DNA input and making modifications to the cloning protocol within the ability of the skilled artisan may produce increases of greater than about 10-fold in the recovery of transformants, providing libraries of up to 1010 or more recombinants.
As will be apparent to those skilled in the art, in embodiments wherein high affinity peptides are sought, an important criteria for the present selection method can be that it is able to discriminate between peptides of different affinity for a particular target, and preferentially enrich for the peptides of highest affinity. Again, this is in the context of a chimeric serum protein. Applying the well known principles of peptide affinity and valence (i.e. avidity), it is understood that manipulating the display package to be rendered effectively monovalent can allow affinity enrichment to be carried out for generally higher binding affinities (i.e. binding constants in the range of 106 to 1010 M"1) as compared to the broader range of affinities isolable using a multivalent display package. To generate the monovalent display, the natural (i.e. wild-type) form of the surface or coat protein used to anchor the test peptide or CSP to the display can be added at a high enough level that it almost entirely eliminates inclusion of the test peptide or CSP in the display package. Thus, a vast majority of the display packages can be generated to include no more than one copy of the test peptide or CSP (see, for example, Garrad et al. (1991) Bio/Technology 9:1373-1377). In a preferred embodiment of a monovalent display library, the library of display packages will comprise no more than 5 to 10% polyvalent displays, and more preferably no more than 2% of the display will be polyvalent, and most preferably, no more than 1% polyvalent display packages in the population. The source of the wild-type anchor protein can be, for example, provided by a copy of the wild-type gene present on the same construct as the peptide fusion protein, or provided by a separate construct altogether. However, it will be equally clear that by similar manipulation, polyvalent displays can be generated to isolate a broader range of binding affinities. Such embodiments can be useful, for example, in purification protocols where avidity can be desirable. i) Phages As Display Packages
Bacteriophage are attractive prokaryotic-related organisms for use in the subject method. Bacteriophage are excellent candidates for providing a display system of the variegated peptide library as there is little or no enzymatic activity associated with intact mature phage, and because their genes are inactive outside a bacterial host, rendering the mature phage particles metabolically inert. In general, the phage surface is a relatively simple structure. Phage can be grown easily in large numbers, they are amenable to the practical handling involved in many potential mass screening programs, and they carry genetic information for their own synthesis within a small, simple package. As the test gene is inserted into the phage genome, choosing the appropriate phage to be employed in the subject method will generally depend most on whether (i) the genome of the phage allows introduction of the test gene either by tolerating additional genetic material or by having replaceable genetic material; (ii) the virion is capable of packaging the genome after accepting the insertion or substitution of genetic material; and (iii) the display of the peptide on the phage surface does not disrupt virion structure sufficiently to interfere with phage propagation.
One concern presented with the use of phage is that the morphogenetic pathway of the phage determines the environment in which the peptide will have opportunity to fold. Periplasmically assembled phage are preferred as the displayed peptides may contain essential disulfides, and such peptides may not fold correctly within a cell. However, in certain embodiments in which the display package forms intracellularly (e.g., where λ phage are used), it has been demonstrated in other instances that disulfide-containing peptides can assume proper folding after the phage is released from the cell.
Another concern related to the use of phage, but also pertinent to the use of bacterial cells and spores as well, is that multiple infections could generate hybrid displays that carry the gene for one particular test peptide or CSP yet have two or more different test peptides or CSPs on their surfaces. Therefore, it can be preferable, though optional, to minimize this possibility by infecting cells with phage under conditions resulting in a low multiple-infection. For a given bacteriophage, the preferred display means is a protein that is present on the phage surface (e.g. a coat protein). Filamentous phage can be described by a helical lattice; isometric phage, by an icosahedral lattice. Each monomer of each major coat protein sits on a lattice point and makes defined interactions with each of its neighbors. Proteins that fit into the lattice by making some, but not all, of the normal lattice contacts are likely to destabilize the virion by aborting formation of the virion as well as by leaving gaps in the virion so that the nucleic acid is not protected. Thus in bacteriophage, unlike the cases of bacteria and spores, it is generally important to retain in the fusion anchor proteins those residues of the coat protein that interact with other proteins in the virion. For example, when using the Ml 3 cpVIII protein, the entire mature protein will generally be retained with the test peptide or CSP sequence being added to the N-terminus of cpVIII, while on the other hand it can suffice to retain only the last 100 carboxy terminal residues (or even fewer) of the Ml 3 cpIII coat protein in the peptide fusion protein. Under the appropriate induction, the test gene library is expressed and exported, as part of the fusion protein, to the bacterial cytoplasm, such as when the λ phage is employed. The induction of the fusion protein(s) may be delayed until some replication of the phage genome, synthesis of some of the phage structural-proteins, and assembly of some phage particles has occurred. The assembled protein chains then interact with the phage particles via the binding of the anchor protein on the outer surface of the phage particle. The cells are lysed and the phage bearing the library-encoded test peptide or CSP (that corresponds to the specific library sequences carried in the DNA of that phage) are released and isolated from the bacterial debris. To enrich for and isolate phage which encodes a selected test peptide or CSP, and thus to ultimately isolate the nucleic acid sequences (the test gene) themselves, phage harvested from the bacterial debris are affinity purified. As described below, when a test peptide or CSP which specifically binds a particular target is desired, the target can be used to retrieve phage displaying the desired test peptide or CSP. The phage so obtained may then be amplified by infecting into host cells. Additional rounds of affinity enrichment followed by amplification may be employed until the desired level of enrichment is reached. The enriched peptide-phage can also be screened with additional detection- techniques such as expression plaque (or colony) lift (see, e.g., Young and Davis, Science (1983) 222:778-782) whereby a labeled target is used as a probe.
a) Filamentous Phage
Filamentous bacteriophages, which include Ml 3, fl, fd, Ifl, Ike, Xf, Pfl, and
Pf3, are a group of related viruses that infect bacteria. They are termed filamentous because they are long, thin particles comprised of an elongated capsule that envelopes the deoxyribonucleic acid (DNA) that forms the bacteriophage genome. The F pili filamentous bacteriophage (Ff phage) infect only gram-negative bacteria by specifically adsorbing to the tip of F pili, and include fd, fl and Ml 3.
Compared to other bacteriophage, filamentous phage in general are attractive and Ml 3 in particular is especially attractive because: (i) the 3-D structure of the virion is known; (ii) the processing of the coat protein is well understood; (iii) the genome is expandable; (iv) the genome is small; (v) the sequence of the genome is lαiown; (vi) the virion is physically resistant to shear, heat, cold, urea, guanidinium chloride, low pH, and high salt; (vii) the phage is a sequencing vector so that sequencing is especially easy; (viii) antibiotic-resistance genes have been cloned into the genome with predictable results (Hines et al. (1980) Gene 11:207-218); (ix) it is easily cultured and stored, with no unusual or expensive media requirements for the infected cells, (x) it has a high burst size, each infected cell yielding 100 to 1000 Ml 3 progeny after infection; and (xi) it is easily harvested and concentrated (Salivar et al. (1964) Virology 24: 359-371). The entire life cycle of the filamentous phage Ml 3, a common cloning and sequencing vector, is well understood. The genetic structure of M13 is well known, including the complete sequence (Schaller et al. in The Single-Stranded DNA Phages eds. Denhardt et al. (NY: CSHL Press, 1978)), the identity and function of the ten genes, and the order of transcription and location of the promoters, as well as the physical structure of the virion (Smith et al. (1985) Science 228:1315-1317; Raschad et al. (1986) Microbiol Dev 50:401-427; Kuhn et al. (1987) Science 238:1413-1415; Zimmerman et al. (1982) JBiol Chem 257:6529- 6536; and Banner et al. (1981) Nature 289:814-816). Because the genome is small (6423 bp), cassette mutagenesis is practical on RF Ml 3 (Current Protocols in Molecular Biology, eds. Ausubel et al. (NY: John Wiley & Sons, 1991)), as is single-stranded oligonucleotide directed mutagenesis (Fritz et al. in DNA Cloning, ed by Glover (Oxford, UK: IRC Press, 1985)). Ml 3 is a plasmid and transformation system in itself, and an ideal sequencing vector. Ml 3 can be grown on Rec-strains of E. coli. The Ml 3 genome is expandable (Messing et al. in The Single-Stranded DNA Phages, eds Denhardt et al. (NY: CSHL Press, 1978) pages 449-453; and Fritz et al., supra) and Ml 3 does not lyse cells. Extra genes can be inserted into Ml 3 and will be maintained in the viral genome in a stable manner.
The mature capsule or Ff phage is comprised of a coat of five phage-encoded gene products: cpVIII, the major coat protein product of gene VIII that forms the bulk of the capsule; and four minor coat proteins, cpIII and cpIV at one end of the capsule and cpVII and cpIX at the other end of the capsule. The length of the capsule is formed by 2500 to 3000 copies of cpVIII in an ordered helix array that forms the characteristic filament structure. The gene Ill-encoded protein (cpIII) is typically present in 4 to 6 copies at one end of the capsule and serves as the receptor for binding of the phage to its bacterial host in the initial phase of infection. For detailed reviews of Ff phage structure, see Rasched et al., Microbiol. Rev., 50:401-427 (1986); and Model et al., in The Bacteriophages, Volume 2, R. Calendar, Ed., Plenum Press, pp. 375-456 (1988). The phage particle assembly involves extrusion of the viral genome through the host cell's membrane. Prior to extrusion, the major coat protein cpVIII and the minor coat protein cpIII are synthesized and transported to the host cell's membrane. Both cpVIII and cpIII are anchored in the host cell membrane prior to their incorporation into the mature particle. In addition, the viral genome is produced and coated with cpV protein. During the extrusion process, cpV-coated genomic DNA is stripped of the cpV coat and simultaneously recoated with the mature coat proteins.
Both cpIII and cpVIII proteins include two domains that provide signals for assembly of the mature phage particle. The first domain is a secretion signal that directs the newly synthesized protein to the host cell membrane. The secretion signal is located at the amino terminus of the polypeptide and targets the polypeptide at least to the cell membrane. The second domain is a membrane anchor domain that provides signals for association with the host cell membrane and for association with the phage particle during assembly. This second signal for both cpVIII and cpIII comprises at least a hydrophobic region for spanning the membrane.
The 50 amino acid mature gene VIII coat protein (cpVIII) is synthesized as a 73 amino acid precoat (Ito et al. (1979) PNAS 76:1199-1203). cpVIII has been extensively studied as a model membrane protein because it can integrate into lipid bilayers such as the cell membrane in an asymmetric orientation with the acidic amino terminus toward the outside and the basic carboxy terminus toward the inside of the membrane. The first 23 amino acids constitute a typical signal-sequence which causes the nascent polypeptide to be inserted into the inner cell membrane. An E. coli signal peptidase (SP-I) recognizes amino acids 18, 21, and 23, and, to a lesser extent, residue 22, and cuts between residues 23 and 24 of the precoat (Kuhn et al. (1985) J Biol. Chem. 260:15914-15918; and Kuhn et al. (1985) J Biol. Chem. 260:15907-15913). After removal of the signal sequence, the amino terminus of the mature coat is located on the periplasmic side of the inner membrane; the carboxy terminus is on the cytoplasmic side. About 3000 copies of the mature coat protein associate side-by-side in the inner membrane.
The sequence of gene VIII is known, and the amino acid sequence can be encoded on a synthetic gene. Mature gene VIII protein makes up the sheath around the circular ssDNA. The gene VIII protein can be a suitable anchor protein because its location and orientation in the virion are known (Banner et al. (1981) Nature 289:814-816). Preferably, the peptide is attached to the amino terminus of the mature Ml 3 coat protein to generate the phage display library. As set out above, manipulation of the concentration of both the wild-type cpVIII and Ab/cpVIII fusion in an infected cell can be utilized to decrease the avidity of the display and thereby enhance the detection of high affinity peptides directed to the target(s).
Another vehicle for displaying the peptide is by expressing it as a domain of a chimeric gene containing part or all of gene III, e.g., encoding cpIII. When monovalent displays are required, expressing the peptide as a fusion protein with cpIII can be a preferred embodiment, as manipulation of the ratio of wild-type cpIII to chimeric cpIII during formation of the phage particles can be readily controlled. This gene encodes one of the minor coat proteins of Ml 3. Genes VI, VII, and IX also encode minor coat proteins. Each of these minor proteins is present in about 5 copies per virion and is related to morphogenesis or infection. In contrast, the major coat protein is present in more than 2500 copies per virion. The gene VI, VII, and IX proteins are present at the ends of the virion; these tliree proteins are not post- translationally processed (Rasched et al. (1986) Ann Rev. Microbiol. 41:507-541). In particular, the single-stranded circular phage DNA associates with about five copies of the gene III protein and is then extruded through the patch of membrane-associated coat protein in such a way that the DNA is encased in a helical sheath of protein (Webster et al. in The Single-Stranded DNA Phages, eds Dressier et al. (NY:CSHL Press, 1978).
Manipulation of the sequence of cpIII has demonstrated that the C-terminal 23 amino acid residue stretch of hydrophobic amino acids normally responsible for a membrane anchor function can be altered in a variety of ways and retain the capacity to associate with membranes. Ff phage-based expression vectors were first described in which the cpIII amino acid residue sequence was modified by insertion of polypeptide "targets" (Parmely et al., Gene (1988) 73:305-318; and Cwirla et al., PNAS (1990) 87:6378-6382) or an amino acid residue sequence defining a single chain peptide domain (McCafferty et al., Science (1990) 348:552-554). It has been demonstrated that insertions into gene III can result in the production of novel protein domains on the virion outer surface. (Smith (1985) Science 228:1315-1317; and de la Cruz et al. (1988) J Biol. Chem. 263:4318-4322). The CSP gene may be fused to gene III at the site used by Smith and by de la Cruz et al, at a codon corresponding to another domain boundary or to a surface loop of the protein, or to the amino terminus of the mature protein.
Generally, the successful cloning strategy utilizing a phage coat protein, such as cpIII of filamentous phage fd, will provide expression of a peptide chain fused to the N-terminus of a coat protein (e.g., cpIII) and transport to the inner membrane of the host where the hydrophobic domain in the C-terminal region of the coat protein anchors the fusion protein in the membrane, with the N-terminus containing the peptide chain protruding into the periplasmic space. Similar constructions could be made with other filamentous phage. Pf3 is a well lαiown filamentous phage that infects Pseudomonos aerugenosa cells that harbor an IncP-I plasmid. The entire genome has been sequenced ((Luiten et al. (1985) J Virol. 56:268-276) and the genetic signals involved in replication and assembly are known (Luiten et al. (1987) DNA 6:129-137). The major coat protein of PF3 is unusual in having no signal peptide to direct its secretion. The sequence has charged residues ASP-7, ARG-37, LYS-40, and PHE44 which is consistent with the amino terminus being exposed. Thus, to cause a peptide to appear on the surface of Pf3, a tripartite gene can be constructed which comprises a signal sequence known to cause secretion in P. aerugenosa, fused in-frame to a gene fragment encoding the peptide sequence, which is fused in-frame to DNA encoding the mature Pf3 coat protein. Optionally, DNA encoding a flexible linker of one to 10 amino acids is introduced between the CSP gene fragment and the Pf3 coat-protein gene. This tripartite gene is introduced into Pf3 so that it does not interfere with expression of any Pf3 genes. Once the signal sequence is cleaved off, the peptide is in the periplasm and the mature coat protein acts as an anchor and phage-assembly signal.
b) Bacteriophage φKl 74
The bacteriophage φX174 is a very small icosahedral virus which has been thoroughly studied by genetics, biochemistry, and electron microscopy (see The
Single Stranded DNA Phages (eds. Denhardt et al. (NY:CSHL Press, 1978)). Three gene products of φX174 are present on the outside of the mature virion: F (capsid),
G (major spike protein, 60 copies per virion), and H (minor spike protein, 12 copies per virion). The G protein comprises 175 amino acids, while H comprises 328 amino acids. The F protein interacts with the single-stranded DNA of the virus. The proteins F, G, and H are translated from a single mRNA in the viral infected cells.
As the virus is so tightly constrained because several of its genes overlap, φX174 is not typically used as a cloning vector due to the fact that it can accept very little additional DNA. However, mutations in the viral G gene (encoding the G protein) can be rescued by a copy of the wild-type G gene carried on a plasmid that is expressed in the same host cell (Chambers et al. (1982) Nuc Acid Res 10:6465- 6473). In one embodiment, one or more stop codons are introduced into the G gene so that no G protein is produced from the viral genome. The variegated test or CSP gene library can then be fused with the nucleic acid sequence of the H gene. An amount of the viral G gene equal to the size of test gene fragment is eliminated from the φX174 genome, such that the size of the genome is ultimately unchanged. Thus, in host cells also transformed with a second plasmid expressing the wild-type G protein, the production of viral particles from the mutant virus is rescued by the exogenous G protein source. Where it is desirable that only one test peptide or CSP be displayed per φX174 particle, the second plasmid can further include one or more copies of the wild-type H protein gene so that a mix of H and test/H proteins will be predominated by the wild-type H upon incoφoration into phage particles.
c) Large DNA Phage
Phage such as λ or T4 have much larger genomes than do M13 or φX174, and have more complicated 3-D capsid structures than M13 or φPX174, with more coat proteins to choose from. In embodiments of the invention whereby the test peptide or CSP library is processed and assembled into a functional form and associates with the bacteriophage particles within the cytoplasm of the host cell, bacteriophage λ and derivatives thereof are examples of suitable vectors. The intracellular morphogenesis of phage λ can potentially prevent protein domains that ordinarily contain disulfide bonds from folding correctly. However, variegated libraries expressing a population of functional peptides, which include such bonds, have been generated in λ phage. (Huse et al. (1989) Science 246:1275-1281; Mullinax et al. (1990) PNAS 87:8095-8099; and Pearson et al. (1991) PNAS 88:2432-2436). Such strategies take advantage of the rapid construction and efficient transformation abilities of λ phage.
When used for expression of peptide sequences (isogenous nucleotide sequences), may be readily inserted into a λ vector. For instance, variegated peptide libraries can be constructed by modification of λ ZAP II through use of the multiple cloning site of a λ ZAP II vector (Huse et al. supra). ii) Bacterial Cells as Display Packages
Recombinant peptides are able to cross bacterial membranes after the addition of appropriate secretion signal sequences to the N-terminus of the protein (Better et al (1988) Science 240:1041-1043; and Skerra et al. (1988) Science 240:1038-1041). In addition, recombinant peptides have been fused to outer membrane proteins for surface presentation. For example, one strategy for displaying peptides on bacterial cells comprises generating a fusion protein by inserting the peptide into cell surface exposed portions of an integral outer membrane protein (Fuchs et al. (1991) Bio/Technology 9:1370-1372). In selecting a bacterial cell to serve as the display package, any well-characterized bacterial strain will typically be suitable, provided the bacteria may be grown in culture, engineered to display the test gene library on its surface, and is compatible with the particular affinity selection process practiced in the subject method. Among bacterial cells, the preferred display systems include Salmonella typhimurium, Bacillus subtilis, Pseudomonas aeruginosa, Vibrio cholerae, Klebsiella pneumonia, Neisseria gonorrhoeae, Neisseria meningitidis, Bacteroides nodosus, Moraxella bovis, and especially Escherichia coli. Many bacterial cell surface proteins useful in the present invention have been characterized, and works on the localization of these proteins and the methods of determining their structure include Benz et al. (1988) Ann Rev Microbiol 42: 359-393; Balduyck et al. (1985) Biol Chem Hoppe-Seyler 366:9-14; Ehrmann et al (1990) PNAS 87:7574-7578; Heijne et al. (1990) Protein Engineering 4:109-112; Ladner et al. U.S. Patent No. 5,223,409; Ladner et al. WO88/06630; Fuchs et al. (1991) Bio/technology 9:1370-1372; and Goward et al. (1992) TIBS 18:136-140. To further illustrate, the LamB protein of E coli is a well understood surface protein that can be used to generate a variegated library of test peptides or CSPs on the surface of a bacterial cell (see, for example, Ronco et al. (1990) Biochemie 72:183-189; van der Weit et al. (1990) Vaccine 8:269-277; Charabit et al. (1988) Gene 70:181-189; and Ladner U.S. Patent No. 5,222,409). LamB of E. coli is a porin for maltose and maltodextrin transport, and serves as the receptor for adsorption of bacteriophages λ and K10. LamB is transported to the outer membrane if a functional N-terminal signal sequence is present (Benson et al. (1984) PNAS 81:3830-3834). As with other cell surface proteins, LamB is synthesized with a typical signal-sequence which is subsequently removed. Thus, the variegated test gene library can be cloned into the LamB gene such that the resulting library of fusion proteins comprise a portion of LamB sufficient to anchor the protein to the cell membrane with the test peptide or CSP fragment oriented on the extracellular side of the membrane. Secretion of the extracellular portion of the fusion protein can be facilitated by inclusion of the LamB signal sequence, or other suitable signal sequence, as the N-terminus of the protein. The E. coli LamB has also been expressed in functional form in S. typhimurium (Harkki et al. (1987) Mol Gen Genet 209:607-611), V. cholerae (Harkki et al. (1986) Microb Pathol 1:283-288), and K. pneumonia (Wehmeier et al. (1989) Mol Gen Genet 215:529-536), so that one could display a population of test peptides or CSPs in any of these species as a fusion to E. coli LamB. Moreover, K. pneumonia expresses a maltoporin similar to LamB which could also be used. In P. aeruginosa, the Dl protein (a homologue of LamB) can be used (Trias et al. (1988) Biochem Biophys Ada 938:493-496). Similarly, other bacterial surface proteins, such as PAL, OmpA, OmpC, OmpF, PhoE, pilin, BtuB, FepA, FhuA, IutA, FecA and FhuE, may be used in place of LamB as a portion of the display means in a bacterial cell.
In another exemplary embodiment, the fusion protein can be derived using the FliTrx™ Random Display library (Invitrogen). That library is a diverse population of random dodecapeptides inserted within the thioredoxin active-site loop inside the dispensable region of the bacterial flagellin gene (fliC). The resultant recombinant fusion protein (FLITRX) is exported and assembled into partially functional flagella on the bacterial cell surface, displaying the random peptide library.
Peptides are fused in the middle of thioredoxin, therefore, both their N- and
C-termini are anchored by thioredoxin' s tertiary structure. This results in the display of a constrained peptide. By contrast, phage display proteins are fused to the N- terminus of phage coat proteins in an unconstrained manner. The unconstrained molecules possess many degrees of conformational freedom which may result in the lack of proper interaction with the target molecule. Without proper interaction, many potential protein-protein interactions may be missed.
Moreover, phage display is limited by the low expression levels of bacteriophage coat proteins. FliTrx™ and similar methods can overcome this limitation by using a strong promoter to drive expression of the test peptide or CSP fusions that are displayed as multiple copies.
According to the present invention, it is contemplated that the FliTrx vector can be modified to provide, similar to the illustrated vectors of the attached figures, a vector which is differentially spliced in mammalian cells to yield a secreted, soluble CSP.
iii) Bacterial Spores as Display Packages
Bacterial spores also have desirable properties as display package candidates in the subject method. For example, spores are much more resistant than vegetative bacterial cells or phage to chemical and physical agents, and hence permit the use of a great variety of affinity selection conditions. Also, Bacillus spores neither actively metabolize nor alter the proteins on their surface. However, spores have the disadvantage that the molecular mechanisms that trigger sporulation are less well worked out than is the formation of Ml 3 or the export of protein to the outer membrane of E. coli, though such a limitation is not a serious defractant from their use in the present invention.
Bacteria of the genus Bacillus form endospores that are extremely resistant to damage by heat, radiation, desiccation, and toxic chemicals (reviewed by Losick et al. (1986) Ann Rev Genet 20:625-669). This phenomenon is attributed to extensive intermolecular cross-linking of the coat proteins. In certain embodiments of the subject method, such as those which include relatively harsh affinity separation steps, Bacillus spores can be the preferred display package. Endospores from the genus Bacillus are more stable than are, for example, exospores from Streptomyces. Moreover, Bacillus subtilis forms spores in 4 to 6 hours, whereas Streptomyces species may require days or weeks to sporulate. In addition, genetic knowledge and manipulation is much more developed for B. subtilis than for other spore-forming bacteria.
Viable spores that differ only slightly from wild-type are produced in B. subtilis even if any one of four coat proteins is missing (Donovan et al. (1987) JMol Biol 196:1-10). Moreover, plasmid DNA is commonly included in spores, and plasmid encoded proteins have been observed on the surface of Bacillus spores (Debro et al. (1986) J Bacteriol 165:258-268). Thus, it can be possible during sporulation to express a gene encoding a chimeric coat protein comprising a peptide of the variegated gene library, without interfering materially with spore formation. To illustrate, several polypeptide components of B. subtilis spore coat
(Donovan et al. (1987) JMol Biol 196:1-10) have been characterized. The sequences of two complete coat proteins and amino-terminal fragments of two others have been determined. Fusion of the test peptide or CSP sequence to cotC or cotD fragments is likely to cause the peptide to appear on the spore surface. The genes of each of these spore coat proteins are preferred as neither cotC or cotD are post- translationally modified (see Ladner et al. U.S. Patent No. 5,223,409).
iv) Selecting Peptides from the Display Mode
Upon expression, the variegated test peptide or CSP display library is subjected to affinity enrichment in order to select for test peptides or CSPs which bind preselected targets. The term "affinity separation" or "affinity enrichment" includes, but is not limited to: (1) affinity chromatography utilizing immobilized targets, (2) immunoprecipitation using soluble targets, (3) fluorescence activated cell sorting, (4) agglutination, and (5) plaque lifts. In each embodiment, the library of display packages are ultimately separated based on the ability of the associated test peptide or CSP to bind the target of interest. See, for example, the Ladner et al. U.S. Patent No. 5,223,409; the Kang et al. International Publication No. WO 92/18619; the Dower et al. International Publication No. WO 91/17271; the Winter et al. International Publication WO 92/20791; the Markland et al. International Publication No. WO 92/15679; the Breitling et al. International Publication WO 93/01288; the McCafferty et al. International Publication No. WO 92/01047; the Garrard et al. International Publication No. WO 92/09690; and the Ladner et al. International Publication No. WO 90/02809. In most preferred embodiments, the display library will be pre-enriched for test peptides or CSPs specific for the target by first contacting the display library with any negative controls or other targets for which differential binding by the test peptide or CSP is desired. Subsequently, the non-binding fraction from that pre-treatment step is contacted with the target and members from the display which are able to specifically bind the target are isolated.
With respect to affinity chromatography, it will be generally understood by those skilled in the art that a great number of chromatography techniques can be adapted for use in the present invention, ranging from column chromatography to batch elution, and including ELISA and biopanning techniques. Typically, where the target is a component of a cell, rather than a whole cell, the target is immobilized on an insoluble carrier, such as sepharose or polyacrylamide beads, or, alternatively, the wells of a microtitre plate. As described below, in instances where no purified source of the target is readily available, such as the case with many cell surface receptors, the cells on which the target is displayed may serve as the insoluble matrix carrier.
The population of display packages is applied to the affinity matrix under conditions compatible with the binding of the test peptide or CSP to a target. The population is then fractionated by washing with a solute that does not greatly effect specific binding of test peptides or CSPs to the target, but which substantially disrupts any non-specific binding of the display package to the target or matrix. A certain degree of control can be exerted over the binding characteristics of the peptides recovered from the display library by adjusting the conditions of the binding incubation and subsequent washing. The temperature, pH, ionic strength, divalent cation concentration, and the volume and duration of the washing can select for peptides within a particular range of affinity and specificity. Selection based on slow dissociation rate, which is usually predictive of high affinity, is a very practical route. This may be done either by continued incubation in the presence of a saturating amount of free hapten (if available), or by increasing the volume, number, and length of the washes. In each case, the rebinding of dissociated display packages is prevented, and with increasing time, test peptides or CSPs of higher and higher affinity are recovered. Moreover, additional modifications of the binding and washing procedures may be applied to find peptides with special characteristics. The affinities of some peptides are dependent on ionic strength or cation concentration. This is a useful characteristic for peptides to be used in affinity purification of various proteins when gentle conditions for removing the protein from the peptide are required. Specific examples are peptides which depend on Ca++ for binding activity and which lose or gain binding affinity in the presence of EGTA or other metal chelating agent. Such peptides may be identified in the recombinant peptide library by a double screening technique isolating first those that bind the target in the presence of Ca++, and by subsequently identifying those in this group that fail to bind in the presence of EGTA.
After "washing" to remove non-specifically bound display packages, when desired, specifically bound display packages can be eluted by either specific desorption (using excess target) or non-specific desorption (using pH, polarity reducing agents, or chaotropic agents). In preferred embodiments, the elution protocol does not kill the organism used as the display package such that the enriched population of display packages can be further amplified by reproduction. The list of potential eluants includes salts (such as those in which one of the counter ions is Na+, NH , Rb+, SO4 2\ H2PO4-, citrate, K+, Li+, Cs+, HSO4-, CO3 2", Ca2+, Sr2+, Cl", PO4 2", HCO3", Mg2+, Ba2+, Br", HPO4 2", or acetate), acid, heat, and, when available, soluble forms of the target (or analogs thereof). Because bacteria continue to metabolize during the affinity separation step and are generally more susceptible to damage by harsh conditions, the choice of buffer components (especially eluates) can be more restricted when the display package is a bacteria rather than for phage or spores. Neutral solutes, such as ethanol, acetone, ether, or urea, are examples of other agents useful for eluting the bound display packages.
In preferred embodiments, affinity enriched display packages are iteratively amplified and subjected to further rounds of affinity separation until enrichment of the desired binding activity is detected. In certain embodiments, the specifically bound display packages, especially bacterial cells, need not be eluted per se, but rather, the matrix bound display packages can be used directly to inoculate a suitable growth media for amplification. Where the display package is a phage particle, the fusion protein generated with the coat protein can interfere substantially with the subsequent amplification of eluted phage particles, particularly in embodiments wherein the cpIII protein is used as the display anchor. Even though present in only one of the 5-6 tail fibers, some constructs because of their size and/or sequence, may cause severe defects in the infectivity of their carrier phage. This causes a loss of phage from the population during reinfection and amplification following each cycle of panning. In one embodiment, the peptide can be derived on the surface of the display package so as to be susceptible to proteolytic cleavage which severs the covalent linkage of at least the target binding sites of the displayed test peptide or CSP from the remaining package. For instance, where the cpIII coat protein of Ml 3 is employed, such a strategy can be used to obtain infectious phage by treatment with an enzyme which cleaves between the test peptide or CSP portion and cpIII portion of a tail fiber fusion protein (e.g. such as the use of an enterokinase cleavage recognition sequence).
To further minimize problems associated with defective infectivity, DNA prepared from the eluted phage can be transformed into host cells by electroporation or well known chemical means. The cells are cultivated for a period of time sufficient for marker expression, and selection is applied as typically done for DNA transformation. The colonies are amplified, and phage harvested for a subsequent round(s) of panning.
After isolation of display packages which encode test peptides or CSPs having a desired binding specificity for the target, the CSPs for each of the purified display packages can be tested for biological activity in the secretion mode of the subject method.
C. Secretion Mode
In the "secretion mode," the combinatorial peptide library, which has been enriched in the display mode, is transfected into and expressed by eukaryotic cells. In this mode, the test CSPs are secreted by the host cells and screened for biological activity. In preferred embodiments, and illustrated in the drawings, the subject vectors are constructed to include eukaryotic splice sites such that, in the mature mRNA, elements required for the display mode in prokaryotic cells are spliced out - at least those elements which would interfere with the secretion mode. A variety of naturally and non-naturally occurring splice sites are available in the art and can be selected for, e.g., optimization in particular eukaryotic cells selected.
In preferred embodiments, the vectors of the subject invention are used to transfect a cell that can be co-cultured with a target cell. A biologically active protein secreted by the cells expressing the combinatorial library will diffuse to neighboring target cells and induce a particular biological response, such as to illustrate, proliferation or differentiation, or activation of a signal transduction pathway which is directly detected by other phenotypic criteria. The pattern of detection of biological activity will resemble a gradient function, and will allow the isolation (generally after several repetitive rounds of selection) of cells producing CSPs having certain activity in the assay. Likewise, antagonists of a given factor can be selected in similar fashion by the ability of the cell producing a functional antagonist to protect neighboring cells from the effect of exogenous factor added to the culture media.
To further illustrate, target cells are cultured in 24-well microtitre plates. Other cells are transfected with the combinatorial CSP library, recovered after the display mode step, and cultured in cell culture inserts (e.g. Collaborative Biomedical Products, Catalog #40446) that are able to fit into the wells of the microtitre plate. The cell culture inserts are placed in the wells such that recombinant test CSPs secreted by the cells in the insert can diffuse through the porous bottom of the insert and contact the target cells in the microtitre plate wells. After a period of time sufficient for a secreted test CSP to produce a measurable response in the target cells, the inserts are removed and the effect of the peptides on the target cells determined. For example, where the target cell is a neural crest cell and the activity desired from the test CSPs is the induction of neuronal differentiation, then fluorescently-labeled antibodies specific for Islet- 1 or other neuronal markers can be used to score for induction in the target cells as indicative of a functional neurotrophic peptide in that well. Cells from the inserts corresponding to wells which score positive for activity can be split and re-cultured on several inserts, the process being repeated until the active CSP is identified.
When screening for bioactivity of test CSPs, intracellular second messenger generation can be measured directly. For instance, a variety of intracellular effectors have been identified as being receptor- or ion channel-regulated, including adenylyl cyclase, cyclic GMP, phosphodiesterases, phosphoinositidases, phosphoinositol kinases, and phospholipases, as well as a variety of ions.
In one embodiment, the GTPase enzymatic activity by G proteins can be measured in plasma membrane preparations by determining the breakdown of γ32P GTP using techniques that are lαiown in the art (For example, see Signal Transduction: A Practical Approach. G. Milligan, Ed. Oxford University Press, Oxford England). When receptors that modulate cAMP are tested, it will be possible to use standard techniques for cAMP detection, such as competitive assays which quantitate [^HJcAMP in the presence of unlabelled cAMP. Certain receptors and ion channels stimulate the activity of phospholipase C which stimulates the breakdown of phosphatidylinositol 4,5, bisphosphate to 1,4,5- IP3 (which mobilizes intracellular Ca++) and diacylglycerol (DAG) (which activates protein kinase C). Inositol lipids can be extracted and analyzed using standard lipid extraction techniques. DAG can also be measured using thin-layer chromatography. Water soluble derivatives of all three inositol lipids (IPl, IP2, IP3) can also be quantitated using radiolabelling techniques or HPLC.
The other product of PIP2 breakdown, DAG can also be produced from phosphatidyl choline. The breakdown of this phospholipid in response to receptor- mediated signaling can also be measured using a variety of radiolabelling techniques.
The activation of phospholipase A2 can easily be quantitated using known techniques, including, for example, the generation of arachadonate in the cell.
In various cells, e.g., mammalian cells, specific proteases are induced or activated in each of several arms of divergent signaling pathways. These may be independently monitored by following their unique activities with substrates specific for each protease.
In the case of certain receptors and ion channels, it may be desirable to screen for changes in cellular phosphorylation. Such assay formats may be useful when, for example, the assay is designed to detect an agonist or antagonist of a receptor kinase or phosphatase. For example, immunoblotting (Lyons and Nelson (1984) Proc. Natl. Acad. Sci. USA 81:7426-7430) using anti-phosphotyrosine, anti- phosphoserine or anti-phosphothreonine antibodies. In addition, tests for phosphorylation could be also useful when the receptor itself may not be a kinase, but activates protein kinases or phosphatase that function downstream in the signal transduction pathway.
One such cascade is the MAP kinase pathway that appears to mediate both mitogenic, differentiation and stress responses in different cell types. Stimulation of growth factor receptors results in Ras activation followed by the sequential activation of c-Raf, MEK, and p44 and p42 MAP kinases (ERK1 and ERK2). Activated MAP kinase then phosphorylates many key regulatory proteins, including p90RSK and Elk-1 that are phosphorylated when MAP kinase translocates to the nucleus. Homologous pathways exist in mammalian and yeast cells. For instance, an essential part of the S. cerevisiae pheromone signaling pathway is comprised of a protein kinase cascade composed of the products of the STEl l, STE7, and FUS3/KSS1 genes (the latter pair are distinct and functionally redundant). Accordingly, phosphorylation and/or activation of members of this kinase cascade can be detected and used to quantitate receptor engagement. Phosphotyrosine specific antibodies are available to measure increases in tyrosine phosphorylation and phospho-specific antibodies are commercially available (New England Biolabs, Beverly, MA).
In yet another embodiment, the signal transduction pathway of interest may upregulate expression or otherwise activate an enzyme which is capable of modifying a substrate which can be added to the cell. The signal can be detected by using a detectable substrate, in which case lose of the substrate signal is monitored, or alternatively, by using a substrate which produces a detectable product. In preferred embodiments, the conversion of the substrate to product by the activated enzyme produces a detectable change in optical characteristics of the test cell, e.g., the substrate and/or product is cl romogenically or fluorogenically active. In an illustrative embodiment the signal transduction pathway causes a change in the activity of a proteolytic enzyme, altering the rate at which it cleaves a substrate peptide (or simply activates the enzyme towards the substrate). The peptide includes a fluorogenic donor radical, e.g., a fluorescence emitting radical, and an acceptor radical, e.g., an aromatic radical which absorbs the fluorescence energy of the fluorogenic donor radical when the acceptor radical and the fluorogenic donor radical are covalently held in close proximity. See, for example, USSN 5,527,681, 5,506,115, 5,429,766, 5,424,186, and 5,316,691; and Capobianco et al. (1992) Anal Biochem 204:96-102. For example, the substrate peptide has a fluorescence donor group such as 1-aminobenzoic acid (anthranilic acid or ABZ) or aminomethylcoumarin (AMC) located at one position on the peptide and a fluorescence quencher group, such as lucifer yellow, methyl red or nitrobenzo-2- oxo-l,3-diazole (NBD), at a different position near the distal end of the peptide. A cleavage site for the activated enzyme will be disposed between each of the sites for the donor and acceptor groups. The intramolecular resonance energy transfer from the fluorescence donor molecule to the quencher will quench the fluorescence of the donor molecule when the two are sufficiently proximate in space, e.g., when the peptide is intact. Upon cleavage of the peptide, however, the quencher is separated from the donor group, leaving behind a fluorescent fragment. Thus, activation of the enzyme results in cleavage of the detection peptide, and dequenching of the fluorescent group. In still other embodiments, the detectable signal can be produced by use of enzymes or chromogenic/fluorescent probes whose activities are dependent on the concentration of a second messenger, e.g., such as calcium, hydrolysis products of inositol phosphate, cAMP, etc. For example , the mobilization of intracellular calcium or the influx of calcium from outside the cell can be measured using standard techniques. The choice of the appropriate calcium indicator, fluorescent, bioluminescent, metallochromic, or Ca^-sensitive microelectrodes depends on the cell type and the magnitude and time constant of the event under study (Borle (1990) Environ Health Perspect 84:45-56). As an exemplary method of Ca"1"1' detection, cells could be loaded with the Ca"1"1" sensitive fluorescent dye fura-2 or indo-1, using standard methods, and any change in Ca"1"1" measured using a fluorometer.
As certain embodiments described above suggest, in addition to directly measuring second messenger production, the signal transduction activity of a receptor or ion channel pathway can be measured by detection of a transcription product, e.g., by detecting receptor/channel-mediated transcriptional activation (or repression) of a gene(s). Detection of the transcription product includes detecting the gene transcript, detecting the product directly (e.g., by immunoassay) or detecting an activity of the protein (e.g., such as an enzymatic activity or chromogenic/fluorogenic activity); each of which is generally referred to herein as a means for detecting expression of the indicator gene. The indicator gene may be an unmodified endogenous gene of the host cell, a modified endogenous gene, or a part of a completely heterologous construct, e.g., as part of a reporter gene construct. In one embodiment, the indicator gene is an unmodified endogenous gene.
For example, the instant method can rely on detecting the transcriptional level of such endogenous genes as the c-fos gene (e.g., in mammalian cells) or the Barl or Fusl genes (e.g., in yeast cells) in response to such signal transduction pathways as originating from G protein coupled receptors. In certain instances, it may be desirable to increase the level of transcriptional activation of the endogenous indicator gene by the signal pathway in order to, for example, improve the signal-to-noise of the test system, or to adjust the level of response to a level suitable for a particular detection teclmique. In one embodiment, the transcriptional activation ability of the signal pathway can be amplified by the overexpression of one or more of the proteins involved in the intracellular signal cascade, particularly enzymes involved in the pathway. For example, increased expression of Jun kinases (JNKs) can potentiate the level of transcriptional activation by a signal in an MEKK/JNKK pathway. Likewise, overexpression of one or more signal transduction proteins in the yeast pheromone pathway can increase the level of Fusl and/or Barl expression. This approach can, of course, also be used to potentiate the level of transcription of a heterologous reporter gene as well.
In other embodiments, the sensitivity of an endogenous indicator gene can be enhanced by manipulating the promoter sequence at the natural locus for the indicator gene. Such manipulation may range from point mutations to the endogenous regulatory elements to gross replacement of all or substantial portions of the regulatory elements. In general, manipulation of the genomic sequence for the indicator gene can be carried out using techniques known in the art, including homologous recombination. In another exemplary embodiment, the promoter (or other transcriptional regulatory sequences) of the endogenous gene can be "switched out" with a heterologous promoter sequence, e.g., to form a chimeric gene at the indicator gene locus. Again, using such techniques as homologous recombination, the regulatory sequence can be so altered at the genomic locus of the indicator gene. In still another embodiment, a heterologous reporter gene construct can be used to provide the function of an indicator gene. Reporter gene constructs are prepared by operatively linking a reporter gene with at least one transcriptional regulatory element. If only one transcriptional regulatory element is included it must be a regulatable promoter. At least one the selected transcriptional regulatory elements must be indirectly or directly regulated by the activity of the selected cell- surface receptor whereby activity of the receptor can be monitored via transcription of the reporter genes.
Many reporter genes and transcriptional regulatory elements are known to those of skill in the art and others may be identified or synthesized by methods known to those of skill in the art.
Examples of reporter genes include, but are not limited to CAT (chloramphenicol acetyl transferase) (Alton and Vapnek (1979), Nature 282: 864- 869) luciferase, and other enzyme detection systems, such as beta-galactosidase; firefly luciferase (deWet et al. (1987), Mol. Cell. Biol. 7:725-737); bacterial luciferase (Engebrecht and Silverman (1984), PNAS 1: 4154-4158; Baldwin et al. (1984), Biochemistry 23: 3663-3667); alkaline phosphatase (Toh et al. (1989) Eur. J. Biochem. 182: 231-238, Hall et al. (1983) J. Mol. Appl. Gen. 2: 101), human placental secreted alkaline phosphatase (Cullen and Malim (1992) Methods in Enzymol. 216:362-368); β-lactamase or GST.
Transcriptional control elements for use in the reporter gene constructs, or for modifying the genomic locus of an indicator gene include, but are not limited to, promoters, enhancers, and repressor and activator binding sites. Suitable transcriptional regulatory elements may be derived from the transcriptional regulatory regions of genes whose expression is rapidly induced, generally within minutes, of contact between the cell surface protein and the effector protein that modulates the activity of the cell surface protein. Examples of such genes include, but are not limited to, the immediate early genes (see, Sheng et al. (1990) Neuron 4: 477-485), such as c-fos. Immediate early genes are genes that are rapidly induced upon binding of a ligand to a cell surface protein. The transcriptional control elements that are preferred for use in the gene constructs include transcriptional control elements from immediate early genes, elements derived from other genes that exhibit some or all of the characteristics of the immediate early genes, or synthetic elements that are constructed such that genes in operative linkage therewith exhibit such characteristics. The characteristics of preferred genes from which the transcriptional control elements are derived include, but are not limited to, low or undetectable expression in quiescent cells, rapid induction at the transcriptional level within minutes of extracellular simulation, induction that is transient and independent of new protein synthesis, subsequent shut-off of transcription requires new protein synthesis, and mRNAs transcribed from these genes have a short half-life. It is not necessary for all of these properties to be present.
Other promoters and transcriptional control elements, in addition to those described above, include the vasoactive intestinal peptide (VIP) gene promoter (cAMP responsive; Fink et al. (1988), Proc. Natl. Acad. Sci. 85:6662-6666); the somatostatin gene promoter (cAMP responsive; Montminy et al. (1986), Proc. Natl. Acad. Sci. 8.3:6682-6686); the proenlcephalin promoter (responsive to cAMP, nicotinic agonists, and phorbol esters; Comb et al. (1986), Nature 323:353-356); the phosphoenolpyruvate carboxy-kinase gene promoter (cAMP responsive; Short et al. (1986), J. Biol. Chem. 261:9721-9726); the NGFI-A gene promoter (responsive to NGF, cAMP, and serum; Changelian et al. (1989). Proc. Natl. Acad. Sci. 86:377- 381); and others that may be lαiown to or prepared by those of skill in the art.
In the case of receptors which modulate cyclic AMP, a transcriptional based readout can be constructed using the cyclic AMP response element binding protein, CREB, which is a transcription factor whose activity is regulated by phosphorylation at a particular serine (SI 33). When this serine residue is phosphorylated, CREB binds to a recognition sequence known as a CRE (cAMP Responsive Element) found to the 5' of promoters known to be responsive to elevated cAMP levels. Upon binding of phosphorylated CREB to a CRE, transcription from this promoter is increased.
Phosphorylation of CREB is seen in response to both increased cAMP levels and increased intracellular Ca levels. Increased cAMP levels result in activation of PKA, which in turn phosphorylates CREB and leads to binding to CRE and transcriptional activation. Increased intracellular calcium levels results in activation of calcium / calmodulin responsive kinase II (CaM kinase II). Phosphorylation of CREB by CaM kinase II is effectively the same as phosphorylation of CREB by PKA, and results in transcriptional activation of CRE containing promoters.
Therefore, a transcriptionally-based readout can be constructed in cells containing a reporter gene whose expression is driven by a basal promoter containing one or more CRE. Changes in the intracellular concentration of Ca++ (a result of alterations in the activity of the receptor upon engagement with a ligand) will result in changes in the level of expression of the reporter gene if: a) CREB is also co-expressed in the cell, and b) either an endogenous or heterologous CaM kinase phosphorylates CREB in response to increases in calcium or if an exogenously expressed CaM kinase II is present in the same cell. In other words, stimulation of PLC activity may result in phosphorylation of CREB and increased transcription from the CRE-construct, while inhibition of PLC activity may result in decreased transcription from the CRE-responsive construct. As described in Bonni et al. (1993) Science 262:1575-1579, the observation that CNTF treatment of SK-N-MC cells leads to the enhanced interaction of STAT/p91 and STAT related proteins with specific DNA sequences suggested that these proteins might be key regulators of changes in gene expression that are triggered by CNTF. Consistent with this possibility is the finding that DNA sequence elements similar to the consensus DNA sequence required for STAT/p91 binding are present upstream of a number of genes previously found to be induced by CNTF (e.g., Human c-fos, Mouse c-fos, Mouse tisl l, Rat junB, Rat SOD-1, and CNTF). Those authors demonstrated the ability of STAT/p91 binding sites to confer CNTF responsiveness to a non-responsive reporter gene. Accordingly, a reporter construct for use in the present invention for detecting signal transduction through STAT proteins, such as from cytokine receptors, can be generated by using -71 to +109 of the mouse c-fos gene fused to the bacterial chloramphenicol acetyltransferase gene (-71fosCAT) or other detectable marker gene. Induction by a cytokine receptor induces the tyrosine phosphorylation of STAT and STAT-related proteins, with subsequent translocation and binding of these proteins to the STAT-RE. This then leads to activation of transcription of genes containing this DNA element within their promoters.
In preferred embodiments, the reporter gene is a gene whose expression causes a phenotypic change which is screenable or selectable. If the change is selectable, the phenotypic change creates a difference in the growth or survival rate between cells which express the reporter gene and those which do not. If the change is screenable, the phenotype change creates a difference in some detectable characteristic of the cells, by which the cells which express the marker may be distinguished from those which do not. Selection is preferable to screening in that it can provide a means for amplifying from the cell culture those cells which express a test polypeptide which is a receptor effector.
The marker gene is coupled to the receptor signaling pathway so that expression of the marker gene is dependent on activation of the receptor. This coupling may be achieved by operably linking the marker gene to a receptor-responsive promoter. The term "receptor-responsive promoter" indicates a promoter which is regulated by some product of the target receptor's signal transduction pathway. Alternatively, the promoter may be one which is repressed by the receptor pathway, thereby preventing expression of a product which is deleterious to the cell. With a receptor repressed promoter, one screens for agonists by linking the promoter to a deleterious gene, and for antagonists, by linking it to a beneficial gene. Repression may be achieved by operably linking a receptor- induced promoter to a gene encoding mRNA which is antisense to at least a portion of the mRNA encoded by the marker gene (whether in the coding or flanking regions), so as to inhibit translation of that mRNA. Repression may also be obtained by linking a receptor-induced promoter to a gene encoding a DNA binding repressor protein, and incorporating a suitable operator site into the promoter or other suitable region of the marker gene.
The marker gene may also be a screenable gene. The screened characteristic may be a change in cell morphology, metabolism or other screenable features. Suitable markers include β-galactosidase (Xgal, C12FDG, Salmon-gal, Magenta-Gal (latter two from Biosynth Ag)), alkaline phosphatase, horseradish peroxidase, exo-glucanase (product of yeast exbl gene; nonessential, secreted); luciferase; bacterial green fluorescent protein; (human placental) secreted alkaline phosphatase (SEAP); and chloramphenicol transferase (CAT). Some of the above can be engineered so that they are secreted (although not β-galactosidase). A preferred screenable marker gene is beta-galactosidase; yeast cells expressing the enzyme convert the colorless substrate Xgal into a blue pigment. Again, the promoter may be receptor-induced or receptor-inhibited.
In certain assays it may be desirable to use changes in growth in the screening procedure. For example, one of the consequences of activation of the pheromone signal pathway in wild-type yeast is growth arrest. If one is testing for an antagonist of a G protein-coupled receptor, such as a human receptor engineered into a yeast cell, this normal response of growth arrest can be used to select cells in which the pheromone response pathway is inhibited. That is, cells exposed to a test compound will be growth arrested if the compound is an agonist, but will grow normally if the compound is neutral or an antagonist. Thus, the growth arrest response can be used to advantage to discover compounds that function as agonists or antagonists. Moreover, the effect of growth arrest can provide a selective advantage in the presence of an agent which is cytotoxic to mitotic cells. For example, during the growth arrest window, the cytotoxic agent is added to the culture. Cells which proceed tlirough the cell-cycle, e.g., which are not growth arrested, will be killed. At some time after the addition of the cytotoxic agent, it can be washed from the culture, and surviving cells permitted to proceed with proliferation. Cells which were arrested by the test compound will be enriched in the surviving population.
However, in certain embodiments the growth arrest consequent to activation of the pheromone response pathway is an undesirable effect since cells that bind agonists stop growing while surrounding cells that fail to bind peptides will continue to grow. The cells of interest, then, will be overgrown or their detection obscured by the background cells, confounding identification of agonistic peptides. To overcome this problem the present invention teaches engineering the cell such that: 1) growth arrest does not occur as a result of exogenous signal pathway activation (e.g., by inactivating the FAR1 gene); and/or 2) a selective growth advantage is conferred by activating the pathway (e.g., by transforming an auxotrophic mutant with a HIS3 gene under the control of a pheromone-responsive promoter, and applying selective conditions). It is, of course, desirable that the exogenous receptor be exposed on a continuing basis to the peptides. Unfortunately, this is likely to result in desensitization of the pheromone pathway to the stimulus. For example, the mating signal transduction pathway is lαiown to become desensitized by several mechanisms including pheromone degradation and modification of the function of the receptor, G-proteins and/or downstream elements of the pheromone signal transduction by the products of the SST2, STE50, AFR1 (Konopka, J.B. (1993) Mol. Cell. Biol. 13:6876-6888) and SGV1, MSG5, and SIG1 genes. Selected mutations in these genes can lead to hypersensitivity to pheromone and an inability to adapt to the presence of pheromone. For example, introduction of mutations that interfere with function into strains expressing heterologous G protein-coupled receptors constitutes a significant improvement on wild type strains and enables the development of extremely sensitive bioassays for compounds that interact with the receptors. Other mutations e.g. STE50, sgvl,bαri, ste2,ste3,pikl,msg5, sigl, and aftl, have the similar effect of increasing the sensitivity of the bioassay. Thus desensitization may be avoided by mutating (which may include deleting) the SST2 gene so that it no longer produces a functional protein, or by mutating one of the other genes listed above.
If the endogenous homolog of the receptor is produced by the yeast cell, the assay will not be able to distinguish between peptides which interact with the endogenous receptor and those which interact with the exogenous receptor. It is therefore desirable that the endogenous gene be deleted or otherwise rendered nonfunctional.
Suitable host cells for generating the target cells of subject assay include prokaryotes, yeast, or higher eukaryotic cells, including plant and animal cells, especially mammalian cells. Prokaryotes include gram negative or gram positive organisms. Examples of suitable mammalian host cell lines include the COS-7 line of monkey kidney cells (ATCC CRL 1651) (Gluzman (1981) Cell 23:175) CV-1 cells (ATCC CCL 70), L cells, C127, 3T3, Chinese hamster ovary (CHO), HeLa, HEK-293, SWISS 3T3, and BHK cell lines.
If yeast cells are used, the yeast may be of any species which are cultivable and in which an exogenous receptor can be made to engage the appropriate signal transduction machinery of the host cell. Suitable species include Kluyverei lactis, Schizosaccharomyces pombe, and Ustilaqo maydis; Saccharomyces cerevisiae is preferred. Other yeast which can be used in practicing the present invention are Neurospora crassa, Aspergillus niger, Aspergillus nidulans, Pichia pastoris, Candida tropicalis, and Hansenula polymorpha. The term "yeast", as used herein, includes not only yeast in a strictly taxonomic sense, i.e., unicellular organisms, but also yeast-like multicellular fungi or filamentous fungi.
The choice of appropriate host cell will also be influenced by the choice of detection signal. For instance, reporter constructs, as described below, can provide a selectable or screenable trait upon transcriptional activation (or inactivation) in response to a signal transduction pathway coupled to the target receptor. The reporter gene may be an unmodified gene already in the host cell pathway. It may be a host cell gene that has been operably linked to a "receptor-responsive" promoter. Alternatively, it may be a heterologous gene (e.g., a "reporter gene construct") that has been so linked. Suitable genes and promoters are discussed below. In other embodiments, second messenger generation can be measured directly in the detection step, such as mobilization of intracellular calcium or phospholipid metabolism are quantitated. In yet other embodiments indicator genes can be used to detect receptor-mediated signaling.
Accordingly, it will be understood that to achieve selection or screening, the host cell must have an appropriate phenotype. For example, generating a pheromone-responsive chimeric HIS3 gene in a yeast that has a wild-type HIS3 gene would frustrate genetic selection. Thus, to achieve nutritional selection, an auxotrophic strain is wanted.
A variety of complementations for use in the subject assay can be constructed. Indeed, many yeast genetic complementation with mammalian signal transduction proteins have been described in the art. For example, Mosteller et al. (1994) Mol Cell Biol 14:1104-12 demonstrates that human Ras proteins can complement loss of ras mutations in S. cerevisiae. Moreover, Toda et al. (1986) Princess Takamatsu Symp 17: 253-60 have shown that human ras proteins can complement the loss of RAS1 and RAS2 proteins in yeast, and hence are functionally homologous. Both human and yeast RAS proteins can stimulate the magnesium and guanine nucleotide-dependent adenylate cyclase activity present in yeast membranes. Ballester et al. (1989) Cell 59: 681-6 describe a vector to express the mammalian GAP protein in the yeast S. cerevisiae. When expressed in yeast, GAP inhibits the function of the human ras protein, and complements the loss of IRAl . IRAl is a yeast gene that encodes a protein with homology to GAP and acts upstream of RAS. Mammalian GAP can therefore function in yeast and interact with yeast RAS. Wei et al. (1994) Gene 151: 279-84 describes that a human Ras-specific guanine nucleotide-exchange factor, Cdc25GEF, can complement the loss of CDC25 function in S. cerevisiae. Martegani et al. (1992) EMBO J 11: 2151-7 describe the cloning by functional complementation of a mouse cDNA encoding a homolog of CDC25, a Saccharomyces cerevisiae RAS activator. Vojtek et al. (1993) J Cell Sci 105: 777-85 and Matviw et al. (1992) Mol Cell Biol 12: 5033-40 describe how a mouse CAP protein, e.g., an adenylyl cyclase associated protein associated with ras- mediated signal transduction, can complements defects in S. cerevisiae. Papasavvas et al. (1992) Biochem Biophys Res Commun 184:1378-85 also suggest that inactivated yeast adenyl cyclase can be complemented by a mammalian adenyl cyclase gene. Hughes et al. (1993) Nature 364: 349-52 describe the complementation of byrl in fission yeast by mammalian MAP kinase kinase (MEK). Parissenti et al. (1993) Mol Cell Endocrinol 98: 9-16 describes the reconstitution of bovine protein kinase C (PKC) in yeast. The Ca(2+)- and phospholipid-dependent Ser/Thr kinase PKC plays important roles in the transduction of cellular signals in mammalian cells. Marcus et al. (1995) PNAS 92: 6180-4 suggests the complementation of shkl null mutations in S. pombe by the either the structurally related S. cerevisiae Ste20 or mammalian p65PAK protein kinases.
IV. Exemplary Uses
Because of the flexibility of the system, the subject method can be used in a broad range of applications, including for the selection of CSPs having effects on proliferation, differentiation, cell death, cell migration, etc. In preferred embodiments, the target used in the display mode is an extracellular component of a cell. However, it will be appreciated that the target for subject method can be an intracellular component and, during the secretion mode, the system can be augmented with agents which promote the cellular uptake of the test CSPs.
In an illustrative embodiment, the subject method is utilized to identify CSPs which have antiproliferative activity with respect to one or more types of cells. For instance, in the display mode, the test peptide or CSP library can be panned with the target cells for which an antiproliferative is desired in order to enrich for peptides which bind to that cell. At that stage, the test peptide or CSP library can also be panned against one or more control cell lines in order to remove peptides which bind the control cells. In this manner, the CSP library which is then tested in the secretion mode can be enriched for CSPs which selectively bind target cell (relative to the control cells). Thus, for example, the display mode can produce a library enriched for test peptides or CSPs which preferentially bind tumor cells relative to normal cells, which preferentially bind p53- cells relative to p53+ cells, which preferentially bind hair follicle cells relative to other epithelial cells, or any other differential binding characteristic.
In the secretion mode, the peptides are tested for antiproliferative activity against the target cell using any of a number of techniques known in the art. For instance, BrdU or other nucleotide uptake can be measured as an indicator of proliferation. As above, the secretion mode can include negative controls in order to select for peptides with specific antiproliferative activity.
In similar fashion, peptides can be isolated from the library based on their ability to induce apoptosis or cell lysis, e.g., in a cell selective manner. In yet another embodiment, the subject method can be used to identify peptides with angiogenic or anti-angiogenic activity. For instance, the CSP library can be enriched for peptides that bind to endothelial cells but which do not bind to fibroblasts. The resulting sub-library can be screened for peptides which inhibit capillary endothelial cell proliferation and/or endothelial cell migration. Peptides scoring positive for one or both of these activities can also be tested for activity against other cell types, such as smooth muscle cells or fibroblasts, in order to select peptides active only against endothelial cells.
In still another embodiment, the subject method can be used to identify anti- infective peptides, e.g., which are active as anti-fungal or antibacterial agents. In one embodiment, the assay of the present invention can be used for identifying effectors of a receptor protein or complex thereof. In general, the assay is characterized by the use of a test cell which includes a target receptor or ion channel protein whose signal transduction activity can be modulated by interaction with an extracellular signal, the transduction activity being able to generate a detectable signal.
In general, such embodiments of the subject assay are characterized by the use of a mixture of cells expressing a target receptor protein or ion channel capable of transducing a detectable signal in the reagent cell. The receptor/channel protein can be either endogenous or heterologous. In combination with the disclosed detection means, a culture of the instant reagent cells will provide means for detecting agonists or antagonists of receptor function.
The ability of particular peptides to modulate a signal transduction activity of the target receptor or channel can be scored for by detecting up or down-regulation of the detection signal. For example, second messenger generation (e.g. GTPase activity, phospholipid hydrolysis, or protein phosphorylation patterns as examples) can be measured directly. Alternatively, the use of an indicator gene can provide a convenient readout. In other embodiments a detection means consists of an indicator gene. In any event, a statistically significant change in the detection signal can be used to facilitate identification of compounds which modulate receptor or ion channel activities.
By this method, peptides which induce a signal pathway from a particular receptor or channel can be identified. If a test CSP does not appear to induce the activity of the receptor/channel protein, the assay may be repeated as described above, and modified by the introduction of a step in which the reagent cell is first contacted with a known activator of the target receptor/channel to induce signal transduction, and the test CSP can be assayed for its ability to inhibit the activated receptor/channel, e.g., to identify antagonists. In yet other embodiments, peptides can be screened for those which potentiate the response to a lαiown activator of the receptor.
With respect to the receptor or ion channel, it may be endogenously expressed by the host cell, or it may be expressed from a heterologous gene that has been introduced into the cell. Methods for introducing heterologous DNA into eukaryotic cells are of course well known in the art and any such method may be used. In addition, DNA encoding various receptor proteins is known to those of skill in the art or it may be cloned by any method lαiown to those of skill in the art. In certain embodiments, such as when an exogenous receptor is expressed, it may be desirable to inactivate, such as by deletion, a homologous receptor present in the cell. In particular, the assays can be used to test functional ligand-receptor or ligand-ion channel interactions for cell surface-localized receptors and channels. As described in more detail below, the subject assay can be used to identify effectors of, for example, G protein-coupled receptors, receptor tyrosine kinases, cytokine receptors, and ion channels. In certain embodiments the method described herein is used for identifying ligands for "orphan receptors" for which no ligand is known. In preferred embodiments, the receptor is a cell surface receptor, such as: a receptor tyrosine kinase, e.g., an EPH receptor; an ion channel; a cytokine receptor; an multisubunit immune recognition receptor, a chemokine receptor; a growth factor receptor, or a G-protein coupled receptor, such as a chemoattractant peptide receptor, a neuropeptide receptor, a light receptor, a neuiOtransmitter receptor, or a polypeptide hormone receptor.
Preferred G protein coupled receptors include αlA-adrenergic receptor, lB-adrenergic receptor, α2-adrenergic receptor, α2B-adrenergic receptor, βl- adrenergic receptor, β2-adrenergic receptor, β3-adrenergic receptor, ml acetylcholine receptor (AChR), m2 AChR, m3 AChR, m4 AChR, m5 AChR, Dl dopamine receptor, D2 dopamine receptor, D3 dopamine receptor, D4 dopamine receptor, D5 dopamine receptor, Al adenosine receptor, A2b adenosine receptor, 5- HTla receptor, 5-HTlb receptor, 5HTl-like receptor, 5-HTld receptor, 5HTld-like receptor, 5HTld beta receptor, substance K (neurokinin A) receptor, fMLP receptor, fMLP-like receptor, angiotensin II type 1 receptor, endothelin ETA receptor, endothelin ETB receptor, thrombin receptor, growth hormone-releasing hormone (GHRH) receptor, vasoactive intestinal peptide receptor, oxytocin receptor, somatostatin SSTRl and SSTR2, SSTR3, cannabinoid receptor, follicle stimulating hormone (FSH) receptor, leutropin (LH/HCG) receptor, thyroid stimulating hormone (TSH) receptor, thromboxane A2 receptor, platelet-activating factor (PAF) receptor, C5a anaphylatoxin receptor, Interleukin 8 (IL-8) IL-8RA, IL-8RB, Delta Opioid receptor, Kappa Opioid receptor, mip-1/RANTES receptor, Rhodopsin, Red opsin, Green opsin, Blue opsin, metabotropic glutamate mGluRl-6, histamine H2 receptor, ATP receptor, neuropeptide Y receptor, amyloid protein precursor receptor, insulin-like growth factor II receptor, bradykinin receptor, gonadotropin- releasing hormone receptor, cholecystokinin receptor, melanocyte stimulating hormone receptor, antidiuretic hormone receptor, glucagon receptor, and adrenocorticotropic hormone II receptor.
Preferred EPH receptors include eph, elk, eek, sek, mek4, hek, helc2, eek, erk, tyrol, tyro4, tyro5, tyro6, tyrol 1, cek4, cek5, cek6, cek7, cek8, cek9, ceklO, bsk, rtkl , rtk2, rtk3, mykl , myk2, ehkl , ehk2, pagliaccio, htk, erk and nuk receptors.
A. Cytokine Receptors
In one embodiment the target receptor is a cytokine receptor. Cytokines are a family of soluble mediators of cell-to-cell communication that includes interleukins, interferons, and colony-stimulating factors. The characteristic features of cytokines lie in their functional redundancy and pleiotropy. Most of the cytokine receptors that constitute distinct superfamilies do not possess intrinsic protein tyrosine kinase domains, yet receptor stimulation usually invokes rapid tyrosine phosphorylation of intracellular proteins, including the receptors themselves. Many members of the cytokine receptor superfamily activate the Jak protein tyrosine kinase family, with resultant ' ~^sphorylation of the STAT transcriptional activator factors. IL-2, IL-7, IL-2 and Interferon γ have all been shown to activate Jak kinases (Frank et al (1995) Proc Natl Acad Sci USA 92:7779-7783); Scharfe et al. (1995) Blood 86:2077-2085); (Bacon et al. (1995) Proc Natl Acad Sci USA 92:7307-7311); and (Sakatsume et al (1995) J. Biol Chem 270:17528-17534). Events downstream of Jak phosphorylation have also been elucidated. For example, exposure of T lymphocytes to IL-2 has been shown to lead to the phosphorylation of signal transducers and activators of transcription (STAT) proteins STAT1 D, STAT2D, and STAT3, as well as of two STAT-related proteins, p94 and p95. The STAT proteins were found to translocate to the nucleus and to bind to a specific DNA sequence, thus suggesting a mechanism by which IL-2 may activate specific genes involved in immune cell function (Frank et al. supra). Jak3 is associated with the gamma chain of the IL-2, IL-4, and IL-7 cytokine receptors (Fujii et al. (1995) Proc Natl Acad Sci 92:5482-5486) and (Musso et al (1995) J Exp Med. 181:1425-1431). The Jak kinases have also been shown to be activated by numerous ligands that signal via cytokine receptors such as, growth hormone and erythropoietin and IL-6 (Kishimoto (1994) Stem cells Suppl 12:37-44). Detection means which may be scored for in the present assay, in addition to direct detection of second messengers, such as by changes in phosphorylation, includes reporter constructs or indicator genes which include transcriptional regulatory elements responsive to the STAT proteins. Described infra.
B. Multisubunit Immune Recognition Receptor (MIRR).
In another embodiment the receptor is a multisubunit receptor. Receptors can be comprised of multiple proteins referred to as subunits, one category of which is referred to as a multisubunit receptor is a multisubunit immune recognition receptor (MIRR). MIRRs include receptors having multiple noncovalently associated subunits and are capable of interacting with src-family tyrosine kinases. MIRRs can include, but are not limited to, B cell antigen receptors, T cell antigen receptors, Fc receptors and CD22. One example of an MIRR is an antigen receptor on the surface of a B cell. To further illustrate, the MIRR on the surface of a B cell comprises membrane-bound immunoglobulin (mlg) associated with the subunits Ig- and Ig-β or Ig-γ, which forms a complex capable of regulating B cell function when bound by antigen. An antigen receptor can be functionally linked to an amplifier molecule in a manner such that the amplifier molecule is capable of regulating gene transcription.
Src-family tyrosine kinases are enzymes capable of phosphorylating tyrosine residues of a target molecule. Typically, a src-family tyrosine kinase contains one or more binding domains and a kinase domain. A binding domain of a src-family tyrosine kinase is capable of binding to a target molecule and a kinase domain is capable of phosphorylating a target molecule bound to the kinase. Members of the src family of tyrosine kinases are characterized by an N-terminal unique region followed by three regions that contain different degrees of homology among all the members of the family. These three regions are referred to as src homology region 1 (SHI), src homology region 2 (SH2) and src homology region 3 (SH3). Both the SH2 and SH3 domains are believed to have protein association functions important for the formation of signal transduction complexes. The amino acid sequence of an N-terminal unique region, varies between each src-family tyrosine kinase. An N- terminal unique region can be at least about the first 40 amino acid residues of the N-terminal of a src-family tyrosine kinase. Syk-family ldnases are enzymes capable of phosphorylating tyrosine residues of a target molecule. Typically, a syk-family kinase contains one or more binding domains and a kinase domain. A binding domain of a syk-family tyrosine kinase is capable of binding to a target molecule and a kinase domain is capable of phosphorylating a target molecule bound to the kinase. Members of the syk- family of tyrosine kinases are characterized by two SH2 domains for protein association function and a tyrosine kinase domain.
A primary target molecule is capable of further extending a signal transduction pathway by modifying a second messenger molecule. Primary target molecules can include, but are not limited to, phosphatidylinositol 3 -kinase (PI-3K), p21rasGAP-activating protein and associated pi 90 and p62 protein, phospholipases such as PLCγl and PLCγ2, MAP kinase, She and VAV. A primary target molecule is capable of producing second messenger molecule which is capable of further amplifying a transduced signal. Second messenger molecules include, but are not limited to diacylglycerol and inositol 1,4,5-triphosphate (IP3). Second messenger molecules are capable of initiating physiological events which can lead to alterations in gene transcription. For example, production of IP3 can result in release of intracellular calcium, which can then lead to activation of calmodulin kinase II, which can then lead to serine phosphorylation of a DNA binding protein referred to as ets-1 proto-onco-protein. Diacylglycerol is capable of activating the signal transduction protein, protein kinase C which affects the activity of the API DNA binding protein complex. Signal transduction pathways can lead to transcriptional activation of genes such as c-fos, egr-1, and c-myc.
She can be thought of as an adapter molecule. An adapter molecule comprises a protein that enables two other proteins to form a complex (e.g., a three molecule complex). She protein enables a complex to form which includes Grb2 and
SOS. She comprises an SH2 domain that is capable of associating with the SH2 domain of Grb2.
Molecules of a signal transduction pathway can associate with one another using recognition sequences. Recognition sequences enable specific binding between two molecules. Recognition sequences can vary depending upon the structure of the molecules that are associating with one another. A molecule can have one or more recognition sequences, and as such can associate with one or more different molecules.
Signal transduction pathways for MIRR complexes are capable of regulating the biological functions of a cell. Such functions can include, but are not limited to the ability of a cell to grow, to differentiate and to secrete cellular products. MIRR- induced signal transduction pathways can regulate the biological functions of specific types of cells involved in particular responses by an animal, such as immune responses, inflammatory responses and allergic responses. Cells involved in an immune response can include, for example, B cells, T cells, macrophages, dendritic cells, natural killer cells and plasma cells. Cells involved in inflammatory responses can include, for example, basophils, mast cells, eosinophils, neutrophils and macrophages. Cells involved in allergic responses can include, for example mast cells, basophils, B cells, T cells and macrophages. In exemplary embodiments of the subject assay, the detection signal is a second messengers, such as a phosphorylated src-like protein, includes reporter constructs or indicator genes which include transcriptional regulatory elements such as serum response element (SRE), 12-O-tetradecanoyl-phorbol- 13 -acetate response element, cyclic AMP response element, c- fos promoter, or a CREB-responsive element.
C. Receptor tyrosine kinases.
In still another embodiment, the target receptor is a receptor tyrosine kinase. The receptor tyrosine kinases can be divided into five subgroups on the basis of structural similarities in their extracellular domains and the organization of the tyrosine kinase catalytic region in their cytoplasmic domains. Sub-groups I (epidermal growth factor (EGF) receptor-like), II (insulin receptor-like) and the eph/eck family contain cysteine-rich sequences (Hirai et al., (1987) Science 238:1717-1720 and Lindberg and Hunter, (1990) Mol. Cell. Biol. 10:6316-6324). The functional domains of the kinase region of these three classes of receptor tyrosine kinases are encoded as a contiguous sequence ( Hanks et al. (1988) Science 241 :42-52). Subgroups III (platelet-derived growth factor (PDGF) receptor-like) and IV (the fibro-blast growth factor (FGF) receptors) are characterized as having immunoglobulin (Ig)-like folds in their extracellular domains, as well as having their kinase domains divided in two parts by a variable stretch of unrelated amino acids (Yanden and Ullrich (1988) supra and Hanks et al. (1988) supra). The family with by far the largest number of lαiown members is the EPH family. Since the description of the prototype, the EPH receptor (Hirai et al. (1987) Science 238:1717-1720), sequences have been reported for at least ten members of this family, not counting apparently orthologous receptors found in more than one species. Additional partial sequences, and the rate at which new members are still being reported, suggest the family is even larger (Maisonpierre et al. (1993) Oncogene 8:3277-3288; Andres et al. (1994) Oncogene 9:1461-1467; Henkemeyer et al. (1994) Oncogene 9:1001-1014; Ruiz et al. (1994) Mech Dev 46:87-100; Xu et al. (1994) Development 120:287-299; Zhou et al. (1994) JNeurosci Res 37:129-143; and references in Tuzi and Gullick (1994) Br J Cancer 69:417-421). Remarkably, despite the large number of members in the EPH family, all of these molecules were identified as orphan receptors without known ligands.
The expression patterns determined for some of the EPH family receptors have implied important roles for these molecules in early vertebrate development. In particular, the timing and pattern of expression of sek, mek4 and some of the other receptors during the phase of gastrulation and early organogenesis has suggested functions for these receptors in the important cellular interactions involved in patterning the embryo at this stage (Gilardi-Hebenstreit et al. (1992) Oncogene 7:2499-2506; Nieto et al. (1992) Development 116:1137-1150; Henkemeyer et al, supra; Ruiz et al., supra; and Xu et al., supra). Sek, for example, shows a notable early expression in the two areas of the mouse embryo that show obvious segmentation, namely the somites in the mesoderm and the rhombomeres of the hindbrain; hence the name sek, for segmentally expressed kinase (Gilardi- Hebenstreit et al., supra; Nieto et al., supra). As in Drosophila, these segmental structures of the mammalian embryo are implicated as important elements in establishing the body plan. The observation that Sek expression precedes the appearance of morphological segmentation suggests a role for sek in forming these segmental structures, or in determining segment-specific cell properties such as lineage compartmentation (Nieto et al., supra). Moreover, EPH receptors have been implicated, by their pattern of expression, in the development and maintenance of nearly every tissue in the embryonic and adult body. For instance, EPH receptors have been detected throughout the nervous system, the testes, the cartilaginous model of the skeleton, tooth primordia, the infundibular component of the pituitary, various epithelia tissues, lung, pancreas, liver and kidney tissues. Observations such as this have been indicative of important and unique roles for EPH family kinases in development and physiology, but further progress in understanding their action has been severely limited by the lack of information on their ligands. As used herein, the terms "EPH receptor" or "EPH-type receptor" refer to a class of receptor tyrosine kinases, comprising at least eleven paralogous genes, though many more orthologs exist within this class, e.g. homologs from different species. EPH receptors, in general, are a discrete group of receptors related by homology and easily recognizable, e.g., they are typically characterized by an extracellular domain containing a characteristic spacing of cysteine residues near the N-terminus and two fibronectin type III repeats (Hirai et al. (1987) Science 238:1717-1720; Lindberg et al. (1990) Mol Cell Biol 10:6316-6324; Chan et al. (1991) Oncogene 6:1057-1061; Maisonpierre et al. (1993) Oncogene 8:3277-3288; Andres et al. (1994) Oncogene 9:1461-1467; Henkemeyer et al (1994) Oncogene 9:1001-1014; Ruiz et al. (1994) Mech Dev 46:87-100; Xu et al. (1994) Development 120:287-299; Zhou et al. (1994) JNeurosci Res 37:129-143; and references in Tuzi and Gullick (1994) Br J Cancer 69:417-421). Exemplary EPH receptors include the eph, elk, eek, sek, mek4, hek, hek.2, eek, erk, tyrol, tyro4, tyroS, tyroό, tyrol 1, cek4, cek5, cekό, cek7, cek8, cek9, ceklO, bsk, rtkl, rtk2, rtk3, mykl, myk2, ehkl, ehk.2, pagliaccio, htk, erk and nuk receptors. The term "EPH receptor" refers to the membrane form of the receptor protein, as well as soluble extracellular fragments which retain the ability to bind the ligand of the present invention.
In exemplary embodiments, the detection signal is provided by detecting phosphorylation of intracellular proteins, e.g., MEKKs, MEKs, or Map kinases, or by the use of reporter constructs or indicator genes which include transcriptional regulatory elements responsive to c-fos and/or c-jun. Described infra. D. G Protein-Coupled Receptors
One family of signal transduction cascades found in eukaryotic cells utilizes heterofrimeric "G proteins." Many different G proteins are known to interact with receptors. G protein signaling systems include three components: the receptor itself, a GTP-binding protein (G protein), and an intracellular target protein.
The cell membrane acts as a switchboard. Messages arriving through different receptors can produce a single effect if the receptors act on the same type of G protein. On the other hand, signals activating a single receptor can produce more than one effect if the receptor acts on different kinds of G proteins, or ifthe G proteins can act on different effectors.
In their resting state, the G proteins, which consist of alpha, beta and gamma subunits, are complexed with the nucleotide guanosine diphosphate (GDP) and are in contact with receptors. When a hormone or other first messenger binds to receptor, the receptor changes conformation and this alters its interaction with the G protein. This spurs the D subunit to release GDP, and the more abundant nucleotide guanosine triphosphate (GTP), replaces it, activating the G protein. The G protein then dissociates to separate the alpha subunit from the still complexed beta and gamma subunits. Either the G alpha subunit, or the G beta-gamma complex, depending on the pathway, interacts with an effector. The effector (which is often an enzyme) in turn converts an inactive precursor molecule into an active "second messenger," which may diffuse through the cytoplasm, triggering a metabolic cascade. After a few seconds, the G alpha converts the GTP to GDP, thereby inactivating itself. The inactivated G alpha may then reassociate with the G beta- gamma complex. Hundreds, if not thousands, of receptors convey messages through heterofrimeric G proteins, of which at least 17 distinct forms have been isolated. Although the greatest variability has been seen in the alpha subunit, several different beta and gamma structures have been reported. There are, additionally, several different G protein-dependent effectors. Most G protein-coupled receptors are comprised of a single protein chain that is threaded through the plasma membrane seven times. Such receptors are often referred to as seven-transmembrane receptors (STRs). More than a hundred different STRs have been found, including many distinct receptors that bind the same ligand, and there are likely many more STRs awaiting discovery.
In addition, STRs have been identified for which the natural ligands are unknown; these receptors are termed "orphan" G protein-coupled receptors, as described above. Examples include receptors cloned by Neote et al. (1993) Cell 72,
415; Kouba et al. FEBS Lett. (1993) 321, 173; Birkenbach et al.(1993) J Virol. 61,
2209.
The "exogenous receptors" of the present invention may be any G protein-coupled receptor which is exogenous to the cell wliich is to be genetically engineered for the purpose of the present invention. This receptor may be a plant or animal cell receptor. Screening for binding to plant cell receptors may be useful in the development of, e.g., herbicides. In the case of an animal receptor, it may be of invertebrate or vertebrate origin. If an invertebrate receptor, an insect receptor is preferred, and would facilitate development of insecticides. The receptor may also be a vertebrate, more preferably a mammalian, still more preferably a human, receptor. The exogenous receptor is also preferably a seven transmembrane segment receptor.
Known ligands for G protein coupled receptors include: purines and nucleotides, such as adenosine, cAMP, ATP, UTP, ADP, melatonin and the like; biogenic amines (and related natural ligands), such as 5-hydroxytryptamine, acetylcholine, dopamine, adrenaline, adrenaline, adrenaline., histamine, noradrenaline, noradrenaline, noradrenaline., tyramine/octopamine and other related compounds; peptides such as adrenocorticofrophic hormone (acth), melanocyte stimulating hormone (msh), melanocortins, neurotensin (nt), bombesin and related peptides, endothelins, cholecystokinin, gastrin, neurokinin b (nk3), invertebrate tachykinin-like peptides, substance k (nk2), substance p (nkl), neuropeptide y (npy), thyrotropin releasing-factor (trf), bradykinin, angiotensin ii, beta-endorphin, c5a anaphalatoxin, calcitonin, chemokines (also called intercrines), corticotrophic releasing factor (erf), dynorphin, endorphin, fmlp and other formylated peptides, follitropin (fsh), fungal mating pheremones, galanin, gastric inhibitory polypeptide receptor (gip), glucagon-like peptides (glps), glucagon, gonadotropin releasing hormone (gnrh), growth hormone releasing hormone(ghrh), insect diuretic hormone, interleukin- 8, leutropin (lh/hcg), met-enkephalin, opioid peptides, oxytocin, parathyroid hormone (pth) and ptlirp, pituitary adenylyl cyclase activiating peptide (pacap), secretin, somatostatin, thrombin, thyrotropin (tsh), vasoactive intestinal peptide (vip), vasopressin, vasotocin; eicosanoids such as ip-prostacyclin, pg- prostaglandins, tx-tl romboxanes; retinal based compounds such as vertebrate 11 -cis retinal, invertebrate 11 -cis retinal and other related compounds; lipids and lipid- based compounds such as cannabinoids, anandamide, lysophosphatidic acid, platelet activating factor, leukotrienes and the like; excitatory amino acids and ions such as calcium ions and glutamate.
Suitable examples of G-protein coupled receptors include, but are not limited to, dopaminergic, muscarinic cholinergic, a-adrenergic, b-adrenergic, opioid (including delta and mu), cannabinoid, serotoninergic, and GABAergic receptors. Preferred receptors include the 5HT family of receptors, dopamine receptors, C5a receptor and FPRL-1 receptor, cyclo-histidyl-proline-diketoplperazine receptors, melanocyte stimulating hormone release inhibiting factor receptor, and receptors for neurotensin, thyrotropin releasing hormone, calcitonin, cholecytokinin-A, neurokinin-2, histamine-3, cannabinoid, melanocortin, or adrenomodulin, neuropeptide- Yl or galanin. Other suitable receptors are listed in the art. The term "receptor," as used herein, encompasses both naturally occurring and mutant receptors.
Many of these G protein-coupled receptors, like the yeast alpha- and beta-factor receptors, contain seven hydrophobic amino acid-rich regions which are assumed to lie within the plasma membrane. Specific human G protein-coupled STRs for which genes have been isolated and for which expression vectors could be constructed include those listed herein and others known in the art. Thus, the gene would be operably linked to a promoter functional in the cell to be engineered and to a signal sequence that also functions in the cell. For example in the case of yeast, suitable promoters include Ste2, Ste3 and gallO. Suitable signal sequences include those of Ste2, Ste3 and of other genes which encode proteins secreted by yeast cells. Preferably, when a yeast cell is used, the codons of the gene would be optimized for expression in yeast. See Hoekema et al.,(1987) Mol. Cell. Biol, 7:2914-24; Sharp, et al, (1986)14:5125-43.
The homology of STRs is discussed in Dohlman et al., Ann. Rev. Biochem.,
(1991) 60:653-88. When STRs are compared, a distinct spatial pattern of homology is discernible. The transmembrane domains are often the most similar, whereas the
N- and C-terminal regions, and the cytoplasmic loop connecting transmembrane segments V and VI are more divergent.
The functional significance of different STR regions has been studied by introducing point mutations (both substitutions and deletions) and by constructing chimeras of different but related STRs. Synthetic peptides corresponding to individual segments have also been tested for activity. Affinity labeling has been used to identify ligand binding sites.
It is conceivable that when the host cell is a yeast cell, a foreign receptor will fail to functionally integrate into the yeast membrane, and there interact with the endogenous yeast G protein. More likely, either the receptor will need to be modified (e.g., by replacing its V-VI loop with that of the yeast STE2 or STE3 receptor), or a compatible G protein should be provided.
If the wild-type exogenous G protein-coupled receptor cannot be made functional in yeast, it may be mutated for this purpose. A comparison would be made of the amino acid sequences of the exogenous receptor and of the yeast receptors, and regions of high and low homology identified. Trial mutations would then be made to distinguish regions involved in ligand or G protein binding, from those necessary for functional integration in the membrane. The exogenous receptor would then be mutated in the latter region to more closely resemble the yeast receptor, until functional integration was achieved. If this were insufficient to achieve functionality, mutations would next be made in the regions involved in G protein binding. Mutations would be made in regions involved in ligand binding only as a last resort, and then an effort would be made to preserve ligand binding by making conservative substitutions whenever possible. Preferably, the yeast genome is modified so that it is unable to produce the yeast receptors which are homologous to the exogenous receptors in functional form. Otherwise, a positive assay score might reflect the ability of a peptide to activate the endogenous G protein-coupled receptor, and not the receptor of interest.
(i) Chemoattractant receptors
The N-formyl peptide receptor is a classic example of a calcium mobilizing G protein-coupled receptor expressed by neutrophils and other phagocytic cells of the mammalian immune system (Snyderman et al. (1988) In Inflammation: Basic Principles and Clinical Correlates, pp. 309-323). N-formyl peptides of bacterial origin bind to the receptor and engage a complex activation program that results in directed cell movement, release of inflammatory granule contents, and activation of a latent NADPH oxidase which is important for the production of metabolites of molecular oxygen. This pathway initiated by receptor-ligand interaction is critical in host protection from pyrogenic infections. Similar signal transduction occurs in response to the inflammatory peptides C5a and IL-8.
Two other formyl peptide receptor like (FPRL) genes have been cloned based on their ability to hybridize to a fragment of the NFPR cDNA coding sequence. These have been named FPRLl (Murphy et al. (1992) J Biol Chem.
261:1631-1643) and FPRL2 (Ye et al. (1992) Biochem Biophys Res. Comm.
184:582-589). FPRL2 was found to mediate calcium mobilization in mouse fibroblasts transfected with the gene and exposed to formyl peptide. In contrast, although FPRLl was found to be 69% identical in amino acid sequence to NFPR, it did not bind prototype N-formyl peptides ligands when expressed in heterologous cell types. This lead to the hypothesis of the existence of an as yet unidentified ligand for the FPRLl orphan receptor (Murphy et al. supra).
(ii) G proteins
In the case of an exogenous G protein-coupled receptor, the yeast cell must be able to produce a G protein which is activated by the exogenous receptor, and wliich can in turn activate the yeast effector(s). The art suggests that the endogenous yeast Gα subunit (e.g., GPA) will often be sufficiently homologous to the "cognate" Gα subunit which is natively associated with the exogenous receptor for coupling to occur. More likely, it will be necessary to genetically engineer the yeast cell to produce a foreign Gα subunit which can properly interact with the exogenous receptor. For example, the Gα subunit of the yeast G protein may be replaced by the Gα subunit natively associated with the exogenous receptor.
Dietzel and Kurjan, (1987) Cell, 50:1001) demonstrated that rat Gas can be functionally coupled to the yeast Gβγ complex. However, rat Gαi2 complemented only when substantially overexpressed, while Gα did not complement at all (Kang, et al., Mol. Cell. Biol, (1990)10:2582). Consequently, with some foreign Gα subunits, it is not feasible to simply replace the yeast Gα.
If the exogenous G protein coupled receptor is not adequately coupled to yeast Gβγ by the Gα subunit natively associated with the receptor, the Gα subunit may be modified to improve coupling. These modifications often will take the form of mutations which increase the resemblance of the Gα subunit to the yeast Gα while decreasing its resemblance to the receptor-associated Gα. For example, a residue may be changed so as to become identical to the corresponding yeast Gα residue, or to at least belong to the same exchange group of that residue. After modification, the modified Gα subunit might or might not be "substantially homologous" to the foreign and/or the yeast Gα subunit.
The modifications are preferably concentrated in regions of the Gα which are likely to be involved in Gβγ binding. In some examples, the modifications will take the form of replacing one or more segments of the receptor-associated Gα with the corresponding yeast Gα segment(s), thereby forming a chimeric Gα subunit. (For the purpose of the appended claims, the term "segment" refers to tliree or more consecutive amino acids). In other examples, point mutations may be sufficient.
This chimeric Gα subunit will interact with the exogenous receptor and the yeast Gβγ complex, thereby permitting signal transduction. While use of the endogenous yeast Gβγ is preferred, if a foreign or chimeric Gβγ is capable of transducing the signal to the yeast effector, it may be used instead. V. Pharmaceutical Preparations of Identified Agents
After identifying certain test CSPs in the subject assay, e.g. as potential surrogate ligands, or receptor antagonists, the practitioner of the subject assay will continue to test the efficacy and specificity of the selected peptides both in vitro and in vivo. Whether for subsequent in vivo testing, or for administration to an animal as an approved drug, peptides identified in the subject assay, or peptidomimetics thereof, can be formulated in pharmaceutical preparations for in vivo administration to an animal, preferably a human.
The peptides selected in the subject assay, or a pharmaceutically acceptable salt thereof, may accordingly be formulated for administration with a biologically acceptable medium, such as water, buffered saline, polyol (for example, glycerol, propylene glycol, liquid polyethylene glycol and the like) or suitable mixtures thereof. The optimum concentration of the active ingredient(s) in the chosen medium can be determined empirically, according to procedures well known to medicinal chemists. As used herein, "biologically acceptable medium" includes any and all solvents, dispersion media, and the like which may be appropriate for the desired route of administration of the pharmaceutical preparation. The use of such media for pharmaceutically active substances is known in the art. Except insofar as any conventional media or agent is incompatible with the activity of the compound, its use in the pharmaceutical preparation of the invention is contemplated. Suitable vehicles and their formulation inclusive of other proteins are described, for example, in the book Remington's Pharmaceutical Sciences (Remington's Pharmaceutical Sciences. Mack Publishing Company, Easton, Pa., USA 1985). These vehicles include injectable "deposit formulations". Based on the above, such pharmaceutical formulations include, although not exclusively, solutions or freeze-dried powders of the compound in association with one or more pharmaceutically acceptable vehicles or diluents, and contained in buffered media at a suitable pH and isosmotic with physiological fluids. In preferred embodiment, the peptide can be disposed in a sterile preparation for topical and/or systemic administration. In the case of freeze- dried preparations, supporting excipients such as, but not exclusively, mannitol or glycine may be used and appropriate buffered solutions of the desired volume will be provided so as to obtain adequate isotonic buffered solutions of the desired pH. Similar solutions may also be used for the pharmaceutical compositions of compounds in isotonic solutions of the desired volume and include, but not exclusively, the use of buffered saline solutions with phosphate or citrate at suitable concentrations so as to obtain at all times isotonic pharmaceutical preparations of the desired pH, (for example, neutral pH).
Exemplification
The invention now being generally described, it will be more readily understood by reference to the following examples which are included merely for purposes of illustration of certain aspects and embodiments of the present invention, and are not intended to limit the invention.
Two distinct embodiments of the present invention are demonstrated in the preparation of vectors pSW3 and pSW4. These vectors can be used to display the CSPs useful in the instant application.
pSWl rFieure 1 and 2):
pSWl is the prototype vector for the KeyCode technology. Its backbone is a modified M13 bacteriophage (bp 1-4779 and 8585 to 9170) genome in which the wild type pill gene is deleted. Note that alternate embodiments are possible, for example the deletion (and relocation) of the wild type bacteriophage pVIII gene instead of pill. Inserted into this backbone is an Amp resistance gene (bp 4824- 5681) that allows for selection in E. coli. The expression segment (bp 6046-8584) contains the CMV promoter, splice acceptor and donor sequences, and SV40 replication/termination elements which allow for expression of peptide sequences in eukaryotic hosts. These eukaryotic segments are interspersed with E. coli expression elements (lac promoter, Ml 3 phage gene III (or gene VIII), and E. coli termination signals), wliich allow the cloned library peptides to be displayed on the phage coat as a fusion to the pill phage protein. Random oligonucleotides encoding the random peptides are directionally inserted into the phage as BstXI fragments. For the present invention, it is necessary to further modify pSWl in order to provide for display of test peptides or chimeric serum peptides, followed by secretion of chimeric serum peptides.
pSW2 fFigure 3 and 4):
pSW2 is a derivative of pSWl that was designed to aid the construction of albumin display vectors. In pSW2, the relevant functional elements are present as cassettes that expedite the simple exchange of components. pSW2 is another prototype vector for the KeyCode technology.
First, a Sail restriction site is introduced into pSWl through insertion of 4 bases (CGAC) at position 8728. This insertion will provide a restriction site that allows the exchange of signal sequences and N-terminal protein sequences for expression in eukaryotic hosts (see diagrams 4 and 5). For example, the IgH secretion signal sequence could be excised and replaced with a serum albumin secretion signal sequence. Second, Hindlll, Sapl, and BamHl restriction sites are engineered between base pairs 6841 and 6975 of pSWl, thus eliminating 10 amino acids of flanking sequence around the peptide libraries in pSWl and eliminating the potential effects of flanking sequence on peptide binding during phage panning (see Figure 4 and 6). Libraries cloned into these sites need to have the E. coli secretion signal / splice acceptor / splice donor sequences inserted on the same oligo as the library (Figure 8), resulting in correct translation and fusion of peptides to pill (or pVIII) in prokaryotic phage display while incorporating necessary splice elements for eukaryotic expression of peptides.
Third, a PflMI site is introduced at base pair 8264 by PCR to allow simple insertion of protein sequences, such as serum albumin, to which the peptide may be fused in eukaryotic expression (Figure 4 and 7).
Thus, pSW2 can be further modified in order to house the libraries of test peptides fused to bacterial gene III (or gene VIII). pSW2 was modified to form embodiments of vectors useful in the present invention, pSW3 and pSW4, as described below. In pSW3, the vector allows for display of test peptides, followed by secretion of chimeric serum peptides. In pSW4, the vector allows for display and secretion of chimeric serum peptides.
pSW3 fFigure 9 and 10 :
In pSW3, the peptides are displayed on the surface of the page as pill (or pVIII) fusions without serum albumin. However, in mammalian cells, such as COS- 7 cells, the peptides are expressed in the cys53-cys62 loop of domain I (aa 1-96) or full length mouse serum albumin.
For the construction of pSW3, the serum albumin secretion signal sequence and amino acids 1 to 51 of the mature serum albumin protein (Figures 10 and 11) are inserted between positions bp 6657 to 6725 of pSW2 by PCR into the Xbal/Sall sites. Due to requirements of the splice donor sequence at position 6721 of pSW2, amino acid 51 of serum albumin needs to be changed from Lys to Arg.
Second, the serum albumin coding sequence (amino acids 65-608 or 65-96) is cloned by PCR as an XhoI/PflM fragment into pSW2 (8737 to 8763; see Figures 10 and 12). Due to the requirements of the splice acceptor sequence at position 8759, amino acid 65 of serum albumin changes from Ser to Ala.
Peptide libraries will be cloned as either Sapl or Hindlll/BamHI oligomers by gap duplex method. In cloning these peptides, oligos must contain the E. coli secretion signal/splice acceptor/splice donor as well as the peptide library flanked by amino acids 52 and 53 and amino acids 62 to 64 of mouse serum albumin (see
Figures 10 and 13).
To further in vivo evaluation, peptides inserted into domain I can be transferred into wild type albumin as an Nrul/Xcml fragment in the pcDNA 3 (NX)
+ MSA WT plasmid. The resulting clone will express the peptide in loop 53 in a wild type MSA protein (K51, S65) from eukaryotic host cells (see Figure 14) and can be used for the construction of CHO producer cell lines.
pSW4 fFigures l5 and l6):
pSW4 allows the display the entire serum albumin domain 1 (aa 1-106) with the inserted peptides (for example, between Cys52-Cys63 and/or Cys91-Cysl00) on the surface of the virion as a pill fusion and can be used for the COS-7 expression of the albumin-peptide chimera. pSW2 was modified to create pSW4 as described below:
First, the serum albumin secretion signal sequence is inserted as complimentary oligos between the Xbal and Sail sites of pSW2 at positions 6657 and 6725(see Figures 16 and 17).
Second, amino acids 1 through 96 of serum albumin are inserted between the Hindlll and BamHl sites by PCR cloning. In this cloning, the original Hindlll site of pSW2 is destroyed by silent mutation. A new Hindlll site is engineered by silent mutagenesis in the serum albumin sequence at amino acids 65 and 66 (see diagrams 16 and 18). The presence of a unique Sphl site in serum albumin at amino acids 39 and 40 allows insertion of peptide libraries as Sphl Hindlll oligos (see Figures 16 and 19). SacII and Pstl sites are also engineered by silent mutagenesis in serum albumin at amino acids 86/87 and 105/106 respectively, which allow peptide library insertion between Cys 91 and Cys 100, in an additional surface loop of serum albumin. There are no amino acid changes in pSW4 in serum albumin (contrary to the K51R and S65A substitutions in pSW3). pill fusions longer than 30 amino acids will interfere with the infectivity of the phage. For the selective enrichment of peptides that bind to the target, the recovered phage must be amplified by re-infecting E. coli cells. Since in pSW4 all 5 copies of pill are fused to domain I of serum albumin, it is very likely to interfere with the infectivity of the pSW4 phage. To overcome this problem in pSW4, a 10 amino acid thrombin protease recognition site is inserted between the coding regions for MSA domain 1 and the phage pill (see Figures 16 and 18). Thrombin treatment of the phage prior to infection cleaves the MSA moiety off resulting in higher infectivity.
It is also possible to express the peptides as part of the entire serum albumin molecule. In this case a serum albumin fragment encoding aa97 to 608 is cloned between the XhoI/PflM sites. pAMlO:
The expression cassette of pAMlO is the same as in pSW3except it is a phagemid. pAMlO allows the display of the entire serum albumin domain I (aa 1- 106) with test peptides inserted between Cys52-Cys63 and/or Cys91-Cysl00 on the surface of the virion as a pill fusion. The bacteriophage pVIII gene may be used in place of the pill gene. In such a case, a helper phage is used in order to produce wild-type pVIII, without the attached chimeric serum peptide. The presence of the wild type pVIII allows formation of viable phage in which some of the phage pVIII coat protein will be attached to CSPs. When transfected into COS7 cells, it directs the expression and secretion of the albumin peptide chimera. pAM9 was modified as described below to create pAMlO: The pSW3 cassette comprised of the MSA signal sequence, lac promoter, E. coli signal sequence, MSA domain I (aa 1-106) and pill gene (or pVIII gene) with interspersed eμkaryotic splice signals was used to replace the similar pAM9 cassette as an Xbal/Xhol fragment.
pAMl l:
The expression cassette of pAMl 1 is the same as in pSW4 except it is a phagemid. pAMl 1 allows the display of the test peptides on the- surface of the virion as a pill fusion. The bacteriophage pVIII gene may be used in place of the pill gene. In such a case, a helper phage is used in order to produce wild-type pVIII, without the test peptide. The presence of the wild type pVIII allows formation of viable phage in which some of the phage pVIII coat protein will be attached to test peptides. When transfected into COS7 cells, it directs the expression and secretion of the albumin peptide chimera.
Myc epitope display in MSA loop regions.
In order to determine whether the predicted loops were indeed exposed on the surface of the albumin molecule, mouse serum albumin (MSA) was modified to include the myc epitope, EQKLISEEDL (SEQ ID NO: 1). The myc epitope was inserted in the middle of each of three amino acid segments: between amino acids 57-58 for loop 53-62, amino acids 364-365 for loop 360-369 and amino acids 467- 468 for loop 450-467. Cos7 cells were transfected with either wild type MSA or the various myc containing MSA constructs. The presence of the proteins in the medium was first determined by Western blot analysis using antibodies specific for MSA and the myc epitope. It was determined that only samples from media from cells transfected with MSA or MSA-Myc reveal the presence of the albumin protein. Additionally, only the samples from cells transfected with MSA-Myc are positive for the myc epitope. As the samples are all denatured by virtue of the SDS-PAGE system, this analysis does not allow for the differentiation of myc epitopes that would be exposed on the surface versus one that was buried within the protein. For this analysis immunoprecipitation with the myc-specific antibody was utilized. In this experiment, the conditioned media was either mixed directly with the antibody (N, native) or first denatured in the presence of 0.1% SDS, 1 mM β- mercapthoethanol and heat (100°C for 10 min) and then antibody added (D, denatured). Following immunoprecipitation the presence of the proteins that could be precipitated by the myc antibody were revealed by Western blot analysis using the MSA specific antibody. As predicted, the albumin proteins with myc inserted in loops 53-62 and 360-369 were bound by the myc antibody regardless of whether the protein was in its native or denatured form. On the other hand, when myc was inserted in the predicted buried region, loop 450-463, the protein only bound the antibody when it was first denatured. This experiment clearly demonstrates that loops 53-62 and 360-369 are exposed on the surface of the MSA protein and therefore good for display. Additionally, the 450-463 loop is buried.
Inhibition of bovine capillary endothelial cells (BCE) MSA-RGD.
The goal of this experiment was determine the function of MSA with the RGD peptide (VRGDF, SEQ ID NO: 2) displayed on the surface of the protein in the loop 53-58 region (MSA-myc-RGD). RGD was chosen, as this peptide can efficiently bind to *v*3 integrin receptors on endothelial cells and inhibit their proliferation. Triplicate wells of Cos7 cells were transfected with the following constructs: MSA-myc (the myc epitope was added to the C-terminal tail of MSA in this iteration); MSA-myc-RGD; or pAM7-stuffer. These Cos7 cells were grown in the lower chamber of a Transwell® tissue culture plate with BCE cells in the upper chamber. To stimulate growth of the BCE cells, FGF was added to the lower chamber or not in the case of no FGF control and the cells allowed to grow for 72 hours. To one set of wells, those with pAM7-stuffer, 6.25 *M c-RGD peptide was also added. Cell growth was determined by a Calcein-binding fluorescence assay. The left panel of Figure 3 is a graph of the optical density (OD) for each. The data reveals the addition of FGF results in a 2-fold stimulation of growth of the BCE cells. This growth was inhibited by the c-RGD peptide and also by the secreted MSA-myc-RGD protein. The right panel is a different way of looking at the same data. In this instance the degree of inhibition of growth is graphed for each. The data shows that the MSA-Myc-RGD protein inhibited the growth of the BCE cell by 53% and the degree of inhibition was equivalent to that of the added RGD peptide. The RGD peptide displayed on the surface of the MSA molecule inhibited BCE cell growth as efficiently as the endogenously added free RGD peptide demonstrating that the peptide retains its activity in the looped orientation.
Inhibition of BCE and HUVEC proliferation by serum albumin-ECBP fusions.
This experiment demonstrated the inhibition of BCE and HUNEC cell proliferation by purified mouse serum albumin (MSA) proteins that displayed endothelial cell binding (EC) peptides. In the MSA-peptide fusions the peptide sequence was inserted into a cysteine constrained loop between amino acids 53 and 62. The proteins were produced by COS-7 cells that were transfected with expression plasmids that directed the synthesis and secretion of the particular recombinant protein. As shown in Figure 5 of U.S.S.N. 09/768,183, in the MSA- 9G5, MSA-11B3 and MSA-RGD constructs the inserted peptides replaced the naturally occurring residues of MSA between cys53-cys62. In MSA-1H5 and MSA- myc constructs (negative control), the peptides were inserted into the loop at amino acid glu57. Figures 6 and 7 of U.S.S.N. 09/768,183 show the inhibitory effect of the purified proteins on the proliferation of BCE and HUVEC cells that were stimulated by FGF.
Experimental design of the EC proliferation experiments Protein production and concentration COS7-L cells were transfected with protein expression constructs expressing:
1. MSA, full-length mouse serum albumin (negative control)
2. MSA-RGD, in which the RGD sequence (VRGDF, SEQ ID NO: 2) replaces the MSA sequence between Cys 53 and Cys 62
3. MSA-11B3, in wliich the 11-B3 peptide sequence (PSTLRAQ, SEQ ID NO: 3) replaces the MSA sequence between Cys 53 and Cys 62
4. MSA-1H5, in wliich the 1-H5 peptide sequence (HTKQIPRHIYSA, SEQ ID NO: 4) is inserted between Glu 57 and Ser 58 within the Cys 53 and Cys 62 loop of MSA
5. MSA-9G5, in which the 9-G5 peptide sequence (DSHKRLK, SEQ ID NO: 5) replaces the MSA sequence between Cys 53 and Cys 62
6. MSA-myc, in which the Myc epitope peptide sequence (EQKLISEEDL, SEQ ID NO: 1) is inserted between Glu 57 and Ser 58 within the Cys 53 and Cys 62 loop of MSA (negative control)
The transfected COS7-L cells were cultured in defined serum free media (VP-SFM). Each day for 5 days, the conditioned media were collected from the cells, centrifuged to remove dead cells and other cellular debris, and then frozen. The 5 days worth of cultured media were pooled and concentrated 500-fold using a Centiprep-80 with a molecular weight cut-off of 50 (for MSA, MSA-RGD, MSA- 9G5) or a molecular weight cut-off of 30 (for MSA-myc, MSA-11B3, MSA-1H5). The concentration of the albumin proteins was determined by Western blot analysis of each preparation using a rabbit anti-MSA antibody and using purified MSA of known concentration to generate a standard curve. Following development of the blot and exposure to film the autoradiographs were analyzed using the Gel Doc 1000 image analysis system and Molecular Analyst software (BioRad).
BCE Proliferation Assays
On day zero, bovine capillary endothelial cells (BCE) at passage 11 were plated in 96 well tissue culture plates at a density of 2 X 103 cells per well in 100 ml 5% calf serum (CSV DMEM supplemented with penicillin/streptomycin (PS). The cells were then incubated overnight in an atmosphere of 10 % CO2, 37 °C.
On day one, the media was changed to 150 ml 2% CS/DMEM/PS. The albumin proteins were added to the first well as 8.75 ml which contains an additional 150 ml of 2% CS/DMEM/PS. 150 ml was then removed from this well and added to the next well resulting in a 1:2 dilution of the protein. This process was repeated for a total of six times each in triplicate. 50 ml of 4 ng/ml FGF (final concentration: 1 ng/ml FGF) was then added to each well and the plates incubated as above for 72 h. A synthetic peptide of cyclic RGD (c-RGD) at a concentration of 4.1 mM was included to serve as a positive control for inhibition of proliferation. Cells without addition of protein but with FGF added and without FGF added were included on each plate as additional controls.
After the 72 h incubation, the media was removed, and the plates were washed twice with PBS and frozen at -80 °C. Proliferation of the BCE cells was assessed using the CyQUANT® cell proliferation assay kit according to the manufacturer's recommendations. Conclusions
The insertion of the EC binding peptides into MSA increased their inhibitory activity by approximately 1000-fold. The MSA-EC binding peptide fusions inhibited BCE and HUVEC proliferation in the nanomolar (nM) range while the synthetic peptides were active in the micromolar (mM) range. The control MSA and MSA- myc proteins did not significantly affect the proliferation of the target endothelial cells.
Induction of tumor cell apoptosis by MSA-RGD fusions.
Peptides containing the RGD (Arg-Gly-Asp) motif have been shown to induce apoptosis in a caspase-3 dependent manner through the promotion of pro- caspase3 auto-cleavage and activation (Buckley et al., 1999). It was therefore of interest to determine if the MSA-RGD fusion was also capable of inducing apoptosis. To test this hypothesis, human non-small cell lung carcinoma cells (NCI 1869) were plated on the membrane of a transwell insert. These cells were incubated to allow attachment. Cos7 cells were transfected with a plasmid containing cDNA encoding (pcDNA MSA-RGD/53) for the expression and secretion of the respective fusion protein. An empty vector (pcDNA3) was transfected in parallel as a negative control. After 24 hours, the transwell insert carrying the NCI- 1869 cells was transferred to the plate containing the Cos7/MSA-RGD transfectants. The cells were co-incubated for an additional 24 hours. The NCI 1869 cells were then recovered and incubated in PBS/Mg"1""1" containing the fluorometric Caspase-3 substrate, DEVD-AFC. Cleavage of this fluorogenic tetrapeptide substrate by Caspase-3 generates a fluorescent signal, which is read in a fluorometric plate reader as a measure of the induction of apoptosis.
The results presented in Figure 8 of U.S.S.N. 09/768,183 (each bar is the average of 3 independent samples) demonstrate that the secretion of MSA-RGD by
Cos7 cells leads to a 4.9 fold induction of apoptosis relative to the vector control in
NCI- 1869 cells. Incubation of these cells with purified RGD peptide also leads to the induction of apoptosis as assessed by microscopic analysis.
All of the above-cited references and publications are hereby incorporated by reference.
Equivalents
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, numerous equivalents to the specific method and reagents described herein, including alternatives, variants, additions, deletions, modifications and substitutions. Such equivalents are considered to be within the scope of this invention and are covered by the following claims.

Claims

Claims:
1. A method for generating a chimeric serum peptide (CSP) with a selected biological activity, comprising:
(i) providing a display library comprising a variegated population of test chimeric serum peptides (CSPs) expressed on the surface of a population of display packages, each of which CSPs includes a serum protein sequence and at least one heterologous test peptide sequence which is variegated in the library and which is provided at an N- terminal end, C-terminal end or internal site of the serum protein sequence;
(ii) in a display mode, isolating, from the display library, a sub- population of display packages enriched for test CSPs which have a desired binding specificity and/or affinity for a cell or a component thereof; (iii) in a secretion mode, simultaneously expressing the enriched test CSP sub-population under conditions wherein the test CSPs are secreted and are free of the display packages; and, (iv) assessing the ability of the secreted test CSPs to regulate a selected biological activity in a target cell; (v) selecting a chimeric serum peptide (CSP) possessing the ability to regulate the selected biological activity in the target cell, thereby generating a chimeric serum peptide (CSP) with a selected biological activity.
2. A method for generating a chimeric serum peptide (CSP) with a selected biological activity, comprising:
(i) providing a display library comprismg a variegated population of test peptides expressed on the surface of a population of display packages, wherein the test peptide sequences are separated from the serum protein sequences by splice sites that are functional in eukaryotic cells, but not in prokaryotic cells; (ii) in a display mode, isolating, from the display library, a sub- population of display packages enriched for test peptides which have a desired binding specificity and/or affinity for a cell or a component thereof; (iii) in a secretion mode, simultaneously expressing the enriched test peptide sub-population under conditions wherein the test peptide is flanked by serum protein sequences, such that the test peptide is in the form of a test chimeric serum peptide (CSP), and the test CSPs are secreted and are free of the display packages; and, (iv) assessing the ability of the secreted test CSPs to regulate a selected biological activity in a target cell; (v) selecting a chimeric serum peptide (CSP) possessing the ability to regulate the selected biological activity in the target cell, thereby generating a chimeric serum peptide (CSP) with a selected biological activity.
3. The method of claim 1 or 2, wherein the display library is a phage display library.
4. The method of claim 3, wherein the display packages of the phage display library are phage particles selected from: M13, fl, fd, Ifl, Ike, Xf, Pfl, Pf3, λ, T4, T7, P2, P4, φX-174, MS2 or f2.
5. The method of claim 3, wherein the phage display library is generated with a filamentous bacteriophage specific for Escherichia coli and the phage coat protein is coat protein III.
6. The method of claim 5, wherein the filamentous bacteriophage is selected from: Ml 3, fd, or fl.
7. The method of claim 1 or 2, wherein the display library is a bacterial cell- surface display library or a spore display library.
8. The method of claim 3, wherein test CSPs are enriched from the display library in the display mode by a differential binding means comprising affinity separation of test CSPs which specifically bind the cell or component thereof from test CSPs which do not.
9. The method of claim 8, wherein the differential binding means comprises panning the display library on whole cells.
10. The method of claim 8, wherein the differential binding means comprises an affinity chromatographic means in which a component of a cell is provided as part of an insoluble matrix.
11. The method of claim 10, wherein the insoluble matrix comprises a cell surface protein attached to a polymeric support.
12. The method of claim 8, wherein the differential binding means comprises immunoprecipitating the display packages.
13. The method of claim 1 or 2, wherein the display mode emiches for test CSPs which bind to a cell-type specific marker.
14. The method of claim 1 or 2, wherein the display mode emiches for test CSPs which bind to a cell surface receptor protein.
15. The method of claim 14, wherein the receptor protein is a G-protein coupled receptor.
16. The method of claim 15, wherein the G-protein coupled receptor is selected from: a chemoattractant peptide receptor, a neuropeptide receptor, a light receptor, a neurotransmitter receptor, a cyclic AMP receptor, or a polypeptide hormone receptor.
17. The method of claim 15, wherein the G-protein coupled receptor is selected from: αlA-adrenergic receptor, αlB-adrenergic receptor, α2-adrenergic receptor, α2B-adrenergic receptor, βl -adrenergic receptor, β2- adrenergic receptor, β3-adrenergic receptor, ml acetylcholine receptor (AChR), m2
AChR, m3 AChR, m4 AChR, m5 AChR, Dl dopamine receptor, D2 dopamine receptor, D3 dopamine receptor, D4 dopamine receptor, D5 dopamine receptor, Al adenosine receptor, A2b adenosine receptor, 5-HTla, 5-HTlb, 5HTl-like, 5-HTld, 5HTld-like, 5HTld beta, substance K (neurokinin A), fMLP receptor, fMLP-like receptor, angiotensin II type 1, endothelin ETA, endothelin ETB, thrombin, growth hormone-releasing hormone (GHRH), vasoactive intestinal peptide, oxytocin, somatostatin SSTRl and SSTR2, SSTR3, cannabinoid, follicle stimulating hormone (FSH), leutropin (LH/HCG), thyroid stimulating hormone (TSH), thromboxane A2, platelet-activating factor (PAF), C5a anaphylatoxin, Interleukin 8 (IL-8) IL-8RA, IL-8RB, Delta Opioid, Kappa Opioid, mip- 1/RANTES, Rhodopsin, Red opsin, Green opsin, Blue opsin, metabotropic glutamate mGluRl-6, histamine H2, ATP, neuropeptide Y, amyloid protein precursor, insulin-like growth factor II, bradykinin, gonadotropin-releasing hormone, cholecystokinin, melanocyte stimulating hormone receptor, antidiuretic hormone receptor, glucagon receptor, or adrenocorticotropic hormone II.
18. The method of claim 14, wherein the receptor protein is a receptor tyrosine kinase.
19. The method of claim 18, wherein the receptor tyrosine kinase is an EPH receptor.
20. The method of claim 19, wherein the receptor is selected from: eph, elk, eek, sek, mek4, hek, hek2, eek, erk, tyrol, tyro4, tyro5, tyroό, tyrol 1, cek4, cek5, cekό, cek7, cek8, cek9, ceklO, bsk, rtkl, rtk.2, rtk3, mykl, myk2, ehkl, ehk2, pagliaccio, htk, erk or nuk receptors.
21. The method of claim 14, wherein the receptor protein is a cytokine receptor.
22. The method of claim 14, wherein the receptor protein is an MIRR receptor.
23. The method of claim 14, wherein the receptor protein is an orphan receptor.
24. The method of claim 1 or 2, wherein the display library includes at least 10 different test CSPs.
25. The method of claim 1 or 2, wherein the test peptide portion of the CSPs are from about 3 to about 100 amino acid residues in length.
26. The metliod of claim 1 or 2, wherein the test peptide portion of the CSPs are from about 4 to about 20 amino acid residues in length.
27. The method of claim 1 or 2, wherein each of the test CSPs are encoded by a chimeric gene comprising (i) a coding sequence for the test CSP, (ii) a coding sequence for a surface protein of the display package for displaying the test CSPs on the surface of a population of display packages, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in the display mode, the chimeric gene is expressed as fusion protein including the test CSP and the surface protein, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing.
28. The method of claim 1 or 2, wherein the test CSPs are expressed by a eukaryotic cell in the secretion mode.
29. The method of claim 28, wherein the eukaryotic cell is a mammalian cell.
30. The method of claim 1 or 2, wherein the target cell is a eukaryotic cell.
31. The method of claim 30, wherein the eukaryotic cell is a mammalian cell.
32. The method of claim 31 , wherein the mammalian cell is a human cell.
33. The method of claim 1 or 2, wherein the biological activity includes a change in cell proliferation, cell differentiation or cell death.
34. The method of claim 1 or 2, wherein the biological activity is detected by changes in intracellular calcium mobilization.
35. The method of claim 1 or 2, wherein the biological activity is detected by changes in intracellular protein phosphorylation.
36. The method of claim 1 or 2, wherein the biological activity is detected by changes in phospholipid metabolism.
37. The method of claim 1 or 2, wherein the biological activity is detected by changes in expression of cell-specific marker genes.
38. The method of claim 13 , wherein the target cell further comprises a reporter gene construct containing a reporter gene in operative linkage with one or more transcriptional regulatory elements responsive to the signal transduction activity of the cell surface receptor protein, expression of the reporter gene providing the detectable signal.
39. The method of claim 38, wherein the reporter gene encodes a gene product that gives rise to a detectable signal selected from: color, fluorescence, luminescence, cell viability relief of a cell nutritional requirement, cell growth, or drug resistance.
40. The method of claim 39, wherein the reporter gene encodes a gene product selected from: chloramphenicol acetyl transferase, beta-galactosidase or secreted alkaline phosphatase.
41. The method of claim 39, wherein the reporter gene encodes a gene product which confers a growth signal.
42. The method of claim 1 or 2, wherein the secretion mode includes expression of the test CSPs by a host cell co-cultured with the target cell.
43. The method of claim 42, wherein the co-cultured host and target cells are separated by a membrane which is permeable to the test CSP.
44. The method of claim 1 or 2, wherein the secretion mode comprises assessing the ability of the secreted test CSPs to inhibit the biological activity of an exogenously added compound on the target cells.
45. The method of claim 1 or 2, wherein: in step (ii), display packages which bind to endothelial cells are isolated; and in step (iv), the ability of the secreted test CSPs to inhibit proliferation of endothelial cells is assessed.
46. The method of claim 45, wherein: in step (iv), the ability of the secreted test CSPs to inhibit proliferation of endothelial cells in the presence of an angiogenic amount of an endogenous growth factor is assessed.
47. The method of claim 1 or 2, wherein the serum protein sequence is selected from: albumins, α-globulins, β-globulins, γ-globulins, haptoglobin, transthyretin, collagen, α2 macroglobulin, β2 microglobulin, C Reactive Protein, apolipoproteins, lipoproteins, cathepsins amylase, antichymotrypsin, ferritin, α fetoprotein, elastin and fibronectin and coagulation factors including fibrinogen, fibrin, thrombin, ceruloplasmin, antiplasmin and antithrombin III, or fragments thereof.
48. The method of claim 1 or 2, wherein the CSPs are represented by the formula A-B-C, wherein A represents a first fragment of a serum protein or homolog thereof, B represents a test peptide sequence, and C represents another fragment of a serum protein or a homolog thereof.
49. The method of claim 1 or 2, comprising the further step of formulating, with a pharmaceutically acceptable carrier, one or more test CSPs which regulate the biological activity in the target cell or peptidomimetics thereof.
50. A display library enriched for test chimeric serum peptides (CSPs) having a desired binding specificity and/or affinity for a cell or a component thereof and which regulate a biological activity in a target cell.
51. A vector comprising a chimeric gene for a chimeric serum peptide (CSP), which chimeric gene comprises (i) a coding sequence for a test CSP, which CSP includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C-terminal end or internal site of the serum protein sequence, (ii) a coding sequence for a surface protein of a display package, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in a display mode, the chimeric gene is expressed as a fusion protein including the test CSP and the surface protein such that the test CSP can be displayed on the surface of a population of display packages, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing.
52. A vector comprising a chimeric gene for a chimeric serum peptide (CSP), which chimeric gene comprises (i) a coding sequence for a test CSP, which CSP includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C-terminal end or internal site of the serum protein sequence, (ii) a coding sequence for a surface protein of a display package, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in a display mode, the chimeric gene is expressed as a test peptide protein not including the serum protein sequence and the surface protein such that the test peptide can be displayed on the surface of a population of display packages, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing.
53. The vector of claim 51 or 52, wherein the chimeric gene further comprises a secretion signal sequence for secretion of the test CSP in the secretion mode.
54. The vector of claim 53, wherein the secretion signal sequence causes secretion of the test CSP from eukaryotic cells.
55. The vector of claim 54, wherein the eukaryotic cells are mammalian cells.
56. A vector library, each vector comprising a chimeric gene for a chimeric serum peptide (CSP), which chimeric gene comprises (i) a coding sequence for a test CSP, which CSP includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C- terminal end or internal site of the serum protein sequence, which test peptide sequence is variegated amongst members of the library, (ii) a coding sequence for a surface protein of a display package, and (iii) RNA splice sites flanking the coding sequence for the surface protein, wherein, in a display mode, the chimeric gene is expressed as fusion protein including the test CSP and the surface protein such that the test CSP can be displayed on the surface of a population of display packages, whereas in the secretion mode, the test CSP is expressed without the surface protein as a result of the coding sequence for the surface protein being removed by RNA splicing, the vector library collectively encodes a variegated population of test CSPs.
57. The vector library of claim 56, wherein the chimeric gene further comprises a secretion signal sequence for secretion of the test CSP in the secretion mode.
58. The vector library of claim 57, wherein the secretion signal sequence causes secretion of the test CSP from eukaryotic cells.
59. The vector library of claim 58, wherein the eukaryotic cells are mammalian cells.
60. A cell composition comprising a population of cells containing the vector library of claim 56.
61. A construct as shown in Figure 1, 3, 9 or 15.
62. A method for identifying a peptide with a selected antimicrobial activity, comprising:
(i) providing a recombinant host cell population which expresses a soluble peptide library comprising a variegated population of test chimeric serum proteins (CSPs), which includes a serum protein sequence and at least one heterologous test peptide sequence provided at an N-terminal end, C-terminal end or internal site of the serum protein sequence; (ii) culturing the host cells with a target microorganism under conditions wherein the peptide library is secreted and diffuses to the target microorganism; and, (iii) selecting a host cell expressing a test CSP that inhibits growth of the target microorganism, thereby identifying a peptide with a selected antimicrobial activity..
63. The method of claim 59, wherein the target microorganism is a bacteria or a fungus.
64. The method of claim 59, wherein the host cell population is cultured on agar embedded with the target microorganisms.
65. The method of claim 64, wherein the antimicrobial activity of a test CSP is determined by zone clearing in the agar.
PCT/US2003/002085 2002-01-23 2003-01-23 Methods and reagents for generating chimeric serum peptide carri ers WO2003061596A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2003222199A AU2003222199A1 (en) 2002-01-23 2003-01-23 Methods and reagents for generating chimeric serum peptide carri ers

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US35122502P 2002-01-23 2002-01-23
US60/351,225 2002-01-23

Publications (2)

Publication Number Publication Date
WO2003061596A2 true WO2003061596A2 (en) 2003-07-31
WO2003061596A3 WO2003061596A3 (en) 2004-07-22

Family

ID=27613473

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/002085 WO2003061596A2 (en) 2002-01-23 2003-01-23 Methods and reagents for generating chimeric serum peptide carri ers

Country Status (2)

Country Link
AU (1) AU2003222199A1 (en)
WO (1) WO2003061596A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8101371B2 (en) 2007-10-18 2012-01-24 Musc Foundation For Research Development Methods for the diagnosis of genitourinary cancer
CN102517225A (en) * 2011-12-28 2012-06-27 浙江大学 Recombinant saccharymyces cerevisiae exhibiting amylase at surface as well as construction method and application

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5503993A (en) * 1987-12-02 1996-04-02 The Green Cross Corporation Method of preparing foreign protein in yeast, recombinant DNA, transformant
US5648254A (en) * 1988-01-15 1997-07-15 Zymogenetics, Inc. Co-expression in eukaryotic cells
US6054312A (en) * 1997-08-29 2000-04-25 Selective Genetics, Inc. Receptor-mediated gene delivery using bacteriophage vectors
US6232081B1 (en) * 1997-10-16 2001-05-15 Baylor College Of Medicine Method for the detection of NF-κB regulatory factors

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5503993A (en) * 1987-12-02 1996-04-02 The Green Cross Corporation Method of preparing foreign protein in yeast, recombinant DNA, transformant
US5648254A (en) * 1988-01-15 1997-07-15 Zymogenetics, Inc. Co-expression in eukaryotic cells
US6054312A (en) * 1997-08-29 2000-04-25 Selective Genetics, Inc. Receptor-mediated gene delivery using bacteriophage vectors
US6232081B1 (en) * 1997-10-16 2001-05-15 Baylor College Of Medicine Method for the detection of NF-κB regulatory factors

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8101371B2 (en) 2007-10-18 2012-01-24 Musc Foundation For Research Development Methods for the diagnosis of genitourinary cancer
CN102517225A (en) * 2011-12-28 2012-06-27 浙江大学 Recombinant saccharymyces cerevisiae exhibiting amylase at surface as well as construction method and application

Also Published As

Publication number Publication date
WO2003061596A3 (en) 2004-07-22
AU2003222199A1 (en) 2003-09-02

Similar Documents

Publication Publication Date Title
AU761648B2 (en) Methods and reagents for isolating biologically active peptides
US5733731A (en) Peptide library and screening method
US20020025536A1 (en) Methods and reagents for isolating biologically active antibodies
US7125679B2 (en) Methods to screen peptide libraries using minicell display
US7393318B2 (en) Methods and compositions for interaction trap assays
EP1696038B1 (en) Isolating biological modulators from biodiverse gene fragment libraries
CA2266543C (en) Methods and compositions for identifying receptor effectors
US20020081570A1 (en) System to detect protein-protein interactions
KR20020059370A (en) Methods and compositions for the construction and use of fusion libraries
EP0610448A1 (en) Peptide library and screening method
AU2002254134A1 (en) Method to screen peptide display libraries using minicell display
US7297491B2 (en) Methods and compositions for interaction trap assays
WO2003061596A2 (en) Methods and reagents for generating chimeric serum peptide carri ers
WO1998007845A1 (en) Interaction trap assay, reagents and uses thereof
AU752792B2 (en) Methods and compositions for the determination of protein function and identification of modulators thereof
Rodi et al. Filamentous bacteriophage structure and biology
US20100022402A1 (en) Methods and Compositions for the In Vitro High-Throughput Detection of Protein/Protein Interactions

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 200403010

Country of ref document: ZA

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP