CA2385460A1 - Rgs10b, a g-protein regulator expressed in osteoclasts - Google Patents
Rgs10b, a g-protein regulator expressed in osteoclasts Download PDFInfo
- Publication number
- CA2385460A1 CA2385460A1 CA002385460A CA2385460A CA2385460A1 CA 2385460 A1 CA2385460 A1 CA 2385460A1 CA 002385460 A CA002385460 A CA 002385460A CA 2385460 A CA2385460 A CA 2385460A CA 2385460 A1 CA2385460 A1 CA 2385460A1
- Authority
- CA
- Canada
- Prior art keywords
- seq
- nucleotides
- nucleic acid
- complement
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4702—Regulators; Modulating activity
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention provides nucleic acid molecules derived from a human osteoclastoma cDNA library. The nucleic acid molecules are used in applications such as diagnostic methods, screening assays and as hybridizati on probes and primers.
Description
RGS l OB, A G-PROTEIN REGULATOR EXPRESSED 1N OSTEOCLASTS
BACKGROUND OF THE INVENTION
The G-protein family plays an important role in signal transduction by linking extracellular receptors to intracellular signaling pathways. G
proteins are composed of cx, (3 and y subunits. In the inactive state, the a subunits bind to GDP.
They are activated by G-protein-coupled receptors (bound to ligand), which trigger the dissociation of GDP and the subsequent binding of GTP, to activate the cx subunit. The active state is terminated when the intrinsic GTPase activity contained within the cx subunit hydrolyzes GTP to GDP (Gold et al., J. Neurosci 17(20):8024-8037 (1997)).
A family of proteins, termed the regulator of G-protein signaling (RGS) proteins, has been identified which modulates G-protein function by activating the GTPase activity of oc subunits (Dohlman and Thorner, J. Biol. Chem. 272:3871-3874 (1997); Koelle, Curr. Opin. Cell. Biol. 9:143-147 (1997); Neer, Curr.
Biol.
7:831-R33 (1997)). These proteins reduce the duration of the active state of the tx subunit, which inhibits G-protein function.
SUMMARY OF THE INVENTION
Using a differential screening approach, partial cDNA sequences encoding genes which are selectively expressed in osteoclasts were identified. For one of these clones (designated clone 140), a full-length cDNA (designated RGS l OB) was identified from an osteoclastoma library. This clone encodes a 181 amino acid polypeptide which appears to represent an isoform of a previously described molecule, known as RGS 10. RGS 10 has been characterized as a selective activator of Ga 1 GTPase activity, and hence an inhibitor of G-protein signaling (Hunt et al., Nature 383:175-177 (1996). RGS10 is a member of a large family of G-protein inhibitors which is highly conserved throughout evolution.
_7_ The amino terminus of the RGS 1 OB sequence is distinct from that of RGS 10, with only 3 of 16 residues identical (Figure 1). In contrast, residues 17-181 are identical in the two molecules, and RGS 1 OB possess the highly conserved RGS
domain. Additionally, the 3' untranslated region (LTTR) of RGS 1 OB is also unique from that of RGS 10, indicating that RGS 1 OB is likely to be generated by alternative splicing of at least 2 novel exons.
Multiple tissue Northern analysis was performed using a 90-base pair probe to the amino terminus of RGS 1 OB. A 1 kb mRNA for RGS 1 OB was detected very strongly in the osteoclastoma tumor, but not in the stromal cell component of the tumor. RGS 1 OB mRNA was minimally expressed in HSB-2 (B lymphocyte cell line) and U937 (myelomonocytic cell line). All other human tissues and cell lines tested were negative for RGS 1 OB mRNA. These data demonstrate that RGS 1 OB
is selectively expressed in human osteoclasts. Thus, RGSlOB provides a marker for osteoclasts, and may also have an important regulatory function. Inhibiting RGS10B may up- or down-regulate bone resorption, depending on the pathway it normally inhibits.
Thus, the present invention relates to an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of SEQ ID
NO: 1; the complement of SEQ ID NO: 1; nucleotides 1-48 of SEQ ID NO: l; the complement of nucleotides 1-48 of SEQ ID NO: 1; nucleotides 547-831 of SEQ ID
NO: 1; and the complement of nucleotides 547-831 of SEQ ID NO: 1. These nucleotide sequences are distinct from that of RGS 10 and can be utilized as probes specific for RGS l OB.
The invention also relates to an isolated nucleic acid molecule consisting essentially of or consisting of a nucleotide sequence selected from the group consisting of SEQ ID NO: 1; the complement of SEQ ID NO: l; nucleotides 1-48 of SEQ ID NO: 1; the complement of nucleotides 1-48 of SEQ ID NO: l; nucleotides 547-831 of SEQ ID NO: 1; and the complement of nucleotides 547-831 of SEQ ID
NO: 1.
The invention further relates to a nucleic acid molecule which hybridizes under high stringency conditions to a nucleotide sequence selected from the group consisting of SEQ ID NO: 1; the complement of SEQ ID NO: 1; nucleotides 1-48 of SEQ ID NO: l; the complement of nucleotides 1-48 of SEQ ID NO: 1; nucleotides 547-831 of SEQ ID NO: l; and the complement of nucleotides 547-831 of SEQ ID
NO: 1. The invention further relates to a nucleic acid molecule which is substantially identical to the nucleotide sequence of SEQ ID NO: 1 and which comprises nucleotides 1-48 of SEQ ID NO: 1.
The invention further provides a method for assaying the presence of a nucleic acid molecule (e.g., RGSlOB) in a sample, comprising contacting said sample with a second nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 1; the complement of SEQ ID
NO: 1; nucleotides 1-48 of SEQ ID NO: 1; the complement of nucleotides 1-48 of SEQ ID NO: 1; nucleotides 547-831of SEQ ID NO: l; and the complement of nucleotides 547-831of SEQ ID NO: l, under conditions appropriate for selective hybridization.
The invention also relates to a vector comprising an isolated nucleic acid molecule of the invention operatively linked to a regulatory sequence, as well as to a recombinant host cell comprising the vector. The invention also provides a method for preparing a polypeptide encoded by an isolated nucleic acid molecule, comprising culturing a recombinant host cell of the invention under conditions suitable for expression of said nucleic acid molecule.
The invention further provides an isolated polypeptide encoded by isolated nucleic acid molecules of the invention. In a particular embodiment, the polypeptide comprises an amino acid sequence selected from the group consisting of the amino acid sequence of SEQ ID NO: 2 and amino acids 1-16 of SEQ ID N0:2, or portion thereof. The invention also relates to an isolated polypeptide comprising an amino acid sequence which is substantially identical to the amino acid sequence of SEQ ID
NO: 2 and which comprises amino acids 1-16 of SEQ ID NO: 2.
The invention also relates to an antibody, or an antigen-binding fragment thereof, which selectively binds to the polypeptides of the invention, as well as to a method for assaying the presence of a polypeptide encoded by an isolated nucleic acid molecule of the invention in a sample, comprising contacting said sample with an antibody which specifically binds to the encoded polypeptide.
The invention also relates to methods of inhibiting G protein function in a cell comprising enhancing expression of RGS 1 OB or active portion thereof in the cell, whereby RGS l OB activates the intrinsic GTPase activity of a Gcx subunit of the G protein, thereby inhibiting G protein activity.
The invention further relates to a method of activating the GTPase activity of a Ga subunit comprising contacting said G« subunit with RGS 1 OB or active portion thereof. The invention further relates to a method of activating the GTPase activity of a Ga subunit in a cell comprising enhancing expression of RGS l OB or active portion thereof in the cell.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows a comparison of the amino acid sequences of RGS l OB
(upper row; SEQ ID NO: 2) and RGS 10 (lower row; SEQ ID NO: 3). Dashes 1 S indicate gaps introduced to maximize alignment.
Figure 2 shows the nucleotide sequence (SEQ ID NO: 1 ) of RGS l OB cDNA
and the deduced amino acid sequence (SEQ ID NO: 2) of RGS l OB.
DETAILED DESCRIPTION OF THE INVENTION
As described herein, a cDNA encoding RGS l OB was identified as follows:
First, human giant cell tumor of the bone was used to 1) construct a cDNA
library;
BACKGROUND OF THE INVENTION
The G-protein family plays an important role in signal transduction by linking extracellular receptors to intracellular signaling pathways. G
proteins are composed of cx, (3 and y subunits. In the inactive state, the a subunits bind to GDP.
They are activated by G-protein-coupled receptors (bound to ligand), which trigger the dissociation of GDP and the subsequent binding of GTP, to activate the cx subunit. The active state is terminated when the intrinsic GTPase activity contained within the cx subunit hydrolyzes GTP to GDP (Gold et al., J. Neurosci 17(20):8024-8037 (1997)).
A family of proteins, termed the regulator of G-protein signaling (RGS) proteins, has been identified which modulates G-protein function by activating the GTPase activity of oc subunits (Dohlman and Thorner, J. Biol. Chem. 272:3871-3874 (1997); Koelle, Curr. Opin. Cell. Biol. 9:143-147 (1997); Neer, Curr.
Biol.
7:831-R33 (1997)). These proteins reduce the duration of the active state of the tx subunit, which inhibits G-protein function.
SUMMARY OF THE INVENTION
Using a differential screening approach, partial cDNA sequences encoding genes which are selectively expressed in osteoclasts were identified. For one of these clones (designated clone 140), a full-length cDNA (designated RGS l OB) was identified from an osteoclastoma library. This clone encodes a 181 amino acid polypeptide which appears to represent an isoform of a previously described molecule, known as RGS 10. RGS 10 has been characterized as a selective activator of Ga 1 GTPase activity, and hence an inhibitor of G-protein signaling (Hunt et al., Nature 383:175-177 (1996). RGS10 is a member of a large family of G-protein inhibitors which is highly conserved throughout evolution.
_7_ The amino terminus of the RGS 1 OB sequence is distinct from that of RGS 10, with only 3 of 16 residues identical (Figure 1). In contrast, residues 17-181 are identical in the two molecules, and RGS 1 OB possess the highly conserved RGS
domain. Additionally, the 3' untranslated region (LTTR) of RGS 1 OB is also unique from that of RGS 10, indicating that RGS 1 OB is likely to be generated by alternative splicing of at least 2 novel exons.
Multiple tissue Northern analysis was performed using a 90-base pair probe to the amino terminus of RGS 1 OB. A 1 kb mRNA for RGS 1 OB was detected very strongly in the osteoclastoma tumor, but not in the stromal cell component of the tumor. RGS 1 OB mRNA was minimally expressed in HSB-2 (B lymphocyte cell line) and U937 (myelomonocytic cell line). All other human tissues and cell lines tested were negative for RGS 1 OB mRNA. These data demonstrate that RGS 1 OB
is selectively expressed in human osteoclasts. Thus, RGSlOB provides a marker for osteoclasts, and may also have an important regulatory function. Inhibiting RGS10B may up- or down-regulate bone resorption, depending on the pathway it normally inhibits.
Thus, the present invention relates to an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of SEQ ID
NO: 1; the complement of SEQ ID NO: 1; nucleotides 1-48 of SEQ ID NO: l; the complement of nucleotides 1-48 of SEQ ID NO: 1; nucleotides 547-831 of SEQ ID
NO: 1; and the complement of nucleotides 547-831 of SEQ ID NO: 1. These nucleotide sequences are distinct from that of RGS 10 and can be utilized as probes specific for RGS l OB.
The invention also relates to an isolated nucleic acid molecule consisting essentially of or consisting of a nucleotide sequence selected from the group consisting of SEQ ID NO: 1; the complement of SEQ ID NO: l; nucleotides 1-48 of SEQ ID NO: 1; the complement of nucleotides 1-48 of SEQ ID NO: l; nucleotides 547-831 of SEQ ID NO: 1; and the complement of nucleotides 547-831 of SEQ ID
NO: 1.
The invention further relates to a nucleic acid molecule which hybridizes under high stringency conditions to a nucleotide sequence selected from the group consisting of SEQ ID NO: 1; the complement of SEQ ID NO: 1; nucleotides 1-48 of SEQ ID NO: l; the complement of nucleotides 1-48 of SEQ ID NO: 1; nucleotides 547-831 of SEQ ID NO: l; and the complement of nucleotides 547-831 of SEQ ID
NO: 1. The invention further relates to a nucleic acid molecule which is substantially identical to the nucleotide sequence of SEQ ID NO: 1 and which comprises nucleotides 1-48 of SEQ ID NO: 1.
The invention further provides a method for assaying the presence of a nucleic acid molecule (e.g., RGSlOB) in a sample, comprising contacting said sample with a second nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 1; the complement of SEQ ID
NO: 1; nucleotides 1-48 of SEQ ID NO: 1; the complement of nucleotides 1-48 of SEQ ID NO: 1; nucleotides 547-831of SEQ ID NO: l; and the complement of nucleotides 547-831of SEQ ID NO: l, under conditions appropriate for selective hybridization.
The invention also relates to a vector comprising an isolated nucleic acid molecule of the invention operatively linked to a regulatory sequence, as well as to a recombinant host cell comprising the vector. The invention also provides a method for preparing a polypeptide encoded by an isolated nucleic acid molecule, comprising culturing a recombinant host cell of the invention under conditions suitable for expression of said nucleic acid molecule.
The invention further provides an isolated polypeptide encoded by isolated nucleic acid molecules of the invention. In a particular embodiment, the polypeptide comprises an amino acid sequence selected from the group consisting of the amino acid sequence of SEQ ID NO: 2 and amino acids 1-16 of SEQ ID N0:2, or portion thereof. The invention also relates to an isolated polypeptide comprising an amino acid sequence which is substantially identical to the amino acid sequence of SEQ ID
NO: 2 and which comprises amino acids 1-16 of SEQ ID NO: 2.
The invention also relates to an antibody, or an antigen-binding fragment thereof, which selectively binds to the polypeptides of the invention, as well as to a method for assaying the presence of a polypeptide encoded by an isolated nucleic acid molecule of the invention in a sample, comprising contacting said sample with an antibody which specifically binds to the encoded polypeptide.
The invention also relates to methods of inhibiting G protein function in a cell comprising enhancing expression of RGS 1 OB or active portion thereof in the cell, whereby RGS l OB activates the intrinsic GTPase activity of a Gcx subunit of the G protein, thereby inhibiting G protein activity.
The invention further relates to a method of activating the GTPase activity of a Ga subunit comprising contacting said G« subunit with RGS 1 OB or active portion thereof. The invention further relates to a method of activating the GTPase activity of a Ga subunit in a cell comprising enhancing expression of RGS l OB or active portion thereof in the cell.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows a comparison of the amino acid sequences of RGS l OB
(upper row; SEQ ID NO: 2) and RGS 10 (lower row; SEQ ID NO: 3). Dashes 1 S indicate gaps introduced to maximize alignment.
Figure 2 shows the nucleotide sequence (SEQ ID NO: 1 ) of RGS l OB cDNA
and the deduced amino acid sequence (SEQ ID NO: 2) of RGS l OB.
DETAILED DESCRIPTION OF THE INVENTION
As described herein, a cDNA encoding RGS l OB was identified as follows:
First, human giant cell tumor of the bone was used to 1) construct a cDNA
library;
2) produce 3zP-labelled cDNA to use as a stromal cell+, osteoclast+ probe, and 3) produce (by culturing) a stromal cell population lacking osteoclasts. The presence of osteoclasts in the giant cell tumor was confirmed by histological staining for the osteoclast marker type S tartrate-resistant acid phosphatase (TRAP) and with the use of monoclonal antibody reagents.
The stromal cell population lacking osteoclasts was produced by dissociating cells of a giant cell tumor, then growing and passaging the cells in tissue culture until the cell population was homogeneous and appeared fibroblastic. The cultured W~ X1/21797 CA 02385460 2002-03-20 pCT~S00/25905 stromal cell population did not contain osteoclasts. The cultured stromal cells were then used to produce a stromal cell', osteoclast- 32P-labelled cDNA probe.
The cDNA library produced from the giant cell tumor of the bone was then screened in duplicate for hybridization to the cDNA probes: one screen was performed with the giant cell tumor cDNA probe (stromal cell+, osteoclast-), while a duplicate screen was performed using the cultured stromal cell cDNA probe (stromal cell+, osteoclasf). Hybridization to a stromal+, osteoclast+ probe, accompanied by failure to hybridize to a stromal+, osteoclast- probe, indicated that a clone contained nucleic acid sequences uniquely or selectively expressed by osteoclasts.
Selected clones were used to screen an osteoclastoma library and identify full length cDNA
sequences. As described herein, one of the selected clones (clone 140) was used to idenitfy a full length cDNA sequence termed RGS l OB.
The present invention has utility for the production and identification of nucleic acid probes useful for identifying RGS l OB DNA, including genomic DNA.
Nucleic acid molecules, polypeptides and antibodies described herein are also useful for identifying osteoclasts using well established techniques. Nucleic acid molecules of the present invention are also useful in gene therapy. For example, they can be used to alter the expression, e.g., in osteoclasts, of an aberrant gene product, or to correct aberrant expression of RGS l OB. The sequences described herein can further be used to cause RGS l OB expression in cells in which such expression does not ordinarily occur, e.g., in cells which are not osteoclasts.
The present invention relates to an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 1; the complement of SEQ ID NO: 1; nucleotides 1-48 of SEQ ID NO: 1; the complement of nucleotides 1-48 of SEQ ID NO: 1; nucleotides 547-831 of SEQ ID NO: 1; and the complement of nucleotides 547-831 of SEQ ID NO: 1. These nucleotide sequences are distinct from that of RGS 10 and can be utilized as probes specific for RGS l OB. The invention also relates to an isolated nucleic acid molecule consisting essentially of or consisting of a nucleotide sequence selected from the group consisting of SEQ ID NO: 1; the complement of SEQ ID NO: 1; nucleotides 1-48 of SEQ ID NO: 1; the complement of nucleotides 1-48 of SEQ >D NO: 1; nucleotides 547-831 of SEQ ID NO: 1; and the complement of nucleotides 547-831 of SEQ ID
NO: 1.
As appropriate, the isolated nucleic acid molecules of the present invention can be RNA, for example, mRNA, or DNA, such as cDNA and genomic DNA.
DNA molecules can be double-stranded or single-stranded; single stranded RNA
or DNA can be either the coding, or sense, strand or the non-coding, or antisense, strand. The nucleic acid molecule can include all or a portion of the coding sequence of a gene and can further comprise additional non-coding sequences such as introns and non-coding 3' and 5' sequences (including regulatory sequences, for example).
Additionally, the nucleic acid molecule can be fused to a marker sequence, for example, a sequence that encodes a polypeptide to assist in isolation or purification of the polypeptide. Such sequences include, but are not limited to, those which encode a glutathione-S-transferase (GST) fusion protein and those which encode a hemaglutin A (HA) polypeptide marker from influenza.
An "isolated" nucleic acid molecule, as used herein, is one that is separated from nucleic acids which normally flank the nucleic acid molecule in nature.
With regard to genomic DNA, the term "isolated" refers to nucleic acid molecules which are separated from the chromosome with which the genomic DNA is naturally associated. For example, the isolated nucleic acid molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotides which flank the nucleic acid molecule in the genomic DNA of the cell from which the nucleic acid molecule is derived.
Moreover, an isolated nucleic acid molecule of the invention, such as a cDNA or RNA molecule, can be substantially free of other cellular material, or culture medium when produced by recombinant techniques, or chemical precursors or other chemicals when chemically synthesized. However, the nucleic acid molecule can be fused to other coding or regulatory sequences and still be considered isolated. In some instances, the isolated material will form part of a composition (for example, a crude extract containing other substances), buffer system or reagent mix. In other circumstances, the material may be purified to essential homogeneity, for example as determined by PAGE or column -7_ chromatography such as HPLC. Preferably, an isolated nucleic acid molecule comprises at least about 50, 80 or 90% (on a molar basis) of all macromolecular species present.
Further, recombinant DNA contained in a vector is included in the definition of "isolated" as used herein. Also, isolated nucleic acid molecules include recombinant DNA molecules in heterologous host cells, as well as partially or substantially purified DNA molecules in solution. "Isolated" nucleic acid molecules also encompass in vivo and in vitro RNA transcripts of the DNA molecules of the present invention.
In one embodiment, the variants hybridize under high stringency hybridization conditions (e.g., for selective hybridization) to a nucleotide sequence comprising a nucleotide sequence selected from SEQ ID NO: 1, the complement of SEQ ID NO:
l, nucleotides 1-48 of SEQ ID NO: 1, the complement of nucleotides 1-48 of SEQ
ID NO: l, nucleotides 547-831 of SEQ ID NO: 1 and the complement of nucleotides 547-831 of SEQ ID NO: 1.
The invention further relates to a nucleic acid molecule which hybridizes under stringent, preferably highly stringent, conditions to a nucleotide sequence selected from the group consisting of SEQ ID NO: l; the complement of SEQ ID
NO: 1; nucleotides 1-48 of SEQ ID NO: 1; the complement of nucleotides 1-48 of SEQ ID NO: 1; nucleotides 547-831 of SEQ ID NO: l; and the complement of nucleotides 547-831 of SEQ ID NO: 1.
Stringent hybridization conditions for nucleic acid molecules are well known to those skilled in the art and can be found in standard texts such as Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1998), pp. 2.10.1-2.10.16 and 6.3.1-6.3.6, the teachings of which are hereby incorporated by reference.
As understood by those of ordinary skill, the exact conditions can be determined empirically and depend on ionic strength, temperature and the concentration of destabilizing agents such as formamide or denaturing agents such as SDS. Other factors considered in determining the desired hybridization conditions include the length of the nucleic acid sequences, base composition, percent mismatch between _g_ the hybridizing sequences and the frequency of occurrence of subsets of the sequences within other non-identical sequences. Thus, equivalent conditions can be determined by varying one or more of these parameters while maintaining a similar degree of identity or similarity between the two nucleic acid molecules.
Typically, conditions are used such that sequences at least about 60%, at least about 70%, at least about 80%, at least about 90% or at least about 95% or more identical to each other remain hybridized to one another. In one non-limiting example, nucleic acid molecules are allowed to hybridize in 6X sodium chloride/sodium citrate (SSC) at about 4~ °C, followed by one or more low stringency washes in 0.2X
SSC/0.1 % SDS
at room temperature, or by one or more moderate stringency washes in 0.2X
SSC/0.1% SDS at 42°C, or washed in 0.2X SSC/0.1% SDSat 65°C
for high stringency.
The invention further relates to a nucleic acid molecule which is substantially identical (preferably greater than about 85 percent, more preferably greater than 1 S about 90 percent, and even more preferably greater than about 95 percent identical) to the nucleotide sequence of SEQ ID NO: 1 and which comprises nucleotides 1-of SEQ ID NO: 1. The percent identity of two nucleotide or amino acid sequences can be determined by aligning the sequences for optimal comparison purposes (e.g., gaps can be introduced in the sequence of a first sequence). The nucleotides or amino acids at corresponding positions are then compared, and the percent identity between the two sequences is a function of the number of identical positions shared by the sequences (i.e., % identity = # of identical positions/total # of positions x 100). In certain embodiments, the length of a sequence aligned for comparison purposes is at least 30%, preferably at least 40%, more preferably at least 60%, and even more preferably at least 70%, 80% or 90% of the length of the reference sequence. The actual comparison of the two sequences can be accomplished by well-known methods, for example, using a mathematical algorithm. A preferred, non-limiting example of such a mathematical algorithm is described in Karlin et al., Proc. Natl. Acad. Sci. USA, 90:5873-5877 (1993). Such an algorithm is incorporated into the NBLAST and XBLAST programs (version 2.0) as described in Altschul et al., Nucleic Acids Res., 25:389-3402 (1997). When utilizing BLAST
and Gapped BLAST programs, the default parameters of the respective programs (e.g., NBLAST) can be used. See http://www.ncbi.nlm.nih.gov. In one embodiment, parameters for sequence comparison can be set at score=100, wordlength=12, or can be varied (e.g., W=5 or W=20).
Another preferred, non-limiting example of a mathematical algorithm utilized for the comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the CGC sequence alignment software package.
When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty of 12 , and a gap penalty of 4 can be used.
Additional algorithms for sequence analysis are known in the art and include ADVANCE and ADAM as described in Torellis and Robotti (1994) Comput. Appl.
Biosci., 10:3-5; and FASTA described in Pearson and Lipman (1988) PNAS, 85:2444-8.
1 S In another embodiment, the percent identity between two amino acid sequences can be accomplished using the GAP program in the CGC software package (available at http://www.cgc.com) using either a Blossom 63 matrix or a PAM250 matrix, and a gap weight of 12, 10, 8, 6, or 4 and a length weight of 2, 3, or 4. In yet another embodiment, the percent identity between two nucleic acid sequences can be accomplished using the GAP program in the CGC software package (available at http://www.cgc.com), using a gap weight of 50 and a length weight of 3.
The present invention also provides isolated nucleic acid molecules that contain a fragment or portion that hybridizes under highly stringent conditions to a nucleotide sequence comprising a nucleotide sequence selected from SEQ ID NO:
1, the complement of SEQ ID NO: 1, nucleotides 1-48 of SEQ ID NO: 1, the complement of nucleotides 1-48 of SEQ ID NO: 1, nucleotides 54,7-831 of SEQ ID
NO: 1 and the complement of nucleotides 547-831 of SEQ ID NO: 1. The nucleic acid fragments of the invention are at least about 15, preferably at least about 18, 20, 23 or 25 nucleotides, and can be 30, 40, ~0, 100, 200 or more nucleotides in length.
Longer fragments, for example, 30 or more nucleotides in length, which encode antigenic proteins or polypeptides described herein are useful.
In a related aspect, the nucleic acid molecules of the invention are used as probes or primers in assays such as those described herein. "Probes" are oligonucleotides that hybridize in a base-specific manner to a complementary strand of nucleic acid molecules. Such probes include polypeptide nucleic acids, as described in Nielsen et al., Science, 254, 1497-1500 (1991). Typically, a probe comprises a region of nucleotide sequence that hybridizes under highly stringent conditions to at least about 15, typically about 20-25, and more typically about 40, 50 or 75, consecutive nucleotides of a nucleic acid molecule comprising a nucleotide sequence selected from SEQ ID NO: 1, the complement of SEQ ID NO: l, nucleotides 1-48 of SEQ ID NO: 1, the complement of nucleotides 1-48 of SEQ ID
NO: l, nucleotides 547-831 of SEQ ID NO: 1 and the complement of nucleotides 547-831 of SEQ ID NO: 1. The probe may also comprise additional nucleotides on either or both ends of the molecule, which preferably do not interfere with the ability of the molecule to hybridize to a nucleotide sequence. More typically, the probe further comprises a label, e.g., radioisotope, fluorescent compound, enzyme, or enzyme co-factor.
As used herein, the term "primer" refers to a single-stranded oligonucleotide which acts as a point of initiation of template-directed DNA synthesis using well-known methods (e.g., PCR, LCR) including, but not limited to those described herein. The appropriate length of the primer depends on the particular use, but typically ranges from about 15 to 30 nucleotides.
The nucleic acid molecules of the invention such as those described above can be identified and isolated using standard molecular biology techniques and the sequence information provided in SEQ ID NO: 1. For example, nucleic acid molecules can be amplified and isolated by the polymerase chain reaction using synthetic oligonucleotide primers designed based on one or more of the sequences provided in SEQ ID NO: 1, the complement of SEQ ID NO: l, nucleotides 1-48 of SEQ ID NO: 1, the complement of nucleotides 1-48 of SEQ ID NO: l, nucleotides 547-831 of SEQ ID NO: 1 and the complement of nucleotides 547-831 of SEQ ID
NO: 1. See generally PCR Technology: Principles and Applications for DNA
Amplification (ed. H.A. Erlich, Freeman Press, NY, NY, 1992); PCR Protocols: A
Guide to Methods and Applications (Eds. Innis, et al., Academic Press, San Diego, CA, 1990); Manila et al., Nucleic Acids Res., 19:4967 (1991); Eckert et al., PCR
Methods and Applications, 1:17 (1991); PCR (eds. McPherson et al., IRL Press, Oxford); and U.S. Patent 4,683,202. The nucleic acid molecules can be amplified using cDNA, mRNA or genomic DNA as a template, cloned into an appropriate vector and characterized by DNA sequence analysis.
Other suitable amplification methods include the ligase chain reaction (LCR) (see Wu and Wallace, Genomics, 4:560 (1989), Landegren et al., Science, 241:1077 (1988), transcription amplification (Kwoh et al., Proc. Natl. Acad. Sci. USA, 86:1173 (1989)), and self sustained sequence replication (Guatelli et al., Proc. Nat.
Acad. Sci. USA, 87:1874 (1990)) and nucleic acid based sequence amplification (NASBA). The latter two amplification methods involve isothermal reactions based on isothermal transcription, which produce both single stranded RNA (ssRNA) and double stranded DNA (dsDNA) as the amplification products in a ratio of about or 100 to 1, respectively.
The amplified DNA can be radiolabelled and used as a probe for screening a cDNA library derived from osteoclasts, e.g., human osteoclasts, mRNA in zap express, ZIPLOX or other suitable vector. Corresponding clones can be isolated, DNA can obtained following in vivo excision, and the cloned insert can be sequenced in either or both orientations by art recognized methods to identify the correct reading frame encoding a protein of the appropriate molecular weight.
For example, the direct analysis of the nucleotide sequence of nucleic acid molecules of the present invention can be accomplished using well-known methods that are commercially available. See, for example, Sambrook et al., Molecular Cloning, A
Laboratory Manual (2nd Ed., CSHP, New York 1989); Zyskind et al., Recombinant DNA Laboratorv Manual, (Acad. Press, 1988)). Using these or similar methods, the protein and the DNA encoding the protein can be isolated, sequenced and further characterized.
Antisense nucleic acid molecules of the invention can be designed using the nucleotide sequences of SEQ ID NO: 1, the complement of SEQ ID NO: l, or a portion thereof, such as nucleotides 1-48 of SEQ ID NO: 1, the complement of nucleotides 1-48 of SEQ ID NO: 1, nucleotides 547-831 of SEQ ID NO: 1 and the complement of nucleotides 547-831 of SEQ >D NO: 1, and constructed using chemical synthesis and enzymatic ligation reactions using procedures known in the art. For example, an antisense nucleic acid molecule (e.g., an antisense oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously modified nucleotides designed to increase the biological stability of the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides can be used. Alternatively, the antisense nucleic acid molecule can be produced biologically using an expression vector into which a nucleic acid molecule has been subcloned in an antisense orientation (i.e., RNA
transcribed from the inserted nucleic acid molecule will be of an antisense orientation to a target nucleic acid of interest).
In general, the isolated nucleic acid sequences can be used as molecular weight markers on Southern gels, and as chromosome markers which are labeled to map related gene positions. The nucleic acid sequences can also be used to compare with endogenous DNA sequences in patients to identify genetic disorders, and as probes, such as to hybridize and discover related DNA sequences or to subtract out known sequences from a sample. The nucleic acid sequences can further be used to derive primers for genetic fingerprinting, to raise anti-protein antibodies using DNA
immunization techniques, and as an antigen to raise anti-DNA antibodies or elicit immune responses. Additionally, the nucleotide sequences of the invention can be used to identify and express recombinant proteins for analysis, characterization or therapeutic use, or as markers for tissues in which the corresponding protein is expressed, either constitutively, during tissue differentiation, or in diseased states.
Another aspect of the invention pertains to nucleic acid constructs containing a nucleic acid molecule selected from the group consisting of SEQ m NO: 1 and the complement of SEQ ID NO: 1 (or a portion thereofj. The constructs comprise a vector (e.g., an expression vector) into which a sequence of the invention has been inserted in a sense or antisense orientation. As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA loop into which additional DNA segments can be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome.
Moreover, certain vectors, expression vectors, are capable of directing the expression of genes to which they are operably linked. In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. However, the invention is intended to include such other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses) that serve equivalent functions.
Preferred recombinant expression vectors of the invention comprise a nucleic acid molecule of the invention in a form suitable for expression of the nucleic acid molecule in a host cell. This means that the recombinant expression vectors include one or more regulatory sequences, selected on the basis of the host cells to be used for expression, which is operably linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory sequences) in a manner which allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell). The term "regulatory sequence" is intended to include promoters, enhancers and other expression control elements (e.g., polyadenylation signals).
Such regulatory sequences are described, for example, in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990).
Regulatory sequences include those which direct constitutive expression of a nucleotide sequence in many types of host cell and those which direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed and the level of expression of protein desired. The expression vectors of the invention can be introduced into host cells to thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acid molecules as described herein .
The recombinant expression vectors of the invention can be designed for expression of a polypeptide of the invention in prokaryotic or eukaryotic cells, e.g., bacterial cells such as E. coli, insect cells (using baculovirus expression vectors), yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, supra. Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.
Another aspect of the invention pertains to host cells into which a recombinant expression vector of the invention has been introduced. The terms "host cell" and "recombinant host cell" are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but also to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
A host cell can be any prokaryotic or eukaryotic cell. For example, a nucleic acid molecule of the invention can be expressed in bacterial cells (e.g., E.
coli), insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those skilled in the art.
Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a variety of art-recognized techniques for introducing a foreign nucleic acid molecule (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAF-dextran-mediated transfection, lipofection, or electroporation. Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al. (supra), and other laboratory manuals.
For stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is generally introduced into the host cells along with the gene of interest.
Preferred selectable markers include those that confer resistance to drugs, such as 6418, hygromycin and methotrexate. Nucleic acid molecules encoding a selectable marker can be introduced into a host cell on the same vector as the nucleic acid molecule of the invention or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid molecule can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die).
A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i.e., express) a polypeptide of the invention.
Accordingly, the invention further provides methods for producing a polypeptide using the host cells of the invention. In one embodiment, the method comprises culturing the host cell of invention (into which a recombinant expression vector encoding a polypeptide of the invention has been introduced) in a suitable medium such that the polypeptide is produced. In another embodiment, the method further comprises isolating the polypeptide from the medium or the host cell.
The host cells of the invention can also be used to produce nonhuman transgenic animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte or an embryonic stem cell into which a nucleic acid molecule of the invention has been introduced. Such host cells can then be used to create non-human transgenic animals in which exogenous nucleotide sequences have been introduced into the genome or homologous recombinant animals in which endogenous nucleotide sequences have been altered. Such animals are useful for studying the function and/or activity of the nucleotide sequence and polypeptide encoded by the sequence and for identifying and/or evaluating modulators of their activity. As used herein, a "transgenic animal" is a non-human animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or more of the cells of the animal includes a transgene. Other examples of transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens and amphibians.
A
transgene is exogenous DNA which is integrated into the genome of a cell from which a transgenic animal develops and which remains in the genome of the mature animal, thereby directing the expression of an encoded gene product in one or more cell types or tissues of the transgenic animal. As used herein, an "homologous recombinant animal" is a non-human animal, preferably a mammal, more preferably a mouse, in which an endogenous gene has been altered by homologous recombination between the endogenous gene and an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to development of the animal.
Methods for generating transgenic animals via embryo manipulation and microinjection, particularly animals such as mice, have become conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. Patent No. 4,873,191 and in Hogan, Manipulating the Mouse Embryo (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Methods for constructing homologous recombination vectors and homologous recombinant animals are described further in Bradley (1991) Current Opinion in BiolTechnology, 2:823-829 and in PCT Publication Nos. WO 90/11354, WO 91/01140, WO 92/0968, and WO 93/04169. Clones of the non-human transgenic animals described herein can also be produced according to the methods described in Wilmut et al.
(1997) Nature, 385:810-813 and PCT Publication Nos. WO 97/07668 and WO 97/07669.
The present invention also provides isolated polypeptides and variants and fragments thereof that are encoded by the nucleic acid molecules of the invention.
For example, as described above, the nucleotide sequences can be used to design primers to clone and express cDNAs encoding the polypeptides of the invention.
As used herein, a polypeptide is said to be "isolated" or "purified" when it is substantially free of cellular material when it is isolated from recombinant and non-recombinant cells, or free of chemical precursors or other chemicals when it is chemically synthesized. A polypeptide, however, can be joined to another polypeptide with which it is not normally associated in a cell and still be "isolated"
or "purified."
The polypeptides of the invention can be purified to homogeneity. It is understood, however, that preparations in which the polypeptide is not purified to homogeneity are useful. The critical feature is that the preparation allows for the desired function of the polypeptide, even in the presence of considerable amounts of other components. Thus, the invention encompasses various degrees of purity.
In one embodiment, the polypeptide is substantially free of cellular material, which includes preparations of the polypeptide having less than about 30% (by dry weight) other proteins (i.e., contaminating protein), less than about 20% other proteins, less than about 10% other proteins, or less than about 5% other proteins.
When a polypeptide is recombinantly produced, it can also be substantially free of culture medium, i.e., culture medium represents less than about 20%, less than about 10%, or less than about 5% of the volume of the protein preparation. In one embodiment, the polypeptide is substantially free of chemical precursors or other chemicals, which includes preparations of the polypeptide in which it is separated from chemical precursors or other chemicals that are involved in its synthesis. In one embodiment, the language "substantially free of chemical precursors or other chemicals" includes preparations of the polypeptide having less than about 30% (by dry weight) chemical precursors or other chemicals, less than about 20% chemical precursors or other chemicals, less than about 10% chemical precursors or other chemicals, or less than about 5% chemical precursors or other chemicals.
In one embodiment, a polypeptide comprises an amino acid sequence encoded by a nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 1 and nucleotides 1-48 of SEQ ID NO: l and complements and portions thereof, e.g., SEQ ID NO: 2 or a portion thereof (e.g., amino acids 1-16 of SEQ ID NO: 2). However, the invention also encompasses sequence variants, provided that the sequence contains a characteristic feature of the RGSlOB sequence (e.g., amino acids 1-16 of SEQ ID NO: 2). Variants include a substantially homologous protein encoded by the same genetic locus in an organism, i.e., an allelic variant. Variants also encompass proteins derived from other genetic loci in an organism, but having substantial homology to a polypeptide encoded by a nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 1 and complements and portions thereof. Variants also include proteins substantially homologous or identical to these polypeptides but derived from another organism, i.e., an ortholog. Variants also include proteins that are substantially homologous or identical to these polypeptides that are produced by chemical synthesis. Variants also include proteins that are substantially identical to these polypeptides that are produced by recombinant methods.
To determine the percent homology or identity of two amino acid sequences, or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in the sequence of one protein or nucleic acid molecule for optimal alignment with the other protein or nucleic acid molecule).
The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in one sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the other sequence, then the molecules are homologous at that position. As used herein, amino acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid "identity". The percent homology between the two sequences is a function of the number of identical positions shared by the sequences (i.e., percent homology or identity equals the number of identical positions/total number of positions times 100).
The invention also encompasses polypeptides having a lower degree of identity but having sufficient similarity so as to perform one or more of the same functions performed by a polypeptide encoded by a nucleic acid molecule of the invention. Similarity is determined by conserved amino acid substitution. Such substitutions are those that substitute a given amino acid in a polypeptide by another amino acid of like characteristics. Conservative substitutions are likely to be phenotypically silent. Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids Ala, Val, Leu, and Ile; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gln, exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe, Tyr. Guidance concerning which amino acid changes are likely to be phenotypically silent are found in Bowie et al., Science 247:1306-1310 (1990).
A variant polypeptide can differ in amino acid sequence by one or more substitutions, deletions, insertions, inversions, fusions, and truncations or a combination of any of these. Further, variant polypeptides can be fully functional or can lack function in one or more activities. Fully functional variants typically contain only conservative variation or variation in non-critical residues or in non-critical regions. Functional variants can also contain substitution of similar amino acids that result in no change or an insignificant change in function.
Alternatively, such substitutions may positively or negatively affect function to some degree. Non-functional variants typically contain one or more non-conservative amino acid substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, inversion, or deletion in a critical residue or critical region.
Amino acids that are essential for function can be identified by methods known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et al., Science, 244:1081-1085 (1989)). The latter procedure introduces single alanine mutations at every residue in the molecule. The resulting mutant molecules are then tested for biological activity in vitro, or in vitro proliferative activity. Sites that are critical for polypeptide activity can also be determined by structural analysis such as crystallization, nuclear magnetic resonance or photoaffinity labeling (Smith et al., J. Mol. Biol., 224:899-904 (1992); de Vos et al. Science, 255:306-312 (1992)).
The invention also includes polypeptide portions and fragments of the polypeptides of the invention. As used herein, the terms "fragment" and "portion"
are intended to a equivalents. Fragments can be derived from a polypeptide encoded by a nucleic acid molecule comprising SEQ ID NO: 1 or a portion thereof and the complements thereof. However, the invention also encompasses fragments of all variants of the polypeptides described herein. As used herein, a fragment or portion comprises at least 6 contiguous amino acids. Useful fragments include those that retain one or more of the biological activities of the polypeptide as well as fragments that can be used as an immunogen to generate polypeptide-specific antibodies.
Biologically active fragments (peptides which are, for example, 6, 9, 12, 15, 16, 20, 30, 35, 36, 37, 38, 39, 40, 50, 100 or more amino acids in length) can comprise a domain, segment, or motif that has been identified by analysis of the polypeptide sequence using well-known methods, e.g., signal peptides, extracellular domains, one or more transmembrane segments or loops, ligand binding regions, zinc finger domains, DNA binding domains, acylation sites, glycosylation sites, or phosphorylation sites.
Fragments can be discrete (not fused to other amino acids or polypeptides) or can be within a larger polypeptide. Further, several fragments can be comprised within a single larger polypeptide. In one embodiment a fragment designed for expression in a host can have heterologous pre- and pro-polypeptide regions fused to the amino terminus of the polypeptide fragment and an additional region fused to the carboxyl terminus of the fragment.
The invention thus provides chimeric or fusion proteins. These comprise a polypeptide of the invention operatively linked to a heterologous protein having an amino acid sequence not substantially homologous to the polypeptide.
"Operatively linked" indicates that the polypeptide protein and the heterologous protein are fused in-frame. The heterologous protein can be fused to the N-terminus or C-terminus of the polypeptide. In one embodiment the fusion protein does not affect function of the polypeptide per se. For example, the fusion protein can be a GST-fusion protein in which the polypeptide sequences are fused to the C-terminus of the GST
sequences. Other types of fusion proteins include, but are not limited to, enzymatic fusion proteins, for example beta-galactosidase fusions, yeast two-hybrid GAL
fusions, poly-His fusions and Ig fusions. Such fusion proteins, particularly poly-His fusions, can facilitate the purification of recombinant polypeptide. In certain host cells (e.g., mammalian host cells), expression and/or secretion of a protein can be increased by using a heterologous signal sequence. Therefore, in another embodiment, the fusion protein contains a heterologous signal sequence at its N-terminus.
EP-A-O 464 533 discloses fusion proteins comprising various portions of immunoglobulin constant regions. The Fc is useful in therapy and diagnosis and thus results, for example, in improved pharmacokinetic properties (EP-A 0232 262).
In drug discovery, for example, human proteins have been fused with Fc portions for the purpose of high-throughput screening assays to identify antagonists.
Bennett et al., Journal ofMolecular Recognition, 8:52-58 (1995) and Johanson et al., The Journal ofBiological Chemistry, 270,16:9459-9471 (1995). Thus, this invention also encompasses soluble fusion proteins containing a polypeptide of the invention and various portions of the constant regions of heavy or light chains of immunoglobulins of various subclass (IgG, IgM, IgA, IgE).
A chimeric or fusion protein can be produced by standard recombinant DNA
techniques. For example, DNA fragments coding for the different protein sequences are ligated together in-frame in accordance with conventional techniques. In another embodiment, the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR amplification of nucleic acid fragments can be carned out using anchor primers which give rise to complementary overhangs between two consecutive nucleic acid fragments which can subsequently be annealed and re-amplified to generate a chimeric nucleic acid sequence (see Ausubel et al., Current Protocols in Molecular Biology, 1992).
Moreover, many expression vectors are commercially available that already encode a fusion moiety (e.g., a GST protein). A nucleic acid molecule encoding a polypeptide of the invention can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the polypeptide protein.
The isolated polypeptide can be purified from cells that naturally express it, such as from osteoclasts, purified from cells that have been altered to express it (recombinant), or synthesized using known protein synthesis methods. In one embodiment, the protein is produced by recombinant DNA techniques. For example, a nucleic acid molecule encoding the polypeptide is cloned into an expression vector, the expression vector introduced into a host cell and the protein expressed in the host cell. The protein can then be isolated from the cells by an appropriate purification scheme using standard protein purification techniques.
In general, polypeptides or proteins of the present invention can be used as a molecular weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns using art-recognized methods. The polypeptides of the present invention can be used to raise antibodies or to elicit an immune response. The polypeptides can also be used as a reagent, e.g., a labeled reagent, in assays to quantitatively determine levels of the protein or a molecule to which it binds (e.g., a receptor or a ligand) in biological fluids. The polypeptides can also be used as markers for cells or tissues in which the corresponding protein is preferentially expressed, either constitutively, during tissue differentiation, or in a diseased state. The polypeptides can be used to isolate a corresponding binding partner, e.g., receptor or ligand, such as, for example, in an interaction trap assay, and to screen for peptide or small molecule antagonists or agonists of the binding interaction.
In another aspect, the invention provides antibodies to the polypeptides and polypeptide fragments of the invention, e.g., having an amino acid sequence encoded by a nucleic acid molecule comprising all or a portion of SEQ ID NO:
1.
The term "antibody" as used herein refers to immunoglobulin molecules and immunologically active portions of immunoglobulin molecules, i.e., molecules that contain an antigen binding site that specifically binds an antigen. A molecule that specifically binds to a polypeptide of the invention is a molecule that binds to that polypeptide or a fragment thereof, but does not substantially bind other molecules in a sample, e.g., a biological sample, which naturally contains the polypeptide.
Examples of immunologically active portions of immunoglobulin molecules include Flab) and F(ab')2 fragments which can be generated by treating the antibody with an enzyme such as pepsin. The invention provides polyclonal and monoclonal antibodies that bind to a polypeptide of the invention. The term "monoclonal antibody" or "monoclonal antibody composition", as used herein, refers to a population of antibody molecules that contain only one species of an antigen binding site capable of immunoreacting with a particular epitope of a polypeptide of the invention. A monoclonal antibody composition thus typically displays a single binding affinity for a particular polypeptide of the invention with which it immunoreacts.
Polyclonal antibodies can be prepared as described above by immunizing a suitable subject with a desired immunogen, e.g., polypeptide of the invention or fragment thereof. The antibody titer in the immunized subject can be monitored over time by standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) using immobilized polypeptide. If desired, the antibody molecules directed against the polypeptide can be isolated from the mammal (e.g., from the blood) and further purified by well-known techniques, such as protein A
chromatography to obtain the IgG fraction. At an appropriate time after immunization, e.g., when the antibody titers are highest, antibody-producing cells can be obtained from the subject and used to prepare monoclonal antibodies by standard techniques, such as the hybridoma technique originally described by Kohler and Milstein (1975) Nature, 256:495-497, the human B cell hybridoma technique (Kozbor et al. (1983) Immunol. Today, 4:72), the EBV-hybridoma technique (Cole et al. (1985), Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp.
77-96) or trioma techniques. The technology for producing hybridomas is well known (see generally Current Protocols in Immunology (1994) Coligan et al.
(eds.) John Wiley & Sons, Inc., New York, NY). Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal immunized with an immunogen as described above, and the culture supernatants of the resulting hybridoma cells are screened to identify a hybridoma producing a monoclonal antibody that binds a polypeptide of the invention.
Any of the many well known protocols used for fusing lymphocytes and immortalized cell lines can be applied for the purpose of generating a monoclonal antibody to a polypeptide of the invention (see, e.g., Current Protocols in Immunology, supra; Galfre et al. (1977) Nature, 266:55052; R.H. Kenneth, in Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing Corp., New York, New York (1980); and Lerner (1981) Yale J. Biol.
Med., 54:387-402. Moreover, the ordinarily skilled worker will appreciate that there are many variations of such methods that also would be useful.
Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal antibody to a polypeptide of the invention can be identified and isolated by screening a recombinant combinatorial immunoglobulin library (e.g., an antibody phage display library) with the polypeptide to thereby isolate immunoglobulin library members that bind the polypeptide. Kits for generating and screening phage display libraries are commercially available (e.g., the Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-O1; and the Stratagene SurfZAPTM Phage Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents particularly amenable for use in generating and screening antibody display library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No.
WO
92/18619; PCT Publication No. WO 91/17271; PCT Publication No. WO 92/20791;
PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; PCT
Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT
Publication No. WO 90/02809; Fuchs et al. (1991) BiolTechnology, 9:1370-1372;
Hay et al. (1992) Hum. Antibod. Hybridomas, 3:81-85; Huse et al. (1989) Science, 246:1275-1281; Griffiths et al. (1993) EMBO J., 12:725-734.
Additionally, recombinant antibodies, such as chimeric and humanized monoclonal antibodies, comprising both human and non-human portions, which can be made using standard recombinant DNA techniques, are within the scope of the invention. Such chimeric and humanized monoclonal antibodies can be produced by recombinant DNA techniques known in the art.
In general, antibodies of the invention (e.g., a monoclonal antibody) can be used to isolate a polypeptide of the invention by standard techniques, such as affinity chromatography or immunoprecipitation. A polypeptide-specific antibody can facilitate the purification of natural polypeptide from cells and of recombinantly produced polypeptide expressed in host cells. Moreover, an antibody specific for a polypeptide of the invention can be used to detect the polypeptide (e.g., in a cellular lysate, cell supernatant, or tissue sample) in order to evaluate the abundance and pattern of expression of the polypeptide. Antibodies can be used diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can be facilitated by coupling the antibody to a detectable substance. Examples of detectable substances include various enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, (-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes include streptavidinlbiotin and avidin/biotin; examples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, and examples of suitable radioactive material include'ZSI,'3'I, 35S or 3H.
Portions or fragments of the nucleotide sequences identified herein (and the corresponding complete gene sequences) can be used in numerous ways as polynucleotide reagents. For example, these sequences can be used to: (i) map their respective genes on a chromosome; and, thus, locate gene regions associated with genetic disease; (ii) identify an individual from a minute biological sample (tissue typing); and (iii) aid in forensic identification of a biological sample.
The present invention also pertains to diagnostic assays for determining protein and/or nucleic acid expression as well as activity of proteins of the invention, in the context of a biological sample (e.g., blood, serum, cells, tissue) to thereby determine whether an individual is afflicted with a disease or disorder, or is at risk of developing a disorder, associated with aberrant expression or activity. The invention also provides for prognostic (or predictive) assays for determining whether an individual is at risk of developing a disorder associated with activity or expression of proteins or nucleic acid molecules of the invention.
Disorders which may be treated or diagnosed by methods described herein include, but are not limited to, diseases characterized by an increase in bone density, including osteopetrosis, osteosclerosis, pyknodysostosis, osteomyelosclerosis, hyperphosphatasia, progressive diaphyseal dysplasia, melorheostosis, osteopoikilosis, hyperostosis frontalis interna, sclerostosis, McCune-Albright syndrome and spondyloepiphyseal dysplasia; disorders characterized by a decrease in bone density, including osteoarthritis, Maroteaux-Lamy syndrome and osteoporosis; and disorders of bone formation, including osteogenisis imperfecta, sutural craneosynostosis, osteomalacia and cleidocranial dysplasia, and bone cancers such as osteosarcomas. Additionally, the methods described herein may be useful for the treatment or diagnosis of bone disorders associated with cancer, anorexia nervosa, and auto-immune disorders.
For example, mutations in the gene can be assayed in a biological sample.
Such assays can be used for prognostic or predictive purpose to thereby prophylactically treat an individual prior to the onset of a disorder characterized by or associated with expression or activity of nucleic acid molecules or proteins of the invention.
An exemplary method for detecting the presence or absence of proteins or nucleic acid molecules of the invention in a biological sample involves obtaining a biological sample from a test subject and contacting the biological sample with a compound or an agent capable of detecting the protein, or nucleic acid molecule (e.g., mRNA, genomic DNA) that encodes the protein, such that the presence of the protein or nucleic acid molecule is detected in the biological sample. A
preferred agent for detecting mRNA or genomic DNA is a labeled nucleic acid probe capable of hybridizing to mRNA or genomic DNA sequences described herein. The nucleic acid probe can be, for example, a full-length nucleic acid molecule, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to appropriate mRNA or genomic DNA. For example, the nucleic acid probe can be all or a portion of SEQ ID NO: l, or the complement of SEQ ID NO: l, or a portion thereof. Other suitable probes for use in the diagnostic assays of the invention are described herein.
In one embodiment, the agent for detecting proteins of the invention is an antibody capable of binding to the protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab')z) can be used. The term "labeled", with regard to the probe or antibody, is intended to encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a detectable substance to the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with another reagent that is directly labeled. Examples of indirect labeling include detection of a primary antibody using a fluorescently labeled secondary antibody and end-labeling of a DNA probe with biotin such that it can be detected with fluorescently labeled streptavidin. The term "biological sample" is intended to include tissues, calls and biological fluids isolated from a subject, as well as tissues, cells and fluids present within a subject. That is, the detection method of the invention can be used to detect mRNA, protein, or genomic DNA of the invention in a biological sample in vitro as well as in vivo. For example, in vitro techniques for detection of mRNA include Northern hybridizations and in situ hybridizations. In vitro techniques for detection of protein include enzyme linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations and immunofluorescence. In vitro techniques for detection of genomic DNA include Southern hybridizations. Furthermore, in vivo techniques for detection of protein include introducing into a subject a labeled anti-protein antibody. For example, the antibody can be labeled with a radioactive marker whose presence and location in a subject can be detected by standard imaging techniques.
In one embodiment, the biological sample contains protein molecules from the test subject. Alternatively, the biological sample can contain mRNA
molecules from the test subject or genomic DNA molecules from the test subject. A
preferred biological sample is a serum sample or lung tissue biopsy isolated by conventional means from a subj ect.
In another embodiment, the methods further involve obtaining a control biological sample from a control subject, contacting the control sample with a compound or agent capable of detecting protein, mRNA, or genomic DNA of the invention, such that the presence of protein, mRNA or genomic DNA is detected in the biological sample, and comparing the presence of protein, mRNA or genomic DNA in the control sample with the presence of protein, mRNA or genomic DNA in the test sample.
The invention also encompasses kits for detecting the presence of proteins or nucleic acid molecules of the invention in a biological sample. For example, the kit can comprise a labeled compound or agent capable of detecting protein or mRNA
in a biological sample; means for determining the amount of in the sample; and means for comparing the amount of in the sample with a standard. The compound or agent can be packaged in a suitable container. The kit can further comprise instructions for using the kit to detect protein or nucleic acid molecules.
The diagnostic methods described herein can furthermore be utilized to identify subjects having or at risk of developing a disease or disorder associated with aberrant expression or activity of proteins and nucleic acid molecules of the invention. For example, the assays described herein, such as the preceding diagnostic assays or the following assays can be utilized to identify a subject having or at risk of developing a disorder associated with protein or nucleic acid expression or activity such as a proliferative disorder, a differentiative or developmental disorder, or a hematopoietic disorder. Alternatively, the prognostic assays can be utilized to identify a subject having or at risk for developing a differentiative or proliferative disease (e.g., cancer) particularly of the bones. Thus, the present invention provides a method for identifying a disease or disorder associated with aberrant expression or activity of proteins or nucleic acid molecules of the invention, in which a test sample is obtained from a subject and protein or nucleic acid molecule (e.g., mRNA, genomic DNA) is detected, wherein the presence of protein or nucleic acid molecule is diagnostic for a subject having or at risk of developing a disease or disorder associated with aberrant expression or activity of the protein or nucleic acid sequence of the invention. As used herein, a "test sample" refers to a biological sample obtained from a subject of interest. For example, a test sample can be a biological fluid (e.g., serum), cell sample (e.g., osteoclasts), or tissue sample.
The invention provides a method (also referred to herein as a "screening assay") for identifying modulators, i.e., candidate or test compounds or agents (e.g., antisense, polypeptides, peptidomimetics, small molecules or other drugs) which bind to nucleic acid molecules, polypeptides or proteins described herein or have a stimulatory or inhibitory effect on, for example, expression or activity of the nucleic acid molecules, polypeptides or proteins of the invention.
In one embodiment, the invention provides assays for screening candidate or test compounds that bind to or modulate the activity of protein or polypeptide described herein or biologically active portion thereof. The test compounds of the present invention can be obtained using any of the numerous approaches in combinatorial library methods known in the art, including: biological libraries;
spatially addressable parallel solid phase or solution phase libraries;
synthetic library methods requiring deconvolution; the 'one-bead one-compound' library method;
and synthetic library methods using affinity chromatography selection. The biological library approach is limited to polypeptide libraries, while the other four approaches are applicable to polypeptide, non-peptide oligomer or small molecule libraries of compounds (Lam, K.S. (1997) Anticancer Drug Des., 12:145).
In another embodiment, an assay is a cell-based assay comprising contacting a cell expressing a particular target molecule described herein with a test compound and determining the ability of the test compound to modulate or alter (e.g.
stimulate or inhibit) the activity of the target molecule. Determining the ability of the test compound to modulate the activity of the target molecule can be accomplished, for example, by determining the ability of a known ligand to bind to or interact with the target molecule. For example, the ability of RGS 1 OB to interact with and/or activate the GTPase activity of a Gcx subunit can be determined.
In yet another embodiment, an assay of the present invention is a cell-free assay in which protein of the invention or biologically active portion thereof is contacted with a test compound and the ability of the test compound to bind to the protein or biologically active portion thereof is determined. Binding of the test compound to the protein can be determined either directly or indirectly as described above. In one embodiment, the assay includes contacting the protein or biologically active portion thereof with a known compound which binds the protein to form an assay mixture, contacting the assay mixture with a test compound, and determining the ability of the test compound to interact with the protein. Determining the ability of the test compound to interact with the protein comprises determining the ability of the test compound to preferentially bind to the protein or biologically active portion thereof as compared to the known compound.
In another embodiment, the assay is a cell-free assay in which a protein of the invention or biologically active portion thereof is contacted with a test compound and the ability of the test compound to modulate or alter (e.g., stimulate or inhibit) the activity of the protein or biologically active portion thereof is determined. Determining the ability of the test compound to modulate the activity of the protein can be accomplished, for example, by determining the ability of the protein to bind to a known target molecule by one of the methods described above for determining direct binding. In an alternative embodiment, determining the ability of the test compound to modulate the activity of a protein of the invention can be accomplished by determining the ability of the protein to further modulate the activity of a target molecule. For example, the catalytic/enzymatic activity of the target molecule on an appropriate substrate can be determined as previously described.
In yet another embodiment, the cell-free assay involves contacting a protein of the invention or biologically active portion thereof with a known compound which binds the protein to form an assay mixture, contacting the assay mixture with a test compound, and determining the ability of the test compound to interact with the protein, wherein determining the ability of the test compound to interact with the protein comprises determining the ability of the protein to preferentially bind to or modulate the activity of a target molecule.
In more than one embodiment of the above assay methods of the present invention, it may be desirable to immobilize either the protein or its target molecule to facilitate separation of complexed from uncomplexed forms of one or both of the proteins, as well as to accommodate automation of the assay. Binding of a test compound to the protein, or interaction of the protein with a target molecule in the presence and absence of a candidate compound, can be accomplished in any vessel suitable for containing the reactants. Examples of such vessels include microtitre plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein (e.g., a glutathione-S-transferase fusion protein) can be provided which adds a domain that allows one or both of the proteins to be bound to a matrix.
In another embodiment, modulators of expression of nucleic acid molecules of the invention are identified in a method wherein a cell is contacted with a candidate compound and the expression of appropriate mRNA or protein in the cell is determined. The level of expression of appropriate mRNA or protein in the presence of the candidate compound is compared to the level of expression of mRNA or protein in the absence of the candidate compound. The candidate compound can then be identified as a modulator of expression based on this comparison. For example, when expression of mRNA or protein is greater (statistically significantly greater) in the presence of the candidate compound than in its absence, the candidate compound is identified as a stimulator or enhancer of the mRNA or protein expression. Alternatively, when expression of the mRNA or protein is less (statistically significantly less) in the presence of the candidate compound than in its absence, the candidate compound is identified as an inhibitor of the mRNA or protein expression. The level of mRNA or protein expression in the cells can be determined by methods described herein for detecting mRNA or protein.
This invention further pertains to novel agents identified by the above-described screening assays. Accordingly, it is within the scope of this invention to further use an agent identified as described herein in an appropriate animal model. For example, an agent identified as described herein (e.g., a modulating agent, an antisense nucleic acid molecule, a specific antibody, or a protein-binding partner) can be used in an animal model to determine the efficacy, toxicity, or side effects of treatment with such an agent. Alternatively, an agent identified as described herein can be used in an animal model to determine the mechanism of action of such an agent. Furthermore, this invention pertains to uses of novel agents identified by the above-described screening assays for treatments as described herein.
The present invention provides for both prophylactic and therapeutic methods of treating a subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant expression or activity of proteins or nucleic acid molecules of the invention. In one aspect, the invention provides a method for preventing in a subject, a disease or condition associated with aberrant expression or activity of genes or proteins of the present invention, by administering to the subject an agent which modulates expression or at least one activity of a gene or protein of the invention. Subjects at risk for a disease that is caused or contributed to by aberrant gene expression or protein activity can be identified by, for example, any or a combination of diagnostic or prognostic assays as described herein.
Administration of a prophylactic agent can occur prior to the manifestation of symptoms characteristic of the aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its progression. Depending on the type of aberrancy, for example, an agonist or antagonist agent can be used for treating the subject. The appropriate agent can be determined based on screening assays described herein.
Another aspect of the invention pertains to methods of modulating expression or activity of genes or proteins of the invention for therapeutic purposes.
The modulatory method of the invention involves contacting a cell with an agent that modulates one or more of the activities of the specified protein associated with the cell. An agent that modulates protein activity can be an agent as described herein, such as a nucleic acid molecule or a protein, a naturally-occurring target molecule of a protein described herein, a polypeptide, a peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one or more protein activities. Examples of such stimulatory agents include active protein as well as a nucleic acid molecule encoding the protein that has been introduced into the cell. In another embodiment, the agent inhibits one or more protein activities.
Examples of such inhibitory agents include antisense nucleic acid molecules and anti-protein antibodies. These modulatory methods can be performed in vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present invention provides methods of treating an individual afflicted with a disease or disorder characterized by aberrant expression or activity of a protein or nucleic acid molecule of the invention. In one embodiment, the method involves administering an agent (e.g., an agent identified by a screening assay described herein), or combination of agents that modulates (e.g., upregulates or downregulates) expression or activity of a gene or protein of the invention. In another embodiment, the method involves administering a protein or nucleic acid molecule of the invention as therapy to compensate for reduced or aberrant expression or activity of the protein or nucleic acid molecule.
Stimulation of protein activity is desirable in situations in which the protein is abnormally downregulated and/or in which increased protein activity is likely to have a beneficial effect. Likewise, inhibition of protein activity is desirable in situations in which the protein is abnormally upregulated and/or in which decreased protein activity is likely to have a beneficial effect. One example of such a situation is where a subject has a disorder characterized by aberrant development or cellular differentiation. The molecules of the present invention, as well as agents, or modulators which have a stimulatory or inhibitory effect on the protein activity (e.g., gene expression) as identified by a screening assay described herein can be administered to individuals to treat (prophylactically or therapeutically) disorders (e.g., proliferative or developmental disorders) associated with aberrant protein activity.
The invention will be further described by the following non-limiting examples. The teachings of all publications cited herein are incorporated herein by reference in their entirety.
EXAMPLES
Osteoclast cDNA Library Construction Messenger RNA (mRNA) obtained from a human osteoclastoma (giant cell tumor of bone) was used to construct an osteoclastoma cDNA library.
Osteoclastomas are actively bone resorptive tumors, but are usually non-metastatic.
In cryostat sections, osteoclastomas consist of ~30% multinucleated cells positive for tartrate resistant acid phosphatase (TRAP), a widely utilized phenotypic marker specific in vivo for osteoclasts (Minkin, Calcif. Tissue Int. 34:285-290 (1982)). The remaining cells are uncharacterized stromal cells, a mixture of cell types with fibroblastic/mesenchymal morphology.
Monoclonal antibody reagents were used to partially characterize the surface phenotype of the multinucleated cells in the giant cell tumors of long bone.
In frozen sections, all multinucleated cells expressed CD68, which has previously been reported to define an antigen specific for both osteoclasts and macrophages (Norton, M.A. and M.H. Helfrich, In Biology and Physiology of the Osteoclast, B.R.
Rifkin and C.V. Gay, editors, CRC Press, Inc. Boca Raton, FL, 33-54 (1992)). In contrast, no staining of giant cells was observed for CD1 1b or CD14 surface antigens, which are present on monocyte/ macrophages and granulocytes (Arnaout, M.A. et al. J.
Cell. Physiol. 137:305 (1988); Haziot, A. et al. J. Immunol. 141:547 (1988)).
Cytocentrifuge preparations of human peripheral blood monocytes were positive for CD68, CDl 1b, and CD14. These results demonstrate that the multinucleated giant cells of osteoclastomas have a phenotype which is distinct from that of macrophages, and which is consistent with that of osteoclasts.
Osteoclastoma tissue was snap frozen in liquid nitrogen and used to prepare poly A+ mRNA according to standard methods. cDNA cloning into a pcDNAII
vector was carried out using a commercially-available kit (Librarian, InVitrogen).
Approximately 2.6 x 106 clones were obtained, >95% of which contained inserts of an average length 0.6 kB.
Stromal Cell mRNA Preparation A portion of each osteoclastoma was snap frozen in liquid nitrogen for mRNA preparation. The remainder of the tumor was dissociated using brief trypsinization and mechanical disaggregation, and placed into tissue culture.
These cells were expanded in Dulbecco's MEM (high glucose, Sigma) supplemented with 10% newborn calf serum (MA Bioproducts), gentamycin (0.5 mg/ml), 1-glutamine (2 mM) and non-essential amino acids (0.1 mM, Gibco). The stromal cell population was passaged at least five times, after which it showed a homogenous, fibroblastic-looking cell population that contained no multinucleated cells.
The stromal cells were mononuclear, tested negative for acid phosphatase, and tested variably alkaline phosphatase positive. These findings indicate that propagated stromal cells (i.e., stromal cells that are passaged in culture) are non-osteoclastic and non-activated.
Identification of DNA by Differential Screening of an Osteoclastoma cDNA
Library A total of 12,000 clones drawn from the osteoclastoma cDNA library were screened by differential hybridization, using mixed 32P labelled cDNA probes derived from (1) giant cell tumor mRNA (stromal cell+, OC~), and (2) mRNA from stromal cells (stromal cell+, OC-) cultivated from the same tumor. The probes were labelled with 32[P]dCTP by random priming to an activity of ~109CPM/~g. Of these 12,000 clones, 195 gave a positive hybridization signal with giant cell (i.e., osteoclast and stromal cell) mRNA, but not with stromal cell mRNA.
Additionally, these clones failed to hybridize to cDNA produced from mRNA derived from a variety of unrelated human cell types including epithelial cells, fibroblasts, lymphocytes, myelomonocytic cells, osteoblasts, and neuroblastoma cells. The failure of these clones to hybridize to cDNA produced from mRNA derived from other cell types supports the conclusion that these clones are either uniquely or selectively expressed in osteoclasts.
The osteoclast (0C) cDNA library was screened for differential hybridization to OC cDNA (stromal cell+, OC+) and stromal cell cDNA (stromal cell+, OC-) as follows:
NYTRAN filters (Schleicher & Schuell) were placed on agar plates containing growth medium and ampicillin. Individual bacterial colonies from the OC library were randomly picked and transferred, in triplicate, onto filters with preruled grids and then onto a master agar plate. Up to 200 colonies were inoculated onto a single 90-mm filter/plate using these techniques. The plates were inverted and incubated at 37°C until the bacterial inoculates had grown (on the filter) to a diameter of 0.5-1.0 mm.
The colonies were then lysed, and the DNA bound to the filters by first placing the filters on top of two pieces of Whatman 3MM paper saturated with 0.5 N
NaOH for 5 minutes. The filters were neutralized by placing on two pieces of Whatman 3MM paper saturated with 1 M Tris-HCL, pH 8.0 for 3-5 minutes.
S Neutralization was followed by incubation on another set of Whatman 3MM
papers saturated with 1M Tris-HCL, pH 8.0/1.5 M NaCI for 3-5 minutes. The filters were then washed briefly in 2 X SSC.
DNA was immobilized on the filters by baking the filters at 80°C
for 30 minutes. Filters were best used immediately, but they could be stored for up to one week in a vacuum jar at room temperature.
Filters were prehybridized in 5-8 ml of hybridization solution per filter, for 2-4 hours in a heat sealable bag. An additional 2 ml of solution was added for each additional filter added to the hybridization bag. The hybridization buffer consisted of 5 X SSC, 5 X Denhardt's solution, 1% SDS and 100 ~g/ml denatured heterologous DNA.
Prior to hybridization, labeled probe was denatured by heating in 1 X SSC
for 5 minutes at 100°C, then immediately chilled on ice. Denatured probe was added to the filters in hybridization solution, and the filters hybridized with continuous agitation for 12-20 hours at 65°C.
After hybridization, the filters were washed in 2 X SSC/0.2% SDS at 50-60°C for 30 minutes, followed by washing in 0.2 X SSC/0.2% SDS at 60°C for 60 minutes.The filters were then air dried and autoradiographed using an intensifying screen at -70°C overnight.
DNA Sequencing of Selected Clones Clones that hybridized to tumor cell cDNA, but not to stromal cell cDNA, were sequenced by the dideoxy chain termination method of Sanger et al., Proc.
Natl. Acad. Sci. USA 74:5463 (1977)) using sequenase (US Biochemical). The DNASIS (Hitatchi) program was used to carry out sequence analysis and a homology search in the GenBank/EMBL database.
For one of the clones identified by this method (designated clone 140), a full-length cDNA (designated RGS10B) was identified from an osteoclastoma cDNA
library. The nucleotide and predicted amino acid sequences of RGS l OB are shown in Figure 2. RGS l OB encodes a 181 amino acid polypeptide which appears to represent an isoform of a previously described molecule known as RGS10.
However, the amino terminus of the RGSlOB sequence is distinct from that of RGS 10, with only 3 of 16 residues identical (Figure 1 ). In contrast, residues 17-181 are identical in the two molecules, and RGS l OB possess the highly conserved RGS
domain. Additionally, the 3' untranslated region (UTR) of RGS l OB is also unique from that of RGS 10, indicating that RGS 1 OB is likely to be generated by alternative splicing of at least 2 novel exons.
Northern Blot Analysis Multiple tissue Northern analysis was performed using a 90-base pair probe to the amino terminus of RGS l OB. A 1 kb mRNA for RGS 1 OB was detected very strongly in the osteoclastoma tumor, but not in the stromal cell component of the tumor. RGS l OB mRNA was minimally expressed in HSB-2 (B lymphocyte cell line) and U937 (myelomonocytic cell line). All other human tissues and cell lines tested were negative for RGS 1 OB mRNA. These data demonstrate that RGS 1 OB
is selectively expressed in human osteoclasts.
While this invention has been particularly shown and described with reference to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.
SEQUENCE LISTING
<110> Forsyth Dental Infirmary for Children <120> RGS10B, A G-Protein Regulator Expressed in Osteoclasts <130> 1564.1011-003 <150> US 60/154,930 <151> 1999-09-21 <160> 3 <170> FastSEQ for Windows Version 4.0 <210> 1 <211> 830 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <222> (1). .(830) <223> n = A,T,C or G
<400> 1 atgttcaacc gcgccgtgag ccggctgagc aggaagcggc cgccgtcaga catccacgac 60 agcgatggca gttccagcag cagccaccag agcctcaaga gcacagccaa atgggcggca 120 tccctggaga atctgctgga agacccagaa ggcgtgaaaa gatttaggga tttttaaaaa 180 aggaattcag tgaagaaaat gttttgtttt ggctagcatg tgaagatttt aagaaaatgc 240 aagataagac gcagatgcag gaaaaggcaa aggagatcta catgaccttt ctgtccagca 300 aggcctcatc acaggtcaac gtggaggggc agtctcggct caacgagaag atcctggaag 360 aaccgcaccc tctgatgttc cagaaactcc aggaccagat ctttaatctc atgaagtacg 420 acagctacag ccgcttctta aagtctgact tgtttttaaa acacaagcga accgaggaag 480 aggaagaaga tttgcctgat gctcaaactg cagctaaaag agcttccaga atttataaca 540 catgagcccc caaaaagccg ggactggsrg ctttaagawg caarggrmtt tcstntywgg 600 wccgtgscsg gttymtcatt gctytgtkat ttgtaaggac traartktrc aaaacccwwm 660 amygggawgt gtgtwttrtt aactgcttcc cccktaaact ttrmatgwtg swtraggkaw 720 cataaaaaan gaataatgat wacttggaag tttkagkttw caaaacagrg attcstkmaa 780 cmytggacac kkggngcatt ktgyagytta attaamccty atgymaggcc 830 <210> 2 <211> 181 <212> PRT
<213> Homo sapiens <400> 2 Met Phe Asn Arg Ala Val Ser Arg Leu Ser Arg Lys Arg Pro Pro Ser Asp Ile His Asp Ser Asp Gly Ser Ser Ser Ser Ser His Gln Ser Leu Lys Ser Thr Ala Lys Trp Ala Ala Ser Leu Glu Asn Leu Leu Glu Asp Pro Glu Gly Val Lys Arg Phe Arg Glu Phe Leu Lys Lys Glu Phe Ser Glu Glu Asn Val Leu Phe Trp Leu Ala Cys Glu Asp Phe Lys Lys Met Gln Asp Lys Thr Gln Met Gln Glu Lys Ala Lys Glu Ile Tyr Met Thr Phe Leu Ser Ser Lys Ala Ser Ser Gln Val Asn Val Glu Gly Gln Ser Arg Leu Asn Glu Lys Ile Leu Glu Glu Pro His Pro Leu Met Phe Gln Lys Leu Gln Asp Gln Ile Phe Asn Leu Met Lys Tyr Asp Ser Tyr Ser Arg Phe Leu Lys Ser Asp Leu Phe Leu Lys His Lys Arg Thr Glu Glu Glu Glu Glu Asp Leu Pro Asp Ala Gln Thr Ala Ala Lys Arg Ala Ser Arg Ile Tyr Asn Thr <210> 3 <211> 173 <212> PRT
<213> Unknown <220>
<223> Activator of Galphal GTPase activity <400> 3 Met Gln Ser Glu Leu Phe Phe Ala Asp Ile His Asp Ser Asp Gly Ser Ser Ser Ser Ser His Gln Ser Leu Lys Ser Thr Ala Lys Trp Ala Ala Ser Leu Glu Asn Leu Leu Glu Asp Pro Glu Gly Val Lys Arg Phe Arg Glu Phe Leu Lys Lys Glu Phe Ser Glu Glu Asn Val Leu Phe Trp Leu Ala Cys Glu Asp Phe Lys Lys Met Gln Asp Lys Thr Gln Met Gln Glu Lys Ala Lys Glu Ile Tyr Met Thr Phe Leu Ser Ser Lys Ala Ser Ser Gln Val Asn Val Glu Gly Gln Ser Arg Leu Asn Glu Lys Ile Leu Glu Glu Pro His Pro Leu Met Phe Gln Lys Leu Gln Asp Gln Ile Phe Asn Leu Met Lys Tyr Asp Ser Tyr Ser Arg Phe Leu Lys Ser Asp Leu Phe Leu Lys His Lys Arg Thr Glu Glu Glu Glu Glu Asp Leu Pro Asp Ala Gln Thr Ala Ala Lys Arg Ala Ser Arg Ile Tyr Asn Thr
The stromal cell population lacking osteoclasts was produced by dissociating cells of a giant cell tumor, then growing and passaging the cells in tissue culture until the cell population was homogeneous and appeared fibroblastic. The cultured W~ X1/21797 CA 02385460 2002-03-20 pCT~S00/25905 stromal cell population did not contain osteoclasts. The cultured stromal cells were then used to produce a stromal cell', osteoclast- 32P-labelled cDNA probe.
The cDNA library produced from the giant cell tumor of the bone was then screened in duplicate for hybridization to the cDNA probes: one screen was performed with the giant cell tumor cDNA probe (stromal cell+, osteoclast-), while a duplicate screen was performed using the cultured stromal cell cDNA probe (stromal cell+, osteoclasf). Hybridization to a stromal+, osteoclast+ probe, accompanied by failure to hybridize to a stromal+, osteoclast- probe, indicated that a clone contained nucleic acid sequences uniquely or selectively expressed by osteoclasts.
Selected clones were used to screen an osteoclastoma library and identify full length cDNA
sequences. As described herein, one of the selected clones (clone 140) was used to idenitfy a full length cDNA sequence termed RGS l OB.
The present invention has utility for the production and identification of nucleic acid probes useful for identifying RGS l OB DNA, including genomic DNA.
Nucleic acid molecules, polypeptides and antibodies described herein are also useful for identifying osteoclasts using well established techniques. Nucleic acid molecules of the present invention are also useful in gene therapy. For example, they can be used to alter the expression, e.g., in osteoclasts, of an aberrant gene product, or to correct aberrant expression of RGS l OB. The sequences described herein can further be used to cause RGS l OB expression in cells in which such expression does not ordinarily occur, e.g., in cells which are not osteoclasts.
The present invention relates to an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 1; the complement of SEQ ID NO: 1; nucleotides 1-48 of SEQ ID NO: 1; the complement of nucleotides 1-48 of SEQ ID NO: 1; nucleotides 547-831 of SEQ ID NO: 1; and the complement of nucleotides 547-831 of SEQ ID NO: 1. These nucleotide sequences are distinct from that of RGS 10 and can be utilized as probes specific for RGS l OB. The invention also relates to an isolated nucleic acid molecule consisting essentially of or consisting of a nucleotide sequence selected from the group consisting of SEQ ID NO: 1; the complement of SEQ ID NO: 1; nucleotides 1-48 of SEQ ID NO: 1; the complement of nucleotides 1-48 of SEQ >D NO: 1; nucleotides 547-831 of SEQ ID NO: 1; and the complement of nucleotides 547-831 of SEQ ID
NO: 1.
As appropriate, the isolated nucleic acid molecules of the present invention can be RNA, for example, mRNA, or DNA, such as cDNA and genomic DNA.
DNA molecules can be double-stranded or single-stranded; single stranded RNA
or DNA can be either the coding, or sense, strand or the non-coding, or antisense, strand. The nucleic acid molecule can include all or a portion of the coding sequence of a gene and can further comprise additional non-coding sequences such as introns and non-coding 3' and 5' sequences (including regulatory sequences, for example).
Additionally, the nucleic acid molecule can be fused to a marker sequence, for example, a sequence that encodes a polypeptide to assist in isolation or purification of the polypeptide. Such sequences include, but are not limited to, those which encode a glutathione-S-transferase (GST) fusion protein and those which encode a hemaglutin A (HA) polypeptide marker from influenza.
An "isolated" nucleic acid molecule, as used herein, is one that is separated from nucleic acids which normally flank the nucleic acid molecule in nature.
With regard to genomic DNA, the term "isolated" refers to nucleic acid molecules which are separated from the chromosome with which the genomic DNA is naturally associated. For example, the isolated nucleic acid molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotides which flank the nucleic acid molecule in the genomic DNA of the cell from which the nucleic acid molecule is derived.
Moreover, an isolated nucleic acid molecule of the invention, such as a cDNA or RNA molecule, can be substantially free of other cellular material, or culture medium when produced by recombinant techniques, or chemical precursors or other chemicals when chemically synthesized. However, the nucleic acid molecule can be fused to other coding or regulatory sequences and still be considered isolated. In some instances, the isolated material will form part of a composition (for example, a crude extract containing other substances), buffer system or reagent mix. In other circumstances, the material may be purified to essential homogeneity, for example as determined by PAGE or column -7_ chromatography such as HPLC. Preferably, an isolated nucleic acid molecule comprises at least about 50, 80 or 90% (on a molar basis) of all macromolecular species present.
Further, recombinant DNA contained in a vector is included in the definition of "isolated" as used herein. Also, isolated nucleic acid molecules include recombinant DNA molecules in heterologous host cells, as well as partially or substantially purified DNA molecules in solution. "Isolated" nucleic acid molecules also encompass in vivo and in vitro RNA transcripts of the DNA molecules of the present invention.
In one embodiment, the variants hybridize under high stringency hybridization conditions (e.g., for selective hybridization) to a nucleotide sequence comprising a nucleotide sequence selected from SEQ ID NO: 1, the complement of SEQ ID NO:
l, nucleotides 1-48 of SEQ ID NO: 1, the complement of nucleotides 1-48 of SEQ
ID NO: l, nucleotides 547-831 of SEQ ID NO: 1 and the complement of nucleotides 547-831 of SEQ ID NO: 1.
The invention further relates to a nucleic acid molecule which hybridizes under stringent, preferably highly stringent, conditions to a nucleotide sequence selected from the group consisting of SEQ ID NO: l; the complement of SEQ ID
NO: 1; nucleotides 1-48 of SEQ ID NO: 1; the complement of nucleotides 1-48 of SEQ ID NO: 1; nucleotides 547-831 of SEQ ID NO: l; and the complement of nucleotides 547-831 of SEQ ID NO: 1.
Stringent hybridization conditions for nucleic acid molecules are well known to those skilled in the art and can be found in standard texts such as Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1998), pp. 2.10.1-2.10.16 and 6.3.1-6.3.6, the teachings of which are hereby incorporated by reference.
As understood by those of ordinary skill, the exact conditions can be determined empirically and depend on ionic strength, temperature and the concentration of destabilizing agents such as formamide or denaturing agents such as SDS. Other factors considered in determining the desired hybridization conditions include the length of the nucleic acid sequences, base composition, percent mismatch between _g_ the hybridizing sequences and the frequency of occurrence of subsets of the sequences within other non-identical sequences. Thus, equivalent conditions can be determined by varying one or more of these parameters while maintaining a similar degree of identity or similarity between the two nucleic acid molecules.
Typically, conditions are used such that sequences at least about 60%, at least about 70%, at least about 80%, at least about 90% or at least about 95% or more identical to each other remain hybridized to one another. In one non-limiting example, nucleic acid molecules are allowed to hybridize in 6X sodium chloride/sodium citrate (SSC) at about 4~ °C, followed by one or more low stringency washes in 0.2X
SSC/0.1 % SDS
at room temperature, or by one or more moderate stringency washes in 0.2X
SSC/0.1% SDS at 42°C, or washed in 0.2X SSC/0.1% SDSat 65°C
for high stringency.
The invention further relates to a nucleic acid molecule which is substantially identical (preferably greater than about 85 percent, more preferably greater than 1 S about 90 percent, and even more preferably greater than about 95 percent identical) to the nucleotide sequence of SEQ ID NO: 1 and which comprises nucleotides 1-of SEQ ID NO: 1. The percent identity of two nucleotide or amino acid sequences can be determined by aligning the sequences for optimal comparison purposes (e.g., gaps can be introduced in the sequence of a first sequence). The nucleotides or amino acids at corresponding positions are then compared, and the percent identity between the two sequences is a function of the number of identical positions shared by the sequences (i.e., % identity = # of identical positions/total # of positions x 100). In certain embodiments, the length of a sequence aligned for comparison purposes is at least 30%, preferably at least 40%, more preferably at least 60%, and even more preferably at least 70%, 80% or 90% of the length of the reference sequence. The actual comparison of the two sequences can be accomplished by well-known methods, for example, using a mathematical algorithm. A preferred, non-limiting example of such a mathematical algorithm is described in Karlin et al., Proc. Natl. Acad. Sci. USA, 90:5873-5877 (1993). Such an algorithm is incorporated into the NBLAST and XBLAST programs (version 2.0) as described in Altschul et al., Nucleic Acids Res., 25:389-3402 (1997). When utilizing BLAST
and Gapped BLAST programs, the default parameters of the respective programs (e.g., NBLAST) can be used. See http://www.ncbi.nlm.nih.gov. In one embodiment, parameters for sequence comparison can be set at score=100, wordlength=12, or can be varied (e.g., W=5 or W=20).
Another preferred, non-limiting example of a mathematical algorithm utilized for the comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the CGC sequence alignment software package.
When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty of 12 , and a gap penalty of 4 can be used.
Additional algorithms for sequence analysis are known in the art and include ADVANCE and ADAM as described in Torellis and Robotti (1994) Comput. Appl.
Biosci., 10:3-5; and FASTA described in Pearson and Lipman (1988) PNAS, 85:2444-8.
1 S In another embodiment, the percent identity between two amino acid sequences can be accomplished using the GAP program in the CGC software package (available at http://www.cgc.com) using either a Blossom 63 matrix or a PAM250 matrix, and a gap weight of 12, 10, 8, 6, or 4 and a length weight of 2, 3, or 4. In yet another embodiment, the percent identity between two nucleic acid sequences can be accomplished using the GAP program in the CGC software package (available at http://www.cgc.com), using a gap weight of 50 and a length weight of 3.
The present invention also provides isolated nucleic acid molecules that contain a fragment or portion that hybridizes under highly stringent conditions to a nucleotide sequence comprising a nucleotide sequence selected from SEQ ID NO:
1, the complement of SEQ ID NO: 1, nucleotides 1-48 of SEQ ID NO: 1, the complement of nucleotides 1-48 of SEQ ID NO: 1, nucleotides 54,7-831 of SEQ ID
NO: 1 and the complement of nucleotides 547-831 of SEQ ID NO: 1. The nucleic acid fragments of the invention are at least about 15, preferably at least about 18, 20, 23 or 25 nucleotides, and can be 30, 40, ~0, 100, 200 or more nucleotides in length.
Longer fragments, for example, 30 or more nucleotides in length, which encode antigenic proteins or polypeptides described herein are useful.
In a related aspect, the nucleic acid molecules of the invention are used as probes or primers in assays such as those described herein. "Probes" are oligonucleotides that hybridize in a base-specific manner to a complementary strand of nucleic acid molecules. Such probes include polypeptide nucleic acids, as described in Nielsen et al., Science, 254, 1497-1500 (1991). Typically, a probe comprises a region of nucleotide sequence that hybridizes under highly stringent conditions to at least about 15, typically about 20-25, and more typically about 40, 50 or 75, consecutive nucleotides of a nucleic acid molecule comprising a nucleotide sequence selected from SEQ ID NO: 1, the complement of SEQ ID NO: l, nucleotides 1-48 of SEQ ID NO: 1, the complement of nucleotides 1-48 of SEQ ID
NO: l, nucleotides 547-831 of SEQ ID NO: 1 and the complement of nucleotides 547-831 of SEQ ID NO: 1. The probe may also comprise additional nucleotides on either or both ends of the molecule, which preferably do not interfere with the ability of the molecule to hybridize to a nucleotide sequence. More typically, the probe further comprises a label, e.g., radioisotope, fluorescent compound, enzyme, or enzyme co-factor.
As used herein, the term "primer" refers to a single-stranded oligonucleotide which acts as a point of initiation of template-directed DNA synthesis using well-known methods (e.g., PCR, LCR) including, but not limited to those described herein. The appropriate length of the primer depends on the particular use, but typically ranges from about 15 to 30 nucleotides.
The nucleic acid molecules of the invention such as those described above can be identified and isolated using standard molecular biology techniques and the sequence information provided in SEQ ID NO: 1. For example, nucleic acid molecules can be amplified and isolated by the polymerase chain reaction using synthetic oligonucleotide primers designed based on one or more of the sequences provided in SEQ ID NO: 1, the complement of SEQ ID NO: l, nucleotides 1-48 of SEQ ID NO: 1, the complement of nucleotides 1-48 of SEQ ID NO: l, nucleotides 547-831 of SEQ ID NO: 1 and the complement of nucleotides 547-831 of SEQ ID
NO: 1. See generally PCR Technology: Principles and Applications for DNA
Amplification (ed. H.A. Erlich, Freeman Press, NY, NY, 1992); PCR Protocols: A
Guide to Methods and Applications (Eds. Innis, et al., Academic Press, San Diego, CA, 1990); Manila et al., Nucleic Acids Res., 19:4967 (1991); Eckert et al., PCR
Methods and Applications, 1:17 (1991); PCR (eds. McPherson et al., IRL Press, Oxford); and U.S. Patent 4,683,202. The nucleic acid molecules can be amplified using cDNA, mRNA or genomic DNA as a template, cloned into an appropriate vector and characterized by DNA sequence analysis.
Other suitable amplification methods include the ligase chain reaction (LCR) (see Wu and Wallace, Genomics, 4:560 (1989), Landegren et al., Science, 241:1077 (1988), transcription amplification (Kwoh et al., Proc. Natl. Acad. Sci. USA, 86:1173 (1989)), and self sustained sequence replication (Guatelli et al., Proc. Nat.
Acad. Sci. USA, 87:1874 (1990)) and nucleic acid based sequence amplification (NASBA). The latter two amplification methods involve isothermal reactions based on isothermal transcription, which produce both single stranded RNA (ssRNA) and double stranded DNA (dsDNA) as the amplification products in a ratio of about or 100 to 1, respectively.
The amplified DNA can be radiolabelled and used as a probe for screening a cDNA library derived from osteoclasts, e.g., human osteoclasts, mRNA in zap express, ZIPLOX or other suitable vector. Corresponding clones can be isolated, DNA can obtained following in vivo excision, and the cloned insert can be sequenced in either or both orientations by art recognized methods to identify the correct reading frame encoding a protein of the appropriate molecular weight.
For example, the direct analysis of the nucleotide sequence of nucleic acid molecules of the present invention can be accomplished using well-known methods that are commercially available. See, for example, Sambrook et al., Molecular Cloning, A
Laboratory Manual (2nd Ed., CSHP, New York 1989); Zyskind et al., Recombinant DNA Laboratorv Manual, (Acad. Press, 1988)). Using these or similar methods, the protein and the DNA encoding the protein can be isolated, sequenced and further characterized.
Antisense nucleic acid molecules of the invention can be designed using the nucleotide sequences of SEQ ID NO: 1, the complement of SEQ ID NO: l, or a portion thereof, such as nucleotides 1-48 of SEQ ID NO: 1, the complement of nucleotides 1-48 of SEQ ID NO: 1, nucleotides 547-831 of SEQ ID NO: 1 and the complement of nucleotides 547-831 of SEQ >D NO: 1, and constructed using chemical synthesis and enzymatic ligation reactions using procedures known in the art. For example, an antisense nucleic acid molecule (e.g., an antisense oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously modified nucleotides designed to increase the biological stability of the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides can be used. Alternatively, the antisense nucleic acid molecule can be produced biologically using an expression vector into which a nucleic acid molecule has been subcloned in an antisense orientation (i.e., RNA
transcribed from the inserted nucleic acid molecule will be of an antisense orientation to a target nucleic acid of interest).
In general, the isolated nucleic acid sequences can be used as molecular weight markers on Southern gels, and as chromosome markers which are labeled to map related gene positions. The nucleic acid sequences can also be used to compare with endogenous DNA sequences in patients to identify genetic disorders, and as probes, such as to hybridize and discover related DNA sequences or to subtract out known sequences from a sample. The nucleic acid sequences can further be used to derive primers for genetic fingerprinting, to raise anti-protein antibodies using DNA
immunization techniques, and as an antigen to raise anti-DNA antibodies or elicit immune responses. Additionally, the nucleotide sequences of the invention can be used to identify and express recombinant proteins for analysis, characterization or therapeutic use, or as markers for tissues in which the corresponding protein is expressed, either constitutively, during tissue differentiation, or in diseased states.
Another aspect of the invention pertains to nucleic acid constructs containing a nucleic acid molecule selected from the group consisting of SEQ m NO: 1 and the complement of SEQ ID NO: 1 (or a portion thereofj. The constructs comprise a vector (e.g., an expression vector) into which a sequence of the invention has been inserted in a sense or antisense orientation. As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA loop into which additional DNA segments can be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome.
Moreover, certain vectors, expression vectors, are capable of directing the expression of genes to which they are operably linked. In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. However, the invention is intended to include such other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses) that serve equivalent functions.
Preferred recombinant expression vectors of the invention comprise a nucleic acid molecule of the invention in a form suitable for expression of the nucleic acid molecule in a host cell. This means that the recombinant expression vectors include one or more regulatory sequences, selected on the basis of the host cells to be used for expression, which is operably linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory sequences) in a manner which allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell). The term "regulatory sequence" is intended to include promoters, enhancers and other expression control elements (e.g., polyadenylation signals).
Such regulatory sequences are described, for example, in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990).
Regulatory sequences include those which direct constitutive expression of a nucleotide sequence in many types of host cell and those which direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed and the level of expression of protein desired. The expression vectors of the invention can be introduced into host cells to thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acid molecules as described herein .
The recombinant expression vectors of the invention can be designed for expression of a polypeptide of the invention in prokaryotic or eukaryotic cells, e.g., bacterial cells such as E. coli, insect cells (using baculovirus expression vectors), yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, supra. Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.
Another aspect of the invention pertains to host cells into which a recombinant expression vector of the invention has been introduced. The terms "host cell" and "recombinant host cell" are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but also to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
A host cell can be any prokaryotic or eukaryotic cell. For example, a nucleic acid molecule of the invention can be expressed in bacterial cells (e.g., E.
coli), insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those skilled in the art.
Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a variety of art-recognized techniques for introducing a foreign nucleic acid molecule (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAF-dextran-mediated transfection, lipofection, or electroporation. Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al. (supra), and other laboratory manuals.
For stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is generally introduced into the host cells along with the gene of interest.
Preferred selectable markers include those that confer resistance to drugs, such as 6418, hygromycin and methotrexate. Nucleic acid molecules encoding a selectable marker can be introduced into a host cell on the same vector as the nucleic acid molecule of the invention or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid molecule can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die).
A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i.e., express) a polypeptide of the invention.
Accordingly, the invention further provides methods for producing a polypeptide using the host cells of the invention. In one embodiment, the method comprises culturing the host cell of invention (into which a recombinant expression vector encoding a polypeptide of the invention has been introduced) in a suitable medium such that the polypeptide is produced. In another embodiment, the method further comprises isolating the polypeptide from the medium or the host cell.
The host cells of the invention can also be used to produce nonhuman transgenic animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte or an embryonic stem cell into which a nucleic acid molecule of the invention has been introduced. Such host cells can then be used to create non-human transgenic animals in which exogenous nucleotide sequences have been introduced into the genome or homologous recombinant animals in which endogenous nucleotide sequences have been altered. Such animals are useful for studying the function and/or activity of the nucleotide sequence and polypeptide encoded by the sequence and for identifying and/or evaluating modulators of their activity. As used herein, a "transgenic animal" is a non-human animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or more of the cells of the animal includes a transgene. Other examples of transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens and amphibians.
A
transgene is exogenous DNA which is integrated into the genome of a cell from which a transgenic animal develops and which remains in the genome of the mature animal, thereby directing the expression of an encoded gene product in one or more cell types or tissues of the transgenic animal. As used herein, an "homologous recombinant animal" is a non-human animal, preferably a mammal, more preferably a mouse, in which an endogenous gene has been altered by homologous recombination between the endogenous gene and an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to development of the animal.
Methods for generating transgenic animals via embryo manipulation and microinjection, particularly animals such as mice, have become conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. Patent No. 4,873,191 and in Hogan, Manipulating the Mouse Embryo (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Methods for constructing homologous recombination vectors and homologous recombinant animals are described further in Bradley (1991) Current Opinion in BiolTechnology, 2:823-829 and in PCT Publication Nos. WO 90/11354, WO 91/01140, WO 92/0968, and WO 93/04169. Clones of the non-human transgenic animals described herein can also be produced according to the methods described in Wilmut et al.
(1997) Nature, 385:810-813 and PCT Publication Nos. WO 97/07668 and WO 97/07669.
The present invention also provides isolated polypeptides and variants and fragments thereof that are encoded by the nucleic acid molecules of the invention.
For example, as described above, the nucleotide sequences can be used to design primers to clone and express cDNAs encoding the polypeptides of the invention.
As used herein, a polypeptide is said to be "isolated" or "purified" when it is substantially free of cellular material when it is isolated from recombinant and non-recombinant cells, or free of chemical precursors or other chemicals when it is chemically synthesized. A polypeptide, however, can be joined to another polypeptide with which it is not normally associated in a cell and still be "isolated"
or "purified."
The polypeptides of the invention can be purified to homogeneity. It is understood, however, that preparations in which the polypeptide is not purified to homogeneity are useful. The critical feature is that the preparation allows for the desired function of the polypeptide, even in the presence of considerable amounts of other components. Thus, the invention encompasses various degrees of purity.
In one embodiment, the polypeptide is substantially free of cellular material, which includes preparations of the polypeptide having less than about 30% (by dry weight) other proteins (i.e., contaminating protein), less than about 20% other proteins, less than about 10% other proteins, or less than about 5% other proteins.
When a polypeptide is recombinantly produced, it can also be substantially free of culture medium, i.e., culture medium represents less than about 20%, less than about 10%, or less than about 5% of the volume of the protein preparation. In one embodiment, the polypeptide is substantially free of chemical precursors or other chemicals, which includes preparations of the polypeptide in which it is separated from chemical precursors or other chemicals that are involved in its synthesis. In one embodiment, the language "substantially free of chemical precursors or other chemicals" includes preparations of the polypeptide having less than about 30% (by dry weight) chemical precursors or other chemicals, less than about 20% chemical precursors or other chemicals, less than about 10% chemical precursors or other chemicals, or less than about 5% chemical precursors or other chemicals.
In one embodiment, a polypeptide comprises an amino acid sequence encoded by a nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 1 and nucleotides 1-48 of SEQ ID NO: l and complements and portions thereof, e.g., SEQ ID NO: 2 or a portion thereof (e.g., amino acids 1-16 of SEQ ID NO: 2). However, the invention also encompasses sequence variants, provided that the sequence contains a characteristic feature of the RGSlOB sequence (e.g., amino acids 1-16 of SEQ ID NO: 2). Variants include a substantially homologous protein encoded by the same genetic locus in an organism, i.e., an allelic variant. Variants also encompass proteins derived from other genetic loci in an organism, but having substantial homology to a polypeptide encoded by a nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 1 and complements and portions thereof. Variants also include proteins substantially homologous or identical to these polypeptides but derived from another organism, i.e., an ortholog. Variants also include proteins that are substantially homologous or identical to these polypeptides that are produced by chemical synthesis. Variants also include proteins that are substantially identical to these polypeptides that are produced by recombinant methods.
To determine the percent homology or identity of two amino acid sequences, or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in the sequence of one protein or nucleic acid molecule for optimal alignment with the other protein or nucleic acid molecule).
The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in one sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the other sequence, then the molecules are homologous at that position. As used herein, amino acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid "identity". The percent homology between the two sequences is a function of the number of identical positions shared by the sequences (i.e., percent homology or identity equals the number of identical positions/total number of positions times 100).
The invention also encompasses polypeptides having a lower degree of identity but having sufficient similarity so as to perform one or more of the same functions performed by a polypeptide encoded by a nucleic acid molecule of the invention. Similarity is determined by conserved amino acid substitution. Such substitutions are those that substitute a given amino acid in a polypeptide by another amino acid of like characteristics. Conservative substitutions are likely to be phenotypically silent. Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids Ala, Val, Leu, and Ile; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gln, exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe, Tyr. Guidance concerning which amino acid changes are likely to be phenotypically silent are found in Bowie et al., Science 247:1306-1310 (1990).
A variant polypeptide can differ in amino acid sequence by one or more substitutions, deletions, insertions, inversions, fusions, and truncations or a combination of any of these. Further, variant polypeptides can be fully functional or can lack function in one or more activities. Fully functional variants typically contain only conservative variation or variation in non-critical residues or in non-critical regions. Functional variants can also contain substitution of similar amino acids that result in no change or an insignificant change in function.
Alternatively, such substitutions may positively or negatively affect function to some degree. Non-functional variants typically contain one or more non-conservative amino acid substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, inversion, or deletion in a critical residue or critical region.
Amino acids that are essential for function can be identified by methods known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et al., Science, 244:1081-1085 (1989)). The latter procedure introduces single alanine mutations at every residue in the molecule. The resulting mutant molecules are then tested for biological activity in vitro, or in vitro proliferative activity. Sites that are critical for polypeptide activity can also be determined by structural analysis such as crystallization, nuclear magnetic resonance or photoaffinity labeling (Smith et al., J. Mol. Biol., 224:899-904 (1992); de Vos et al. Science, 255:306-312 (1992)).
The invention also includes polypeptide portions and fragments of the polypeptides of the invention. As used herein, the terms "fragment" and "portion"
are intended to a equivalents. Fragments can be derived from a polypeptide encoded by a nucleic acid molecule comprising SEQ ID NO: 1 or a portion thereof and the complements thereof. However, the invention also encompasses fragments of all variants of the polypeptides described herein. As used herein, a fragment or portion comprises at least 6 contiguous amino acids. Useful fragments include those that retain one or more of the biological activities of the polypeptide as well as fragments that can be used as an immunogen to generate polypeptide-specific antibodies.
Biologically active fragments (peptides which are, for example, 6, 9, 12, 15, 16, 20, 30, 35, 36, 37, 38, 39, 40, 50, 100 or more amino acids in length) can comprise a domain, segment, or motif that has been identified by analysis of the polypeptide sequence using well-known methods, e.g., signal peptides, extracellular domains, one or more transmembrane segments or loops, ligand binding regions, zinc finger domains, DNA binding domains, acylation sites, glycosylation sites, or phosphorylation sites.
Fragments can be discrete (not fused to other amino acids or polypeptides) or can be within a larger polypeptide. Further, several fragments can be comprised within a single larger polypeptide. In one embodiment a fragment designed for expression in a host can have heterologous pre- and pro-polypeptide regions fused to the amino terminus of the polypeptide fragment and an additional region fused to the carboxyl terminus of the fragment.
The invention thus provides chimeric or fusion proteins. These comprise a polypeptide of the invention operatively linked to a heterologous protein having an amino acid sequence not substantially homologous to the polypeptide.
"Operatively linked" indicates that the polypeptide protein and the heterologous protein are fused in-frame. The heterologous protein can be fused to the N-terminus or C-terminus of the polypeptide. In one embodiment the fusion protein does not affect function of the polypeptide per se. For example, the fusion protein can be a GST-fusion protein in which the polypeptide sequences are fused to the C-terminus of the GST
sequences. Other types of fusion proteins include, but are not limited to, enzymatic fusion proteins, for example beta-galactosidase fusions, yeast two-hybrid GAL
fusions, poly-His fusions and Ig fusions. Such fusion proteins, particularly poly-His fusions, can facilitate the purification of recombinant polypeptide. In certain host cells (e.g., mammalian host cells), expression and/or secretion of a protein can be increased by using a heterologous signal sequence. Therefore, in another embodiment, the fusion protein contains a heterologous signal sequence at its N-terminus.
EP-A-O 464 533 discloses fusion proteins comprising various portions of immunoglobulin constant regions. The Fc is useful in therapy and diagnosis and thus results, for example, in improved pharmacokinetic properties (EP-A 0232 262).
In drug discovery, for example, human proteins have been fused with Fc portions for the purpose of high-throughput screening assays to identify antagonists.
Bennett et al., Journal ofMolecular Recognition, 8:52-58 (1995) and Johanson et al., The Journal ofBiological Chemistry, 270,16:9459-9471 (1995). Thus, this invention also encompasses soluble fusion proteins containing a polypeptide of the invention and various portions of the constant regions of heavy or light chains of immunoglobulins of various subclass (IgG, IgM, IgA, IgE).
A chimeric or fusion protein can be produced by standard recombinant DNA
techniques. For example, DNA fragments coding for the different protein sequences are ligated together in-frame in accordance with conventional techniques. In another embodiment, the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR amplification of nucleic acid fragments can be carned out using anchor primers which give rise to complementary overhangs between two consecutive nucleic acid fragments which can subsequently be annealed and re-amplified to generate a chimeric nucleic acid sequence (see Ausubel et al., Current Protocols in Molecular Biology, 1992).
Moreover, many expression vectors are commercially available that already encode a fusion moiety (e.g., a GST protein). A nucleic acid molecule encoding a polypeptide of the invention can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the polypeptide protein.
The isolated polypeptide can be purified from cells that naturally express it, such as from osteoclasts, purified from cells that have been altered to express it (recombinant), or synthesized using known protein synthesis methods. In one embodiment, the protein is produced by recombinant DNA techniques. For example, a nucleic acid molecule encoding the polypeptide is cloned into an expression vector, the expression vector introduced into a host cell and the protein expressed in the host cell. The protein can then be isolated from the cells by an appropriate purification scheme using standard protein purification techniques.
In general, polypeptides or proteins of the present invention can be used as a molecular weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns using art-recognized methods. The polypeptides of the present invention can be used to raise antibodies or to elicit an immune response. The polypeptides can also be used as a reagent, e.g., a labeled reagent, in assays to quantitatively determine levels of the protein or a molecule to which it binds (e.g., a receptor or a ligand) in biological fluids. The polypeptides can also be used as markers for cells or tissues in which the corresponding protein is preferentially expressed, either constitutively, during tissue differentiation, or in a diseased state. The polypeptides can be used to isolate a corresponding binding partner, e.g., receptor or ligand, such as, for example, in an interaction trap assay, and to screen for peptide or small molecule antagonists or agonists of the binding interaction.
In another aspect, the invention provides antibodies to the polypeptides and polypeptide fragments of the invention, e.g., having an amino acid sequence encoded by a nucleic acid molecule comprising all or a portion of SEQ ID NO:
1.
The term "antibody" as used herein refers to immunoglobulin molecules and immunologically active portions of immunoglobulin molecules, i.e., molecules that contain an antigen binding site that specifically binds an antigen. A molecule that specifically binds to a polypeptide of the invention is a molecule that binds to that polypeptide or a fragment thereof, but does not substantially bind other molecules in a sample, e.g., a biological sample, which naturally contains the polypeptide.
Examples of immunologically active portions of immunoglobulin molecules include Flab) and F(ab')2 fragments which can be generated by treating the antibody with an enzyme such as pepsin. The invention provides polyclonal and monoclonal antibodies that bind to a polypeptide of the invention. The term "monoclonal antibody" or "monoclonal antibody composition", as used herein, refers to a population of antibody molecules that contain only one species of an antigen binding site capable of immunoreacting with a particular epitope of a polypeptide of the invention. A monoclonal antibody composition thus typically displays a single binding affinity for a particular polypeptide of the invention with which it immunoreacts.
Polyclonal antibodies can be prepared as described above by immunizing a suitable subject with a desired immunogen, e.g., polypeptide of the invention or fragment thereof. The antibody titer in the immunized subject can be monitored over time by standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) using immobilized polypeptide. If desired, the antibody molecules directed against the polypeptide can be isolated from the mammal (e.g., from the blood) and further purified by well-known techniques, such as protein A
chromatography to obtain the IgG fraction. At an appropriate time after immunization, e.g., when the antibody titers are highest, antibody-producing cells can be obtained from the subject and used to prepare monoclonal antibodies by standard techniques, such as the hybridoma technique originally described by Kohler and Milstein (1975) Nature, 256:495-497, the human B cell hybridoma technique (Kozbor et al. (1983) Immunol. Today, 4:72), the EBV-hybridoma technique (Cole et al. (1985), Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp.
77-96) or trioma techniques. The technology for producing hybridomas is well known (see generally Current Protocols in Immunology (1994) Coligan et al.
(eds.) John Wiley & Sons, Inc., New York, NY). Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal immunized with an immunogen as described above, and the culture supernatants of the resulting hybridoma cells are screened to identify a hybridoma producing a monoclonal antibody that binds a polypeptide of the invention.
Any of the many well known protocols used for fusing lymphocytes and immortalized cell lines can be applied for the purpose of generating a monoclonal antibody to a polypeptide of the invention (see, e.g., Current Protocols in Immunology, supra; Galfre et al. (1977) Nature, 266:55052; R.H. Kenneth, in Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing Corp., New York, New York (1980); and Lerner (1981) Yale J. Biol.
Med., 54:387-402. Moreover, the ordinarily skilled worker will appreciate that there are many variations of such methods that also would be useful.
Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal antibody to a polypeptide of the invention can be identified and isolated by screening a recombinant combinatorial immunoglobulin library (e.g., an antibody phage display library) with the polypeptide to thereby isolate immunoglobulin library members that bind the polypeptide. Kits for generating and screening phage display libraries are commercially available (e.g., the Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-O1; and the Stratagene SurfZAPTM Phage Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents particularly amenable for use in generating and screening antibody display library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No.
WO
92/18619; PCT Publication No. WO 91/17271; PCT Publication No. WO 92/20791;
PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; PCT
Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT
Publication No. WO 90/02809; Fuchs et al. (1991) BiolTechnology, 9:1370-1372;
Hay et al. (1992) Hum. Antibod. Hybridomas, 3:81-85; Huse et al. (1989) Science, 246:1275-1281; Griffiths et al. (1993) EMBO J., 12:725-734.
Additionally, recombinant antibodies, such as chimeric and humanized monoclonal antibodies, comprising both human and non-human portions, which can be made using standard recombinant DNA techniques, are within the scope of the invention. Such chimeric and humanized monoclonal antibodies can be produced by recombinant DNA techniques known in the art.
In general, antibodies of the invention (e.g., a monoclonal antibody) can be used to isolate a polypeptide of the invention by standard techniques, such as affinity chromatography or immunoprecipitation. A polypeptide-specific antibody can facilitate the purification of natural polypeptide from cells and of recombinantly produced polypeptide expressed in host cells. Moreover, an antibody specific for a polypeptide of the invention can be used to detect the polypeptide (e.g., in a cellular lysate, cell supernatant, or tissue sample) in order to evaluate the abundance and pattern of expression of the polypeptide. Antibodies can be used diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can be facilitated by coupling the antibody to a detectable substance. Examples of detectable substances include various enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, (-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes include streptavidinlbiotin and avidin/biotin; examples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, and examples of suitable radioactive material include'ZSI,'3'I, 35S or 3H.
Portions or fragments of the nucleotide sequences identified herein (and the corresponding complete gene sequences) can be used in numerous ways as polynucleotide reagents. For example, these sequences can be used to: (i) map their respective genes on a chromosome; and, thus, locate gene regions associated with genetic disease; (ii) identify an individual from a minute biological sample (tissue typing); and (iii) aid in forensic identification of a biological sample.
The present invention also pertains to diagnostic assays for determining protein and/or nucleic acid expression as well as activity of proteins of the invention, in the context of a biological sample (e.g., blood, serum, cells, tissue) to thereby determine whether an individual is afflicted with a disease or disorder, or is at risk of developing a disorder, associated with aberrant expression or activity. The invention also provides for prognostic (or predictive) assays for determining whether an individual is at risk of developing a disorder associated with activity or expression of proteins or nucleic acid molecules of the invention.
Disorders which may be treated or diagnosed by methods described herein include, but are not limited to, diseases characterized by an increase in bone density, including osteopetrosis, osteosclerosis, pyknodysostosis, osteomyelosclerosis, hyperphosphatasia, progressive diaphyseal dysplasia, melorheostosis, osteopoikilosis, hyperostosis frontalis interna, sclerostosis, McCune-Albright syndrome and spondyloepiphyseal dysplasia; disorders characterized by a decrease in bone density, including osteoarthritis, Maroteaux-Lamy syndrome and osteoporosis; and disorders of bone formation, including osteogenisis imperfecta, sutural craneosynostosis, osteomalacia and cleidocranial dysplasia, and bone cancers such as osteosarcomas. Additionally, the methods described herein may be useful for the treatment or diagnosis of bone disorders associated with cancer, anorexia nervosa, and auto-immune disorders.
For example, mutations in the gene can be assayed in a biological sample.
Such assays can be used for prognostic or predictive purpose to thereby prophylactically treat an individual prior to the onset of a disorder characterized by or associated with expression or activity of nucleic acid molecules or proteins of the invention.
An exemplary method for detecting the presence or absence of proteins or nucleic acid molecules of the invention in a biological sample involves obtaining a biological sample from a test subject and contacting the biological sample with a compound or an agent capable of detecting the protein, or nucleic acid molecule (e.g., mRNA, genomic DNA) that encodes the protein, such that the presence of the protein or nucleic acid molecule is detected in the biological sample. A
preferred agent for detecting mRNA or genomic DNA is a labeled nucleic acid probe capable of hybridizing to mRNA or genomic DNA sequences described herein. The nucleic acid probe can be, for example, a full-length nucleic acid molecule, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to appropriate mRNA or genomic DNA. For example, the nucleic acid probe can be all or a portion of SEQ ID NO: l, or the complement of SEQ ID NO: l, or a portion thereof. Other suitable probes for use in the diagnostic assays of the invention are described herein.
In one embodiment, the agent for detecting proteins of the invention is an antibody capable of binding to the protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab')z) can be used. The term "labeled", with regard to the probe or antibody, is intended to encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a detectable substance to the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with another reagent that is directly labeled. Examples of indirect labeling include detection of a primary antibody using a fluorescently labeled secondary antibody and end-labeling of a DNA probe with biotin such that it can be detected with fluorescently labeled streptavidin. The term "biological sample" is intended to include tissues, calls and biological fluids isolated from a subject, as well as tissues, cells and fluids present within a subject. That is, the detection method of the invention can be used to detect mRNA, protein, or genomic DNA of the invention in a biological sample in vitro as well as in vivo. For example, in vitro techniques for detection of mRNA include Northern hybridizations and in situ hybridizations. In vitro techniques for detection of protein include enzyme linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations and immunofluorescence. In vitro techniques for detection of genomic DNA include Southern hybridizations. Furthermore, in vivo techniques for detection of protein include introducing into a subject a labeled anti-protein antibody. For example, the antibody can be labeled with a radioactive marker whose presence and location in a subject can be detected by standard imaging techniques.
In one embodiment, the biological sample contains protein molecules from the test subject. Alternatively, the biological sample can contain mRNA
molecules from the test subject or genomic DNA molecules from the test subject. A
preferred biological sample is a serum sample or lung tissue biopsy isolated by conventional means from a subj ect.
In another embodiment, the methods further involve obtaining a control biological sample from a control subject, contacting the control sample with a compound or agent capable of detecting protein, mRNA, or genomic DNA of the invention, such that the presence of protein, mRNA or genomic DNA is detected in the biological sample, and comparing the presence of protein, mRNA or genomic DNA in the control sample with the presence of protein, mRNA or genomic DNA in the test sample.
The invention also encompasses kits for detecting the presence of proteins or nucleic acid molecules of the invention in a biological sample. For example, the kit can comprise a labeled compound or agent capable of detecting protein or mRNA
in a biological sample; means for determining the amount of in the sample; and means for comparing the amount of in the sample with a standard. The compound or agent can be packaged in a suitable container. The kit can further comprise instructions for using the kit to detect protein or nucleic acid molecules.
The diagnostic methods described herein can furthermore be utilized to identify subjects having or at risk of developing a disease or disorder associated with aberrant expression or activity of proteins and nucleic acid molecules of the invention. For example, the assays described herein, such as the preceding diagnostic assays or the following assays can be utilized to identify a subject having or at risk of developing a disorder associated with protein or nucleic acid expression or activity such as a proliferative disorder, a differentiative or developmental disorder, or a hematopoietic disorder. Alternatively, the prognostic assays can be utilized to identify a subject having or at risk for developing a differentiative or proliferative disease (e.g., cancer) particularly of the bones. Thus, the present invention provides a method for identifying a disease or disorder associated with aberrant expression or activity of proteins or nucleic acid molecules of the invention, in which a test sample is obtained from a subject and protein or nucleic acid molecule (e.g., mRNA, genomic DNA) is detected, wherein the presence of protein or nucleic acid molecule is diagnostic for a subject having or at risk of developing a disease or disorder associated with aberrant expression or activity of the protein or nucleic acid sequence of the invention. As used herein, a "test sample" refers to a biological sample obtained from a subject of interest. For example, a test sample can be a biological fluid (e.g., serum), cell sample (e.g., osteoclasts), or tissue sample.
The invention provides a method (also referred to herein as a "screening assay") for identifying modulators, i.e., candidate or test compounds or agents (e.g., antisense, polypeptides, peptidomimetics, small molecules or other drugs) which bind to nucleic acid molecules, polypeptides or proteins described herein or have a stimulatory or inhibitory effect on, for example, expression or activity of the nucleic acid molecules, polypeptides or proteins of the invention.
In one embodiment, the invention provides assays for screening candidate or test compounds that bind to or modulate the activity of protein or polypeptide described herein or biologically active portion thereof. The test compounds of the present invention can be obtained using any of the numerous approaches in combinatorial library methods known in the art, including: biological libraries;
spatially addressable parallel solid phase or solution phase libraries;
synthetic library methods requiring deconvolution; the 'one-bead one-compound' library method;
and synthetic library methods using affinity chromatography selection. The biological library approach is limited to polypeptide libraries, while the other four approaches are applicable to polypeptide, non-peptide oligomer or small molecule libraries of compounds (Lam, K.S. (1997) Anticancer Drug Des., 12:145).
In another embodiment, an assay is a cell-based assay comprising contacting a cell expressing a particular target molecule described herein with a test compound and determining the ability of the test compound to modulate or alter (e.g.
stimulate or inhibit) the activity of the target molecule. Determining the ability of the test compound to modulate the activity of the target molecule can be accomplished, for example, by determining the ability of a known ligand to bind to or interact with the target molecule. For example, the ability of RGS 1 OB to interact with and/or activate the GTPase activity of a Gcx subunit can be determined.
In yet another embodiment, an assay of the present invention is a cell-free assay in which protein of the invention or biologically active portion thereof is contacted with a test compound and the ability of the test compound to bind to the protein or biologically active portion thereof is determined. Binding of the test compound to the protein can be determined either directly or indirectly as described above. In one embodiment, the assay includes contacting the protein or biologically active portion thereof with a known compound which binds the protein to form an assay mixture, contacting the assay mixture with a test compound, and determining the ability of the test compound to interact with the protein. Determining the ability of the test compound to interact with the protein comprises determining the ability of the test compound to preferentially bind to the protein or biologically active portion thereof as compared to the known compound.
In another embodiment, the assay is a cell-free assay in which a protein of the invention or biologically active portion thereof is contacted with a test compound and the ability of the test compound to modulate or alter (e.g., stimulate or inhibit) the activity of the protein or biologically active portion thereof is determined. Determining the ability of the test compound to modulate the activity of the protein can be accomplished, for example, by determining the ability of the protein to bind to a known target molecule by one of the methods described above for determining direct binding. In an alternative embodiment, determining the ability of the test compound to modulate the activity of a protein of the invention can be accomplished by determining the ability of the protein to further modulate the activity of a target molecule. For example, the catalytic/enzymatic activity of the target molecule on an appropriate substrate can be determined as previously described.
In yet another embodiment, the cell-free assay involves contacting a protein of the invention or biologically active portion thereof with a known compound which binds the protein to form an assay mixture, contacting the assay mixture with a test compound, and determining the ability of the test compound to interact with the protein, wherein determining the ability of the test compound to interact with the protein comprises determining the ability of the protein to preferentially bind to or modulate the activity of a target molecule.
In more than one embodiment of the above assay methods of the present invention, it may be desirable to immobilize either the protein or its target molecule to facilitate separation of complexed from uncomplexed forms of one or both of the proteins, as well as to accommodate automation of the assay. Binding of a test compound to the protein, or interaction of the protein with a target molecule in the presence and absence of a candidate compound, can be accomplished in any vessel suitable for containing the reactants. Examples of such vessels include microtitre plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein (e.g., a glutathione-S-transferase fusion protein) can be provided which adds a domain that allows one or both of the proteins to be bound to a matrix.
In another embodiment, modulators of expression of nucleic acid molecules of the invention are identified in a method wherein a cell is contacted with a candidate compound and the expression of appropriate mRNA or protein in the cell is determined. The level of expression of appropriate mRNA or protein in the presence of the candidate compound is compared to the level of expression of mRNA or protein in the absence of the candidate compound. The candidate compound can then be identified as a modulator of expression based on this comparison. For example, when expression of mRNA or protein is greater (statistically significantly greater) in the presence of the candidate compound than in its absence, the candidate compound is identified as a stimulator or enhancer of the mRNA or protein expression. Alternatively, when expression of the mRNA or protein is less (statistically significantly less) in the presence of the candidate compound than in its absence, the candidate compound is identified as an inhibitor of the mRNA or protein expression. The level of mRNA or protein expression in the cells can be determined by methods described herein for detecting mRNA or protein.
This invention further pertains to novel agents identified by the above-described screening assays. Accordingly, it is within the scope of this invention to further use an agent identified as described herein in an appropriate animal model. For example, an agent identified as described herein (e.g., a modulating agent, an antisense nucleic acid molecule, a specific antibody, or a protein-binding partner) can be used in an animal model to determine the efficacy, toxicity, or side effects of treatment with such an agent. Alternatively, an agent identified as described herein can be used in an animal model to determine the mechanism of action of such an agent. Furthermore, this invention pertains to uses of novel agents identified by the above-described screening assays for treatments as described herein.
The present invention provides for both prophylactic and therapeutic methods of treating a subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant expression or activity of proteins or nucleic acid molecules of the invention. In one aspect, the invention provides a method for preventing in a subject, a disease or condition associated with aberrant expression or activity of genes or proteins of the present invention, by administering to the subject an agent which modulates expression or at least one activity of a gene or protein of the invention. Subjects at risk for a disease that is caused or contributed to by aberrant gene expression or protein activity can be identified by, for example, any or a combination of diagnostic or prognostic assays as described herein.
Administration of a prophylactic agent can occur prior to the manifestation of symptoms characteristic of the aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its progression. Depending on the type of aberrancy, for example, an agonist or antagonist agent can be used for treating the subject. The appropriate agent can be determined based on screening assays described herein.
Another aspect of the invention pertains to methods of modulating expression or activity of genes or proteins of the invention for therapeutic purposes.
The modulatory method of the invention involves contacting a cell with an agent that modulates one or more of the activities of the specified protein associated with the cell. An agent that modulates protein activity can be an agent as described herein, such as a nucleic acid molecule or a protein, a naturally-occurring target molecule of a protein described herein, a polypeptide, a peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one or more protein activities. Examples of such stimulatory agents include active protein as well as a nucleic acid molecule encoding the protein that has been introduced into the cell. In another embodiment, the agent inhibits one or more protein activities.
Examples of such inhibitory agents include antisense nucleic acid molecules and anti-protein antibodies. These modulatory methods can be performed in vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present invention provides methods of treating an individual afflicted with a disease or disorder characterized by aberrant expression or activity of a protein or nucleic acid molecule of the invention. In one embodiment, the method involves administering an agent (e.g., an agent identified by a screening assay described herein), or combination of agents that modulates (e.g., upregulates or downregulates) expression or activity of a gene or protein of the invention. In another embodiment, the method involves administering a protein or nucleic acid molecule of the invention as therapy to compensate for reduced or aberrant expression or activity of the protein or nucleic acid molecule.
Stimulation of protein activity is desirable in situations in which the protein is abnormally downregulated and/or in which increased protein activity is likely to have a beneficial effect. Likewise, inhibition of protein activity is desirable in situations in which the protein is abnormally upregulated and/or in which decreased protein activity is likely to have a beneficial effect. One example of such a situation is where a subject has a disorder characterized by aberrant development or cellular differentiation. The molecules of the present invention, as well as agents, or modulators which have a stimulatory or inhibitory effect on the protein activity (e.g., gene expression) as identified by a screening assay described herein can be administered to individuals to treat (prophylactically or therapeutically) disorders (e.g., proliferative or developmental disorders) associated with aberrant protein activity.
The invention will be further described by the following non-limiting examples. The teachings of all publications cited herein are incorporated herein by reference in their entirety.
EXAMPLES
Osteoclast cDNA Library Construction Messenger RNA (mRNA) obtained from a human osteoclastoma (giant cell tumor of bone) was used to construct an osteoclastoma cDNA library.
Osteoclastomas are actively bone resorptive tumors, but are usually non-metastatic.
In cryostat sections, osteoclastomas consist of ~30% multinucleated cells positive for tartrate resistant acid phosphatase (TRAP), a widely utilized phenotypic marker specific in vivo for osteoclasts (Minkin, Calcif. Tissue Int. 34:285-290 (1982)). The remaining cells are uncharacterized stromal cells, a mixture of cell types with fibroblastic/mesenchymal morphology.
Monoclonal antibody reagents were used to partially characterize the surface phenotype of the multinucleated cells in the giant cell tumors of long bone.
In frozen sections, all multinucleated cells expressed CD68, which has previously been reported to define an antigen specific for both osteoclasts and macrophages (Norton, M.A. and M.H. Helfrich, In Biology and Physiology of the Osteoclast, B.R.
Rifkin and C.V. Gay, editors, CRC Press, Inc. Boca Raton, FL, 33-54 (1992)). In contrast, no staining of giant cells was observed for CD1 1b or CD14 surface antigens, which are present on monocyte/ macrophages and granulocytes (Arnaout, M.A. et al. J.
Cell. Physiol. 137:305 (1988); Haziot, A. et al. J. Immunol. 141:547 (1988)).
Cytocentrifuge preparations of human peripheral blood monocytes were positive for CD68, CDl 1b, and CD14. These results demonstrate that the multinucleated giant cells of osteoclastomas have a phenotype which is distinct from that of macrophages, and which is consistent with that of osteoclasts.
Osteoclastoma tissue was snap frozen in liquid nitrogen and used to prepare poly A+ mRNA according to standard methods. cDNA cloning into a pcDNAII
vector was carried out using a commercially-available kit (Librarian, InVitrogen).
Approximately 2.6 x 106 clones were obtained, >95% of which contained inserts of an average length 0.6 kB.
Stromal Cell mRNA Preparation A portion of each osteoclastoma was snap frozen in liquid nitrogen for mRNA preparation. The remainder of the tumor was dissociated using brief trypsinization and mechanical disaggregation, and placed into tissue culture.
These cells were expanded in Dulbecco's MEM (high glucose, Sigma) supplemented with 10% newborn calf serum (MA Bioproducts), gentamycin (0.5 mg/ml), 1-glutamine (2 mM) and non-essential amino acids (0.1 mM, Gibco). The stromal cell population was passaged at least five times, after which it showed a homogenous, fibroblastic-looking cell population that contained no multinucleated cells.
The stromal cells were mononuclear, tested negative for acid phosphatase, and tested variably alkaline phosphatase positive. These findings indicate that propagated stromal cells (i.e., stromal cells that are passaged in culture) are non-osteoclastic and non-activated.
Identification of DNA by Differential Screening of an Osteoclastoma cDNA
Library A total of 12,000 clones drawn from the osteoclastoma cDNA library were screened by differential hybridization, using mixed 32P labelled cDNA probes derived from (1) giant cell tumor mRNA (stromal cell+, OC~), and (2) mRNA from stromal cells (stromal cell+, OC-) cultivated from the same tumor. The probes were labelled with 32[P]dCTP by random priming to an activity of ~109CPM/~g. Of these 12,000 clones, 195 gave a positive hybridization signal with giant cell (i.e., osteoclast and stromal cell) mRNA, but not with stromal cell mRNA.
Additionally, these clones failed to hybridize to cDNA produced from mRNA derived from a variety of unrelated human cell types including epithelial cells, fibroblasts, lymphocytes, myelomonocytic cells, osteoblasts, and neuroblastoma cells. The failure of these clones to hybridize to cDNA produced from mRNA derived from other cell types supports the conclusion that these clones are either uniquely or selectively expressed in osteoclasts.
The osteoclast (0C) cDNA library was screened for differential hybridization to OC cDNA (stromal cell+, OC+) and stromal cell cDNA (stromal cell+, OC-) as follows:
NYTRAN filters (Schleicher & Schuell) were placed on agar plates containing growth medium and ampicillin. Individual bacterial colonies from the OC library were randomly picked and transferred, in triplicate, onto filters with preruled grids and then onto a master agar plate. Up to 200 colonies were inoculated onto a single 90-mm filter/plate using these techniques. The plates were inverted and incubated at 37°C until the bacterial inoculates had grown (on the filter) to a diameter of 0.5-1.0 mm.
The colonies were then lysed, and the DNA bound to the filters by first placing the filters on top of two pieces of Whatman 3MM paper saturated with 0.5 N
NaOH for 5 minutes. The filters were neutralized by placing on two pieces of Whatman 3MM paper saturated with 1 M Tris-HCL, pH 8.0 for 3-5 minutes.
S Neutralization was followed by incubation on another set of Whatman 3MM
papers saturated with 1M Tris-HCL, pH 8.0/1.5 M NaCI for 3-5 minutes. The filters were then washed briefly in 2 X SSC.
DNA was immobilized on the filters by baking the filters at 80°C
for 30 minutes. Filters were best used immediately, but they could be stored for up to one week in a vacuum jar at room temperature.
Filters were prehybridized in 5-8 ml of hybridization solution per filter, for 2-4 hours in a heat sealable bag. An additional 2 ml of solution was added for each additional filter added to the hybridization bag. The hybridization buffer consisted of 5 X SSC, 5 X Denhardt's solution, 1% SDS and 100 ~g/ml denatured heterologous DNA.
Prior to hybridization, labeled probe was denatured by heating in 1 X SSC
for 5 minutes at 100°C, then immediately chilled on ice. Denatured probe was added to the filters in hybridization solution, and the filters hybridized with continuous agitation for 12-20 hours at 65°C.
After hybridization, the filters were washed in 2 X SSC/0.2% SDS at 50-60°C for 30 minutes, followed by washing in 0.2 X SSC/0.2% SDS at 60°C for 60 minutes.The filters were then air dried and autoradiographed using an intensifying screen at -70°C overnight.
DNA Sequencing of Selected Clones Clones that hybridized to tumor cell cDNA, but not to stromal cell cDNA, were sequenced by the dideoxy chain termination method of Sanger et al., Proc.
Natl. Acad. Sci. USA 74:5463 (1977)) using sequenase (US Biochemical). The DNASIS (Hitatchi) program was used to carry out sequence analysis and a homology search in the GenBank/EMBL database.
For one of the clones identified by this method (designated clone 140), a full-length cDNA (designated RGS10B) was identified from an osteoclastoma cDNA
library. The nucleotide and predicted amino acid sequences of RGS l OB are shown in Figure 2. RGS l OB encodes a 181 amino acid polypeptide which appears to represent an isoform of a previously described molecule known as RGS10.
However, the amino terminus of the RGSlOB sequence is distinct from that of RGS 10, with only 3 of 16 residues identical (Figure 1 ). In contrast, residues 17-181 are identical in the two molecules, and RGS l OB possess the highly conserved RGS
domain. Additionally, the 3' untranslated region (UTR) of RGS l OB is also unique from that of RGS 10, indicating that RGS 1 OB is likely to be generated by alternative splicing of at least 2 novel exons.
Northern Blot Analysis Multiple tissue Northern analysis was performed using a 90-base pair probe to the amino terminus of RGS l OB. A 1 kb mRNA for RGS 1 OB was detected very strongly in the osteoclastoma tumor, but not in the stromal cell component of the tumor. RGS l OB mRNA was minimally expressed in HSB-2 (B lymphocyte cell line) and U937 (myelomonocytic cell line). All other human tissues and cell lines tested were negative for RGS 1 OB mRNA. These data demonstrate that RGS 1 OB
is selectively expressed in human osteoclasts.
While this invention has been particularly shown and described with reference to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.
SEQUENCE LISTING
<110> Forsyth Dental Infirmary for Children <120> RGS10B, A G-Protein Regulator Expressed in Osteoclasts <130> 1564.1011-003 <150> US 60/154,930 <151> 1999-09-21 <160> 3 <170> FastSEQ for Windows Version 4.0 <210> 1 <211> 830 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <222> (1). .(830) <223> n = A,T,C or G
<400> 1 atgttcaacc gcgccgtgag ccggctgagc aggaagcggc cgccgtcaga catccacgac 60 agcgatggca gttccagcag cagccaccag agcctcaaga gcacagccaa atgggcggca 120 tccctggaga atctgctgga agacccagaa ggcgtgaaaa gatttaggga tttttaaaaa 180 aggaattcag tgaagaaaat gttttgtttt ggctagcatg tgaagatttt aagaaaatgc 240 aagataagac gcagatgcag gaaaaggcaa aggagatcta catgaccttt ctgtccagca 300 aggcctcatc acaggtcaac gtggaggggc agtctcggct caacgagaag atcctggaag 360 aaccgcaccc tctgatgttc cagaaactcc aggaccagat ctttaatctc atgaagtacg 420 acagctacag ccgcttctta aagtctgact tgtttttaaa acacaagcga accgaggaag 480 aggaagaaga tttgcctgat gctcaaactg cagctaaaag agcttccaga atttataaca 540 catgagcccc caaaaagccg ggactggsrg ctttaagawg caarggrmtt tcstntywgg 600 wccgtgscsg gttymtcatt gctytgtkat ttgtaaggac traartktrc aaaacccwwm 660 amygggawgt gtgtwttrtt aactgcttcc cccktaaact ttrmatgwtg swtraggkaw 720 cataaaaaan gaataatgat wacttggaag tttkagkttw caaaacagrg attcstkmaa 780 cmytggacac kkggngcatt ktgyagytta attaamccty atgymaggcc 830 <210> 2 <211> 181 <212> PRT
<213> Homo sapiens <400> 2 Met Phe Asn Arg Ala Val Ser Arg Leu Ser Arg Lys Arg Pro Pro Ser Asp Ile His Asp Ser Asp Gly Ser Ser Ser Ser Ser His Gln Ser Leu Lys Ser Thr Ala Lys Trp Ala Ala Ser Leu Glu Asn Leu Leu Glu Asp Pro Glu Gly Val Lys Arg Phe Arg Glu Phe Leu Lys Lys Glu Phe Ser Glu Glu Asn Val Leu Phe Trp Leu Ala Cys Glu Asp Phe Lys Lys Met Gln Asp Lys Thr Gln Met Gln Glu Lys Ala Lys Glu Ile Tyr Met Thr Phe Leu Ser Ser Lys Ala Ser Ser Gln Val Asn Val Glu Gly Gln Ser Arg Leu Asn Glu Lys Ile Leu Glu Glu Pro His Pro Leu Met Phe Gln Lys Leu Gln Asp Gln Ile Phe Asn Leu Met Lys Tyr Asp Ser Tyr Ser Arg Phe Leu Lys Ser Asp Leu Phe Leu Lys His Lys Arg Thr Glu Glu Glu Glu Glu Asp Leu Pro Asp Ala Gln Thr Ala Ala Lys Arg Ala Ser Arg Ile Tyr Asn Thr <210> 3 <211> 173 <212> PRT
<213> Unknown <220>
<223> Activator of Galphal GTPase activity <400> 3 Met Gln Ser Glu Leu Phe Phe Ala Asp Ile His Asp Ser Asp Gly Ser Ser Ser Ser Ser His Gln Ser Leu Lys Ser Thr Ala Lys Trp Ala Ala Ser Leu Glu Asn Leu Leu Glu Asp Pro Glu Gly Val Lys Arg Phe Arg Glu Phe Leu Lys Lys Glu Phe Ser Glu Glu Asn Val Leu Phe Trp Leu Ala Cys Glu Asp Phe Lys Lys Met Gln Asp Lys Thr Gln Met Gln Glu Lys Ala Lys Glu Ile Tyr Met Thr Phe Leu Ser Ser Lys Ala Ser Ser Gln Val Asn Val Glu Gly Gln Ser Arg Leu Asn Glu Lys Ile Leu Glu Glu Pro His Pro Leu Met Phe Gln Lys Leu Gln Asp Gln Ile Phe Asn Leu Met Lys Tyr Asp Ser Tyr Ser Arg Phe Leu Lys Ser Asp Leu Phe Leu Lys His Lys Arg Thr Glu Glu Glu Glu Glu Asp Leu Pro Asp Ala Gln Thr Ala Ala Lys Arg Ala Ser Arg Ile Tyr Asn Thr
Claims (19)
1. An isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
a) SEQ ID NO: 1;
b) the complement of SEQ ID NO: 1;
c) nucleotides 1-48 of SEQ ID NO: 1;
d) the complement of nucleotides 1-48 of SEQ ID NO: 1;
e) nucleotides 547-831 of SEQ ID NO: 1; and f) the complement of nucleotides 547-831 of SEQ ID NO: 1.
a) SEQ ID NO: 1;
b) the complement of SEQ ID NO: 1;
c) nucleotides 1-48 of SEQ ID NO: 1;
d) the complement of nucleotides 1-48 of SEQ ID NO: 1;
e) nucleotides 547-831 of SEQ ID NO: 1; and f) the complement of nucleotides 547-831 of SEQ ID NO: 1.
2. An isolated nucleic acid molecule according to Claim 1 which is DNA.
3. An isolated nucleic acid molecule consisting essentially of a nucleotide sequence selected from the group consisting of:
a) SEQ ID NO: 1; and b) the complement of SEQ ID NO: 1;
c) nucleotides 1-48 of SEQ ID NO: 1;
d) the complement of nucleotides 1-48 of SEQ ID NO: 1;
e) nucleotides 547-831 of SEQ ID NO: 1; and f) the complement of nucleotides 547-831 of SEQ ID NO: 1.
a) SEQ ID NO: 1; and b) the complement of SEQ ID NO: 1;
c) nucleotides 1-48 of SEQ ID NO: 1;
d) the complement of nucleotides 1-48 of SEQ ID NO: 1;
e) nucleotides 547-831 of SEQ ID NO: 1; and f) the complement of nucleotides 547-831 of SEQ ID NO: 1.
4. An isolated nucleic acid molecule consisting of a nucleotide sequence selected from the group consisting of:
a) SEQ ID NO: 1; and b) the complement of SEQ ID NO: 1;
c) nucleotides 1-48 of SEQ ID NO: 1;
d) the complement of nucleotides 1-48 of SEQ ID NO: 1;
e) nucleotides 547-831 of SEQ ID NO: 1; and f) the complement of nucleotides 547-831 of SEQ ID NO: 1.
a) SEQ ID NO: 1; and b) the complement of SEQ ID NO: 1;
c) nucleotides 1-48 of SEQ ID NO: 1;
d) the complement of nucleotides 1-48 of SEQ ID NO: 1;
e) nucleotides 547-831 of SEQ ID NO: 1; and f) the complement of nucleotides 547-831 of SEQ ID NO: 1.
5. A nucleic acid molecule which hybridizes under high stringency conditions to a nucleotide sequence selected from the group consisting of:
a) SEQ ID NO: 1; and b) the complement of SEQ ID NO: 1;
c) nucleotides 1-48 of SEQ ID NO: 1;
d) the complement of nucleotides 1-48 of SEQ ID NO: 1;
e) nucleotides 547-831 of SEQ ID NO: 1; and f) the complement of nucleotides 547-831 of SEQ ID NO: 1.
a) SEQ ID NO: 1; and b) the complement of SEQ ID NO: 1;
c) nucleotides 1-48 of SEQ ID NO: 1;
d) the complement of nucleotides 1-48 of SEQ ID NO: 1;
e) nucleotides 547-831 of SEQ ID NO: 1; and f) the complement of nucleotides 547-831 of SEQ ID NO: 1.
6. A method for assaying the presence of a nucleic acid molecule in a sample, comprising contacting said sample with a second nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
a) SEQ ID NO: 1;
b) the complement of SEQ ID NO: 1;
c) nucleotides 1-16 of SEQ ID NO: 1;
d) the complement of nucleotides 1-16 of SEQ ID NO: 1, e) nucleotides 547-831 of SEQ ID NO: 1; and f) the complement of nucleotides 547-831 of SEQ ID NO: 1, under conditions appropriate for selective hybridization.
a) SEQ ID NO: 1;
b) the complement of SEQ ID NO: 1;
c) nucleotides 1-16 of SEQ ID NO: 1;
d) the complement of nucleotides 1-16 of SEQ ID NO: 1, e) nucleotides 547-831 of SEQ ID NO: 1; and f) the complement of nucleotides 547-831 of SEQ ID NO: 1, under conditions appropriate for selective hybridization.
7. A vector comprising an isolated nucleic acid molecule according to Claim 1 operatively linked to a regulatory sequence.
8. A recombinant host cell comprising the vector of Claim 7.
9. A method for preparing a polypeptide encoded by an isolated nucleic acid molecule, comprising culturing the recombinant host cell of Claim 8 under conditions suitable for expression of said nucleic acid molecule.
10. An isolated polypeptide encoded by an isolated nucleic acid molecule according to Claim 1.
11. An isolated polypeptide comprising an amino acid sequence of SEQ ID NO:
2.
2.
12. An isolated polypeptide comprising amino acids 1-16 of sequence of SEQ ID
NO: 2.
NO: 2.
13. An isolated polypeptide comprising an amino acid sequence which is greater than about 93 percent identical to the amino acid sequence of SEQ ID NO: 2.
14. An antibody, or an antigen-binding fragment thereof, which selectively binds to the polypeptide encoded by an isolated nucleic acid molecule according to Claim 1, or to a portion of said polypeptide.
15. An antibody, or an antigen-binding fragment thereof, which selectively binds to the amino acid sequence of SEQ ID NO: 2 or to a portion of said amino acid sequence.
16. A method for assaying the presence of a polypeptide encoded by an isolated nucleic acid molecule according to Claim 1 in a sample, comprising contacting said sample with an antibody which specifically binds to the encoded polypeptide.
17. A method of inhibiting G protein function in a cell comprising enhancing expression of RGS10B or active portion thereof in said cell, whereby RGS10B activates the intrinsic GTPase activity of a G.alpha. subunit of said G
protein, thereby inhibiting G protein activity.
protein, thereby inhibiting G protein activity.
18. A method of activating the GTPase activity of a G.alpha. subunit comprising contacting said G.alpha. subunit with RGS10B or active portion thereof.
19. A method of activating the GTPase activity of a G.alpha. subunit in a cell comprising enhancing expression of RGS10B or active portion thereof in said cell.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15493099P | 1999-09-21 | 1999-09-21 | |
US60/154,930 | 1999-09-21 | ||
PCT/US2000/025905 WO2001021797A1 (en) | 1999-09-21 | 2000-09-21 | Rgs10b, a g-protein regulator expressed in osteoclasts |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2385460A1 true CA2385460A1 (en) | 2001-03-29 |
Family
ID=22553427
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002385460A Abandoned CA2385460A1 (en) | 1999-09-21 | 2000-09-21 | Rgs10b, a g-protein regulator expressed in osteoclasts |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1214412A1 (en) |
JP (1) | JP2003510048A (en) |
CA (1) | CA2385460A1 (en) |
WO (1) | WO2001021797A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006096690A2 (en) * | 2005-03-07 | 2006-09-14 | University Of Rochester | Compositions and methods for inhibiting g protein signaling |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1996038462A1 (en) * | 1995-06-02 | 1996-12-05 | Massachusetts Institute Of Technology | Regulators of g-protein signalling |
JPH09299092A (en) * | 1996-03-12 | 1997-11-25 | Takeda Chem Ind Ltd | New protein and its dna |
US6020142A (en) * | 1996-10-04 | 2000-02-01 | Millennium Pharmaceuticals, Inc. | Rath genes and polypeptides and methods for the treatment and diagnosis of immune disorders |
US5955314A (en) * | 1996-11-08 | 1999-09-21 | Incyte Pharmaceuticals, Inc. | Regulator of cell signaling |
US5882890A (en) * | 1997-03-31 | 1999-03-16 | Incyte Pharmaceuticals, Inc. | Nucleic acids encoding novel regulators of G-protein signaling |
CN1293709A (en) * | 1998-03-18 | 2001-05-02 | 昂尼克斯药物公司 | Identification of factors which mediate the interaction of heterotrimeric G proteins and monomeric G proteins |
-
2000
- 2000-09-21 WO PCT/US2000/025905 patent/WO2001021797A1/en not_active Application Discontinuation
- 2000-09-21 EP EP00966786A patent/EP1214412A1/en not_active Withdrawn
- 2000-09-21 CA CA002385460A patent/CA2385460A1/en not_active Abandoned
- 2000-09-21 JP JP2001525355A patent/JP2003510048A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
WO2001021797A1 (en) | 2001-03-29 |
EP1214412A1 (en) | 2002-06-19 |
JP2003510048A (en) | 2003-03-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2002095049A2 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
US6815181B2 (en) | Nucleic acid molecules encoding human secreted hemopexin-related proteins | |
WO2003004676A2 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
US6753174B2 (en) | Isolated human lactate dehydrogenase proteins | |
CA2385460A1 (en) | Rgs10b, a g-protein regulator expressed in osteoclasts | |
US6890717B1 (en) | OBP1: a novel protein selectively expressed in osteoblasts | |
WO2003006485A2 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
US6489456B1 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
US20040038282A1 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
EP1409522A2 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
US20030022208A1 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
US20050095589A1 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
US20050075283A1 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
US20030013151A1 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
JP2003523202A (en) | Isolated human phosphatase protein, nucleic acid molecule encoding human phosphatase protein and method of using same | |
US20050048560A1 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
EP1392343A2 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
EP1451576A2 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof | |
WO2002077260A2 (en) | Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |