EP1007540A1 - Dna molecules encoding human nuclear receptor proteins - Google Patents
Dna molecules encoding human nuclear receptor proteinsInfo
- Publication number
- EP1007540A1 EP1007540A1 EP98943441A EP98943441A EP1007540A1 EP 1007540 A1 EP1007540 A1 EP 1007540A1 EP 98943441 A EP98943441 A EP 98943441A EP 98943441 A EP98943441 A EP 98943441A EP 1007540 A1 EP1007540 A1 EP 1007540A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- nnr2
- protein
- human
- nnrl
- expression vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 108020004017 nuclear receptors Proteins 0.000 title abstract description 33
- 108020005497 Nuclear hormone receptor Proteins 0.000 title abstract description 29
- 102000007399 Nuclear hormone receptor Human genes 0.000 title abstract 2
- 238000000034 method Methods 0.000 claims abstract description 47
- 108090000623 proteins and genes Proteins 0.000 claims description 157
- 102000004169 proteins and genes Human genes 0.000 claims description 138
- 108020004414 DNA Proteins 0.000 claims description 83
- 239000013604 expression vector Substances 0.000 claims description 73
- 102000053602 DNA Human genes 0.000 claims description 33
- 239000002773 nucleotide Substances 0.000 claims description 30
- 125000003729 nucleotide group Chemical group 0.000 claims description 30
- 230000014509 gene expression Effects 0.000 claims description 26
- 238000012258 culturing Methods 0.000 claims 8
- 125000003275 alpha amino acid group Chemical group 0.000 claims 4
- 239000002299 complementary DNA Substances 0.000 abstract description 69
- 238000012216 screening Methods 0.000 abstract description 14
- 230000000694 effects Effects 0.000 abstract description 13
- 239000013598 vector Substances 0.000 abstract description 13
- 238000002955 isolation Methods 0.000 abstract description 5
- 238000012512 characterization method Methods 0.000 abstract description 3
- 238000004519 manufacturing process Methods 0.000 abstract description 2
- 235000018102 proteins Nutrition 0.000 description 82
- 108020004635 Complementary DNA Proteins 0.000 description 70
- 238000010804 cDNA synthesis Methods 0.000 description 68
- 210000004027 cell Anatomy 0.000 description 56
- 239000012634 fragment Substances 0.000 description 40
- 150000001413 amino acids Chemical group 0.000 description 36
- 150000007523 nucleic acids Chemical class 0.000 description 33
- 102000006255 nuclear receptors Human genes 0.000 description 31
- 239000013615 primer Substances 0.000 description 28
- 108020004705 Codon Proteins 0.000 description 25
- 102000039446 nucleic acids Human genes 0.000 description 24
- 108020004707 nucleic acids Proteins 0.000 description 24
- 108091028043 Nucleic acid sequence Proteins 0.000 description 20
- 102000005962 receptors Human genes 0.000 description 20
- 108020003175 receptors Proteins 0.000 description 20
- 235000001014 amino acid Nutrition 0.000 description 19
- 150000001875 compounds Chemical class 0.000 description 18
- 108091060211 Expressed sequence tag Proteins 0.000 description 16
- 230000006870 function Effects 0.000 description 15
- 238000003556 assay Methods 0.000 description 14
- 108090000765 processed proteins & peptides Proteins 0.000 description 14
- 229940024606 amino acid Drugs 0.000 description 13
- 230000004927 fusion Effects 0.000 description 13
- 230000001225 therapeutic effect Effects 0.000 description 13
- 108020001756 ligand binding domains Proteins 0.000 description 12
- 230000035772 mutation Effects 0.000 description 12
- 238000012163 sequencing technique Methods 0.000 description 12
- 108020004999 messenger RNA Proteins 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- 238000006467 substitution reaction Methods 0.000 description 10
- 230000004568 DNA-binding Effects 0.000 description 9
- 238000007792 addition Methods 0.000 description 9
- 238000012217 deletion Methods 0.000 description 9
- 230000037430 deletion Effects 0.000 description 9
- 102000040430 polynucleotide Human genes 0.000 description 9
- 108091033319 polynucleotide Proteins 0.000 description 9
- 239000002157 polynucleotide Substances 0.000 description 9
- 230000000069 prophylactic effect Effects 0.000 description 9
- 241000238631 Hexapoda Species 0.000 description 8
- 241000699670 Mus sp. Species 0.000 description 8
- 108700026244 Open Reading Frames Proteins 0.000 description 8
- 239000000556 agonist Substances 0.000 description 8
- 239000005557 antagonist Substances 0.000 description 8
- 238000010276 construction Methods 0.000 description 8
- 239000002671 adjuvant Substances 0.000 description 7
- 210000004408 hybridoma Anatomy 0.000 description 7
- 230000003053 immunization Effects 0.000 description 7
- 239000003446 ligand Substances 0.000 description 7
- 108091033380 Coding strand Proteins 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 102000037865 fusion proteins Human genes 0.000 description 6
- 108020001507 fusion proteins Proteins 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 6
- 241000894007 species Species 0.000 description 6
- 108091034117 Oligonucleotide Proteins 0.000 description 5
- 238000012408 PCR amplification Methods 0.000 description 5
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 5
- 230000027455 binding Effects 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 239000000284 extract Substances 0.000 description 5
- 238000009472 formulation Methods 0.000 description 5
- 238000002649 immunization Methods 0.000 description 5
- 238000002347 injection Methods 0.000 description 5
- 239000007924 injection Substances 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 4
- 239000003155 DNA primer Substances 0.000 description 4
- 102000003676 Glucocorticoid Receptors Human genes 0.000 description 4
- 108090000079 Glucocorticoid Receptors Proteins 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 229940079593 drug Drugs 0.000 description 4
- 230000002538 fungal effect Effects 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 230000000699 topical effect Effects 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 108091005461 Nucleic proteins Chemical group 0.000 description 3
- 206010035226 Plasma cell myeloma Diseases 0.000 description 3
- 241000283984 Rodentia Species 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical group CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 3
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 3
- 239000013543 active substance Substances 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 108091007433 antigens Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 210000004556 brain Anatomy 0.000 description 3
- 230000011712 cell development Effects 0.000 description 3
- 230000024245 cell differentiation Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 3
- 230000001605 fetal effect Effects 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 238000001990 intravenous administration Methods 0.000 description 3
- 238000007852 inverse PCR Methods 0.000 description 3
- 210000004698 lymphocyte Anatomy 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 201000000050 myeloid neoplasm Diseases 0.000 description 3
- 210000000287 oocyte Anatomy 0.000 description 3
- 239000002953 phosphate buffered saline Substances 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 238000003127 radioimmunoassay Methods 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 229920002477 rna polymer Polymers 0.000 description 3
- 239000013605 shuttle vector Substances 0.000 description 3
- 102000005969 steroid hormone receptors Human genes 0.000 description 3
- 210000001550 testis Anatomy 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 241001515965 unidentified phage Species 0.000 description 3
- IVLXQGJVBGMLRR-UHFFFAOYSA-N 2-aminoacetic acid;hydron;chloride Chemical compound Cl.NCC(O)=O IVLXQGJVBGMLRR-UHFFFAOYSA-N 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 206010003445 Ascites Diseases 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 101150066516 GST gene Proteins 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical group OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 241001529936 Murinae Species 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 102000007079 Peptide Fragments Human genes 0.000 description 2
- 108010033276 Peptide Fragments Proteins 0.000 description 2
- 241000256251 Spodoptera frugiperda Species 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 230000001270 agonistic effect Effects 0.000 description 2
- 230000003042 antagnostic effect Effects 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000012531 culture fluid Substances 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000008194 pharmaceutical composition Substances 0.000 description 2
- 230000035790 physiological processes and functions Effects 0.000 description 2
- 210000002826 placenta Anatomy 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 210000002307 prostate Anatomy 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 230000003393 splenic effect Effects 0.000 description 2
- 108020003113 steroid hormone receptors Proteins 0.000 description 2
- 238000007920 subcutaneous administration Methods 0.000 description 2
- 231100000419 toxicity Toxicity 0.000 description 2
- 230000001988 toxicity Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 239000003981 vehicle Substances 0.000 description 2
- 230000003442 weekly effect Effects 0.000 description 2
- SNBCLPGEMZEWLU-QXFUBDJGSA-N 2-chloro-n-[[(2r,3s,5r)-3-hydroxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methyl]acetamide Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CNC(=O)CCl)[C@@H](O)C1 SNBCLPGEMZEWLU-QXFUBDJGSA-N 0.000 description 1
- PMUNIMVZCACZBB-UHFFFAOYSA-N 2-hydroxyethylazanium;chloride Chemical compound Cl.NCCO PMUNIMVZCACZBB-UHFFFAOYSA-N 0.000 description 1
- TVZGACDUOSZQKY-LBPRGKRZSA-N 4-aminofolic acid Chemical compound C1=NC2=NC(N)=NC(N)=C2N=C1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 TVZGACDUOSZQKY-LBPRGKRZSA-N 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 241000255789 Bombyx mori Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 101100007328 Cocos nucifera COS-1 gene Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000192700 Cyanobacteria Species 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 101000851696 Homo sapiens Steroid hormone receptor ERR2 Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-Hydroxysuccinimide Chemical class ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 1
- 102000034570 NR1 subfamily Human genes 0.000 description 1
- 108020001305 NR1 subfamily Proteins 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- XOJVVFBFDXDTEG-UHFFFAOYSA-N Norphytane Natural products CC(C)CCCC(C)CCCC(C)CCCC(C)C XOJVVFBFDXDTEG-UHFFFAOYSA-N 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 102000003728 Peroxisome Proliferator-Activated Receptors Human genes 0.000 description 1
- 108090000029 Peroxisome Proliferator-Activated Receptors Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- -1 RNA Proteins 0.000 description 1
- 230000006819 RNA synthesis Effects 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- 102100035254 Sodium- and chloride-dependent GABA transporter 3 Human genes 0.000 description 1
- 101710104417 Sodium- and chloride-dependent GABA transporter 3 Proteins 0.000 description 1
- 108010085012 Steroid Receptors Proteins 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- 229940123464 Thiazolidinedione Drugs 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 241000209140 Triticum Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 210000001789 adipocyte Anatomy 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 230000004520 agglutination Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 229960003896 aminopterin Drugs 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 229940127003 anti-diabetic drug Drugs 0.000 description 1
- 210000000628 antibody-producing cell Anatomy 0.000 description 1
- 239000003472 antidiabetic agent Substances 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 230000009084 cardiovascular function Effects 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 210000004671 cell-free system Anatomy 0.000 description 1
- 239000012707 chemical precursor Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- 238000011260 co-administration Methods 0.000 description 1
- 238000011284 combination treatment Methods 0.000 description 1
- 239000003636 conditioned culture medium Substances 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 150000001945 cysteines Chemical group 0.000 description 1
- 239000007933 dermal patch Substances 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 239000003596 drug target Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000012894 fetal calf serum Substances 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000014101 glucose homeostasis Effects 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 102000044176 human ESRRB Human genes 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 238000004191 hydrophobic interaction chromatography Methods 0.000 description 1
- 229910052588 hydroxylapatite Inorganic materials 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 230000016784 immunoglobulin production Effects 0.000 description 1
- 238000010324 immunological assay Methods 0.000 description 1
- 239000003547 immunosorbent Substances 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 230000003907 kidney function Effects 0.000 description 1
- 230000037356 lipid metabolism Effects 0.000 description 1
- 230000003908 liver function Effects 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000006186 oral dosage form Substances 0.000 description 1
- XYJRXVWERLGGKC-UHFFFAOYSA-D pentacalcium;hydroxide;triphosphate Chemical compound [OH-].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O XYJRXVWERLGGKC-UHFFFAOYSA-D 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 210000001995 reticulocyte Anatomy 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 230000000405 serological effect Effects 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000013268 sustained release Methods 0.000 description 1
- 239000012730 sustained-release form Substances 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 150000001467 thiazolidinediones Chemical class 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 230000037317 transdermal delivery Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 150000003680 valines Chemical class 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 239000007762 w/o emulsion Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/70567—Nuclear receptors, e.g. retinoic acid receptor [RAR], RXR, nuclear orphan receptors
Definitions
- Figure 7A-C shows the nucleotide sequence (SEQ ID NO:5) which comprises the open reading frame encoding the human nuclear receptor protein, nNR2.
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Toxicology (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Cell Biology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Immunology (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- High Energy & Nuclear Physics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
The present invention discloses the isolation and characterization of cDNA molecules encoding two human nuclear receptor proteins, designated nNR1, nNR2 and/or nNR2-1. Also within the scope of the disclosure are recombinant vectors, recombinant host cells, methods of screening for modulators of nNR1, nNR2 and/or nNR2-1 activity, and production of antibodies against nNR1, nNR2 and/or nNR2-1, or epitopes thereof.
Description
TITLE OF THE INVENTION
DNA MOLECULES ENCODING HUMAN NUCLEAR
RECEPTOR PROTEINS
CROSS-REFERENCE TO RELATED APPLICATIONS
This application is a continuation of U.S. Provisional Application Serial No. 60/078,633, filed March 19, 1998 which is a continuation-in-part of U.S. Provisional Application Serial No. 60/062,902, filed October 21, 1997, which is a continuation-in-part of U.S. Provisional Application Serial No. 60/057,090, filed August 27, 1997.
STATEMENT REGARDING FEDERALLY-SPONSORED R&D
Not applicable
REFERENCE TO MICROFICHE APPENDIX
Not applicable.
FIELD OF THE INVENTION The present invention relates in part to isolated nucleic acid molecules (polynucleotide) which encode human nuclear receptor proteins, referred to throughout as nNRl, nNR2 and/or nNR2- 1. The present invention also relates to recombinant vectors and recombinant hosts which contain a DNA fragment encoding nNRl, nNR2 and/or nNR2-l, substantially purified forms of associated human nNRl, nNR2 and/or nNR2-l protein, human mutant proteins, and methods associated with identifying compounds which modulate nNRl, nNR2 and or nNR2-l activity.
BACKGROUND OF THE INVENTION
The nuclear receptor superfamily, which includes steroid hormone receptors, are small chemical ligand-inducible transcription factors which have been shown to play roles in controlling development, differentiation and physiological function. Isolation of cDNA clones encoding nuclear receptors reveal several characteristics. First, the NH2-terminal regions, which vary in length between receptors, is hypervariable with low homology between family members. There are three internal regions of conservation, referred to as domain I, II and III. Region I is a cysteine-rich region which is referred to as the DNA binding domain (DBD). Regions II and III are within the COOH-terminal region of the protein and is also referred to as the ligand binding domain (LBD). For a review, see Power et al. (1992, Trends in Pharmaceutical Sciences 13: 318-323).
The lipophilic hormones that activate steroid receptors are known to be associated with human diseases. Therefore, the respective nuclear receptors have been identified as possible targets for therapeutic intervention. For a review of the mechanism of action of various steroid hormone receptors, see Tsai and O'Malley (1994, Annu. Rev. Biochem. 63: 451-486).
Recent work with non-steroid nuclear receptors has also shown the potential as drug targets for therapeutic intervention. This work reports that peroxisome proliferator activated receptor g (PPARg), identified by a conserved DBD region, promotes adipocyte differentiation upon activation and that thiazolidinediones, a class of antidiabetic drugs, function through PPARg (Tontonoz et al., 1994, Cell 79: 1147-1156; Lehmann et al., 1995, J. Biol. Chem. 270(22): 12953-12956; Teboul et al., 1995, J. Biol. Chem. 270(47): 28183-28187). This indicates that PPARg plays a role in glucose homeostasis and lipid metabolism.
Giguere, et al. (1988, Nature 331: 91-94) isolated two cDNAs which encode a human nuclear receptor, referred to as hERRl and hEER2. The authors did not assign a ligand and subsequent ligand- inducible function to either of these human nuclear receptors.
Trapp and Holsboer (1996, J. Biol. Chem. 271(17): 9879-9882) show that hERR2 acts as a cell-specific inhibitor of glucocorticoid receptor-mediated gene expression.
It would be advantageous to identify a gene encoding an additional human nuclear receptor protein. A nucleic acid molecule expressing a human nuclear receptor protein will be useful in screening for compounds acting as a modulator of cell differentiation, cell development and physiological function. The present invention addresses and meets these needs by disclosing isolated nucleic acid molecules which express a human nuclear receptor protein which will have a role in cell differentiation and development.
SUMMARY OF THE INVENTION
The present invention relates to isolated nucleic acid molecules (polynucleotides) which encode novel nuclear receptor proteins, preferably human nuclear receptor proteins, such as human nuclear receptor proteins exemplified and referred to throughout this specification as nNRl, nNR2 and/or nNR2-l.
The present invention also relates to isolated nucleic acid fragments of nNRl (SEQ ID NO:l) and nNR2 (SEQ ID NO:3) which encode mRNA expressing a biologically active novel human nuclear receptor. Any such nucleic acid fragment will encode either a protein or protein fragment comprising at least an intracellular DNA-binding domain and/or ligand binding domain, domains conserved throughout the human nuclear receptor family domain which exist in nNRl (SEQ ID NO:2) and nNR2 (SEQ ID NO:4). Any such polynucleotide includes but is not necessarily limited to nucleotide substitutions, deletions, additions, amino-terminal truncations and carboxy- terminal truncations such that these mutations encode mRNA which express a protein or protein fragment of diagnostic, therapeutic or prophylactic use and would be useful for screening for agonists and/or antagonists for nNRl, nNR2 and/or nNR2-l function.
The isolated nucleic acid molecule of the present invention may include a deoxyribonucleic acid molecule (DNA), such as genomic DNA and complementary DNA (cDNA), which may be single (coding or
noncoding strand) or double stranded, as well as synthetic DNA, such as a synthesized, single stranded polynucleotide. The isolated nucleic acid molecule of the present invention may also include a ribonucleic acid molecule (RNA). The present invention also relates to recombinant vectors and recombinant hosts, both prokaryotic and eukaryotic, which contain the substantially purified nucleic acid molecules disclosed throughout this specification.
A preferred aspect of the present invention is disclosed in Figure 1A-C and SEQ ID NO:l, a human cDNA encoding a novel nuclear trans-acting receptor protein, nNRl.
Another preferred aspect of the present invention is disclosed in Figure 4A-C and SEQ ID NO:3, a human cDNA encoding a novel nuclear trans-acting receptor protein, nNR2. Another preferred aspect of the present invention is disclosed in Figure 7A-C and SEQ ID NO:5, a human cDNA encoding a truncated version of nNR2, referred to as nNR2-l.
The present invention also relates to a substantially purified form of the novel nuclear trans-acting receptor protein, nNRl, which is disclosed in Figures 2A-F and Figure 3 and as set forth in SEQ ID NO:2.
The present invention also relates to biologically active fragments and/or mutants of nNRl as set forth as SEQ ID NO:2, including but not necessarily limited to amino acid substitutions, deletions, additions, amino terminal truncations and carboxy-terminal truncations such that these mutations provide for proteins or protein fragments of d agnostic, therapeutic or prophylactic use and would be useful for screening for agonists and/or antagonists of nNRl function.
The present invention also relates to a substantially purified form of the novel nuclear trans-acting receptor protein, nNR2, which is disclosed in Figure 5A-H and Figure 6 and as set forth in SEQ ID NO:4.
The present invention also relates to biologically active fragments and/or mutants of nNR2 as set forth as SEQ ID NO:4, including but not necessarily limited to amino acid substitutions, deletions, additions, amino terminal truncations and carboxy-terminal truncations such that these mutations provide for proteins or protein
4
SUBSTTTUTE SHEET RϋLE 26)
fragments of diagnostic, therapeutic or prophylactic use and would be useful for screening for agonists and/or antagonists of nNR2 function. A preferred aspect of the present invention is disclosed in Figure 3 and is set forth as SEQ ID NO:2, the amino acid sequence of the novel nuclear trans-acting receptor protein, nNRl.
A preferred aspect of the present invention is disclosed in Figure 6 and is set forth as SEQ ID NO:4, the amino acid sequence of the novel nuclear trans-acting receptor protein, nNR2.
A preferred aspect of the present invention is disclosed in Figure 8 and is set forth as SEQ ID NO:6, the amino acid sequence of a truncated version of nNR2, refereed to as nNR2-l.
The present invention also relates to polyclonal and monoclonal antibodies raised in response to either the human form of nNRl, nNR2 and/or nNR2-l disclosed herein, or a biologically active fragment thereof. It will be especially preferable to raise antibodies against epitopes within the NH2-terminal domain of nNRl, nNR2 and/or nNR2-l, which show the least homology to other known proteins belonging to the human nuclear receptor superfamily. To this end, the DNA molecules, RNA molecules, recombinant protein and antibodies of the present invention may be used to screen and measure levels of human nNRl, nNR2 and/or nNR2-l. The recombinant proteins, DNA molecules, RNA molecules and antibodies lend themselves to the formulation of kits suitable for the detection and typing of human nNRl, nNR2 and/or nNR2-l. The present invention also relates to isolated nucleic acid molecules which are fusion constructions expressing fusion proteins useful in assays to identify compounds which modulate wild-type human nNRl, nNR2 and/or nNR2-l activity. A preferred aspect of this portion of the invention includes, but is not limited to, glutathione S- transferase GST-nNRl and/or GST-nNR2 fusion constructs. These fusion constructs include, but are not limited to, all or a portion of the ligand-binding domain of nNRl, nNR2 and/or nNR2-l, respectively, as an in-frame fusion at the carboxy terminus of the GST gene. The disclosure of SEQ ID NOS: 1-4 allow the artisan of ordinary skill to construct any such nucleic acid molecule encoding a GST-nuclear
receptor fusion protein. Soluble recombinant GST-nuclear receptor fusion proteins may be expressed in various expression systems, including Spodoptera frugiperda (Sf21) insect cells (Invitrogen) using a baculovirus expression vector (e.g., Bac-N-Blue DNA from Invitrogen or pAcG2T from Pharmingen).
It is an object of the present invention to provide an isolated nucleic acid molecule which encodes a novel form of a nuclear receptor protein such as human nNRl and/or human nNR2, human nuclear receptor protein fragments of full length proteins such as nNRl, nNR2 and or nNR2-l, and mutants which are derivatives of SEQ ID NO:2 and SEQ ID NO:4. Any such polynucleotide includes but is not necessarily limited to nucleotide substitutions, deletions, additions, amino-terminal truncations and carboxy-terminal truncations such that these mutations encode mRNA which express a protein or protein fragment of diagnostic, therapeutic or prophylactic use and would be useful for screening for agonists and/or antagonists for nNRl, nNR2 and/or nNR2- 1 function.
It is a further object of the present invention to provide the human nuclear receptor proteins or protein fragments encoded by the nucleic acid molecules referred to in the preceding paragraph.
It is a further object of the present invention to provide recombinant vectors and recombinant host cells which comprise a nucleic acid sequence encoding human nNRl, nNR2 and/or nNR2-l or a biological equivalent thereof. It is an object of the present invention to provide a substantially purified form of nNRl, as set forth in SEQ ID NO:2. It is an object of the present invention to provide for biologically active fragments and/or mutants of nNRl, including but not necessarily limited to amino acid substitutions, deletions, additions, amino terminal truncations and carboxy-terminal truncations such that these mutations provide for proteins or protein fragments of diagnostic, therapeutic or prophylactic use.
It is an object of the present invention to provide a substantially purified form of nNR2, as set forth in SEQ ID NO:4.
It is an object of the present invention to provide for biologically active fragments and/or mutants of nNR2, including but not necessarily limited to amino acid substitutions, deletions, additions, amino terminal truncations and carboxy-terminal truncations such that these mutations provide for proteins or protein fragments of diagnostic, therapeutic or prophylactic use.
It is also an object of the present invention to provide for nNRl- and/or nNR2-based in-frame fusion constructions, methods of expressing these fusion constructions and biological equivalents disclosed herein, related assays, recombinant cells expressing these constructs and agonistic and/or antagonistic compounds identified through the use DNA molecules encoding human nuclear receptor proteins such as nNRl, nNR2 and/or nNR2-l.
As used herein, "DBD" refers to DNA binding domain. As used herein, "LBD" refers to ligand binding domain.
As used herein, the term "mammalian host" refers to any mammal, including a human being.
BRIEF DESCRIPTION OF THE DRAWINGS Figure 1A-C shows the nucleotide sequence (SEQ ID
NO:l) which comprises the open reading frame encoding the human nuclear receptor protein, nNRl.
Figure 2A-F shows the nucleotide sequence of the double stranded cDNA molecule (SEQ ID NO:l and SEQ ID NO:29) which encodes nNRl, and the amino acid sequence of nNRl (SEQ ID NO:2). The region in bold and underline is the DNA binding domain.
Figure 3 shows the amino acid sequence of nNRl (SEQ ID NO:2). The region in bold and underline is the DNA binding domain. Figure 4A-C shows the nucleotide sequence (SEQ ID NO:3) which comprises the open reading frame encoding the human nuclear receptor protein, nNR2.
Figure 5A-H shows the nucleotide sequence of the double stranded cDNA molecule (SEQ ID NO:l and SEQ ID NO:29) which encodes nNR2, and the amino acid sequence of nNR2 (SEQ ID NO:4). The region in bold and underline is the DNA binding domain.
Figure 6 shows the amino acid sequence of nNR2 (SEQ ID NO:4). The region in bold and underline is the DNA binding domain.
Figure 7A-C shows the nucleotide sequence (SEQ ID NO:5) which comprises the open reading frame encoding the human nuclear receptor protein, nNR2.
Figure 8 shows the amino acid sequence of nNR2-l, a carboxy-terminal truncated version of nNR2 (SEQ ID NO:6). The region in bold and underline is the DNA binding domain.
DETAILED DESCRIPTION OF THE INVENTION
The present invention relates to isolated nucleic acid and protein forms which represent nuclear receptors, preferably but not necessarily limited to human receptors. These expressed proteins are novel nuclear receptors and which are useful in the identification of downstream target genes and ligands regulating their activity. The nuclear receptor superfamily is composed of a group of structurally related receptors which are regulated by chemically distinct ligands. The common structure for a nuclear receptor is a highly conserved DNA binding domain (DBD) located in the center of the peptide and the ligand- binding domain (LBD) at the COOH-terminus. Eight out of the nine non- variant cysteines form two type II zinc fingers which distinguish nuclear receptors from other DNA-binding proteins. The DBDs share at least 50% to 60% amino acid sequence identity even among the most distant members in vertebrates. The superfamily has been expanded within the past decade to contain approximately 25 subfamilies. An EST database search using whole peptide sequences of several representative subfamily members was used to identify two human ESTs (GenBank accession numbers h91890 and w26275 for an EST corresponding to nNRl, nNR2 and/or nNR2-l, respectively). The sequence information from each EST was utilized to isolate and characterize the full length cDNA for the gene corresponding to nNRl (see Figure 1A-C and SEQ ID NO:l) and nNR2 (see Figure 4A-C and SEQ ID NO:3). The cDNA of SEQ ID NO:l encodes nNRl, a protein 500 amino acids in length (Figure 3; SEQ ID NO:2), which has a distinctive DBD structure (Figure 2A-F). The cDNA of SEQ ID NO:3 encodes nNR2, a protein 458 amino acids
(Figure 6; SEQ ID NO:4) in length, and also has a distinctive DBD structure (Figure 5A-H). The cDNA of SEQ ID NO:5 encodes nNR2-l, a protein 418 amino acids (Figure 8; SEQ ID NO:6) in length which is a carboxy terminal truncated version of nNR2. The protein nNR2-l also has a distinctive DBD structure (Figure 8).
The nNRl protein shows 95% homology to hERR2 (Giguere, et al., 1988, Nature 331: 91-94) in the overlapping peptide region. However, nNRl contains an additional 67 amino acids at the carboxy- teπninus in comparison to hERR2. The gene encoding nNRl is located on locus 14q24.3 ~ 14q31, which is the Alzheimer disease gene 3 (AD3) locus. Therefore, nNRl may be an endogenous modulator of glucocorticoid receptor (GR) in view of data showing that hERR2 represses GR activity. nNR2 and nNR2-l share 77% and 75% homology, respectively, at the amino acid level to hERR2 (Giguere, et al., 1988, Nature 331: 91-94) in the overlapping region. The nNR2 and nNRl proteins show 77% homology at the amino acid level. The gene encoding nNR2 is located on chromosome 1. Both genes are expressed at very low levels in the majority of the tissues examined via RT-PCR.
Therefore, the present invention also relates to isolated nucleic acid fragments of nNRl (SEQ ID NO:l) and nNR2 (SEQ ID NO:3) which encode mRNA expressing a biologically active novel human nuclear receptor. Any such nucleic acid fragment will encode either a protein or protein fragment comprising at least an intracellular DNA- binding domain and/or ligand binding domain, domains conserved throughout the human nuclear receptor family domain which exist in nNRl (SEQ ID NO:2) and nNR2 (SEQ ID NO:4). Any such polynucleotide includes but is not necessarily limited to nucleotide substitutions, deletions, additions, amino-terminal truncations and carboxy-terminal truncations such that these mutations encode mRNA which express a protein or protein fragment of diagnostic, therapeutic or prophylactic use and would be useful for screening for agonists and/or antagonists for nNRl, nNR2 and/or nNR2-l function. Such a nucleic acid fragment is exemplified as an altered version of the DNA fragment encoding nNR2. This DNA molecule (as set forth in SEQ ID NO:5) is identical to SEQ ID NO:3 save for a two nucleotide insertion at nucleotide 1352 of SEQ
ID NO:3. This insertion results in a shifted reading frame and introduction of a TGA termination codon 33 nucleotides from the insertion site, resulting in an open reading frame which encodes the carboxy-truncated nNR2 protein, nNR2-l, as shown in Figure 8 and SEQ ID NO: 6.
The isolated nucleic acid molecule of the present invention may include a deoxyribonucleic acid molecule (DNA), such as genomic DNA and complementary DNA (cDNA), which may be single (coding or noncoding strand) or double stranded, as well as synthetic DNA, such as a synthesized, single stranded polynucleotide. The isolated nucleic acid molecule of the present invention may also include a ribonucleic acid molecule (RNA).
The present invention also relates to recombinant vectors and recombinant hosts, both prokaryotic and eukaryotic, which contain the substantially purified nucleic acid molecules disclosed throughout this specification.
A preferred aspect of the present invention is disclosed in Figure 1A-C and SEQ ID NO:l, a human cDNA encoding a novel nuclear trans-acting receptor protein, nNRl, disclosed as follows: GAATATGATG ACCCTAATGC AACAATATCT AACATACTAT CCGAGCTTCG GTCATTTGGA AGAACTGCAG ATTTTCCTCC TTCAAAATTA AAGTCAGGTT ATGGAGAACA TGTATGCTAT GTTCTTGATT GCTTCGCTGA AGAAGCATTG AAATATATTG GTTTCACCTG GAAAAGGCCA ATATACCCAG TAGAAGAATT AGAAGAAGAA AGCGTTGCAG AAGATGATGC AGAATTAACA TTAAATAAAG TGGATGAAGA ATTTGTGGAA GAAGAGACAG ATAATGAAGA AAACTTTATT GATCTCAACG TTTTAAAGGC CCAGACATAT CACTTGGATA TGAACGAGAC TGCCAAACAA GAAGATATTT TGGAATCCAC AACAGATGCT GCAGAATGGA GCCTAGAAGT GGAACGTGTA CTACCGCAAC TGAAAGTCAC GATTAGGACT GACAATAAGG ATTGGAGAAT CCATGTTGAC CAAATGCACC AGCACAGAAG TGGAATTGAA TCTGCTCTAA AGGAGACCAA GGGATTTTTG GACAAACTCC ATAATGAAAT TACTAGGACT TTGGAAAAGA TCAGCAGCCG AGAAAAGTAC ATCAACAATC AGCCGGGAGC CCATGGAGCA CTGTCCTCAG AGATGCGCAG GTTAGGCTCA CTGTCTAGGC CAGGCCCACC TTAGTCACTG TGGACTGGCA ATGGAAGCTC TTCCTGGACA CACCTGCCCT AGCCCTCACC CTGGGGTGGA AGAGAAATGA GCTTGGCTTG CAACTCAGAC CATTCCACGG AGGCATCCTC
CCCTTCCCTG GGCTGGTGAA TAAAAGTTTC CTGAGGTCAA GGACTTCCTT TTCCCTGCCA AAATGGTGTC CAGAACTTTG AGGCCAGAGG TGATCCAGTG ATTTGGGAGC TGCAGGTCAC ACAGGCTGCT CAGAGGGCTG CTGAACAGGA TGTCCTCGGA CGACAGGCAC CTGGGCTCCA GCTGCGGCTC CTTCATCAAG ACTGAGCCGT CCAGCCCGTC CTCGGGCATA GATGCCCTCA GCCACCACAG CCCCAGTGGC TCGTCCGACG CCAGCGGCGG CTTTGGCCTG GCCCTGGGCA CCCACGCCAA CGGTCTGGAC TCGCCACCCA TGTTTGCAGG CGCCGGGCTG GGAGGCACCC CATGCCGCAA GAGCTACGAG GACTGTGCCA GCGGCATCAT GGAGGACTCG GCCATCAAGT GCGAGTACAT GCTCAACGCC ATCCCCAAGC GCCTGTGCCT CGTGTGCGGG GACATTGCCT CTGGCTACCA CTACGGCGTG GCCTCCTGCG AGGCTTGCAA GGCCTTCTTC AAGAGGACTA TCCAAGGGAA CATTGAGTAC AGCTGCCCGG CCACCAACGA GTGCGAGATC ACCAAACGGA GGCGCAAGTC CTGCCAGGCC TGCCGCTTCA TGAAATGCCT CAAAGTGGGG ATGCTGAAGG AAGGTGTGCG CCTTGATCGA GTGCGTGGAG GCCGTCAGAA ATACAAGCGA CGGCTGGACT CAGAGAGCAG CCCATACCTG AGCTTACAAA TTTCTCCACC TGCTAAAAAG CCATTGACCA AGATTGTCTC ATACCTACTG GTGGCTGAGC CGGACAAGCT CTATGCCATG CCTCCCCCTG GTATGCCTGA GGGGGACATC AAGGCCCTGA CCACTCTCTG TGACCTGGCA GACCGAGAGC TTGTGGTCAT CATTGGCTGG GCCAAGCACA TCCCAGGCTT CTCAAGCCTC TCCCTGGGGG ACCAGATGAG CCTGCTGCAG AGTGCCTGGA TGGAAATCCT CATCCTGGGC ATCGTGTACC GCTCGCTGCC CTACGACGAC AAGCTGGTGT ACGCTGAGGA CTACATCATG GATGAGGAGC ACTCCCGCCT CGCGGGGCTG CTGGAGCTCT ACCGGGCCAT CCTGCAGCTG GTACGCAGGT ACAAGAAGCT CAAGGTGGAG AAGGAGGAGT TTGTGACGCT CAAGGCCCTG GCCCTCGCCA ACTCCGATTC CATGTACATC GAGGATCTAG AGGCTGTCCA GAAGCTGCAG GACCTGCTGC ACGAGGCACT GCAGGACTAC GAGCTGAGCC AGCGCCATGA GGAGCCCTGG AGGACGGGCA AGCTGCTGCT GACACTGCCG CTGCTGCGGC AGACGGCCGC CAAGGCCGTG CAGCACTTCT ATAGCGTCAA ACTGCAGGGC AAAGTGCCCA TGCACAAACT CTTCCTGGAG ATGCTGGAGG CCAAGGCCTG GGCCAGGGCT GACTCCCTTC AGGAGTGGAG GCCACTGGAG CAAGTGCCCT CTCCCCTCCA CCGAGCCACC AAGAGGCAGC ATGTGCATTT CCTAACTCCC TTGCCCCCTC CCCCATCTGT GGCCTGGGTG GGCACTGCTC AGGCTGGATA CCACCTGGAG GTTTTCCTTC CGCAGAGGGC AGGTTGGCCA AGAGCAGCTT AGAGGATCTC CCAAGGATGA AAGAATGTCA AGCCATGATG GAAAATGCCC CTTCCAATCA GCTGCCTTCA CAAGCAGGGA TCAGAGCAAC TCCCCGGGGA
TCCCCAATCC ACGCCCTTCT AGTCCAACCC CCCTCAATGA GAGAGGCAGG CAGATCTCAC CCAGCACTAG GACACCAGGA GGCCAGGGAA AGCATCTCTG GCTCACCATG TAACATCTGG CTTGGAGCAA GTGGGTGTTC TGCACACCAG GCAGCTGCAC CTCACTGGAT CTAGTGTTGC TGCGAGTGAC CTCACTTCAG AGCCCCTCTA GCAGAGTGGG GCGGAAGTCC TGATGGTTGG TGTCCATGAG GTGGAAG (SEQ ID NO : 1 ) .
Another preferred aspect of the present invention is disclosed in Figure 4A-C and SEQ ID NO:3, a human cDNA encoding a novel nuclear trans-acting receptor protein, nNR2, disclosed as follows: GCGGGCCGCC AGTGTGGTGG AATTCGGCTT GTCACTAGGA GAACATTTGT GTTAATTGCA CTGTGCTCTG TCAAGGAAAC TTTGATTTAT AGCTGGGGTG CACAAATAAT GGTTGCCGGT CGCACATGGA TTCGGTAGAA CTTTGCCTTC CTGAATCTTT TTCCCTGCAC TACGAGGAAG AGCTTCTCTG CAGAATGTCA AACAAAGATC GACACATTGA TTCCAGCTGT TCGTCCTTCA TCAAGACGGA ACCTTCCAGC CCAGCCTCCC TGACGGACAG CGTCAACCAC CACAGCCCTG GTGGCTCTTC AGACGCCAGT GGGAGCTACA GTTCAACCAT GAATGGCCAT CAGAACGGAC TTGACTCGCC ACCTCTCTAC CCTTCTGCTC CTATCCTGGG AGGTAGTGGG CCTGTCAGGA AACTGTATGA TGACTGCTCC AGCACCATTG TTGAAGATCC CCAGACCAAG TGTGAATACA TGCTCAACTC GATGCCCAAG AGACTGTGTT TAGTGTGTGG TGACATCGCT TCTGGGTACC ACTATGGGGT AGCATCATGT GAAGCCTGCA AGGCATTCTT CAAGAGGACA ATTCAAGGCA ATATAGAATA CAGCTGCCCT GCCACGAATG AATGTGAAAT CACAAAGCGC AGACGTAAAT CCTGCCAGGC TTGCCGCTTC ATGAAGTGTT TAAAAGTGGG CATGCTGAAA GAAGGGGTGC GTCTTGACAG AGTACGTGGA GGTCGGCAGA AGTACAAGCG CAGGATAGAT GCGGAGAACA GCCCATACCT GAACCCTCAG CTGGTTCAGC CAGCCAAAAA GCCATATAAC AAGATTGTCT CACATTTGTT GGTGGCTGAA CCGGAGAAGA TCTATGCCAT GCCTGACCCT ACTGTCCCCG ACAGTGACAT CAAAGCCCTC ACTACACTGT GTGACTTGGC CGACCGAGAG TTGGTGGTTA TCATTGGATG GGCGAAGCAT ATTCCAGGCT TCTCCACGCT GTCCCTGGCG GACCAGATGA GCCTTCTGCA GAGTGCTTGG ATGGAAATTT TGATCCTTGG TGTCGTATAC CGGTCTCTTT CATTTGAGGA TGAACTTGTC TATGCAGACG ATTATATAAT GGACGAAGAC CAGTCCAAAT TAGCAGGCCT TCTTGATCTA AATAATGCTA TCCTGCAGCT GGTAAAGAAA TACAAGAGCA TGAAGCTGGA AAAAGAAGAA TTTGTCACCC TCAAAGCTAT AGCTCTTGCT AATTCAGACT CCATGCACAT AGAAGATGTT GAAGCCGTTC AGAAGCTTCA
GGATGTCTTA CATGAAGCGC TGCAGGATTA TGAAGCTGGC CAGCACATGG AAGACCCTCG TCGAGCTGGC AAGATGCTGA TGACACTGCC ACTCCTGAGG CAGACCTCTA CCAAGGCCGT GCAGCATTTC TACAACATCA AACTAGAAGG CAAAGTCCCA ATGCACAAAC TTTTTTTGGA AATGTTGGAG GCCAAGGTCT GACTAAAAGC TCCCTGGGCC TTCCCATCCT TCATGTTGAA AAAGGGAAAA TAAACCCAAG AGTGATGTCG AAGAAACTTA GAGTTTAGTT AACAACATCA AAAATCAACA GACTGCACTG ATAATTTAGC AGCAAGACTA TGAAGCAGCT TTCAGATTCC TCCATAGGTT CCTGATGAGT TCTTTCTACT TTCTCCATCA TCTTCTTTCC TCTTTCTTCC CACATTTCTC TTTCTCTTTA TTTTTTCTCC TTTTCTTCTT TCACCTCCCT TATTTCTTTG CTTCTTTCAT TCCTAGTTCC CATTCTCCTT TATTTTCTTC CCGTCTGCCT GCCTTCTTTC TTTTCTTTAC CTACTCTCAT TCCTCTCTTT TCTCATCCTT CCCCTTTTTT CTAAATTTGA AATAGCTTTA GTTTAAAAAA AAAAATCCTC CCTTCCCCCT TTCCTTTCCC TTTCTTTCCT TTTTCCCTTT CCTTTTCCCT TTCCTTTCCT TTCCTCTTGA CCTTCTTTCC ATCTTTCTTT TTCTTCCTTC TGCTGCTGAA CTTTTAAAAG AGGTCTCTAA CTGAAGAGAG ATGGAAGCCA GCCCTGCCAA AGGATGGAGA TCCATAATAT GGATGCCAGT GAACTTATTG TGAACCATAC CGTCCCCAAT GACTAAGGAA TCAAAGAGAG AGAACCAACG TTCCTAAAAG TACAGTGCAA CATATACAAA TTGACTGAGT GCAGTATTAG ATTTCATGGG AGCAGCCTCT AATTAGACAA CTTAAGCAAC GTTGCATCGG CTGCTTCTTA TCATTGCTTT TCCATCTAGA TCAGTTACAG CCATTTGATT CCTTAATTGT TTTTTCAAGT CTTCCAGGTA TTTGTTAGTT TAGCTACTAT GTAACTTTTT CAGGGAATAG TTTAAGCTTT ATTCATTCAT GCAATACTAA AGAGAAATAA GAATACTGCA ATTTTGTGCT GGCTTTGAAC AATTACGAAC AATAATGAAG GACAAATGAA TCCTGAAGGA AGATTTTTAA AAATGTTTTG TTTCTTCTTA CAAATGGAGA TTTTTTTGTA CCAGCTTTAC CACTTTTCAG CCATTTATTA ATATGGGAAT TTAACTTACT CAAGCAATAG TTGAAGGGAA GGTGCATATT ATCACGGATG CAATTTATGT TGTGTGCCAG TCTGGTCCCA AACATCAATT TCTTAACATG AGCTCCAGTT TACCTAAATG TTCACTGACA CAAAGGATGA GATTACACCT ACAGTGACTC TGAGTAGTCA CATATATAAG CACTGCACAT GAGATATAGA TCCGTAGAAT TGTCAGGAGT GCACCTCTCT ACTTGGGAGG TACAATTGCC ATATGATTTC TAGCTGCCAT GGTGGTTAGG AATGTGATAC TGCCTGTTTG CAAAGTCACA GACCTTGCCT CAGAAGGAGC TGTGAGCCAG TATTCATTTA AGAGAATTCC ACCACACTGG CGGCCCGCGC TTGAT (SEQ ID NO: 3) .
The present invention also relates to an isolated and purified DNA molecule which encodes a truncated version of nNR2 referred to as nNR2-l. This cDNA molecule is set forth in SEQ ID NO:5 and is disclosed as follows: GCGGGCCGCC AGTGTGGTGG AATTCGGCTT GTCACTAGGA GAACATTTGT GTTAATTGCA CTGTGCTCTG TCAAGGAAAC TTTGATTTAT AGCTGGGGTG CACAAATAAT GGTTGCCGGT CGCACATGGA TTCGGTAGAA CTTTGCCTTC CTGAATCTTT TTCCCTGCAC TACGAGGAAG AGCTTCTCTG CAGAATGTCA AACAAAGATC GACACATTGA TTCCAGCTGT TCGTCCTTCA TCAAGACGGA ACCTTCCAGC CCAGCCTCCC TGACGGACAG CGTCAACCAC CACAGCCCTG GTGGCTCTTC AGACGCCAGT GGGAGCTACA GTTCAACCAT GAATGGCCAT CAGAACGGAC TTGACTCGCC ACCTCTCTAC CCTTCTGCTC CTATCCTGGG AGGTAGTGGG CCTGTCAGGA AACTGTATGA TGACTGCTCC AGCACCATTG TTGAAGATCC CCAGACCAAG TGTGAATACA TGCTCAACTC GATGCCCAAG AGACTGTGTT TAGTGTGTGG TGACATCGCT TCTGGGTACC ACTATGGGGT AGCATCATGT GAAGCCTGCA AGGCATTCTT CAAGAGGACA ATTCAAGGCA ATATAGAATA CAGCTGCCCT GCCACGAATG AATGTGAAAT CACAAAGCGC AGACGTAAAT CCTGCCAGGC TTGCCGCTTC ATGAAGTGTT TAAAAGTGGG CATGCTGAAA GAAGGGGTGC GTCTTGACAG AGTACGTGGA GGTCGGCAGA AGTACAAGCG CAGGATAGAT GCGGAGAACA GCCCATACCT GAACCCTCAG CTGGTTCAGC CAGCCAAAAA GCCATATAAC AAGATTGTCT CACATTTGTT GGTGGCTGAA CCGGAGAAGA TCTATGCCAT GCCTGACCCT ACTGTCCCCG ACAGTGACAT CAAAGCCCTC ACTACACTGT GTGACTTGGC CGACCGAGAG TTGGTGGTTA TCATTGGATG GGCGAAGCAT ATTCCAGGCT TCTCCACGCT GTCCCTGGCG GACCAGATGA GCCTTCTGCA GAGTGCTTGG ATGGAAATTT TGATCCTTGG TGTCGTATAC CGGTCTCTTT CATTTGAGGA TGAACTTGTC TATGCAGACG ATTATATAAT GGACGAAGAC CAGTCCAAAT TAGCAGGCCT TCTTGATCTA AATAATGCTA TCCTGCAGCT GGTAAAGAAA TACAAGAGCA TGAAGCTGGA AAAAGAAGAA TTTGTCACCC TCAAAGCTAT AGCTCTTGCT AATTCAGACT CCATGCACAT AGAAGATGTT GAAGCCGTTC AGAAGCTTCA GGATGTCTTA CATGAAGCGC TGCAGGATTA TGAAGCTGGC CAGCACATGG AGAAGACCCT CGTCGAGCTG GCAAGATGCT GATGACACTG CCACTCCTGA GGCAGACCTC TACCAAGGCC GTGCAGCATT TCTACAACAT CAAACTAGAA GGCAAAGTCC CAATGCACAA ACTTTTTTTG GAAATGTTGG AGGCCAAGGT CTGACTAAAA GCTCCCTGGG CCTTCCCATC CTTCATGTTG AAAAAGGGAA
AATAAACCCA AGAGTGATGT CGAAGAAACT TAGAGTTTAG TTAACAACAT CAAAAATCAA CAGACTGCAC TGATAATTTA GCAGCAAGAC TATGAAGCAG CTTTCAGATT CCTCCATAGG TTCCTGATGA GTTCTTTCTA CTTTCTCCAT CATCTTCTTT CCTCTTTCTT CCCACATTTC TCTTTCTCTT TATTTTTTCT CCTTTTCTTC TTTCACCTCC CTTATTTCTT TGCTTCTTTC ATTCCTAGTT CCCATTCTCC TTTATTTTCT TCCCGTCTGC CTGCCTTCTT TCTTTTCTTT ACCTACTCTC ATTCCTCTCT TTTCTCATCC TTCCCCTTTT TTCTAAATTT GAAATAGCTT TAGTTTAAAA AAAAAAATCC TCCCTTCCCC CTTTCCTTTC CCTTTCTTTC CTTTTTCCCT TTCCTTTTCC CTTTCCTTTC CTTTCCTCTT GACCTTCTTT CCATCTTTCT TTTTCTTCCT TCTGCTGCTG AACTTTTAAA AGAGGTCTCT AACTGAAGAG AGATGGAAGC CAGCCCTGCC AAAGGATGGA GATCCATAAT ATGGATGCCA GTGAACTTAT TGTGAACCAT ACCGTCCCCA ATGACTAAGG AATCAAAGAG AGAGAACCAA CGTTCCTAAA AGTACAGTGC AACATATACA AATTGACTGA GTGCAGTATT AGATTTCATG GGAGCAGCCT CTAATTAGAC AACTTAAGCA ACGTTGCATC GGCTGCTTCT TATCATTGCT TTTCCATCTA GATCAGTTAC AGCCATTTGA TTCCTTAATT GTTTTTTCAA GTCTTCCAGG TATTTGTTAG TTTAGCTACT ATGTAACTTT TTCAGGGAAT AGTTTAAGCT TTATTCATTC ATGCAATACT AAAGAGAAAT AAGAATACTG CAATTTTGTG CTGGCTTTGA ACAATTACGA ACAATAATGA AGGACAAATG AATCCTGAAG GAAGATTTTT AAAAATGTTT TGTTTCTTCT TACAAATGGA GATTTTTTTG TACCAGCTTT ACCACTTTTC AGCCATTTAT TAATATGGGA ATTTAACTTA CTCAAGCAAT AGTTGAAGGG AAGGTGCATA TTATCACGGA TGCAATTTAT GTTGTGTGCC AGTCTGGTCC CAAACATCAA TTTCTTAACA TGAGCTCCAG TTTACCTAAA TGTTCACTGA CACAAAGGAT GAGATTACAC CTACAGTGAC TCTGAGTAGT CACATATATA AGCACTGCAC ATGAGATATA GATCCGTAGA ATTGTCAGGA GTGCACCTCT CTACTTGGGA GGTACAATTG CCATATGATT TCTAGCTGCC ATGGTGGTTA GGAATGTGAT ACTGCCTGTT TGCAAAGTCA CAGACCTTGC CTCAGAAGGA GCTGTGAGCC AGTATTCATT TAAGAGAATT CCACCACACT GGCGGCCCGC GCTTGAT (SEQ ID NO: 5) The present invention also relates to a substantially purified form of the novel nuclear trans-acting receptor protein, nNRl, which is shown in Figures 2A-F and Figure 3 and as set forth in SEQ ID NO:2, disclosed as follows: MSSDDRHLGS SCGSFIKTEP SSPSSGIDAL SHHSPSGSSD ASGGFGLALG THANGLDSPP MFAGAGLGGT PCRKSYEDCA SGIMEDSAIK CEYMLNAIPK
RLCLVCGDIA SGYHYGVASC EACKAFFKRT IQGNIEYSCP ATNECEITKR RRKSCQACRF MKCLKVGMLK EGVR DRVRG GRQKYKRRLD SESSPYLSLQ ISPPAKKPLT KIVSYLLVAE PDKLYAMPPP GMPEGDIKA TTLCDLADRE LWIIG AKH IPGFSSLSLG DQMSLLQSAW MEILILGIVY RS PYDDKLV YAEDYIMDEE HSRLAGLLEL YRAILQLVRR YKK KVEKEE FVTLKALALA NSDSMYIED EAVQKLQDLL HEALQDYELS QRHEEPWRTG KLLLTLPLLR QTAAKAVQHF YSVKLQGKVP MHK FLEMLE AKAWARADSL QE RPLEQVP SPLHRATKRQ HVHFLTPLPP PPSVAWVGTA QAGYHLEVF PQRAGWPRAA ( SEQ ID NO : 2 ) . The present invention also relates to biologically active fragments and/or mutants of nNRl as set forth as SEQ ID NO:2, including but not necessarily limited to amino acid substitutions, deletions, additions, amino terminal truncations and carboxy-terminal truncations such that these mutations provide for proteins or protein fragments of diagnostic, therapeutic or prophylactic use and would be useful for screening for agonists and/or antagonists of nNRl function.
The present invention also relates to a substantially purified form of the novel nuclear trans-acting receptor protein, nNR2, which is shown in Figure 5A-H and Figure 6 and as set forth in SEQ ID NO:4, disclosed as follows:
MDSVELCLPE SFSLHYEEEL LCRMSNKDRH IDSSCSSFIK TEPSSPASLT DSVNHHSPGG SSDASGSYSS TMNGHQNGLD SPPLYPSAPI LGGSGPVRKL YDDCSSTIVE DPQTKCEYML NSMPKRLCLV CGDIASGYHY GVASCEACKA FFKRTIQGNI EYSCPATNEC EITKRRRKSC QACRFMKCLK VGM KEGVRL DRVRGGRQKY KRRIDAENSP Y NPQLVQPA KKPYNKIVSH LLVAEPEKIY AMPDPTVPDS DIKALTTLCD LADRELWII GWAKHIPGFS TLSLADQMSL LQSAWMEI I LGWYRSLSF EDELVYADDY IMDEDQSKLA GLLDLNNAIL QLVKKYKSMK LEKEEFVTLK AIALANSDSM HIEDVEAVQK LQDVLHEALQ DYEAGQHMED PRRAGKMLMT LPLLRQTSTK AVQHFYNIK EGKVPMHK F LEMLEAKV ( SEQ ID NO : 4 ) .
The present invention also relates to biologically active fragments and/or mutants of nNR2 as set forth as SEQ ID NO:4, including but not necessarily limited to amino acid substitutions, deletions, additions, amino terminal truncations and carboxy-terminal truncations such that these mutations provide for proteins or protein
fragments of diagnostic, therapeutic or prophylactic use and would be useful for screening for agonists and/or antagonists of nNR2 function. To this end, an example of such a protein is the carboxy-terminal truncated version of nNR2, referred to as nNR2-l and described in Figure 8 and set forth as SEQ ID NO:6, as follows:
MDSVELCLPE SFSLHYEEEL LCRMSNKDRH IDSSCSSFIK TEPSSPASLT DSVNHHSPGG SSDASGSYSS TMNGHQNGLD SPPLYPSAPI LGGSGPVRKL YDDCSSTIVE DPQTKCEYML NSMPKR CLV CGDIASGYHY GVASCEACKA FFKRTIQGNI EYSCPATNEC EITKRRRKSC QACRFMKCLK VGMLKEGVRL DRVRGGRQKY KRRIDAENSP YLNPQLVQPA KKPYNKIVSH LVAEPEKIY AMPDPTVPDS DIKALTTLCD ADRELWII GWAKHIPGFS TLSLADQMSL LQSA MEI I LGλA YRSLSF EDELVYADDY IMDEDQSKLA GLLDLNNAIL QLVKKYKSMK LEKEEFVTLK AIALANSDSM HIEDVEAVQK LQDVLHEALQ DYEAGQHMEK TLVELARC ( SEQ ID NO : 6 ) . The present invention also relates to isolated nucleic acid molecules which are fusion constructions expressing fusion proteins useful in assays to identify compounds which modulate wild-type human nNRl, nNR2 and/or nNR2-l activity. A preferred aspect of this portion of the invention includes, but is not limited to, glutathione S- transferase GST-nNRl and/or GST-nNR2 fusion constructs. These fusion constructs include, but are not limited to, all or a portion of the ligand-binding domain of nNRl, nNR2 anάVor nNR2-l, respectively, as an in-frame fusion at the carboxy terminus of the GST gene. The disclosure of SEQ ID NOS: 1-4 allow the artisan of ordinary skill to construct any such nucleic acid molecule encoding a GST-nuclear receptor fusion protein. Soluble recombinant GST-nuclear receptor fusion proteins may be expressed in various expression systems, including Spodoptera frugiperda (Sf21) insect cells (Invitrogen) using a baculovirus expression vector (e.g., Bac-N-Blue DNA from Invitrogen or pAcG2T from Pharmingen).
The isolated nucleic acid molecule of the present invention may include a deoxyribonucleic acid molecule (DNA), such as genomic DNA and complementary DNA (cDNA), which may be single (coding or noncoding strand) or double stranded, as well as synthetic DNA, such as a synthesized, single stranded polynucleotide. The isolated nucleic
acid molecule of the present invention may also include a ribonucleic acid molecule (RNA).
It is known that there is a substantial amount of redundancy in the various codons which code for specific amino acids. Therefore, this invention is also directed to those DNA sequences encode RNA comprising alternative codons which code for the eventual translation of the identical amino acid, as shown below:
A=Ala=Alanine: codons GCA, GCC, GCG, GCU
C=Cys=Cysteine: codons UGC, UGU D=Asp= Aspartic acid: codons GAC, GAU
E=Glu=Glutamic acid: codons GAA, GAG
F=Phe=Phenylalanine: codons UUC, UUU
G=Gly=Glycine: codons GGA, GGC, GGG, GGU
H=His =Histidine: codons CAC, CAU I=Ile =Isoleucine: codons AUA, AUC, AUU
K=Lys=Lysine: codons AAA, AAG
L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU
M=Met=Methionine: codon AUG
N=Asp=Asparagine: codons AAC, AAU P=Pro=Proline: codons CCA, CCC, CCG, CCU
Q=Gln=Glutamine: codons CAA, CAG
R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU
S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU
T=Thr=Threonine: codons ACA, ACC, ACG, ACU V=Val=Valine: codons GUA, GUC, GUG, GUU
W=Trp=Tryptophan: codon UGG
Y=Tyr=Tyrosine: codons UAC, UAU
Therefore, the present invention discloses codon redundancy which may result in differing DNA molecules expressing an identical protein. For purposes of this specification, a sequence bearing one or more replaced codons will be defined as a degenerate variation. Also included within the scope of this invention are mutations either in the DNA sequence or the translated protein which do not substantially alter the ultimate physical properties of the expressed protein. For example, substitution of valine for leucine, arginine for
lysine, or asparagine for glutamine may not cause a change in functionality of the polypeptide.
It is known that DNA sequences coding for a peptide may be altered so as to code for a peptide having properties that are different than those of the naturally occurring peptide. Methods of altering the DNA sequences include but are not limited to site directed mutagenesis. Examples of altered properties include but are not limited to changes in the affinity of an enzyme for a substrate or a receptor for a ligand. As used herein, "purified" and "isolated" are utilized interchangeably to stand for the proposition that the nucleic acid, protein, or respective fragment thereof in question has been substantially removed from its in vivo environment so that it may be manipulated by the skilled artisan, such as but not limited to nucleotide sequencing, restriction digestion, site-directed mutagenesis, and subcloning into expression vectors for a nucleic acid fragment as well as obtaining the protein or protein fragment in pure quantities so as to afford the opportunity to generate polyclonal antibodies, monoclonal antibodies, amino acid sequencing, and peptide digestion. Therefore, the nucleic acids claimed herein may be present in whole cells or in cell lysates or in a partially purified or substantially purified form. A nucleic acid is considered substantially purified when it is purified away from environmental contaminants. Thus, a nucleic acid sequence isolated from cells is considered to be substantially purified when purified from cellular components by standard methods while a chemically synthesized nucleic acid sequence is considered to be substantially purified when purified from its chemical precursors.
The present invention also relates to recombinant vectors and recombinant hosts, both prokaryotic and eukaryotic, which contain the substantially purified nucleic acid molecules disclosed throughout this specification.
Therefore, the present invention also relates to methods of expressing nNRl, nNR2 and or nNR2-l and biological equivalents disclosed herein, assays employing these recombinantly expressed gene products, cells expressing these gene products, and agonistic and/or antagonistic compounds identified through the use of assays utilizing
these recombinant forms, including, but not limited to, one or more modulators of the human nNRl, nNR2 and/or nNR2-l either through direct contact LBD or through direct or indirect contact with a ligand which either interacts with the DBD or with the wild-type transcription complex which either nNRl, nNR2 and/or nNR2-l interacts in trans, thereby modulating cell differentiation or cell development.
As used herein, a "biologically active equivalent" or "functional derivative" of a wild-type human nNRl, nNR2 and/or nNR2- 1 possesses a biological activity that is substantially similar to the biological activity of the wild type human nNRl, nNR2 and/or nNR2-l. The term "functional derivative" is intended to include the "fragments," "mutants," "variants," "degenerate variants," "analogs" and "homologues" or to "chemical derivatives" of the wild type human nNRl, nNR2 and/or nNR2-l protein. The term "fragment" is meant to refer to any polypeptide subset of wild-type human nNRl or nNR2. The term "mutant" is meant to refer to a molecule that may be substantially similar to the wild-type form but possesses distinguishing biological characteristics. Such altered characteristics include but are in no way limited to altered substrate binding, altered substrate affinity and altered sensitivity to chemical compounds affecting biological activity of the human nNRl, nNR2 and/or nNR2-l or human nNRl, nNR2 and/or nNR2-l functional derivatives. The term "variant" is meant to refer to a molecule substantially similar in structure and function to either the entire wild-type protein or to a fragment thereof. A molecule is "substantially similar" to a wild-type human nNRl, nNR2 and/or nNR2-l-like protein if both molecules have substantially similar structures or if both molecules possess similar biological activity. Therefore, if the two molecules possess substantially similar activity, they are considered to be variants even if the structure of one of the molecules is not found in the other or even if the two amino acid sequences are not identical. The term "analog" refers to a molecule substantially similar in function to either the full-length human nNRl, nNR2 and/or nNR2-l protein or to a biologically active fragment thereof. Any of a variety of procedures may be used to clone human nNRl, nNR2 and/or nNR2-l. These methods include, but are not
limited to, (1) a RACE PCR cloning technique (Frohman, et al., 1988, Proc. Natl. Acad. Sci. USA 85: 8998-9002). 5' and/or 3' RACE may be performed to generate a full-length cDNA sequence. This strategy involves using gene-specific oligonucleotide primers for PCR amplification of human nNRl, nNR2 and/or nNR2-l cDNA. These gene-specific primers are designed through identification of an expressed sequence tag (EST) nucleotide sequence which has been identified by searching any number of publicly available nucleic acid and protein databases; (2) direct functional expression of the human nNRl, nNR2 and or nNR2-l cDNA following the construction of a human nNRl, nNR2 and/or nNR2-l-containing cDNA library in an appropriate expression vector system; (3) screening a human nNRl, nNR2 and/or nNR2-l-containing cDNA library constructed in a bacteriophage or plasmid shuttle vector with a labeled degenerate oligonucleotide probe designed from the amino acid sequence of the human nNRl, nNR2 and/or nNR2-l protein; (4) screening a human nNRl, nNR2 and/or nNR2-l-containing cDNA library constructed in a bacteriophage or plasmid shuttle vector with a partial cDNA encoding the human nNRl, nNR2 and/or nNR2-l protein. This partial cDNA is obtained by the specific PCR amplification of human nNRl, nNR2 and/or nNR2-l DNA fragments through the design of degenerate oligonucleotide primers from the amino acid sequence known for other kinases which are related to the human nNRl, nNR2 and/or nNR2-l protein; (5) screening a human nNRl, nNR2 and or nNR2-l-containing cDNA library constructed in a bacteriophage or plasmid shuttle vector with a partial cDNA encoding the human nNRl, nNR2 and/or nNR2-l protein. This strategy may also involve using gene-specific oligonucleotide primers for PCR amplification of human nNRl, nNR2 and/or nNR2-l cDNA identified as an EST as described above; or (6) designing 5' and 3' gene specific oligonucleotides using SEQ ID NO: 1 as a template so that either the full-length cDNA may be generated by known PCR techniques, or a portion of the coding region may be generated by these same known PCR techniques to generate and isolate a portion of the coding region to use as a probe to screen one of numerous types of cDNA and/or genomic libraries in order to isolate a
full-length version of the nucleotide sequence encoding human nNRl, nNR2 and/or nNR2-l.
It is readily apparent to those skilled in the art that other types of libraries, as well as libraries constructed from other cell types-or species types, may be useful for isolating a nNRl, nNR2 and/or nNR2-l- encoding DNA or a nNRl, nNR2 and/or nNR2-l homologue. Other types of libraries include, but are not limited to, cDNA libraries derived from other cells or cell lines other than human cells or tissue such as murine cells, rodent cells or any other such vertebrate host which may contain nNRl, nNR2 and/or nNR2-l-encoding DNA. Additionally a nNRl, nNR2 and/or nNR2-l gene and homologues may be isolated by oligonucleotide- or polynucleotide-based hybridization screening of a vertebrate genomic library, including but not limited to, a murine genomic library, a rodent genomic library, as well as concomitant human genomic DNA libraries.
It is readily apparent to those skilled in the art that suitable cDNA libraries may be prepared from cells or cell lines which have nNRl, nNR2 and/or nNR2-l activity. The selection of cells or cell lines for use in preparing a cDNA library to isolate a cDNA encoding nNRl, nNR2 and/or nNR2-l may be done by first measuring cell-associated nNRl, nNR2 and/or nNR2-l activity using any known assay available for such a purpose.
Preparation of cDNA libraries can be performed by standard techniques well known in the art. Well known cDNA library construction techniques can be found for example, in Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory, Cold Spring Harbor, New York. Complementary DNA libraries may also be obtained from numerous commercial sources, including but not limited to Clontech Laboratories, Inc. and Stratagene. It is also readily apparent to those skilled in the art that
DNA encoding human nNRl, nNR2 and/or nNR2-l may also be isolated from a suitable genomic DNA library. Construction of genomic DNA libraries can be performed by standard techniques well known in the art. Well known genomic DNA library construction techniques can be found in Sambrook, et al., supra.
In order to clone the human nNRl, nNR2 and/or nNR2-l gene by one of the preferred methods, the amino acid sequence or DNA sequence of human nNRl, nNR2 and/or nNR2-l or a homologous protein may be necessary. To accomplish this, the nNRl, nNR2 and/or nNR2-l protein or a homologous protein may be purified and partial amino acid sequence determined by automated sequenators. It is not necessary to determine the entire amino acid sequence, but the linear sequence of two regions of 6 to 8 amino acids can be determined for the PCR amplification of a partial human nNRl, nNR2 and/or nNR2-l DNA fragment. Once suitable amino acid sequences have been identified, the DNA sequences capable of encoding them are synthesized. Because the genetic code is degenerate, more than one codon may be used to encode a particular amino acid, and therefore, the amino acid sequence can be encoded by any of a set of similar DNA oligonucleotides. Only one member of the set will be identical to the human nNRl, nNR2 and/or nNR2-l sequence but others in the set will be capable of hybridizing to human nNRl, nNR2 and/or nNR2-l DNA even in the presence of DNA oligonucleotides with mismatches. The mismatched DNA oligonucleotides may still sufficiently hybridize to the human nNRl, nNR2 and/or nNR2-l DNA to permit identification and isolation of human nNRl, nNR2 and/or nNR2-l encoding DNA. Alternatively, the nucleotide sequence of a region of an expressed sequence may be identified by searching one or more available genomic databases. Gene- specific primers may be used to perform PCR amplification of a cDNA of interest from either a cDNA library or a population of cDNAs. As noted above, the appropriate nucleotide sequence for use in a PCR-based method may be obtained from SEQ ID NO: 1, either for the purpose of isolating overlapping 5' and 3' RACE products for generation of a full- length sequence coding for human nNRl, nNR2 and/or nNR2-l, or to isolate a portion of the nucleotide sequence coding for human nNRl, nNR2 and/or nNR2-l for use as a probe to screen one or more cDNA- or genomic-based libraries to isolate a full-length sequence encoding human nNRl, nNR2 and/or nNR2-l or human nNRl, nNR2 and/or nNR2-l-like proteins.
In an exemplified method, the human nNRl, nNR2 and/or nNR2-l full-length cDNA of the present invention were generated by PCR scanning human cDNA libraries with oligonucleotide primers generated from ESTs showing homology to hERR2. Briefly, random and oligo dT primed cDNA libraries as described herein which consist of approximately 4 million primary clones were constructed in the plasmid vector pBluescript (Stratagene, LaJolla, CA). The primary clones were subdivided into 188 pools with each pool containing -20,000 clones. Each pool was amplified separately and the resulting plasmid pools were collected and transferred into two 96-well plates. Primer pairs from the 5' and 3' portion of an EST are used to scan the respective cDNA library distributed in a 96-well plate. Initial positive pools are identified with EST primers. Corresponding full length cDNA clones were retrieved via inverse PCR using primer pairs designed from the EST which are back to back against each other. Therefore, the primers walk away from each other during the PCR reaction, resulting in amplification of a population of linearized plasmid DNA molecules corresponding to the EST. cDNA clones were obtained by ligating linear DNA and transforming the circularized DNA into bacteria competent cells. Usually, four positive clones for each gene were used for sequence analysis because of the possibility of mutation during long PCR reactions. The consensus DNA sequence is considered as the wild type DNA sequence. Recloning of the gene through PCR using gene specific primers covering the whole open reading frame was done so as to obtain a cDNA clone which has an identical DNA sequence to the consensus sequence. This procedure does not depend upon using a cDNA library with directionally cloned inserts, but does require cDNA libraries constructed in a plasmid vector, such as pBluescript. This procedure was utilized to identify full length cDNA molecules representing human nNRl, nNR2 and/or nNR2-l.
A variety of mammalian expression vectors may be used to express recombinant human nNRl, nNR2 and/or nNR2-l in mammalian cells. Expression vectors are defined herein as DNA sequences that are required for the transcription of cloned DNA and the translation of their mRNAs in an appropriate host. Such vectors can be
used to express eukaryotic DNA in a variety of hosts such as bacteria, blue green algae, plant cells, insect cells and animal cells. Specifically designed vectors allow the shuttling of DNA between hosts such as bacteria-yeast or bacteria-animal cells. An appropriately constructed expression vector should contain: an origin of replication for autonomous replication in host cells, selectable markers, a limited number of useful restriction enzyme sites, a potential for high copy number, and active promoters. A promoter is defined as a DNA sequence that directs RNA polymerase to bind to DNA and initiate RNA synthesis. A strong promoter is one which causes mRNAs to be initiated at high frequency. Expression vectors may include, but are not limited to, cloning vectors, modified cloning vectors, specifically designed plasmids or viruses.
Commercially available mammalian expression vectors which may be suitable for recombinant human nNRl, nNR2 and/or nNR2-l expression, include but are not limited to, pcDNA3.1 (Invitrogen), pLITMUS28, pLITMUS29, pLITMUS38 and pLITMUS39 (New England Bioloabs), pcDNAI, pcDNAIamp (Invitrogen), pcDNA3 (Invitrogen), pMClneo (Stratagene), pXTl (Stratagene), pSG5 (Stratagene), EBO-pSV2-neo (ATCC 37593) pBPV- 1(8-2) (ATCC 37110), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), and lZD35 (ATCC 37565).
A variety of bacterial expression vectors may be used to express recombinant human nNRl, nNR2 and/or nNR2-l in bacterial cells. Commercially available bacterial expression vectors which may be suitable for recombinant human nNRl, nNR2 and/or nNR2-l expression include, but are not limited to pQE (Qiagen), pETlla (Novagen), lambda gtll (Invitrogen), and pKK223-3 (Pharmacia). A variety of fungal cell expression vectors may be used to express recombinant human nNRl, nNR2 and/or nNR2-l in fungal cells. Commercially available fungal cell expression vectors which may be suitable for recombinant human nNRl, nNR2 and/or nNR2-l expression include but are not limited to pYES2 (Invitrogen) and Pichia expression vector (Invitrogen).
A variety of insect cell expression vectors may be used to express recombinant receptor in insect cells. Commercially available insect cell expression vectors which may be suitable for recombinant expression of human nNRl, nNR2 and/or nNR2-l include but are not limited to pBlueBacIII and pBlueBacHis2 (Invitrogen), and pAcG2T (Pharmingen).
An expression vector containing DNA encoding a human nNRl, nNR2 and/or nNR2-l-like protein may be used for expression of human nNRl, nNR2 and/or nNR2-l in a recombinant host cell. Recombinant host cells may be prokaryotic or eukaryotic, including but not limited to bacteria such as E. coli, fungal cells such as yeast, mammalian cells including but not limited to cell lines of human, bovine, porcine, monkey and rodent origin, and insect cells including but not limited to Drosophila- and silkworm-derived cell lines. Cell lines derived from mammalian species which may be suitable and which are commercially available, include but are not limited to, L cells L-M(TK") (ATCC CCL 1.3), L cells L-M (ATCC CCL 1.2), Saos-2 (ATCC HTB-85), 293 (ATCC CRL 1573), Raji (ATCC CCL 86), CV-1 (ATCC CCL 70), COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS-C-1 (ATCC CCL 26), MRC-5 (ATCC CCL 171) and CPAE (ATCC CCL 209).
The expression vector may be introduced into host cells via any one of a number of techniques including but not limited to transformation, transfection, protoplast fusion, and electroporation. The expression vector-containing cells are individually analyzed to determine whether they produce human nNRl, nNR2 and/or nNR2-l protein. Identification of human nNRl, nNR2 and/or nNR2-l expressing cells may be done by several means, including but not limited to immunological reactivity with anti-human nNRl, nNR2 and/or nNR2-l antibodies, labeled ligand binding and the presence of host cell-associated human nNRl, nNR2 and or nNR2-l activity.
The cloned human nNRl, nNR2 and/or nNR2-l cDNA obtained through the methods described above may be recombinantly expressed by molecular cloning into an expression vector (such as
pcDNA3.1, pQE, pBlueBacHis2 and pLITMUS28) containing a suitable promoter and other appropriate transcription regulatory elements, and transferred into prokaryotic or eukaryotic host cells to produce recombinant human nNRl, nNR2 and/or nNR2-l. Techniques for such manipulations can be found described in Sambrook, et al., supra, are discussed at length in the Example section and are well known and easily available to the artisan of ordinary skill in the art.
Expression of human nNRl, nNR2 and/or nNR2-l DNA may also be performed using in vitro produced synthetic mRNA. Synthetic mRNA can be efficiently translated in various cell-free systems, including but not limited to wheat germ extracts and reticulocyte extracts, as well as efficiently translated in cell based systems, including but not limited to microinjection into frog oocytes, with microinjection into frog oocytes being preferred. To determine the human nNRl, nNR2 and/or nNR2-l cDNA sequence(s) that yields optimal levels of human nNRl, nNR2 and/or nNR2-l, cDNA molecules including but not limited to the following can be constructed: a cDNA fragment containing the full- length open reading frame for human nNRl, nNR2 and/or nNR2-l as well as various constructs containing portions of the cDNA encoding only specific domains of the protein or rearranged domains of the protein. All constructs can be designed to contain none, all or portions of the 5' and/or 3' untranslated region of a human nNRl, nNR2 and/or nNR2-l cDNA. The expression levels and activity of human nNRl, nNR2 and/or nNR2-l can be determined following the introduction, both singly and in combination, of these constructs into appropriate host cells. Following determination of the human nNRl, nNR2 and/or nNR2-l cDNA cassette yielding optimal expression in transient assays, this nNRl, nNR2 and/or nNR2-l cDNA construct is transferred to a variety of expression vectors (including recombinant viruses), including but not limited to those for mammalian cells, plant cells, insect cells, oocytes, bacteria, and yeast cells.
The present invention also relates to polyclonal and monoclonal antibodies raised in response to either the human form of nNRl, nNR2 and/or nNR2-l disclosed herein, or a biologically active
fragment thereof. It will be especially preferable to raise antibodies against epitopes within the NH2-terminal domain of nNRl, nNR2 and/or nNR2-l, which show the least homology to other known proteins belonging to the human nuclear receptor superfamily. Recombinant nNRl, nNR2 and/or nNR2-l protein can be separated from other cellular proteins by use of an i____munoaffinity column made with monoclonal or polyclonal antibodies specific for full- length nNRl, nNR2 and/or nNR2-l protein, or polypeptide fragments of nNRl, nNR2 and/or nNR2-l protein. Additionally, polyclonal or monoclonal antibodies may be raised against a synthetic peptide
(usually from about 9 to about 25 amino acids in length) from a portion of the protein as disclosed in SEQ ID NO:2. Monospecific antibodies to human nNRl, nNR2 and/or nNR2-l are purified from mammalian antisera containing antibodies reactive against human nNRl, nNR2 εmd/or nNR2-l or are prepared as monoclonal antibodies reactive with human nNRl, nNR2 and/or nNR2-l using the technique of Kohler and Milstein (1975, Nature 256: 495-497). Monospecific antibody as used herein is defined as a single antibody species or multiple antibody species with homogenous binding characteristics for human nNRl, nNR2 εmd/or nNR2-l. Homogenous binding as used herein refers to the ability of the antibody species to bind to a specific antigen or epitope, such as those associated with human nNRl, nNR2 εmd/or nNR2-l, as described above. Human nNRl, nNR2 and/or nNR2-l-specific antibodies are raised by immunizing animals such as mice, rats, guinea pigs, rabbits, goats, horses and the like, with an appropriate concentration of human nNRl, nNR2 and/or nNR2-l protein or a synthetic peptide generated from a portion of human nNRl, nNR2 and/or nNR2-l with or without an immune adjuvant.
Preimmune serum is collected prior to the first immunization. Each animal receives between about 0.1 mg and about 1000 mg of human nNRl, nNR2 and/or nNR2-l protein associated with an acceptable immune adjuvant. Such acceptable adjuvants include, but are not limited to, Freund's complete, Freund's incomplete, alum- precipitate, water in oil emulsion containing Corynebacterium parvum and tRNA. The initial immxinization consists of human nNRl, nNR2
εmd/or nNR2-l protein or peptide fragment thereof in, preferably, Freund's complete adjuvant at multiple sites either subcutaneously (SC), intraperitoneally (IP) or both. Each animal is bled at regular intervals, preferably weekly, to determine antibody titer. The animals may or may not receive booster injections following the initial immunization. Those animals receiving booster injections are generally given an equal amount of human nNRl, nNR2 and/or nNR2-l in Freund's incomplete adjuvant by the same route. Booster injections are given at about three week intervals until maximal titers are obtained. At about 7 days after each booster immunization or about weekly after a single immumzation, the animals are bled, the serum collected, and aliquots are stored at about -20°C.
Monoclonal antibodies (mAb) reactive with human nNRl, nNR2 and/or nNR2-l are prepared by immunizing inbred mice, preferably Balb/c, with human nNRl, nNR2 and/or nNR2-l protein.
The mice εire immunized by the IP or SC route with about 1 mg to about 100 mg, preferably about 10 mg, of human nNRl, nNR2 and/or nNR2-l protein in about 0.5 ml buffer or saline incorporated in an equal volume of an acceptable adjuvant, as discussed above. Freund's complete adjuvant is preferred. The mice receive an initial immunization on day 0 and are rested for about 3 to about 30 weeks. Immunized mice εire given one or more booster immunizations of about 1 to about 100 mg of human nNRl, nNR2 εmd/or nNR2-l in a buffer solution such as phosphate buffered sεdine by the intravenous (IV) route. Lymphocytes, from εuitibody positive mice, preferably splenic lymphocytes, are obtained by removing spleens from immunized mice by standard procedures known in the εirt. Hybridoma cells are produced by mixing the splenic lymphocytes with an appropriate fusion pεtrtner, preferably myeloma cells, under conditions which will allow the formation of stable hybridomas. Fusion partners may include, but are not limited to: mouse myelomas P3/NSl/Ag 4-1; MPC-11; S-194 and Sp 2/0, with Sp 2/0 being preferred. The antibody producing cells and myeloma cells are fused in polyethylene glycol, about 1000 mol. wt., at concentrations from about 30% to about 50%. Fused hybridoma cells are selected by growth in hypoxεmthine, thymidine and aminopterin supplemented Dulbecco's
Modified Eagles Medium (DMEM) by procedures known in the art. Supernatεmt fluids are collected form growth positive wells on about days 14, 18, and 21 and are screened for antibody production by an immunoassay such as solid phase immunoradioassay (SPIRA) using human nNRl, nNR2 and/or nNR2-l as the antigen. The culture fluids are also tested in the Ouchterlony precipitation assay to determine the isotype of the mAb. Hybridoma cells from antibody positive wells are cloned by a technique such as the soft agar technique of MacPherson, 1973, Soft Agar Techniques, in Tissue Culture Methods and Applications, Kruse and Paterson, Eds., Academic Press.
Monoclonal antibodies are produced in vivo by injection of pristine primed Balb/c mice, approximately 0.5 ml per mouse, with about 2 x 106 to about 6 x 106 hybridoma cells about 4 days after priming. Ascites fluid is collected at approximately 8-12 days after cell transfer and the monoclonal antibodies are purified by techniques known in the art.
In vitro production of anti-human nNRl, nNR2 and/or nNR2-l mAb is carried out by growing the hybridoma in DMEM containing about 2% fetal calf serum to obtain sufficient quantities of the specific mAb. The mAb are purified by techniques known in the art. Antibody titers of ascites or hybridoma culture fluids are determined by various serological or immunological assays which include, but are not limited to, precipitation, passive agglutination, enzyme-linked immunosorbent antibody (ELISA) technique and radioimmunoassay (RIA) techniques. Similεir assays are used to detect the presence of human nNRl, nNR2 εmd/or nNR2-l in body fluids or tissue and cell extracts.
It is readily apparent to those skilled in the art that the above described methods for producing monospecific antibodies may be utilized to produce antibodies specific for human nNRl, nNR2 and/or nNR2-l peptide fragments, or full-length human nNRl, nNR2 and/or nNR2-l.
Humεui nNRl, nNR2 and/or nNR2-l antibody affinity columns εire made, for example, by adding the antibodies to Affigel-10 (Biorad), a gel support which is pre-activated with N-
hydroxysuccinimide esters such that the antibodies form covalent linkages with the agarose gel bead support. The antibodies are then coupled to the gel via amide bonds with the spacer arm. The remaining activated esters are then quenched with IM ethanolamine HCl (pH 8). The column is washed with water followed by 0.23 M glycine HCl (pH
2.6) to remove any non-conjugated antibody or extraneous protein. The column is then equilibrated in phosphate buffered saline (pH 7.3) and the cell culture supernatants or cell extracts containing full-length humε nNRl, nNR2 and/or nNR2-l or human nNRl, nNR2 and/or nNR2-l protein fragments are slowly passed through the column. The column is then washed with phosphate buffered saline until the optical density (A2gø) falls to background, then the protein is eluted with 0.23 M glycine-HCl (pH 2.6). The purified human nNRl, nNR2 and/or nNR2-l protein is then dialyzed against phosphate buffered saline. Levels of human nNRl, nNR2 and/or nNR2-l in host cells is quεmtified by a variety of techniques including, but not limited to, immunoaifinity and/or ligand affinity techniques. nNRl, nNR2 εmd/or nNR2-l-specific affinity beads or nNRl, nNR2 and/or nNR2-l-specific εmtibodies εire used to isolate 35S-methionine labeled or unlabelled nNRl, nNR2 and or nNR2-l. Labeled nNRl, nNR2 and/or nNR2-l protein is analyzed by SDS-PAGE. Unlabelled nNRl, nNR2 and or nNR2-l protein is detected by Western blotting, ELISA or RIA assays employing either nNRl, nNR2 and/or nNR2-l protein specific antibodies and/or antiphosphotyrosine antibodies. Following expression of nNRl, nNR2 εmd/or nNR2-l in a host cell, nNRl, nNR2 εmd/or nNR2-l protein may be recovered to provide nNRl, nNR2 and/or nNR2-l protein in active form. Several nNRl, nNR2 and/or nNR2-l protein purification procedures are available and suitable for use. Recombinant nNRl, nNR2 εmd/or nNR2- 1 protein may be purified from cell lysates and extracts, or from conditioned culture medium, by various combinations of, or individual application of salt fractionation, ion exchange chromatography, size exclusion chromatography, hydroxylapatite adsorption chromatography and hydrophobic interaction chromatography.
The present invention is also directed to methods for screening for compounds which modulate the expression of DNA or RNA encoding a human nNRl, nNR2 and/or nNR2-l protein. Compounds which modulate these activities may be DNA, RNA, peptides, proteins, or non-proteinaceous organic molecules.
Compounds may modulate by increasing or attenuating the expression of DNA or RNA encoding human nNRl, nNR2 and/or nNR2-l, or the function of human nNRl, nNR2 and/or nNR2-l. Compounds that modulate the expression of DNA or RNA encoding human nNRl, nNR2 and/or nNR2-l or the biological function thereof may be detected by a variety of assays. The assay may be a simple "yes/no" assay to determine whether there is a change in expression or function. The assay may be made quantitative by comparing the expression or function of a test sample with the levels of expression or function in a standard sample. Kits containing human nNRl, nNR2 εmd/or nNR2-l, antibodies to humεm nNRl, nNR2 and or nNR2-l, or modified humεm nNRl, nNR2 and/or nNR2-l may be prepared by known methods for such uses.
The DNA molecules, RNA molecules, recombinant protein and antibodies of the present invention may be used to screen and measure levels of human nNRl, nNR2 and/or nNR2-l. The recombinant proteins, DNA molecules, RNA molecules and antibodies lend themselves to the formulation of kits suitable for the detection and typing of human nNRl, nNR2 and or nNR2-l. Such a kit would comprise a compartmentalized carrier suitable to hold in close confinement at least one container. The carrier would further comprise reagents such as recombinant nNRl, nNR2 and/or nNR2-l or εmti- nNRl, nNR2 and/or nNR2-l εmtibodies suitable for detecting human nNRl, nNR2 εmd/or nNR2-l. The carrier may also contain a means for detection such as labeled antigen or enzyme substrates or the like. Pharmaceutically useful compositions comprising modulators of human nNRl, nNR2 and or nNR2-l may be formulated according to known methods such as by the admixture of a pharmaceutically acceptable carrier. Examples of such carriers and methods of formulation may be found in Remington's Phεirmaceutical
Sciences. To form a pharmaceutically acceptable composition suitable for effective administration, such compositions will contain an effective amount of the protein, DNA, RNA, modified human nNRl, nNR2 and/or nNR2-l, or either nNRl, nNR2 εmd/or nNR2-l agonists or antagonists.
Therapeutic or diagnostic compositions of the invention are administered to an individual in amounts sufficient to treat or diagnose disorders. The effective amount may vary according to a variety of factors such as the individual's condition, weight, sex and age. Other factors include the mode of administration.
The pharmaceutical compositions may be provided to the individual by a variety of routes such as subcutaneous, topical, oral and intramuscular.
The term "chemical derivative" describes a molecule that contains additional chemical moieties which are not normally a part of the base molecule. Such moieties may improve the solubility, half-life, absorption, etc. of the base molecule. Alternatively the moieties may attenuate undesirable side effects of the base molecule or decrease the toxicity of the base molecule. Examples of such moieties are described in a variety of texts, such as Remington's Pharmaceutical Sciences.
Compounds identified according to the methods disclosed herein may be used alone at appropriate dosages. Alternatively, co- administration or sequential administration of other agents may be desirable. The present invention also has the objective of providing suitable topical, oral, systemic and parenteral pharmaceutical formulations for use in the novel methods of treatment of the present invention. The compositions containing compounds identified according to this invention as the active ingredient can be administered in a wide variety of therapeutic dosage forms in conventional vehicles for administration. For example, the compounds can be administered in such oral dosage forms as tablets, capsules (each including timed release and sustained release formulations), pills, powders, granules, elixirs, tinctures, solutions, suspensions, syrups ε d emulsions, or by injection. Likewise, they may also be administered in intravenous (both
bolus and infusion), intraperitoneal, subcutaneous, topical with or without occlusion, or intramuscular form, all using forms well known to those of ordinary skill in the pharmaceutical arts.
Advantageously, compounds of the present invention may be administered in a single daily dose, or the total daily dosage may be administered in divided doses of two, three or four times daily. Furthermore, compounds for the present invention can be administered in intranasal form via topical use of suitable intranasal vehicles, or via transdermal routes, using those forms of trεmsdermal skin patches well known to those of ordinary skill in that art. To be administered in the form of a transdermal delivery system, the dosage administration will, of course, be continuous rather than intermittent throughout the dosage regimen.
For combination treatment with more than one active agent, where the active agents are in separate dosage formulations, the active agents can be administered concurrently, or they each can be administered at separately staggered times.
The dosage regimen utilizing the compounds of the present invention is selected in accordance with a variety of factors including type, species, age, weight, sex and medical condition of the patient; the severity of the condition to be treated; the route of administration; the renal, hepatic and cardiovascular function of the patient; and the particular compound thereof employed. A physician or veterinarian of ordinary skill can readily determine and prescribe the effective amount of the drug required to prevent, counter or arrest the progress of the condition. Optimal precision in achieving concentrations of drug within the range that yields efficacy without toxicity requires a regimen based on the kinetics of the drug's availability to target sites. This involves a consideration of the distribution, equilibrium, and elimination of a drug.
The following examples are provided to illustrate the present invention without, however, limiting the same hereto.
EXAMPLE 1: Isolation and Characterization of DNA Fragments Encoding nNRl, nNR2 and/or nNR2-l
The DNA sequences from several representative subfεumlies (Giguere, et al., 1988, Nature 331: 91-94) were used to query the EST database by using the Blastn program. Two ESTs (Genbank accession number h91890 (nNRl) and w26275 (nNR2)) were identified with homology to human ERR2 at DNA sequence level.
EST h91890 is disclosed herein as SEQ ID NO:7 and is as set forth: CTTTTTAGGA GGTGGAGAAA TTTGTAAGCT CAGGTATGGG CTGCTCTCTG AGTCCAGCCG TCGCTTGTAT TTCTGACGGC CTCCACGCAC TCGATCAAGG CGCACACCTT CCTTCAGCAT CCCCACTTTG AGGCATTTCA TGAAGCGGCA GGCCTGGCAG GACTTGCGCC TCCGTTTGGT GATCTCGCAC TCGTTGGTGG CCGGGCAGCT GTACTCAATG TTCCCTTGGA TAGTCCTCTT GAAGAAGGCC TTGCAAGCCT CGCAGGAGGC CCACGCGTNA GTGGTAGCCA GAGNAAATGT CCCCGCACAC GAGGCACAGG CGCTTGGGGA TGGCGTTGAG CATGTTACTT CGCACTTGGA TGGGCCGAGT CCTCCATGGA TGGCCGCTGG CAACAGTTCC TCG (SEQ ID NO:7) .
EST w26275 is disclosed herein as SEQ ID NO: 8 and is as set forth: CNNNNNNNN NNNTTTTNT GCCTAAAGTG GTACCCNGAA GNGATGTCAC CACACACTAA ACACAGTCTC TTGGGCATCG AGTTGAGCAT GTATTCACAC TTGGTCTGGG GATCTTCAAC AATGGTGCTG GAGCAGTCAT CATACAGTTT CCTGACAGGC CCACTACCTC CCAGGATAGG AGCAGAAGGG TAGAGAGGTG GCGAGTCAAG TCCGTTCTGA TGGCCATTCA TGGTTGAACT GTAGCTCCCA CTGGCGTCTG AAGAGCCACC AGGGCTGTGG TGGTTGACGC TGTCCGTCAG GGAGGCTGGG CTGGAAGGTT CCGTCTTGAT GAAGGACGAA CAGCTGGAAT CAATGTGTCG ATCTTTGTTT GGACATTCTG CAGAGAAGCT CTTCCTCCGT NGTGCAGGGA AAAAGATTCA GGAAGGCAAA GTTCTTCCCG AATCCATGTG CGACCGGAAA CCATTATTTG NGCACCCCAG CTATTAATCA AAGTTCCTTG ACAGAGACAG GGCAATTACA NAATGTCTCC TNTNGGGGAT CAACTGTTCN GTATTTvTNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN
NMSπSINNNNNN NNNNI INNNNN TT ( SEQ ID NO : 8 ) .
Primer pairs 5'-TGAGTCCAGCCGTCGCTTGTAT-3' (ERR4F1; SEQ ID NO:9), 5'-TGCAAGCCTCGCAGGAGGCC-3' (ERR4iFl; SEQ ID NO: 10), and 5'-GGCCTTCTTCAAGAGGACTATC- 3'(ERR4R1; SEQ ID NO:ll) were designed from h91890;
5'-AAAGATCGACACATTGATTCC-3' (ERR5F; SEQ ID NO: 12), 5'-GACTTGACTCGCCACCTCTC-3' (ERRδiF; SEQ ID NO: 13) and 5'-GTTCTGATGGCCATTCATGGT-3' (ERR5R; SEQ ID NO: 14) were designed from W26275. Primer pairs ERR4F/ERR4R and ERR5F/ERR5R were used to scan cDNA made from testis, fetal brain, prostate and placenta first before scanning cDNA libraries made from those cDNA and distributed in 96-well plates. Primers for nNRl produced a PCR product from testis cDNA, while primers for nNR2 generated a PCR product a cDNA library generated from fetal brain, prostate and placenta mRNA. Therefore, a cDNA library made from testis with
>2.5 kb insert was used for nNRl positive pool identification, and A4 and G8 gave the PCR product of expected size. Inverse PCR using ERR4iFl and ERR4R1 were performed on positive pools and DNA fragments of about 6.0 kb were amplified. The DNA fragment was purified using Qiagen gel extraction kit. Phosphorylation, self-ligation and transformation of the purified DNA was carried out. DNA mini-preps from four individual clones were used in automated sequencing with gene specific and vector primers. Since a PCR-induced mutation is possible in long PCR reactions, nNRl was re-subcloned in to the PCR2.1 vector (Invitrogen) using a PCR fragment amplified by a 5'-primer 5'-GAATATGATGACCCTAATGCA-3' (SEQ ID NO: 15) and a 3'-primer 5'-CTTCCACCTCATGGACACCAA-3' (SEQ ID NO: 16) on the positive A4 pool. One out of the four TA-clones showed no mutation through sequencing confirmation. DNA sequence analysis was performed using the ABI PRISM™ dye terminator cycle sequencing ready reaction kit with AmpliTaq DNA polymerase, FS (Perkin Elmer, Norwalk, CT). DNA sequence analysis was performed with M13 forward/reverse primers and gene specific sequencing primers manufactured by GIBCO BRL (Gaithersburg, MD). Sequence assembly and analysis were performed with SEQUENCHER™ 3.0 (Gene Codes
Corporation, Ann Arbor, MI). Ambiguities and/or discrepancies between automated base calling in sequencing reads were visually examined and edited to the correct base call. Several regions were resequenced after initial automated or visual calling. Four oligonucleotides close to the regions with potential sequence ambiguities were utilized ([R1F1] 5'-CAT TCC ACG GAG GCA TCC TC-3' (SEQ ID NO:23); [R1F2] 5'-CCA AGG CCG TGC AGC ACT TC-3' (SEQ ID NO:24); [R1R1] 5*-GAC AGC CTC TAG ATC CTC GAT-3* (SEQ ID NO:25); and, [R1R2] 5*-ATC ATG GCT TGA CAT TCT TTC-3' (SEQ ID NO:26) εmd automated sequencing was performed. The final nucleotide sequence encoding NR1 is shown as set forth in Figure 1A-C and as set forth as SEQ ID NO:l
For nNR2, a cDNA library made from fetal brain with >2.5 kb insert was used. Positive pools Cl, F7 and G6 were identified and used in inverse PCR with primer pairs ERR5iF ERR5R. A PCR fragment of ~ 6.0 kb was amplified from Cl. The same methodology as described herein for nNRl was applied to isolation, characterization and sequencing of a nNR2 cDNA. The cDNA fragment cloned into pCR2.1 vector was amplified by 5'-primer 5'-GTTAATTGCACTGTGCTCTG-3' (SEQ ID NO: 17) and 3'-primer 5*-AGTGTGGTGGAATTCTCTTA-3' (SEQ ID NO:18).
Primer pairs XR2F3 (5'-AGCTCTTGCTAATTCAGAC-3' [SEQ ID NO:27]) and XR2R4 (5*-TCAACATGAAGGATGGGAAGG-3' [SEQ ID NO:28]) were used in DNA sequence εmalysis (performed using the ABI PRISM™ dye terminator cycle sequencing ready reaction kit with AmpliTaq DNA polymerase, FS (Perkin Elmer, Norwalk, CT)) of the carboxy region of nNR2. DNA sequence εmalysis was performed with M13 forwεird reverse primers and gene specific sequencing primers customarily manufactured by GIBCO BRL (Gaithersburg, MD). Sequence assembly and analysis were performed with
SEQUENCHER™ 3.0 (Gene Codes Corporation, Ann Arbor, MI). Ambiguities and/or discrepancies between automated base calling in sequencing reads were visually examined and edited to the correct base call. Resequencing of the ligand binding domain showed a new open reading frame that was confirmed with the XR2F3/ XR2R4 primers.
The nNR2 peptide coded by the complete open reading frame has 40 extra amino acids at C-terminus compared to nNR2-l and is similar in length to its closest related member hERR2.
In order to identify the genome map position of the genes, primers in the 3' non-coding region were designed. Forwarding primer 5'-TCTAGTGTTGCTGCGAGTGAC-3' (SEQ ID NO: 19) and reversing primer 5'-CTTCCACCTCATGGACACCAA-3' (SEQ ID NO:20) were used for nNRl, while forwarding primer 5'-GTCTGACTAAAAGCTCCCTG-3' (SEQ ID NO:21) and reversing primer 5'-GAAGATGATGGAGAAAGTAGA-3' (SEQ ID NO:22) were used for nNR2. PCR scanning was performed on the 83 clones of the Stanford radiation hybrid panel (Cox et al., 1990, Science, 250:245:250). The PCR results were scored and submitted to the Stanford Genome Center for linkage analysis. The results indicate that nNRl is located on locus 14q24.3 ~ 14q31 and nNR2 is located on chromosome 1.
Claims
1. A purified DNA molecule encoding a human nNRl protein wherein said protein comprises the amino acid sequence as follows:
MSSDDRHLGS SCGSFIKTEP SSPSSGIDAL SHHSPSGSSD ASGGFGLALG THANGLDSPP MFAGAGLGGT PCRKSYEDCA SGIMEDSAIK CEYMLNAIPK RLCLVCGDIA SGYHYGVASC EACKAFFKRT IQGNIEYSCP ATNECEITKR RRKSCQACRF MKCLKVGMLK EGVRLDRVRG GRQKYKRRLD SESSPYLSLQ ISPPAKKPLT KIVSYLLVAE PDKLYAMPPP GMPEGDIKAL TTLCDLADRE LWI IG AKH IPGFSSLSLG DQMSLLQSAW MEILILGIVY RSLPYDDKLV YAEDYI DEE HSRLAGLLEL YRAILQLVRR YKKLKVEKEE FVTLKALALA NSDSMYIEDL EAVQKLQDLL HEALQDYELS QRHEEP RTG KLLLTLPLLR QTAAKAVQHF YSVKLQGKVP MHKLFLEMLE AKA ARADSL QE RPLEQVP SPLHRATKRQ HVHFLTPLPP PPSVAWVGTA QAGYHLEVFL PQRAGWPRAA, as set forth in three-letter abbreviation in SEQ ID NO:2.
2. An expression vector for expressing a human nNRl protein in a recombinant host cell wherein said expression vector comprises a DNA molecule of claim 1.
3. A host cell which expresses a recombinant human nNRl protein wherein said host cell contains the expression vector of claim 2.
4. A process for expressing a hum╬╡tn nNRl protein in a recombinant host cell, comprising:
(a) transfecting the expression vector of cl╬╡dm 2 into a suitable host cell; and,
(b) culturing the host cells of step (a) under conditions which allow expression of said the human nNRl protein from said expression vector.
5. A purified DNA molecule encoding a human nNRl protein wherein said protein consists of the amino acid sequence as follows:
MSSDDRHLGS SCGSFIKTEP SSPSSGIDAL SHHSPSGSSD ASGGFGLALG THANGLDSPP MFAGAGLGGT PCRKSYEDCA SGIMEDSAIK CEYMLNAIPK RLCLVCGDIA SGYHYGVASC EACKAFFKRT IQGNIEYSCP ATNECEITKR RRKSCQACRF MKCLKVGMLK EGVRLDRVRG GRQKYKRRLD SESSPYLSLQ ISPPAKKPLT KIVSYLLVAE PDKLYAMPPP GMPEGDIKAL TTLCDLADRE LWIIGWAKH IPGFSSLSLG DQMSLLQSA MEILILGIVY RSLPYDDKLV YAEDYIMDEE HSRLAGLLEL YRAILQLVRR YKKLKVEKEE FVTLKALALA NSDSMYIEDL EAVQKLQDLL HEALQDYELS QRHEEP RTG KLLLTLPLLR QTAAKAVQHF YSVKLQGKVP MHKLFLEMLE AKA ARADSL QEWRPLEQVP SPLHRATKRQ HVHFLTPLPP PPSVAWVGTA QAGYHLEVFL PQRAGWPRAA, as set forth in three-letter abbreviation in SEQ ID NO:2.
6. An expression vector for expressing a human nNRl protein in a recombinant host cell wherein said expression vector comprises a DNA molecule of claim 5.
7. A host cell which expresses a recombinant human nNRl protein wherein said host cell contains the expression vector of claim 6.
8. A process for expressing a human nNRl protein in a recombinant host cell, comprising:
(a) transfecting the expression vector of claim 6 into a suitable host cell; and,
(b) culturing the host cells of step (a) under conditions which allow expression of said the human nNRl protein from said expression vector.
9. A purified DNA molecule encoding a human nNRl protein wherein s╬╡rid DNA molecule comprises the nucleotide sequence as set forth in SEQ ID NO:l, as follows:
GAATATGATG ACCCTAATGC AACAATATCT AACATACTAT CCGAGCTTCG GTCATTTGGA AGAACTGCAG ATTTTCCTCC TTCAAAATTA AAGTCAGGTT ATGGAGAACA TGTATGCTAT GTTCTTGATT GCTTCGCTGA AGAAGCATTG AAATATATTG GTTTCACCTG GAAAAGGCCA ATATACCCAG TAGAAGAATT AGAAGAAGAA AGCGTTGCAG AAGATGATGC AGAATTAACA TTAAATAAAG TGGATGAAGA ATTTGTGGAA GAAGAGACAG ATAATGAAGA AAACTTTATT GATCTCAACG TTTTAAAGGC CCAGACATAT CACTTGGATA TGAACGAGAC TGCCAAACAA GAAGATATTT TGGAATCCAC AACAGATGCT GCAGAATGGA GCCTAGAAGT GGAACGTGTA CTACCGCAAC TGAAAGTCAC GATTAGGACT GACAATAAGG ATTGGAGAAT CCATGTTGAC CAAATGCACC AGCACAGAAG TGGAATTGAA TCTGCTCTAA AGGAGACCAA GGGATTTTTG GACAAACTCC ATAATGAAAT TACTAGGACT TTGGAAAAGA TCAGCAGCCG AGAAAAGTAC ATCAACAATC AGCCGGGAGC CCATGGAGCA CTGTCCTCAG AGATGCGCAG GTTAGGCTCA CTGTCTAGGC CAGGCCCACC TTAGTCACTG TGGACTGGCA ATGGAAGCTC TTCCTGGACA CACCTGCCCT AGCCCTCACC CTGGGGTGGA AGAGAAATGA GCTTGGCTTG CAACTCAGAC CATTCCACGG AGGCATCCTC CCCTTCCCTG GGCTGGTGAA TAAAAGTTTC CTGAGGTCAA GGACTTCCTT TTCCCTGCCA AAATGGTGTC CAGAACTTTG AGGCCAGAGG TGATCCAGTG ATTTGGGAGC TGCAGGTCAC ACAGGCTGCT CAGAGGGCTG CTGAACAGGA TGTCCTCGGA CGACAGGCAC CTGGGCTCCA GCTGCGGCTC CTTCATCAAG ACTGAGCCGT CCAGCCCGTC CTCGGGCATA GATGCCCTCA GCCACCACAG CCCCAGTGGC TCGTCCGACG CCAGCGGCGG CTTTGGCCTG GCCCTGGGCA CCCACGCCAA CGGTCTGGAC TCGCCACCCA TGTTTGCAGG CGCCGGGCTG GGAGGCACCC CATGCCGCAA GAGCTACGAG GACTGTGCCA GCGGCATCAT GGAGGACTCG GCCATCAAGT GCGAGTACAT GCTCAACGCC ATCCCCAAGC GCCTGTGCCT CGTGTGCGGG GACATTGCCT CTGGCTACCA CTACGGCGTG GCCTCCTGCG AGGCTTGCAA GGCCTTCTTC AAGAGGACTA TCCAAGGGAA CATTGAGTAC AGCTGCCCGG CCACCAACGA GTGCGAGATC ACCAAACGGA GGCGCAAGTC CTGCCAGGCC TGCCGCTTCA TGAAATGCCT CAAAGTGGGG ATGCTGAAGG AAGGTGTGCG CCTTGATCGA GTGCGTGGAG GCCGTCAGAA ATACAAGCGA CGGCTGGACT CAGAGAGCAG CCCATACCTG AGCTTACAAA TTTCTCCACC TGCTAAAAAG CCATTGACCA AGATTGTCTC ATACCTACTG GTGGCTGAGC CGGACAAGCT CTATGCCATG CCTCCCCCTG GTATGCCTGA GGGGGACATC AAGGCCCTGA CCACTCTCTG TGACCTGGCA GACCGAGAGC TTGTGGTCAT CATTGGCTGG GCCAAGCACA TCCCAGGCTT CTCAAGCCTC TCCCTGGGGG ACCAGATGAG CCTGCTGCAG AGTGCCTGGA TGGAAATCCT CATCCTGGGC ATCGTGTACC GCTCGCTGCC CTACGACGAC AAGCTGGTGT ACGCTGAGGA CTACATCATG GATGAGGAGC ACTCCCGCCT CGCGGGGCTG CTGGAGCTCT ACCGGGCCAT CCTGCAGCTG GTACGCAGGT ACAAGAAGCT CAAGGTGGAG AAGGAGGAGT TTGTGACGCT CAAGGCCCTG GCCCTCGCCA ACTCCGATTC CATGTACATC GAGGATCTAG AGGCTGTCCA GAAGCTGCAG GACCTGCTGC ACGAGGCACT GCAGGACTAC GAGCTGAGCC AGCGCCATGA GGAGCCCTGG AGGACGGGCA AGCTGCTGCT GACACTGCCG CTGCTGCGGC AGACGGCCGC CAAGGCCGTG CAGCACTTCT ATAGCGTCAA ACTGCAGGGC AAAGTGCCCA TGCACAAACT CTTCCTGGAG ATGCTGGAGG CCAAGGCCTG GGCCAGGGCT GACTCCCTTC AGGAGTGGAG GCCACTGGAG CAAGTGCCCT CTCCCCTCCA CCGAGCCACC AAGAGGCAGC ATGTGCATTT CCTAACTCCC TTGCCCCCTC CCCCATCTGT GGCCTGGGTG GGCACTGCTC AGGCTGGATA CCACCTGGAG GTTTTCCTTC CGCAGAGGGC AGGTTGGCCA AGAGCAGCTT AGAGGATCTC CCAAGGATGA AAGAATGTCA AGCCATGATG GAAAATGCCC CTTCCAATCA GCTGCCTTCA CAAGCAGGGA TCAGAGCAAC TCCCCGGGGA TCCCCAATCC ACGCCCTTCT AGTCCAACCC CCCTCAATGA GAGAGGCAGG CAGATCTCAC CCAGCACTAG GACACCAGGA GGCCAGGGAA AGCATCTCTG GCTCACCATG TAACATCTGG CTTGGAGCAA GTGGGTGTTC TGCACACCAG GCAGCTGCAC CTCACTGGAT CTAGTGTTGC TGCGAGTGAC CTCACTTCAG AGCCCCTCTA GCAGAGTGGG GCGGAAGTCC TGATGGTTGG TGTCCATGAG GTGGAAG (SEQ ID NO:l).
10. A DNA molecule of claim 9 which comprises from about nucleotide 950 to about nucleotide 2452 of SEQ ID NO:l.
11. An expression vector for expressing a human nNRl protein wherein said expression vector comprises a DNA molecule of claim 9.
12. An expression vector for expressing a human nNRl protein wherein said expression vector comprises a DNA molecule of claim 11.
13. A host cell which expresses a recombinant human nNRl protein wherein said host cell contains the expression vector of claim 11.
14. A host cell which expresses a recombinant human nNRl protein wherein said host cell contains the expression vector of claim 12.
15. A process for expressing a human nNRl protein in a recombinant host cell, comprising:
(a) transfecting the expression vector of claim 11 into a suitable host cell; and,
(b) culturing the host cells of step (a) under conditions which allow expression of said the human nNRl protein from said expression vector.
16. A purified DNA molecule encoding a human nNRl protein wherein said DNA molecule consists of the nucleotide sequence as set forth in SEQ ID NO:l, as follows:
GAATATGATG ACCCTAATGC AACAATATCT AACATACTAT CCGAGCTTCG GTCATTTGGA AGAACTGCAG ATTTTCCTCC TTCAAAATTA AAGTCAGGTT ATGGAGAACA TGTATGCTAT GTTCTTGATT GCTTCGCTGA AGAAGCATTG AAATATATTG GTTTCACCTG GAAAAGGCCA ATATACCCAG TAGAAGAATT AGAAGAAGAA AGCGTTGCAG AAGATGATGC AGAATTAACA TTAAATAAAG TGGATGAAGA ATTTGTGGAA GAAGAGACAG ATAATGAAGA AAACTTTATT GATCTCAACG TTTTAAAGGC CCAGACATAT CACTTGGATA TGAACGAGAC TGCCAAACAA GAAGATATTT TGGAATCCAC AACAGATGCT GCAGAATGGA GCCTAGAAGT GGAACGTGTA CTACCGCAAC TGAAAGTCAC GATTAGGACT GACAATAAGG ATTGGAGAAT CCATGTTGAC CAAATGCACC AGCACAGAAG TGGAATTGAA TCTGCTCTAA AGGAGACCAA GGGATTTTTG GACAAACTCC ATAATGAAAT TACTAGGACT TTGGAAAAGA TCAGCAGCCG AGAAAAGTAC ATCAACAATC AGCCGGGAGC CCATGGAGCA CTGTCCTCAG AGATGCGCAG GTTAGGCTCA CTGTCTAGGC CAGGCCCACC TTAGTCACTG TGGACTGGCA ATGGAAGCTC TTCCTGGACA CACCTGCCCT AGCCCTCACC CTGGGGTGGA AGAGAAATGA GCTTGGCTTG CAACTCAGAC CATTCCACGG AGGCATCCTC CCCTTCCCTG GGCTGGTGAA TAAAAGTTTC CTGAGGTCAA GGACTTCCTT TTCCCTGCCA AAATGGTGTC CAGAACTTTG AGGCCAGAGG TGATCCAGTG ATTTGGGAGC TGCAGGTCAC ACAGGCTGCT CAGAGGGCTG CTGAACAGGA TGTCCTCGGA CGACAGGCAC CTGGGCTCCA GCTGCGGCTC CTTCATCAAG ACTGAGCCGT CCAGCCCGTC CTCGGGCATA GATGCCCTCA GCCACCACAG CCCCAGTGGC TCGTCCGACG CCAGCGGCGG CTTTGGCCTG GCCCTGGGCA CCCACGCCAA CGGTCTGGAC TCGCCACCCA TGTTTGCAGG CGCCGGGCTG GGAGGCACCC CATGCCGCAA GAGCTACGAG GACTGTGCCA GCGGCATCAT GGAGGACTCG GCCATCAAGT GCGAGTACAT GCTCAACGCC ATCCCCAAGC GCCTGTGCCT CGTGTGCGGG GACATTGCCT CTGGCTACCA CTACGGCGTG GCCTCCTGCG AGGCTTGCAA GGCCTTCTTC AAGAGGACTA TCCAAGGGAA CATTGAGTAC AGCTGCCCGG CCACCAACGA GTGCGAGATC ACCAAACGGA GGCGCAAGTC CTGCCAGGCC TGCCGCTTCA TGAAATGCCT CAAAGTGGGG ATGCTGAAGG AAGGTGTGCG CCTTGATCGA GTGCGTGGAG GCCGTCAGAA ATACAAGCGA CGGCTGGACT CAGAGAGCAG CCCATACCTG AGCTTACAAA TTTCTCCACC TGCTAAAAAG CCATTGACCA AGATTGTCTC ATACCTACTG GTGGCTGAGC CGGACAAGCT CTATGCCATG CCTCCCCCTG GTATGCCTGA GGGGGACATC AAGGCCCTGA CCACTCTCTG TGACCTGGCA GACCGAGAGC TTGTGGTCAT CATTGGCTGG GCCAAGCACA TCCCAGGCTT CTCAAGCCTC TCCCTGGGGG ACCAGATGAG CCTGCTGCAG AGTGCCTGGA TGGAAATCCT CATCCTGGGC ATCGTGTACC GCTCGCTGCC CTACGACGAC AAGCTGGTGT ACGCTGAGGA CTACATCATG GATGAGGAGC ACTCCCGCCT CGCGGGGCTG CTGGAGCTCT ACCGGGCCAT CCTGCAGCTG GTACGCAGGT ACAAGAAGCT CAAGGTGGAG AAGGAGGAGT TTGTGACGCT CAAGGCCCTG GCCCTCGCCA ACTCCGATTC CATGTACATC GAGGATCTAG AGGCTGTCCA GAAGCTGCAG GACCTGCTGC ACGAGGCACT GCAGGACTAC GAGCTGAGCC AGCGCCATGA GGAGCCCTGG AGGACGGGCA AGCTGCTGCT GACACTGCCG CTGCTGCGGC AGACGGCCGC CAAGGCCGTG CAGCACTTCT ATAGCGTCAA ACTGCAGGGC AAAGTGCCCA TGCACAAACT CTTCCTGGAG ATGCTGGAGG CCAAGGCCTG GGCCAGGGCT GACTCCCTTC AGGAGTGGAG GCCACTGGAG CAAGTGCCCT CTCCCCTCCA CCGAGCCACC AAGAGGCAGC ATGTGCATTT CCTAACTCCC TTGCCCCCTC CCCCATCTGT GGCCTGGGTG GGCACTGCTC AGGCTGGATA CCACCTGGAG GTTTTCCTTC CGCAGAGGGC AGGTTGGCCA AGAGCAGCTT AGAGGATCTC CCAAGGATGA AAGAATGTCA AGCCATGATG GAAAATGCCC CTTCCAATCA GCTGCCTTCA CAAGCAGGGA TCAGAGCAAC TCCCCGGGGA TCCCCAATCC ACGCCCTTCT AGTCCAACCC CCCTCAATGA GAGAGGCAGG CAGATCTCAC CCAGCACTAG GACACCAGGA GGCCAGGGAA AGCATCTCTG GCTCACCATG TAACATCTGG CTTGGAGCAA GTGGGTGTTC TGCACACCAG GCAGCTGCAC CTCACTGGAT CTAGTGTTGC TGCGAGTGAC CTCACTTCAG AGCCCCTCTA GCAGAGTGGG GCGGAAGTCC TGATGGTTGG TGTCCATGAG GTGGAAG (SEQIDNO:l).
17. A DNA molecule of cl╬╡tim 16 which consists of nucleotide 950 to about nucleotide 2452 of SEQ ID NO:l.
18. An expression vector for expressing a human nNRl protein wherein said expression vector comprises a DNA molecule of claim 16.
19. An expression vector for expressing a human nNRl protein wherein said expression vector comprises a DNA molecule of claim 17.
20. A host cell which expresses a recombinant human nNRl protein wherein s╬╡rid host cell contains the expression vector of claim 18.
21. A host cell which expresses a recombinant hum╬╡m nNRl protein wherein said host cell contains the expression vector of claim 19.
22. A process for expressing a human nNRl protein in a recombinant host cell, comprising:
(a) transfecting the expression vector of claim 18 into a suitable host cell; and,
(b) culturing the host cells of step (a) under conditions which allow expression of said the human nNRl protein from said expression vector.
23. A purified DNA molecule encoding a human nNR2 protein wherein said protein comprises the amino acid sequence as follows:
MDSVELCLPE SFSLHYEEEL LCRMSNKDRH IDSSCSSFIK TEPSSPASLT DSVNHHSPGG SSDASGSYSS TMNGHQNGLD SPPLYPSAPI LGGSGPVRKL YDDCSSTIVE DPQTKCEYML NSMPKRLCLV CGDIASGYHY GVASCEACKA FFKRTIQGNI EYSCPATNEC EITKRRRKSC QACRFMKCLK VGMLKEGVRL DRVRGGRQKY KRRIDAENSP YLNPQLVQPA KKPYNKIVSH LLVAEPEKIY AMPDPTVPDS DIKALTTLCD LADRELWII GWAKHIPGFS TLSLADQMSL LQSAWMEILI LGWYRSLSF EDELVYADDY IMDEDQSKLA GLLDLNNAIL QLVKKYKSMK LEKEEFVTLK AIALANSDSM HIEDVEAVQK LQDVLHEALQ DYEAGQHMED PRRAGKMLMT LPLLRQTSTK AVQHFYNIKL EGKVPMHKLF LEMLEAKV, as set forth in three-letter abbreviation in SEQ ID NO:4.
24. An expression vector for expressing a human nNR2 protein in a recombinant host cell wherein said expression vector comprises a DNA molecule of cl╬╡rim 23.
25. A host cell which expresses a recombinant human nNR2 protein wherein said host cell contains the expression vector of claim 24.
26. A process for expressing a human nNR2 protein in a recombinant host cell, comprising: (a) transfecting the expression vector of cl╬╡dm 24 into a suitable host cell; and,
(b) culturing the host cells of step (a) under conditions which allow expression of said the human nNRl protein from s╬╡dd expression vector.
27. A purified DNA molecule encoding a human nNR2 protein wherein said protein consists of the amino acid sequence as follows:
MDSVELCLPE SFSLHYEEEL LCRMSNKDRH IDSSCSSFIK TEPSSPASLT
DSVNHHSPGG SSDASGSYSS TMNGHQNGLD SPPLYPSAPI LGGSGPVRKL
YDDCSSTIVE DPQTKCEYML NSMPKRLCLV CGDIASGYHY GVASCEACKA
FFKRTIQGNI EYSCPATNEC EITKRRRKSC QACRFMKCLK VGMLKEGVRL DRVRGGRQKY KRRIDAENSP YLNPQLVQPA KKPYNKIVSH LLVAEPEKIY
AMPDPTVPDS DIKALTTLCD LADRELWII GWAKHIPGFS TLSLADQMSL
LQSAWMEILI LGWYRSLSF EDELVYADDY IMDEDQSKLA GLLDLNNAIL
QLVKKYKSMK LEKEEFVTLK AIALANSDSM HIEDVEAVQK LQDVLHEALQ
DYEAGQHMED PPJIAGKMLMT LPLLRQTSTK AVQHFYNIKL EGKVPMHKLF LEMLEAKV, as set forth in three letter code as SEQ ID NO 4.
28. An expression vector for expressing a human nNR2 protein in a recombin╬╡mt host cell wherein said expression vector comprises a DNA molecule of claim 27.
29. A host cell which expresses a recombinant hxun╬╡ nNRl protein wherein said host cell contains the expression vector of claim 28.
30. A process for expressing a hum╬╡m nNR2 protein in a recombinant host cell, comprising: (a) transfecting the expression vector of claim 28 into a suitable host cell; and,
(b) culturing the host cells of step (a) under conditions which allow expression of said the human nNRl protein from said expression vector.
31. A purified DNA molecule encoding a human nNR2 protein wherein said DNA molecule comprises the nucleotide sequence as set forth in SEQ ID NO:3, as follows:
GCGGGCCGCC AGTGTGGTGG AATTCGGCTT GTCACTAGGA GAACATTTGT GTTAATTGCA CTGTGCTCTG TCAAGGAAAC TTTGATTTAT AGCTGGGGTG CACAAATAAT GGTTGCCGGT CGCACATGGA TTCGGTAGAA CTTTGCCTTC CTGAATCTTT TTCCCTGCAC TACGAGGAAG AGCTTCTCTG CAGAATGTCA AACAAAGATC GACACATTGA TTCCAGCTGT TCGTCCTTCA TCAAGACGGA ACCTTCCAGC CCAGCCTCCC TGACGGACAG CGTCAACCAC CACAGCCCTG GTGGCTCTTC AGACGCCAGT GGGAGCTACA GTTCAACCAT GAATGGCCAT CAGAACGGAC TTGACTCGCC ACCTCTCTAC CCTTCTGCTC CTATCCTGGG AGGTAGTGGG CCTGTCAGGA AACTGTATGA TGACTGCTCC AGCACCATTG TTGAAGATCC CCAGACCAAG TGTGAATACA TGCTCAACTC GATGCCCAAG AGACTGTGTT TAGTGTGTGG TGACATCGCT TCTGGGTACC ACTATGGGGT AGCATCATGT GAAGCCTGCA AGGCATTCTT CAAGAGGACA ATTCAAGGCA ATATAGAATA CAGCTGCCCT GCCACGAATG AATGTGAAAT CACAAAGCGC AGACGTAAAT CCTGCCAGGC TTGCCGCTTC ATGAAGTGTT TAAAAGTGGG CATGCTGAAA GAAGGGGTGC GTCTTGACAG AGTACGTGGA GGTCGGCAGA AGTACAAGCG CAGGATAGAT GCGGAGAACA GCCCATACCT GAACCCTCAG CTGGTTCAGC CAGCCAAAAA GCCATATAAC AAGATTGTCT CACATTTGTT GGTGGCTGAA CCGGAGAAGA TCTATGCCAT GCCTGACCCT ACTGTCCCCG ACAGTGACAT CAAAGCCCTC ACTACACTGT GTGACTTGGC CGACCGAGAG TTGGTGGTTA TCATTGGATG GGCGAAGCAT ATTCCAGGCT TCTCCACGCT GTCCCTGGCG GACCAGATGA GCCTTCTGCA GAGTGCTTGG ATGGAAATTT TGATCCTTGG TGTCGTATAC CGGTCTCTTT CATTTGAGGA TGAACTTGTC TATGCAGACG ATTATATAAT GGACGAAGAC CAGTCCAAAT TAGCAGGCCT TCTTGATCTA AATAATGCTA TCCTGCAGCT GGTAAAGAAA TACAAGAGCA TGAAGCTGGA AAAAGAAGAA TTTGTCACCC TCAAAGCTAT AGCTCTTGCT AATTCAGACT CCATGCACAT AGAAGATGTT GAAGCCGTTC AGAAGCTTCA GGATGTCTTA CATGAAGCGC TGCAGGATTA TGAAGCTGGC CAGCACATGG AAGACCCTCG TCGAGCTGGC AAGATGCTGA TGACACTGCC ACTCCTGAGG CAGACCTCTA CCAAGGCCGT GCAGCATTTC TACAACATCA AACTAGAAGG CAAAGTCCCA ATGCACAAAC TTTTTTTGGA AATGTTGGAG GCCAAGGTCT GACTAAAAGC TCCCTGGGCC TTCCCATCCT TCATGTTGAA AAAGGGAAAA TAAACCCAAG AGTGATGTCG AAGAAACTTA GAGTTTAGTT AACAACATCA AAAATCAACA GACTGCACTG ATAATTTAGC AGCAAGACTA TGAAGCAGCT TTCAGATTCC TCCATAGGTT CCTGATGAGT TCTTTCTACT TTCTCCATCA TCTTCTTTCC TCTTTCTTCC CACATTTCTC TTTCTCTTTA TTTTTTCTCC TTTTCTTCTT TCACCTCCCT TATTTCTTTG CTTCTTTCAT TCCTAGTTCC CATTCTCCTT TATTTTCTTC CCGTCTGCCT GCCTTCTTTC TTTTCTTTAC CTACTCTCAT TCCTCTCTTT TCTCATCCTT CCCCTTTTTT CTAAATTTGA AATAGCTTTA GTTTAAAAAA AAAAATCCTC CCTTCCCCCT TTCCTTTCCC TTTCTTTCCT TTTTCCCTTT CCTTTTCCCT TTCCTTTCCT TTCCTCTTGA CCTTCTTTCC ATCTTTCTTT TTCTTCCTTC TGCTGCTGAA CTTTTAAAAG AGGTCTCTAA CTGAAGAGAG ATGGAAGCCA GCCCTGCCAA AGGATGGAGA TCCATAATAT GGATGCCAGT GAACTTATTG TGAACCATAC CGTCCCCAAT GACTAAGGAA TCAAAGAGAG AGAACCAACG TTCCTAAAAG TACAGTGCAA CATATACAAA TTGACTGAGT GCAGTATTAG ATTTCATGGG AGCAGCCTCT AATTAGACAA CTTAAGCAAC GTTGCATCGG CTGCTTCTTA TCATTGCTTT TCCATCTAGA TCAGTTACAG CCATTTGATT CCTTAATTGT TTTTTCAAGT CTTCCAGGTA TTTGTTAGTT TAGCTACTAT GTAACTTTTT CAGGGAATAG TTTAAGCTTT ATTCATTCAT GCAATACTAA AGAGAAATAA GAATACTGCA ATTTTGTGCT GGCTTTGAAC AATTACGAAC AATAATGAAG GACAAATGAA TCCTGAAGGA AGATTTTTAA AAATGTTTTG TTTCTTCTTA CAAATGGAGA TTTTTTTGTA CCAGCTTTAC CACTTTTCAG CCATTTATTA ATATGGGAAT TTAACTTACT CAAGCAATAG TTGAAGGGAA GGTGCATATT ATCACGGATG CAATTTATGT TGTGTGCCAG TCTGGTCCCA AACATCAATT TCTTAACATG AGCTCCAGTT TACCTAAATG TTCACTGACA CAAAGGATGA GATTACACCT ACAGTGACTC TGAGTAGTCA CATATATAAG CACTGCACAT GAGATATAGA TCCGTAGAAT TGTCAGGAGT GCACCTCTCT ACTTGGGAGG TACAATTGCC ATATGATTTC TAGCTGCCAT GGTGGTTAGG AATGTGATAC TGCCTGTTTG CAAAGTCACA GACCTTGCCT CAGAAGGAGC TGTGAGCCAG TATTCATTTA AGAGAATTCC ACCACACTGG CGGCCCGCGC TTGAT (SEQ ID NO:3).
32. A DNA molecule of claim 31 which comprises from about nucleotide 126 to about nucleotide 1382 of SEQ ID NO:3.
33. An expression vector for expressing a human nNR2 protein wherein said expression vector comprises a DNA molecule of claim 31.
34. An expression vector for expressing a human nNR2 protein wherein said expression vector comprises a DNA molecule of claim 32.
35. A host cell which expresses a recombinant human nNR2 protein wherein said host cell contains the expression vector of claim 33.
36. A host cell which expresses a recombinant hum╬╡m nNR2 protein wherein said host cell contains the expression vector of claim 34.
37. A process for expressing a human nNR2 protein in a recombinant host cell, comprising:
(a) transfecting the expression vector of claim 33 into a suitable host cell; and,
(b) culturing the host cells of step (a) under conditions which allow expression of said the human nNRl protein from said expression vector.
38. A purified DNA molecule encoding a hum╬╡m nNR2 protein wherein said DNA molecule consists of the nucleotide sequence as set forth in SEQ ID NO:3, as follows:
GCGGGCCGCC AGTGTGGTGG AATTCGGCTT GTCACTAGGA GAACATTTGT GTTAATTGCA CTGTGCTCTG TCAAGGAAAC TTTGATTTAT AGCTGGGGTG CACAAATAAT GGTTGCCGGT CGCACATGGA TTCGGTAGAA CTTTGCCTTC CTGAATCTTT TTCCCTGCAC TACGAGGAAG AGCTTCTCTG CAGAATGTCA AACAAAGATC GACACATTGA TTCCAGCTGT TCGTCCTTCA TCAAGACGGA ACCTTCCAGC CCAGCCTCCC TGACGGACAG CGTCAACCAC CACAGCCCTG GTGGCTCTTC AGACGCCAGT GGGAGCTACA GTTCAACCAT GAATGGCCAT CAGAACGGAC TTGACTCGCC ACCTCTCTAC CCTTCTGCTC CTATCCTGGG AGGTAGTGGG CCTGTCAGGA AACTGTATGA TGACTGCTCC AGCACCATTG TTGAAGATCC CCAGACCAAG TGTGAATACA TGCTCAACTC GATGCCCAAG AGACTGTGTT TAGTGTGTGG TGACATCGCT TCTGGGTACC ACTATGGGGT AGCATCATGT GAAGCCTGCA AGGCATTCTT CAAGAGGACA ATTCAAGGCA ATATAGAATA CAGCTGCCCT GCCACGAATG AATGTGAAAT CACAAAGCGC AGACGTAAAT CCTGCCAGGC TTGCCGCTTC ATGAAGTGTT TAAAAGTGGG CATGCTGAAA GAAGGGGTGC GTCTTGACAG AGTACGTGGA GGTCGGCAGA AGTACAAGCG CAGGATAGAT GCGGAGAACA GCCCATACCT GAACCCTCAG CTGGTTCAGC CAGCCAAAAA GCCATATAAC AAGATTGTCT CACATTTGTT GGTGGCTGAA CCGGAGAAGA TCTATGCCAT GCCTGACCCT ACTGTCCCCG ACAGTGACAT CAAAGCCCTC ACTACACTGT GTGACTTGGC CGACCGAGAG TTGGTGGTTA TCATTGGATG GGCGAAGCAT ATTCCAGGCT TCTCCACGCT GTCCCTGGCG GACCAGATGA GCCTTCTGCA GAGTGCTTGG ATGGAAATTT TGATCCTTGG TGTCGTATAC CGGTCTCTTT CATTTGAGGA TGAACTTGTC TATGCAGACG ATTATATAAT GGACGAAGAC CAGTCCAAAT TAGCAGGCCT TCTTGATCTA AATAATGCTA TCCTGCAGCT GGTAAAGAAA TACAAGAGCA TGAAGCTGGA AAAAGAAGAA TTTGTCACCC TCAAAGCTAT AGCTCTTGCT AATTCAGACT CCATGCACAT AGAAGATGTT GAAGCCGTTC AGAAGCTTCA GGATGTCTTA CATGAAGCGC TGCAGGATTA TGAAGCTGGC CAGCACATGG AAGACCCTCG TCGAGCTGGC AAGATGCTGA TGACACTGCC ACTCCTGAGG CAGACCTCTA CCAAGGCCGT GCAGCATTTC TACAACATCA AACTAGAAGG CAAAGTCCCA ATGCACAAAC TTTTTTTGGA AATGTTGGAG GCCAAGGTCT GACTAAAAGC TCCCTGGGCC TTCCCATCCT TCATGTTGAA AAAGGGAAAA TAAACCCAAG AGTGATGTCG AAGAAACTTA GAGTTTAGTT AACAACATCA AAAATCAACA GACTGCACTG ATAATTTAGC AGCAAGACTA TGAAGCAGCT TTCAGATTCC TCCATAGGTT CCTGATGAGT TCTTTCTACT TTCTCCATCA TCTTCTTTCC TCTTTCTTCC CACATTTCTC TTTCTCTTTA TTTTTTCTCC TTTTCTTCTT TCACCTCCCT TATTTCTTTG CTTCTTTCAT TCCTAGTTCC CATTCTCCTT TATTTTCTTC CCGTCTGCCT GCCTTCTTTC TTTTCTTTAC CTACTCTCAT TCCTCTCTTT TCTCATCCTT CCCCTTTTTT CTAAATTTGA AATAGCTTTA GTTTAAAAAA AAAAATCCTC CCTTCCCCCT TTCCTTTCCC TTTCTTTCCT TTTTCCCTTT CCTTTTCCCT TTCCTTTCCT TTCCTCTTGA CCTTCTTTCC ATCTTTCTTT TTCTTCCTTC TGCTGCTGAA CTTTTAAAAG AGGTCTCTAA CTGAAGAGAG ATGGAAGCCA GCCCTGCCAA AGGATGGAGA TCCATAATAT GGATGCCAGT GAACTTATTG TGAACCATAC CGTCCCCAAT GACTAAGGAA TCAAAGAGAG AGAACCAACG TTCCTAAAAG TACAGTGCAA CATATACAAA TTGACTGAGT GCAGTATTAG ATTTCATGGG AGCAGCCTCT AATTAGACAA CTTAAGCAAC GTTGCATCGG CTGCTTCTTA TCATTGCTTT TCCATCTAGA TCAGTTACAG CCATTTGATT CCTTAATTGT TTTTTCAAGT CTTCCAGGTA TTTGTTAGTT TAGCTACTAT GTAACTTTTT CAGGGAATAG TTTAAGCTTT ATTCATTCAT GCAATACTAA AGAGAAATAA GAATACTGCA ATTTTGTGCT GGCTTTGAAC AATTACGAAC AATAATGAAG GACAAATGAA TCCTGAAGGA AGATTTTTAA AAATGTTTTG TTTCTTCTTA CAAATGGAGA TTTTTTTGTA CCAGCTTTAC CACTTTTCAG CCATTTATTA ATATGGGAAT TTAACTTACT CAAGCAATAG TTGAAGGGAA GGTGCATATT ATCACGGATG CAATTTATGT TGTGTGCCAG TCTGGTCCCA AACATCAATT TCTTAACATG AGCTCCAGTT TACCTAAATG TTCACTGACA CAAAGGATGA GATTACACCT ACAGTGACTC TGAGTAGTCA CATATATAAG CACTGCACAT GAGATATAGA TCCGTAGAAT TGTCAGGAGT GCACCTCTCT ACTTGGGAGG TACAATTGCC ATATGATTTC TAGCTGCCAT GGTGGTTAGG AATGTGATAC TGCCTGTTTG CAAAGTCACA GACCTTGCCT CAGAAGGAGC TGTGAGCCAG TATTCATTTA AGAGAATTCC ACCACACTGG CGGCCCGCGC TTGAT (SEQ ID NO: 3 ) .
39. A DNA molecule of claim 38 which consists of nucleotide 126 to about nucleotide 1382 of SEQ ID NO:3.
40. An expression vector for expressing a human nNR2 protein wherein said expression vector comprises a DNA molecule of claim 38.
41. An expression vector for expressing a human nNR2 protein wherein said expression vector comprises a DNA molecule of claim 39.
42. A host cell which expresses a recombinant hum╬╡m nNR2 protein wherein said host cell contains the expression vector of claim 40.
43. A host cell which expresses a recombinant hum╬╡m nNR2 protein wherein said host cell contains the expression vector of claim 41.
44. A process for expressing a human nNR2 protein in a recombinant host cell, comprising:
(a) transfecting the expression vector of claim 40 into a suitable host cell; and,
(b) culturing the host cells of step (a) under conditions which allow expression of said the human nNR2 protein from said expression vector.
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US5709097P | 1997-08-27 | 1997-08-27 | |
US57090P | 1997-08-27 | ||
US6290297P | 1997-10-21 | 1997-10-21 | |
US62902P | 1997-10-21 | ||
US7863398P | 1998-03-19 | 1998-03-19 | |
US78633P | 1998-03-19 | ||
PCT/US1998/017826 WO1999010367A1 (en) | 1997-08-27 | 1998-08-27 | Dna molecules encoding human nuclear receptor proteins |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1007540A1 true EP1007540A1 (en) | 2000-06-14 |
EP1007540A4 EP1007540A4 (en) | 2003-06-18 |
Family
ID=27369158
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP98943441A Withdrawn EP1007540A4 (en) | 1997-08-27 | 1998-08-27 | Dna molecules encoding human nuclear receptor proteins |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1007540A4 (en) |
JP (1) | JP2001513984A (en) |
CA (1) | CA2301554A1 (en) |
WO (1) | WO1999010367A1 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0866127A3 (en) | 1997-03-17 | 1999-12-22 | Smithkline Beecham Plc | HE8AN36, a steroid hormone receptor homolog |
WO2000042180A1 (en) * | 1999-01-14 | 2000-07-20 | Kyowa Hakko Kogyo Co., Ltd. | Novel protein |
US20060148030A1 (en) * | 2002-03-25 | 2006-07-06 | Fujisawa Pharmaceutical Co., Ltd | Nuclear receptor err y 3 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5071773A (en) * | 1986-10-24 | 1991-12-10 | The Salk Institute For Biological Studies | Hormone receptor-related bioassays |
-
1998
- 1998-08-27 JP JP2000507693A patent/JP2001513984A/en active Pending
- 1998-08-27 EP EP98943441A patent/EP1007540A4/en not_active Withdrawn
- 1998-08-27 CA CA002301554A patent/CA2301554A1/en not_active Abandoned
- 1998-08-27 WO PCT/US1998/017826 patent/WO1999010367A1/en not_active Application Discontinuation
Non-Patent Citations (9)
Title |
---|
DATABASE EMBL [Online] 1 December 1995 (1995-12-01) "ys81d07.r1 Homo sapiens cDNA clone 221197 5' similar to gb:X51417_cds1 STEROID HORMONE RECEPTOR ERR2 (HUMAN)" retrieved from EBI Database accession no. H91890 XP002239023 * |
DATABASE EMBL [Online] 1 December 1995 (1995-12-01) "ys81d07.s1 Homo sapiens cDNA clone 221197 3' similar to gb:X51417_cds1 STEROID HORMONE RECEPTOR ERR2 (HUMAN)" retrieved from EBI Database accession no. H91842 XP002239026 * |
DATABASE EMBL [Online] 16 November 1995 (1995-11-16) "ys69h11.r1 Homo sapiens cDNA clone 220101 5' similar to gb:X51417_cds1 STEROID HORMONE RECEPTOR ERR2 (HUMAN)" retrieved from EBI Database accession no. H82542 XP002239024 * |
DATABASE EMBL [Online] 16 November 1995 (1995-11-16) "ys69h11.s1 Homo sapiens cDNA clone 220101 3' similar to gb:X51417_cds1 STEROID HORMONE RECEPTOR ERR2 (HUMAN)" retrieved from EBI Database accession no. H82543 XP002239025 * |
DATABASE EMBL [Online] 27 January 1996 (1996-01-27) "yw88d02.r1 Homo sapiens cDNA clone 259299 5' similar to PIR:JC2390 JC2390 nuclear receptor Rev-ErbA beta - rat [1]" retrieved from EBI Database accession no. N41813 XP002239020 * |
DATABASE EMBL [Online] 29 May 1995 (1995-05-29) "yg96e07.r1 Homo sapiens cDNA clone 41139 5' similar to SP:RNR1_RAT Q07917 REGENERATING LIVER NUCLEAR RECEPTOR 1" retrieved from EBI Database accession no. R59032 XP002239022 * |
DATABASE EMBL [Online] 8 April 1995 (1995-04-08) "yd21f09.r1 Homo sapiens cDNA clone 67817 5' similar to SP:ROR1_HUMAN P35397 NUCLEAR RECEPTOR" retrieved from EBI Database accession no. T79024 XP002239021 * |
ENMARK E ET AL: "ORPHAN NUCLEAR RECEPTORS - THE FIRST EIGHT YEARS" MOLECULAR ENDOCRINOLOGY, BALTIMORE, MD, US, vol. 10, no. 11, 1 November 1996 (1996-11-01), pages 1293-1307, XP000645166 ISSN: 0888-8809 * |
See also references of WO9910367A1 * |
Also Published As
Publication number | Publication date |
---|---|
WO1999010367A1 (en) | 1999-03-04 |
JP2001513984A (en) | 2001-09-11 |
CA2301554A1 (en) | 1999-03-04 |
EP1007540A4 (en) | 2003-06-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1017811A1 (en) | G-protein coupled glycoprotein hormone receptor hg38 | |
US6054295A (en) | DNA molecules encoding human nuclear receptor proteins | |
US7029865B2 (en) | DNA molecules encoding the melanocortin 4 receptor protein from rhesus monkey | |
US6693184B1 (en) | DNA molecules encoding splice variants of the human melanocortin 1 receptor protein | |
WO1999010367A1 (en) | Dna molecules encoding human nuclear receptor proteins | |
JP2002508393A (en) | DNA molecules encoding human nuclear receptor proteins nNR7 and nNR7-1 | |
US20030119100A1 (en) | DNA molecules encoding human nuclear receptor proteins | |
US6645738B1 (en) | DNA molecules encoding the melanocortin 5 receptor protein from rhesus monkey | |
US7060463B2 (en) | DNA molecules encoding Macaca mulatta androgen receptor | |
CA2314434A1 (en) | Dna molecules encoding human nuclear receptor protein, nnr5 | |
EP1012155A1 (en) | Human uncoupling protein 3 | |
EP1037912A1 (en) | DNA MOLECULES ENCODING VERTEBRATE NUCLEAR RECEPTOR PROTEIN, nNR4 | |
EP1133515A1 (en) | Dna molecules encoding hg51, a g-protein-coupled receptor | |
WO2000027862A1 (en) | Dna molecules encoding the melanocortin 3 receptor protein from rhesus monkey |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20000327 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU NL PT SE |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 20030508 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN |
|
17Q | First examination report despatched |
Effective date: 20031211 |
|
18W | Application withdrawn |
Effective date: 20031208 |