CA2340617A1 - Genes associated with neurotransmitter processing - Google Patents
Genes associated with neurotransmitter processing Download PDFInfo
- Publication number
- CA2340617A1 CA2340617A1 CA002340617A CA2340617A CA2340617A1 CA 2340617 A1 CA2340617 A1 CA 2340617A1 CA 002340617 A CA002340617 A CA 002340617A CA 2340617 A CA2340617 A CA 2340617A CA 2340617 A1 CA2340617 A1 CA 2340617A1
- Authority
- CA
- Canada
- Prior art keywords
- gene
- neurotransmitter
- genes
- processing
- polynucleotide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 215
- 238000012545 processing Methods 0.000 title claims description 60
- 239000002858 neurotransmitter agent Substances 0.000 title abstract description 47
- 238000000034 method Methods 0.000 claims abstract description 73
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 70
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 65
- 229920001184 polypeptide Polymers 0.000 claims abstract description 63
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 45
- 201000010099 disease Diseases 0.000 claims abstract description 37
- 239000013604 expression vector Substances 0.000 claims abstract description 17
- 102000040430 polynucleotide Human genes 0.000 claims description 63
- 108091033319 polynucleotide Proteins 0.000 claims description 63
- 239000002157 polynucleotide Substances 0.000 claims description 63
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 claims description 60
- 229960003638 dopamine Drugs 0.000 claims description 30
- 230000014509 gene expression Effects 0.000 claims description 28
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 20
- 239000000523 sample Substances 0.000 claims description 20
- 108700032673 cocaine- and amphetamine-regulated transcript Proteins 0.000 claims description 19
- 238000009396 hybridization Methods 0.000 claims description 18
- 150000001413 amino acids Chemical class 0.000 claims description 15
- 108010038439 Chromogranin B Proteins 0.000 claims description 12
- 102000008109 Mixed Function Oxygenases Human genes 0.000 claims description 12
- 108010074633 Mixed Function Oxygenases Proteins 0.000 claims description 12
- 101150004420 RAB3A gene Proteins 0.000 claims description 12
- 101001129803 Homo sapiens Paired mesoderm homeobox protein 2A Proteins 0.000 claims description 11
- 108010002533 Secretogranin II Proteins 0.000 claims description 11
- 239000002243 precursor Substances 0.000 claims description 11
- 229960004441 tyrosine Drugs 0.000 claims description 10
- 102000003823 Aromatic-L-amino-acid decarboxylases Human genes 0.000 claims description 9
- 108090000121 Aromatic-L-amino-acid decarboxylases Proteins 0.000 claims description 9
- 102000019315 Nicotinic acetylcholine receptors Human genes 0.000 claims description 9
- 108050006807 Nicotinic acetylcholine receptors Proteins 0.000 claims description 9
- 102000009659 Vesicular Monoamine Transport Proteins Human genes 0.000 claims description 9
- 108010020033 Vesicular Monoamine Transport Proteins Proteins 0.000 claims description 9
- 229940024606 amino acid Drugs 0.000 claims description 7
- 239000008194 pharmaceutical composition Substances 0.000 claims description 7
- 230000000295 complement effect Effects 0.000 claims description 6
- 239000003937 drug carrier Substances 0.000 claims description 6
- 239000012472 biological sample Substances 0.000 claims description 4
- 102000010791 Chromogranin B Human genes 0.000 claims 4
- 102000000705 Secretogranin II Human genes 0.000 claims 4
- 239000005557 antagonist Substances 0.000 abstract description 5
- 239000000556 agonist Substances 0.000 abstract description 4
- 210000004027 cell Anatomy 0.000 description 32
- 102000004169 proteins and genes Human genes 0.000 description 29
- 235000018102 proteins Nutrition 0.000 description 28
- 239000002299 complementary DNA Substances 0.000 description 27
- 150000007523 nucleic acids Chemical group 0.000 description 22
- SFLSHLFXELFNJZ-QMMMGPOBSA-N (-)-norepinephrine Chemical compound NC[C@H](O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-QMMMGPOBSA-N 0.000 description 20
- 241000282414 Homo sapiens Species 0.000 description 20
- 229960002748 norepinephrine Drugs 0.000 description 20
- SFLSHLFXELFNJZ-UHFFFAOYSA-N norepinephrine Natural products NCC(O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-UHFFFAOYSA-N 0.000 description 20
- 108020004414 DNA Proteins 0.000 description 19
- 239000012634 fragment Substances 0.000 description 16
- 102000039446 nucleic acids Human genes 0.000 description 16
- 108020004707 nucleic acids Proteins 0.000 description 16
- 230000001225 therapeutic effect Effects 0.000 description 14
- 210000001519 tissue Anatomy 0.000 description 14
- 230000015572 biosynthetic process Effects 0.000 description 13
- 108030001679 Endothelin-converting enzyme 1 Proteins 0.000 description 12
- 239000002773 nucleotide Substances 0.000 description 12
- 125000003729 nucleotide group Chemical group 0.000 description 12
- 238000011282 treatment Methods 0.000 description 12
- 238000003786 synthesis reaction Methods 0.000 description 11
- 102000048186 Endothelin-converting enzyme 1 Human genes 0.000 description 10
- 102100020867 Secretogranin-1 Human genes 0.000 description 10
- 239000013598 vector Substances 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 208000018737 Parkinson disease Diseases 0.000 description 8
- 102100035835 Secretogranin-2 Human genes 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 230000004186 co-expression Effects 0.000 description 8
- 238000003745 diagnosis Methods 0.000 description 8
- 208000035475 disorder Diseases 0.000 description 8
- 239000003814 drug Substances 0.000 description 8
- 206010015037 epilepsy Diseases 0.000 description 8
- 238000004519 manufacturing process Methods 0.000 description 8
- 238000002560 therapeutic procedure Methods 0.000 description 8
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 108020004999 messenger RNA Proteins 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- -1 regulatory sequences Proteins 0.000 description 7
- 201000000980 schizophrenia Diseases 0.000 description 7
- 238000003556 assay Methods 0.000 description 6
- 210000004556 brain Anatomy 0.000 description 6
- 230000000875 corresponding effect Effects 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 238000011156 evaluation Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 108020004635 Complementary DNA Proteins 0.000 description 5
- 238000000729 Fisher's exact test Methods 0.000 description 5
- 108091034117 Oligonucleotide Proteins 0.000 description 5
- 241000283973 Oryctolagus cuniculus Species 0.000 description 5
- 238000010171 animal model Methods 0.000 description 5
- 230000027455 binding Effects 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 210000002569 neuron Anatomy 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 4
- 102000007345 Chromogranins Human genes 0.000 description 4
- 108010007718 Chromogranins Proteins 0.000 description 4
- 108091035707 Consensus sequence Proteins 0.000 description 4
- 108010048671 Homeodomain Proteins Proteins 0.000 description 4
- 102000009331 Homeodomain Proteins Human genes 0.000 description 4
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- ZPUCINDJVBIVPJ-LJISPDSOSA-N cocaine Chemical compound O([C@H]1C[C@@H]2CC[C@@H](N2C)[C@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-LJISPDSOSA-N 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 239000005556 hormone Substances 0.000 description 4
- 229940088597 hormone Drugs 0.000 description 4
- 230000000926 neurological effect Effects 0.000 description 4
- 230000002265 prevention Effects 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 230000001850 reproductive effect Effects 0.000 description 4
- 239000001509 sodium citrate Substances 0.000 description 4
- 229930182837 (R)-adrenaline Natural products 0.000 description 3
- UCTWMZQNUQWSLP-VIFPVBQESA-N (R)-adrenaline Chemical compound CNC[C@H](O)C1=CC=C(O)C(O)=C1 UCTWMZQNUQWSLP-VIFPVBQESA-N 0.000 description 3
- 241000282472 Canis lupus familiaris Species 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- 102100029113 Endothelin-converting enzyme 2 Human genes 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 108091060211 Expressed sequence tag Proteins 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- WTDRDQBEARUVNC-LURJTMIESA-N L-DOPA Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-LURJTMIESA-N 0.000 description 3
- WTDRDQBEARUVNC-UHFFFAOYSA-N L-Dopa Natural products OC(=O)C(N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-UHFFFAOYSA-N 0.000 description 3
- 102100031686 Paired mesoderm homeobox protein 2A Human genes 0.000 description 3
- 108091093037 Peptide nucleic acid Proteins 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 229960005139 epinephrine Drugs 0.000 description 3
- 230000028023 exocytosis Effects 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 230000001900 immune effect Effects 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 238000002493 microarray Methods 0.000 description 3
- 230000026731 phosphorylation Effects 0.000 description 3
- 238000006366 phosphorylation reaction Methods 0.000 description 3
- 238000003127 radioimmunoassay Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 3
- 229940038773 trisodium citrate Drugs 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- SNICXCGAKADSCV-JTQLQIEISA-N (-)-Nicotine Chemical compound CN1CCC[C@H]1C1=CC=CN=C1 SNICXCGAKADSCV-JTQLQIEISA-N 0.000 description 2
- KWTSXDURSIMDCE-QMMMGPOBSA-N (S)-amphetamine Chemical compound C[C@H](N)CC1=CC=CC=C1 KWTSXDURSIMDCE-QMMMGPOBSA-N 0.000 description 2
- 208000006096 Attention Deficit Disorder with Hyperactivity Diseases 0.000 description 2
- 208000036864 Attention deficit/hyperactivity disease Diseases 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 241000701489 Cauliflower mosaic virus Species 0.000 description 2
- 241000282693 Cercopithecidae Species 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- 101710139359 Death-associated protein kinase 3 Proteins 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- 102000004533 Endonucleases Human genes 0.000 description 2
- 102100029112 Endothelin-converting enzyme 1 Human genes 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 108010014173 Factor X Proteins 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- 108010051696 Growth Hormone Proteins 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101000956149 Homo sapiens Death-associated protein kinase 3 Proteins 0.000 description 2
- 101000841255 Homo sapiens Endothelin-converting enzyme 2 Proteins 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 2
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 208000012902 Nervous system disease Diseases 0.000 description 2
- 208000025966 Neurological disease Diseases 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 240000007594 Oryza sativa Species 0.000 description 2
- 235000007164 Oryza sativa Nutrition 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- 102000001708 Protein Isoforms Human genes 0.000 description 2
- 108010029485 Protein Isoforms Proteins 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 101710192385 Secretogranin-1 Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 101710138331 Somatostatin-1 Proteins 0.000 description 2
- 102100038803 Somatotropin Human genes 0.000 description 2
- 102100036407 Thioredoxin Human genes 0.000 description 2
- 108091036066 Three prime untranslated region Proteins 0.000 description 2
- 241000723873 Tobacco mosaic virus Species 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- OIPILFWXSMYKGL-UHFFFAOYSA-N acetylcholine Chemical compound CC(=O)OCC[N+](C)(C)C OIPILFWXSMYKGL-UHFFFAOYSA-N 0.000 description 2
- 229960004373 acetylcholine Drugs 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 229940025084 amphetamine Drugs 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 208000015802 attention deficit-hyperactivity disease Diseases 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000002869 basic local alignment search tool Methods 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 210000001124 body fluid Anatomy 0.000 description 2
- 239000010839 body fluid Substances 0.000 description 2
- 150000003943 catecholamines Chemical class 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 229960003920 cocaine Drugs 0.000 description 2
- 230000009918 complex formation Effects 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 238000007405 data analysis Methods 0.000 description 2
- 238000006114 decarboxylation reaction Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000006806 disease prevention Effects 0.000 description 2
- 210000005064 dopaminergic neuron Anatomy 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 238000002509 fluorescent in situ hybridization Methods 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 239000000122 growth hormone Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 238000003018 immunoassay Methods 0.000 description 2
- 230000005847 immunogenicity Effects 0.000 description 2
- 208000035231 inattentive type attention deficit hyperactivity disease Diseases 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 210000003963 intermediate filament Anatomy 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 230000003472 neutralizing effect Effects 0.000 description 2
- 229960002715 nicotine Drugs 0.000 description 2
- SNICXCGAKADSCV-UHFFFAOYSA-N nicotine Natural products CN1CCCC1C1=CC=CN=C1 SNICXCGAKADSCV-UHFFFAOYSA-N 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 230000027758 ovulation cycle Effects 0.000 description 2
- 208000007312 paraganglioma Diseases 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 2
- 102000054765 polymorphisms of proteins Human genes 0.000 description 2
- 230000001323 posttranslational effect Effects 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 210000002307 prostate Anatomy 0.000 description 2
- 102000016914 ras Proteins Human genes 0.000 description 2
- 108010014186 ras Proteins Proteins 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 235000009566 rice Nutrition 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 210000004739 secretory vesicle Anatomy 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 210000000225 synapse Anatomy 0.000 description 2
- 210000001550 testis Anatomy 0.000 description 2
- 108060008226 thioredoxin Proteins 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- 101150072531 10 gene Proteins 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- ZKHQWZAMYRWXGA-KQYNXXCUSA-J ATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-J 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- WQVYAWIMAWTGMW-ZLUOBGJFSA-N Ala-Asp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WQVYAWIMAWTGMW-ZLUOBGJFSA-N 0.000 description 1
- OQCPATDFWYYDDX-HGNGGELXSA-N Ala-Gln-His Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OQCPATDFWYYDDX-HGNGGELXSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 208000000044 Amnesia Diseases 0.000 description 1
- 208000031091 Amnestic disease Diseases 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- GISFCCXBVJKGEO-QEJZJMRPSA-N Asp-Glu-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GISFCCXBVJKGEO-QEJZJMRPSA-N 0.000 description 1
- OGTCOKZFOJIZFG-CIUDSAMLSA-N Asp-His-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OGTCOKZFOJIZFG-CIUDSAMLSA-N 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 208000020925 Bipolar disease Diseases 0.000 description 1
- 108010039209 Blood Coagulation Factors Proteins 0.000 description 1
- 102000015081 Blood Coagulation Factors Human genes 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 102000000584 Calmodulin Human genes 0.000 description 1
- 108010041952 Calmodulin Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 102000052052 Casein Kinase II Human genes 0.000 description 1
- 108010010919 Casein Kinase II Proteins 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004394 Complementary RNA Proteins 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 206010012289 Dementia Diseases 0.000 description 1
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 208000032131 Diabetic Neuropathies Diseases 0.000 description 1
- 201000010374 Down Syndrome Diseases 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 102000001301 EGF receptor Human genes 0.000 description 1
- 108060006698 EGF receptor Proteins 0.000 description 1
- 201000009273 Endometriosis Diseases 0.000 description 1
- 102000002045 Endothelin Human genes 0.000 description 1
- 108050009340 Endothelin Proteins 0.000 description 1
- 101710144982 Endothelin-converting enzyme 2 Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 208000000571 Fibrocystic breast disease Diseases 0.000 description 1
- 208000001287 Galactorrhea Diseases 0.000 description 1
- 206010017600 Galactorrhoea Diseases 0.000 description 1
- 206010071602 Genetic polymorphism Diseases 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- 229920002683 Glycosaminoglycan Polymers 0.000 description 1
- 239000000579 Gonadotropin-Releasing Hormone Substances 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- MKWFGXSFLYNTKC-XIRDDKMYSA-N His-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N MKWFGXSFLYNTKC-XIRDDKMYSA-N 0.000 description 1
- 101000684275 Homo sapiens ADP-ribosylation factor 3 Proteins 0.000 description 1
- 101001130437 Homo sapiens Ras-related protein Rap-2b Proteins 0.000 description 1
- 208000023105 Huntington disease Diseases 0.000 description 1
- XQFRJNBWHJMXHO-RRKCRQDMSA-N IDUR Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 XQFRJNBWHJMXHO-RRKCRQDMSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- 102000005741 Metalloproteases Human genes 0.000 description 1
- 108010006035 Metalloproteases Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 102000029749 Microtubule Human genes 0.000 description 1
- 108091022875 Microtubule Proteins 0.000 description 1
- 101100054858 Mus musculus Adam12 gene Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 208000009905 Neurofibromatoses Diseases 0.000 description 1
- 108010088373 Neurofilament Proteins Proteins 0.000 description 1
- 102000008763 Neurofilament Proteins Human genes 0.000 description 1
- 102000005665 Neurotransmitter Transport Proteins Human genes 0.000 description 1
- 108010084810 Neurotransmitter Transport Proteins Proteins 0.000 description 1
- 108091005461 Nucleic proteins Chemical group 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 206010033266 Ovarian Hyperstimulation Syndrome Diseases 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 206010061332 Paraganglion neoplasm Diseases 0.000 description 1
- 208000027099 Paranoid disease Diseases 0.000 description 1
- 208000004362 Penile Induration Diseases 0.000 description 1
- 208000020758 Peyronie disease Diseases 0.000 description 1
- 102000004861 Phosphoric Diester Hydrolases Human genes 0.000 description 1
- 108090001050 Phosphoric Diester Hydrolases Proteins 0.000 description 1
- 229920000776 Poly(Adenosine diphosphate-ribose) polymerase Polymers 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- DYMPSOABVJIFBS-IHRRRGAJSA-N Pro-Phe-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CS)C(=O)O DYMPSOABVJIFBS-IHRRRGAJSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- 102000003946 Prolactin Human genes 0.000 description 1
- 108010057464 Prolactin Proteins 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 102000003923 Protein Kinase C Human genes 0.000 description 1
- 108090000315 Protein Kinase C Proteins 0.000 description 1
- 108020004518 RNA Probes Proteins 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 239000003391 RNA probe Substances 0.000 description 1
- 101710140254 RUN domain-containing protein 3A Proteins 0.000 description 1
- 102100022665 RUN domain-containing protein 3A Human genes 0.000 description 1
- 102100031421 Ras-related protein Rap-2b Human genes 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 208000006265 Renal cell carcinoma Diseases 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 101800003628 Secretoneurin Proteins 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- 108010052164 Sodium Channels Proteins 0.000 description 1
- 102000018674 Sodium Channels Human genes 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 101710138332 Somatostatin-2 Proteins 0.000 description 1
- 101000857870 Squalus acanthias Gonadoliberin Proteins 0.000 description 1
- 102000005465 Stathmin Human genes 0.000 description 1
- 108050003387 Stathmin Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 206010043118 Tardive Dyskinesia Diseases 0.000 description 1
- 208000031320 Teratogenesis Diseases 0.000 description 1
- 208000024313 Testicular Neoplasms Diseases 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- 208000000323 Tourette Syndrome Diseases 0.000 description 1
- 208000016620 Tourette disease Diseases 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 206010044565 Tremor Diseases 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 206010044688 Trisomy 21 Diseases 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- 108091000117 Tyrosine 3-Monooxygenase Proteins 0.000 description 1
- 206010046798 Uterine leiomyoma Diseases 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 101710102828 Vesicle-associated protein Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 230000010933 acylation Effects 0.000 description 1
- 238000005917 acylation reaction Methods 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 230000006986 amnesia Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000002788 anti-peptide Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000003143 atherosclerotic effect Effects 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 210000003445 biliary tract Anatomy 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 239000003114 blood coagulation factor Substances 0.000 description 1
- 229940019700 blood coagulation factors Drugs 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 208000011803 breast fibrocystic disease Diseases 0.000 description 1
- 210000000621 bronchi Anatomy 0.000 description 1
- 239000007975 buffered saline Substances 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 1
- 229960003669 carbenicillin Drugs 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006473 carboxylation reaction Methods 0.000 description 1
- 210000000845 cartilage Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 206010007776 catatonia Diseases 0.000 description 1
- 230000005779 cell damage Effects 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 208000037887 cell injury Diseases 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 208000026106 cerebrovascular disease Diseases 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000000546 chi-square test Methods 0.000 description 1
- 210000002477 chromaffin system Anatomy 0.000 description 1
- 238000003200 chromosome mapping Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 230000009137 competitive binding Effects 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 210000002808 connective tissue Anatomy 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 208000029078 coronary artery disease Diseases 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000006735 deficit Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- SLPJGDQJLTYWCI-UHFFFAOYSA-N dimethyl-(4,5,6,7-tetrabromo-1h-benzoimidazol-2-yl)-amine Chemical compound BrC1=C(Br)C(Br)=C2NC(N(C)C)=NC2=C1Br SLPJGDQJLTYWCI-UHFFFAOYSA-N 0.000 description 1
- 238000007878 drug screening assay Methods 0.000 description 1
- 208000010118 dystonia Diseases 0.000 description 1
- 201000003511 ectopic pregnancy Diseases 0.000 description 1
- 210000003372 endocrine gland Anatomy 0.000 description 1
- 230000002357 endometrial effect Effects 0.000 description 1
- 208000023965 endometrium neoplasm Diseases 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 210000003238 esophagus Anatomy 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 210000003499 exocrine gland Anatomy 0.000 description 1
- 238000012820 exploratory laparotomy Methods 0.000 description 1
- 210000001508 eye Anatomy 0.000 description 1
- 210000003754 fetus Anatomy 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 210000000609 ganglia Anatomy 0.000 description 1
- 210000005095 gastrointestinal system Anatomy 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- XLXSAKCOAKORKW-AQJXLSMYSA-N gonadorelin Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)NCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 XLXSAKCOAKORKW-AQJXLSMYSA-N 0.000 description 1
- 229940035638 gonadotropin-releasing hormone Drugs 0.000 description 1
- 210000000020 growth cone Anatomy 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 description 1
- 201000000079 gynecomastia Diseases 0.000 description 1
- 210000002216 heart Anatomy 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000007914 intraventricular administration Methods 0.000 description 1
- 210000004153 islets of langerhan Anatomy 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 210000000867 larynx Anatomy 0.000 description 1
- 201000010901 lateral sclerosis Diseases 0.000 description 1
- 201000010260 leiomyoma Diseases 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 238000013332 literature search Methods 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 238000012775 microarray technology Methods 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 210000004688 microtubule Anatomy 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 208000005264 motor neuron disease Diseases 0.000 description 1
- 210000000214 mouth Anatomy 0.000 description 1
- 201000006417 multiple sclerosis Diseases 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 208000010125 myocardial infarction Diseases 0.000 description 1
- 208000018389 neoplasm of cerebral hemisphere Diseases 0.000 description 1
- 210000005170 neoplastic cell Anatomy 0.000 description 1
- 208000004296 neuralgia Diseases 0.000 description 1
- 230000000955 neuroendocrine Effects 0.000 description 1
- 201000004931 neurofibromatosis Diseases 0.000 description 1
- 210000004498 neuroglial cell Anatomy 0.000 description 1
- 230000007658 neurological function Effects 0.000 description 1
- 230000007514 neuronal growth Effects 0.000 description 1
- 210000000607 neurosecretory system Anatomy 0.000 description 1
- 230000002474 noradrenergic effect Effects 0.000 description 1
- 210000001331 nose Anatomy 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 230000000624 ovulatory effect Effects 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 210000003899 penis Anatomy 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 210000001428 peripheral nervous system Anatomy 0.000 description 1
- 208000033808 peripheral neuropathy Diseases 0.000 description 1
- 210000001539 phagocyte Anatomy 0.000 description 1
- 239000008177 pharmaceutical agent Substances 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 210000003800 pharynx Anatomy 0.000 description 1
- 229960005190 phenylalanine Drugs 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 201000010065 polycystic ovary syndrome Diseases 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 230000036584 pressor response Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 229940097325 prolactin Drugs 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 201000001514 prostate carcinoma Diseases 0.000 description 1
- 201000007094 prostatitis Diseases 0.000 description 1
- 230000004224 protection Effects 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000018883 protein targeting Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000008960 regulation of mRNA stability Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 125000006853 reporter group Chemical group 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 210000001625 seminal vesicle Anatomy 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 210000002356 skeleton Anatomy 0.000 description 1
- IFGCUJZIWBUILZ-UHFFFAOYSA-N sodium 2-[[2-[[hydroxy-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyphosphoryl]amino]-4-methylpentanoyl]amino]-3-(1H-indol-3-yl)propanoic acid Chemical compound [Na+].C=1NC2=CC=CC=C2C=1CC(C(O)=O)NC(=O)C(CC(C)C)NP(O)(=O)OC1OC(C)C(O)C(O)C1O IFGCUJZIWBUILZ-UHFFFAOYSA-N 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 231100000527 sperm abnormality Toxicity 0.000 description 1
- 230000021595 spermatogenesis Effects 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000000528 statistical test Methods 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 125000000446 sulfanediyl group Chemical group *S* 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 1
- 230000009452 underexpressoin Effects 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 210000000626 ureter Anatomy 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 210000003932 urinary bladder Anatomy 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 239000002550 vasoactive agent Substances 0.000 description 1
- 201000010653 vesiculitis Diseases 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- QAOHCFGKCWTBGC-QHOAOGIMSA-N wybutosine Chemical compound C1=NC=2C(=O)N3C(CC[C@H](NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O QAOHCFGKCWTBGC-QHOAOGIMSA-N 0.000 description 1
- QAOHCFGKCWTBGC-UHFFFAOYSA-N wybutosine Natural products C1=NC=2C(=O)N3C(CCC(NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1C1OC(CO)C(O)C1O QAOHCFGKCWTBGC-UHFFFAOYSA-N 0.000 description 1
- 238000001086 yeast two-hybrid system Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
- A61P25/08—Antiepileptics; Anticonvulsants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
- A61P25/14—Drugs for disorders of the nervous system for treating abnormal movements, e.g. chorea, dyskinesia
- A61P25/16—Anti-Parkinson drugs
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
- A61P25/18—Antipsychotics, i.e. neuroleptics; Drugs for mania or schizophrenia
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P5/00—Drugs for disorders of the endocrine system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2537/00—Reactions characterised by the reaction format or use of a specific feature
- C12Q2537/10—Reactions characterised by the reaction format or use of a specific feature the purpose or use of
- C12Q2537/143—Multiplexing, i.e. use of multiple primers or probes in a single reaction, usually for simultaneously analyse of multiple analysis
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Neurology (AREA)
- General Chemical & Material Sciences (AREA)
- Pharmacology & Pharmacy (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Animal Behavior & Ethology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Neurosurgery (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Pain & Pain Management (AREA)
- Psychology (AREA)
- Toxicology (AREA)
- Endocrinology (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Diabetes (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Psychiatry (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
Abstract
The invention provides five genes associated with neurotransmitter processin g and polypeptides encoded by those genes. The invention also provides expression vectors, host cells, antibodies, agonists, and antagonists. The invention also provides methods for diagnosing, treating or preventing diseases.
Description
GENES ASSOCIATED WITH NEUROTRANSMITTER PROCESSING
A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent f le or records, but otherwise reserves all copyright rights whatsoever.
TECHNICAL FIELD
The invention relates to five genes associated with neurotransmitter processing and their corresponding gene products. The invention also relates to the use of these biomolecules in diagnosis, treatment, prevention, and evaluation oftherapies for neurological and hormone-related diseases, more particularly Parkinson's disease, schizophrenia, and epilepsy.
BACKGROUND OF THE INVENTION
Normal neurological function requires a careful balance of neurotransmitters.
An upset in the balance can lead to disease. For example, Parkinson's disease is triggered by an as yet unknown event that increases the susceptibility of dopaminergic neurons to oxidative processes which in turn leads to cell damage and death (Jenner, P. ( 1998) Mov. Disord.
13:24-34). The resulting lack of dopamine leads to tremors and rigidity in the patient, the main clinical manifestations of Parkinson's disease (Birkmayer, W. and P. Riederer ( 1986) Understanding_the Neurotranmitters: Kevs to the Workin~~s of the Brain, Springer-Verlag, New York NY).
Many other disorders result from an improper balance among neurotransmitters.
The central dopaminergic neurons have long been thought to play a significant role in schizophrenia (Farde, L. (1997) Schizophr. Res. 28:157-162), but a more complex picture is emerging where interactions among dopamine, norepinephrine and other neurotransmitters appears to be the basis of the disorder (Carlsson, A. et al. ( I 997) Life Sci. 61:75-94). Both dopamine and norepinpehrine are also implicated in attention deficit hyperactive disorder (Hechtman, L.
(1994) J. Psychiary Neurosci. 19: 193-201 ). In animal models norepinephrine has been implicated in epilepsy (Szot, P. et al. ( 1996) Brain Res. Mol. Brain Res. 43:233-245; Clough, R. W, et al.
( 1998) Epilepsy Res.
29:135-146); Janumpalli, S. et al. (1998) J. Neurosci. 18:2004-2008).
Norepinephrine directly regulates gonadotropin-releasing hormone and may therefore be involved in reproductive disorders (Becu-Villalobos, D. et al. ( 1997) Cell Mol. Neurobiol. 17:699-715;
Pau, K.Y. and H.G.
Spies (1997) Chin. J. Physiol. 40:181-196). Norepinephrine also regulates growth hormone and inhibition of dopamine /3 hydroxylase {DBH). DBH retards growth in animal models (Malozowski, S. et al. (1993) Acta Endocrinol. 129:554-558).
In general, neurological disorders are treated with drugs that modulate neurotransmitter level in an attempt to maintain the correct level. Consequently, any of the substrates, enzymes, receptors or neurotransmitter transporters as well as genes involved in their regulation are potentially of therapeutic use. We have discovered five new genes that exhibit significant coexpression with genes known to be involved in the synthesis and release of the neurotransmitters dopamine, norepinephrine and epinephrine. These new genes are useful in the diagnosis, prevention, treatment, and evaluation of therapies for diseases, particutarly neurological and hormone-related diseases, more particularly Parkinson's disease, schizophrenia, and epilepsy.
SUMMARY OF THE INVENTION
In one aspect, the invention provides for a substantially purified polynucleotide comprising a gene that is coexpressed with. one or more neurotransmitter-processing-specific genes in a plurality of biological samples. Preferably, each neurotransmitter-processing-specific gene is selected from(a) a polynucleotide sequence selected from the group consisting of SEQ ID
NOs: 1-5; (b) a polynucleotide sequence which encodes the polypeptide sequence of SEQ ID NO:
6; (c) a polynucleotide sequence which is complementary to the poiynucleotide sequence of (a) or (b); and (d) a probe which hybridizes to the polynucleotide of (a), (b), or (c). Furthermore, the invention provides an expression vector comprising any of the above described polynucleotides and host cells comprising the expression vector.
In a second aspect, the invention provides a substantially purified polypeptide comprising the gene product of a gene that is coexpressed with one or more neurotransmitter-processing-specific genes in a plurality of biological samples. The neurotransmitter-processing-specific gene may be selected from the group consisting of L-tyrosine hydroxylase (TH), aromatic amino acid decarboxylase (AADC), dopamine (i-hydroxylase (DBH), nicotinic acetylcholine receptor a3 subunit precursor (nAchR-a3), secretogranin I and II, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMATI ), and ARIX
homeodomain protein. Preferred embodiments are (a) the polypeptide sequence of SEQ ID NO: 6, and (b) a polypeptide sequence comprising at least 6 sequential amino acids of the polypeptide sequence of (a) . Additionally, the invention provides antibodies that bind specifically to any of the above described polypeptides.
In another aspect, the invention provides a pharmaceutical composition comprising the polynucleotide or the polypeptide of the inventions in conjunction with a suitable pharmaceutical carrier and a method for treating or preventing disease comprising administering to a subject in need such a composition in an amount effective for treating or preventing said disease.
In yet a further aspect, the invention provides a method for diagnosing a disease or condition associated with the altered expression of a gene that is coexpressed with one or more neurotransmitter-processing-specific genes in a sample, wherein each neurotransmitter-processing-specific gene is selected from the group consisting of L-tyrosine hydroxylase (TH), aromatic amino acid decarboxylase (AADC), dopamine (3-hydroxylase (DBH), nicotinic acetylcholine receptor a3 subunit precursor (nAchR-a3), secretogranin 1 and II, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMATI ), and ARIX homeodomain protein. The method comprises providing the sample comprising one of more of said coexpressed genes; hybridizing the polynucleotide to said coexpressed genes under conditions effective to form one or more hybridization complexes; and detecting the hybridization complexes, wherein the presence of one or more of the hybridization complexes correlates with the presence of the disease or condition.
BRIEF DESCRIPTION OF THE SEQUENCE LISTING
The Sequence Listing provides exemplary gene sequences associated with neurotransmitter processing, SEQ ID NOs: 1-5, and a polypeptide sequence, SEQ
ID NO: 6, derived from the gene sequence of SEQ ID N0:4. Each sequence is identified by a numerical sequence identification number and by the Incyte clone number from which the sequence was first identified.
DESCRIPTION OF THE INVENTION
It must be noted that as used herein and in the appended claims, the singular forms "a,"
"an," and "the" include the plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled in the art; and so forth.
"NSEQ" refers generally to a polynucleotide sequence of the present invention, including SEQ ID NOs: 1-S. "PSEQ" refers generally to a polypeptide sequence of the present invention, including SEQ 1D NO: 6.
A " variant" refers to either a polynucleotide or a polypeptide whose sequence diverges from SEQ ID NOs: 1-5 or SEQ ID NO: 6, respectively. Polynucleotide sequence divergence may result from mutational changes such as deletions, additions, and substitutions of one or more nucleotides; it may also occur because of differences in codon usage. Each of these types of changes may occur alone, or in combination, one or more times in a given sequence. Polypeptide variants include sequences that possess at least one structural or functional characteristic of SEQ
1D NO: 6.
A "fragment" can refer to a nucleic acid sequence that is preferably at least 20 nucleic acids in iength, more preferably 40 nucleic acids, and most preferably 60 nucleic acids in length, and encompasses, for example, fragments consisting of nucleic acids 1-SO of SEQ ID NOs: l-5. A
"fragment" can also refer to polypeptide sequences which are preferably at least 5 to about 15 amino acids in length, most preferably at least 10 amino acids long, and which retain some biological activity or immunological activity of, for example, SEQ ID N0:6.
"Neurotransmitter-processing-specific genes" refers to genes which are involved in the processing, such as the synthesis or release, of neurotransmitters. For purposes of this invention, the neurotransmitters are preferably dopamine, norepinephrine and epinephrine.
"Gene associated with neurotransmitter processing" refers to a new gene sequence whose expression pattern is similar to that of the neurotransmitter-processing-specific genes and which is useful in the~diagnosis, treatment, or prevention of diseases associated with the altered expression of neurotransmitter processing genes. The gene sequences can also be used in the evaluation of therapies for these diseases.
"Substantially purified" refers to a nucleic acid or an amino acid sequence that is removed from its natural environment and is isolated or separated, and is at least about 60% free, preferably about 75% free, and most preferably about 90% free from other components with which it is naturally present.
THE INVENTION
The present invention encompasses five genes useful in the diagnosis, treatment, prevention, and evaluation of therapies for diseases related to improper neurotransmitter processing, particularly neurological and growth-related diseases, more particularly Parkinson's disease, schizophrenia, and epilepsy. These genes were identified using the method disclosed in the US patent application entitled "Prostate Cancer-Associated Genes", Walker et al., Serial No:09/102,615 filed June 22, 1998, herein incorporated by reference, and described briefly below.
The method provides first identifying polynucleotides that are expressed in a plurality of cDNA libraries. The polynucleotides may include genes of known function, genes known to be specifically expressed in a specific disease process, subcellular compartment, cell type, tissue type, or species. Additionally, the polynucleotides include genes of unknown function. The expression patterns of the known genes are then compared with those of the genes of unknown function to determine whether a specified coexpression probability threshold is met.
Through this comparison, a subset of the polynucleotides having a high coexpression probability with the known genes can be identified. The high coexpression probability correlates with a particular coexpression probability threshold which is less than 0.001, and more preferably less than 0.00001.
A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent f le or records, but otherwise reserves all copyright rights whatsoever.
TECHNICAL FIELD
The invention relates to five genes associated with neurotransmitter processing and their corresponding gene products. The invention also relates to the use of these biomolecules in diagnosis, treatment, prevention, and evaluation oftherapies for neurological and hormone-related diseases, more particularly Parkinson's disease, schizophrenia, and epilepsy.
BACKGROUND OF THE INVENTION
Normal neurological function requires a careful balance of neurotransmitters.
An upset in the balance can lead to disease. For example, Parkinson's disease is triggered by an as yet unknown event that increases the susceptibility of dopaminergic neurons to oxidative processes which in turn leads to cell damage and death (Jenner, P. ( 1998) Mov. Disord.
13:24-34). The resulting lack of dopamine leads to tremors and rigidity in the patient, the main clinical manifestations of Parkinson's disease (Birkmayer, W. and P. Riederer ( 1986) Understanding_the Neurotranmitters: Kevs to the Workin~~s of the Brain, Springer-Verlag, New York NY).
Many other disorders result from an improper balance among neurotransmitters.
The central dopaminergic neurons have long been thought to play a significant role in schizophrenia (Farde, L. (1997) Schizophr. Res. 28:157-162), but a more complex picture is emerging where interactions among dopamine, norepinephrine and other neurotransmitters appears to be the basis of the disorder (Carlsson, A. et al. ( I 997) Life Sci. 61:75-94). Both dopamine and norepinpehrine are also implicated in attention deficit hyperactive disorder (Hechtman, L.
(1994) J. Psychiary Neurosci. 19: 193-201 ). In animal models norepinephrine has been implicated in epilepsy (Szot, P. et al. ( 1996) Brain Res. Mol. Brain Res. 43:233-245; Clough, R. W, et al.
( 1998) Epilepsy Res.
29:135-146); Janumpalli, S. et al. (1998) J. Neurosci. 18:2004-2008).
Norepinephrine directly regulates gonadotropin-releasing hormone and may therefore be involved in reproductive disorders (Becu-Villalobos, D. et al. ( 1997) Cell Mol. Neurobiol. 17:699-715;
Pau, K.Y. and H.G.
Spies (1997) Chin. J. Physiol. 40:181-196). Norepinephrine also regulates growth hormone and inhibition of dopamine /3 hydroxylase {DBH). DBH retards growth in animal models (Malozowski, S. et al. (1993) Acta Endocrinol. 129:554-558).
In general, neurological disorders are treated with drugs that modulate neurotransmitter level in an attempt to maintain the correct level. Consequently, any of the substrates, enzymes, receptors or neurotransmitter transporters as well as genes involved in their regulation are potentially of therapeutic use. We have discovered five new genes that exhibit significant coexpression with genes known to be involved in the synthesis and release of the neurotransmitters dopamine, norepinephrine and epinephrine. These new genes are useful in the diagnosis, prevention, treatment, and evaluation of therapies for diseases, particutarly neurological and hormone-related diseases, more particularly Parkinson's disease, schizophrenia, and epilepsy.
SUMMARY OF THE INVENTION
In one aspect, the invention provides for a substantially purified polynucleotide comprising a gene that is coexpressed with. one or more neurotransmitter-processing-specific genes in a plurality of biological samples. Preferably, each neurotransmitter-processing-specific gene is selected from(a) a polynucleotide sequence selected from the group consisting of SEQ ID
NOs: 1-5; (b) a polynucleotide sequence which encodes the polypeptide sequence of SEQ ID NO:
6; (c) a polynucleotide sequence which is complementary to the poiynucleotide sequence of (a) or (b); and (d) a probe which hybridizes to the polynucleotide of (a), (b), or (c). Furthermore, the invention provides an expression vector comprising any of the above described polynucleotides and host cells comprising the expression vector.
In a second aspect, the invention provides a substantially purified polypeptide comprising the gene product of a gene that is coexpressed with one or more neurotransmitter-processing-specific genes in a plurality of biological samples. The neurotransmitter-processing-specific gene may be selected from the group consisting of L-tyrosine hydroxylase (TH), aromatic amino acid decarboxylase (AADC), dopamine (i-hydroxylase (DBH), nicotinic acetylcholine receptor a3 subunit precursor (nAchR-a3), secretogranin I and II, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMATI ), and ARIX
homeodomain protein. Preferred embodiments are (a) the polypeptide sequence of SEQ ID NO: 6, and (b) a polypeptide sequence comprising at least 6 sequential amino acids of the polypeptide sequence of (a) . Additionally, the invention provides antibodies that bind specifically to any of the above described polypeptides.
In another aspect, the invention provides a pharmaceutical composition comprising the polynucleotide or the polypeptide of the inventions in conjunction with a suitable pharmaceutical carrier and a method for treating or preventing disease comprising administering to a subject in need such a composition in an amount effective for treating or preventing said disease.
In yet a further aspect, the invention provides a method for diagnosing a disease or condition associated with the altered expression of a gene that is coexpressed with one or more neurotransmitter-processing-specific genes in a sample, wherein each neurotransmitter-processing-specific gene is selected from the group consisting of L-tyrosine hydroxylase (TH), aromatic amino acid decarboxylase (AADC), dopamine (3-hydroxylase (DBH), nicotinic acetylcholine receptor a3 subunit precursor (nAchR-a3), secretogranin 1 and II, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMATI ), and ARIX homeodomain protein. The method comprises providing the sample comprising one of more of said coexpressed genes; hybridizing the polynucleotide to said coexpressed genes under conditions effective to form one or more hybridization complexes; and detecting the hybridization complexes, wherein the presence of one or more of the hybridization complexes correlates with the presence of the disease or condition.
BRIEF DESCRIPTION OF THE SEQUENCE LISTING
The Sequence Listing provides exemplary gene sequences associated with neurotransmitter processing, SEQ ID NOs: 1-5, and a polypeptide sequence, SEQ
ID NO: 6, derived from the gene sequence of SEQ ID N0:4. Each sequence is identified by a numerical sequence identification number and by the Incyte clone number from which the sequence was first identified.
DESCRIPTION OF THE INVENTION
It must be noted that as used herein and in the appended claims, the singular forms "a,"
"an," and "the" include the plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled in the art; and so forth.
"NSEQ" refers generally to a polynucleotide sequence of the present invention, including SEQ ID NOs: 1-S. "PSEQ" refers generally to a polypeptide sequence of the present invention, including SEQ 1D NO: 6.
A " variant" refers to either a polynucleotide or a polypeptide whose sequence diverges from SEQ ID NOs: 1-5 or SEQ ID NO: 6, respectively. Polynucleotide sequence divergence may result from mutational changes such as deletions, additions, and substitutions of one or more nucleotides; it may also occur because of differences in codon usage. Each of these types of changes may occur alone, or in combination, one or more times in a given sequence. Polypeptide variants include sequences that possess at least one structural or functional characteristic of SEQ
1D NO: 6.
A "fragment" can refer to a nucleic acid sequence that is preferably at least 20 nucleic acids in iength, more preferably 40 nucleic acids, and most preferably 60 nucleic acids in length, and encompasses, for example, fragments consisting of nucleic acids 1-SO of SEQ ID NOs: l-5. A
"fragment" can also refer to polypeptide sequences which are preferably at least 5 to about 15 amino acids in length, most preferably at least 10 amino acids long, and which retain some biological activity or immunological activity of, for example, SEQ ID N0:6.
"Neurotransmitter-processing-specific genes" refers to genes which are involved in the processing, such as the synthesis or release, of neurotransmitters. For purposes of this invention, the neurotransmitters are preferably dopamine, norepinephrine and epinephrine.
"Gene associated with neurotransmitter processing" refers to a new gene sequence whose expression pattern is similar to that of the neurotransmitter-processing-specific genes and which is useful in the~diagnosis, treatment, or prevention of diseases associated with the altered expression of neurotransmitter processing genes. The gene sequences can also be used in the evaluation of therapies for these diseases.
"Substantially purified" refers to a nucleic acid or an amino acid sequence that is removed from its natural environment and is isolated or separated, and is at least about 60% free, preferably about 75% free, and most preferably about 90% free from other components with which it is naturally present.
THE INVENTION
The present invention encompasses five genes useful in the diagnosis, treatment, prevention, and evaluation of therapies for diseases related to improper neurotransmitter processing, particularly neurological and growth-related diseases, more particularly Parkinson's disease, schizophrenia, and epilepsy. These genes were identified using the method disclosed in the US patent application entitled "Prostate Cancer-Associated Genes", Walker et al., Serial No:09/102,615 filed June 22, 1998, herein incorporated by reference, and described briefly below.
The method provides first identifying polynucleotides that are expressed in a plurality of cDNA libraries. The polynucleotides may include genes of known function, genes known to be specifically expressed in a specific disease process, subcellular compartment, cell type, tissue type, or species. Additionally, the polynucleotides include genes of unknown function. The expression patterns of the known genes are then compared with those of the genes of unknown function to determine whether a specified coexpression probability threshold is met.
Through this comparison, a subset of the polynucleotides having a high coexpression probability with the known genes can be identified. The high coexpression probability correlates with a particular coexpression probability threshold which is less than 0.001, and more preferably less than 0.00001.
The polynucleotides may originate from cDNA libraries derived from a variety of sources including, but not limited to, eukaryotes such as human, mouse, rat, dog, monkey, plant, and yeast and prokaryotes such as bacteria and viruses. These polynucleotides can also be selected from a variety of sequence types including, but not limited to, expressed sequence tags (ESTs), assembled polynucleotide sequences, full length gene coding regions, introns, regulatory sequences, 5' untranslated regions, and 3' untranslated regions. To have statistically significant analytical results, the polynucleotides need to be expressed in at least three cDNA
libraries. In one preferred embodiment, the polynucleotides are obtained from sequence databases, such as the L1FESEQ
database (Incyte Pharmaceuticals, Palo Alto CA).
The cDNA libraries used in the coexpression analysis of the present invention may be obtained from blood vessels, heart, blood cells, cultured cells, connective tissue, epithelium, islets of Langerhans, neurons, phagocytes, biliary tract, esophagus, gastrointestinal system, liver, pancreas, fetus, placenta, chromaffin system, endocrine glands, ovary, uterus, penis, prostate, seminal vesicles, testis, bone marrow, immune system, cartilage, muscles, skeleton, central nervous system, ganglia, neuroglia, neurosecretory system, peripheral nervous system, bronchus, larynx, lung, nose, pleurus, ear, eye, mouth, pharynx, exocrine glands, bladder, kidney, ureter, and the like. The number of cDNA libraries selected can range from as few as 20 to greater than 10,000. Preferably, the number of the cDNA libraries is greater than 500.
In a preferred embodiment, gene sequences are assembled to reflect related sequences, such as assembled sequence fragments derived from a single transcript.
Assembly of the polynucleotide sequences can be performed using sequences of various types including, but not limited to, ESTs, extensions, or shotgun sequences. In a most preferred embodiment, the polynucleotide sequences are derived from human sequences that have been assembled using the algorithm disclosed in the US provisional patent application entitled "Database and System for Storing, Comparing and Displaying Related Biomolecular Sequence Information", Lincoln et al., Serial No:60/079,469, filed March 26, 1998, herein incorporated by reference.
Experimentally, differential expression of the polynucleotides can be evaluated by methods including, but not limited to, differential display by spatial immobilization or by gel electrophoresis, genome mismatch scanning, representational difference analysis, and transcript imaging. Additionally, differential expression can be assessed by microarray technology. These methods may be used alone or in combination.
Neurotransmitter-processing-specific genes can be selected based on the current use of these genes as diagnostic markers or as therapeutic targets for diseases related to incorrect neurotransmitter processing, such as the neurotransmitter processing. Genes related to neurotransmitter processing include L-tyrosine hydroxylase {TH), aromatic amino acid decarboxylase (AADC), dopamine ~i-hydroxylase (DBH), nicotinic acetylcholine receptor a3 subunit precursor (nAchR-a3), secretogranin I and 1I, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMATI), ARIX
homeodomain protein, and the like.
The procedure for identifying new genes that exhibit a statistically significant coexpression pattern with known genes related to neurotransmitter processing is as follows. First, the presence or absence of a gene sequence in a cDNA library is defined: a gene is present in a cDNA library when at least one cDNA fragment corresponding to that gene is detected in a cDNA
sample taken from the library, and a gene is absent from a library when no corresponding cDNA
fragment is detected in the sample.
Second, the significance of gene coexpression is evaluated using a probability method to measure a due-to-chance probability of the coexpression. The probability method can be the Fisher exact test, the chi-squared test, or the kappa test. These tests and examples of their applications are well known in the art and can be found in standard statistics texts {Agresti, A.
(1990 Categorical Data Analysis. Wiley, New York NY; Rice, J. A. {1988) Mathematical Statistics and Data Analysis. Wadsworth & Brooks/Cole, Pacific Grove CA). A Bonferroni correction (Rice, supra, page 384) can also be applied in combination with one of the probability methods for correcting statistical results of one gene versus multiple other genes. In a preferred embodiment, the due-to-chance probability is measured by a Fisher exact test, and the threshold of the due-to-chance probability is set to less than 0.001, more preferably less than 0.00001.
To determine whether two genes, A and B, have similar coexpression patterns, occurrence data vectors can be generated as illustrated in Table 1, wherein a gene's presence is indicated by a one and its absence by a zero. A zero indicates that the gene did not occur in the library, and a one indicates that it occurred at least once.
Table I . Occurrence data for genes A and B
Library Library Library ... Library gene 1 1 0 ... 0 A
gene 1 0 1 0 B
...
For a given pair of genes, the occurrence data in Table 1 can be summarized in a 2x2 contingency table.
Table 2. Contingency table for co-occurrences of genes A and B
libraries. In one preferred embodiment, the polynucleotides are obtained from sequence databases, such as the L1FESEQ
database (Incyte Pharmaceuticals, Palo Alto CA).
The cDNA libraries used in the coexpression analysis of the present invention may be obtained from blood vessels, heart, blood cells, cultured cells, connective tissue, epithelium, islets of Langerhans, neurons, phagocytes, biliary tract, esophagus, gastrointestinal system, liver, pancreas, fetus, placenta, chromaffin system, endocrine glands, ovary, uterus, penis, prostate, seminal vesicles, testis, bone marrow, immune system, cartilage, muscles, skeleton, central nervous system, ganglia, neuroglia, neurosecretory system, peripheral nervous system, bronchus, larynx, lung, nose, pleurus, ear, eye, mouth, pharynx, exocrine glands, bladder, kidney, ureter, and the like. The number of cDNA libraries selected can range from as few as 20 to greater than 10,000. Preferably, the number of the cDNA libraries is greater than 500.
In a preferred embodiment, gene sequences are assembled to reflect related sequences, such as assembled sequence fragments derived from a single transcript.
Assembly of the polynucleotide sequences can be performed using sequences of various types including, but not limited to, ESTs, extensions, or shotgun sequences. In a most preferred embodiment, the polynucleotide sequences are derived from human sequences that have been assembled using the algorithm disclosed in the US provisional patent application entitled "Database and System for Storing, Comparing and Displaying Related Biomolecular Sequence Information", Lincoln et al., Serial No:60/079,469, filed March 26, 1998, herein incorporated by reference.
Experimentally, differential expression of the polynucleotides can be evaluated by methods including, but not limited to, differential display by spatial immobilization or by gel electrophoresis, genome mismatch scanning, representational difference analysis, and transcript imaging. Additionally, differential expression can be assessed by microarray technology. These methods may be used alone or in combination.
Neurotransmitter-processing-specific genes can be selected based on the current use of these genes as diagnostic markers or as therapeutic targets for diseases related to incorrect neurotransmitter processing, such as the neurotransmitter processing. Genes related to neurotransmitter processing include L-tyrosine hydroxylase {TH), aromatic amino acid decarboxylase (AADC), dopamine ~i-hydroxylase (DBH), nicotinic acetylcholine receptor a3 subunit precursor (nAchR-a3), secretogranin I and 1I, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMATI), ARIX
homeodomain protein, and the like.
The procedure for identifying new genes that exhibit a statistically significant coexpression pattern with known genes related to neurotransmitter processing is as follows. First, the presence or absence of a gene sequence in a cDNA library is defined: a gene is present in a cDNA library when at least one cDNA fragment corresponding to that gene is detected in a cDNA
sample taken from the library, and a gene is absent from a library when no corresponding cDNA
fragment is detected in the sample.
Second, the significance of gene coexpression is evaluated using a probability method to measure a due-to-chance probability of the coexpression. The probability method can be the Fisher exact test, the chi-squared test, or the kappa test. These tests and examples of their applications are well known in the art and can be found in standard statistics texts {Agresti, A.
(1990 Categorical Data Analysis. Wiley, New York NY; Rice, J. A. {1988) Mathematical Statistics and Data Analysis. Wadsworth & Brooks/Cole, Pacific Grove CA). A Bonferroni correction (Rice, supra, page 384) can also be applied in combination with one of the probability methods for correcting statistical results of one gene versus multiple other genes. In a preferred embodiment, the due-to-chance probability is measured by a Fisher exact test, and the threshold of the due-to-chance probability is set to less than 0.001, more preferably less than 0.00001.
To determine whether two genes, A and B, have similar coexpression patterns, occurrence data vectors can be generated as illustrated in Table 1, wherein a gene's presence is indicated by a one and its absence by a zero. A zero indicates that the gene did not occur in the library, and a one indicates that it occurred at least once.
Table I . Occurrence data for genes A and B
Library Library Library ... Library gene 1 1 0 ... 0 A
gene 1 0 1 0 B
...
For a given pair of genes, the occurrence data in Table 1 can be summarized in a 2x2 contingency table.
Table 2. Contingency table for co-occurrences of genes A and B
Gene A present Gene A absent Total Gene B present 8 2 10 Gene B absent 2 18 20 Total 10 20 Table 2 presents coexpression data for gene A and gene B in a total of 30 libraries. Gene A and gene B each occurs 10 times in the libraries. Table 2 summarizes and presents 1 ) the number of times gene A and B are both present in a library, 2) the number of times gene A and B
are both absent in a library, 3) the number of times gene A is present while gene B is absent, and l0 4) the number of times gene B is present while gene A is absent. The upper left entry is the number of times the two genes co-occur in a library, and the middle right entry is the number of times neither gene occurs in a library. The off diagonal entries are the number of times one gene occurs while the other does not. Gene A and B are both present eight times and absent 18 times, gene A is present while gene B is absent two times, and gene B is present while gene A is absent two times. The probability ("p-value") that the above association occurs due to chance as calculated using a Fisher Exact test is 0.0003. Associations are generally considered significant if a p-value is less than 0.01 (Agresti, supra; Rice, supra).
This method of estimating the probability for coexpression of two genes makes several assumptions. The method assumes that the libraries are independent and are identically sampled.
However, in practical situations, the selected cDNA libraries are not entirely independent because more than one library may be obtained from a single patient or tissue, and they are not entirely identically sampled because different numbers of cDNA's may be sequenced from each library (typically ranging from 5,000 to 10,000 cDNAs per library). In addition, because a Fisher exact coexpression probability is calculated for each gene versus 41,419 other genes, the probability is corrected for the multiple statistical tests by requiring a more stringent p-value than the standard p-value of 0.01.
Using the coexpression analysis method, we have identified five new genes that exhibit strong association, or coexpression, with known genes related to neurotransmitter processing. The results presented in Tables 5 to 10 show that the expression of the five new genes have direct or indirect association with the expression of known genes related to neurotransmitter processing.
Therefore, the new genes can potentially be used in diagnosis, treatment, or prevention of diseases related to neurotransmitter processing, such as Parkinson's disease, schizophrenia, epilepsy, female reproductive disorders, attention deficit disorder, or in the evaluation of therapies for these diseases. Further, the gene products of the five new genes are potential therapeutic proteins and targets of therapeutics against these diseases.
Therefore, in one embodiment, the present invention encompasses a polynucleotide sequence comprising the sequence of SEQ ID NOs: 1-5. These five polynucleotides are shown by the method of the present invention to have strong coexpression association with known genes related to neurotransmitter processing. The invention also encompasses a variant of the polynucleotide sequence, its complement, or 18 consecutive nucleotides of the sequences provided in the above described sequences. Variant polynucteotide sequences typically have at least about 70%, more preferably at least about 85%, and most preferably at least about 9S% polynucleotide sequence identity to NSEQ.
One preferred method for identifying variants provides using NSEQ and/or PSEQ
sequences to search against the GenBank primate (pri), rodent (rod), and mammalian (mam), vertebrate (vrtp), and eukaryote (eukp) databases, SwissProt, BLOCKS (Bairoch, A. et al. (1997) Nucleic Acids Res. 25:217-221 ), PFAM, and other databases that contain previously identified and annotated motifs, sequences, and gene functions. Methods that search for primary sequence patterns with secondary structure gap penalties (Smith, T. et al. (1992) protein Engineering 5:35-51) as well as algorithms such as BLAST (Basic Local Alignment Search Tool;
Altschul, S.F.
(1993) J. Mol. Evol 36:290-300; and Altschul et al. (1990) J. Mol. Biol.
215:403-410), BLOCKS
(Henikoff S. and Henikoff G.J. (1991) Nucleic Acids Research 19:6565-6572), Hidden Markov Models (HMM; Eddy, S.R. (1996) Cur. Opin. Str. Biol. 6:361-365; and Sonnhammer, E.L.L. et al.
(1997) Proteins 28:405-420), and the like, can be used to manipulate and analyze nucleotide and amino acid sequences. These databases, algorithms and other methods are well known in the art and are described in Ausubel, F.M. et al. ( 1997; Short Protocols in Molecular Biology°y, John Wiley & Sons, New York NY ) and in Meyers, R.A. (1995; Molecular Bioloey and Biotechnolo~v, Wiley VCH, Inc, New York NY, p 856-853).
Also encompassed by the invention are polynucleotide sequences that are capable of hybridizing to SEQ ID NOs: 1-5, and fragments thereof under stringent conditions. Stringent conditions can be defined by salt concentration, temperature, and other chemicals and conditions well known in the art.
For example, stringent salt concentration will ordinarily be less than about 750 mM NaCI
and 75 mM trisodium citrate, preferably less than about 500 mM NaCI and 50 mM
trisodium citrate, and most preferably less than about 250 mM NaCI and 25 mM trisodium citrate. Stringent temperature conditions will ordinarily include temperatures of at least about 30°C, more preferably of at least about 37°C, and most preferably of at least about 42°C. Varying additional _g_ parameters, such as hybridization time, the concentration of detergent (sodium dodecyl sulfate, SDS) or solvent (formamide), and the inclusion or exclusion of carrier DNA, are well known to those skilled in the art. Additional variations on these conditions will be readily apparent to those skilled in the art (Wahl, G.M. and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-511; Ausubel, supra; and Sambrook, J. et al. {1989) Molecular Cloning A Laboratory Manual, Cold Spring Harbor Press, Plainview, NY).
NSEQ or the poiynucleotide sequences encoding PSEQ can be extended utilizing a partial nucleotide sequence and employing various PCR-based methods known in the art to obtain full length sequences or to detect upstream sequences, such as promoters and regulatory elements.
(See, e.g., Dieffenbach, C.W. and G.S. Dveksler (1995; PCR Primer a Laboratory Manual, Cold Spring Harbor Press, Plainview, NY, pp. i-S; Sarkar, G. ( 1993; PCR Methods Applic. 2:318-322);
Trigiia, T. et al. {1988; Nucleic Acids Res. 16:8186); Lagerstrom, M. et al.
(1991; PCR Methods Applic. 1:111-119); and Parker, J.D. et al. (1991; Nucleic Acids Res. 19:3055-306). Additionally, one may use PCR and nested primers to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based methods, primers may be designed using commercially available software, such as OLIGO
4.06 software (National Biosciences, Plymouth MN) or another appropriate program, to be about 18 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 72°C.
In another aspect of the invention, NSEQ or the polynucleotide sequences encoding PSEQ
can be cloned in recombinant DNA molecules that direct expression of PSEQ or the polypeptides encoded by NSEQ, or structural or functional fragments thereof, in appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent amino acid sequence may be produced and used to express the polypeptides of PSEQ or the polypeptides encoded by NSEQ. The nucleotide sequences of the present invention can be engineered using methods generally known in the art in order to alter the nucletide sequences for a variety of purposes including, but not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth.
In order to express a biologically active polypeptide encoded by NSEQ, NSEQ or the polynucleotide sequences encoding PSEQ, or derivatives thereof, may be inserted into an appropriate expression vector, i.e., a vector which contains the necessary elements for transcriptional and translational control of the inserted coding sequence in a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in NSEQ or polynucleotide sequences encoding PSEQ. Methods which are well known to those skilled in the art may be used to construct expression vectors containing NSEQ or polynucleotide sequences encoding PSEQ and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, supra, and Ausubel, su ra).
A variety of expression vector/host cell systems may be utilized to contain and express NSEQ or polynucleotide sequences encoding PSEQ. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast expression vectors;
insect cell systems infected with viral expression vectors (baculovirus); plant cell systems transformed with viral expression vectors, cauliflower mosaic virus (CaMV) or tobacco mosaic virus (TMV), or with bacterial expression vectors (Ti or pBR322 plasmids); or animal cell systems.
The invention is not limited by the host cell employed. For long term production of recombinant proteins in mammalian systems, stable expression of a polypeptide encoded by NSEQ in cell lines is preferred. For example, NSEQ or sequences encoding PSEQ can be transformed into cell lines using expression vectors which may contain viral origins of replication and/or endogenous expression elements and a selectable marker gene on the same or on a separate vector.
In general, host cells that contain NSEQ and that express PSEQ may be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or chip based technologies for the detection and/or quantification of nucleic acid or protein sequences.
Immunological methods for detecting and measuring the expression of PSEQ using either specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting (FACS).
Host cells transformed with NSEQ or polynucleotide sequences encoding PSEQ may be cultured under conditions suitable for the expression and recovery of the protein from cell culture.
The protein produced by a transformed cell may be secreted or retained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, - I O-expression vectors containing polynucleotides of NSEQ or polynucleotides encoding PSEQ may be designed to contain signal sequences which direct secretion of PSEQ or polypeptides encoded by NSEQ through a prokaryotic or eukaryotic cell membrane.
In addition, a host cell strain may be chosen for its ability to modulate expression of the S inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro"
form of the protein may also be used to specify protein targeting, folding, and/or activity.
Different host cells which have specific cellular machinery and characteristic mechanisms for post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38), are available from the American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing of the foreign protein.
In another embodiment of the invention, natural, modified, or recombinant NSEQ
or nucleic acid sequences encoding PSEQ are ligated to a heterologous sequence resulting in translation of a fusion protein containing heterologous protein moieties in any of the aforementioned host systems. Such heterologous protein moieties facilitate purification of fusion proteins using commercially available affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, hemagglutinin (HA) and monoclonal antibody epitopes..
In another embodiment, NSEQ or sequences encoding PSEQ are synthesized, in whole or in part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. ( 1980) Nucleic Acids Symp. Ser. (7):215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser.
are both absent in a library, 3) the number of times gene A is present while gene B is absent, and l0 4) the number of times gene B is present while gene A is absent. The upper left entry is the number of times the two genes co-occur in a library, and the middle right entry is the number of times neither gene occurs in a library. The off diagonal entries are the number of times one gene occurs while the other does not. Gene A and B are both present eight times and absent 18 times, gene A is present while gene B is absent two times, and gene B is present while gene A is absent two times. The probability ("p-value") that the above association occurs due to chance as calculated using a Fisher Exact test is 0.0003. Associations are generally considered significant if a p-value is less than 0.01 (Agresti, supra; Rice, supra).
This method of estimating the probability for coexpression of two genes makes several assumptions. The method assumes that the libraries are independent and are identically sampled.
However, in practical situations, the selected cDNA libraries are not entirely independent because more than one library may be obtained from a single patient or tissue, and they are not entirely identically sampled because different numbers of cDNA's may be sequenced from each library (typically ranging from 5,000 to 10,000 cDNAs per library). In addition, because a Fisher exact coexpression probability is calculated for each gene versus 41,419 other genes, the probability is corrected for the multiple statistical tests by requiring a more stringent p-value than the standard p-value of 0.01.
Using the coexpression analysis method, we have identified five new genes that exhibit strong association, or coexpression, with known genes related to neurotransmitter processing. The results presented in Tables 5 to 10 show that the expression of the five new genes have direct or indirect association with the expression of known genes related to neurotransmitter processing.
Therefore, the new genes can potentially be used in diagnosis, treatment, or prevention of diseases related to neurotransmitter processing, such as Parkinson's disease, schizophrenia, epilepsy, female reproductive disorders, attention deficit disorder, or in the evaluation of therapies for these diseases. Further, the gene products of the five new genes are potential therapeutic proteins and targets of therapeutics against these diseases.
Therefore, in one embodiment, the present invention encompasses a polynucleotide sequence comprising the sequence of SEQ ID NOs: 1-5. These five polynucleotides are shown by the method of the present invention to have strong coexpression association with known genes related to neurotransmitter processing. The invention also encompasses a variant of the polynucleotide sequence, its complement, or 18 consecutive nucleotides of the sequences provided in the above described sequences. Variant polynucteotide sequences typically have at least about 70%, more preferably at least about 85%, and most preferably at least about 9S% polynucleotide sequence identity to NSEQ.
One preferred method for identifying variants provides using NSEQ and/or PSEQ
sequences to search against the GenBank primate (pri), rodent (rod), and mammalian (mam), vertebrate (vrtp), and eukaryote (eukp) databases, SwissProt, BLOCKS (Bairoch, A. et al. (1997) Nucleic Acids Res. 25:217-221 ), PFAM, and other databases that contain previously identified and annotated motifs, sequences, and gene functions. Methods that search for primary sequence patterns with secondary structure gap penalties (Smith, T. et al. (1992) protein Engineering 5:35-51) as well as algorithms such as BLAST (Basic Local Alignment Search Tool;
Altschul, S.F.
(1993) J. Mol. Evol 36:290-300; and Altschul et al. (1990) J. Mol. Biol.
215:403-410), BLOCKS
(Henikoff S. and Henikoff G.J. (1991) Nucleic Acids Research 19:6565-6572), Hidden Markov Models (HMM; Eddy, S.R. (1996) Cur. Opin. Str. Biol. 6:361-365; and Sonnhammer, E.L.L. et al.
(1997) Proteins 28:405-420), and the like, can be used to manipulate and analyze nucleotide and amino acid sequences. These databases, algorithms and other methods are well known in the art and are described in Ausubel, F.M. et al. ( 1997; Short Protocols in Molecular Biology°y, John Wiley & Sons, New York NY ) and in Meyers, R.A. (1995; Molecular Bioloey and Biotechnolo~v, Wiley VCH, Inc, New York NY, p 856-853).
Also encompassed by the invention are polynucleotide sequences that are capable of hybridizing to SEQ ID NOs: 1-5, and fragments thereof under stringent conditions. Stringent conditions can be defined by salt concentration, temperature, and other chemicals and conditions well known in the art.
For example, stringent salt concentration will ordinarily be less than about 750 mM NaCI
and 75 mM trisodium citrate, preferably less than about 500 mM NaCI and 50 mM
trisodium citrate, and most preferably less than about 250 mM NaCI and 25 mM trisodium citrate. Stringent temperature conditions will ordinarily include temperatures of at least about 30°C, more preferably of at least about 37°C, and most preferably of at least about 42°C. Varying additional _g_ parameters, such as hybridization time, the concentration of detergent (sodium dodecyl sulfate, SDS) or solvent (formamide), and the inclusion or exclusion of carrier DNA, are well known to those skilled in the art. Additional variations on these conditions will be readily apparent to those skilled in the art (Wahl, G.M. and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-511; Ausubel, supra; and Sambrook, J. et al. {1989) Molecular Cloning A Laboratory Manual, Cold Spring Harbor Press, Plainview, NY).
NSEQ or the poiynucleotide sequences encoding PSEQ can be extended utilizing a partial nucleotide sequence and employing various PCR-based methods known in the art to obtain full length sequences or to detect upstream sequences, such as promoters and regulatory elements.
(See, e.g., Dieffenbach, C.W. and G.S. Dveksler (1995; PCR Primer a Laboratory Manual, Cold Spring Harbor Press, Plainview, NY, pp. i-S; Sarkar, G. ( 1993; PCR Methods Applic. 2:318-322);
Trigiia, T. et al. {1988; Nucleic Acids Res. 16:8186); Lagerstrom, M. et al.
(1991; PCR Methods Applic. 1:111-119); and Parker, J.D. et al. (1991; Nucleic Acids Res. 19:3055-306). Additionally, one may use PCR and nested primers to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based methods, primers may be designed using commercially available software, such as OLIGO
4.06 software (National Biosciences, Plymouth MN) or another appropriate program, to be about 18 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 72°C.
In another aspect of the invention, NSEQ or the polynucleotide sequences encoding PSEQ
can be cloned in recombinant DNA molecules that direct expression of PSEQ or the polypeptides encoded by NSEQ, or structural or functional fragments thereof, in appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent amino acid sequence may be produced and used to express the polypeptides of PSEQ or the polypeptides encoded by NSEQ. The nucleotide sequences of the present invention can be engineered using methods generally known in the art in order to alter the nucletide sequences for a variety of purposes including, but not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth.
In order to express a biologically active polypeptide encoded by NSEQ, NSEQ or the polynucleotide sequences encoding PSEQ, or derivatives thereof, may be inserted into an appropriate expression vector, i.e., a vector which contains the necessary elements for transcriptional and translational control of the inserted coding sequence in a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in NSEQ or polynucleotide sequences encoding PSEQ. Methods which are well known to those skilled in the art may be used to construct expression vectors containing NSEQ or polynucleotide sequences encoding PSEQ and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, supra, and Ausubel, su ra).
A variety of expression vector/host cell systems may be utilized to contain and express NSEQ or polynucleotide sequences encoding PSEQ. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast expression vectors;
insect cell systems infected with viral expression vectors (baculovirus); plant cell systems transformed with viral expression vectors, cauliflower mosaic virus (CaMV) or tobacco mosaic virus (TMV), or with bacterial expression vectors (Ti or pBR322 plasmids); or animal cell systems.
The invention is not limited by the host cell employed. For long term production of recombinant proteins in mammalian systems, stable expression of a polypeptide encoded by NSEQ in cell lines is preferred. For example, NSEQ or sequences encoding PSEQ can be transformed into cell lines using expression vectors which may contain viral origins of replication and/or endogenous expression elements and a selectable marker gene on the same or on a separate vector.
In general, host cells that contain NSEQ and that express PSEQ may be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or chip based technologies for the detection and/or quantification of nucleic acid or protein sequences.
Immunological methods for detecting and measuring the expression of PSEQ using either specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting (FACS).
Host cells transformed with NSEQ or polynucleotide sequences encoding PSEQ may be cultured under conditions suitable for the expression and recovery of the protein from cell culture.
The protein produced by a transformed cell may be secreted or retained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, - I O-expression vectors containing polynucleotides of NSEQ or polynucleotides encoding PSEQ may be designed to contain signal sequences which direct secretion of PSEQ or polypeptides encoded by NSEQ through a prokaryotic or eukaryotic cell membrane.
In addition, a host cell strain may be chosen for its ability to modulate expression of the S inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro"
form of the protein may also be used to specify protein targeting, folding, and/or activity.
Different host cells which have specific cellular machinery and characteristic mechanisms for post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38), are available from the American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing of the foreign protein.
In another embodiment of the invention, natural, modified, or recombinant NSEQ
or nucleic acid sequences encoding PSEQ are ligated to a heterologous sequence resulting in translation of a fusion protein containing heterologous protein moieties in any of the aforementioned host systems. Such heterologous protein moieties facilitate purification of fusion proteins using commercially available affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, hemagglutinin (HA) and monoclonal antibody epitopes..
In another embodiment, NSEQ or sequences encoding PSEQ are synthesized, in whole or in part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. ( 1980) Nucleic Acids Symp. Ser. (7):215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser.
(7):225-232; and Ausubel, supra). Alternatively, PSEQ or a polypeptide sequence encoded by 2S NSEQ itself, or a fragment thereof, may be synthesized using chemical methods. For example, peptide synthesis can be performed using various solid-phase techniques (Roberge, J.Y. et al.
(1995) Science 269:202-204). Automated synthesis may be achieved using the AB1431A Peptide synthesizer (PE Biosystems, Foster City CA). Additionally, PSEQ or the amino acid sequence encoded by NSEQ, or any part thereof, may be altered during direct synthesis and/or combined with sequences from other proteins, or any part thereof, to produce a polypeptide variant.
In another embodiment, the invention provides a substantially purified polypeptide comprising the amino acid sequence selected from the group consisting of SEQ
ID NO: 6, or fragments thereof.
DIAGNOSTICS and THERAPEUTICS
The genes of the invention can be used in diagnosis, prevention, treatment, and evaluation of therapies for diseases related to neurotransmitter processing, particularly neurological and hormone-related diseases, more particularly Parkinson's disease, schizophrenia, epilepsy, female reproductive disorders, and attention deficit disorder. Further, the amino acid sequences encoded by the new genes are potential therapeutic proteins and targets of therapeutics against these diseases.
In one preferred embodiment, the NSEQ or the polynucleotides encoding PSEQ are used for diagnostic purposes to determine the absence, presence, and excess expression of PSEQ, and to monitor regulation of the levels of mRNA or the polypeptides encoded by NSEQ during therapeutic intervention. The polynucleotides may be at least 18 nucleotides long, complementary RNA and DNA molecules, branched nucleic acids, and peptide nucleic acids (PNAs).
Alternatively, the polynucleotides are used to detect and quantitate gene expression in samples in which expression of PSEQ or the polypeptides encoded by NSEQ are correlated with disease.
Additionally, NSEQ or the polynucleotides encoding PSEQ can be used to detect genetic polymorphisms associated with a disease. These polymorphisms may be detected at the transcript cDNA or genomic level.
The specificity of the probe, whether it is made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or amplification (maximal, high, intermediate, or low), will determine whether the probe identifies only naturally occurring sequences encoding PSEQ, allelic variants, or related sequences.
Probes may also be used for the detection of related sequences, and should preferably have at least 50% sequence identity to any of the NSEQ or PSEQ-encoding sequences.
Means for producing specific hybridization probes for DNAs encoding PSEQ
include the cloning of NSEQ or polynucleotide sequences encoding PSEQ into vectors for the production of mRNA probes. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter groups, for example, by radionuclides such as'ZP or'sS, or by enzymatic labels, such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, by fluorescent labels and the like. The polynucleotide sequences encoding PSEQ may be used in Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies;and in microarrays utilizing fluids or tissues from patients to detect altered PSEQ expression. Such qualitative or quantitative methods are well known in the art.
NSEQ or the nucleotide sequences encoding PSEQ can be labeled by standard methods and added to a fluid or tissue sample from a patient under conditions suitable for the formation of hybridization complexes. After a suitable incubation period, the sample is washed and the signal is quantitated and compared with a standard value. If the amount of signal in the patient sample is significantly altered in comparison to the standard value then the presence of altered levels of nucleotide sequences of NSEQ and those encoding PSEQ in the sample indicates the presence of the associated disease. Such assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment of an individual patient.
Once the presence of a disease is established and a treatment protocol is initiated, hybridization or amplification assays can be repeated on a regular basis to determine if the level of expression in the patient begins to approximate that which is observed in the normal subject. The results obtained from successive assays may be used to show the efficacy of treatment over a period ranging from several days to months.
The polynucleotides may be used for the diagnosis of a variety of diseases associated with neurotransmitter processing such as neurological disorders including, but not limited to, akathesia, Alzheimer's disease, amnesia, amylotrophic lateral sclerosis, bipolar disorder, catatonia, cerebral neoplasms, dementia, depression, diabetic neuropathy, Down's syndrome, tardive dyskinesia, dystonias, epilepsy, Huntington's disease, peripheral neuropathy, multiple sclerosis, neurofibromatosis, Parkinson's disease, paranoid psychoses, postherapeutic neuralgia, schizophrenia, and Tourette's disorder; and reproductive disorders including, but not limited to, disorders of prolactin production, tubal disease, ovulatory defects, endometriosis, disruptions of the estrous cycle, disruptions of the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, endometrial and ovarian tumors, uterine fibroids, autoimmune disorders, ectopic pregnancies, and teratogenesis, cancer of the breast, testis, and prostate, fibrocystic breast disease, galactorrhea disruptions of spermatogenesis, abnormal sperm physiology, prostatitis, Peyronie's disease, and gynecomastia.
Alternatively, the polynucleotides may be used as targets in a microarray. The microarray can be used to monitor the expression level of large numbers of genes simultaneously and to identify splice variants, mutations, and polymorphisms. This information may be used to determine gene function, to understand the genetic basis of a disease, to diagnose a disease, and to develop and monitor the activities of therapeutic agents.
1n yet another alternative, polynucleotides may be used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. ( 1995) in Meyers, supra, pp. 965-968).
In another embodiment, antibodies which specifically bind PSEQ may be used for the diagnosis of diseases characterized by the over- or under-expression of PSEQ
or polypeptides encoded by NSEQ. Alternatively, one may use competitive drug screening assays in which neutralizing antibodies capable of binding PSEQ or the polypeptides encoded by NSEQ
specifically compete with a test compound for binding the polypeptides. In this manner, antibodies can be used to detect the presence of any peptide which shares one or more antigenic determinants with PSEQ or the polypeptides encoded by NSEQ. Diagnostic assays for PSEQ or the polypeptides encoded by NSEQ include methods which utilize the antibody and a label to detect PSEQ or the polypeptided encoded by NSEQ in human body fluids or in extracts of cells or tissues. A variety of protocols for measuring PSEQ or the polypeptides encoded by NSEQ, including ELISAs, RIAs, and FRCS, are well known in the art and provide a basis for diagnosing altered or abnormal levels of the expression of PSEQ or the polypeptides encoded by NSEQ.
Normal or standard values for PSEQ expression are established by combining body fluids or cell extracts taken from normal subjects, preferably human, with antibody to PSEQ
or a polypeptide encoded by NSEQ under conditions suitable for complex formation The amount of standard complex formation may be quantitated by various methods, preferably by photometric means.
Quantities of PSEQ or the polypeptides encoded by NSEQ expressed in subject, control, and disease samples from biopsied tissues are compared with the standard values.
Deviation between standard and subject values establishes the parameters for diagnosing or monitoring disease.
In another aspect, the polynucleotides and polypeptides of the present invention can be employed for treatment or the monitoring of therapeutic treatments for the diseases specified above. The polynucleotides of NSEQ or those encoding PSEQ, or any fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the complement of the polynucleotides ofNSEQ or those encoding PSEQ may be used in situations in which it would be desirable to block the transcription or translation of the mRNA.
Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, tissue, or cell population. Methods which are well known to those skilled in the art can be used to construct vectors to express nucleic acid sequences complementary to the polynucleotides encoding PSEQ. (See, e.g., Sambrook, supra; and Ausubel, supra.) Genes having polynucleotide sequences of NSEQ or those encoding PSEQ can be turned off by transforming a cell or tissue with expression vectors which express high levels of a polynucleotide, or fragment thereof, encoding PSEQ. Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 and +10 from the start site, are preferred. Similarly, inhibition can be achieved using triple helix base-pairing methodology.
Triple helix pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerises, transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have been described in the literature.
(See, e.g., Gee, J.E.
et al. ( 1994) in Huber, B.E. and B.1. Carr, Molecular and Immunologic Aonroaches, Futura Publishing Co., Mt. Kisco NY, pp. 163-177.) Ribozymes, enzymatic RNA
molecules, may also be used to catalyze the specific cleavage of RNA.
RNA molecules may be modified to increase intracellular stability and half life. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages within the backbone of the molecule. This concept is inherent in the production of PNAs and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases.
Many methods for introducing vectors into cells or tissues are available and equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells taken from the patient and clonally propagated for autologous transplant back into that same patient. Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et al.
( 1997) Nature Biotechnology I 5:462-466.) Further, an antagonist or antibody of a polypeptide of PSEQ or encoded by NSEQ
may be administered to a subject to treat or prevent a disease related to synthesis and release of dopamine and norepinephrine with increased expression or activity of PSEQ. An antibody which specifically binds the polypeptide may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express the polypeptide.
Antibodies to PSEQ or a polypeptide encoded by NSEQ may also be generated using methods that are well known in the art. Such antibodies may include, but are not limited to, polyclonat, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are especially preferred for therapeutic use. Monoclonal antibodies to PSEQ may be prepared using any technique which provides for the production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma technique. In addition, techniques developed for the production of chimeric antibodies can be used. (See, e.g., Meyers, supra.) Alternatively, techniques described for the production of single chain antibodies may be employed. Antibody fragments which contain specific binding sites for PSEQ or the polypeptide sequences encoded by NSEQ may also be generated.
Various immunoassays may be used for screening to identify antibodies having the desired specificity. Numerous protocols for competitive binding or immunoradiometric assays using either polyclonal or monoclonal antibodies with established specificities are well known in the art.
Yet further, an agonist of a polypeptide of PSEQ or that encoded by NSEQ may be administered to a subject to treat or prevent a cancer associated with decreased expression or activity of the polypeptide.
An additional aspect of the invention relates to the administration of a pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, for any ofthe therapeutic effects discussed above. Such pharmaceutical compositions may consist of polypeptides of PSEQ or those encoded by NSEQ, antibodies to the polypeptides, and mimetics, agonists, antagonists, or inhibitors ofthe polypeptides. The compositions may be administered alone or in combination with at least one other agent, such as a stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions may be administered to a patient alone, or in combination with other agents, drugs, or hormones.
The pharmaceutical compositions utilized in this invention may be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means.
In addition to the active ingredients, these pharmaceutical compositions may contain suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically. Further details on techniques for formulation and administration may be found in the latest edition of ReminQton's Pharmaceutical Sciences (Maack Publishing Co. Easton PA).
WO ~U/12685 For any compound, the therapeutically effective dose can be estimated initially either in cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, rabbits, dogs, or pigs. An animal model may also be used to determine the appropriate concentration range and route of administration. Such information can then be used to determine useful doses and routes for administration in humans.
A therapeutically effective dose refers to that amount of active ingredient, for example, polypeptides of PSEQ or those encoded by NSEQ, or fragments thereof, antibodies of the polypeptides, and agonists, antagonists or inhibitors of the polypeptides, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by calculating the EDS° (the dose therapeutically effective in 50% of the population) or LDs° (the dose lethal to SO%
of the population) statistics.
Any of the therapeutic methods described above may be applied to any subject in need of such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, I S monkeys, and most preferably, humans.
EXAMPLES
1t is understood that this invention is not limited to the particular methodology, protocols, and reagents described, as these may vary. It is also understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims.
The examples below are provide to illustrate the subject invention and are not included for the purpose of limiting the invention.
I. PGANNOTOI cDNA Library Construction For purposes of example, the preparation of the PGANNOTOI library is described. The PGANNOTO1 cDNA library was constructed using polyA RNA isolated from paraganglionic tumor tissue removed from the intra-abdominal region of a 46-year-old Caucasian male during exploratory laparotomy. Pathology indicated a benign paraganglioma and was associated with a grade 2 renal cell carcinoma, clear cell type, which did not penetrate the capsule. Family history included cerebrovascular disease, atherosclerotic coronary artery disease, a myocardial infarction, and type II diabetes.
The frozen tissue was homogenized and lysed in guanidinium isothiocyanate solution using a Poiytron homogenizes (PT-3000; Brinkmann Instruments, Westbury NJ).
The lysate was centrifuged over a 5.7 M CsCI cushion using an SW28 rotor in a BL8-70M
ultracentrifuge (Beckman Coulter, Fullerton CA) for 18 hours at 25,000 rpm at ambient temperature. The RNA
was extracted twice with acid phenol, pH 4.0, following an RNA isolation protocol (Stratagene, La Jolla CA) , precipitated using 0.3 M sodium acetate and 2.5 volumes of ethanol, resuspended in DEPC-treated water and treated with DNase for I 5 min at 37°C. The reaction was stopped with an equal volume of acid phenol and the RNA was isolated using the OLIGOTEX kit (Qiagen , Chatsworth CA) and used to construct the cDNA library.
The RNA was handled according to the recommended protocols in the SUPERSCRIPT
Plasmid system (Life Technologies, Gaithersburg MD), and cDNAs were ligated into pSport I
plasmid (Life Technologies). The plasmid was subsequently transformed into DHSa competent cells (Life Technologies).
II Isolation and Sequencing of cDNA Clones Plasmid DNA was released from the cells and purified using the M1NIPREP kit (Edge Biosystems, Gaithersburg MD). This kit consists of a 96 well block with reagents for 960 purifications. The recommended protocol was employed except for the following changes: 1) the 96 wells were each filled with only 1 m) of sterile Terrific Broth (Life Technologies) with carbenicillin at 25 mg/L and glycerol at 0.4%; 2) the bacteria were cultured for 24 hours after the wells were inoculated and then lysed with 60 ~l of lysis buffer; 3) a centrifugation step employing the GS-6R (Beckman Coulter) at 2900 rpm for 5 min was performed before the contents of the block were added to the primary filter plate; and 4) the optional step of adding isopropanol to TRIS buffer was not routinely performed. After the last step in the protocol, samples were transferred to a 96-well block for storage.
Alternative methods of purifying plasmid DNA include the use of MAGIC
MINIPREPS
system (Promega, Madison WI) or QIAWELL-$ Plasmid, QIAWELL PLUS DNA and QIAWELL ULTRA DNA purification systems (Qiagen).
The cDNAs were prepared using a MICROLAB 2200 system (Hamilton, Reno NV) in combination with four DNA ENGINE thermal cyclers (PTC200; MJ Research, Watertown MA) and sequenced by the method of Sanger F and AR Coulson ( 1975; J Mol Biol 94:441 f) using ABI
PRISM 377 or 373 DNA sequencing systems (PE Biosystems).
III. Selection, Assembly, and Characterization of Seguences The sequences used for coexpression analysis were assembled from EST
sequences, 5' and 3' longread sequences, and full length coding sequences. Selected sequences were expressed in at least three cDNA libraries.
The assembly process is described as follows. EST sequence chromatograms were processed and verified. Quality scores were obtained using PHRED (Swing, B. et al. (1998) Genome Res. 8:175-185; Swing, B. and P. Green (1998) Genome Res. 8:186-194).
The edited WO 00/126$5 PCT/US99/I9615 sequences were loaded into a relational database management system (RDBMS).
The EST
sequences were clustered into an initial set of bins using BLAST with a product score of S0. All clusters of two or more sequences were created as bins. The overlapping sequences represented in a bin correspond to the sequence of a transcribed gene.
Assembly of the component sequences within each bin was performed using a modification of Phrap, a publicly available program for assembling DNA
fragments (Green, P.
University of Washington, Seattle WA). Bins that showed 82% identity from a local pair-wise alignment between any of the consensus sequences were merged.
Bins were annotated by screening the consensus sequence in each bin against public databases, such as GBPRI and GENPEPT from NCBI. The annotation process involved a FASTn screen against the GBPRI database in GENBANK. Those hits with a percent identity of greater than or equal to 70% and an alignment length of greater than or equal to 100 base pairs were recorded as homolog hits. The residual unannotated sequences were screened by FASTx against GENPEPT. Those hits with an E value of less than or equal to 10-e are recorded as homolog hits.
Sequences were then reclustered using BLASTn and Cross-Match, a program for rapid protein and nucleic acid sequence comparison and database search (Green, P.
University of Washington, Seattle WA), sequentially. Any BLAST alignment between a sequence and a consensus sequence with a score greater than I50 was realigned using cross-match. The sequence was added to the bin whose consensus sequence gave the highest Smith-Waterman score amongst local alignments with at least 82% identity. Non-matching sequences created new bins. The assembly and consensus generation processes were performed for the new bins.
IV. Known Genes Used for Identifying New Genes Associated with Neurotransmitter Processing Any of ten neurotransmitter-processing-specific genes were used to identify new Incyte genes, using the coexpression analysis method described above. The genes included L-tyrosine hydroxylase (TH), aromatic amino acid decarboxylase (AADC), dopamine ~i hydroxylase (DBH), nicotinic acetylcholine receptor a3 subunit precursor (nAchR-a3), secretogranin I and II, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMATI), ARIX homeodomain protein, and the like. Other known genes that have potential association with dopamine and norepinephrine pathways include endothelin converting enzymes (ECEs), rapt-interacting protein 8 (RPIPB), neuron-specific growth-associated protein (SCG10), and "Delta-like" putative homeotic protein (dlk).
L-tyrosine hydroxylase (TH} catalyzes the conversion of L-tyrosine to L-dopa, which is the first and rate-limiting step in the synthesis of the catecholamines dopamine, norepinephrine and Ptr'T/US99/19615 epinephrine (Nagatsu, T. (1995) Essays Biochem. 30:15-35). TH activity is regulated in several ways including transcriptionally, by alternative mRNA splicing, and by regulation of mRNA
stability. Aromatic amino acid decarboxylase (AADC) catalyzes the decarboxylation of L-tyorsine, L-tryptophan, L-phenylalanine and L-dopa. Decarboxylation of L-dopa yields dopamine (Webster and Jordan, supra), and increased levels of AADC enhanced dopamine synthesis in vivo (Kaddis et al., su ra). Dopamine ~i hydroxylase (DBH) converts dopamine to norepinephrine (Wu, H.J. et al.
(1990) J. Neurochem. 55:97-105). Nicotinic acetylcholine receptor a3 subunit precursor (nAchR-a3) is a subunit in the ligand-gated sodium ion channel triggered by nicotine and acetylcholine.
Prolonged stimulation of nAchR with nicotine or acetylcholine induces TH and DBH (Patrick, R.L.
and J.D. Barchas (i976) J. Pharmacol. Exp: Ther. 197:181-196). Secretogranins are vesicle associated proteins. Secretogranin I is found in norepinephrine-containing vesicles (Bressler, J.P.
et al. ( 1996) J. Neurosci. Res. 46:678-685). Secretogranin I1 is a precursor for secretoneurin, which stimulates dopamine release (Agneter, E. et al. (1995) J. Neurochem. 65:622-625; Fischer-Colbrie, R. et al. ( 1995) Prog. Neurobiol. 46:49-70). Secretogranin II occurs in dopamine- and norepinephrine-containing vesicles in neurons (Goodall, A.R. et al. ( 1997) J.
Neurochem. 68:1542-1552). Rab3a is a member of the Ras GTPase family. Rab3a is associated with large dense core vesicles and partially co-localizes with DBH (Johannes, L. et al. (1994) Embo.
J. 13:2029-2037;
Darchen, F. et al. (1995) J. Cell Sci. 108: 1639-1649), and is one of the factors controlling Ca2+-dependent exocytosis (Johannes et al., supra). Human cocaine and amphetamine regulated transcript (hCART) is a signal peptide protein and is specifically induced in neurons exposed to cocaine and amphetamine (Douglass, J. and S. Daoud (1996) Gene 169:241-245).
Cocaine blocks re-uptake ofdopamine from the synapse (Reith, M.E, et al. (1997) Eur. J.
Pharmacol. 324:1-10 as does amphetamine, which also induces dopamine release (Jones, S.R. et al.
(1998) J. Neurosci.
I 8:1979-1986). Vesicular monoamine transporter 1 (hVMATI ) packages the neurotransmitter monoamines into large dense core vesicles for exocytosis at the synapse (Liu, Y. et al. (1994) J.
Cell Biol. 127:1419-1433). ARIX is a human homeodomain protein which is specifically expressed in noradrenergic, DBH-positive tissues and in cell lines derived from those tissues (Zeilmer, E. et al. ( 1995) J. Neurosci. 15:8109-8120). ARIX regulates the transcriptional activities of the DBH promoter and the promoter of the tyrosine hydroxylase gene, two of the promoters which control the specific expression of catecholamine biosynthetic genes.
The endothelins, ET-1, ET-2, and ET-3, are a class of secreted peptides that are processed from inactive precursor peptides. The third and final processing step leading to the active ET is catalyzed by a member of the endothelin converting enzymes (ECEs). The ET's were known vasoactive agents, potential neurotransmitters, and potential growth factors (Barnes, K. and A.J.
Turner (1997) Neurochem. Res. 22:1033-1040). ET-I was shown to enhance norepinephrine pressor response (Kita, S. et al. (1998) J. Cardiovasc. Pharmacol. 31:5119-S121), and ET-1 and ET-3 to induce dopamine release (Horie, K. et al. (1995) Life Sci. 57:735-741;
Shibaguchi, H, et al. (1997) Cell Mol. Neurobiol. 17:471-481). ET-2, however, has not been shown in published literature to be associated with norepinephrine or dopamine processing. Rap2-interacting protein 8 (RPIPB) was discovered using a yeast two-hybrid screen of mouse brain cDNAs using the small GTP-binding protein Rap2 as bait (Janoueix-Lerosey, I. et al. ( 1998) Eur. J.
Biochem. 252(2):290-298). Rap2 is a member of the Ras family, another member of which (Rab3a) was known to be involved in exocytosis of neurotransmitters. SCG 10 is a neuron-specific growth-associated protein which belongs to the stathmin family. SCG10 functions in increasing the breakdown of microtubules in the neural growth cone (Riederer, B. M. et al. (1997) Proc.
Natl. Acad. Sci.
91:741-745). dlk is a "Delta-like" putative homeotic protein thought to be involved in neuroendocrine differentiation (Laborda, J. et al. ( 1993) J. Biol. Chem.
268:3817-3820). It belongs to the epidermal growth factor receptor family (Lee, Y.L. et al.
(1995) Biochim. Biophys.
Acta 1261:223-232) and is induced by growth hormone (Carlsson, C. et al.
(1997) Endocrinology 138:3940-3948) and regulated through ErbB3 (Edman, C.F. et al. (1997) Biochem.
J. 323:113-118).
Table 4 shows the five genes that have the strongest association with TH, a gene involved in the synthesis and release of dopamine, and norepinephrine. These coexpressed genes are presented with their p-values. The column headings have the following meanings:
P-value The probability that the observed number of co-occurrences is due to chance using the Fisher Exact method.
Associated gene A gene that shows significant co-expression with the target.
Occurs The number of libraries in which the associated gene occurs.
Both occur The number of libraries in which both the target gene and the co-expressed gene occur.
Target only The number of libraries in which only the target gene occurs.
Associated only The number of libraries in which only the associated gene occurs.
Neither The number of libraries in which neither the target gene nor the associated gene occur.
Table 4 Co-expression results for TH
P-value Associated gene Occurs Both Target Associated Neither occur only only 1.47E-13 DBH 12 9 6 3 503 6.IlE-I1 nAchR-a3 13 8 7 5 501 6.22E-10 Secretogranin I 45 I 1 4 34 472 3.51 E-09 Human dlk mRNA 52 11 4 41 465 1.01 E-08 Homolog to ECE I 3 7 8 6 500 As a target, TH occurred in 15 of 522 cDNA libraries studied, and showed strong coexpression with genes known to be related to neurotransmitter processing and other known genes, such as DBH, nAchR, secretogranin I, human dlk mRNA for putative homeotic protein, and a homolog of endothelin converting enzyme (ECE). These results are shown in Table 4 with association probability in the range of 1.58E-8 to 1.53E-13.
Similar results were observed when the other nine known genes, AADC, DBH, nAchR-a3, secretogranin I and II, Rab3a, hCART, hVMATI, and ARIX were taken as target genes. These results indicate that the coexpression method was successful in identifying association of the known genes among themselves and with the known genes specified above, and thus was effective in identifying genes that are associated with the synthesis and release of neurotransmitters such as dopamine and norepinephrine.
V. Identification of New Genes Associated with Neurotransmitter Processing We have identified five new genes by testing for the co-expression of their cDNA with that from genes known to be dopamine- and norepinephrine-related in the libraries comprising LIFESEQ database (Incyte Pharamaceuticals). The five genes show statistically significant associations with the known dopamine- and norepinephrine-related genes as measured by a Fisher exact test. These five new genes are potential therapeutic proteins and therapeutic targets for the treatment of diseases in which the levels of norepinephrine and/or dopamine are perturbed from their normal levels.
The new genes were identified from a total of41,419 assembled Incyte gene sequences.
The degree of association was measured by probability values and has a cutoff of p value less than 0.00001. This was followed by annotation and literature searches to insure that the genes that passed the probability test have strong association with known neurotransmitter-processing-specifc genes. This process was reiterated so that the initial 41419 genes were reduced to the final five genes associated with neurotransmitter processing. Details of identification for the five genes are presented in Tables 5 to 10. These tables show the five genes that were most closely associated for each target new gene as measured by coexpression using the Fisher Exact test.
The column headings have the same meanings as in Example IV.
Table 5 Coexpression results for Incyte gene 621850 P-value Associated gene Occurs Both Target Associated Neither occur only only 6.91E-09 DBH 12 6 4 6 SOS
(1995) Science 269:202-204). Automated synthesis may be achieved using the AB1431A Peptide synthesizer (PE Biosystems, Foster City CA). Additionally, PSEQ or the amino acid sequence encoded by NSEQ, or any part thereof, may be altered during direct synthesis and/or combined with sequences from other proteins, or any part thereof, to produce a polypeptide variant.
In another embodiment, the invention provides a substantially purified polypeptide comprising the amino acid sequence selected from the group consisting of SEQ
ID NO: 6, or fragments thereof.
DIAGNOSTICS and THERAPEUTICS
The genes of the invention can be used in diagnosis, prevention, treatment, and evaluation of therapies for diseases related to neurotransmitter processing, particularly neurological and hormone-related diseases, more particularly Parkinson's disease, schizophrenia, epilepsy, female reproductive disorders, and attention deficit disorder. Further, the amino acid sequences encoded by the new genes are potential therapeutic proteins and targets of therapeutics against these diseases.
In one preferred embodiment, the NSEQ or the polynucleotides encoding PSEQ are used for diagnostic purposes to determine the absence, presence, and excess expression of PSEQ, and to monitor regulation of the levels of mRNA or the polypeptides encoded by NSEQ during therapeutic intervention. The polynucleotides may be at least 18 nucleotides long, complementary RNA and DNA molecules, branched nucleic acids, and peptide nucleic acids (PNAs).
Alternatively, the polynucleotides are used to detect and quantitate gene expression in samples in which expression of PSEQ or the polypeptides encoded by NSEQ are correlated with disease.
Additionally, NSEQ or the polynucleotides encoding PSEQ can be used to detect genetic polymorphisms associated with a disease. These polymorphisms may be detected at the transcript cDNA or genomic level.
The specificity of the probe, whether it is made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or amplification (maximal, high, intermediate, or low), will determine whether the probe identifies only naturally occurring sequences encoding PSEQ, allelic variants, or related sequences.
Probes may also be used for the detection of related sequences, and should preferably have at least 50% sequence identity to any of the NSEQ or PSEQ-encoding sequences.
Means for producing specific hybridization probes for DNAs encoding PSEQ
include the cloning of NSEQ or polynucleotide sequences encoding PSEQ into vectors for the production of mRNA probes. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter groups, for example, by radionuclides such as'ZP or'sS, or by enzymatic labels, such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, by fluorescent labels and the like. The polynucleotide sequences encoding PSEQ may be used in Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies;and in microarrays utilizing fluids or tissues from patients to detect altered PSEQ expression. Such qualitative or quantitative methods are well known in the art.
NSEQ or the nucleotide sequences encoding PSEQ can be labeled by standard methods and added to a fluid or tissue sample from a patient under conditions suitable for the formation of hybridization complexes. After a suitable incubation period, the sample is washed and the signal is quantitated and compared with a standard value. If the amount of signal in the patient sample is significantly altered in comparison to the standard value then the presence of altered levels of nucleotide sequences of NSEQ and those encoding PSEQ in the sample indicates the presence of the associated disease. Such assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment of an individual patient.
Once the presence of a disease is established and a treatment protocol is initiated, hybridization or amplification assays can be repeated on a regular basis to determine if the level of expression in the patient begins to approximate that which is observed in the normal subject. The results obtained from successive assays may be used to show the efficacy of treatment over a period ranging from several days to months.
The polynucleotides may be used for the diagnosis of a variety of diseases associated with neurotransmitter processing such as neurological disorders including, but not limited to, akathesia, Alzheimer's disease, amnesia, amylotrophic lateral sclerosis, bipolar disorder, catatonia, cerebral neoplasms, dementia, depression, diabetic neuropathy, Down's syndrome, tardive dyskinesia, dystonias, epilepsy, Huntington's disease, peripheral neuropathy, multiple sclerosis, neurofibromatosis, Parkinson's disease, paranoid psychoses, postherapeutic neuralgia, schizophrenia, and Tourette's disorder; and reproductive disorders including, but not limited to, disorders of prolactin production, tubal disease, ovulatory defects, endometriosis, disruptions of the estrous cycle, disruptions of the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, endometrial and ovarian tumors, uterine fibroids, autoimmune disorders, ectopic pregnancies, and teratogenesis, cancer of the breast, testis, and prostate, fibrocystic breast disease, galactorrhea disruptions of spermatogenesis, abnormal sperm physiology, prostatitis, Peyronie's disease, and gynecomastia.
Alternatively, the polynucleotides may be used as targets in a microarray. The microarray can be used to monitor the expression level of large numbers of genes simultaneously and to identify splice variants, mutations, and polymorphisms. This information may be used to determine gene function, to understand the genetic basis of a disease, to diagnose a disease, and to develop and monitor the activities of therapeutic agents.
1n yet another alternative, polynucleotides may be used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. ( 1995) in Meyers, supra, pp. 965-968).
In another embodiment, antibodies which specifically bind PSEQ may be used for the diagnosis of diseases characterized by the over- or under-expression of PSEQ
or polypeptides encoded by NSEQ. Alternatively, one may use competitive drug screening assays in which neutralizing antibodies capable of binding PSEQ or the polypeptides encoded by NSEQ
specifically compete with a test compound for binding the polypeptides. In this manner, antibodies can be used to detect the presence of any peptide which shares one or more antigenic determinants with PSEQ or the polypeptides encoded by NSEQ. Diagnostic assays for PSEQ or the polypeptides encoded by NSEQ include methods which utilize the antibody and a label to detect PSEQ or the polypeptided encoded by NSEQ in human body fluids or in extracts of cells or tissues. A variety of protocols for measuring PSEQ or the polypeptides encoded by NSEQ, including ELISAs, RIAs, and FRCS, are well known in the art and provide a basis for diagnosing altered or abnormal levels of the expression of PSEQ or the polypeptides encoded by NSEQ.
Normal or standard values for PSEQ expression are established by combining body fluids or cell extracts taken from normal subjects, preferably human, with antibody to PSEQ
or a polypeptide encoded by NSEQ under conditions suitable for complex formation The amount of standard complex formation may be quantitated by various methods, preferably by photometric means.
Quantities of PSEQ or the polypeptides encoded by NSEQ expressed in subject, control, and disease samples from biopsied tissues are compared with the standard values.
Deviation between standard and subject values establishes the parameters for diagnosing or monitoring disease.
In another aspect, the polynucleotides and polypeptides of the present invention can be employed for treatment or the monitoring of therapeutic treatments for the diseases specified above. The polynucleotides of NSEQ or those encoding PSEQ, or any fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the complement of the polynucleotides ofNSEQ or those encoding PSEQ may be used in situations in which it would be desirable to block the transcription or translation of the mRNA.
Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, tissue, or cell population. Methods which are well known to those skilled in the art can be used to construct vectors to express nucleic acid sequences complementary to the polynucleotides encoding PSEQ. (See, e.g., Sambrook, supra; and Ausubel, supra.) Genes having polynucleotide sequences of NSEQ or those encoding PSEQ can be turned off by transforming a cell or tissue with expression vectors which express high levels of a polynucleotide, or fragment thereof, encoding PSEQ. Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 and +10 from the start site, are preferred. Similarly, inhibition can be achieved using triple helix base-pairing methodology.
Triple helix pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerises, transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have been described in the literature.
(See, e.g., Gee, J.E.
et al. ( 1994) in Huber, B.E. and B.1. Carr, Molecular and Immunologic Aonroaches, Futura Publishing Co., Mt. Kisco NY, pp. 163-177.) Ribozymes, enzymatic RNA
molecules, may also be used to catalyze the specific cleavage of RNA.
RNA molecules may be modified to increase intracellular stability and half life. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages within the backbone of the molecule. This concept is inherent in the production of PNAs and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases.
Many methods for introducing vectors into cells or tissues are available and equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells taken from the patient and clonally propagated for autologous transplant back into that same patient. Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et al.
( 1997) Nature Biotechnology I 5:462-466.) Further, an antagonist or antibody of a polypeptide of PSEQ or encoded by NSEQ
may be administered to a subject to treat or prevent a disease related to synthesis and release of dopamine and norepinephrine with increased expression or activity of PSEQ. An antibody which specifically binds the polypeptide may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express the polypeptide.
Antibodies to PSEQ or a polypeptide encoded by NSEQ may also be generated using methods that are well known in the art. Such antibodies may include, but are not limited to, polyclonat, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are especially preferred for therapeutic use. Monoclonal antibodies to PSEQ may be prepared using any technique which provides for the production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma technique. In addition, techniques developed for the production of chimeric antibodies can be used. (See, e.g., Meyers, supra.) Alternatively, techniques described for the production of single chain antibodies may be employed. Antibody fragments which contain specific binding sites for PSEQ or the polypeptide sequences encoded by NSEQ may also be generated.
Various immunoassays may be used for screening to identify antibodies having the desired specificity. Numerous protocols for competitive binding or immunoradiometric assays using either polyclonal or monoclonal antibodies with established specificities are well known in the art.
Yet further, an agonist of a polypeptide of PSEQ or that encoded by NSEQ may be administered to a subject to treat or prevent a cancer associated with decreased expression or activity of the polypeptide.
An additional aspect of the invention relates to the administration of a pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, for any ofthe therapeutic effects discussed above. Such pharmaceutical compositions may consist of polypeptides of PSEQ or those encoded by NSEQ, antibodies to the polypeptides, and mimetics, agonists, antagonists, or inhibitors ofthe polypeptides. The compositions may be administered alone or in combination with at least one other agent, such as a stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions may be administered to a patient alone, or in combination with other agents, drugs, or hormones.
The pharmaceutical compositions utilized in this invention may be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means.
In addition to the active ingredients, these pharmaceutical compositions may contain suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically. Further details on techniques for formulation and administration may be found in the latest edition of ReminQton's Pharmaceutical Sciences (Maack Publishing Co. Easton PA).
WO ~U/12685 For any compound, the therapeutically effective dose can be estimated initially either in cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, rabbits, dogs, or pigs. An animal model may also be used to determine the appropriate concentration range and route of administration. Such information can then be used to determine useful doses and routes for administration in humans.
A therapeutically effective dose refers to that amount of active ingredient, for example, polypeptides of PSEQ or those encoded by NSEQ, or fragments thereof, antibodies of the polypeptides, and agonists, antagonists or inhibitors of the polypeptides, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by calculating the EDS° (the dose therapeutically effective in 50% of the population) or LDs° (the dose lethal to SO%
of the population) statistics.
Any of the therapeutic methods described above may be applied to any subject in need of such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, I S monkeys, and most preferably, humans.
EXAMPLES
1t is understood that this invention is not limited to the particular methodology, protocols, and reagents described, as these may vary. It is also understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims.
The examples below are provide to illustrate the subject invention and are not included for the purpose of limiting the invention.
I. PGANNOTOI cDNA Library Construction For purposes of example, the preparation of the PGANNOTOI library is described. The PGANNOTO1 cDNA library was constructed using polyA RNA isolated from paraganglionic tumor tissue removed from the intra-abdominal region of a 46-year-old Caucasian male during exploratory laparotomy. Pathology indicated a benign paraganglioma and was associated with a grade 2 renal cell carcinoma, clear cell type, which did not penetrate the capsule. Family history included cerebrovascular disease, atherosclerotic coronary artery disease, a myocardial infarction, and type II diabetes.
The frozen tissue was homogenized and lysed in guanidinium isothiocyanate solution using a Poiytron homogenizes (PT-3000; Brinkmann Instruments, Westbury NJ).
The lysate was centrifuged over a 5.7 M CsCI cushion using an SW28 rotor in a BL8-70M
ultracentrifuge (Beckman Coulter, Fullerton CA) for 18 hours at 25,000 rpm at ambient temperature. The RNA
was extracted twice with acid phenol, pH 4.0, following an RNA isolation protocol (Stratagene, La Jolla CA) , precipitated using 0.3 M sodium acetate and 2.5 volumes of ethanol, resuspended in DEPC-treated water and treated with DNase for I 5 min at 37°C. The reaction was stopped with an equal volume of acid phenol and the RNA was isolated using the OLIGOTEX kit (Qiagen , Chatsworth CA) and used to construct the cDNA library.
The RNA was handled according to the recommended protocols in the SUPERSCRIPT
Plasmid system (Life Technologies, Gaithersburg MD), and cDNAs were ligated into pSport I
plasmid (Life Technologies). The plasmid was subsequently transformed into DHSa competent cells (Life Technologies).
II Isolation and Sequencing of cDNA Clones Plasmid DNA was released from the cells and purified using the M1NIPREP kit (Edge Biosystems, Gaithersburg MD). This kit consists of a 96 well block with reagents for 960 purifications. The recommended protocol was employed except for the following changes: 1) the 96 wells were each filled with only 1 m) of sterile Terrific Broth (Life Technologies) with carbenicillin at 25 mg/L and glycerol at 0.4%; 2) the bacteria were cultured for 24 hours after the wells were inoculated and then lysed with 60 ~l of lysis buffer; 3) a centrifugation step employing the GS-6R (Beckman Coulter) at 2900 rpm for 5 min was performed before the contents of the block were added to the primary filter plate; and 4) the optional step of adding isopropanol to TRIS buffer was not routinely performed. After the last step in the protocol, samples were transferred to a 96-well block for storage.
Alternative methods of purifying plasmid DNA include the use of MAGIC
MINIPREPS
system (Promega, Madison WI) or QIAWELL-$ Plasmid, QIAWELL PLUS DNA and QIAWELL ULTRA DNA purification systems (Qiagen).
The cDNAs were prepared using a MICROLAB 2200 system (Hamilton, Reno NV) in combination with four DNA ENGINE thermal cyclers (PTC200; MJ Research, Watertown MA) and sequenced by the method of Sanger F and AR Coulson ( 1975; J Mol Biol 94:441 f) using ABI
PRISM 377 or 373 DNA sequencing systems (PE Biosystems).
III. Selection, Assembly, and Characterization of Seguences The sequences used for coexpression analysis were assembled from EST
sequences, 5' and 3' longread sequences, and full length coding sequences. Selected sequences were expressed in at least three cDNA libraries.
The assembly process is described as follows. EST sequence chromatograms were processed and verified. Quality scores were obtained using PHRED (Swing, B. et al. (1998) Genome Res. 8:175-185; Swing, B. and P. Green (1998) Genome Res. 8:186-194).
The edited WO 00/126$5 PCT/US99/I9615 sequences were loaded into a relational database management system (RDBMS).
The EST
sequences were clustered into an initial set of bins using BLAST with a product score of S0. All clusters of two or more sequences were created as bins. The overlapping sequences represented in a bin correspond to the sequence of a transcribed gene.
Assembly of the component sequences within each bin was performed using a modification of Phrap, a publicly available program for assembling DNA
fragments (Green, P.
University of Washington, Seattle WA). Bins that showed 82% identity from a local pair-wise alignment between any of the consensus sequences were merged.
Bins were annotated by screening the consensus sequence in each bin against public databases, such as GBPRI and GENPEPT from NCBI. The annotation process involved a FASTn screen against the GBPRI database in GENBANK. Those hits with a percent identity of greater than or equal to 70% and an alignment length of greater than or equal to 100 base pairs were recorded as homolog hits. The residual unannotated sequences were screened by FASTx against GENPEPT. Those hits with an E value of less than or equal to 10-e are recorded as homolog hits.
Sequences were then reclustered using BLASTn and Cross-Match, a program for rapid protein and nucleic acid sequence comparison and database search (Green, P.
University of Washington, Seattle WA), sequentially. Any BLAST alignment between a sequence and a consensus sequence with a score greater than I50 was realigned using cross-match. The sequence was added to the bin whose consensus sequence gave the highest Smith-Waterman score amongst local alignments with at least 82% identity. Non-matching sequences created new bins. The assembly and consensus generation processes were performed for the new bins.
IV. Known Genes Used for Identifying New Genes Associated with Neurotransmitter Processing Any of ten neurotransmitter-processing-specific genes were used to identify new Incyte genes, using the coexpression analysis method described above. The genes included L-tyrosine hydroxylase (TH), aromatic amino acid decarboxylase (AADC), dopamine ~i hydroxylase (DBH), nicotinic acetylcholine receptor a3 subunit precursor (nAchR-a3), secretogranin I and II, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMATI), ARIX homeodomain protein, and the like. Other known genes that have potential association with dopamine and norepinephrine pathways include endothelin converting enzymes (ECEs), rapt-interacting protein 8 (RPIPB), neuron-specific growth-associated protein (SCG10), and "Delta-like" putative homeotic protein (dlk).
L-tyrosine hydroxylase (TH} catalyzes the conversion of L-tyrosine to L-dopa, which is the first and rate-limiting step in the synthesis of the catecholamines dopamine, norepinephrine and Ptr'T/US99/19615 epinephrine (Nagatsu, T. (1995) Essays Biochem. 30:15-35). TH activity is regulated in several ways including transcriptionally, by alternative mRNA splicing, and by regulation of mRNA
stability. Aromatic amino acid decarboxylase (AADC) catalyzes the decarboxylation of L-tyorsine, L-tryptophan, L-phenylalanine and L-dopa. Decarboxylation of L-dopa yields dopamine (Webster and Jordan, supra), and increased levels of AADC enhanced dopamine synthesis in vivo (Kaddis et al., su ra). Dopamine ~i hydroxylase (DBH) converts dopamine to norepinephrine (Wu, H.J. et al.
(1990) J. Neurochem. 55:97-105). Nicotinic acetylcholine receptor a3 subunit precursor (nAchR-a3) is a subunit in the ligand-gated sodium ion channel triggered by nicotine and acetylcholine.
Prolonged stimulation of nAchR with nicotine or acetylcholine induces TH and DBH (Patrick, R.L.
and J.D. Barchas (i976) J. Pharmacol. Exp: Ther. 197:181-196). Secretogranins are vesicle associated proteins. Secretogranin I is found in norepinephrine-containing vesicles (Bressler, J.P.
et al. ( 1996) J. Neurosci. Res. 46:678-685). Secretogranin I1 is a precursor for secretoneurin, which stimulates dopamine release (Agneter, E. et al. (1995) J. Neurochem. 65:622-625; Fischer-Colbrie, R. et al. ( 1995) Prog. Neurobiol. 46:49-70). Secretogranin II occurs in dopamine- and norepinephrine-containing vesicles in neurons (Goodall, A.R. et al. ( 1997) J.
Neurochem. 68:1542-1552). Rab3a is a member of the Ras GTPase family. Rab3a is associated with large dense core vesicles and partially co-localizes with DBH (Johannes, L. et al. (1994) Embo.
J. 13:2029-2037;
Darchen, F. et al. (1995) J. Cell Sci. 108: 1639-1649), and is one of the factors controlling Ca2+-dependent exocytosis (Johannes et al., supra). Human cocaine and amphetamine regulated transcript (hCART) is a signal peptide protein and is specifically induced in neurons exposed to cocaine and amphetamine (Douglass, J. and S. Daoud (1996) Gene 169:241-245).
Cocaine blocks re-uptake ofdopamine from the synapse (Reith, M.E, et al. (1997) Eur. J.
Pharmacol. 324:1-10 as does amphetamine, which also induces dopamine release (Jones, S.R. et al.
(1998) J. Neurosci.
I 8:1979-1986). Vesicular monoamine transporter 1 (hVMATI ) packages the neurotransmitter monoamines into large dense core vesicles for exocytosis at the synapse (Liu, Y. et al. (1994) J.
Cell Biol. 127:1419-1433). ARIX is a human homeodomain protein which is specifically expressed in noradrenergic, DBH-positive tissues and in cell lines derived from those tissues (Zeilmer, E. et al. ( 1995) J. Neurosci. 15:8109-8120). ARIX regulates the transcriptional activities of the DBH promoter and the promoter of the tyrosine hydroxylase gene, two of the promoters which control the specific expression of catecholamine biosynthetic genes.
The endothelins, ET-1, ET-2, and ET-3, are a class of secreted peptides that are processed from inactive precursor peptides. The third and final processing step leading to the active ET is catalyzed by a member of the endothelin converting enzymes (ECEs). The ET's were known vasoactive agents, potential neurotransmitters, and potential growth factors (Barnes, K. and A.J.
Turner (1997) Neurochem. Res. 22:1033-1040). ET-I was shown to enhance norepinephrine pressor response (Kita, S. et al. (1998) J. Cardiovasc. Pharmacol. 31:5119-S121), and ET-1 and ET-3 to induce dopamine release (Horie, K. et al. (1995) Life Sci. 57:735-741;
Shibaguchi, H, et al. (1997) Cell Mol. Neurobiol. 17:471-481). ET-2, however, has not been shown in published literature to be associated with norepinephrine or dopamine processing. Rap2-interacting protein 8 (RPIPB) was discovered using a yeast two-hybrid screen of mouse brain cDNAs using the small GTP-binding protein Rap2 as bait (Janoueix-Lerosey, I. et al. ( 1998) Eur. J.
Biochem. 252(2):290-298). Rap2 is a member of the Ras family, another member of which (Rab3a) was known to be involved in exocytosis of neurotransmitters. SCG 10 is a neuron-specific growth-associated protein which belongs to the stathmin family. SCG10 functions in increasing the breakdown of microtubules in the neural growth cone (Riederer, B. M. et al. (1997) Proc.
Natl. Acad. Sci.
91:741-745). dlk is a "Delta-like" putative homeotic protein thought to be involved in neuroendocrine differentiation (Laborda, J. et al. ( 1993) J. Biol. Chem.
268:3817-3820). It belongs to the epidermal growth factor receptor family (Lee, Y.L. et al.
(1995) Biochim. Biophys.
Acta 1261:223-232) and is induced by growth hormone (Carlsson, C. et al.
(1997) Endocrinology 138:3940-3948) and regulated through ErbB3 (Edman, C.F. et al. (1997) Biochem.
J. 323:113-118).
Table 4 shows the five genes that have the strongest association with TH, a gene involved in the synthesis and release of dopamine, and norepinephrine. These coexpressed genes are presented with their p-values. The column headings have the following meanings:
P-value The probability that the observed number of co-occurrences is due to chance using the Fisher Exact method.
Associated gene A gene that shows significant co-expression with the target.
Occurs The number of libraries in which the associated gene occurs.
Both occur The number of libraries in which both the target gene and the co-expressed gene occur.
Target only The number of libraries in which only the target gene occurs.
Associated only The number of libraries in which only the associated gene occurs.
Neither The number of libraries in which neither the target gene nor the associated gene occur.
Table 4 Co-expression results for TH
P-value Associated gene Occurs Both Target Associated Neither occur only only 1.47E-13 DBH 12 9 6 3 503 6.IlE-I1 nAchR-a3 13 8 7 5 501 6.22E-10 Secretogranin I 45 I 1 4 34 472 3.51 E-09 Human dlk mRNA 52 11 4 41 465 1.01 E-08 Homolog to ECE I 3 7 8 6 500 As a target, TH occurred in 15 of 522 cDNA libraries studied, and showed strong coexpression with genes known to be related to neurotransmitter processing and other known genes, such as DBH, nAchR, secretogranin I, human dlk mRNA for putative homeotic protein, and a homolog of endothelin converting enzyme (ECE). These results are shown in Table 4 with association probability in the range of 1.58E-8 to 1.53E-13.
Similar results were observed when the other nine known genes, AADC, DBH, nAchR-a3, secretogranin I and II, Rab3a, hCART, hVMATI, and ARIX were taken as target genes. These results indicate that the coexpression method was successful in identifying association of the known genes among themselves and with the known genes specified above, and thus was effective in identifying genes that are associated with the synthesis and release of neurotransmitters such as dopamine and norepinephrine.
V. Identification of New Genes Associated with Neurotransmitter Processing We have identified five new genes by testing for the co-expression of their cDNA with that from genes known to be dopamine- and norepinephrine-related in the libraries comprising LIFESEQ database (Incyte Pharamaceuticals). The five genes show statistically significant associations with the known dopamine- and norepinephrine-related genes as measured by a Fisher exact test. These five new genes are potential therapeutic proteins and therapeutic targets for the treatment of diseases in which the levels of norepinephrine and/or dopamine are perturbed from their normal levels.
The new genes were identified from a total of41,419 assembled Incyte gene sequences.
The degree of association was measured by probability values and has a cutoff of p value less than 0.00001. This was followed by annotation and literature searches to insure that the genes that passed the probability test have strong association with known neurotransmitter-processing-specifc genes. This process was reiterated so that the initial 41419 genes were reduced to the final five genes associated with neurotransmitter processing. Details of identification for the five genes are presented in Tables 5 to 10. These tables show the five genes that were most closely associated for each target new gene as measured by coexpression using the Fisher Exact test.
The column headings have the same meanings as in Example IV.
Table 5 Coexpression results for Incyte gene 621850 P-value Associated gene Occurs Both Target Associated Neither occur only only 6.91E-09 DBH 12 6 4 6 SOS
8.41 E-08 Somatostatin I 29 7 3 22 489 I .37E-07 SCG 10 49 8 2 41 470 2.25E-07 Secretogranin II 33 7 3 26 485 Incyte gene 621850 occurred in 10 of 522 cDNA libraries studied and showed strong co-expression with several of the known genes related to neurotransmitter processing, including DBH, somatostatin I and II, as shown in Table S. 621850 also showed strong association with a SCG10.
SCG10 was shown in Example IV to be strongly associated with the known genes related to synthesis and release of dopamine. These results are consistent with the notion that 621850 is associated with neurotransmitter processing. .
Table 6 Coexpression results for Incyte gene P-value Associated Occurs Both TargetAssociatedNeither gene occuronly only 0 nAchR 13 7 7 6 501 0 TH type 4 15 7 7 8 499 0 Secretogranin 33 8 6 25 482 II
Incyte gene 625839 occurred in 14 of 522 cDNA libraries studied and showed strong co-expression with several of the known genes related to synthesis and release of dopamine, and norepinephrine, including DBH, nAchR, TH type 4, and secretogranin II, as shown in Table 6.
625839 also showed strong association with PARP, a human poly(ADB-ribose) polymerase which is expressed during development in Drosophila (Uchida, K. et al. (1993) 90:3481-3485). These results are consistent with the notion that 625839 is associated with neurotransmitter processing.
Table 7 Coexpression results for Incyte gene 2405140 P-value Associated geneOccursBoth Target AssociatedNeither occur only only 9.39E-06 DBH 12 3 0 9 509 1.22E-OS nAchR 13 3 0 10 508 1.94E-OSTH type 4 15 3 0 12 506 3.48E-OS AADC 18 3 0 15 503 3.48E-OS Homolog to predicted18 3 0 1 S 503 c. elegans protein F25H2.8 Incyte gene 2405140 occurred in 3 of 522 cDNA libraries studied and showed strong co-expression with several of the known genes related to neurotransmitter processing, including DBH, nAchR, TH type 4, and AADC, as shown in Table 7. 2405140 also showed strong association with a homolog to a predicted C. elegans protein. F25H2.8. These results are consistent with the notion that 2405140 is associated with neurotransmitter processing.
Table 8 Coexpression results for Incyte gene 2823339 P-value Associated gene Occurs Both Target Associated Neither occur only only 2.20E-08 Secretogranin 1 45 9 3 36 473 4.51 E-08 Secretogranin ll 33 8 4 25 484 1.37E-07 hVMATI 8 5 7 3 506 1.56E-07 TH type 4 15 6 6 9 500 lncyte gene 2823339 occurred in 12 of 522 cDNA libraries studied and showed strong co-expression with several of the known genes related to neurotransmitter processing, including secretogranin I and ll, hVMATI and TH type 4, as shown in Tabie 8. These results are consistent with the notion that 2823339 is associated with neurotransmitter processing.
Table 9 Coexpression results for Incyte gene 2825861 P-value Associated gene Occurs Both Target Associated Neither occur only only 0 TH type 4 15 3 0 12 506 0 Human factor X 29 3 0 26 492 Incyte gene 2825861 occurred in 3 of 522 cDNA libraries studied and showed strong co-expression with two of the known genes related to neurotransmitter processing, including DBH and TH type 4, as shown in Table 9. 2825861 also shows association with a blood coagulation factor, human factor X. These results are consistent with the notion that 2825861 is associated with neurotransmitter processing..
VI. New Genes Associated with Neurotransmitter Processing Nucleic acids comprising nucleic acid sequences of SEQ ID NOs: I-5 of the present invention were identified from lncyte Clones 621850, 625839, 2405140, 2823339, and 2825861, respectively, and assembled as described in Example III. The nucleic acid sequences were translated and reading frames determined when possible. A full length protein sequence, SEQ ID
NO: 6, was derived from SEQ ID NO: 4 after determining its coding frame. SEQ
ID NO: I-6 were then annotated according to Example VII using BLAST and other motif search tools against databases of known molecules.
SEQ ID NO: 1 showed 99% sequence identity with a human brain-specific mltNA, KIAA0604 (g3043731 ). In addition, a polypeptide sequence translated from SEQ
ID NO:1 showed about 77.3% sequence homology with the gene that encodes endothelin-converting enzyme 2 of Bos taurus, ECE-2. As described in Example IV, ECEs are known to be involved in the pathway of neurotransmitter processing.
SEQ ID NO: 2 showed 60% sequence identity from about nucleotide 80 to about nucleotide 1818 with a human brain-specific gene, KIAA0604 (g3043731 ), and 61 % sequence identity from about necleotide 144 to about nucleotide 1760 with a human mRNA
for endothelin-converting enzyme 1, ECE-1 (gl 197803). ECE-1 is a metalloprotease known to be associated with the neurotransmitter processing.
SEQ ID N0:4 showed about 58% sequence identity from nucleotide 332 to nucleotide 1393 with the gene which encodes the human heavy neurofilament subunit, NF-H
(g35028), a homolog to intermediate filament (IF) proteins. The corresponding amino acid sequence, SEQ ID
NO: 6, is 210 amino acids in length and has one potential N-glycosylation site at N16; three potential casein kinase II phosphorylation sites at S21, S 100, and S 114; one potential glycosaminoglycan attachment site at 5110; and two potential protein kinase C
phosphorylation sites at S27 and S110.
VII. Homology Searching for Genes Associated with Neurotransmitter Processing Polynucleotide sequences, SEQ ID NOs: 1-5, and polypeptide sequences, SEQ ID
NO: 6, were queried against databases derived from sources such as GenBank and SwissProt. These databases, which contain previously identified and annotated sequences, were searched for regions of similarity using Basic Local Alignment Search Tool (BLAST; Altschul, S.F.
et al. (1990) J. Mol.
Biol. 215:403-410) and Smith-Waterman alignment (Smith, T. et al. ( 1992) Protein Engineering 5:35-51 ). BLAST searched for matches and reported only those that satisfied the probability thresholds of 10''-5 or less for nucleotide sequences and 10'8 or less for polypeptide sequences.
The polypeptide sequences were also analyzed for known motif patterns using MOTIFS, SPSCAN, BLIMPS, and Hidden Markov Mode) (HMM)-based protocols. MOTIFS
(Genetics Computer Group, Madison WI) searches polypeptide sequences for patterns that match those defined in the Prosite Dictionary of Protein Sites and Patterns (Bairoch, A.
et al. ( 1997) Nucleic Acids Res. 25:217-221 ), and displays the patterns found and their corresponding literature abstracts. SPSCAN (Genetics Computer Group, Madison, WI) searches for potential signal peptide sequences using a weighted matrix method (Nielsen, H. et al. (1997) Prot. Eng. 10:1-6).
Hits with a score of 5 or greater were considered. BLIMPS uses a weighted matrix analysis algorithm to search for sequence similarity between the polypeptide sequences and those contained in BLOCKS, a database consisting of short amino acid segments, or blocks, of 3-60 amino acids in length, compiled from the PROSITE database (Henikoff, S. and G.
J. Henikoff (1991) Nucleic Acids Res. 19:6565-6572; Bairoch et al., supra), and those in PRINTS, a protein fingerprint database based on non-redundant sequences obtained from sources such as SwissProt, GenBank, PIR, and NRL-3D (Attwood, T. K. et al. (1997) J. Chem. Inf. Comput.
Sci.
37:417-424). For the purposes of the present invention, the BLIMPS searches reported matches with a cutoff score of 1000 or greater and a cutoff probability value of 1.0 x 10'3. HMM-based protocols were based on a probabilistic approach and searched for consensus primary structures of gene families in the protein sequences (Eddy, S.R. (1996) Cur. Opin. Str.
Biol. 6:361-365;
Sonnhammer, E.L.L. et al. (1997) Proteins 28:405-420). More than 500 known protein families with cutoff scores ranging from 10 to 50 bits were selected for use in this invention.
VIII. Labeling and Use of Individual Hybridization Probes Oligonucleotides are designed using state-of the-art software such as OLIGO
4.06 (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ,uCi of [y-'zp) adenosine triphosphate (Amersham Pharmacia Biotech, Piscataway NJ), and T4 polynucleotide kinase (NEN Life Science Products, Boston MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine resin column (Amersham Pharmacia Biotech). An aliquot containing 10'counts per minute of the labeled probe is used in a typical membrane-based hybridization analysis of human genomic DNA digested with one of the following endonucleases:
Ase I, Bgl II, Eco RI, Pst I, Xba 1, or Pvu II (NEN Life Science Products).
The DNA from each digest is fractionated on a 0.7 percent agarose gel and transferred to nylon membranes (NYTRAN PLUS, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5%
sodium dodecyl sulfate. After XOMAT AR film (Eastman Kodak, Rochester NY) is exposed to the blots for several hours, hybridization patterns are compared.
IX. Production of Specific Antibodies SEQ ID NO: 6 substantially purified using polyacrylamide gel electrophoresis (PAGE;
see, e.g., Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to immunize rabbits and to produce antibodies using standard protocols.
Alternatively, the amino acid sequence is analyzed using LASERGENE software (DNASTAR Madison W1) to determine regions of high immunogenicity, and a corresponding oligopeptide is synthesized and used to raise antibodies by means known to those of skill in the art. Methods for selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well described in the art. Typically, oligopeptides 15 residues in length are synthesized using an ABI 431A Peptide synthesizer (PE Biosystems) using Fmoc-chemistry and coupled to KLH (Sigma-Aldrich, St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester to increase immunogenicity. Rabbits are immunized with the oligopeptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide activity by, for example, binding the peptide to plastic, blocking with 1 % BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG.
SEQUENCE LISTING
<110> INCYTE PHARMACEUTICALS, INC.
WALKER, Michael G.
VOLKMUTH, Wayne KLINGLER, Tod M.
<120> GENES ASSOCIATED WITH NEUROTRANSMITTER PROCESSING
<130> PB-0003 PCT
<140> To Be Assigned <141> Herewith <150> 09/149,952 <151> 1998-09-O1 <160> 6 <170> PERL Program <210> 1 <211> 3000 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte template ID No.: 621850CB1 <900> 1 gctgggagct ggcagcaacg tgggattcca gaaggggaca agacagctgt taggctcacg 60 cacgcagctg gagctggtct tagcaggtgc ctetctactg ctggctgcac tgcttctggg 120 ctgccttgtg gccctagggg tccagtacca cagagaccca tcccacagca cctgccttac 180 agaggcctgc attcgagtgg ctggaaaaat cctggagtcc ctggaccgag gggtgagccc 290 ctgtgaggac ttttaccagt tctcctgtgg gggctggatt cggaggaacc ccctgcccga 300 tgggcgttct cgctggaaca ccttcaacag cctctgggac caaaaccagg ccatactgaa 360 gcacctgctt gaaaacacca ccttcaactc cagcagtgaa gctgagcaga agacacagcg 920 cttctaccta tcttgcctac aggtggagcg cattgaggag ctgggagccc agccactgag 980 ggacctcatt gagaagattg gtggttggaa cattacgggg ccctgggacc aggacaactt 540 tatggaggtg ttgaaggcag tagcagggac ctacagggcc accccattct tcaccgtcta 600 catcagtgct gactctaaga gttccaacag caatgttatc caggtggacc agtctgggct 660 ctttctgccc tctcgggatt actacttaaa cagaactgcc aatgagaaag tgctcactgc 720 ctatctggat tacatggagg aactggggat gctgctgggt gggcggccca cctccacgag 780 ggagcagatg cagcaggtgc tggagttgga gatacagctg gccaacatca cagtgcccca 840 ggaccagcgg cgcgacgagg agaagatcta ccacaagatg agcatttcgg agctgcaggc 900 tctggcgccc tccatggact ggcttgagtc cctgtctttc ttgctgtcac cattggagtt 960 gagtgactct gagcctgtgg tggtgtatgg gatggattat ttgcagcagg tgtcagagct 1020 catcaaccgc acggaaccaa gcatcctgaa caattacctg atctggaacc tggtgcaaaa 1080 gacaacctca agcctggacc gacgctttga gtctgcacaa gagaagctgc tggagaccct 1140 ctatggcact aagaagtcct gtgtgccgag gtggcagacc tgcatctcca acacggatga 1200 cgcccttggc tttgctttgg ggtccctctt cgtgaaggcc acgtttgacc ggcaaagcaa 1260 agaaattgca gaggggatga tcagcgaaat ccggaccgca tttgaggagg ccctgggcaa 1320 gctggtttgg atggatgaga agacccgcca ggcagccaag gagaaagcag atgccatcta 1380 tgatatgatt ggtttcccag actttatcct ggagcccaaa gagctggatg atgtttatga 1440 cgggtacgaa atttctgaag attctttctt ccaaaacatg ttgaatttgt acaacttctc 1500 tgccaaggtt atggctgacc agctccgcaa gcctcccagc cgagaccagt ggagcatgac 1560 cccccagaca gtgaatgcct actaccttcc aactaagaat gagatcgtct tccccgctgg 1620 catcctgcag gcccccttct atgcccgcaa ccaccccaag gccctgaact tcggtggcat 1680 cggtgtggtc atgggccatg agttgacgca tgcctttgat gaccaagggc gcgagtatga 1790 caaagaaggg aacctgcggc cctggtggca gaatgagtcc ctggcagcct tccggaacca 1800 WO 00/12685 PC'T/US99/19615 cacggcctgc atggaggaac agtacaatca ataccaggtc aatggggaga ggctcaacgg 1860 ccgccagacg ctgggggaga acattgcgga caacgggggg ctgaaggctg cctacaatgc 1920 ttacaaagca tggctgagaa agcatgggga ggagcagcaa ctgccagccg tggggctcac 1980 caaccaccag ctcttcttcg tgggatttgc ccaggtgtgg tgctcggtcc gcacaccaga 2040 gagttctcac gaggggctgg tgaccgaccc ccacagccct gcccgcttcc gcgtgctggg 2100 cactctctcc aactcccgtg acttcctgcg gcacttcggc tgccctgtcg gctcccccat 2160 gaacccaggg cagctgtgtg aggtgtggta gacctggatc aggggagaaa tgcccagctg 2220 tcaccagacc tggggcagct ctcctgacaa agctgtttgc tcttgggttg ggaggaagca 2280 aatgcaagct gggctgggtc tagtccctcc cccccacagg tgacatgagt acagaccctc 2340 ctcaatcacc acattgtgcc tctgctttgg gggtgcccct gcctccagca gagcccccac 2400 cattcactgt gacatctttc cgtgtcaccc tgcctggaag aggtctgggt ggggaggcca 2460 gttcccatag gaaggagtct gcctcttctg tccccaggct cactcagcct ggcggccatg 2520 gggcctgccg tgcctgcccc actgtgaccc acaggcctgg gtggtgtacc tcctggactt 2580 ctccccaggc tcactcagtg cgcacttagg ggtggactca gctctgtctg gctcaccctc 2640 acgggctacc cccacctcac cctgtgctcc ttgtgccact gctcccagtg ctgctgctga 2700 ccttcactga cagctcctag tggaagccca agggcctctg aaagcctcct gctgcccact 2760 gtttccctgg gctgagaggg gaagtgcata tgtgtagcgg gtactggttc ctgtgtctta 2820 gggcacaagc ctttagcaaa tgattgattc tccctggaca aagcaggaaa gcagatagag 2880 cagggaaaag gaagaacaga gtttattttt acagaaaaga gggtgggagg gtgtggtctt 2990 ggcccttata ggaccctgtg ccaataaaca gacatgcatc cgtgaaaaaa aaaaaaaaaa 3000 <210> 2 <211> 2150 <212> DNA
<213> Homo Sapiens <220>
<221> unsure <222> 2107, 2116, 2124, 2133, 2137, 2190 <223> a or g or c or t, unknown, or other <220>
<221> misc_feature <223> Incyte template ID No.: 625839CT1 <400> 2 gcgagatcga gcgactgggc ccgcgaccca tgctagaggt catcgaggac tgcgggggct 60 gggacctggg cggcgcggag gagcgtccgg gggtcgcggc gcgatgggac ctcaaccggc 120 tgctgtacaa ggcgcagggc gtgtacagcg ccgccgcgct cttctcgctc acggtcagcc 180 tggacgacag gaactcctcg cgctacgtca tccgcattga ccaggatggg ctcaccctgc 240 cagagaggac cctgtacctc gctcaggatg aggacagtga gaagatcctg gcagcataca 300 gggtgttcat ggagcgagtg ctcagcctcc tgggtgcaga cgctgtggaa cagaaggccc 360 aagagatcct gcaagtggag cagcagctgg ccaacatcac tgtgtcagag tatgacgacc 420 tacggcgaga tgtcagctcc atgtacaaca aggtgacgct ggggcagctg cagaagatca 480 ccccccactt gcggtggaag tggctgctag accagatctt ccaggaggac ttctcagagg 540 aagaggaggt ggtgctgctg gcgacagact acatgcagca ggtgtcgcag ctcatccgct 600 ccacacccca ccgggtcctg cacaactacc tggtgtggcg cgtggtggtg gtcctgagtg 660 aacacctgtc cccgccattc cgtgaggcac tgcacgagct ggcacaggag atggagggca 720 gcgacaagcc acaggagctg gcccgggtct gcttgggcca ggcccaatcg ccactttggc 780 atggcgcttg gcgccctctt tgtacatgag cattctcagc tgccagcaaa gccaaggtgc 840 agcagctagt ggaagacatc aagtacatcc tgggccagcg cctggaggag ctggactgga 900 tggacgccga gaccagggct gctgctcggg gccaagctcc agtacatgat ggtgatggtc 960 ggctacccgg acttcctgct gaaacccgat gctgtggaca aggagtatga gtttgaggtc 1020 catgagaaga cctacttcaa gaacatcttg aacagcatcc gcttcagcat ccagctctgc 1080 agttaagaag attcggcagg aggtggacaa gtccacgtgg ctgctccccc cacaggcgct 1140 caatgcctac tatctaccca acaagaacca gatggtgttc cccgcgggca tcctgcagcc 1200 caccctgtac gaccctgact tcccacagtc tctcaactac gggggcatcg gcaccatcat 1260 tggacatgag ctgacccacg gctacgacga ctgggggggc cagtatgacc gctcagggaa 1320 cctgctgcac tggtggacgg aggcctccta cagccgcttc ctgcgaaagg ctgagtgcat 1380 cgtccgtctc tatgacaact tcactgtcta caaccagcgg gtgaacggga aacacacgct 1440 tggggagaac atcgcagata tgggcggcct caagctggcc taccacgcct atcagaagtg 1500 ggtgcgggag cacggcccag agcacccact tccccggctc aagtacacac atgaccagct 1560 cttcttcatt gcctttgccc agaactggtg catcaagcgg cggtcgcagt ccatctacct 1620 gcaggtgctg actgacaagc atgcccctga gcactacagg gtgctgggca gtgtgtccca 1680 gtttgaggag tttggccggg ctttccactg tcccaaggac tcacccatga accctgccca 1740 caagtgttcc gtgtggtgag cctggctgcc cgcctgcacg cccccactgc ccccgcacga 1800 atcacctcct gctggctacc ggggcaggca tgcacccggt gccagccccg ctctgggcac 1860 cacctgcctt ccagcccctc caggacccgg tccgcctgct gcccctcact tcaggagggg 1920 cctggagcag ggtgaggctg gactttgggg ggctgtgagg gaaatatact ggggtcccca 1980 gattctgctc taagggggcc agaccctctg ccaggctgga ttgtacgggc cccaccttcg 2040 ctgtgttctt gctgcaaagt ctggtcaata aatcactgca ctgttaaaaa aaaaaattag 2100 gggtgtntaa agtggntgga cacngggtca acntatnttn gaaccttttg 2150 <210> 3 <211> 512 <212> DNA
<213> Homo sapiens <220>
<221> misc feature <223> Incyte template ID No.: 2905140CB1 <400> 3 gagaaatgta taaaataatc gagaccggcg gaggcaggtc agagccgtct ggaatgcgcg 60 caccttcaac gaacaatgcc aacattgaag tcctcggttg gagtctgcac agttggagat 120 ctttggtgcc attttagaca tctttggatt tcatcaatca aactgactgc aattttccat 180 aaaaaccctg aatttgggtc agaaagtggg caaagtagat aaagatcatt cgagctgtct 290 tataagatga taaatagata tcctttcagg ccaacaatgc caaagtgcag ttttgtgatt 300 cccttccatg ggttctgaat gcagtgagtc gaaacgattt ctacatgttt tcccatggtt 360 taggaggtgt ctttacatac ttgtcaatag tagcctgacc tttttcccca tggagttgct 420 aagtgtgttt tgtttgttgc tttgagtact tttttcttgt tgtttgtgtg tgtgttgcac 480 aaaatacaca agaaaataaa aaaaaaaaaa as 512 <210> 4 <211> 1511 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte template ID No.: 2823339CB1 <400> 9 tccagagtgc tgaatttctg ggcagccagg gggctcttgc tctgctatgg gttgaagatt 60 cagcctgatc acttcttgtc aaagttgggg agtctgagct ggttctgggc ctagcaaggc 120 aatctactct gttaaagtcg atagagggag aagctgggga gaacactgca attttttcta 180 tgagccctct gtagagggtg ggtggtgggg gctccacaga ctgctccttg gttccacagg 240 ccatgcccgg aggtgctggc ttcgagccct tcagacaagt aaggtcagct ctgccccctc 300 ctgcttcact tgccctgtag ctgtggatgc tactactagg gaggtcctgc ctggggccct 360 gcaggtgtgc agctcagagc ccgctgagcc cccaggaacc cctcctgcct ctcacagcca 920 tctagatgca gctcctctgc ccactgttgt ctactctaaa ggacttcaga gaggctctcc 480 agcaggcgcc tgggactcgg accaaaatgg caactccaag cgtgctttgg gggaccctgc 540 cactcccacg gaaggtcctc gccgcccacc tccccgtcct ccctgtcggc tcagcatggg 600 ccgccgtcac aaactctgta gccctgaccc gggccaggcc aacaacagtg aaggcagcga 660 ccatgactac ctgcccttgg tgcggctgca ggaggcacca ggctccttcc gcctggacgc 720 gcccttctgc gccgctgtgc gcatctcgca ggagcgcctc tgccgtgcct cgccctttgc 780 cgtgcaccgc gccagcctca gccccacctc ggcctcattg ccctgggcac ttctgggccc 840 tggtgttggc cagggtgaca gtgccacggc ctcctgcagc ccgtccccca gctcgggctc 900 tgaggggcca ggccaggtgg acagtgggcg gggctcagac accgaggcct ccgatggggc 960 ggaagggctg ggcggcaccg acctgcgggg ccggacctgg gccactgccg tagcactcgc 1020 ctggctggag caccgatgcg ccgctgcctt cgacgagtgg gaactgacag cggccaaggc 1080 tgattgctgg ctgcgggccc agcacttgcc tgacggcctt gacctggccg ccctcaaggc 1140 cgcagcccga gggctcttcc tgctactgcg ccactgggac caaaacctgc agctacacct 1200 gctgtgctac agcccagcga acgtgtgaag gctgccccct gctgcttggg ctggcgcccc 1260 acccaacaca ctcaagtcac tgccgcccag ggctggcctc ttggtgctgg gaaagtgtag 1320 gctggtgcca gcctgtcccc cactgcttct tactccctcc ctggagccct cttgccccca 1380 caaaaagtgc ctgcctgtgc tctctccctc tcctcccacc ccactcacac tcccctccat 1990 cctctgagct ccctgcaaca cagtggaagg gtagagagcc acagtcccca aatcctatgc 1500 aataaagtgc a 1511 <210> 5 <211> 697 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte template ID No.: 2825861CT1 <900> 5 cggctcgagc ttccaaaaaa ctttcaatct tatgctgtgt caagatatca ctggctgttg 60 gtaaagtgct aatagcagaa tcagagagtg ctcatgtact tgctgattga aaatacatga 120 ttgacatgat tggaagtaca ctccagcttg ggtacagagt gagactgtct caaaaaaaca 180 acaacaaaaa agagacaaga ctatcttaca cttttttgtg gtttttccct tgcccttata 240 gcttccctac agtagtgggt aaccagtatc caactgaagg ctctgaaaag ggcacaaaga 300 cctcctagaa agagtacttc atctgatcaa cactaagaag ttcatctgtc tgctgctgct 360 gctgcggaca agctttatca ccagaaagtc atttataacg tttgcgtcaa taatgtctga 420 atggcttaaa gtaccatgca acattcattt gaatgttctg agtgatcaaa gtgaaatggc 980 tacaaccgga acacaatgaa gaggtacaga aaacaaaact caaacatccc aaggaaatgc 590 gtgttttgga acttttgggg agacagcatg gtgcagccac attccatgat tggctccagt 600 ggtgatccag gaagtgccta tgagtaagac ttgtatattt ctcatcactg ctctttcaac 660 ttctagcaca gcacctgaca tgtaataaat attcgca 697 <210> 6 <211> 210 <212> PRT
<213> Homo Sapiens <220>
<221> misc feature <223> Incyte template ID No.: 2823339CD1 <400> 6 Met Gly Arg Arg His Lys Leu Cys Ser Pro Asp Pro Gly Gln Ala Asn Asn Ser Glu Gly Ser Asp His Asp Tyr Leu Pro Leu Val Arg Leu Gln Glu Ala Pro Gly Ser Phe Arg Leu Asp Ala Pro Phe Cys Ala Ala Val Arg Ile Ser Gln Glu Arg Leu Cys Arg Ala Ser Pro Phe Ala Val His Arg Ala Ser Leu Ser Pro Thr Ser Ala Ser Leu Pro Trp Ala Leu Leu Gly Pro Gly Val Gly Gln Gly Asp Ser Ala Thr Ala Ser Cys Ser Pro Ser Pro Ser Ser Gly Ser Glu Gly Pro Gly Gln Val Asp Ser Gly Arg Gly Ser Asp Thr Glu Ala Ser Asp Gly Ala Glu Gly Leu Gly Gly Thr Asp Leu Arg Gly Arg Thr Trp Ala Thr Ala Val Ala Leu Ala Trp Leu Glu His Arg Cys Ala Ala Ala Phe Asp Glu Trp Glu Leu Thr Ala Ala Lys Ala Asp Cys Trp Leu Arg Ala Gln His Leu Pro Asp Gly Leu Asp Leu Ala Ala Leu Lys Ala Ala Ala Arg Gly Leu Phe Leu Leu Leu Arg His Trp Asp Gln Asn Leu Gln Leu His Leu Leu Cys Tyr Ser Pro Ala Asn Val
SCG10 was shown in Example IV to be strongly associated with the known genes related to synthesis and release of dopamine. These results are consistent with the notion that 621850 is associated with neurotransmitter processing. .
Table 6 Coexpression results for Incyte gene P-value Associated Occurs Both TargetAssociatedNeither gene occuronly only 0 nAchR 13 7 7 6 501 0 TH type 4 15 7 7 8 499 0 Secretogranin 33 8 6 25 482 II
Incyte gene 625839 occurred in 14 of 522 cDNA libraries studied and showed strong co-expression with several of the known genes related to synthesis and release of dopamine, and norepinephrine, including DBH, nAchR, TH type 4, and secretogranin II, as shown in Table 6.
625839 also showed strong association with PARP, a human poly(ADB-ribose) polymerase which is expressed during development in Drosophila (Uchida, K. et al. (1993) 90:3481-3485). These results are consistent with the notion that 625839 is associated with neurotransmitter processing.
Table 7 Coexpression results for Incyte gene 2405140 P-value Associated geneOccursBoth Target AssociatedNeither occur only only 9.39E-06 DBH 12 3 0 9 509 1.22E-OS nAchR 13 3 0 10 508 1.94E-OSTH type 4 15 3 0 12 506 3.48E-OS AADC 18 3 0 15 503 3.48E-OS Homolog to predicted18 3 0 1 S 503 c. elegans protein F25H2.8 Incyte gene 2405140 occurred in 3 of 522 cDNA libraries studied and showed strong co-expression with several of the known genes related to neurotransmitter processing, including DBH, nAchR, TH type 4, and AADC, as shown in Table 7. 2405140 also showed strong association with a homolog to a predicted C. elegans protein. F25H2.8. These results are consistent with the notion that 2405140 is associated with neurotransmitter processing.
Table 8 Coexpression results for Incyte gene 2823339 P-value Associated gene Occurs Both Target Associated Neither occur only only 2.20E-08 Secretogranin 1 45 9 3 36 473 4.51 E-08 Secretogranin ll 33 8 4 25 484 1.37E-07 hVMATI 8 5 7 3 506 1.56E-07 TH type 4 15 6 6 9 500 lncyte gene 2823339 occurred in 12 of 522 cDNA libraries studied and showed strong co-expression with several of the known genes related to neurotransmitter processing, including secretogranin I and ll, hVMATI and TH type 4, as shown in Tabie 8. These results are consistent with the notion that 2823339 is associated with neurotransmitter processing.
Table 9 Coexpression results for Incyte gene 2825861 P-value Associated gene Occurs Both Target Associated Neither occur only only 0 TH type 4 15 3 0 12 506 0 Human factor X 29 3 0 26 492 Incyte gene 2825861 occurred in 3 of 522 cDNA libraries studied and showed strong co-expression with two of the known genes related to neurotransmitter processing, including DBH and TH type 4, as shown in Table 9. 2825861 also shows association with a blood coagulation factor, human factor X. These results are consistent with the notion that 2825861 is associated with neurotransmitter processing..
VI. New Genes Associated with Neurotransmitter Processing Nucleic acids comprising nucleic acid sequences of SEQ ID NOs: I-5 of the present invention were identified from lncyte Clones 621850, 625839, 2405140, 2823339, and 2825861, respectively, and assembled as described in Example III. The nucleic acid sequences were translated and reading frames determined when possible. A full length protein sequence, SEQ ID
NO: 6, was derived from SEQ ID NO: 4 after determining its coding frame. SEQ
ID NO: I-6 were then annotated according to Example VII using BLAST and other motif search tools against databases of known molecules.
SEQ ID NO: 1 showed 99% sequence identity with a human brain-specific mltNA, KIAA0604 (g3043731 ). In addition, a polypeptide sequence translated from SEQ
ID NO:1 showed about 77.3% sequence homology with the gene that encodes endothelin-converting enzyme 2 of Bos taurus, ECE-2. As described in Example IV, ECEs are known to be involved in the pathway of neurotransmitter processing.
SEQ ID NO: 2 showed 60% sequence identity from about nucleotide 80 to about nucleotide 1818 with a human brain-specific gene, KIAA0604 (g3043731 ), and 61 % sequence identity from about necleotide 144 to about nucleotide 1760 with a human mRNA
for endothelin-converting enzyme 1, ECE-1 (gl 197803). ECE-1 is a metalloprotease known to be associated with the neurotransmitter processing.
SEQ ID N0:4 showed about 58% sequence identity from nucleotide 332 to nucleotide 1393 with the gene which encodes the human heavy neurofilament subunit, NF-H
(g35028), a homolog to intermediate filament (IF) proteins. The corresponding amino acid sequence, SEQ ID
NO: 6, is 210 amino acids in length and has one potential N-glycosylation site at N16; three potential casein kinase II phosphorylation sites at S21, S 100, and S 114; one potential glycosaminoglycan attachment site at 5110; and two potential protein kinase C
phosphorylation sites at S27 and S110.
VII. Homology Searching for Genes Associated with Neurotransmitter Processing Polynucleotide sequences, SEQ ID NOs: 1-5, and polypeptide sequences, SEQ ID
NO: 6, were queried against databases derived from sources such as GenBank and SwissProt. These databases, which contain previously identified and annotated sequences, were searched for regions of similarity using Basic Local Alignment Search Tool (BLAST; Altschul, S.F.
et al. (1990) J. Mol.
Biol. 215:403-410) and Smith-Waterman alignment (Smith, T. et al. ( 1992) Protein Engineering 5:35-51 ). BLAST searched for matches and reported only those that satisfied the probability thresholds of 10''-5 or less for nucleotide sequences and 10'8 or less for polypeptide sequences.
The polypeptide sequences were also analyzed for known motif patterns using MOTIFS, SPSCAN, BLIMPS, and Hidden Markov Mode) (HMM)-based protocols. MOTIFS
(Genetics Computer Group, Madison WI) searches polypeptide sequences for patterns that match those defined in the Prosite Dictionary of Protein Sites and Patterns (Bairoch, A.
et al. ( 1997) Nucleic Acids Res. 25:217-221 ), and displays the patterns found and their corresponding literature abstracts. SPSCAN (Genetics Computer Group, Madison, WI) searches for potential signal peptide sequences using a weighted matrix method (Nielsen, H. et al. (1997) Prot. Eng. 10:1-6).
Hits with a score of 5 or greater were considered. BLIMPS uses a weighted matrix analysis algorithm to search for sequence similarity between the polypeptide sequences and those contained in BLOCKS, a database consisting of short amino acid segments, or blocks, of 3-60 amino acids in length, compiled from the PROSITE database (Henikoff, S. and G.
J. Henikoff (1991) Nucleic Acids Res. 19:6565-6572; Bairoch et al., supra), and those in PRINTS, a protein fingerprint database based on non-redundant sequences obtained from sources such as SwissProt, GenBank, PIR, and NRL-3D (Attwood, T. K. et al. (1997) J. Chem. Inf. Comput.
Sci.
37:417-424). For the purposes of the present invention, the BLIMPS searches reported matches with a cutoff score of 1000 or greater and a cutoff probability value of 1.0 x 10'3. HMM-based protocols were based on a probabilistic approach and searched for consensus primary structures of gene families in the protein sequences (Eddy, S.R. (1996) Cur. Opin. Str.
Biol. 6:361-365;
Sonnhammer, E.L.L. et al. (1997) Proteins 28:405-420). More than 500 known protein families with cutoff scores ranging from 10 to 50 bits were selected for use in this invention.
VIII. Labeling and Use of Individual Hybridization Probes Oligonucleotides are designed using state-of the-art software such as OLIGO
4.06 (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ,uCi of [y-'zp) adenosine triphosphate (Amersham Pharmacia Biotech, Piscataway NJ), and T4 polynucleotide kinase (NEN Life Science Products, Boston MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine resin column (Amersham Pharmacia Biotech). An aliquot containing 10'counts per minute of the labeled probe is used in a typical membrane-based hybridization analysis of human genomic DNA digested with one of the following endonucleases:
Ase I, Bgl II, Eco RI, Pst I, Xba 1, or Pvu II (NEN Life Science Products).
The DNA from each digest is fractionated on a 0.7 percent agarose gel and transferred to nylon membranes (NYTRAN PLUS, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5%
sodium dodecyl sulfate. After XOMAT AR film (Eastman Kodak, Rochester NY) is exposed to the blots for several hours, hybridization patterns are compared.
IX. Production of Specific Antibodies SEQ ID NO: 6 substantially purified using polyacrylamide gel electrophoresis (PAGE;
see, e.g., Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to immunize rabbits and to produce antibodies using standard protocols.
Alternatively, the amino acid sequence is analyzed using LASERGENE software (DNASTAR Madison W1) to determine regions of high immunogenicity, and a corresponding oligopeptide is synthesized and used to raise antibodies by means known to those of skill in the art. Methods for selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well described in the art. Typically, oligopeptides 15 residues in length are synthesized using an ABI 431A Peptide synthesizer (PE Biosystems) using Fmoc-chemistry and coupled to KLH (Sigma-Aldrich, St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester to increase immunogenicity. Rabbits are immunized with the oligopeptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide activity by, for example, binding the peptide to plastic, blocking with 1 % BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG.
SEQUENCE LISTING
<110> INCYTE PHARMACEUTICALS, INC.
WALKER, Michael G.
VOLKMUTH, Wayne KLINGLER, Tod M.
<120> GENES ASSOCIATED WITH NEUROTRANSMITTER PROCESSING
<130> PB-0003 PCT
<140> To Be Assigned <141> Herewith <150> 09/149,952 <151> 1998-09-O1 <160> 6 <170> PERL Program <210> 1 <211> 3000 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte template ID No.: 621850CB1 <900> 1 gctgggagct ggcagcaacg tgggattcca gaaggggaca agacagctgt taggctcacg 60 cacgcagctg gagctggtct tagcaggtgc ctetctactg ctggctgcac tgcttctggg 120 ctgccttgtg gccctagggg tccagtacca cagagaccca tcccacagca cctgccttac 180 agaggcctgc attcgagtgg ctggaaaaat cctggagtcc ctggaccgag gggtgagccc 290 ctgtgaggac ttttaccagt tctcctgtgg gggctggatt cggaggaacc ccctgcccga 300 tgggcgttct cgctggaaca ccttcaacag cctctgggac caaaaccagg ccatactgaa 360 gcacctgctt gaaaacacca ccttcaactc cagcagtgaa gctgagcaga agacacagcg 920 cttctaccta tcttgcctac aggtggagcg cattgaggag ctgggagccc agccactgag 980 ggacctcatt gagaagattg gtggttggaa cattacgggg ccctgggacc aggacaactt 540 tatggaggtg ttgaaggcag tagcagggac ctacagggcc accccattct tcaccgtcta 600 catcagtgct gactctaaga gttccaacag caatgttatc caggtggacc agtctgggct 660 ctttctgccc tctcgggatt actacttaaa cagaactgcc aatgagaaag tgctcactgc 720 ctatctggat tacatggagg aactggggat gctgctgggt gggcggccca cctccacgag 780 ggagcagatg cagcaggtgc tggagttgga gatacagctg gccaacatca cagtgcccca 840 ggaccagcgg cgcgacgagg agaagatcta ccacaagatg agcatttcgg agctgcaggc 900 tctggcgccc tccatggact ggcttgagtc cctgtctttc ttgctgtcac cattggagtt 960 gagtgactct gagcctgtgg tggtgtatgg gatggattat ttgcagcagg tgtcagagct 1020 catcaaccgc acggaaccaa gcatcctgaa caattacctg atctggaacc tggtgcaaaa 1080 gacaacctca agcctggacc gacgctttga gtctgcacaa gagaagctgc tggagaccct 1140 ctatggcact aagaagtcct gtgtgccgag gtggcagacc tgcatctcca acacggatga 1200 cgcccttggc tttgctttgg ggtccctctt cgtgaaggcc acgtttgacc ggcaaagcaa 1260 agaaattgca gaggggatga tcagcgaaat ccggaccgca tttgaggagg ccctgggcaa 1320 gctggtttgg atggatgaga agacccgcca ggcagccaag gagaaagcag atgccatcta 1380 tgatatgatt ggtttcccag actttatcct ggagcccaaa gagctggatg atgtttatga 1440 cgggtacgaa atttctgaag attctttctt ccaaaacatg ttgaatttgt acaacttctc 1500 tgccaaggtt atggctgacc agctccgcaa gcctcccagc cgagaccagt ggagcatgac 1560 cccccagaca gtgaatgcct actaccttcc aactaagaat gagatcgtct tccccgctgg 1620 catcctgcag gcccccttct atgcccgcaa ccaccccaag gccctgaact tcggtggcat 1680 cggtgtggtc atgggccatg agttgacgca tgcctttgat gaccaagggc gcgagtatga 1790 caaagaaggg aacctgcggc cctggtggca gaatgagtcc ctggcagcct tccggaacca 1800 WO 00/12685 PC'T/US99/19615 cacggcctgc atggaggaac agtacaatca ataccaggtc aatggggaga ggctcaacgg 1860 ccgccagacg ctgggggaga acattgcgga caacgggggg ctgaaggctg cctacaatgc 1920 ttacaaagca tggctgagaa agcatgggga ggagcagcaa ctgccagccg tggggctcac 1980 caaccaccag ctcttcttcg tgggatttgc ccaggtgtgg tgctcggtcc gcacaccaga 2040 gagttctcac gaggggctgg tgaccgaccc ccacagccct gcccgcttcc gcgtgctggg 2100 cactctctcc aactcccgtg acttcctgcg gcacttcggc tgccctgtcg gctcccccat 2160 gaacccaggg cagctgtgtg aggtgtggta gacctggatc aggggagaaa tgcccagctg 2220 tcaccagacc tggggcagct ctcctgacaa agctgtttgc tcttgggttg ggaggaagca 2280 aatgcaagct gggctgggtc tagtccctcc cccccacagg tgacatgagt acagaccctc 2340 ctcaatcacc acattgtgcc tctgctttgg gggtgcccct gcctccagca gagcccccac 2400 cattcactgt gacatctttc cgtgtcaccc tgcctggaag aggtctgggt ggggaggcca 2460 gttcccatag gaaggagtct gcctcttctg tccccaggct cactcagcct ggcggccatg 2520 gggcctgccg tgcctgcccc actgtgaccc acaggcctgg gtggtgtacc tcctggactt 2580 ctccccaggc tcactcagtg cgcacttagg ggtggactca gctctgtctg gctcaccctc 2640 acgggctacc cccacctcac cctgtgctcc ttgtgccact gctcccagtg ctgctgctga 2700 ccttcactga cagctcctag tggaagccca agggcctctg aaagcctcct gctgcccact 2760 gtttccctgg gctgagaggg gaagtgcata tgtgtagcgg gtactggttc ctgtgtctta 2820 gggcacaagc ctttagcaaa tgattgattc tccctggaca aagcaggaaa gcagatagag 2880 cagggaaaag gaagaacaga gtttattttt acagaaaaga gggtgggagg gtgtggtctt 2990 ggcccttata ggaccctgtg ccaataaaca gacatgcatc cgtgaaaaaa aaaaaaaaaa 3000 <210> 2 <211> 2150 <212> DNA
<213> Homo Sapiens <220>
<221> unsure <222> 2107, 2116, 2124, 2133, 2137, 2190 <223> a or g or c or t, unknown, or other <220>
<221> misc_feature <223> Incyte template ID No.: 625839CT1 <400> 2 gcgagatcga gcgactgggc ccgcgaccca tgctagaggt catcgaggac tgcgggggct 60 gggacctggg cggcgcggag gagcgtccgg gggtcgcggc gcgatgggac ctcaaccggc 120 tgctgtacaa ggcgcagggc gtgtacagcg ccgccgcgct cttctcgctc acggtcagcc 180 tggacgacag gaactcctcg cgctacgtca tccgcattga ccaggatggg ctcaccctgc 240 cagagaggac cctgtacctc gctcaggatg aggacagtga gaagatcctg gcagcataca 300 gggtgttcat ggagcgagtg ctcagcctcc tgggtgcaga cgctgtggaa cagaaggccc 360 aagagatcct gcaagtggag cagcagctgg ccaacatcac tgtgtcagag tatgacgacc 420 tacggcgaga tgtcagctcc atgtacaaca aggtgacgct ggggcagctg cagaagatca 480 ccccccactt gcggtggaag tggctgctag accagatctt ccaggaggac ttctcagagg 540 aagaggaggt ggtgctgctg gcgacagact acatgcagca ggtgtcgcag ctcatccgct 600 ccacacccca ccgggtcctg cacaactacc tggtgtggcg cgtggtggtg gtcctgagtg 660 aacacctgtc cccgccattc cgtgaggcac tgcacgagct ggcacaggag atggagggca 720 gcgacaagcc acaggagctg gcccgggtct gcttgggcca ggcccaatcg ccactttggc 780 atggcgcttg gcgccctctt tgtacatgag cattctcagc tgccagcaaa gccaaggtgc 840 agcagctagt ggaagacatc aagtacatcc tgggccagcg cctggaggag ctggactgga 900 tggacgccga gaccagggct gctgctcggg gccaagctcc agtacatgat ggtgatggtc 960 ggctacccgg acttcctgct gaaacccgat gctgtggaca aggagtatga gtttgaggtc 1020 catgagaaga cctacttcaa gaacatcttg aacagcatcc gcttcagcat ccagctctgc 1080 agttaagaag attcggcagg aggtggacaa gtccacgtgg ctgctccccc cacaggcgct 1140 caatgcctac tatctaccca acaagaacca gatggtgttc cccgcgggca tcctgcagcc 1200 caccctgtac gaccctgact tcccacagtc tctcaactac gggggcatcg gcaccatcat 1260 tggacatgag ctgacccacg gctacgacga ctgggggggc cagtatgacc gctcagggaa 1320 cctgctgcac tggtggacgg aggcctccta cagccgcttc ctgcgaaagg ctgagtgcat 1380 cgtccgtctc tatgacaact tcactgtcta caaccagcgg gtgaacggga aacacacgct 1440 tggggagaac atcgcagata tgggcggcct caagctggcc taccacgcct atcagaagtg 1500 ggtgcgggag cacggcccag agcacccact tccccggctc aagtacacac atgaccagct 1560 cttcttcatt gcctttgccc agaactggtg catcaagcgg cggtcgcagt ccatctacct 1620 gcaggtgctg actgacaagc atgcccctga gcactacagg gtgctgggca gtgtgtccca 1680 gtttgaggag tttggccggg ctttccactg tcccaaggac tcacccatga accctgccca 1740 caagtgttcc gtgtggtgag cctggctgcc cgcctgcacg cccccactgc ccccgcacga 1800 atcacctcct gctggctacc ggggcaggca tgcacccggt gccagccccg ctctgggcac 1860 cacctgcctt ccagcccctc caggacccgg tccgcctgct gcccctcact tcaggagggg 1920 cctggagcag ggtgaggctg gactttgggg ggctgtgagg gaaatatact ggggtcccca 1980 gattctgctc taagggggcc agaccctctg ccaggctgga ttgtacgggc cccaccttcg 2040 ctgtgttctt gctgcaaagt ctggtcaata aatcactgca ctgttaaaaa aaaaaattag 2100 gggtgtntaa agtggntgga cacngggtca acntatnttn gaaccttttg 2150 <210> 3 <211> 512 <212> DNA
<213> Homo sapiens <220>
<221> misc feature <223> Incyte template ID No.: 2905140CB1 <400> 3 gagaaatgta taaaataatc gagaccggcg gaggcaggtc agagccgtct ggaatgcgcg 60 caccttcaac gaacaatgcc aacattgaag tcctcggttg gagtctgcac agttggagat 120 ctttggtgcc attttagaca tctttggatt tcatcaatca aactgactgc aattttccat 180 aaaaaccctg aatttgggtc agaaagtggg caaagtagat aaagatcatt cgagctgtct 290 tataagatga taaatagata tcctttcagg ccaacaatgc caaagtgcag ttttgtgatt 300 cccttccatg ggttctgaat gcagtgagtc gaaacgattt ctacatgttt tcccatggtt 360 taggaggtgt ctttacatac ttgtcaatag tagcctgacc tttttcccca tggagttgct 420 aagtgtgttt tgtttgttgc tttgagtact tttttcttgt tgtttgtgtg tgtgttgcac 480 aaaatacaca agaaaataaa aaaaaaaaaa as 512 <210> 4 <211> 1511 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte template ID No.: 2823339CB1 <400> 9 tccagagtgc tgaatttctg ggcagccagg gggctcttgc tctgctatgg gttgaagatt 60 cagcctgatc acttcttgtc aaagttgggg agtctgagct ggttctgggc ctagcaaggc 120 aatctactct gttaaagtcg atagagggag aagctgggga gaacactgca attttttcta 180 tgagccctct gtagagggtg ggtggtgggg gctccacaga ctgctccttg gttccacagg 240 ccatgcccgg aggtgctggc ttcgagccct tcagacaagt aaggtcagct ctgccccctc 300 ctgcttcact tgccctgtag ctgtggatgc tactactagg gaggtcctgc ctggggccct 360 gcaggtgtgc agctcagagc ccgctgagcc cccaggaacc cctcctgcct ctcacagcca 920 tctagatgca gctcctctgc ccactgttgt ctactctaaa ggacttcaga gaggctctcc 480 agcaggcgcc tgggactcgg accaaaatgg caactccaag cgtgctttgg gggaccctgc 540 cactcccacg gaaggtcctc gccgcccacc tccccgtcct ccctgtcggc tcagcatggg 600 ccgccgtcac aaactctgta gccctgaccc gggccaggcc aacaacagtg aaggcagcga 660 ccatgactac ctgcccttgg tgcggctgca ggaggcacca ggctccttcc gcctggacgc 720 gcccttctgc gccgctgtgc gcatctcgca ggagcgcctc tgccgtgcct cgccctttgc 780 cgtgcaccgc gccagcctca gccccacctc ggcctcattg ccctgggcac ttctgggccc 840 tggtgttggc cagggtgaca gtgccacggc ctcctgcagc ccgtccccca gctcgggctc 900 tgaggggcca ggccaggtgg acagtgggcg gggctcagac accgaggcct ccgatggggc 960 ggaagggctg ggcggcaccg acctgcgggg ccggacctgg gccactgccg tagcactcgc 1020 ctggctggag caccgatgcg ccgctgcctt cgacgagtgg gaactgacag cggccaaggc 1080 tgattgctgg ctgcgggccc agcacttgcc tgacggcctt gacctggccg ccctcaaggc 1140 cgcagcccga gggctcttcc tgctactgcg ccactgggac caaaacctgc agctacacct 1200 gctgtgctac agcccagcga acgtgtgaag gctgccccct gctgcttggg ctggcgcccc 1260 acccaacaca ctcaagtcac tgccgcccag ggctggcctc ttggtgctgg gaaagtgtag 1320 gctggtgcca gcctgtcccc cactgcttct tactccctcc ctggagccct cttgccccca 1380 caaaaagtgc ctgcctgtgc tctctccctc tcctcccacc ccactcacac tcccctccat 1990 cctctgagct ccctgcaaca cagtggaagg gtagagagcc acagtcccca aatcctatgc 1500 aataaagtgc a 1511 <210> 5 <211> 697 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte template ID No.: 2825861CT1 <900> 5 cggctcgagc ttccaaaaaa ctttcaatct tatgctgtgt caagatatca ctggctgttg 60 gtaaagtgct aatagcagaa tcagagagtg ctcatgtact tgctgattga aaatacatga 120 ttgacatgat tggaagtaca ctccagcttg ggtacagagt gagactgtct caaaaaaaca 180 acaacaaaaa agagacaaga ctatcttaca cttttttgtg gtttttccct tgcccttata 240 gcttccctac agtagtgggt aaccagtatc caactgaagg ctctgaaaag ggcacaaaga 300 cctcctagaa agagtacttc atctgatcaa cactaagaag ttcatctgtc tgctgctgct 360 gctgcggaca agctttatca ccagaaagtc atttataacg tttgcgtcaa taatgtctga 420 atggcttaaa gtaccatgca acattcattt gaatgttctg agtgatcaaa gtgaaatggc 980 tacaaccgga acacaatgaa gaggtacaga aaacaaaact caaacatccc aaggaaatgc 590 gtgttttgga acttttgggg agacagcatg gtgcagccac attccatgat tggctccagt 600 ggtgatccag gaagtgccta tgagtaagac ttgtatattt ctcatcactg ctctttcaac 660 ttctagcaca gcacctgaca tgtaataaat attcgca 697 <210> 6 <211> 210 <212> PRT
<213> Homo Sapiens <220>
<221> misc feature <223> Incyte template ID No.: 2823339CD1 <400> 6 Met Gly Arg Arg His Lys Leu Cys Ser Pro Asp Pro Gly Gln Ala Asn Asn Ser Glu Gly Ser Asp His Asp Tyr Leu Pro Leu Val Arg Leu Gln Glu Ala Pro Gly Ser Phe Arg Leu Asp Ala Pro Phe Cys Ala Ala Val Arg Ile Ser Gln Glu Arg Leu Cys Arg Ala Ser Pro Phe Ala Val His Arg Ala Ser Leu Ser Pro Thr Ser Ala Ser Leu Pro Trp Ala Leu Leu Gly Pro Gly Val Gly Gln Gly Asp Ser Ala Thr Ala Ser Cys Ser Pro Ser Pro Ser Ser Gly Ser Glu Gly Pro Gly Gln Val Asp Ser Gly Arg Gly Ser Asp Thr Glu Ala Ser Asp Gly Ala Glu Gly Leu Gly Gly Thr Asp Leu Arg Gly Arg Thr Trp Ala Thr Ala Val Ala Leu Ala Trp Leu Glu His Arg Cys Ala Ala Ala Phe Asp Glu Trp Glu Leu Thr Ala Ala Lys Ala Asp Cys Trp Leu Arg Ala Gln His Leu Pro Asp Gly Leu Asp Leu Ala Ala Leu Lys Ala Ala Ala Arg Gly Leu Phe Leu Leu Leu Arg His Trp Asp Gln Asn Leu Gln Leu His Leu Leu Cys Tyr Ser Pro Ala Asn Val
Claims (11)
1. A substantially purified polynucleotide comprising a gene that is coexpressed with one or more neurotransmitter-processing-specific genes in a plurality of biological samples, wherein each neurotransmitter-processing-specific gene is selected from the group consisting of L-tyrosine hydroxylase (TH), aromatic amino acid decarboxylase (AADC), dopamine .beta.-hydroxylase (DBH), nicotinic acetylcholine receptor a3 subunit precursor (nAchR-.alpha.3), secretogranin I and II, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMAT1), and ARIX homeodomain protein.
2. The polynucleotide of claim 1, comprising a polynucleotide sequence selected from:
(a) a polynucleotide sequence selected from the group consisting of SEQ ID
NOs: 1-5;
(b) a polynucleotide sequence which encodes the polypeptide sequence of SEQ ID
NO: 6;
(c) a polynucleotide sequence which is complementary to the polynucleotide sequence of (a) or (b);
(d) a probe which hybridizes to the polynucleotide of (a), (b), or (c).
(a) a polynucleotide sequence selected from the group consisting of SEQ ID
NOs: 1-5;
(b) a polynucleotide sequence which encodes the polypeptide sequence of SEQ ID
NO: 6;
(c) a polynucleotide sequence which is complementary to the polynucleotide sequence of (a) or (b);
(d) a probe which hybridizes to the polynucleotide of (a), (b), or (c).
3. A substantially purified polypeptide comprising the gene product of a gene that is coexpressed with one or more neurotransmitter-processing-specific genes in a plurality of biological samples, wherein each neurotransmitter-processing-specific gene is selected from the group consisting of L-tyrosine hydroxylase (TH), aromatic amino acid decarboxylase (AADC), dopamine .beta.-hydroxylase (DBH), nicotinic acetylcholine receptor a3 subunit precursor (nAchR-.alpha.3), secretogranin I and II, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter I (hVMAT1), and ARIX homeodomain protein.
4. The polypeptide of claim 3, comprising a polypeptide sequence selected from:
(a) the polypeptide sequence of SEQ ID NO: 6;
(b) a polypeptide sequence comprising at least 6 sequential amino acids of the polypeptide sequence of (a).
(a) the polypeptide sequence of SEQ ID NO: 6;
(b) a polypeptide sequence comprising at least 6 sequential amino acids of the polypeptide sequence of (a).
5. An expression vector comprising the polynucleotide of claim 2.
6. A host cell comprising the expression vector of claim 5.
7. A pharmaceutical composition comprising the polynucleotide of claim 2 in conjunction with a suitable pharmaceutical carrier.
8. A pharmaceutical composition comprising the polypeptide of claim 4 in conjunction with a suitable pharmaceutical carrier.
9. An antibody which specifically binds to the polypeptide of claim 4.
10. A method for diagnosing a disease or condition associated with the altered expression of a gene that is coexpressed with one or more neurotransmitter-processing-specific genes, wherein each neurotransmitter-processing-specific gene is selected from the group consisting of L-tyrosine hydroxylase (TH), aromatic amino acid decarboxylase (AADC), dopamine .beta.-hydroxylase (DBH), nicotinic acetylcholine receptor .alpha.3 subunit precursor (nAchR-.alpha.3), secretogranin I and II, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMAT1), and ARIX homeodomain protein, the method comprising the steps of:
(a) providing a sample comprising one of more of said coexpressed genes;
(b) hybridizing the polynucleotide of claim 2 to said coexpressed genes under conditions effective to form one or more hybridization complexes; and (c) detecting the hybridization complexes, wherein the presence of the hybridization complexes correlates with the presence of the disease or condition.
(a) providing a sample comprising one of more of said coexpressed genes;
(b) hybridizing the polynucleotide of claim 2 to said coexpressed genes under conditions effective to form one or more hybridization complexes; and (c) detecting the hybridization complexes, wherein the presence of the hybridization complexes correlates with the presence of the disease or condition.
11. A method for treating or preventing a disease associated with the altered expression of a gene that is coexpressed with one or more neurotransmitter-processing-specific genes in a subject in need, wherein each neurotransmitter-processing-specific gene is selected from the group consisting of L-tyrosine hydroxylase (TH), aromatic amino acid decarboxylase (AADC), dopamine .beta.-hydroxylase (DBH), nicotinic acetylcholine receptor .alpha.3 subunit precursor (nAchR-.alpha.3), secretogranin I and II, Rab3a, human cocaine and amphetamine regulated transcript (hCART), vesicular monoamine transporter 1 (hVMAT1), and ARIX homeodomain protein, the method comprising the step of administering to said subject in need the pharmaceutical composition of claim 7 in an amount effective for treating or preventing said disease.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14495298A | 1998-09-01 | 1998-09-01 | |
US15519498P | 1998-09-01 | 1998-09-01 | |
US09/144,952 | 1998-09-01 | ||
US60/155,194 | 1998-09-01 | ||
PCT/US1999/019615 WO2000012685A2 (en) | 1998-09-01 | 1999-08-26 | Genes associated with neurotransmitter processing |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2340617A1 true CA2340617A1 (en) | 2000-03-09 |
Family
ID=26842524
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002340617A Abandoned CA2340617A1 (en) | 1998-09-01 | 1999-08-26 | Genes associated with neurotransmitter processing |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1108013A2 (en) |
JP (1) | JP2002525039A (en) |
AU (1) | AU5587699A (en) |
CA (1) | CA2340617A1 (en) |
WO (1) | WO2000012685A2 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6482629B1 (en) * | 2001-03-29 | 2002-11-19 | Applera Corporation | Isolated human zinc metalloproteases, nucleic acids molecules encoding said enzymes, and uses thereof |
GB0616230D0 (en) * | 2006-08-16 | 2006-09-27 | Univ Cambridge Tech | Biomarkers and uses thereof |
CN118389675A (en) * | 2024-05-27 | 2024-07-26 | 南方医科大学珠江医院 | Application of aromatic amino acid decarboxylase as a marker in detection products for hepatic encephalopathy |
-
1999
- 1999-08-26 EP EP99942517A patent/EP1108013A2/en not_active Withdrawn
- 1999-08-26 AU AU55876/99A patent/AU5587699A/en not_active Abandoned
- 1999-08-26 WO PCT/US1999/019615 patent/WO2000012685A2/en not_active Application Discontinuation
- 1999-08-26 CA CA002340617A patent/CA2340617A1/en not_active Abandoned
- 1999-08-26 JP JP2000567672A patent/JP2002525039A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
JP2002525039A (en) | 2002-08-13 |
AU5587699A (en) | 2000-03-21 |
WO2000012685A2 (en) | 2000-03-09 |
WO2000012685A3 (en) | 2000-08-24 |
EP1108013A2 (en) | 2001-06-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2350415A1 (en) | Inflammation-associated genes | |
CA2364541A1 (en) | Genes associated with diseases of the kidney | |
JPH11285392A (en) | Mbgp1 polypeptide and polynucleotide | |
EP1088072A2 (en) | Prostate cancer-associated genes | |
JP2002502242A (en) | Ubiquitin binding protein | |
JP2001517943A (en) | Human pancreatitis-related protein, PAP-2 | |
JP2001514657A (en) | Growth factor receptor binding protein | |
JP2001509018A (en) | Human apoptosis-related protein encoded by DNA similar to P53-responsive mouse gene EI124 | |
WO2000021986A2 (en) | Matrix-remodeling genes | |
JP2001506850A (en) | Adipocyte-specific differentiation-related protein | |
CA2327355A1 (en) | Human receptor molecules | |
WO2000040722A2 (en) | Insulin-synthesis genes | |
JP2003210183A (en) | HUMAN IkappaB-beta | |
CA2340617A1 (en) | Genes associated with neurotransmitter processing | |
US6235477B1 (en) | Human reticulocalbin isoforms | |
JP2001513640A (en) | Novel human transmembrane 4 superfamily proteins | |
US5908831A (en) | Histone-like protein | |
JP2002527028A (en) | Human apoptosis regulatory protein | |
JP2001509015A (en) | Human chloride channel protein (HCCP) | |
JP2001521385A (en) | Human tumor-associated membrane protein | |
CA2349761A1 (en) | Corticosteroid synthesis-associated genes | |
JP2000083683A (en) | Frzb family member, frazzled | |
JP2001518794A (en) | Human GTP binding protein | |
JP2002514922A (en) | Human type 1 multiple endocrine oncoprotein | |
JP2001503629A (en) | Tumor-related KAZAL-type inhibitors having homology with PEC-60 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |