CA2331254A1 - Methods for the detection of nucleic acids - Google Patents
Methods for the detection of nucleic acids Download PDFInfo
- Publication number
- CA2331254A1 CA2331254A1 CA002331254A CA2331254A CA2331254A1 CA 2331254 A1 CA2331254 A1 CA 2331254A1 CA 002331254 A CA002331254 A CA 002331254A CA 2331254 A CA2331254 A CA 2331254A CA 2331254 A1 CA2331254 A1 CA 2331254A1
- Authority
- CA
- Canada
- Prior art keywords
- sample
- disease
- locus
- nucleic acid
- members
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims abstract description 124
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 103
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 99
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 99
- 238000001514 detection method Methods 0.000 title description 35
- 201000010099 disease Diseases 0.000 claims abstract description 87
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 87
- 239000002773 nucleotide Substances 0.000 claims abstract description 87
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 83
- 230000035772 mutation Effects 0.000 claims description 33
- 206010028980 Neoplasm Diseases 0.000 claims description 29
- 239000003550 marker Substances 0.000 claims description 23
- 201000011510 cancer Diseases 0.000 claims description 21
- 206010009944 Colon cancer Diseases 0.000 claims description 8
- 208000001333 Colorectal Neoplasms Diseases 0.000 claims description 4
- 230000002596 correlated effect Effects 0.000 claims description 3
- 150000001875 compounds Chemical class 0.000 claims 4
- 231100000331 toxic Toxicity 0.000 claims 2
- 230000002588 toxic effect Effects 0.000 claims 2
- 231100000419 toxicity Toxicity 0.000 claims 2
- 230000001988 toxicity Effects 0.000 claims 2
- 208000008051 Hereditary Nonpolyposis Colorectal Neoplasms Diseases 0.000 claims 1
- 208000017095 Hereditary nonpolyposis colon cancer Diseases 0.000 claims 1
- 201000005027 Lynch syndrome Diseases 0.000 claims 1
- 102000054765 polymorphisms of proteins Human genes 0.000 abstract description 12
- 239000000523 sample Substances 0.000 description 127
- 108700028369 Alleles Proteins 0.000 description 52
- 108091034117 Oligonucleotide Proteins 0.000 description 44
- 230000000295 complement effect Effects 0.000 description 33
- 108020004414 DNA Proteins 0.000 description 32
- 210000004027 cell Anatomy 0.000 description 30
- 238000012217 deletion Methods 0.000 description 23
- 230000037430 deletion Effects 0.000 description 23
- 238000009396 hybridization Methods 0.000 description 20
- 238000000926 separation method Methods 0.000 description 19
- 230000002068 genetic effect Effects 0.000 description 18
- 108090000623 proteins and genes Proteins 0.000 description 16
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 15
- 230000005258 radioactive decay Effects 0.000 description 15
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 description 12
- 238000004458 analytical method Methods 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 10
- 238000003745 diagnosis Methods 0.000 description 10
- 238000009826 distribution Methods 0.000 description 10
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 9
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 9
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 9
- 239000002751 oligonucleotide probe Substances 0.000 description 9
- 238000002955 isolation Methods 0.000 description 8
- 108091033319 polynucleotide Proteins 0.000 description 7
- 102000040430 polynucleotide Human genes 0.000 description 7
- 239000002157 polynucleotide Substances 0.000 description 7
- 239000013610 patient sample Substances 0.000 description 6
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 5
- 239000011324 bead Substances 0.000 description 5
- 239000012472 biological sample Substances 0.000 description 5
- 238000005259 measurement Methods 0.000 description 5
- 201000001441 melanoma Diseases 0.000 description 5
- 108700025694 p53 Genes Proteins 0.000 description 5
- 230000035945 sensitivity Effects 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- 108091092878 Microsatellite Proteins 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 238000004587 chromatography analysis Methods 0.000 description 4
- 208000029742 colonic neoplasm Diseases 0.000 description 4
- 239000005546 dideoxynucleotide Substances 0.000 description 4
- 238000001502 gel electrophoresis Methods 0.000 description 4
- 238000004949 mass spectrometry Methods 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 208000005623 Carcinogenesis Diseases 0.000 description 3
- 108091033380 Coding strand Proteins 0.000 description 3
- 208000031448 Genomic Instability Diseases 0.000 description 3
- 235000001014 amino acid Nutrition 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 230000036952 cancer formation Effects 0.000 description 3
- 231100000504 carcinogenesis Toxicity 0.000 description 3
- 210000001072 colon Anatomy 0.000 description 3
- 238000000295 emission spectrum Methods 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 238000000265 homogenisation Methods 0.000 description 3
- 230000002285 radioactive effect Effects 0.000 description 3
- 238000007619 statistical method Methods 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 108090001008 Avidin Proteins 0.000 description 2
- 108010009392 Cyclin-Dependent Kinase Inhibitor p16 Proteins 0.000 description 2
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 101000980919 Homo sapiens Cyclin-dependent kinase 4 inhibitor B Proteins 0.000 description 2
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- 108700020796 Oncogene Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- 108010090804 Streptavidin Proteins 0.000 description 2
- 108010078814 Tumor Suppressor Protein p53 Proteins 0.000 description 2
- 102100033254 Tumor suppressor ARF Human genes 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 229940024606 amino acid Drugs 0.000 description 2
- 230000002391 anti-complement effect Effects 0.000 description 2
- 108010008730 anticomplement Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000037429 base substitution Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 210000001124 body fluid Anatomy 0.000 description 2
- 239000010839 body fluid Substances 0.000 description 2
- 101150087654 chrnd gene Proteins 0.000 description 2
- 230000000112 colonic effect Effects 0.000 description 2
- 238000004440 column chromatography Methods 0.000 description 2
- 230000000875 corresponding effect Effects 0.000 description 2
- 238000000151 deposition Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 2
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 2
- 210000000981 epithelium Anatomy 0.000 description 2
- 208000016361 genetic disease Diseases 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 101150063195 mts gene Proteins 0.000 description 2
- 238000011330 nucleic acid test Methods 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 239000013074 reference sample Substances 0.000 description 2
- 230000022983 regulation of cell cycle Effects 0.000 description 2
- 238000007789 sealing Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 101710095339 Apolipoprotein E Proteins 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 206010003571 Astrocytoma Diseases 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 102100024462 Cyclin-dependent kinase 4 inhibitor B Human genes 0.000 description 1
- 108010014066 DCC Receptor Proteins 0.000 description 1
- 101150013191 E gene Proteins 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 208000031220 Hemophilia Diseases 0.000 description 1
- 208000009292 Hemophilia A Diseases 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 201000010252 Hyperlipoproteinemia Type III Diseases 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- 208000024556 Mendelian disease Diseases 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 208000032818 Microsatellite Instability Diseases 0.000 description 1
- 208000005927 Myosarcoma Diseases 0.000 description 1
- 102000043276 Oncogene Human genes 0.000 description 1
- 208000037062 Polyps Diseases 0.000 description 1
- 208000032236 Predisposition to disease Diseases 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 102000015098 Tumor Suppressor Protein p53 Human genes 0.000 description 1
- 108010040002 Tumor Suppressor Proteins Proteins 0.000 description 1
- 102000001742 Tumor Suppressor Proteins Human genes 0.000 description 1
- 206010060751 Type III hyperlipidaemia Diseases 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 210000004534 cecum Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000003196 chaotropic effect Effects 0.000 description 1
- 208000006990 cholangiocarcinoma Diseases 0.000 description 1
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- -1 dNTP Chemical compound 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000008021 deposition Effects 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001839 endoscopy Methods 0.000 description 1
- 230000002550 fecal effect Effects 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 210000004602 germ cell Anatomy 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 239000011539 homogenization buffer Substances 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 208000020887 hyperlipoproteinemia type 3 Diseases 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 201000002077 muscle cancer Diseases 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 210000005170 neoplastic cell Anatomy 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 235000019645 odor Nutrition 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000008855 peristalsis Effects 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 210000000664 rectum Anatomy 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 102000023888 sequence-specific DNA binding proteins Human genes 0.000 description 1
- 108091008420 sequence-specific DNA binding proteins Proteins 0.000 description 1
- 208000007056 sickle cell anemia Diseases 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000000528 statistical test Methods 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- 210000001685 thyroid gland Anatomy 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 230000005748 tumor development Effects 0.000 description 1
- 210000003932 urinary bladder Anatomy 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6827—Hybridisation assays for detection of mutation or polymorphism
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Physics & Mathematics (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Methods are provided for identifying nucleic acids. Methods of the invention are useful for identifying and analyzing nucleic acids, especially variants of single nucleotide polymorphisms, that are indicative of disease or the predisposition for disease.
Description
METHODS FOR THE DETECTION OF NUCLEIC ACIDS
This application is a continuation-in-part of U.S. patent application, serial number 08/876,857 (pending), which is a continuation-in-pan of U.S. patent application, serial number 08/700,583 (now U.S. Patent No. 5,670,325), the disclosure of which is incorporated by reference herein.
FIELD OF TIC INVENTION
This invention relates to methods useful for disease diagnosis by detecting changes in nucleic acids, and by detecting the presence of one or more polymorphisms that are indicative of disease BACKGROUND OF THE INVENTION
Io The capacity to diagnose disease is of central concern to human, animal and plant genetic studies, and particularly to inherited disease diagnostics. Genetic disease diagnosis typically is pursued by analyzing variations in DNA sequences that distinguish genomic DNA
among members of a population. If such variations alter the lengths of the fragments that are generated by restriction endonuclease cleavage, the variations are referred to as restriction fragment length polymorphisms (RFLPs). Where a heritable trait is linked to a particular RFLP, the presence of the RFLP can be used to predict the likelihood that the trait will be expressed phenotypically. Statistical methods have been developed to permit the multilocus analysis of RFLPs such that complex traits that are dependent upon multiple alleles can be mapped. See S. Lander et al., 83 Pltoc. NAT'L ACRD. ScI. (U.S.A.) 7353-57 (1986); S.
2o Lander et al., 84 Pltoc. NAT'L ACRD. ScI. (U.S.A.) 2363-67 (1986); H. Donis-Keiler et al., 51 CELL 319-37 (1987); S. Lander et al., 121 GENETICS 185-99 (1989).
In some cases, DNA sequence variations are in regions of the genome that are characterized by short tandem repeats (STRs) that include tandem di- or tri-nucleotide motifs.
These tandem repeats are also referred to as variable number tandem repeat (VNTR) polymorphisms. These polymorphisms are used in a large number of genetic mapping studies.
A third class of DNA sequence variations results from single nucleotide polymorphisms (SNPs), also referred to as single base polymorphisms, that exist between individuals of the same species. Such polymorphisms are far more frequent, at least in the human genome, than RFLPs or STRs'and VNTRs. In some cases, such polymorphisms comprise mutations that are a determinative characteristic in a genetic disease. Indeed, such mutations may affect a single nucleotide present in a coding sequence sufficiently to cause the disease (e.g., hemophilia, sickle-cell anemia). An example of a single nucleotide polymorphism which predisposes a disease is the three-allelic polymorphism of the apolipoprotein E gene. This polymorphism is due to single base substitutions at two DNA loci on the Apo E
gene (Mahley, 240 Scl. 622-30 (1988)). It may explain as much as 10% of the phenotypic variation observed in serum cholesterol levels. More that 90% of patients with type III
hyperlipoproteinemia are homozygous for one of the APO E alleles.
In many cases, however, single nucleotide polyrnorphisms occur in non-coding regions.
Single nucleotide polymorphisms in non-coding regions are often still useful as markers for predisposition to disease if a proximal relationship exists between the single nucleotide polymorphic locus and a disease-related gene. A disease-related gene is any gene that, in one or more variant is associated with, or causative of, disease. Despite the central importance of polymorphisms in modern genetics, no practical method has been developed which permits enumerative analysis of disease-associated polymorphic sites.
There is particular interest in molecular mechanisms for the diagnosis of cancer.
Cancer is a disease characterized by genomic instability. The acquisition of genomic instability is thought to arise from a coincident disruption of genomic integrity and a loss of cell cycle control mechanisms. Generally, a disruption of genomic integrity is thought merely to increase the probability that a cell will engage in the multistep pathway leading to cancer.
However, coupled with a loss of cell cycle control mechanisms, a disruption in genomic integrity may be sufficient to generate a population of genomically unstable neoplastic cells. A
common genetic change characteristic of the early stages of transformation is a loss of heterozygosity. Loss of heterozygosity at a number of tumor suppressor genes has been implicated in tumorigenesis. For example, loss of heterozygosity at the P53 tumor suppressor locus has been correlated with various types of cancer. Ridanpaa et al., 191 PATH. RES.
Pttnc~r. 399-402 (1995). The loss of the apc and dcc tumor suppressor genes has also been associated with tumor development. Blum, 31A EUItOP. J. CANCEr 1369-72 (1995).
Loss of heterozygosity is therefore a potentially useful marker for detecting the early stages of cancer. However, in the early stages of cancer only a small number of cells within a tissue have undergone transformation. Genetic changes characteristic of genomic instability theoretically can serve as markers for the early stages of, for example, colon cancer, and can be detected in DNA isolated from biopsied colonic epithelium and in some cases from transformed cells shed into fecal material. Sidransky et al., 256 Sc~., 102-105 (1992).
Detection methods proposed in the art are time-consuming and expensive.
Moreover, methods according to the art cannot be used to identify a loss of heterozygosity or microsatellite instability in small subpopulation of cells when the cells exist in a heterogeneous (i.e., clonally impure) sample. For example, in U.S. Patent No. 5,527,676, it is stated that tissue samples in which a mutation is to be detected should be enriched for tumor cells in order to detect the loss of heterozygosity in a p53 gene.
The present invention provides molecular assays for the detection of nucleic acids, especially nucleic acids that are indicative of disease SUMMARY OF THE INVENTION
The present invention provides methods for identifying nucleic acids, particularly 15 single nucleotide loci, and specific single base polymorphic variants, that are diagnostic markers. Methods of the invention are useful for identifying single base loci that are indicative of disease or the predisposition for disease. Alternatively, Methods of the invention are useful for analyzing and identifying variants at known disease-associated loci, such as those available on the Genbank database and other databases.
2o In general, the invention comprises methods for enumerating (i. e., counting) the number of molecules of one or more nucleic acid variant present in a sample.
According to methods of the invention, a disease-associated variant at, for example, a single nucleotide polymorphic locus is determined by enumerating the number of a nucleic acid in a first sample and determining if there is a statistically-significant difference between that number and the 25 number of the same nucleotide in a second sample. Preferably, one sample represents the number of the nucleic acid expected to occur in a sample obtained from a healthy individual, or from a healthy population if pooled samples are used. A statistically-significant difference between the number of a nucleic acid expected to be at a single-base locus in a healthy individual and the number determined to be in a sample obtained from a patient is clinically 3o indicative.
This application is a continuation-in-part of U.S. patent application, serial number 08/876,857 (pending), which is a continuation-in-pan of U.S. patent application, serial number 08/700,583 (now U.S. Patent No. 5,670,325), the disclosure of which is incorporated by reference herein.
FIELD OF TIC INVENTION
This invention relates to methods useful for disease diagnosis by detecting changes in nucleic acids, and by detecting the presence of one or more polymorphisms that are indicative of disease BACKGROUND OF THE INVENTION
Io The capacity to diagnose disease is of central concern to human, animal and plant genetic studies, and particularly to inherited disease diagnostics. Genetic disease diagnosis typically is pursued by analyzing variations in DNA sequences that distinguish genomic DNA
among members of a population. If such variations alter the lengths of the fragments that are generated by restriction endonuclease cleavage, the variations are referred to as restriction fragment length polymorphisms (RFLPs). Where a heritable trait is linked to a particular RFLP, the presence of the RFLP can be used to predict the likelihood that the trait will be expressed phenotypically. Statistical methods have been developed to permit the multilocus analysis of RFLPs such that complex traits that are dependent upon multiple alleles can be mapped. See S. Lander et al., 83 Pltoc. NAT'L ACRD. ScI. (U.S.A.) 7353-57 (1986); S.
2o Lander et al., 84 Pltoc. NAT'L ACRD. ScI. (U.S.A.) 2363-67 (1986); H. Donis-Keiler et al., 51 CELL 319-37 (1987); S. Lander et al., 121 GENETICS 185-99 (1989).
In some cases, DNA sequence variations are in regions of the genome that are characterized by short tandem repeats (STRs) that include tandem di- or tri-nucleotide motifs.
These tandem repeats are also referred to as variable number tandem repeat (VNTR) polymorphisms. These polymorphisms are used in a large number of genetic mapping studies.
A third class of DNA sequence variations results from single nucleotide polymorphisms (SNPs), also referred to as single base polymorphisms, that exist between individuals of the same species. Such polymorphisms are far more frequent, at least in the human genome, than RFLPs or STRs'and VNTRs. In some cases, such polymorphisms comprise mutations that are a determinative characteristic in a genetic disease. Indeed, such mutations may affect a single nucleotide present in a coding sequence sufficiently to cause the disease (e.g., hemophilia, sickle-cell anemia). An example of a single nucleotide polymorphism which predisposes a disease is the three-allelic polymorphism of the apolipoprotein E gene. This polymorphism is due to single base substitutions at two DNA loci on the Apo E
gene (Mahley, 240 Scl. 622-30 (1988)). It may explain as much as 10% of the phenotypic variation observed in serum cholesterol levels. More that 90% of patients with type III
hyperlipoproteinemia are homozygous for one of the APO E alleles.
In many cases, however, single nucleotide polyrnorphisms occur in non-coding regions.
Single nucleotide polymorphisms in non-coding regions are often still useful as markers for predisposition to disease if a proximal relationship exists between the single nucleotide polymorphic locus and a disease-related gene. A disease-related gene is any gene that, in one or more variant is associated with, or causative of, disease. Despite the central importance of polymorphisms in modern genetics, no practical method has been developed which permits enumerative analysis of disease-associated polymorphic sites.
There is particular interest in molecular mechanisms for the diagnosis of cancer.
Cancer is a disease characterized by genomic instability. The acquisition of genomic instability is thought to arise from a coincident disruption of genomic integrity and a loss of cell cycle control mechanisms. Generally, a disruption of genomic integrity is thought merely to increase the probability that a cell will engage in the multistep pathway leading to cancer.
However, coupled with a loss of cell cycle control mechanisms, a disruption in genomic integrity may be sufficient to generate a population of genomically unstable neoplastic cells. A
common genetic change characteristic of the early stages of transformation is a loss of heterozygosity. Loss of heterozygosity at a number of tumor suppressor genes has been implicated in tumorigenesis. For example, loss of heterozygosity at the P53 tumor suppressor locus has been correlated with various types of cancer. Ridanpaa et al., 191 PATH. RES.
Pttnc~r. 399-402 (1995). The loss of the apc and dcc tumor suppressor genes has also been associated with tumor development. Blum, 31A EUItOP. J. CANCEr 1369-72 (1995).
Loss of heterozygosity is therefore a potentially useful marker for detecting the early stages of cancer. However, in the early stages of cancer only a small number of cells within a tissue have undergone transformation. Genetic changes characteristic of genomic instability theoretically can serve as markers for the early stages of, for example, colon cancer, and can be detected in DNA isolated from biopsied colonic epithelium and in some cases from transformed cells shed into fecal material. Sidransky et al., 256 Sc~., 102-105 (1992).
Detection methods proposed in the art are time-consuming and expensive.
Moreover, methods according to the art cannot be used to identify a loss of heterozygosity or microsatellite instability in small subpopulation of cells when the cells exist in a heterogeneous (i.e., clonally impure) sample. For example, in U.S. Patent No. 5,527,676, it is stated that tissue samples in which a mutation is to be detected should be enriched for tumor cells in order to detect the loss of heterozygosity in a p53 gene.
The present invention provides molecular assays for the detection of nucleic acids, especially nucleic acids that are indicative of disease SUMMARY OF THE INVENTION
The present invention provides methods for identifying nucleic acids, particularly 15 single nucleotide loci, and specific single base polymorphic variants, that are diagnostic markers. Methods of the invention are useful for identifying single base loci that are indicative of disease or the predisposition for disease. Alternatively, Methods of the invention are useful for analyzing and identifying variants at known disease-associated loci, such as those available on the Genbank database and other databases.
2o In general, the invention comprises methods for enumerating (i. e., counting) the number of molecules of one or more nucleic acid variant present in a sample.
According to methods of the invention, a disease-associated variant at, for example, a single nucleotide polymorphic locus is determined by enumerating the number of a nucleic acid in a first sample and determining if there is a statistically-significant difference between that number and the 25 number of the same nucleotide in a second sample. Preferably, one sample represents the number of the nucleic acid expected to occur in a sample obtained from a healthy individual, or from a healthy population if pooled samples are used. A statistically-significant difference between the number of a nucleic acid expected to be at a single-base locus in a healthy individual and the number determined to be in a sample obtained from a patient is clinically 3o indicative.
The invention further comprises methods for comparing the number of one or more specific single-base polymorphic variants contained in a sample of pooled genomic DNA
obtained from healthy members of an organism population (referred to as the reference number) and an enumerated number of one or more variants contained in a sample of pooled genomic DNA obtained from diseased members of the population (referred to as the target number) to determine whether any difference between the two numbers is statistically significant. The presence of a statistically-significant difference between the reference number and the target number is indicative that the loci (or one or more of the variants) is a diagnostic marker for the disease. An individual patient is screened for the disease by first identifying a 1o variant which is a diagnostic marker for the disease and then screening a sample of the patient's genomic DNA for the presence of the variant. In a patient having a specific variant which is indicative of the presence of a disease-related gene, the severity of the disease can be assessed by determining the number of molecules of the variant present in a standardized DNA
sample and applying a statistical relationship to the number. The statistical relationship is determined by correlating the number of a disease-associated polymorphic variant with the number of the variant expected to occur at a given severity level (using, for example, statistical methods described herein).
In a preferred embodiment, enumerative analysis of pooled genomic DNA samples is used to determine the presence or likelihood of disease. Pooled genomic DNA
from healthy 2o members of a population and pooled genomic DNA from diseased members of a population are obtained. The number of each variant at a single-nucleotide polymorphic site is determined in each sample. The numbers are analyzed to determine if there is a statistically-significant difference between the variants) present in the sample obtained from the healthy population and those present in the sample obtained from the diseased population. A
statistically-significant difference indicates that the polymorphic locus is a marker for disease.
Also in a preferred embodiment, methods of the invention are used to identify a nucleic acid (e.g., a polymorphic variant) associated with a disease. Such methods comprise counting the number of a nucleic acid, preferably a single base, in members of a diseased population, and counting numbers of the same nucleic acid in members of a healthy population. A
3o statistically-significant difference in the numbers of the nucleic acid between the two populations is indicative that the interrogated locus is associated with a disease.
obtained from healthy members of an organism population (referred to as the reference number) and an enumerated number of one or more variants contained in a sample of pooled genomic DNA obtained from diseased members of the population (referred to as the target number) to determine whether any difference between the two numbers is statistically significant. The presence of a statistically-significant difference between the reference number and the target number is indicative that the loci (or one or more of the variants) is a diagnostic marker for the disease. An individual patient is screened for the disease by first identifying a 1o variant which is a diagnostic marker for the disease and then screening a sample of the patient's genomic DNA for the presence of the variant. In a patient having a specific variant which is indicative of the presence of a disease-related gene, the severity of the disease can be assessed by determining the number of molecules of the variant present in a standardized DNA
sample and applying a statistical relationship to the number. The statistical relationship is determined by correlating the number of a disease-associated polymorphic variant with the number of the variant expected to occur at a given severity level (using, for example, statistical methods described herein).
In a preferred embodiment, enumerative analysis of pooled genomic DNA samples is used to determine the presence or likelihood of disease. Pooled genomic DNA
from healthy 2o members of a population and pooled genomic DNA from diseased members of a population are obtained. The number of each variant at a single-nucleotide polymorphic site is determined in each sample. The numbers are analyzed to determine if there is a statistically-significant difference between the variants) present in the sample obtained from the healthy population and those present in the sample obtained from the diseased population. A
statistically-significant difference indicates that the polymorphic locus is a marker for disease.
Also in a preferred embodiment, methods of the invention are used to identify a nucleic acid (e.g., a polymorphic variant) associated with a disease. Such methods comprise counting the number of a nucleic acid, preferably a single base, in members of a diseased population, and counting numbers of the same nucleic acid in members of a healthy population. A
3o statistically-significant difference in the numbers of the nucleic acid between the two populations is indicative that the interrogated locus is associated with a disease.
Once the polymorphic locus is identifited, either by methods of the invention or by consulting an appropriate database, methods of the invention are useful to determine which variant at the polymorphic locus is associated with a disease. In this case, enumerative methods are used to determine whether there is a statistically-significant difference between the number of a first variant in members of a diseased population, and the number of a second variant at the same locus in members of a healthy population. A statistically-significant difference is indicative that the variant in members of the diseased population is useful as a marker for disease. Using this information, patients are screened for the presence of the variant that is thought to be associated with disease, the presence such a variant being indicative of the presence of disease, or a predisposition for a disease.
Methods of the invention are especially useful for the detection of the presence of, or the predisposition for, colorectal cancer in humans. In a preferred embodiment, methods comprise enumerating a number of a polymorphic variant in a patient, and comparing that number to the number of the variant that would be present in a sample obtained from a healthy member of the population. A statistically-significant difference being indicative of the presence of , or a predisposition for, disease in the patient being tested.
Methods of the invention also take advantage of several important insights which permit, for example, reliable detection of a DNA deletion at a known genomic site characteristic of a known cancer cell type. Methods of the invention are useful for the 2o detection and diagnosis of a genetic abnormality, such as a loss of heterozygosity or, more generally, a mutation, which can be correlated with a disease, such as cancer.
In a preferred embodiment, the invention comprises methods for enumerating, in a sample, the number of a nucleic acid indicative of a disease. The invention further comprises comparing the number of molecules with a reference number to determine whether any difference between the two numbers is statistically significant, a statistically significant difference being indicative of a genomic disruption (i.e., loss of heterozygosity or another type of mutation, such as a deletion, addition, substitution or rearrangement).
In a preferred embodiment, enumerative detection of a nucleic acid mutation is accomplished by exposing a nucleic acid sample to first and second radionucleotides. The 3o radionucleotides may be single nucleotides or oligonucleotide probes. The first radionucleotide is capable of hybridizing to a genetic region suspected to be mutated in cancer WO 99/66077 PCT'/US99/13630 or precancer cells. The second radionucleotide is capable of hybridizing to a region known not to be mutated in cancer or precancer cells. After washing to remove unhybridized radionucleotides, the number of each of first and second radionucleotides is counted. A
statistically-significant difference between the number of first and second radionucleotides is indicative of a mutation in a subpopulation of nucleic acids in the sample.
In preferred methods of the invention, first and second radionucleotides are isolated from other sample components by, for example, gel electrophoresis, chromatography, and mass spectrometry. Also in a preferred embodiment, either or both of the first and second radionucleotides is a chain terminator nucleotide, such as a dideoxy nucleotide. A preferred 1o radionucleotide for use in methods of the invention is selected from the group consisting of 32P~ 33P~ 3sS, sH, izsl~ and'4C. The number of first and second radionucleotides may be determined by counting. Methods of the invention are especially useful for the detection of massive nucleotide deletions, such as those that occur in loss of heterozygosity.
In a preferred embodiment the first and second radiolabeled oligonucleotides are separable from each other. For example, the first and second oligonucleotides are of different sizes and can be separated by gel electrophoresis, chromatography or mass spectrometry. In one embodiment the first and second oligonucleotides are of different lengths.
In a preferred embodiment the size difference is imparted by a size marker which is specifically attached to one of the two oligonucleotides. Alternatively a different size marker is attached to each oligonucleotide. After separation, the number of radioactive decay events is measured for each oligonucleotide, and the number of molecules is calculated as described herein.
In a more preferred embodiment, the first and second oligonucleotides are of the same size but are labeled with different radioisotopes selected from, for example, ass, 32p, 33P~ 3H~
1~I and 14C. The first and second oligonucleotides are then distinguished by different characteristic emission spectra. The number of radioactive decay events is measured for each oligonucleotide without separating the two oligonucleotides from each other.
The preferred methods and examples that will now be described are illustrative only and are not intended to be limiting. Other features and advantages of the invention will be apparent from the following detailed description and claims.
3o DESCRIPTION OF THE DRAWINGS
Figure 1 depicts differential primer extension as exemplified below.
_ '7 _ Figures 2A and 2B are model Gaussian distributions showing regions of low statistical probability .
Figure 3 is graph showing the probable values of N for a heterogeneous population of cells in which 1 % of the cells are mutated.
DETAILED DESCRIPTION OF THE INVENTION
The present invention comprises methods for detecting nucleic acids. In preferred embodiments, the invention is directed to the identification, detection, and analysis of informative polymorphisms or polymorphic variants, especially single-nucleotide polymorphisms and variants. According to methods of the invention, enumerative analysis is i0 used to determine whether one or more nucleic acids in a patient sample is a variant that is associated with disease or with the predisposition for disease.
Methods of the invention are especially useful for the detection and diagnosis of a predisposition for a genetic abnormality, such as a loss of heterozygosity, or more generally, a mutation, such as a point mutation, which is indicative of disease. For example, enumerated amounts of a single nucletide variant known to be associated with, for example, cancer, are compared to the amount of the variant known or expected to be present in a separate, non-cancerous sample. A statistically-significant difference between the two numbers is indicative that the variant known to be associated with, for example, cancer is present in the sample, thereby allowing diagnosis of the disease or a predisposition therefor.
Accordingly, diagnosis and detection is accomplished by comparing the number of a nucleic acid in a patient sample (e.g., patient tissue or body fluid) with the number of the same nucleic acid that is detected, or would be expected io occur, in a sample from a healthy patient, or pool of healthy patients. A
statistically-significant difference between the number of a nucleic acid in a patient sample, and the number expected to be in a healthy patient sample, indicates that the patient sample may contain a nucleic acid variant that is indicative of disease, or a predisposition therefor. A
statistically-significant difference can be diagnostic of disease (e.g., when a variant nucleic acid is known to be causative of the disease), diagnostic of a predisposition for a disease (e.g., when a variant nucleic acid is known to be predisposing but not causative), or can indicate the need for further, more invasive diagnostic measures to detect the presence of disease or a predisposing state.
Methods of the invention are especially useful for the detection of the presence of, or the predisposition for, colorectal cancer in humans. In a preferred embodiment, methods comprise enumerating a number of a polymorphic variant in a patient, and comparing that number to the number of the variant that would be present in a sample obtained from a healthy member of the population. A statistically-significant difference being indicative of the presence of , or a predisposition for, disease in the patient being tested.
Methods of the invention also take advantage of several important insights which permit, for example, reliable detection of a DNA deletion at a known genomic site characteristic of a known cancer cell type. Methods of the invention are useful for the 2o detection and diagnosis of a genetic abnormality, such as a loss of heterozygosity or, more generally, a mutation, which can be correlated with a disease, such as cancer.
In a preferred embodiment, the invention comprises methods for enumerating, in a sample, the number of a nucleic acid indicative of a disease. The invention further comprises comparing the number of molecules with a reference number to determine whether any difference between the two numbers is statistically significant, a statistically significant difference being indicative of a genomic disruption (i.e., loss of heterozygosity or another type of mutation, such as a deletion, addition, substitution or rearrangement).
In a preferred embodiment, enumerative detection of a nucleic acid mutation is accomplished by exposing a nucleic acid sample to first and second radionucleotides. The 3o radionucleotides may be single nucleotides or oligonucleotide probes. The first radionucleotide is capable of hybridizing to a genetic region suspected to be mutated in cancer WO 99/66077 PCT'/US99/13630 or precancer cells. The second radionucleotide is capable of hybridizing to a region known not to be mutated in cancer or precancer cells. After washing to remove unhybridized radionucleotides, the number of each of first and second radionucleotides is counted. A
statistically-significant difference between the number of first and second radionucleotides is indicative of a mutation in a subpopulation of nucleic acids in the sample.
In preferred methods of the invention, first and second radionucleotides are isolated from other sample components by, for example, gel electrophoresis, chromatography, and mass spectrometry. Also in a preferred embodiment, either or both of the first and second radionucleotides is a chain terminator nucleotide, such as a dideoxy nucleotide. A preferred 1o radionucleotide for use in methods of the invention is selected from the group consisting of 32P~ 33P~ 3sS, sH, izsl~ and'4C. The number of first and second radionucleotides may be determined by counting. Methods of the invention are especially useful for the detection of massive nucleotide deletions, such as those that occur in loss of heterozygosity.
In a preferred embodiment the first and second radiolabeled oligonucleotides are separable from each other. For example, the first and second oligonucleotides are of different sizes and can be separated by gel electrophoresis, chromatography or mass spectrometry. In one embodiment the first and second oligonucleotides are of different lengths.
In a preferred embodiment the size difference is imparted by a size marker which is specifically attached to one of the two oligonucleotides. Alternatively a different size marker is attached to each oligonucleotide. After separation, the number of radioactive decay events is measured for each oligonucleotide, and the number of molecules is calculated as described herein.
In a more preferred embodiment, the first and second oligonucleotides are of the same size but are labeled with different radioisotopes selected from, for example, ass, 32p, 33P~ 3H~
1~I and 14C. The first and second oligonucleotides are then distinguished by different characteristic emission spectra. The number of radioactive decay events is measured for each oligonucleotide without separating the two oligonucleotides from each other.
The preferred methods and examples that will now be described are illustrative only and are not intended to be limiting. Other features and advantages of the invention will be apparent from the following detailed description and claims.
3o DESCRIPTION OF THE DRAWINGS
Figure 1 depicts differential primer extension as exemplified below.
_ '7 _ Figures 2A and 2B are model Gaussian distributions showing regions of low statistical probability .
Figure 3 is graph showing the probable values of N for a heterogeneous population of cells in which 1 % of the cells are mutated.
DETAILED DESCRIPTION OF THE INVENTION
The present invention comprises methods for detecting nucleic acids. In preferred embodiments, the invention is directed to the identification, detection, and analysis of informative polymorphisms or polymorphic variants, especially single-nucleotide polymorphisms and variants. According to methods of the invention, enumerative analysis is i0 used to determine whether one or more nucleic acids in a patient sample is a variant that is associated with disease or with the predisposition for disease.
Methods of the invention are especially useful for the detection and diagnosis of a predisposition for a genetic abnormality, such as a loss of heterozygosity, or more generally, a mutation, such as a point mutation, which is indicative of disease. For example, enumerated amounts of a single nucletide variant known to be associated with, for example, cancer, are compared to the amount of the variant known or expected to be present in a separate, non-cancerous sample. A statistically-significant difference between the two numbers is indicative that the variant known to be associated with, for example, cancer is present in the sample, thereby allowing diagnosis of the disease or a predisposition therefor.
Accordingly, diagnosis and detection is accomplished by comparing the number of a nucleic acid in a patient sample (e.g., patient tissue or body fluid) with the number of the same nucleic acid that is detected, or would be expected io occur, in a sample from a healthy patient, or pool of healthy patients. A
statistically-significant difference between the number of a nucleic acid in a patient sample, and the number expected to be in a healthy patient sample, indicates that the patient sample may contain a nucleic acid variant that is indicative of disease, or a predisposition therefor. A
statistically-significant difference can be diagnostic of disease (e.g., when a variant nucleic acid is known to be causative of the disease), diagnostic of a predisposition for a disease (e.g., when a variant nucleic acid is known to be predisposing but not causative), or can indicate the need for further, more invasive diagnostic measures to detect the presence of disease or a predisposing state.
_$_ For purposes of exemplification, the following provides details of the use of methods according to the present invention for determining predisposition to certain cancers using variants related to the Multiple Tumor Suppressor gene. Inventive methods are also useful in the diagnosis and analysis of a mutation (and especially a large deletion typical of loss of s heterozygosity) in such a tumor suppressor gene. While the following example uses radiolabeled nucleotides and an imager that detects the radioactive decay events, other methods of enumerating may be used, such as hybridization beads used in conjunction with a multi-orfice impedance counter. While exemplified in the following manner, the invention is not so limited and the skilled artisan will appreciate its wide range of applicability upon consideration to thereof Example 1 Human Multiple Tumor Suppressor Gene The Multiple Tumor Suppressor (MTS) gene is involved in the progression of multiple tumor types, such as melanoma, leukemia, astrocytoma, glioblastoma, lymphoma, gliorna, 15 sarcoma, myosarcoma, cholangiocarcinoma, and cancers of the pancreas, breast, brain, prostate, bladder, thyroid, ovary, uterus, testis, kidney, stomach, colon and rectum. Analysis of the MTS gene is useful in predicting predisposition to cancer and the clinical severity and prognosis of patients with MTS-related cancers.
The MTS locus was identified in linkage studies. See Skolnick et al., International 2o Publication No. WO 95/25$13. The MTS locus encompasses the MTSI and MTS2 gene sequences. Mutations in the MTS locus in the germline are indicative of predisposition to melanoma and other cancers. The mutational events of the MTS locus can involve deletions, insertions and point mutations within the coding sequence and the non-coding sequence.
A locus in the MTS gene was identified by Skolnick, et al. as predisposing for 25 melamona. They tested MTS 1 and MTS2 genomic DNA from individuals presumed to carry MTS alleles predisposing to melanoma and from individuals presumed not to carry MTS
alleles predisposing to melanoma . A single nucleotide polymorphic locus was identified in exon 2 in the MTS1 sequence. The polymorphism results in an amino acid substitution, and was found to segregate with the MTS predisposing allele. The substitutions resulted in either 3o the substitution of a large hydrophobic residue for a small hydrophilic residue, or the substitution of a charged amino acid for a neutral amino acid {specifically, either a substitution WO 99/bb077 PCT/US99/13630 of a glycine with a tryptophan, or a valine with a asparagine). This single-nucleotide polymorphic locus is useful as a marker in the methods of the invention.
Using methods of the invention, predisposition to cancers, such as melanoma and the other cancers related to MTS, is ascertained by testing any tissue or body fluid for the presence of disease-associated variants at the MTS locus. The variants to be screened may be alleles on or near the MTS locus, including Exon 2 of the MTS 1 sequence. A
sample comprising pooled genomic DNA from healthy members of a population presumed not to have the MTS predisposing allele (referred to as the reference sample), and a sample comprising pooled genomic DNA from diseased members of a population presumed to carry the MTS
1o predisposing allele (referred to a the target sample) are prepared. Nucleic acids are sheared or cut into small fragments by, for example, restriction digestion. The size of nucleic acid fragments produced is not critical, subject to the limitations described below. Single-stranded nucleic acid fragments may be prepared using well-known methods. See, e.g., SAMBROOK ET
AL., MOLECULAR CLONING, A LABORATORY MANUAL (1989) incorporated by reference herein.
Either portions of a coding strand or its complement may be detected in methods according to the invention. In a preferred embodiment, both first and second strands of an allele are present in a sample during hybridization to an oligonucleotide probe. The sample is exposed to an excess of probe that is complementary to a portion of the first strand, under conditions that promote specific hybridization of the probe to the portion of the first strand. In a most preferred embodiment, the probe is in sufficient excess to bind all the portion of the first strand, and to prevent reannealing of the first strand to the second strand of the allele.
Also in a preferred embodiment, the second strand of an allele is removed from a sample prior to hybridization to an oligonucleotide probe that is complementary to a portion of the first strand of the allele. Complement to exons are removed by hybridization to anti-complement oligonucleotide probes (isolation probes) and subsequent removal of duplex formed thereby.
Methods for removal of complement strands from a mixture of single-stranded oligonucleotides are known in the art and include techniques such as affinity chromatography.
Upon converting double-stranded DNA to single-stranded DNA, sample is passed through an 3o affinity column comprising bound isolation probe that is complementary to the sequence to be isolated away from the sample. Conventional column chromatography is appropriate for WO 99/66077 PC'f/US99/13630 isolation of complement. An affinity column packed with sepharose or any other appropriate materials with attached complementary nucleotides may be used to isolate complement DNA in the column, while allowing DNA to be analyzed to pass through the column. See SAMBROOK, supra. As an alternative, isolation beads may be used to exclude complement.
After removal of complement, DNA samples are exposed to radiolabeled nucleotides under conditions which promote specific hybridization. Probes are preferably designed to hybridize specifically (i.e., without mismatches) to a portion of target genomic DNA that contains the polymorphic variant. In a particularly preferred embodiment, four different types of probes are used, each having a different radiolabeled nucleotide in a position to hybridize 1o with the variant nucleotide. The nucleotides in position to hybridize with the variant nucleotide are selected from dATP, dNTP, dCTP, and dGTP, and each is differentially labeled (i.e., with a different isotope or with isotopes of detectably distinct energy levels). Probes are hybridized under conditions that require an exact match of nucleotides in the probe to nucleotides on the target. Upon washing, the only probes that remain bound are those having 15 a labeled nucleotide that is an extact match for the nucleotide at the variant position. If more than one variant is present in a sample, each variant is detected because the nucleotides that have specifically bound to the variant are differentially labeled. The number of molecules of each particular variant is counted by measuring the number of radioactive decay events (e.g., by measuring the total number of counts during a defined interval or by measuring the time it 2o takes to obtain a predetermined number of counts) specifically associated with the particular variant. That number is used to calculate the number of radionucleotides which specifically hybridize with a particular variant in the target sample. The number of each variant present in a healthy sample (preferably pooled healthy samples) is determined in the same manner.
In another preferred embodiment, a single base extension reaction is used in which a 25 sequence-specific probe is hybridized immediately adjacent and usptream to the variant nucleotide to be detected. Each of four differentially-labeled dideoxy nucleotides is then added along with a polymerase under conditions that allow extension of the probe by one base. The number of each dideoxy nucleotides that hybridize at the variant nucleotide position is then determined as described above. Those numbers are compared to numbers obtained from 3o members of a healthy population to determine if there is a statistically-significant difference, the presence of such a difference being indicative of disease or the propensity therefor.
In a preferred embodiment, radioactive decays are used to count the number of a targeted nucleic acid. Preferred isotopes for use in the invention are selected from 355, szP, 33P~ i2sl~ 3H, and y4C. In a preferred embodiment, radionucleotides labeled with different isotopes are detected without separating the radionucleotide associated with a first variant from a radionucleotide associated with a second variant. Isotopes useful in the invention have different characteristic emission spectra. The presence of a first isotope does not prevent the measurement of radioactive decay events of a second isotope. In a more preferred embodiment, two different labeled nucleotides of the same molecular weight are used. The two differentially labeled oligonucleotides are electrophoresed on a gel, preferably a 1o denaturing gel, and the gel is exposed to an imager that detects the radioactive decay events of both isotopes. In this embodiment the two isotopes are detected at the same position on the imager, because both oligonucleotides migrate to the same position on the gel.
Detection at the same position on the imager reduces variation due to different detection efficiencies at different positions on the imager.
Also in a preferred embodiment, the radionucleotide associated with the particular variant is separated from the radionucleotide associated with another particular variant prior to measuring radioactive decay events. In a preferred embodiment, the separated radionucleotides are labeled with the same isotope. Preferred separation methods comprise conferring different molecular weights to the radionucleotides specifically associated with the 2o particular variant in the target and reference samples.
In a preferred embodiment, first probes comprise a "separation moiety." Such separation moiety is, for example, hapten, biotin, or digoxigenin. The separation moiety in first probes does not interfere with the first probe's ability to hybridize with template or be extended. In an alternative embodiment, the labeled ddNTPs comprise a separation moiety.
In yet another alternative embodiment, both the first probes and the labeled ddNTPs comprise a separation moiety. Following the extension reaction, a high molecular weight molecule having affinity for the separation moiety (e.g., avidin, streptavidin, or anti-digoxigenin) is added to the reaction mixture under conditions which permit the high molecular weight molecule to bind to the separation moiety. The reaction components are then separated on the 3o basis of molecular weight using techniques known in the art such as gel electrophoresis, chromatography, or mass spectroscopy. See AUSUBEL ET AL., SHORT PROTOCOLS IN
MOLECULAR BIOLOGY (3rd ed., dohn Wiley & Sons, Inc., 1995); WU, RECOMBINANT
DNA
METHODOLOGY II (Academic Press, 1995).
Also in a preferred embodiment, the radionucleotide associated with a first variant is separated from the radionucleotide associated with a second variant by differential primer extension, wherein the extension products of a given oligonucleotide primer are of a different length for each of the two variants. In differential primer extension (exemplified in Figure 1) an oligonucleotide is hybridized such that the 3' nucleotide of the oligonucleotide base pairs with the nucleotide that is immediately 5' of the polymorphic site. The extension reaction is performed in the presence of a radiolabeled terminator nucleotide complementary to the 1o nucleotide at the polymorphic site of the first variant. The reaction may also comprise non-labeled nucleotides complementary to the other 3 nucleotides. Extension of a primer hybridized to a first allele results in a product having only the terminator nucleotide incorporated (exemplified in Figure lA, T* is the labeled terminator nucleotide). Extension of a primer hybridized to the second variant results in a product that incorporates several non-IS labeled nucleotides immediately 5' to the terminator nucleotide (exemplified in Figure 1B).
The number of non-labeled nucleotides that are incorporated is determined by the position, on the template nucleic acid, of the closest 5' nucleotide complementary to the terminator nucleotide. In an alternative embodiment, differential primer extension comprises a labeled oligonucleotide and a non-labeled terminator nucleotide.
2o Labeled probes are exposed to sample under hybridization conditions. Such conditions are well-known in the art. See, e.g., Wallace et al., 6 NUCLEIC ACIDS RES.
3543-57 (1979), incorporated by reference herein. First and second oligonucleotide probes that are distinctly labeled (i.e. with different radioactive isotopes, fluorescent means, or with beads of different size) are applied to a single aliquot of sample. After exposure of the probes to sample under 25 hybridization conditions, sample is washed to remove any unhybridized probe. Thereafter, hybridized probes are detected separately for each variant. Standards may be used to establish background and to equilibrate results. Also, if differential fluorescent labels are used, the number of probes may be determined by counting differential fluorescent events in a sample that has been diluted sufficiently to enable detection of single fluorescent events in the sample.
3o Duplicate samples may be analyzed in order to confirm the accuracy of results obtained.
If there is'a difference between the amount of a particular variant determined in the target sample and the reference sample greater than a 0.5 % difference with at least 550,000 events (see below), it is assumed that the particular variant is indicative of a diagnostic disease marker. Statistical significance may be determined by any known method. A
preferred method is outlined below.
Enumerative sampling of a nucleotide sequence that is uniformly distributed in a biological sample typically follows a Poisson distribution. For large populations, such as the typical number of genomic polynucleotide segments in a biological sample, the Poisson distribution is similar to a normal (Gaussian) curve with a mean, N, and a standard deviation to that may be approximated as the square root of N.
Statistically-significance between numbers of target and reference genes obtained from a biological sample may be determined by any appropriate method. See, e.g:, STEEL ET AL., PRINCIPLES & PROC. STATS., A BIOMETRICAL APPROACH (McGraw-HiII, 1980), the disclosure of which is incorporated by reference herein. An exemplary method is to determine, based upon a desired level of specificity (tolerance of false positives) and sensitivity (tolerance of false negatives) and within a selected level of confidence, the difference between numbers of target and reference genes that must be obtained in order to reach a chosen level of statistical significance. A threshold issue in such a determination is the minimum number, N, of genes (for each of target and reference) that must be available in a population in order to allow a 2o determination of statistical significance. The number N will depend upon the assumption of a minimum number of mutant alleles in a sample containing mutant alleles (assumed herein to be at least 1 % ) and the further assumption that normal samples contain no mutant alleles. It is also assumed that a threshold differences between the numbers of reference and target genes must be at least 0.5 % for a diagnosis that there is a mutation present in a subpopulation of cells in the sample. Based upon the foregoing assumptions, it is possible to determine how large N must be so that a detected difference between numbers of mutant and reference alleles of less than 0.5% is truly a negative (i.e. no mutant subpopulation in the sample) result 99.9%
of the time.
The calculation of N for specificity, then, is based upon the probability of one sample 3o measurement being in the portion of the Gaussian distribution covering the lowest 3.16 % of the population (the area marked " A" in figure 2A) and the probability that the other sample measurement is in the portion of the Gaussian distribution covering the highest 3.16 % of the population (the area marked "B" in figure 2B). Since the two sample measurements are independent events, the probability of both events occurring simultaneously in a single sample is approximately 0.001 or 0.1 % . Thus, 93.68 % of the Gaussian distribution (100% -2x3.16%) lies between the areas marked A and B in figure 3. Statistical tables indicate that such area is equivalent to 3 .72 standard deviations . Accordingly, 0.5 % N is set equal to 3 .72 sigma. Since sigma (the standard deviation) is equal to ,rN , the equation may be solved for N as 553,536. This means that if the lower of the two numbers representing reference and target is at least 553,536 and if the patient is truly normal, the difference between the numbers 1o will be less than 0.5% about 99.9% of the time.
To determine the minimum N required for 99 % sensitivity a similar analysis is performed. This time, one-tailed Gaussian distribution tables show that 1.28 standard deviations (sigma) from the mean cover 90% of the Gaussian distribution.
Moreover, there is a 10 % (the square root of 1 % ) probability of one of the numbers (reference or target) being in either the area marked "A" in Figure 3 or in the area marked "B" in Figure 3.
If the two population means are a total of 1 % different and if there must be a 0.5 %
difference between the number of target and reference genes, then the distance from either mean to the threshold for statistical significance is equivalent to 0.25 % N (See Figure 3) for 99 %
sensitivity. As shown in Figure 3, 0.25%N corresponds to about 40% of one side of the Gaussian 2o distribution. Statistical tables reveal that 40% of the Gaussian distribution corresponds to 1.28 standard deviations from the mean. Therefore, 1.28 sigma is equal to 0.0025N, and N equals 262,144. Thus, for abnormal samples, the difference will exceed 0.5 % at least 99 % of the time if the lower of the two numbers is at least 262,144. Conversely, an erroneous negative diagnosis will be made only 1 % of the time under these conditions.
In order to have both 99.9 % specificity (avoidance of false positives) and 99 sensitivity (avoidance of false negatives), a sample with DNA derived from at least 553,536 (or roughly greater than 550,000) cells should be counted. A difference of at least 0.5 %
between the numbers obtained is significant at a confidence level of 99.0% for sensitivity and a difference of less than 0.5 % between the numbers is significant at a confidence level of 99.9% for specificity. As noted above, other standard statistical tests may be used in order to determine statistical significance and the foregoing represents one such test.
Using the above-described methods, a particular variant is identified in Exon 2 of the MTS1 sequence which is indicative of the presence of the MTS predisposing allele. The variant is determined by identifying a statistically-significant difference between a reference number of a particular variant present in a patient sample and a number of the variant present in a separate sample known to be normal (preferably this is the result of pooled samples from normal individuals). An individual patient can be assessed for a predisposition for various cancers by determining the presence or absence of the particular variant in the patient's genomic DNA. The severity of the disease is then assessed by determining a number of molecules of the variant in a standardized sample of the patient's genomic DNA, and applying 1o a predetermined statistical relationship to the number correlating the number with the severity of the disease.
DETECTION OF THE LOSS OF HETEROZYGOSITY
Methods according to the present invention are useful for the detection of loss of ~5 heterozygosity in a heterogeneous cellular sample in which the loss of heterozygosity occurs in only a small subpopulation of cells in the sample. Using traditional detection methods, such a subpopulation would be difficult, if not impossible, to detect especially if the deletion end points are unknown at the time of detection or a clonally-impure cellular population is used.
See, e.g., U.S. Patent No. 5,527,676 (reporting that a clonal population of cells should be 2o used in order to detect a deletion in a p53 gene). Traditional methods for detection of mutations involved in carcinogenesis rely upon the use of a clonally-pure population of cells and such methods are best at detecting mutations that occur at known "hot spots" in oncogenes, such as k-ras. See, Sidransky, supra.
Methods of the present invention are useful for detecting loss of heterozygosity in a 25 small number of cells in an impure cellular population because such methods do not rely upon knowing the precise deletion end-points and such methods are not affected by the presence in the sample of heterogeneous DNA. For example, in loss of heterozygosity, deletions occur over large portions of the genome and entire chromosome arms may be missing.
Methods of the invention comprise counting a number of molecules of a target nucleic acid suspected of 3o being deleted and comparing it to a reference number. In a preferred embodiment the reference number is the number of molecules of a nucleic acid suspected of not being deleted in the same sample. All that one needs to know is at least a portion of the sequence of a target nucleic acid suspected of being deleted and at least a portion of the sequence of a reference nucleic acid suspected of not being deleted. Methods of the invention, while amenable to multiple mutation detection, do not require multiple mutation detection in order to detect indicia of cancer in a heterogeneous sample.
Accordingly, methods of the present invention are useful for the detection of loss of heterozygosity in a subpopulation of cells or debris therefrom in a sample.
Loss of heterozygosity generally occurs as a deletion of at least one wild-type allelic sequence in a subpopulation of cells. In the case of a tumor suppressor gene, the deletion typically takes the 1o form of a massive deletion characteristic of loss of heterozygosity. Often, as in the case of certain forms of cancer, disease-causing deletions initially occur in a single cell which then produces a small subpopulation of mutant cells. By the time clinical manifestations of the mutation are detected, the disease may have progressed to an incurable stage.
Methods of the invention allow detection of a deletion when it exists as only a small percentage of the total is cells or cellular debris in a sample.
Methods of the invention comprise a comparison of the number of molecules of two nucleic acids that are expected to be present in the sample in equal numbers in normal (non-mutated) cells. In a preferred embodiment, the comparison is between (1) an amount of a genomic polynucleotide segment that is known or suspected not to be mutated in cells of the 20 sample (the "reference" ) and (2) an amount of a wild-type (non-mutated) genomic polynucleotide segment suspected of being mutated in a subpopulation of cells in the sample (the "target"). A statistically-significant difference between the amounts of the two genomic polynucleotide segments indicates that a mutation has occurred.
In a preferred embodiment, the reference and target nucleic acids are alleles of the 25 same genetic locus. Alleles are useful in methods of the invention if there is a sequence difference which distinguishes one allele from the other. In a preferred embodiment, the genetic locus is on or near a tumor suppressor gene. Loss of heterozygosity can result in loss of either allele, therefore either allele can serve as the reference allele.
The important information is the presence or absence of a statistically significant difference between the 30 number of molecules of each allele in the sample. Also in a preferred embodiment, the reference and target nucleic acids are different genetic loci, for example different genes. In a WO 99!66077 PCT/US99/13630 preferred embodiment, the reference nucleic acid comprises both alleles of a reference genetic locus and the target nucleic acid comprises both alleles of a target genetic locus, for example a tumor suppressor gene. Specifically, in the case of a deletion in a tumor suppressor gene, the detected amount of the reference gene is significantly greater than the detected amount of the target gene. If a target sequence is amplified, as in the case of certain oncogene mutations, the detected amount of target is greater than the detected amount of the reference gene by a statistically-significant margin.
Methods according to the art generally require the use of numerous probes, usually in the form of PCR primers and/or hybridization probes, in order to detect a deletion or a point mutation. However, because methods of the present invention involve enumerative detection of nucleotide sequences and enumerative comparisons between sequences that are known to be stable and those that are suspected of being unstable, only a few probes must be used in order to accurately assess cancer risk. In fact, a single set (pair) of probes is all that is necessary to detect a single large deletion. The risk of cancer is indicated by the presence of a mutation in a genetic region known or suspected to be involved in oncogenesis. Patients who are identified as being at risk based upon tests conducted according to methods of the invention are then directed to other, typically invasive, procedures for confirmation and/or treatment of the disease.
Based upon the foregoing explanation, the skilled artisan appreciates that methods of the invention are useful to detect mutations in a subpopulation of a polynucleotides in any biological sample. For example, methods disclosed herein may be used to detect allelic loss (the loss of heterozygosity) associated with diseases such as cancer.
Additionally, methods of the invention may be used to detect a deletion or a base substitution mutation causative of a metabolic error, such as complete or partial loss of enzyme activity. For purposes of exemplification, the following provides details of the use of methods according to the present invention in colon cancer detection. Inventive methods are especially useful in the early detection of a mutation (and especially a large deletion typical of loss of heterozygosity) in a tumor suppressor gene. Accordingly, while exemplified in the following manner, the invention is not so limited and the skilled artisan will appreciate its wide range of applicability upon consideration thereof.
Methods according to the invention preferably comprise comparing a number of a target polynucleotide known or suspected to be mutated to a number of a reference polynucleotide known or suspected not to be mutated. In addition to the alternative embodiments using either alleles or genetic loci as reference and target nucleic acids, the invention comprises a comparison of a microsatellite repeat region in a normal allele with the corresponding microsateilite region in an allele known or suspected to be mutated. Exemplary detection means of the invention comprise determining whether a difference exists between the number of counts of each nucleic acid being measured. The presence of a statistically-significant difference is indicative that a mutation has occurred in one of the nucleic acids to being measured A. Preparation of a Stool Sample A sample prepared from stool voided by a patient should comprise at least a cross-section of the voided stool. As noted above, stool is not homogenous with respect to sloughed cells. As stool passes through the colon, it absorbs sloughed cells from regions of the colonic i5 epithelium with which it makes contacts. Thus, sloughed cells from a polyp are absorbed on only one surface of the forming stool (except near the cecum where stool is still liquid and is homogenized by Intestinal Peristalsis). Taking a representative sample of stool (i.e., at least a cross-section) and homogenizing it ensures that sloughed cells from all epithelial surfaces of the colon will be present for analysis in the processed stool sample. Stool is voided into a 2o receptacle that is preferably small enough to be transported to a testing facility. The receptacle may be fitted to a conventional toilet such that the receptacle accepts stool voided in a conventional manner. The receptacle may comprise a mesh or a screen of sufficient size and placement such that stool is retained while urine is allowed to pass through the mesh or screen and into the toilet. The receptacle may additionally comprise means for homogenizing voided 25 stool. Moreover, the receptacle may comprise means for introducing homogenization buffer or one or more preservatives, such as alcohol or a high salt concentration solution, in order to neutralize bacteria present in the stool sample and to inhibit degradation of DNA.
The receptacle, whether adapted to fit a toilet or simply adapted for receiving the voided stool sample, preferably has sealing means sufficient to contain the voided stool sample 3o and any solution added thereto and to prevent the emanation of odors. The receptacle may have a support frame which is placed directly over a toilet bowl. The support frame has attached thereto an articulating cover which may be placed in a raised position, for depositing of sample or a closed position {not shown) for sealing voided stool within the receptacle. The support frame additionally has a central opening traversing from a top surface through to a bottom surface of the support frame. The bottom surface directly communicates with a top surface of the toilet. Extending from the bottom surface of the support frame and encompassing the entire circumference of the central opening is a means for capturing voided stool. The means for capturing voided stool may be fixedly attached to the support frame or may be removably attached for removal subsequent to deposition of stool.
Once obtained, the stool sample is homogenized in an appropriate buffer, such as o phosphate buffered saline or a chaotropic salt solution. Homogenization means and materials for homogenization are generally known in the art. See, e.g., U.S. Patent No.
4,101,279.
Thus, particular homogenization methods may be selected by the skilled artisan. Methods for further processing and analysis of a biological sample, such as a stool sample are presented below.
B, Methods for Detection of Colon Cancer or Precancer For exemplification, methods of the invention are used to detect a deletion or other mutation in or near the p53 tumor suppressor gene in cells obtained from a representative stool sample. The p53 gene is a good choice because the loss of heterozygosity in p53 is often associated with colorectal cancer. An mRNA sequence corresponding to the DNA
coding 2o region for p53 is reported as GenBank Accession No. M92424. The skilled artisan understands that methods described herein may be used to detect mutations in any gene and that detection of a p53 deletion is exemplary of such methods. In the detection of loss of heterozygosity, it is not necessary to target any particular gene due to the massive deletions associated with this event. Accordingly, an LOH-type deletion involving, for example, p53 may be detected by probing a region outside, but near, p53 because that region is also likely to be deleted. At least a cross-section of a voided stool sample is obtained and prepared as described immediately above. DNA or RNA may optionally be isolated from the sample according to methods known in the art. See, Smith-Ravin et al., 36 GuT, 81-86 (1995), incorporated by reference herein. Methods of the invention may also comprise the step of 3o amplifying DNA or RNA sequences using the polymerase chain reaction.
However, methods of the invention may be performed on unprocessed stool.
Nucleic acids may be sheared or cut into small fragments by, for example, restriction digestion. The size of nucleic acid fragments produced is not critical, subject to the limitations described below. A target nucleic acid that is suspected of being mutated (p53 in this example) and a reference nucleic acid are chosen. The target and reference nucleic acids may be alleles on or near the p53 gene. Alternatively, the target nucleic acid comprises both alleles on or near the p53 gene and the reference nucleic acid comprises both alleles on or near a genetic locus suspected not to be deleted. Single-stranded nucleic acid fragments may be prepared using well-known methods. See, e.g., SAMBROOK ET AL., MOLECULAR
CLONING, LABORATORY MANUAL (1989) incorporated by reference herein.
Either portions of a coding strand or its complement may be detected in methods according to the invention. In a preferred embodiment, both first and second strands of an allele are present in a sample during hybridization to an oligonucleotide probe. The sample is exposed to an excess of probe that is complementary to a portion of the first strand, under conditions to promote specific hybridization of the probe to the portion of the first strand. In a most preferred embodiment, the probe is in sufficient excess to bind all the portion of the first strand, and to prevent reannealing of the first strand to the second strand of the allele. Also in a preferred embodiment, the second strand of an allele is removed from a sample prior to hybridization to an oligonucleotide probe that is complementary to a portion of the first strand of the allele. For exemplification, detection of the coding strand of p53 and reference allele are described. Complement to both p53 and reference allele are removed by hybridization to anti-complement oligonucleotide probes (isolation probes) and subsequent removal of duplex formed thereby. Methods for removal of complement strands from a mixture of single-stranded oligonucleotides are known in the art and include techniques such as affinity chromatography. Upon converting double-stranded DNA to single-stranded DNA, sample is passed through an affinity column comprising bound isolation probe that is complementary to the sequence to be isolated away from the sample. Conventional column chromatography is appropriate for isolation of complement. An affinity column packed with sepharose or any other appropriate materials with attached complementary nucleotides may be used to isolate complement DNA in the column, while allowing DNA to be analyzed to pass through the column. See Sambrook, supra. As an alternative, isolation beads may be used to exclude complement as discussed in detail below.
After removal of complement, the target and reference nucleic acids are exposed to radio-labeled nucleotides under conditions which promote specific association of the radio-labeled nucleotides with the target and reference nucleic acids in a sample.
In order to count the number of molecules of the target and reference nucleic acids, the radionucleotides associated with the target nucleic acid must be distinguished from the radionucleotides associated with the reference nucleic acid. In addition, the radionucleotides that are specifically associated with either target or reference nucleic acid must be distinguished from radionucleotides that are not associated with either nucleic acid. The number of molecules of target nucleic acid is counted by measuring a number X of radioactive decay events (e.g. by to measuring the total number of counts during a defined interval or by measuring the time it takes to obtain a predetermined number of counts) specifically associated with the target nucleic acid. The number X is used to calculate the number X1 of radionucleotides which are specifically associated with the target nucleic acid. The number X1 is used to calculate the number X2 of target nucleic acid molecules, knowing the ratio of radionucleotide molecules to target nucleic acid molecules in the assay.
According to methods of the invention, it is important to count the number of molecules in order to provide a statistical analysis of the likelihood of loss of heterozygosity.
Comparison of the numbers of radioactive decays without knowing the numbers of molecules associated with the radioactive decays does not provide statistical data on the significance of any observed difference.
In a preferred embodiment, a radionucleotide is incorporated into a specific oligonucleotide prior to exposure to the sample. In a most preferred embodiment, a radiolabeled oligonucleotide is used which comprises a single radionucleotide molecule per oligonucleotide molecule. A radio-labeled oligonucleotide is designed to hybridize specifically to a target nucleic acid. in one embodiment the target nucleic acid is a specific allele of a polymorphic genetic locus, and the oligonucleotide is designed to be complementary to the allele at the site of polymorphism. One skilled in the art can perform hybridizations under conditions which promote specific hybridization of the oligonucleotide to the allele, without cross hybridizing to other alleles. Similarly, radiolabeled oligonucleotides are designed to specifically hybridize with the reference nucleic acid.
Also in a preferred embodiment, a radionucleotide is specifically incorporated into an oligonucleotide by primer extension, after exposing the oligonucleotide to the sample under conditions to promote specific hybridization of the oligonucleotide with the target nucleic acid.
In a preferred embodiment the oligonucleotide is unlabeled, and the radionucleotide is a radiolabeled chain terminating nucleotide (e.g. a dideoxynucleotide). In a most preferred embodiment, the radionucleotide is the chain terminating nucleotide complementary to the nucleotide immediately S' to the nucleotide that base pairs to the 3' nucleotide of the oligonucleotide when it is specifically hybridized to the target nucleic acid.
In the embodiment where the target nucleic acid is an allele of a polymorphic genetic locus, the oligonucleotide is to preferably designed such that the 3' nucleotide of the oligonucleotide base pairs with the nucleotide immediately 3' to the polymorphic residue. In a preferred embodiment, a radiolabeled terminating nucleotide that is complementary to the residue at the poiymorphic site is incorporated on the 3'end of the specifically hybridized oligonucleotide by a primer extension reaction. Similarly, in a preferred embodiment, a radionucleotide is specifically associated with a reference nucleic acid by primer extension. Other methods for specifically associating a radioactive isotope with a target or reference nucleic acid (for example a radiolabeled sequence specific DNA binding protein) are also useful for the methods of the invention.
In a preferred embodiment, prior to counting the radioactive decay events, the 2o radionucleotides specifically associated with target and reference nucleic acids are separated from the radivnucleotides that are not specifically associated with either nucleic acid.
Separation is performed as described herein, or using techniques known in the art. Other separation techniques are also useful for practice of the invention. Methods of the invention also comprise distinguishing the radio-label specifically associated with a target nucleic acid from the radio-label specifically associated with a reference nucleic acid. In a preferred embodiment the isotope associated with the target is different from the isotope associated with the receptor. Different isotopes useful to radio-label nucleotides include 3sS, 3zP, 33P~ i2sl~ sH~
and ~4C. In one embodiment, an oligonucleotide complementary to a target nucleic acid is labeled with a different isotope from an oligonucleotide complementary to a reference nucleic acid. In another embodiment, the chain terminating nucleotide associated with the target nucleic acid is different from the chain terminating nucleotide associated with the reference nucleic acid, and the two chain terminating nucleotides are labeled with different isotopes.
In a preferred embodiment, radionucleotides labeled with different isotopes are detected without separating the radionucleotide associated with the target nucleic acid from the radionucleotide associated with the reference nucleic acid. The different isotopes useful to the invention have different characteristic emission spectra. The presence of a first isotope does not prevent the measurement of radioactive decay events of a second isotope.
In a more preferred embodiment, the labeled oligonucleotide associated with the target nucleic acid is the same size as the labeled oligonucleotide associated with the reference nucleic acid (the labeled oligonucleotides can be labeled prior to hybridization or by primer extension). The two differentially labeled oligonucleotides are electrophoresed on a gel, preferably a denaturing gel, and the gel is exposed to an imager that detects the radioactive decay events of both isotopes. in this embodiment the two isotopes are detected at the same position on the imager, because both oligonucleotides migrate to the same position on the gel.
Detection at the same position on the imager reduces variation due to different detection efficiencies at different positions on the imager.
Also in a preferred embodiment, the radionucleotide associated with the target nucleic acid is separated from the radionucleotide associated with the reference nucleic acid prior to measuring radioactive decay events. In a preferred embodiment the separated radionucleotides 2o are labeled with the same isotope.
Preferred separation methods comprise conferring different molecular weights to the radionucleotides specifically associated with the target and reference nucleic acids.
In a preferred embodiment, first probes comprise a "separation moiety." Such separation moiety is, for example, hapten, biotin, or digoxigenin. The separation moiety in first probes does not interfere with the first probe's ability to hybridize with template or be extended. In an alternative embodiment, the labeled ddNTPs comprise a separation moiety.
In yet another alternative embodiment, both the first probes and the labeled ddNTPs comprise a separation moiety. Following the extension reaction, a high molecular weight molecule having affinity for the separation moiety (e.g., avidin, streptavidin, or anti-digoxigenin) is 3o added to the reaction mixture under conditions which permit the high molecular weight molecule to bind to the separation moiety. The reaction components are then separated on the basis of molecular weight using techniques known in the art such as gel electrophoresis, chromatography, or mass spectroscopy. See, AUSUBEL ET AL., SHORT PROTOCOLS IN
MOLECULAR BIOLOGY (3rd ed. lohn Wiley & Sons, Inc., 1995); WU, RECOMBINANT DNA
METHODOLOGY II, (Academic Press, 1995).
Also in a preferred embodiment the radionucleotide associated with a first allele of a polymorphic genetic locus is separated from the radionucleotide associated with a second allele of the polymorphic locus by differential primer extension, wherein the extension products of a given oligonucleotide primer are of a different length for each of the two alleles. In differential primer extension (exemplified in Figure 1) an oligonucleotide is hybridized such that the 3' nucleotide of the oligonucleotide base pairs with the nucleotide that is immediately 5' of the polymorphic site. The extension reaction is performed in the presence of a radiolabeled terminator nucleotide complementary to the nucleotide at the polymorphic site of the first allele. The reaction also comprises non-labeled nucleotides complementary to the other 3 nucleotides. Extension of a primer hybridized to the first allele results in a product having only the terminator nucleotide incorporated (exemplified in Figure lA, T* is the labeled terminator nucleotide). Extension of a primer hybridized to the second allele results in a product that incorporates several non-labeled nucleotides immediately 5' to the terminator nucleotide (exemplified in Figure 1B). The number of non-labeled nucleotides that are incorporated is determined by the position, on the template nucleic acid, of the closest 5' nucleotide complementary to the terminator nucleotide. In an alternative embodiment, differential primer extension comprises a labeled oligonucleotide and a non-labeled terminator nucleotide.
Labeled probes are exposed to sample under hybridization conditions. Such conditions are well-known in the art. See, e.g., Wallace et al., 6 NUCLEIC ACIDS RES.
3543-57 (1979), incorporated by reference herein. First and Second oligonucleotide probes that are distinctly labeled (i.e. with different radioactive isotopes, fluorescent means, or with beads of different size) are applied to a single aliquot of sample. After exposure of the probes to sample under hybridization conditions, sample is washed to remove any unhybridized probe.
Thereafter, hybridized probes are detected separately for p53 hybrids and reference allele hybrids.
3o Standards may be used to establish background and to equilibrate results.
Also, if differential fluorescent labels are used, the number of probes may be determined by counting differential fluorescent events in a sample that has been diluted sufficiently to enable detection of single fluorescent events in the sample. Duplicate samples may be analyzed in order to confirm the accuracy of results obtained.
If there is a difference between the amount of p53 detected and the amount of the reference allele detected greater than a 0.5 % difference with at least 550,000 events (earlier shown to be the threshold of significance), it may be assumed that a mutation has occurred in the region involving p53 and the patient is at risk for developing or has developed colon cancer. Statistical significance may be determined by any known method. A
preferred method is outlined above.
1o The determination of a p53 mutation allows a clinician to recommend further treatment, such as endoscopy procedures, in order to further diagnose and, if necessary, treat the patient's condition. The following examples illustrate methods of the invention that allow direct quantification of hybridization events.
The MTS locus was identified in linkage studies. See Skolnick et al., International 2o Publication No. WO 95/25$13. The MTS locus encompasses the MTSI and MTS2 gene sequences. Mutations in the MTS locus in the germline are indicative of predisposition to melanoma and other cancers. The mutational events of the MTS locus can involve deletions, insertions and point mutations within the coding sequence and the non-coding sequence.
A locus in the MTS gene was identified by Skolnick, et al. as predisposing for 25 melamona. They tested MTS 1 and MTS2 genomic DNA from individuals presumed to carry MTS alleles predisposing to melanoma and from individuals presumed not to carry MTS
alleles predisposing to melanoma . A single nucleotide polymorphic locus was identified in exon 2 in the MTS1 sequence. The polymorphism results in an amino acid substitution, and was found to segregate with the MTS predisposing allele. The substitutions resulted in either 3o the substitution of a large hydrophobic residue for a small hydrophilic residue, or the substitution of a charged amino acid for a neutral amino acid {specifically, either a substitution WO 99/bb077 PCT/US99/13630 of a glycine with a tryptophan, or a valine with a asparagine). This single-nucleotide polymorphic locus is useful as a marker in the methods of the invention.
Using methods of the invention, predisposition to cancers, such as melanoma and the other cancers related to MTS, is ascertained by testing any tissue or body fluid for the presence of disease-associated variants at the MTS locus. The variants to be screened may be alleles on or near the MTS locus, including Exon 2 of the MTS 1 sequence. A
sample comprising pooled genomic DNA from healthy members of a population presumed not to have the MTS predisposing allele (referred to as the reference sample), and a sample comprising pooled genomic DNA from diseased members of a population presumed to carry the MTS
1o predisposing allele (referred to a the target sample) are prepared. Nucleic acids are sheared or cut into small fragments by, for example, restriction digestion. The size of nucleic acid fragments produced is not critical, subject to the limitations described below. Single-stranded nucleic acid fragments may be prepared using well-known methods. See, e.g., SAMBROOK ET
AL., MOLECULAR CLONING, A LABORATORY MANUAL (1989) incorporated by reference herein.
Either portions of a coding strand or its complement may be detected in methods according to the invention. In a preferred embodiment, both first and second strands of an allele are present in a sample during hybridization to an oligonucleotide probe. The sample is exposed to an excess of probe that is complementary to a portion of the first strand, under conditions that promote specific hybridization of the probe to the portion of the first strand. In a most preferred embodiment, the probe is in sufficient excess to bind all the portion of the first strand, and to prevent reannealing of the first strand to the second strand of the allele.
Also in a preferred embodiment, the second strand of an allele is removed from a sample prior to hybridization to an oligonucleotide probe that is complementary to a portion of the first strand of the allele. Complement to exons are removed by hybridization to anti-complement oligonucleotide probes (isolation probes) and subsequent removal of duplex formed thereby.
Methods for removal of complement strands from a mixture of single-stranded oligonucleotides are known in the art and include techniques such as affinity chromatography.
Upon converting double-stranded DNA to single-stranded DNA, sample is passed through an 3o affinity column comprising bound isolation probe that is complementary to the sequence to be isolated away from the sample. Conventional column chromatography is appropriate for WO 99/66077 PC'f/US99/13630 isolation of complement. An affinity column packed with sepharose or any other appropriate materials with attached complementary nucleotides may be used to isolate complement DNA in the column, while allowing DNA to be analyzed to pass through the column. See SAMBROOK, supra. As an alternative, isolation beads may be used to exclude complement.
After removal of complement, DNA samples are exposed to radiolabeled nucleotides under conditions which promote specific hybridization. Probes are preferably designed to hybridize specifically (i.e., without mismatches) to a portion of target genomic DNA that contains the polymorphic variant. In a particularly preferred embodiment, four different types of probes are used, each having a different radiolabeled nucleotide in a position to hybridize 1o with the variant nucleotide. The nucleotides in position to hybridize with the variant nucleotide are selected from dATP, dNTP, dCTP, and dGTP, and each is differentially labeled (i.e., with a different isotope or with isotopes of detectably distinct energy levels). Probes are hybridized under conditions that require an exact match of nucleotides in the probe to nucleotides on the target. Upon washing, the only probes that remain bound are those having 15 a labeled nucleotide that is an extact match for the nucleotide at the variant position. If more than one variant is present in a sample, each variant is detected because the nucleotides that have specifically bound to the variant are differentially labeled. The number of molecules of each particular variant is counted by measuring the number of radioactive decay events (e.g., by measuring the total number of counts during a defined interval or by measuring the time it 2o takes to obtain a predetermined number of counts) specifically associated with the particular variant. That number is used to calculate the number of radionucleotides which specifically hybridize with a particular variant in the target sample. The number of each variant present in a healthy sample (preferably pooled healthy samples) is determined in the same manner.
In another preferred embodiment, a single base extension reaction is used in which a 25 sequence-specific probe is hybridized immediately adjacent and usptream to the variant nucleotide to be detected. Each of four differentially-labeled dideoxy nucleotides is then added along with a polymerase under conditions that allow extension of the probe by one base. The number of each dideoxy nucleotides that hybridize at the variant nucleotide position is then determined as described above. Those numbers are compared to numbers obtained from 3o members of a healthy population to determine if there is a statistically-significant difference, the presence of such a difference being indicative of disease or the propensity therefor.
In a preferred embodiment, radioactive decays are used to count the number of a targeted nucleic acid. Preferred isotopes for use in the invention are selected from 355, szP, 33P~ i2sl~ 3H, and y4C. In a preferred embodiment, radionucleotides labeled with different isotopes are detected without separating the radionucleotide associated with a first variant from a radionucleotide associated with a second variant. Isotopes useful in the invention have different characteristic emission spectra. The presence of a first isotope does not prevent the measurement of radioactive decay events of a second isotope. In a more preferred embodiment, two different labeled nucleotides of the same molecular weight are used. The two differentially labeled oligonucleotides are electrophoresed on a gel, preferably a 1o denaturing gel, and the gel is exposed to an imager that detects the radioactive decay events of both isotopes. In this embodiment the two isotopes are detected at the same position on the imager, because both oligonucleotides migrate to the same position on the gel.
Detection at the same position on the imager reduces variation due to different detection efficiencies at different positions on the imager.
Also in a preferred embodiment, the radionucleotide associated with the particular variant is separated from the radionucleotide associated with another particular variant prior to measuring radioactive decay events. In a preferred embodiment, the separated radionucleotides are labeled with the same isotope. Preferred separation methods comprise conferring different molecular weights to the radionucleotides specifically associated with the 2o particular variant in the target and reference samples.
In a preferred embodiment, first probes comprise a "separation moiety." Such separation moiety is, for example, hapten, biotin, or digoxigenin. The separation moiety in first probes does not interfere with the first probe's ability to hybridize with template or be extended. In an alternative embodiment, the labeled ddNTPs comprise a separation moiety.
In yet another alternative embodiment, both the first probes and the labeled ddNTPs comprise a separation moiety. Following the extension reaction, a high molecular weight molecule having affinity for the separation moiety (e.g., avidin, streptavidin, or anti-digoxigenin) is added to the reaction mixture under conditions which permit the high molecular weight molecule to bind to the separation moiety. The reaction components are then separated on the 3o basis of molecular weight using techniques known in the art such as gel electrophoresis, chromatography, or mass spectroscopy. See AUSUBEL ET AL., SHORT PROTOCOLS IN
MOLECULAR BIOLOGY (3rd ed., dohn Wiley & Sons, Inc., 1995); WU, RECOMBINANT
DNA
METHODOLOGY II (Academic Press, 1995).
Also in a preferred embodiment, the radionucleotide associated with a first variant is separated from the radionucleotide associated with a second variant by differential primer extension, wherein the extension products of a given oligonucleotide primer are of a different length for each of the two variants. In differential primer extension (exemplified in Figure 1) an oligonucleotide is hybridized such that the 3' nucleotide of the oligonucleotide base pairs with the nucleotide that is immediately 5' of the polymorphic site. The extension reaction is performed in the presence of a radiolabeled terminator nucleotide complementary to the 1o nucleotide at the polymorphic site of the first variant. The reaction may also comprise non-labeled nucleotides complementary to the other 3 nucleotides. Extension of a primer hybridized to a first allele results in a product having only the terminator nucleotide incorporated (exemplified in Figure lA, T* is the labeled terminator nucleotide). Extension of a primer hybridized to the second variant results in a product that incorporates several non-IS labeled nucleotides immediately 5' to the terminator nucleotide (exemplified in Figure 1B).
The number of non-labeled nucleotides that are incorporated is determined by the position, on the template nucleic acid, of the closest 5' nucleotide complementary to the terminator nucleotide. In an alternative embodiment, differential primer extension comprises a labeled oligonucleotide and a non-labeled terminator nucleotide.
2o Labeled probes are exposed to sample under hybridization conditions. Such conditions are well-known in the art. See, e.g., Wallace et al., 6 NUCLEIC ACIDS RES.
3543-57 (1979), incorporated by reference herein. First and second oligonucleotide probes that are distinctly labeled (i.e. with different radioactive isotopes, fluorescent means, or with beads of different size) are applied to a single aliquot of sample. After exposure of the probes to sample under 25 hybridization conditions, sample is washed to remove any unhybridized probe. Thereafter, hybridized probes are detected separately for each variant. Standards may be used to establish background and to equilibrate results. Also, if differential fluorescent labels are used, the number of probes may be determined by counting differential fluorescent events in a sample that has been diluted sufficiently to enable detection of single fluorescent events in the sample.
3o Duplicate samples may be analyzed in order to confirm the accuracy of results obtained.
If there is'a difference between the amount of a particular variant determined in the target sample and the reference sample greater than a 0.5 % difference with at least 550,000 events (see below), it is assumed that the particular variant is indicative of a diagnostic disease marker. Statistical significance may be determined by any known method. A
preferred method is outlined below.
Enumerative sampling of a nucleotide sequence that is uniformly distributed in a biological sample typically follows a Poisson distribution. For large populations, such as the typical number of genomic polynucleotide segments in a biological sample, the Poisson distribution is similar to a normal (Gaussian) curve with a mean, N, and a standard deviation to that may be approximated as the square root of N.
Statistically-significance between numbers of target and reference genes obtained from a biological sample may be determined by any appropriate method. See, e.g:, STEEL ET AL., PRINCIPLES & PROC. STATS., A BIOMETRICAL APPROACH (McGraw-HiII, 1980), the disclosure of which is incorporated by reference herein. An exemplary method is to determine, based upon a desired level of specificity (tolerance of false positives) and sensitivity (tolerance of false negatives) and within a selected level of confidence, the difference between numbers of target and reference genes that must be obtained in order to reach a chosen level of statistical significance. A threshold issue in such a determination is the minimum number, N, of genes (for each of target and reference) that must be available in a population in order to allow a 2o determination of statistical significance. The number N will depend upon the assumption of a minimum number of mutant alleles in a sample containing mutant alleles (assumed herein to be at least 1 % ) and the further assumption that normal samples contain no mutant alleles. It is also assumed that a threshold differences between the numbers of reference and target genes must be at least 0.5 % for a diagnosis that there is a mutation present in a subpopulation of cells in the sample. Based upon the foregoing assumptions, it is possible to determine how large N must be so that a detected difference between numbers of mutant and reference alleles of less than 0.5% is truly a negative (i.e. no mutant subpopulation in the sample) result 99.9%
of the time.
The calculation of N for specificity, then, is based upon the probability of one sample 3o measurement being in the portion of the Gaussian distribution covering the lowest 3.16 % of the population (the area marked " A" in figure 2A) and the probability that the other sample measurement is in the portion of the Gaussian distribution covering the highest 3.16 % of the population (the area marked "B" in figure 2B). Since the two sample measurements are independent events, the probability of both events occurring simultaneously in a single sample is approximately 0.001 or 0.1 % . Thus, 93.68 % of the Gaussian distribution (100% -2x3.16%) lies between the areas marked A and B in figure 3. Statistical tables indicate that such area is equivalent to 3 .72 standard deviations . Accordingly, 0.5 % N is set equal to 3 .72 sigma. Since sigma (the standard deviation) is equal to ,rN , the equation may be solved for N as 553,536. This means that if the lower of the two numbers representing reference and target is at least 553,536 and if the patient is truly normal, the difference between the numbers 1o will be less than 0.5% about 99.9% of the time.
To determine the minimum N required for 99 % sensitivity a similar analysis is performed. This time, one-tailed Gaussian distribution tables show that 1.28 standard deviations (sigma) from the mean cover 90% of the Gaussian distribution.
Moreover, there is a 10 % (the square root of 1 % ) probability of one of the numbers (reference or target) being in either the area marked "A" in Figure 3 or in the area marked "B" in Figure 3.
If the two population means are a total of 1 % different and if there must be a 0.5 %
difference between the number of target and reference genes, then the distance from either mean to the threshold for statistical significance is equivalent to 0.25 % N (See Figure 3) for 99 %
sensitivity. As shown in Figure 3, 0.25%N corresponds to about 40% of one side of the Gaussian 2o distribution. Statistical tables reveal that 40% of the Gaussian distribution corresponds to 1.28 standard deviations from the mean. Therefore, 1.28 sigma is equal to 0.0025N, and N equals 262,144. Thus, for abnormal samples, the difference will exceed 0.5 % at least 99 % of the time if the lower of the two numbers is at least 262,144. Conversely, an erroneous negative diagnosis will be made only 1 % of the time under these conditions.
In order to have both 99.9 % specificity (avoidance of false positives) and 99 sensitivity (avoidance of false negatives), a sample with DNA derived from at least 553,536 (or roughly greater than 550,000) cells should be counted. A difference of at least 0.5 %
between the numbers obtained is significant at a confidence level of 99.0% for sensitivity and a difference of less than 0.5 % between the numbers is significant at a confidence level of 99.9% for specificity. As noted above, other standard statistical tests may be used in order to determine statistical significance and the foregoing represents one such test.
Using the above-described methods, a particular variant is identified in Exon 2 of the MTS1 sequence which is indicative of the presence of the MTS predisposing allele. The variant is determined by identifying a statistically-significant difference between a reference number of a particular variant present in a patient sample and a number of the variant present in a separate sample known to be normal (preferably this is the result of pooled samples from normal individuals). An individual patient can be assessed for a predisposition for various cancers by determining the presence or absence of the particular variant in the patient's genomic DNA. The severity of the disease is then assessed by determining a number of molecules of the variant in a standardized sample of the patient's genomic DNA, and applying 1o a predetermined statistical relationship to the number correlating the number with the severity of the disease.
DETECTION OF THE LOSS OF HETEROZYGOSITY
Methods according to the present invention are useful for the detection of loss of ~5 heterozygosity in a heterogeneous cellular sample in which the loss of heterozygosity occurs in only a small subpopulation of cells in the sample. Using traditional detection methods, such a subpopulation would be difficult, if not impossible, to detect especially if the deletion end points are unknown at the time of detection or a clonally-impure cellular population is used.
See, e.g., U.S. Patent No. 5,527,676 (reporting that a clonal population of cells should be 2o used in order to detect a deletion in a p53 gene). Traditional methods for detection of mutations involved in carcinogenesis rely upon the use of a clonally-pure population of cells and such methods are best at detecting mutations that occur at known "hot spots" in oncogenes, such as k-ras. See, Sidransky, supra.
Methods of the present invention are useful for detecting loss of heterozygosity in a 25 small number of cells in an impure cellular population because such methods do not rely upon knowing the precise deletion end-points and such methods are not affected by the presence in the sample of heterogeneous DNA. For example, in loss of heterozygosity, deletions occur over large portions of the genome and entire chromosome arms may be missing.
Methods of the invention comprise counting a number of molecules of a target nucleic acid suspected of 3o being deleted and comparing it to a reference number. In a preferred embodiment the reference number is the number of molecules of a nucleic acid suspected of not being deleted in the same sample. All that one needs to know is at least a portion of the sequence of a target nucleic acid suspected of being deleted and at least a portion of the sequence of a reference nucleic acid suspected of not being deleted. Methods of the invention, while amenable to multiple mutation detection, do not require multiple mutation detection in order to detect indicia of cancer in a heterogeneous sample.
Accordingly, methods of the present invention are useful for the detection of loss of heterozygosity in a subpopulation of cells or debris therefrom in a sample.
Loss of heterozygosity generally occurs as a deletion of at least one wild-type allelic sequence in a subpopulation of cells. In the case of a tumor suppressor gene, the deletion typically takes the 1o form of a massive deletion characteristic of loss of heterozygosity. Often, as in the case of certain forms of cancer, disease-causing deletions initially occur in a single cell which then produces a small subpopulation of mutant cells. By the time clinical manifestations of the mutation are detected, the disease may have progressed to an incurable stage.
Methods of the invention allow detection of a deletion when it exists as only a small percentage of the total is cells or cellular debris in a sample.
Methods of the invention comprise a comparison of the number of molecules of two nucleic acids that are expected to be present in the sample in equal numbers in normal (non-mutated) cells. In a preferred embodiment, the comparison is between (1) an amount of a genomic polynucleotide segment that is known or suspected not to be mutated in cells of the 20 sample (the "reference" ) and (2) an amount of a wild-type (non-mutated) genomic polynucleotide segment suspected of being mutated in a subpopulation of cells in the sample (the "target"). A statistically-significant difference between the amounts of the two genomic polynucleotide segments indicates that a mutation has occurred.
In a preferred embodiment, the reference and target nucleic acids are alleles of the 25 same genetic locus. Alleles are useful in methods of the invention if there is a sequence difference which distinguishes one allele from the other. In a preferred embodiment, the genetic locus is on or near a tumor suppressor gene. Loss of heterozygosity can result in loss of either allele, therefore either allele can serve as the reference allele.
The important information is the presence or absence of a statistically significant difference between the 30 number of molecules of each allele in the sample. Also in a preferred embodiment, the reference and target nucleic acids are different genetic loci, for example different genes. In a WO 99!66077 PCT/US99/13630 preferred embodiment, the reference nucleic acid comprises both alleles of a reference genetic locus and the target nucleic acid comprises both alleles of a target genetic locus, for example a tumor suppressor gene. Specifically, in the case of a deletion in a tumor suppressor gene, the detected amount of the reference gene is significantly greater than the detected amount of the target gene. If a target sequence is amplified, as in the case of certain oncogene mutations, the detected amount of target is greater than the detected amount of the reference gene by a statistically-significant margin.
Methods according to the art generally require the use of numerous probes, usually in the form of PCR primers and/or hybridization probes, in order to detect a deletion or a point mutation. However, because methods of the present invention involve enumerative detection of nucleotide sequences and enumerative comparisons between sequences that are known to be stable and those that are suspected of being unstable, only a few probes must be used in order to accurately assess cancer risk. In fact, a single set (pair) of probes is all that is necessary to detect a single large deletion. The risk of cancer is indicated by the presence of a mutation in a genetic region known or suspected to be involved in oncogenesis. Patients who are identified as being at risk based upon tests conducted according to methods of the invention are then directed to other, typically invasive, procedures for confirmation and/or treatment of the disease.
Based upon the foregoing explanation, the skilled artisan appreciates that methods of the invention are useful to detect mutations in a subpopulation of a polynucleotides in any biological sample. For example, methods disclosed herein may be used to detect allelic loss (the loss of heterozygosity) associated with diseases such as cancer.
Additionally, methods of the invention may be used to detect a deletion or a base substitution mutation causative of a metabolic error, such as complete or partial loss of enzyme activity. For purposes of exemplification, the following provides details of the use of methods according to the present invention in colon cancer detection. Inventive methods are especially useful in the early detection of a mutation (and especially a large deletion typical of loss of heterozygosity) in a tumor suppressor gene. Accordingly, while exemplified in the following manner, the invention is not so limited and the skilled artisan will appreciate its wide range of applicability upon consideration thereof.
Methods according to the invention preferably comprise comparing a number of a target polynucleotide known or suspected to be mutated to a number of a reference polynucleotide known or suspected not to be mutated. In addition to the alternative embodiments using either alleles or genetic loci as reference and target nucleic acids, the invention comprises a comparison of a microsatellite repeat region in a normal allele with the corresponding microsateilite region in an allele known or suspected to be mutated. Exemplary detection means of the invention comprise determining whether a difference exists between the number of counts of each nucleic acid being measured. The presence of a statistically-significant difference is indicative that a mutation has occurred in one of the nucleic acids to being measured A. Preparation of a Stool Sample A sample prepared from stool voided by a patient should comprise at least a cross-section of the voided stool. As noted above, stool is not homogenous with respect to sloughed cells. As stool passes through the colon, it absorbs sloughed cells from regions of the colonic i5 epithelium with which it makes contacts. Thus, sloughed cells from a polyp are absorbed on only one surface of the forming stool (except near the cecum where stool is still liquid and is homogenized by Intestinal Peristalsis). Taking a representative sample of stool (i.e., at least a cross-section) and homogenizing it ensures that sloughed cells from all epithelial surfaces of the colon will be present for analysis in the processed stool sample. Stool is voided into a 2o receptacle that is preferably small enough to be transported to a testing facility. The receptacle may be fitted to a conventional toilet such that the receptacle accepts stool voided in a conventional manner. The receptacle may comprise a mesh or a screen of sufficient size and placement such that stool is retained while urine is allowed to pass through the mesh or screen and into the toilet. The receptacle may additionally comprise means for homogenizing voided 25 stool. Moreover, the receptacle may comprise means for introducing homogenization buffer or one or more preservatives, such as alcohol or a high salt concentration solution, in order to neutralize bacteria present in the stool sample and to inhibit degradation of DNA.
The receptacle, whether adapted to fit a toilet or simply adapted for receiving the voided stool sample, preferably has sealing means sufficient to contain the voided stool sample 3o and any solution added thereto and to prevent the emanation of odors. The receptacle may have a support frame which is placed directly over a toilet bowl. The support frame has attached thereto an articulating cover which may be placed in a raised position, for depositing of sample or a closed position {not shown) for sealing voided stool within the receptacle. The support frame additionally has a central opening traversing from a top surface through to a bottom surface of the support frame. The bottom surface directly communicates with a top surface of the toilet. Extending from the bottom surface of the support frame and encompassing the entire circumference of the central opening is a means for capturing voided stool. The means for capturing voided stool may be fixedly attached to the support frame or may be removably attached for removal subsequent to deposition of stool.
Once obtained, the stool sample is homogenized in an appropriate buffer, such as o phosphate buffered saline or a chaotropic salt solution. Homogenization means and materials for homogenization are generally known in the art. See, e.g., U.S. Patent No.
4,101,279.
Thus, particular homogenization methods may be selected by the skilled artisan. Methods for further processing and analysis of a biological sample, such as a stool sample are presented below.
B, Methods for Detection of Colon Cancer or Precancer For exemplification, methods of the invention are used to detect a deletion or other mutation in or near the p53 tumor suppressor gene in cells obtained from a representative stool sample. The p53 gene is a good choice because the loss of heterozygosity in p53 is often associated with colorectal cancer. An mRNA sequence corresponding to the DNA
coding 2o region for p53 is reported as GenBank Accession No. M92424. The skilled artisan understands that methods described herein may be used to detect mutations in any gene and that detection of a p53 deletion is exemplary of such methods. In the detection of loss of heterozygosity, it is not necessary to target any particular gene due to the massive deletions associated with this event. Accordingly, an LOH-type deletion involving, for example, p53 may be detected by probing a region outside, but near, p53 because that region is also likely to be deleted. At least a cross-section of a voided stool sample is obtained and prepared as described immediately above. DNA or RNA may optionally be isolated from the sample according to methods known in the art. See, Smith-Ravin et al., 36 GuT, 81-86 (1995), incorporated by reference herein. Methods of the invention may also comprise the step of 3o amplifying DNA or RNA sequences using the polymerase chain reaction.
However, methods of the invention may be performed on unprocessed stool.
Nucleic acids may be sheared or cut into small fragments by, for example, restriction digestion. The size of nucleic acid fragments produced is not critical, subject to the limitations described below. A target nucleic acid that is suspected of being mutated (p53 in this example) and a reference nucleic acid are chosen. The target and reference nucleic acids may be alleles on or near the p53 gene. Alternatively, the target nucleic acid comprises both alleles on or near the p53 gene and the reference nucleic acid comprises both alleles on or near a genetic locus suspected not to be deleted. Single-stranded nucleic acid fragments may be prepared using well-known methods. See, e.g., SAMBROOK ET AL., MOLECULAR
CLONING, LABORATORY MANUAL (1989) incorporated by reference herein.
Either portions of a coding strand or its complement may be detected in methods according to the invention. In a preferred embodiment, both first and second strands of an allele are present in a sample during hybridization to an oligonucleotide probe. The sample is exposed to an excess of probe that is complementary to a portion of the first strand, under conditions to promote specific hybridization of the probe to the portion of the first strand. In a most preferred embodiment, the probe is in sufficient excess to bind all the portion of the first strand, and to prevent reannealing of the first strand to the second strand of the allele. Also in a preferred embodiment, the second strand of an allele is removed from a sample prior to hybridization to an oligonucleotide probe that is complementary to a portion of the first strand of the allele. For exemplification, detection of the coding strand of p53 and reference allele are described. Complement to both p53 and reference allele are removed by hybridization to anti-complement oligonucleotide probes (isolation probes) and subsequent removal of duplex formed thereby. Methods for removal of complement strands from a mixture of single-stranded oligonucleotides are known in the art and include techniques such as affinity chromatography. Upon converting double-stranded DNA to single-stranded DNA, sample is passed through an affinity column comprising bound isolation probe that is complementary to the sequence to be isolated away from the sample. Conventional column chromatography is appropriate for isolation of complement. An affinity column packed with sepharose or any other appropriate materials with attached complementary nucleotides may be used to isolate complement DNA in the column, while allowing DNA to be analyzed to pass through the column. See Sambrook, supra. As an alternative, isolation beads may be used to exclude complement as discussed in detail below.
After removal of complement, the target and reference nucleic acids are exposed to radio-labeled nucleotides under conditions which promote specific association of the radio-labeled nucleotides with the target and reference nucleic acids in a sample.
In order to count the number of molecules of the target and reference nucleic acids, the radionucleotides associated with the target nucleic acid must be distinguished from the radionucleotides associated with the reference nucleic acid. In addition, the radionucleotides that are specifically associated with either target or reference nucleic acid must be distinguished from radionucleotides that are not associated with either nucleic acid. The number of molecules of target nucleic acid is counted by measuring a number X of radioactive decay events (e.g. by to measuring the total number of counts during a defined interval or by measuring the time it takes to obtain a predetermined number of counts) specifically associated with the target nucleic acid. The number X is used to calculate the number X1 of radionucleotides which are specifically associated with the target nucleic acid. The number X1 is used to calculate the number X2 of target nucleic acid molecules, knowing the ratio of radionucleotide molecules to target nucleic acid molecules in the assay.
According to methods of the invention, it is important to count the number of molecules in order to provide a statistical analysis of the likelihood of loss of heterozygosity.
Comparison of the numbers of radioactive decays without knowing the numbers of molecules associated with the radioactive decays does not provide statistical data on the significance of any observed difference.
In a preferred embodiment, a radionucleotide is incorporated into a specific oligonucleotide prior to exposure to the sample. In a most preferred embodiment, a radiolabeled oligonucleotide is used which comprises a single radionucleotide molecule per oligonucleotide molecule. A radio-labeled oligonucleotide is designed to hybridize specifically to a target nucleic acid. in one embodiment the target nucleic acid is a specific allele of a polymorphic genetic locus, and the oligonucleotide is designed to be complementary to the allele at the site of polymorphism. One skilled in the art can perform hybridizations under conditions which promote specific hybridization of the oligonucleotide to the allele, without cross hybridizing to other alleles. Similarly, radiolabeled oligonucleotides are designed to specifically hybridize with the reference nucleic acid.
Also in a preferred embodiment, a radionucleotide is specifically incorporated into an oligonucleotide by primer extension, after exposing the oligonucleotide to the sample under conditions to promote specific hybridization of the oligonucleotide with the target nucleic acid.
In a preferred embodiment the oligonucleotide is unlabeled, and the radionucleotide is a radiolabeled chain terminating nucleotide (e.g. a dideoxynucleotide). In a most preferred embodiment, the radionucleotide is the chain terminating nucleotide complementary to the nucleotide immediately S' to the nucleotide that base pairs to the 3' nucleotide of the oligonucleotide when it is specifically hybridized to the target nucleic acid.
In the embodiment where the target nucleic acid is an allele of a polymorphic genetic locus, the oligonucleotide is to preferably designed such that the 3' nucleotide of the oligonucleotide base pairs with the nucleotide immediately 3' to the polymorphic residue. In a preferred embodiment, a radiolabeled terminating nucleotide that is complementary to the residue at the poiymorphic site is incorporated on the 3'end of the specifically hybridized oligonucleotide by a primer extension reaction. Similarly, in a preferred embodiment, a radionucleotide is specifically associated with a reference nucleic acid by primer extension. Other methods for specifically associating a radioactive isotope with a target or reference nucleic acid (for example a radiolabeled sequence specific DNA binding protein) are also useful for the methods of the invention.
In a preferred embodiment, prior to counting the radioactive decay events, the 2o radionucleotides specifically associated with target and reference nucleic acids are separated from the radivnucleotides that are not specifically associated with either nucleic acid.
Separation is performed as described herein, or using techniques known in the art. Other separation techniques are also useful for practice of the invention. Methods of the invention also comprise distinguishing the radio-label specifically associated with a target nucleic acid from the radio-label specifically associated with a reference nucleic acid. In a preferred embodiment the isotope associated with the target is different from the isotope associated with the receptor. Different isotopes useful to radio-label nucleotides include 3sS, 3zP, 33P~ i2sl~ sH~
and ~4C. In one embodiment, an oligonucleotide complementary to a target nucleic acid is labeled with a different isotope from an oligonucleotide complementary to a reference nucleic acid. In another embodiment, the chain terminating nucleotide associated with the target nucleic acid is different from the chain terminating nucleotide associated with the reference nucleic acid, and the two chain terminating nucleotides are labeled with different isotopes.
In a preferred embodiment, radionucleotides labeled with different isotopes are detected without separating the radionucleotide associated with the target nucleic acid from the radionucleotide associated with the reference nucleic acid. The different isotopes useful to the invention have different characteristic emission spectra. The presence of a first isotope does not prevent the measurement of radioactive decay events of a second isotope.
In a more preferred embodiment, the labeled oligonucleotide associated with the target nucleic acid is the same size as the labeled oligonucleotide associated with the reference nucleic acid (the labeled oligonucleotides can be labeled prior to hybridization or by primer extension). The two differentially labeled oligonucleotides are electrophoresed on a gel, preferably a denaturing gel, and the gel is exposed to an imager that detects the radioactive decay events of both isotopes. in this embodiment the two isotopes are detected at the same position on the imager, because both oligonucleotides migrate to the same position on the gel.
Detection at the same position on the imager reduces variation due to different detection efficiencies at different positions on the imager.
Also in a preferred embodiment, the radionucleotide associated with the target nucleic acid is separated from the radionucleotide associated with the reference nucleic acid prior to measuring radioactive decay events. In a preferred embodiment the separated radionucleotides 2o are labeled with the same isotope.
Preferred separation methods comprise conferring different molecular weights to the radionucleotides specifically associated with the target and reference nucleic acids.
In a preferred embodiment, first probes comprise a "separation moiety." Such separation moiety is, for example, hapten, biotin, or digoxigenin. The separation moiety in first probes does not interfere with the first probe's ability to hybridize with template or be extended. In an alternative embodiment, the labeled ddNTPs comprise a separation moiety.
In yet another alternative embodiment, both the first probes and the labeled ddNTPs comprise a separation moiety. Following the extension reaction, a high molecular weight molecule having affinity for the separation moiety (e.g., avidin, streptavidin, or anti-digoxigenin) is 3o added to the reaction mixture under conditions which permit the high molecular weight molecule to bind to the separation moiety. The reaction components are then separated on the basis of molecular weight using techniques known in the art such as gel electrophoresis, chromatography, or mass spectroscopy. See, AUSUBEL ET AL., SHORT PROTOCOLS IN
MOLECULAR BIOLOGY (3rd ed. lohn Wiley & Sons, Inc., 1995); WU, RECOMBINANT DNA
METHODOLOGY II, (Academic Press, 1995).
Also in a preferred embodiment the radionucleotide associated with a first allele of a polymorphic genetic locus is separated from the radionucleotide associated with a second allele of the polymorphic locus by differential primer extension, wherein the extension products of a given oligonucleotide primer are of a different length for each of the two alleles. In differential primer extension (exemplified in Figure 1) an oligonucleotide is hybridized such that the 3' nucleotide of the oligonucleotide base pairs with the nucleotide that is immediately 5' of the polymorphic site. The extension reaction is performed in the presence of a radiolabeled terminator nucleotide complementary to the nucleotide at the polymorphic site of the first allele. The reaction also comprises non-labeled nucleotides complementary to the other 3 nucleotides. Extension of a primer hybridized to the first allele results in a product having only the terminator nucleotide incorporated (exemplified in Figure lA, T* is the labeled terminator nucleotide). Extension of a primer hybridized to the second allele results in a product that incorporates several non-labeled nucleotides immediately 5' to the terminator nucleotide (exemplified in Figure 1B). The number of non-labeled nucleotides that are incorporated is determined by the position, on the template nucleic acid, of the closest 5' nucleotide complementary to the terminator nucleotide. In an alternative embodiment, differential primer extension comprises a labeled oligonucleotide and a non-labeled terminator nucleotide.
Labeled probes are exposed to sample under hybridization conditions. Such conditions are well-known in the art. See, e.g., Wallace et al., 6 NUCLEIC ACIDS RES.
3543-57 (1979), incorporated by reference herein. First and Second oligonucleotide probes that are distinctly labeled (i.e. with different radioactive isotopes, fluorescent means, or with beads of different size) are applied to a single aliquot of sample. After exposure of the probes to sample under hybridization conditions, sample is washed to remove any unhybridized probe.
Thereafter, hybridized probes are detected separately for p53 hybrids and reference allele hybrids.
3o Standards may be used to establish background and to equilibrate results.
Also, if differential fluorescent labels are used, the number of probes may be determined by counting differential fluorescent events in a sample that has been diluted sufficiently to enable detection of single fluorescent events in the sample. Duplicate samples may be analyzed in order to confirm the accuracy of results obtained.
If there is a difference between the amount of p53 detected and the amount of the reference allele detected greater than a 0.5 % difference with at least 550,000 events (earlier shown to be the threshold of significance), it may be assumed that a mutation has occurred in the region involving p53 and the patient is at risk for developing or has developed colon cancer. Statistical significance may be determined by any known method. A
preferred method is outlined above.
1o The determination of a p53 mutation allows a clinician to recommend further treatment, such as endoscopy procedures, in order to further diagnose and, if necessary, treat the patient's condition. The following examples illustrate methods of the invention that allow direct quantification of hybridization events.
Claims (22)
1. A method for identifying a variation in a nucleic acid in two or more samples, the method comprising the steps of:
(a) enumerating a number of a nucleic acid in a first sample;
(b) enumerating a number of said nucleic acid in a second sample; and (c) determining whether a statistically-significant difference exists between enumerated numbers of said nucleic acid between said first sample and said second sample;
a statistically-significant difference being indicative of a variation in said nucleic acid between said first sample and said second sample.
(a) enumerating a number of a nucleic acid in a first sample;
(b) enumerating a number of said nucleic acid in a second sample; and (c) determining whether a statistically-significant difference exists between enumerated numbers of said nucleic acid between said first sample and said second sample;
a statistically-significant difference being indicative of a variation in said nucleic acid between said first sample and said second sample.
2. A method for identifying a nucleic acid variation, the presence of which is indicative of a disease, the method comprising the steps of:
(a) enumerating a first number of a first nucleic acid in a sample obtained from a healthy member of a population;
(b) enumerating a second number of a second nucleic acid in a sample obtained from a member of said population having a disease; and (c) determining whether there is a statistically-significant difference between said first number and said second number, the presence of said difference being indicative that said nucleic acid variation is indicative of said disease.
(a) enumerating a first number of a first nucleic acid in a sample obtained from a healthy member of a population;
(b) enumerating a second number of a second nucleic acid in a sample obtained from a member of said population having a disease; and (c) determining whether there is a statistically-significant difference between said first number and said second number, the presence of said difference being indicative that said nucleic acid variation is indicative of said disease.
3. The method of claim 1, wherein said nucleic acid is a single deoxynucleotide.
4. The method of claim 3, wherein said single deoxynucleotide is a polymorphic locus.
5. The method of claim 2, wherein said disease is hereditary
6. The method of claim 2, wherein said disease is cancer.
7. The method of claim 6, wherein said disease is colorectal cancer.
8. A method for identifying a single base polymorphic locus as a diagnostic disease marker, the method comprising the steps of:
(a) obtaining a first sample comprising pooled genomic DNA from healthy members of an organism population;
(b) obtaining a second sample comprising pooled genomic DNA from diseased members of said population;
(c) determining whether a statistically-significant difference exists between an enumerated number of a single nucleotide variant at a single base polymorphic locus in said first sample and an enumerated number of a single nucleotide variant at said locus in said second sample, said difference being indicative that said locus is a diagnostic marker of said disease.
(a) obtaining a first sample comprising pooled genomic DNA from healthy members of an organism population;
(b) obtaining a second sample comprising pooled genomic DNA from diseased members of said population;
(c) determining whether a statistically-significant difference exists between an enumerated number of a single nucleotide variant at a single base polymorphic locus in said first sample and an enumerated number of a single nucleotide variant at said locus in said second sample, said difference being indicative that said locus is a diagnostic marker of said disease.
9. A method for identifying a genomic polymorphic variant, the presence of which is a diagnostic marker for a disease, the method comprising the steps of:
(a) determining a number of each of two or more variants at a single base polymorphic locus in pooled genomic DNA samples obtained from a statistically-significant number of members of a population; and (b) correlating each said number to the disease state of said member, a statistically-significant positive correlation between any of said variants and said disease state being indicative of a diagnostic marker for said disease.
(a) determining a number of each of two or more variants at a single base polymorphic locus in pooled genomic DNA samples obtained from a statistically-significant number of members of a population; and (b) correlating each said number to the disease state of said member, a statistically-significant positive correlation between any of said variants and said disease state being indicative of a diagnostic marker for said disease.
10. A method for determining the presence of disease in a patient comprising the steps of:
(a) identifying a genomic polymorphic variant correlated with a disease according to claim 9;
(b) determining whether the genomic polymorphic variant is present in a genomic DNA sample obtained from the patient, the presence of said polymorphic variant being indicative of the presence of said disease.
(a) identifying a genomic polymorphic variant correlated with a disease according to claim 9;
(b) determining whether the genomic polymorphic variant is present in a genomic DNA sample obtained from the patient, the presence of said polymorphic variant being indicative of the presence of said disease.
11. A method for identifying a single base polymorphic locus as a diagnostic marker of a loss of heterozygosity, the method comprising the steps of:
(a) obtaining a first sample comprising pooled genomic DNA from healthy members of an organism population;
(b) obtaining a second sample comprising pooled genomic DNA from members of said population having a disease caused by a loss of heterozygosity in genomic DNA;
(c) determining whether a statistically-significant difference exists between an enumerated number of a single nucleotide variant at a single base polymorphic locus in said first sample and an enumerated number of a single nucleotide variant at said locus in said second sample, said difference being indicative that said locus is a diagnostic marker of a loss of heterozygosity.
(a) obtaining a first sample comprising pooled genomic DNA from healthy members of an organism population;
(b) obtaining a second sample comprising pooled genomic DNA from members of said population having a disease caused by a loss of heterozygosity in genomic DNA;
(c) determining whether a statistically-significant difference exists between an enumerated number of a single nucleotide variant at a single base polymorphic locus in said first sample and an enumerated number of a single nucleotide variant at said locus in said second sample, said difference being indicative that said locus is a diagnostic marker of a loss of heterozygosity.
12. A method for identifying a single base polymorphic locus as a diagnostic marker for a mutation in genomic DNA, the method comprising the steps of:
(a) obtaining a first sample comprising pooled genomic DNA from healthy members of an organism population;
(b) obtaining a second sample comprising pooled genomic DNA from members of said population having a disease caused by a mutation in genomic DNA;
(c) determining whether a statistically-significant difference exists between an enumerated number of a single nucleotide variant at a single base polymorphic locus in said first sample and an enumerated number of a single nucleotide variant at said locus in said second sample, said difference being indicative that said locus is a diagnostic marker of a mutation in genomic DNA.
(a) obtaining a first sample comprising pooled genomic DNA from healthy members of an organism population;
(b) obtaining a second sample comprising pooled genomic DNA from members of said population having a disease caused by a mutation in genomic DNA;
(c) determining whether a statistically-significant difference exists between an enumerated number of a single nucleotide variant at a single base polymorphic locus in said first sample and an enumerated number of a single nucleotide variant at said locus in said second sample, said difference being indicative that said locus is a diagnostic marker of a mutation in genomic DNA.
13. The method of claim 10, wherein said disease is hereditary.
14. The method of claim 10, wherein said disease is cancer.
15. The method of claim 10, wherein said disease is colorectal cancer.
16. The method of claim 10, wherein said disease is hereditary non-polyposis colorectal cancer.
17. The method of claim 11, wherein the single nucleotide variant in said first sample and the single nucleotide variant in said second sample are the same.
18. The method of claim 11, wherein the single nucleotide variant in said first sample and the single nucleotide variant in said second sample are different
19. A method for determining the severity of a disease of a patient, the method comprising the steps of:
(a) determining a number of a genomic polymorphic variant, the presence of which is a diagnostic disease marker, at a single base polymorphic locus in a genomic DNA sample obtained from the patient;
(b) applying to said number a predetermined statistical relationship, said statistical relationship correlating numbers of said genomic polymorphic variants in a sample comprising pooled genomic DNA obtained from members of a population of having said disease, with the clinical severity of said disease; and (c). determining the clinical severity of said disease of the patient.
(a) determining a number of a genomic polymorphic variant, the presence of which is a diagnostic disease marker, at a single base polymorphic locus in a genomic DNA sample obtained from the patient;
(b) applying to said number a predetermined statistical relationship, said statistical relationship correlating numbers of said genomic polymorphic variants in a sample comprising pooled genomic DNA obtained from members of a population of having said disease, with the clinical severity of said disease; and (c). determining the clinical severity of said disease of the patient.
20. A method for identifying a single base polymorphic locus as a marker for a disease treatment, the method comprising the steps of:
(a) obtaining a first sample comprising pooled genomic DNA from members of an organism population, said members having undergone a successful treatment of said disease;
(b) obtaining a second sample comprising pooled genomic DNA from members of said population, said members having undergone an unsuccessful treatment of said disease;
(c) determining whether a statistically-significant difference exists between an enumerated number of a single base polymorphic variant at a single base polymorphic locus in said first sample and an enumerated number of a single base polymorphic variant at said locus in said second sample, said difference being indicative that said locus is a marker for a disease treatment.
(a) obtaining a first sample comprising pooled genomic DNA from members of an organism population, said members having undergone a successful treatment of said disease;
(b) obtaining a second sample comprising pooled genomic DNA from members of said population, said members having undergone an unsuccessful treatment of said disease;
(c) determining whether a statistically-significant difference exists between an enumerated number of a single base polymorphic variant at a single base polymorphic locus in said first sample and an enumerated number of a single base polymorphic variant at said locus in said second sample, said difference being indicative that said locus is a marker for a disease treatment.
21. A method for identifying a combination of nucleic acid variants, the presence of said combination being indicative of a marker for a disease treatment, said combination comprising one or more of said variants, the method comprising the steps of:
(a) determining a first set of numbers, said first set comprising an enumerated number for each of said one or more of said variants in a sample obtained from members of a population having undergone a successful treatment for a disease;
(b) determining a second set of numbers, said second set comprising an enumerated number for each of said one or more of said variants in a sample obtained from members of said population having undergone an unsuccessful treatment for said disease;
and (c) determining whether there is a statistically-significant difference between said first set of numbers and said second set of numbers, the presence of said difference being indicative that said combination of nucleic acid variants is indicative of a disease treatment.
(a) determining a first set of numbers, said first set comprising an enumerated number for each of said one or more of said variants in a sample obtained from members of a population having undergone a successful treatment for a disease;
(b) determining a second set of numbers, said second set comprising an enumerated number for each of said one or more of said variants in a sample obtained from members of said population having undergone an unsuccessful treatment for said disease;
and (c) determining whether there is a statistically-significant difference between said first set of numbers and said second set of numbers, the presence of said difference being indicative that said combination of nucleic acid variants is indicative of a disease treatment.
22. A method for identifying a single base polymorphic locus as a marker for toxicity of a pharmaceutical compound, the method comprising the steps of:
(a) obtaining a first sample comprising pooled genomic DNA from members of an organism population, said members having been administered a pharmaceutical compound without displaying an indication of a toxic effect;
(b) obtaining a second sample comprising pooled genomic DNA from members of said population, said members having been administered a pharmaceutical compound and having displayed an indication of a toxic effect;
(c) determining whether a statistically-significant difference exists between an enumerated number of a single base polymorphic variant at a single base polymorphic locus in said first sample and an enumerated number of a single base polymorphic variant at said locus in said second sample, said difference being indicative that said locus is a marker for toxicity of a pharmaceutical compound.
(a) obtaining a first sample comprising pooled genomic DNA from members of an organism population, said members having been administered a pharmaceutical compound without displaying an indication of a toxic effect;
(b) obtaining a second sample comprising pooled genomic DNA from members of said population, said members having been administered a pharmaceutical compound and having displayed an indication of a toxic effect;
(c) determining whether a statistically-significant difference exists between an enumerated number of a single base polymorphic variant at a single base polymorphic locus in said first sample and an enumerated number of a single base polymorphic variant at said locus in said second sample, said difference being indicative that said locus is a marker for toxicity of a pharmaceutical compound.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US9818098A | 1998-06-16 | 1998-06-16 | |
US09/098,180 | 1998-06-16 | ||
PCT/US1999/013630 WO1999066077A2 (en) | 1998-06-16 | 1999-06-16 | Methods for the detection of nucleic acids |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2331254A1 true CA2331254A1 (en) | 1999-12-23 |
Family
ID=22267796
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002331254A Abandoned CA2331254A1 (en) | 1998-06-16 | 1999-06-16 | Methods for the detection of nucleic acids |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP1086247A2 (en) |
CA (1) | CA2331254A1 (en) |
WO (1) | WO1999066077A2 (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000061808A2 (en) | 1999-04-09 | 2000-10-19 | Exact Laboratories, Inc. | Methods for detecting nucleic acids indicative of cancer |
US6849403B1 (en) | 1999-09-08 | 2005-02-01 | Exact Sciences Corporation | Apparatus and method for drug screening |
US6586177B1 (en) | 1999-09-08 | 2003-07-01 | Exact Sciences Corporation | Methods for disease detection |
US6919174B1 (en) | 1999-12-07 | 2005-07-19 | Exact Sciences Corporation | Methods for disease detection |
WO2001042781A2 (en) | 1999-12-07 | 2001-06-14 | Exact Sciences Corporation | Supracolonic aerodigestive neoplasm detection |
US9109256B2 (en) | 2004-10-27 | 2015-08-18 | Esoterix Genetic Laboratories, Llc | Method for monitoring disease progression or recurrence |
US9777314B2 (en) | 2005-04-21 | 2017-10-03 | Esoterix Genetic Laboratories, Llc | Analysis of heterogeneous nucleic acid samples |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2282446B (en) * | 1992-05-29 | 1996-10-02 | Gen Hospital Corp | Use of genetic markers to diagnose familial dysautonomia |
US5858659A (en) * | 1995-11-29 | 1999-01-12 | Affymetrix, Inc. | Polymorphism detection |
ES2220997T3 (en) * | 1995-12-22 | 2004-12-16 | Exact Sciences Corporation | METHODS FOR THE DETECTION OF CLONE POPULATIONS OF TRANSFORMED CELLS IN A GENOMICALLY HETEROGENOUS CELL SAMPLE. |
US5698399A (en) * | 1996-04-05 | 1997-12-16 | Duff; Gordon W. | Detecting genetic predisposition for osteoporosis |
-
1999
- 1999-06-16 WO PCT/US1999/013630 patent/WO1999066077A2/en not_active Application Discontinuation
- 1999-06-16 EP EP99930331A patent/EP1086247A2/en not_active Withdrawn
- 1999-06-16 CA CA002331254A patent/CA2331254A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
EP1086247A2 (en) | 2001-03-28 |
WO1999066077A3 (en) | 2000-02-24 |
WO1999066077A2 (en) | 1999-12-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6203993B1 (en) | Methods for the detection of nucleic acids | |
US6300077B1 (en) | Methods for the detection of nucleic acids | |
US6020137A (en) | Methods for the detection of loss of heterozygosity | |
US5928870A (en) | Methods for the detection of loss of heterozygosity | |
US5670325A (en) | Method for the detection of clonal populations of transformed cells in a genomically heterogeneous cellular sample | |
US5741650A (en) | Methods for detecting colon cancer from stool samples | |
AU711754B2 (en) | Methods for the detection of clonal populations of transformed cells in a genomically heterogeneous cellular sample | |
US6214558B1 (en) | Methods for the detection of chromosomal aberrations | |
US5952178A (en) | Methods for disease diagnosis from stool samples | |
WO2000009751A1 (en) | Diagnostic methods using serial testing of polymorphic loci | |
US20020004201A1 (en) | Methods for the detection of loss of heterozygosity | |
EP1566449A1 (en) | Use of haplotypes and SNPs in lipid-relevant genes for the analyses and diagnosis of cardiovascular diseases | |
CA2331254A1 (en) | Methods for the detection of nucleic acids | |
Prior et al. | Molecular probe protocol for determining carrier status in Duchenne and Becker muscular dystrophies | |
AU720489B2 (en) | Methods for detecting colon cancer from stool samples | |
Prior et al. | Rapid DNA haplotyping using a multiplex heteroduplex approach: application to Duchenne muscular dystrophy carrier testing | |
Cotton | Detection of mutations in DNA | |
EP1291657A2 (en) | Methods for detecting colon cancer from stool samples |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Dead |