EP1737978A1 - Nucleic acid sequencing - Google Patents
Nucleic acid sequencingInfo
- Publication number
- EP1737978A1 EP1737978A1 EP05718267A EP05718267A EP1737978A1 EP 1737978 A1 EP1737978 A1 EP 1737978A1 EP 05718267 A EP05718267 A EP 05718267A EP 05718267 A EP05718267 A EP 05718267A EP 1737978 A1 EP1737978 A1 EP 1737978A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- nucleic acid
- target nucleic
- sequence
- region
- acid sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims description 242
- 238000012163 sequencing technique Methods 0.000 title claims description 131
- 102000039446 nucleic acids Human genes 0.000 title claims description 32
- 108020004707 nucleic acids Proteins 0.000 title claims description 32
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 175
- 238000000034 method Methods 0.000 claims description 89
- 108091034117 Oligonucleotide Proteins 0.000 claims description 74
- 230000000903 blocking effect Effects 0.000 claims description 71
- 238000006243 chemical reaction Methods 0.000 claims description 63
- 238000002360 preparation method Methods 0.000 claims description 60
- 125000003729 nucleotide group Chemical group 0.000 claims description 51
- 239000002773 nucleotide Substances 0.000 claims description 49
- 210000000349 chromosome Anatomy 0.000 claims description 48
- 108020004414 DNA Proteins 0.000 claims description 34
- 230000000295 complement effect Effects 0.000 claims description 33
- 102000054766 genetic haplotypes Human genes 0.000 claims description 23
- 238000009396 hybridization Methods 0.000 claims description 19
- 238000011144 upstream manufacturing Methods 0.000 claims description 16
- 238000003776 cleavage reaction Methods 0.000 claims description 11
- 230000007017 scission Effects 0.000 claims description 11
- 239000003795 chemical substances by application Substances 0.000 claims description 10
- 108091008146 restriction endonucleases Proteins 0.000 claims description 10
- 238000012175 pyrosequencing Methods 0.000 claims description 9
- 102000004190 Enzymes Human genes 0.000 claims description 7
- 108090000790 Enzymes Proteins 0.000 claims description 7
- 239000005546 dideoxynucleotide Substances 0.000 claims description 6
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 claims description 5
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 claims description 5
- 235000011180 diphosphates Nutrition 0.000 claims description 5
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 claims description 4
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 claims description 4
- 238000001514 detection method Methods 0.000 claims description 4
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 claims description 3
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 claims description 3
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 claims description 3
- URGJWIFLBWJRMF-JGVFFNPUSA-N ddTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)CC1 URGJWIFLBWJRMF-JGVFFNPUSA-N 0.000 claims description 3
- 108090000331 Firefly luciferases Proteins 0.000 claims description 2
- 238000004132 cross linking Methods 0.000 claims description 2
- 239000000126 substance Substances 0.000 claims description 2
- 230000002401 inhibitory effect Effects 0.000 claims 2
- HDRRAMINWIWTNU-PRJDIBJQSA-N [[(5r)-5-(2-amino-6-oxo-3h-purin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1CCC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HDRRAMINWIWTNU-PRJDIBJQSA-N 0.000 claims 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-N dCTP Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO[P@](O)(=O)O[P@](O)(=O)OP(O)(O)=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-N 0.000 claims 1
- 238000006116 polymerization reaction Methods 0.000 claims 1
- 239000013615 primer Substances 0.000 description 99
- 108700028369 Alleles Proteins 0.000 description 33
- 102000053602 DNA Human genes 0.000 description 31
- 239000000047 product Substances 0.000 description 15
- 108090000623 proteins and genes Proteins 0.000 description 11
- 239000000523 sample Substances 0.000 description 10
- 238000003752 polymerase chain reaction Methods 0.000 description 8
- 239000007795 chemical reaction product Substances 0.000 description 7
- 230000008774 maternal effect Effects 0.000 description 5
- 230000008775 paternal effect Effects 0.000 description 5
- 108091093088 Amplicon Proteins 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 238000003205 genotyping method Methods 0.000 description 4
- 239000007790 solid phase Substances 0.000 description 4
- 108020004682 Single-Stranded DNA Proteins 0.000 description 3
- HDRRAMINWIWTNU-NTSWFWBYSA-N [[(2s,5r)-5-(2-amino-6-oxo-3h-purin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1CC[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HDRRAMINWIWTNU-NTSWFWBYSA-N 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 239000011324 bead Substances 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 102000054765 polymorphisms of proteins Human genes 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 238000002054 transplantation Methods 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- OAKPWEUQDVLTCN-NKWVEPMBSA-N 2',3'-Dideoxyadenosine-5-triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1CC[C@@H](CO[P@@](O)(=O)O[P@](O)(=O)OP(O)(O)=O)O1 OAKPWEUQDVLTCN-NKWVEPMBSA-N 0.000 description 2
- 102000007347 Apyrase Human genes 0.000 description 2
- 108010007730 Apyrase Proteins 0.000 description 2
- 239000003155 DNA primer Substances 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- 238000000376 autoradiography Methods 0.000 description 2
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 210000000265 leukocyte Anatomy 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000002751 oligonucleotide probe Substances 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 229920002477 rna polymer Polymers 0.000 description 2
- 238000007480 sanger sequencing Methods 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- XKKCQTLDIPIRQD-JGVFFNPUSA-N 1-[(2r,5s)-5-(hydroxymethyl)oxolan-2-yl]-5-methylpyrimidine-2,4-dione Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)CC1 XKKCQTLDIPIRQD-JGVFFNPUSA-N 0.000 description 1
- 101710098119 Chaperonin GroEL 2 Proteins 0.000 description 1
- 102100020756 D(2) dopamine receptor Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 101000931901 Homo sapiens D(2) dopamine receptor Proteins 0.000 description 1
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 1
- 244000028344 Primula vulgaris Species 0.000 description 1
- 235000016311 Primula vulgaris Nutrition 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 241000589500 Thermus aquaticus Species 0.000 description 1
- ARLKCWCREKRROD-POYBYMJQSA-N [[(2s,5r)-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)CC1 ARLKCWCREKRROD-POYBYMJQSA-N 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 208000006673 asthma Diseases 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 239000002981 blocking agent Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 238000001818 capillary gel electrophoresis Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 210000001840 diploid cell Anatomy 0.000 description 1
- 229960003638 dopamine Drugs 0.000 description 1
- -1 double-stranded DNA Chemical class 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 210000003783 haploid cell Anatomy 0.000 description 1
- 210000003917 human chromosome Anatomy 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000005257 nucleotidylation Effects 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000011897 real-time detection Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6858—Allele-specific amplification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Definitions
- the present invention relates to a method of sequencing a nucleic acid, hi particular, the invention relates to a method for determining a target nucleic acid sequence where the target nucleic acid sequence is comprised in a preparation comprising a non-target nucleic acid sequence.
- the invention also relates to a method for determining the haplotype of a subject.
- sequence of nucleic acids is of fundamental importance in many areas of biological research, clinical diagnosis and treatment. Sequencing of DNA is typically carried out by a method based on the Sanger dideoxy chain- termination method (Sanger, F., Nic len, S., and Coulson, A. R. (1977) "DNA Sequencing with chain-terminating inhibitors" PNAS USA 74:5463-5467). hi this method, a labelled oligonucleotide primer complementary to a known sequence adjacent to the target sequence is used to initiate DNA polymerase-catalysed elongation into the target sequence. Typically, four polymerase reactions are carried out for each round of sequencing.
- Each reaction contains all four deoxynucleotides (dNTPs - dCTP, dTTP, dGTP and dATP) plus a small amount of one dideoxynucleotide (ddNTP - ddCTP, ddTTP, ddGTP or ddATP). Because ddNTPs have no 3' hydroxyl group, elongation of the nascent strand is occasionally terminated by incorporation of a ddNTP. Thus the sequencing reaction produces a series of labelled strands whose lengths are indicative of the location of a particular base in the sequence.
- the resultant labelled strands are typically separated according to size by polyacrylamide gel electrophoresis and visualised by detecting the label, for example by autoradiography where the primer was radiolabelled. More recently, the Sanger sequencing method has been adapted in various ways, in particular for large-scale automated sequencing using multiple fluorescent labels and capillary gel electrophoresis.
- One problem with sequencing methods based on the Sanger method occurs when the target nucleic acid to be sequenced is provided in a preparation comprising one or more different nucleic acids or sequences which show some sequence identity to the target sequence.
- the sequencing reaction will lead to products which are derived from primer binding to the second or further sequences, as well as the target sequence.
- the resultant gel or chromatograph will reveal two or more bases as being present at a particular location. Because the method does not allow discrimination between the products of the target sequence and the second or further sequences, the target sequence cannot be determined unambiguously.
- polymorphism extends for two or more nucleotides, or where there are two or more polymorphic sites (alleles) separated by regions of common sequence, it is not possible to discern the sequence of the two alleles.
- standard sequencing methods are not able to determine the combination of alleles existing on a particular chromosome (the haplotype).
- SNPs single nucleotide polymorphisms
- HLA human leukocyte antigen or human leukocyte associated antigen A genotyping is one area where haplotyping is important. Determination of the two haplotype sequences of the HLA genes is crucial to the success of organ transplantation. The individual haplotypes of the donor must be matched with the recipient before transplantation to avoid rejection of the transplant. Methods for evaluating HLA allele types have been described in the past. One such method relies on performing family studies, which is very time-consuming. An alternative method based on DNA sequencing is disclosed in WO 97/23650. However, where heterozygous alleles exist, this method relies on prior knowledge of existing haplotype sequences, so that ambiguous bases can be ascribed to one allele or another.
- haplotyping used in the past rely on preparing a composition comprising only a single haplotype sequence before sequencing.
- One way of doing this is by converting a diploid cell into a haploid cell. This requires a high investment, is labour intensive and slow but gives complete haplotype separation.
- human chromosomes can be cloned into yeast in order to get a haploid for that particular chromosome. This suffers from the same drawbacks in terms of time and cost.
- One way of obtaining a preparation comprising only a single haplotype sequence is to amplify DNA by PCR using allele-specific primers.
- This type of approach for sequencing both alleles of a deletion polymorphism in intron 6 of the human dopamine 2 receptor gene (DRD2) is described in DNA Sequence Vol 6 (2), pp 87-94 (1996), Finck et al..
- allele-specific primers are used to amplify individual allele sequences by polymerase chain reaction (PCR).
- the primers are designed so that they produce amplicons of differing lengths, so that the products of each allele can be discriminated by agarose gel electrophoresis when both alleles are simultaneously amplified in the same reaction tube.
- the amplicons from each allele are then extracted from the gel and sequenced using conserved primers.
- the disadvantage of this approach is that it requires the prior knowledge of at least two, sufficiently separated regions of dissimilarity between the alleles so that appropriate allele-specific primers producing different-sized products can be designed.
- it requires a time-consuming gel separation and extraction step prior to sequencing.
- Biotinylated allele-specific oligonucleotide primers coupled to streptavidin-coated magnetic beads are used to amplify D ⁇ A from one haplotype by PCR, and then conserved primer is used for solid-phase direct D ⁇ A sequencing.
- WO 92/15711 discloses a method for determining a major histocompatibility complex genotype of a subject in a sample containing nucleic acid.
- the method involves PCR amplification of the gene locus of interest, with all alleles for the gene locus to be sequenced being amplified with one conserved oligonucleotide primer pair and at least one allele for the gene locus being amplified with one conserved oligonucleotide primer and one non-conserved oligonucleotide primer.
- the amplicons for each allele are then sequenced with a conserved primer.
- a different method for determining haplotype sequences involves analysis of PCR amplifed sequences covering a polymorphic region by hybridisation rather than, sequencing.
- PCR amplicons are contacted with oligonucleotide probes complementary to the sequence of either the maternal or paternal chromosome in a region comprising an S ⁇ P.
- Probes complementary to the maternal or paternal chromosomes are immobilised in different areas of a solid phase.
- a second set of oligonucleotide probes labelled in a different way and complementary to the sequence of either the maternal or paternal chromosome in a region comprising a second S ⁇ P, is then used to identify which sequence at the first S ⁇ P is on the same chromosome as a particular sequence at the second S ⁇ P.
- WO 00/20628 describes a method by which multiple genomic loci can be sequenced in the same reaction mixture. This method allows the sequencing of a second locus in the mixture by using primers which are longer than the longest product formed from the sequencing reaction in relation to a first locus. Different primers are used for each locus. However, this document does not disclose a method for haplotyping for particular alleles of a single locus.
- the present invention aims to overcome the disadvantages of the prior art.
- the present invention aims to provide an improved method of determining a target nucleic acid sequence, where the target nucleic acid is comprised in a preparation comprising a non-target nucleic acid which has regions of common and dissimilar sequence to the target nucleic acid.
- the present invention also aims to provide an improved method for determining the haplotype of a subject.
- the present invention provides a method for determining a target nucleic acid sequence, wherein the target nucleic acid sequence is comprised in a preparation comprising a non-target nucleic acid sequence, the target nucleic acid sequence and the non-target nucleic acid sequence each having a first region of common sequence upstream of a first region of dissimilar sequence upstream of a second region of dissimilar sequence, the method comprising:
- the present invention provides a method for determining the haplotype of a subject from a sample comprising DNA from the subject, comprising a method as defined above, wherein the preparation comprises the sample, the target nucleic acid sequence comprises a locus on a first chromosome of a pair of chromosomes, the non-target nucleic acid sequence comprises the corresponding locus on the second chromosome of the pair, the locus comprising two or more single nucleotide polymorphisms for which the subject is heterozygous, wherein the sequencing reaction is conducted to determine the sequence of the polymorphic genetic locus on the first chromosome of the pair thereby determining the haplotype of the subject.
- the present invention provides use of pyrosequencing for determining the haplotype of a subject from a sample comprising DNA from the subject, wherein pyrosequencing is used to sequence a target locus on a first chromosome of a pair, the target locus comprising two or more, single nucleotide polymorphisms, the corresponding locus on the second chromosome of the pair being blocked from sequencing by a blocking oligonucleotide hybridised to the second chromosome.
- the present invention provides an improved method of sequencing a target nucleic acid sequence comprised in a preparation comprising a different but related nucleic acid sequence.
- the method advantageously allows the sequencing reaction to proceed in relation to the target nucleic acid sequence, while the sequencing reaction between the primer and the other nucleic acid sequence is blocked by the blocking oligonucleotide.
- the sequence data which is obtained is therefore derived only from the target nucleic acid sequence, as interference from the other nucleic acid sequence is removed.
- the method is a fast and efficient way of discriminating between the two sequences, hi particular, the method is advantageous because a sequence-specific sequencing primer does not have to be constructed for each target nucleic acid sequence.
- the method also does not suffer from problems relating to lack of discrimination in primer hybridisation to closely-related sequences.
- the method also provides an enhanced method for haplotyping.
- the method enables the rapid determination of allele associations to identify individually the two haplotype sequences present at a particular locus in a subject.
- the method is particularly advantageous in identifying associations of SNPs and in HLA genotyping. In particular, the method avoids the need for time-consuming family studies or prior knowledge of allele associations.
- the target nucleic acid sequence of the present invention is not particularly limited. Suitable target nucleic acid sequences include a deoxyribonucleic acid (DNA) sequence, a ribonucleic acid (RNA) sequence, or a DNA or RNA sequence comprising one or more modified nucleotides or bases, or one or more artificial nucleotides or bases.
- the second nucleic acid sequence is likewise not particularly limited, and may be a DNA or RNA sequence, optionally comprising one or more modified nucleotides or bases.
- the target nucleic acid sequence and/or the non-target nucleic acid sequence is a DNA sequence.
- the DNA sequence may be a genomic DNA or cDNA sequence. Each sequence is preferably a human DNA sequence.
- the target nucleic acid sequence may be comprised in the same nucleic acid polymer as the non-target nucleic acid.
- the two nucleic acid sequences are preferably on separate DNA molecules. More preferably the target nucleic acid sequence and the non-target nucleic acid sequence each comprise a different allele at a polymo ⁇ hic genetic locus in a subject.
- the target nucleic acid sequence comprises the locus on one chromosome of a pair (maternal or paternal) and the non-target nucleic acid sequence comprises the locus on the other chromosome of the pair.
- the preparation comprises a target nucleic acid sequence and a non-target nucleic acid sequence.
- Suitable preparations include any preparation comprising two or more nucleic acid sequences, provided that at least two of the nucleic acid sequences share a region of common sequence but differ in a region of dissimilar sequence.
- the preparation comprises a purified DNA preparation.
- the preparation is preferably prepared from a sample derived from a single human subject.
- the preparation may be a sample of human saliva, blood, urine or other tissue, or a DNA preparation comprising genomic DNA which has been prepared from such a sample.
- the preparation comprises one or more further nucleic acid sequences, wherein each further nucleic acid sequence has a first region of common sequence upstream of a first region of dissimilar sequence upstream of a second region of dissimilar sequence.
- common sequence means that the sequence of the further nucleic acid sequence is identical to the target and non-target nucleic acid sequences in this region.
- dissimilar sequence means that the sequence of the further nucleic acid is different from the target and/or non-target nucleic acid sequences in this region.
- the method may include a step of blocking the sequencing reaction between the primer and one or more of the further nucleic acid sequences.
- the sequencing reaction between the primer and the further nucleic acid sequences may be blocked in the same way as for the sequencing reaction between the primer and the non-target nucleic acid sequence. If it is desired to obtain sequencing reaction products derived only from the target nucleic acid sequence, the sequencing reaction between the primer and each of the further nucleic acid sequences may be blocked.
- the sequencing reaction between the primer and only some of the further nucleic acid sequences may be blocked.
- sequencing from particular forther nucleic acid sequences may be selectively blocked or allowed to proceed.
- This type of analysis may be termed "multiplexing". Multiplexing permits the analysis of multiple sites in an individual sample or a number of samples from different individuals.
- the preparation comprises DNA derived from samples taken from two or more individuals. For instance, a number of DNA . preparations derived from different individuals in a group may be combined and the method described herein carried on the combined preparation. This method may be used to assess whether or not a particular combination of SNPs is found together on a single chromosome in all individuals within the group.
- the sequencing reaction will yield a single sequence. If not, the sequencing reaction will indicate alternative bases at the position of one or more SNPs in the sequence. If it is then desired to determine which combination of SNPs was present in which individual, it would be necessary to repeat the method on separate DNA preparations from each individual.
- more than one target nucleic acid sequence may be determined using a single sequencing reaction.
- the present method is performed in parallel using two or more oligonucleotide primers, each of which is complementary to a different sequence, hi this way, two or more polymorphic sites may be analysed simultaneously.
- Each target nucleic acid sequence shares a first region of common sequence with a corresponding non-target nucleic acid sequence.
- the sequencing reaction between each primer and the non- target nucleic acid sequence to which it is complementary is blocked, so that the sequencing reaction proceeds folly only in respect of the target nucleic acid sequences.
- the invention relates to a method for determining a plurality of target nucleic acid sequences, wherein the plurality of target nucleic acid sequences is comprised in a preparation further comprising a plurality of corresponding non- target nucleic acid sequences, each target nucleic acid sequence in the preparation corresponds to one or more corresponding non-target nucleic acid sequences in the preparation, each target nucleic acid sequence and each corresponding non-target nucleic acid sequence has a first region of common sequence upstream of a first region of dissimilar sequence upstream of a second region of dissimilar sequence, the first region of common sequence of each target nucleic acid sequence is the same as the first region of common sequence of its corresponding non-target nucleic acid sequences, the first region of dissimilar sequence of each target nucleic acid sequence is different to the first region of dissimilar sequence of its corresponding non-target nucleic acid sequences, the second region of dissimilar sequence of each target nucleic acid sequence is different to the second region of dissimilar sequence of its
- each blocking oligonucleotide is complementary to at least a portion of the first region of dissimilar sequence of a non-target nucleic acid sequence, under conditions to hybridise the blocking oligonucleotide thereto;
- each primer is complementary to at least a portion of the first region of common sequence of a target nucleic acid sequence and its corresponding non-target nucleic acid sequence, under conditions to hybridise the primer thereto;
- blocking oligonucleotides block the sequencing reaction at least from proceeding into the second region of dissimilar sequence of each corresponding non-target nucleic acid sequence.
- sequencing reaction products are obtained which are derived from more than one target nucleic acid sequence.
- a method is therefore required in order to discriminate between sequencing reaction products derived from each target nucleic acid sequence. This may be done by labelling the sequencing reaction products derived from each nucleic acid sequence in which the sequencing reaction is allowed to proceed with a distinct label.
- the sequencing reaction products derived from each target nucleic acid sequence may be distinguished by differentially labelling each oligonucleotide primer.
- each primer is labelled with a fluorescent label which fluoresces at a different wavelength. Sequencing products derived from each target nucleic acid sequence may then be distinguished for example using an automated sequencer following gel electrophoresis.
- one or more primers is labelled with one part of a ligand-affinant pair.
- a preferred ligand-affinant pair is biotin-streptavidin. The ligand-affinant interaction may be used in order to bind sequencing products derived from one target nucleic acid sequence to a solid phase (such as magnetic beads), thereby separating the labelled sequencing products from non-labelled sequencing products.
- the labelled and non-labelled sequencing may then be separately subjected to gel electrophoresis.
- hi embodiments where two primers are used (in order to sequence two different target nucleic acid sequences) only one primer need be labelled in order to separate the sequencing products derived from each of the target nucleic acid sequences, hi embodiments where 3 or more primers are used, 2 or more of the primers need to be labelled, hi this case, a different ligand-affinant pair needs to be selected for each primer to be labelled, so that the sequencing products derived from each target nucleic acid sequence can be bound to a different solid phase and thereby separated, hi general, where n primers are used, n-1 primers need to be labelled.
- Each of the two nucleic acid sequences includes a first region of common sequence.
- the target nucleic acid sequence is identical to the non-target nucleic acid sequence in this region.
- the method advantageously allows the sequencing of only the target nucleic acid sequence, despite the fact that a generic primer which is complementary to the region of common sequence (and which would hybridise to both nucleic acid sequences in the absence of the blocking oligonucleotide) is used.
- the first region of common sequence preferably comprises a length of at least 10 nucleotides, more preferably at least 20 nucleotides.
- the first region of common sequence is upstream of a first region of dissimilar sequence.
- the first region of dissimilar sequence is upstream of a second region of dissimilar sequence.
- upstream it is meant upstream in terms of the direction of sequencing.
- the sequencing primer first hybridises to a region comprising at least a portion of the first region of common sequence.
- the first region of dissimilar sequence acts as a template for primer extension before the second region of dissimilar sequence. Because primer extension typically proceeds in the 5' to 3' direction (nucleotides are added at the 3' end of the primer), the first region of common sequence typically lies 3' to the first region of dissimilar sequence, and the first region of dissimilar sequence typically lies 3' to the second region of dissimilar sequence.
- region of dissimilar sequence it is meant that the target nucleic acid sequence is different from the non-target nucleic acid sequence in this region, hi one embodiment the first and second regions of dissimilar sequence are contiguous, that is the second region of dissimilar sequence immediately follows the first region of dissimilar sequence with no intervening region of common sequence. In an alternative embodiment, the first and second dissimilar sequences are separated by a second region of common sequence.
- the target nucleic acid sequence and the non-target nucleic acid sequence comprises one or more further regions of dissimilar sequence. For instance, there may be a third, fourth, fifth or subsequent regions of dissimilar sequence downstream of the second region of dissimilar sequence. However, there must be at least two regions of dissimilar sequence. Each region of dissimilar sequence is separated by a further region of common sequence. The method permits the determination of the sequence of the target nucleic acid sequence downstream of the second region of dissimilar sequence as far as the sequencing reaction is capable of proceeding.
- the length of the first and second regions of dissimilar sequence is not particularly limited. Any length of dissimilar sequence may be used from a single nucleotide upwards, hi a prefened embodiment, either or both regions of dissimilar sequence comprises an SNP.
- the present method comprises a step of contacting the preparation with a blocking oligonucleotide complementary to a sequence comprising the first region of dissimilar sequence of the non-target nucleic acid sequence, under conditions to hybridise the blocking oligonucleotide thereto.
- the blocking oligonucleotide is typically a single- stranded DNA 5 to 50 nucleotides in length, preferably 10 to 50 nucleotides, preferably 10 to 40 nucleotides in length, more preferably 15 to 35 nucleotides in length and most preferably 15 to 25 nucleotides in length.
- the blocking oligonucleotide therefore contains at least one base which is non- complementary to the target nucleic acid sequence. It is important that hybridisation conditions are selected, at least in step (a), so that the blocking oligonucleotide hybridises to the non-target nucleic acid sequence but not to the target nucleic acid sequence. Where there is only a single base difference between the target and non- target nucleic acid sequence within the region to which the blocking oligonucleotide binds, the hybridisation conditions, and in particular the hybridisation temperature, must be selected particularly carefully. If the temperature selected is too high, insufficient blocking of the non-target nucleic acid sequence may occur. If the temperature selected is too low, the blocking oligonucleotide may also hybridise to the target nucleic acid sequence and prevent the sequencing reaction proceeding in respect of the target.
- Hybridisation conditions for step (a) may be selected according to criteria well known to those skilled in the art.
- An appropriate temperature and salt content for hybridisation needs to be selected according to the length of the blocking oligonucleotide and its G-C content, amongst other things (Old & Primrose (1994), Principles of Gene Manipulation, Blackwell Science and Maniatis et al. (1992), Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, New York.
- the hybridisation temperature should be close to the melting temperature (T m ) of the oligonucleotide.
- T m is defined as the temperature at which the oligonucleotide and its target are 50% dissociated, and may be calculated according to the "Wallace rule" by the following formula:
- T m 4 X (number of G:C base-pairs) + 2 X (number of A:T base-pairs)
- the hybridisation temperature should be within 2°C of T m .
- the T m is about 60°C and a suitable hybridisation temperature would be 58°C.
- the blocking oligonucleotide inhibits the sequencing of the non-target nucleic acid sequence by the sequencing primer.
- the blocking oligonucleotide must therefore not act as a primer itself for sequencing of the non-target nucleic acid sequence.
- One way of preventing this is to use a blocking oligonucleotide having no 3' hydroxy group, for instance by adding a dideoxynucleotide at the 3' position during synthesis of the oligonucleotide.
- step (a) further comprises a step of contacting the preparation with a terminator nucleotide.
- a particular terminator nucleotide such as ddATP, ddCTP, ddGTP or ddTTP
- ddATP ddATP
- ddCTP ddCTP
- ddGTP ddGTP
- ddTTP ddTTP
- the terminator nucleotide becomes incorporated into the blocking oligonucleotide only when hybridised to the non-target nucleic acid sequence.
- the blocking olignucleotide is chosen such that the base at its 3' terminus is ' complementary to a base within the first region of dissimilar sequence of the non- target nucleic acid sequence, this helps to ensure that the terminator does not become incorporated into any blocking oligonucleotide which might be hybridised to the target nucleic acid sequence.
- the blocking oligonucleotide may block sequencing of the non-target nucleic acid sequence in one of two ways. Firstly, if the blocking oligonucleotide is selected such that it binds to a region overlapping the first region of dissimilar sequence and the first region of common sequence (to which the sequencing primer is complementary), it will inhibit sequencing primer binding to the non-target nucleic acid sequence. Alternatively, the blocking oligonucleotide may be selected such that it binds to a region which is downstream (in terms of the direction of sequencing) from the sequencing primer binding site. In this case, the sequencing primer will bind to the both nucleic acid sequences, but extension of the primer bound to the non-target nucleic acid sequence will be inhibited.
- the terminator nucleotide is capable of covalently cross- linking the primer to the non-target nucleic acid sequence.
- a terminator nucleotide comprising Peptide Nucleic Acid (PNA) and (L-ribo-)Locked Nucleic Acid (LNA) nucleotides, described in WO 95/15974 and WO 00/66604 respectively, can be used to block sequencing of the non-target nucleic acid.
- step (a) further comprises contacting the preparation with a cleavage agent, under conditions to cleave the non-target nucleic acid sequence within the sequence hybridised to the blocking oligonucleotide.
- a cleavage agent is selected that introduces strand breaks only into double-stranded DNA. If a single-stranded DNA preparation is used, only the non-target nucleic acid sequence will be cleaved provided that the blocking oligonucleotide does not hybridise to the target nucleic acid sequence.
- the cleavage agent is preferably a restriction endonuclease.
- a restriction endonuclease which recognises a sequence comprising the first region of dissimilar sequence of the non-target nucleic acid sequence.
- a restriction endonuclease may be used which recogmses a sequence common to both the target and non-target nucleic acid sequence, provided that the recognition sequence is within the binding site of the blocking oligonucleotide. Accordingly, the restriction endonuclease may recognise a site within the first or second regions of common sequence.
- the blocking oligonucleotide is extended by polymerisation far enough in order to allow cleavage of the non-target nucleic acid sequence at a recognition site downstream of the blocking oligonucleotide binding site, hi this case, the blocking oligonucleotide is preferably extended stepwise by the addition of individual nucleotides so that the degree of extension can be controlled.
- the restriction endonuclease is not particularly limited provided that it recognises a defined DNA sequence, and a suitable endonuclease may be selected according to the presence of known recognition sites at an appropriate location in the non-target nucleic acid sequence.
- the restriction endonuclease is preferably a type II restriction endonuclease.
- the cleavage agent comprises a chemical cleavage agent.
- a standard sequencing reaction can be performed in step (c).
- a sequencing reaction utilises an electronic thermocycler, in order to allow a number of cycles of primer hybridisation to the target nucleic acid sequence, elongation by a polymerase and separation of extended products from the template.
- Four separate sequencing reactions may be performed, each containing one dideoxy terminator (dATP, dCTP, dGTP or dTTP) and the products visualised in separate lanes by polyacrylamide gel electrophoresis and autoradiography.
- dye terminators comprising fluorescent labels are employed, wherein the labels fluoresce at different wavelengths to indicate each particular terminator nucleotide, a single sequencing reaction can be used.
- the blocking oligonucleotide is not covalently crosslinked to the non-target nucleic acid sequence, it is important to ensure that the blocking oligonucleotide does not separate from the non-target nucleic acid sequence during the sequencing reaction, as this would allow sequencing of the non-target nucleic acid sequence. Accordingly, in this embodiment, it is preferable to maintain the temperature of the sequencing reaction below the denaturation temperature of the blocking oligonucleotide/non-target nucleic acid complex.
- the preparation can first be heated to an elevated temperature, such as 95°C in order to separate the DNA strands.
- the preparation is then typically cooled to a suitable hybridisation temperature for the blocking oligonucleotide (such as 60°C for a 20-mer oligonucleotide with 50% G-C content).
- a suitable hybridisation temperature for the blocking oligonucleotide such as 60°C for a 20-mer oligonucleotide with 50% G-C content.
- the sequencing reaction is then performed at a constant temperature (such as) without thermocycling.
- the method comprises a step of contacting a preparation with a sequencing primer complementary to at least a portion of the first region of common sequence.
- a sequencing primer complementary to at least a portion of the first region of common sequence.
- the primer is capable under suitable conditions (and in the absence of any blocking agent) of hybridising to both the target nucleic acid sequence and the non-target nucleic acid sequence.
- the primer is complementary to a sequence which is found entirely within the first region of common sequence. This means that the hybridisation site of the primer has an identical sequence in both the target and non- target nucleic acid sequence.
- a primer may be used which is capable of hybridising to a sequence a part of which differs between the target nucleic acid sequence and the non-target nucleic acid sequence, hi this embodiment, the primer may be folly complementary to a sequence found in either the target or non-target nucleic acid sequence, but a part of the primer may not be complementary to the other nucleic acid sequence. Thus, only a part of the primer is capable of hybridising to one of the nucleic acid sequences.
- a mixed primer may be used such that the primer contains two species, a first species complementary to the target nucleic acid sequence and a second species complementary to the non-target nucleic acid sequence.
- the difference in sequence between the target and non-target nucleic acid sequence in the region to which the primer hybridises preferably should be limited to one or two nucleotides, more preferably one nucleotide. The differences should also be located in a region of the nucleic sequences towards which the 5' end of the primer hybridises. If mismatches are located near the 3' end of the primer, it is more likely that polymerisation will be inhibited.
- the primer is not capable of selectively hybridising only to one of the two nucleic acid sequences. If that were the case, it would be unnecessary to perform a blocking step, because sequencing would proceed only from one of the two nucleic acid sequences.
- the nature of the primer is not particularly limited, provided that it is capable of initiating a sequencing reaction when hybridised to the target nucleic acid.
- the primer is a single-stranded DNA.
- the length of the primer is preferably 10 to 50 nucleotides, more preferably 10 to 40 nucleotides and most preferably 15 to 30 nucleotides.
- Suitable primers may be designed according to standard techniques known to those skilled in the art for selecting primers for polymerase reactions, such as for sequencing and for amplification of DNA by the polymerase chain reaction (PCR).
- the preparation is contacted with the sequencing primer, typically by adding an aqueous solution of the primer to a preparation containing a suitable amount of DNA.
- Hybridisation conditions are then selected so that the primer hybridises to the first region of common sequence of the DNA . according to criteria well known to those skilled in the art, and as discussed above in relation to the blocking oligonucleotide. It is important that if the blocking oligonucleotide is not cross-linked to the non-target nucleic acid sequence, the temperature is not raised sufficiently to separate the blocking oligonucleotide from the non-target nucleic acid sequence.
- a blocking oligonucleotide and sequencing primer are selected such that they have a similar T m .
- the sequencing reaction may be any type of nucleic acid sequencing reaction, provided that it involves extension or elongation of the primer when hybridised to a nucleic acid sequence.
- Primer extension is typically performed using a DNA polymerase, such as Thermus aquaticus or Pfu DNA polymerase for reactions involving a high-temperature step, or other suitable DNA polymerases where there is no high-temperature step.
- the sequencing reaction comprises real-time sequencing such as pyrosequencing.
- the sequencing reaction comprises Sanger sequencing using dideoxynucleotides .
- the sequencing reaction proceeds into the second region of dissimilar sequence of the target nucleic acid sequence.
- the blocking oligonucleotide prevents the production of sequencing products from non-target nucleic acid, so that in the second region of dissimilar sequence, "the only product that is seen is derived from the target nucleic acid sequence. This allows the target nucleic acid sequence to be determined, because the interference from the non- target nucleic acid sequence is removed.
- the method also allows a particular sequence in the first region of dissimilar sequence to be determined as being associated with a particular sequence in the second region of dissimilar sequence, by intentionally blocking the sequencing reaction when a particular nucleotide is present at the first region of dissimilar sequence.
- Unincorporated terminator nucleotide is then removed, either by washing (especially if the nucleic acid is linked to a solid support) or by the use of a nucleotide-degrading enzyme, such as apyrase.
- the preparation is then subjected to a sequencing reaction, without allowing the blocking oligonucleotide to separate from the non-target nucleic acid, hi this way, no sequencing reaction proceeds in respect of the non-target nucleic acid sequence.
- the target nucleic acid sequence is free to allow primer extension and the sequencing reaction proceeds only in respect of the target nucleic acid sequence.
- the sequencing reaction comprises a method of sequencing based on the detection of the release of pyrophosphate.
- Applicable methods are disclosed in WO 98/28440 and in Science (1998) Nol 281, pages 363 to 365, the contents of which are incorporated herewith by reference. Such methods have been termed "pyrosequencing".
- the nucleic acid to be sequenced is incubated with the primer, D ⁇ A polymerase, ATP sulforylase, firefly luciferase and a nucleotide-degrading enzyme such as apyrase.
- nucleotides are added stepwise, wherein a nucleotide will only become incorporated into the growing D ⁇ A strand and release pyrophosphate (PPi) if it is complementary to the base in the template strand. Any release of PPi is detected enzymically, for example by an enzyme cascade resulting in the production of light which is detected in a suitable light-sensitive device such as a luminometer or a charge-coupled device camera. Unincorporated nucleotides are degraded between each cycle by the nucleotide-degrading enzyme, so that after the first nucleotide has been degraded, the next nucleotide can be added. As this procedure is repeated, longer stretches of the template sequence are deduced.
- PPi pyrophosphate
- a method based on the detection of the release of pyrophosphate, involving the stepwise addition of nucleotides and real-time detection of their incorporation, is prefened for performing the sequencing reaction according to the present invention,- because it does not require a step of heating which would separate the blocking oligonucleotide from the non-target nucleic acid sequence.
- Pyrosequencing is preferably performed using a single-stranded template, which may be suitably prepared by biotin capture of one strand on magnetic beads.
- the single-stranded template may be free in solution or immobilised on a solid support.
- a double-stranded DNA template may be employed if the enzymes used in the method are thermostable.
- a single heating step is used to denature the double-stranded DNA, followed by a step in which the primer is allowed to anneal. Following the blocking step the extending primer is not separated from its template.
- the method involves determining the combination of individual SNPs which exist in a particular region on one chromosome of a pair in a subject. Determining the association of alleles such as SNPs is termed haplotyping.
- each of the first and second regions of dissimilar sequence comprise a single nucleotide.
- the target nucleic acid sequence comprises a particular locus (such as a particular gene, part of a gene or regulatory element) on one chromosome of a pair in the individual subject, and the non-target nucleic acid sequence comprises the conesponding sequence on the other chromosome in the pair.
- the locus comprises two or more SNPs.
- the first and second regions of common sequence comprise parts of the locus which are non- polymorphic between the two chromosomes.
- haplotype for chromosome A and its pair chromosome A'
- haplotype dideoxyguanosine triphosphate is added to the preparation so that it becomes incorporated into the chromosome A' which bears a C at SNP-1. Sequencing then proceeds only on chromosome A. If the sequencing results indicate a G at SNP-
- HLA genotyping is one area where haplotyping is particularly useful. Genotyping of the two haplotypes of the HLA genes is crucial to the success of the transplantation of organs and bone marrow.
- the locus comprises a human Class I or Class II HLA gene.
- Fig. 1 shows a target nucleic acid 1 and a non-target nucleic acid 2.
- the target nucleic acid and non-target nucleic acid each have a first region of common sequence
- Figure 2 shows a blocking oligonucleotide (B) which is complementary to at least a portion of the first region of dissimilar sequence of the non-target nucleic acid sequence and which hybridises thereto.
- Figure 3 shows a sequencing primer (12) which is complementary to the first region of common sequence and which hybridises thereto.
- a sequencing reaction proceeds in the direction of the arrow 13, such that the primer 12 is extended in the direction of the arrow using the target nucleic acid sequence as a template.
- the blocking oligonucleotide (B) blocks the sequencing reaction at least from proceeding into the second region of dissimilar sequence of the non-target nucleic acid sequence.
- Figure 4 shows sequencing reaction products (14 to 18) resulting from extension of the primer using the target nucleic acid as a template.
- the sequencing reaction proceeds at least as far as the second region of dissimilar sequence.
- Figure 5 shows a sequencing reaction product (19) resulting from extension of the primer using the non-target nucleic acid as a template.
- the sequencing reaction does not proceed as far as the second region of dissimilar sequence.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Biophysics (AREA)
- General Engineering & Computer Science (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Genetics & Genomics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
A method for determining a target nucleic acid sequence, wherein the target nucleic acid sequence is comprised in a preparation comprising a non-target nucleic acid sequence, the target nucleic acid sequence and the non-target nucleic acid sequence each having a first region of common sequence upstream of a first region of dissimilar sequence upstream of a second region of dissimilar sequence, the method comprising: (a) contacting the preparation with a blocking oligonucleotide complementary to at least a portion of the first region of dissimilar sequence of the non-target nucleic acid sequence, under conditions to hybridise the blocking oligonucleotide thereto; (b) contacting the preparation with a sequencing primer complementary to at least a portion of the first region of common sequence, under conditions to hybridise the primer to the target nucleic acid sequence; and (c) subjecting the preparation to a sequencing reaction, such that the sequencing reaction proceeds into the second region of dissimilar sequence of the target nucleic acid sequence, thereby determining at least the second region of dissimilar sequence of the target nucleic acid sequence; and wherein the blocking oligonucleotide blocks the sequencing reaction at least from proceeding into the second region of dissimilar sequence of the non-target nucleic acid sequence.
Description
NUCLEIC ACID SEQUENCING
The present invention relates to a method of sequencing a nucleic acid, hi particular, the invention relates to a method for determining a target nucleic acid sequence where the target nucleic acid sequence is comprised in a preparation comprising a non-target nucleic acid sequence. The invention also relates to a method for determining the haplotype of a subject.
The sequencing of nucleic acids, in particular DNA, is of fundamental importance in many areas of biological research, clinical diagnosis and treatment. Sequencing of DNA is typically carried out by a method based on the Sanger dideoxy chain- termination method (Sanger, F., Nic len, S., and Coulson, A. R. (1977) "DNA Sequencing with chain-terminating inhibitors" PNAS USA 74:5463-5467). hi this method, a labelled oligonucleotide primer complementary to a known sequence adjacent to the target sequence is used to initiate DNA polymerase-catalysed elongation into the target sequence. Typically, four polymerase reactions are carried out for each round of sequencing. Each reaction contains all four deoxynucleotides (dNTPs - dCTP, dTTP, dGTP and dATP) plus a small amount of one dideoxynucleotide (ddNTP - ddCTP, ddTTP, ddGTP or ddATP). Because ddNTPs have no 3' hydroxyl group, elongation of the nascent strand is occasionally terminated by incorporation of a ddNTP. Thus the sequencing reaction produces a series of labelled strands whose lengths are indicative of the location of a particular base in the sequence. The resultant labelled strands are typically separated according to size by polyacrylamide gel electrophoresis and visualised by detecting the label, for example by autoradiography where the primer was radiolabelled. More recently, the Sanger sequencing method has been adapted in various ways, in particular for large-scale automated sequencing using multiple fluorescent labels and capillary gel electrophoresis.
One problem with sequencing methods based on the Sanger method occurs when the target nucleic acid to be sequenced is provided in a preparation comprising one or more different nucleic acids or sequences which show some sequence identity to the target sequence. In particular, if a primer-binding sequence is found in both the target sequence and a second or former sequences, the sequencing reaction will lead to
products which are derived from primer binding to the second or further sequences, as well as the target sequence. Where the target sequence diverges from the second or further sequences, the resultant gel or chromatograph will reveal two or more bases as being present at a particular location. Because the method does not allow discrimination between the products of the target sequence and the second or further sequences, the target sequence cannot be determined unambiguously.
This problem is particularly significant when it is desired to determine the sequence of one allele of a heterozygote pair at a polymorphic location in a single individual. Many eukaryotic cells are diploid, having two copies of most chromosomes, and sequence differences usually exist between each copy of a particular chromosome. Because DNA prepared from one individual will normally contain copies of both chromosomes, standard sequencing methods are unable to differentiate between sequences derived from each copy. Where there is a single nucleotide difference between each allele, the DNA sequence of each chromosome will nevertheless be clear (although it would not be possible to ascribe each sequence to a particular paternal or maternal chromosome). Where the polymorphism extends for two or more nucleotides, or where there are two or more polymorphic sites (alleles) separated by regions of common sequence, it is not possible to discern the sequence of the two alleles. In particular, standard sequencing methods are not able to determine the combination of alleles existing on a particular chromosome (the haplotype).
In the wave of interest spawned by the mapping of the human genome, interest has grown in the use of single nucleotide polymorphisms (SNPs) to identify target genes associated with disease or drug response, h some instances, the presence of a particular SNP alone may be sufficient to cause a particular disease or to explain the individual variability in sensitivity to drugs.
However, it is not clear how often knowledge of an individual SNP will have utility in the clinic or in drug development. Research has shown that in asthma, at least, the association of individual SNPs to form a complete haplotype may be more relevant in predicting drug response than knowledge of isolated individual SNPs. In many cases it may be necessary to obtain a haplotype sequence involving the characterisation of
two or more SNPs on each chromosome. It is therefore highly desirable to determine the combination of SNPs that co-exist on a single chromosome.
HLA (human leukocyte antigen or human leukocyte associated antigen A) genotyping is one area where haplotyping is important. Determination of the two haplotype sequences of the HLA genes is crucial to the success of organ transplantation. The individual haplotypes of the donor must be matched with the recipient before transplantation to avoid rejection of the transplant. Methods for evaluating HLA allele types have been described in the past. One such method relies on performing family studies, which is very time-consuming. An alternative method based on DNA sequencing is disclosed in WO 97/23650. However, where heterozygous alleles exist, this method relies on prior knowledge of existing haplotype sequences, so that ambiguous bases can be ascribed to one allele or another.
Many of the methods used for haplotyping used in the past rely on preparing a composition comprising only a single haplotype sequence before sequencing. One way of doing this is by converting a diploid cell into a haploid cell. This requires a high investment, is labour intensive and slow but gives complete haplotype separation. Alternatively, human chromosomes can be cloned into yeast in order to get a haploid for that particular chromosome. This suffers from the same drawbacks in terms of time and cost.
One way of obtaining a preparation comprising only a single haplotype sequence is to amplify DNA by PCR using allele-specific primers. This type of approach for sequencing both alleles of a deletion polymorphism in intron 6 of the human dopamine 2 receptor gene (DRD2) is described in DNA Sequence Vol 6 (2), pp 87-94 (1996), Finck et al.. In this method, allele-specific primers are used to amplify individual allele sequences by polymerase chain reaction (PCR). The primers are designed so that they produce amplicons of differing lengths, so that the products of each allele can be discriminated by agarose gel electrophoresis when both alleles are simultaneously amplified in the same reaction tube. The amplicons from each allele are then extracted from the gel and sequenced using conserved primers. The disadvantage of this approach is that it requires the prior knowledge of at least two, sufficiently separated regions of dissimilarity between the alleles so that appropriate
allele-specific primers producing different-sized products can be designed. In addition, it requires a time-consuming gel separation and extraction step prior to sequencing.
A related approach is described in Biotechniques Nol 10 (1), pp 30, 32 and 34 (1991), Kaneoka et al. Biotinylated allele-specific oligonucleotide primers coupled to streptavidin-coated magnetic beads are used to amplify DΝA from one haplotype by PCR, and then conserved primer is used for solid-phase direct DΝA sequencing.
WO 92/15711 discloses a method for determining a major histocompatibility complex genotype of a subject in a sample containing nucleic acid. The method involves PCR amplification of the gene locus of interest, with all alleles for the gene locus to be sequenced being amplified with one conserved oligonucleotide primer pair and at least one allele for the gene locus being amplified with one conserved oligonucleotide primer and one non-conserved oligonucleotide primer. The amplicons for each allele are then sequenced with a conserved primer.
A different method for determining haplotype sequences involves analysis of PCR amplifed sequences covering a polymorphic region by hybridisation rather than, sequencing. PCR amplicons are contacted with oligonucleotide probes complementary to the sequence of either the maternal or paternal chromosome in a region comprising an SΝP. Probes complementary to the maternal or paternal chromosomes are immobilised in different areas of a solid phase. A second set of oligonucleotide probes, labelled in a different way and complementary to the sequence of either the maternal or paternal chromosome in a region comprising a second SΝP, is then used to identify which sequence at the first SΝP is on the same chromosome as a particular sequence at the second SΝP.
Other approaches have been adopted in the past for determining a target nucleic acid sequence when the target sequence is contained in a preparation comprising a non- target nucleic acid sequence. In one method described in WO 97/46711, a primer is selected that complements one strand but not the other, and an artificial mismatch is introduced into the primer. By selecting suitable hybridisation conditions so that stable duplexes form between the primer and one allele but not between the primer
and the other allele, chain-extension sequencing of a single allele is achieved. A disadvantage of this method is that the selection of appropriate hybridisation conditions is time-consuming and not necessarily straightforward.
WO 00/20628 describes a method by which multiple genomic loci can be sequenced in the same reaction mixture. This method allows the sequencing of a second locus in the mixture by using primers which are longer than the longest product formed from the sequencing reaction in relation to a first locus. Different primers are used for each locus. However, this document does not disclose a method for haplotyping for particular alleles of a single locus.
Accordingly, the present invention aims to overcome the disadvantages of the prior art. In particular, the present invention aims to provide an improved method of determining a target nucleic acid sequence, where the target nucleic acid is comprised in a preparation comprising a non-target nucleic acid which has regions of common and dissimilar sequence to the target nucleic acid. The present invention also aims to provide an improved method for determining the haplotype of a subject.
Accordingly, the present invention provides a method for determining a target nucleic acid sequence, wherein the target nucleic acid sequence is comprised in a preparation comprising a non-target nucleic acid sequence, the target nucleic acid sequence and the non-target nucleic acid sequence each having a first region of common sequence upstream of a first region of dissimilar sequence upstream of a second region of dissimilar sequence, the method comprising:
(a) contacting the preparation with a blocking oligonucleotide complementary to at least a portion of the first region of dissimilar sequence of the non-target nucleic acid sequence, under conditions to hybridise the blocking oligonucleotide thereto;
(b) contacting the preparation with a sequencing primer complementary to at least a portion of the first region of common sequence, under conditions to hybridise the primer to the target nucleic acid sequence; and
(c) subjecting the preparation to a sequencing reaction, such that the sequencing reaction proceeds into the second region of dissimilar sequence of the target nucleic acid sequence, thereby determining at least the second region of dissimilar sequence of the target nucleic acid sequence; and wherein the blocking oligonucleotide blocks the sequencing reaction at least from proceeding into the second region of dissimilar sequence of the non-target nucleic acid sequence.
In a further aspect, the present invention provides a method for determining the haplotype of a subject from a sample comprising DNA from the subject, comprising a method as defined above, wherein the preparation comprises the sample, the target nucleic acid sequence comprises a locus on a first chromosome of a pair of chromosomes, the non-target nucleic acid sequence comprises the corresponding locus on the second chromosome of the pair, the locus comprising two or more single nucleotide polymorphisms for which the subject is heterozygous, wherein the sequencing reaction is conducted to determine the sequence of the polymorphic genetic locus on the first chromosome of the pair thereby determining the haplotype of the subject.
In a further aspect, the present invention provides use of pyrosequencing for determining the haplotype of a subject from a sample comprising DNA from the subject, wherein pyrosequencing is used to sequence a target locus on a first chromosome of a pair, the target locus comprising two or more, single nucleotide polymorphisms, the corresponding locus on the second chromosome of the pair being blocked from sequencing by a blocking oligonucleotide hybridised to the second chromosome.
The present invention provides an improved method of sequencing a target nucleic acid sequence comprised in a preparation comprising a different but related nucleic acid sequence. The method advantageously allows the sequencing reaction to proceed in relation to the target nucleic acid sequence, while the sequencing reaction between the primer and the other nucleic acid sequence is blocked by the blocking oligonucleotide. The sequence data which is obtained is therefore derived only from
the target nucleic acid sequence, as interference from the other nucleic acid sequence is removed. The method is a fast and efficient way of discriminating between the two sequences, hi particular, the method is advantageous because a sequence-specific sequencing primer does not have to be constructed for each target nucleic acid sequence. The method also does not suffer from problems relating to lack of discrimination in primer hybridisation to closely-related sequences.
The method also provides an enhanced method for haplotyping. The method enables the rapid determination of allele associations to identify individually the two haplotype sequences present at a particular locus in a subject. The method is particularly advantageous in identifying associations of SNPs and in HLA genotyping. In particular, the method avoids the need for time-consuming family studies or prior knowledge of allele associations.
The target nucleic acid sequence of the present invention is not particularly limited. Suitable target nucleic acid sequences include a deoxyribonucleic acid (DNA) sequence, a ribonucleic acid (RNA) sequence, or a DNA or RNA sequence comprising one or more modified nucleotides or bases, or one or more artificial nucleotides or bases. The second nucleic acid sequence is likewise not particularly limited, and may be a DNA or RNA sequence, optionally comprising one or more modified nucleotides or bases.
Preferably the target nucleic acid sequence and/or the non-target nucleic acid sequence is a DNA sequence. The DNA sequence may be a genomic DNA or cDNA sequence. Each sequence is preferably a human DNA sequence.
The target nucleic acid sequence may be comprised in the same nucleic acid polymer as the non-target nucleic acid. However, the two nucleic acid sequences are preferably on separate DNA molecules. More preferably the target nucleic acid sequence and the non-target nucleic acid sequence each comprise a different allele at a polymoφhic genetic locus in a subject. In this embodiment, the target nucleic acid sequence comprises the locus on one chromosome of a pair (maternal or paternal) and the non-target nucleic acid sequence comprises the locus on the other chromosome of the pair.
In the present invention the preparation comprises a target nucleic acid sequence and a non-target nucleic acid sequence. Suitable preparations include any preparation comprising two or more nucleic acid sequences, provided that at least two of the nucleic acid sequences share a region of common sequence but differ in a region of dissimilar sequence. Preferably the preparation comprises a purified DNA preparation. The preparation is preferably prepared from a sample derived from a single human subject. Thus the preparation may be a sample of human saliva, blood, urine or other tissue, or a DNA preparation comprising genomic DNA which has been prepared from such a sample.
In one embodiment, the preparation comprises one or more further nucleic acid sequences, wherein each further nucleic acid sequence has a first region of common sequence upstream of a first region of dissimilar sequence upstream of a second region of dissimilar sequence. Here "common sequence" means that the sequence of the further nucleic acid sequence is identical to the target and non-target nucleic acid sequences in this region. "Dissimilar sequence" means that the sequence of the further nucleic acid is different from the target and/or non-target nucleic acid sequences in this region.
In this embodiment, the method may include a step of blocking the sequencing reaction between the primer and one or more of the further nucleic acid sequences. The sequencing reaction between the primer and the further nucleic acid sequences may be blocked in the same way as for the sequencing reaction between the primer and the non-target nucleic acid sequence. If it is desired to obtain sequencing reaction products derived only from the target nucleic acid sequence, the sequencing reaction between the primer and each of the further nucleic acid sequences may be blocked.
Alternatively, the sequencing reaction between the primer and only some of the further nucleic acid sequences may be blocked. By using the methods described below, sequencing from particular forther nucleic acid sequences may be selectively blocked or allowed to proceed. This type of analysis may be termed "multiplexing". Multiplexing permits the analysis of multiple sites in an individual sample or a number of samples from different individuals.
In one embodiment using multiplexing, the preparation comprises DNA derived from samples taken from two or more individuals. For instance, a number of DNA . preparations derived from different individuals in a group may be combined and the method described herein carried on the combined preparation. This method may be used to assess whether or not a particular combination of SNPs is found together on a single chromosome in all individuals within the group. If so, the sequencing reaction will yield a single sequence. If not, the sequencing reaction will indicate alternative bases at the position of one or more SNPs in the sequence. If it is then desired to determine which combination of SNPs was present in which individual, it would be necessary to repeat the method on separate DNA preparations from each individual.
In another embodiment involving multiplexing, more than one target nucleic acid sequence may be determined using a single sequencing reaction. In this embodiment, the present method is performed in parallel using two or more oligonucleotide primers, each of which is complementary to a different sequence, hi this way, two or more polymorphic sites may be analysed simultaneously. Each target nucleic acid sequence shares a first region of common sequence with a corresponding non-target nucleic acid sequence. The sequencing reaction between each primer and the non- target nucleic acid sequence to which it is complementary is blocked, so that the sequencing reaction proceeds folly only in respect of the target nucleic acid sequences.
In one such embodiment, the invention relates to a method for determining a plurality of target nucleic acid sequences, wherein the plurality of target nucleic acid sequences is comprised in a preparation further comprising a plurality of corresponding non- target nucleic acid sequences, each target nucleic acid sequence in the preparation corresponds to one or more corresponding non-target nucleic acid sequences in the preparation, each target nucleic acid sequence and each corresponding non-target nucleic acid sequence has a first region of common sequence upstream of a first region of dissimilar sequence upstream of a second region of dissimilar sequence, the first region of common sequence of each target nucleic acid sequence is the same as the first region of common sequence of its corresponding non-target nucleic acid sequences, the first region of dissimilar sequence of each target nucleic acid sequence is different to the first region of dissimilar sequence of its corresponding non-target
nucleic acid sequences, the second region of dissimilar sequence of each target nucleic acid sequence is different to the second region of dissimilar sequence of its corresponding non-target nucleic acid sequences, which method comprises:
(a) contacting the preparation with a plurality of blocking oligonucleotides wherein each blocking oligonucleotide is complementary to at least a portion of the first region of dissimilar sequence of a non-target nucleic acid sequence, under conditions to hybridise the blocking oligonucleotide thereto;
(b) contacting the preparation with a plurality of sequencing primers, wherein each primer is complementary to at least a portion of the first region of common sequence of a target nucleic acid sequence and its corresponding non-target nucleic acid sequence, under conditions to hybridise the primer thereto; and
(c) subjecting the preparation to a sequencing reaction, such that the sequencing reaction proceeds into the second region of dissimilar sequence of the target nucleic acid sequences, thereby determining at least the second region of dissimilar sequence of each target nucleic acid sequence;
and wherein the blocking oligonucleotides block the sequencing reaction at least from proceeding into the second region of dissimilar sequence of each corresponding non- target nucleic acid sequence.
In this embodiment, sequencing reaction products are obtained which are derived from more than one target nucleic acid sequence. A method is therefore required in order to discriminate between sequencing reaction products derived from each target nucleic acid sequence. This may be done by labelling the sequencing reaction products derived from each nucleic acid sequence in which the sequencing reaction is allowed to proceed with a distinct label.
The sequencing reaction products derived from each target nucleic acid sequence may be distinguished by differentially labelling each oligonucleotide primer. In one embodiment, each primer is labelled with a fluorescent label which fluoresces at a different wavelength. Sequencing products derived from each target nucleic acid
sequence may then be distinguished for example using an automated sequencer following gel electrophoresis. In another embodiment, one or more primers is labelled with one part of a ligand-affinant pair. A preferred ligand-affinant pair is biotin-streptavidin. The ligand-affinant interaction may be used in order to bind sequencing products derived from one target nucleic acid sequence to a solid phase (such as magnetic beads), thereby separating the labelled sequencing products from non-labelled sequencing products. The labelled and non-labelled sequencing may then be separately subjected to gel electrophoresis. hi embodiments where two primers are used (in order to sequence two different target nucleic acid sequences), only one primer need be labelled in order to separate the sequencing products derived from each of the target nucleic acid sequences, hi embodiments where 3 or more primers are used, 2 or more of the primers need to be labelled, hi this case, a different ligand-affinant pair needs to be selected for each primer to be labelled, so that the sequencing products derived from each target nucleic acid sequence can be bound to a different solid phase and thereby separated, hi general, where n primers are used, n-1 primers need to be labelled.
Each of the two nucleic acid sequences includes a first region of common sequence. This means that the target nucleic acid sequence is identical to the non-target nucleic acid sequence in this region. The method advantageously allows the sequencing of only the target nucleic acid sequence, despite the fact that a generic primer which is complementary to the region of common sequence (and which would hybridise to both nucleic acid sequences in the absence of the blocking oligonucleotide) is used.
The first region of common sequence preferably comprises a length of at least 10 nucleotides, more preferably at least 20 nucleotides.
The first region of common sequence is upstream of a first region of dissimilar sequence. The first region of dissimilar sequence is upstream of a second region of dissimilar sequence. By "upstream" it is meant upstream in terms of the direction of sequencing. The sequencing primer first hybridises to a region comprising at least a portion of the first region of common sequence. As the primer is extended (in the downstream direction) the first region of dissimilar sequence acts as a template for primer extension before the second region of dissimilar sequence. Because primer
extension typically proceeds in the 5' to 3' direction (nucleotides are added at the 3' end of the primer), the first region of common sequence typically lies 3' to the first region of dissimilar sequence, and the first region of dissimilar sequence typically lies 3' to the second region of dissimilar sequence.
By "region of dissimilar sequence" it is meant that the target nucleic acid sequence is different from the non-target nucleic acid sequence in this region, hi one embodiment the first and second regions of dissimilar sequence are contiguous, that is the second region of dissimilar sequence immediately follows the first region of dissimilar sequence with no intervening region of common sequence. In an alternative embodiment, the first and second dissimilar sequences are separated by a second region of common sequence.
In one embodiment the target nucleic acid sequence and the non-target nucleic acid sequence comprises one or more further regions of dissimilar sequence. For instance, there may be a third, fourth, fifth or subsequent regions of dissimilar sequence downstream of the second region of dissimilar sequence. However, there must be at least two regions of dissimilar sequence. Each region of dissimilar sequence is separated by a further region of common sequence. The method permits the determination of the sequence of the target nucleic acid sequence downstream of the second region of dissimilar sequence as far as the sequencing reaction is capable of proceeding.
The length of the first and second regions of dissimilar sequence is not particularly limited. Any length of dissimilar sequence may be used from a single nucleotide upwards, hi a prefened embodiment, either or both regions of dissimilar sequence comprises an SNP.
The present method comprises a step of contacting the preparation with a blocking oligonucleotide complementary to a sequence comprising the first region of dissimilar sequence of the non-target nucleic acid sequence, under conditions to hybridise the blocking oligonucleotide thereto. The blocking oligonucleotide is typically a single- stranded DNA 5 to 50 nucleotides in length, preferably 10 to 50 nucleotides,
preferably 10 to 40 nucleotides in length, more preferably 15 to 35 nucleotides in length and most preferably 15 to 25 nucleotides in length.
The blocking oligonucleotide therefore contains at least one base which is non- complementary to the target nucleic acid sequence. It is important that hybridisation conditions are selected, at least in step (a), so that the blocking oligonucleotide hybridises to the non-target nucleic acid sequence but not to the target nucleic acid sequence. Where there is only a single base difference between the target and non- target nucleic acid sequence within the region to which the blocking oligonucleotide binds, the hybridisation conditions, and in particular the hybridisation temperature, must be selected particularly carefully. If the temperature selected is too high, insufficient blocking of the non-target nucleic acid sequence may occur. If the temperature selected is too low, the blocking oligonucleotide may also hybridise to the target nucleic acid sequence and prevent the sequencing reaction proceeding in respect of the target.
Hybridisation conditions for step (a) may be selected according to criteria well known to those skilled in the art. An appropriate temperature and salt content for hybridisation needs to be selected according to the length of the blocking oligonucleotide and its G-C content, amongst other things (Old & Primrose (1994), Principles of Gene Manipulation, Blackwell Science and Maniatis et al. (1992), Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, New York. Typically the hybridisation temperature should be close to the melting temperature (Tm) of the oligonucleotide. Tm is defined as the temperature at which the oligonucleotide and its target are 50% dissociated, and may be calculated according to the "Wallace rule" by the following formula:
Tm = 4 X (number of G:C base-pairs) + 2 X (number of A:T base-pairs)
Preferably the hybridisation temperature should be within 2°C of Tm. Accordingly, for a 20-mer blocking oligonucleotide with 50% G-C content, the Tm is about 60°C and a suitable hybridisation temperature would be 58°C.
According to the present invention the blocking oligonucleotide inhibits the sequencing of the non-target nucleic acid sequence by the sequencing primer. The blocking oligonucleotide must therefore not act as a primer itself for sequencing of the non-target nucleic acid sequence. One way of preventing this is to use a blocking oligonucleotide having no 3' hydroxy group, for instance by adding a dideoxynucleotide at the 3' position during synthesis of the oligonucleotide.
Alternatively, in a preferred embodiment, step (a) further comprises a step of contacting the preparation with a terminator nucleotide. A particular terminator nucleotide (such as ddATP, ddCTP, ddGTP or ddTTP) may be chosen so that it is complementary to a base in the non-target nucleic acid sequence immediately adjacent to the 3' end of the blocking oligonucleotide. In the presence of a DNA polymerase the terminator nucleotide becomes incorporated into the blocking oligonucleotide only when hybridised to the non-target nucleic acid sequence. If the blocking olignucleotide is chosen such that the base at its 3' terminus is ' complementary to a base within the first region of dissimilar sequence of the non- target nucleic acid sequence, this helps to ensure that the terminator does not become incorporated into any blocking oligonucleotide which might be hybridised to the target nucleic acid sequence.
The blocking oligonucleotide may block sequencing of the non-target nucleic acid sequence in one of two ways. Firstly, if the blocking oligonucleotide is selected such that it binds to a region overlapping the first region of dissimilar sequence and the first region of common sequence (to which the sequencing primer is complementary), it will inhibit sequencing primer binding to the non-target nucleic acid sequence. Alternatively, the blocking oligonucleotide may be selected such that it binds to a region which is downstream (in terms of the direction of sequencing) from the sequencing primer binding site. In this case, the sequencing primer will bind to the both nucleic acid sequences, but extension of the primer bound to the non-target nucleic acid sequence will be inhibited.
In a preferred embodiment, the terminator nucleotide is capable of covalently cross- linking the primer to the non-target nucleic acid sequence. Alternatively a terminator nucleotide comprising Peptide Nucleic Acid (PNA) and (L-ribo-)Locked Nucleic
Acid (LNA) nucleotides, described in WO 95/15974 and WO 00/66604 respectively, can be used to block sequencing of the non-target nucleic acid.
In a preferred embodiment, step (a) further comprises contacting the preparation with a cleavage agent, under conditions to cleave the non-target nucleic acid sequence within the sequence hybridised to the blocking oligonucleotide. h this embodiment, a cleavage agent is selected that introduces strand breaks only into double-stranded DNA. If a single-stranded DNA preparation is used, only the non-target nucleic acid sequence will be cleaved provided that the blocking oligonucleotide does not hybridise to the target nucleic acid sequence.
The cleavage agent is preferably a restriction endonuclease. One way of ensuring that only the non-target nucleic acid sequence is cleaved is to use a restriction endonuclease which recognises a sequence comprising the first region of dissimilar sequence of the non-target nucleic acid sequence.
Alternatively, a restriction endonuclease may be used which recogmses a sequence common to both the target and non-target nucleic acid sequence, provided that the recognition sequence is within the binding site of the blocking oligonucleotide. Accordingly, the restriction endonuclease may recognise a site within the first or second regions of common sequence. i one embodiment, the blocking oligonucleotide is extended by polymerisation far enough in order to allow cleavage of the non-target nucleic acid sequence at a recognition site downstream of the blocking oligonucleotide binding site, hi this case, the blocking oligonucleotide is preferably extended stepwise by the addition of individual nucleotides so that the degree of extension can be controlled.
The restriction endonuclease is not particularly limited provided that it recognises a defined DNA sequence, and a suitable endonuclease may be selected according to the presence of known recognition sites at an appropriate location in the non-target nucleic acid sequence. The restriction endonuclease is preferably a type II restriction endonuclease.
hi an alternative embodiment, the cleavage agent comprises a chemical cleavage agent.
If the blocking oligonucleotide is covalently linked to the non-target nucleic acid sequence or if the non-target nucleic acid sequence is cleaved, a standard sequencing reaction can be performed in step (c). Typically such a sequencing reaction utilises an electronic thermocycler, in order to allow a number of cycles of primer hybridisation to the target nucleic acid sequence, elongation by a polymerase and separation of extended products from the template. Four separate sequencing reactions may be performed, each containing one dideoxy terminator (dATP, dCTP, dGTP or dTTP) and the products visualised in separate lanes by polyacrylamide gel electrophoresis and autoradiography. If dye terminators comprising fluorescent labels are employed, wherein the labels fluoresce at different wavelengths to indicate each particular terminator nucleotide, a single sequencing reaction can be used.
Alternatively, if the blocking oligonucleotide is not covalently crosslinked to the non- target nucleic acid sequence, it is important to ensure that the blocking oligonucleotide does not separate from the non-target nucleic acid sequence during the sequencing reaction, as this would allow sequencing of the non-target nucleic acid sequence. Accordingly, in this embodiment, it is preferable to maintain the temperature of the sequencing reaction below the denaturation temperature of the blocking oligonucleotide/non-target nucleic acid complex. For double-stranded nucleic acids, such as double-stranded DNA, the preparation can first be heated to an elevated temperature, such as 95°C in order to separate the DNA strands. The preparation is then typically cooled to a suitable hybridisation temperature for the blocking oligonucleotide (such as 60°C for a 20-mer oligonucleotide with 50% G-C content). Following addition of the sequencing primer and the removal of unincorporated terminator, the sequencing reaction is then performed at a constant temperature (such as) without thermocycling.
The method comprises a step of contacting a preparation with a sequencing primer complementary to at least a portion of the first region of common sequence. This means that at least a portion of the primer is complementary to a sequence which is present in both the target nucleic acid sequence and the non-target nucleic acid
sequence. Thus the primer is capable under suitable conditions (and in the absence of any blocking agent) of hybridising to both the target nucleic acid sequence and the non-target nucleic acid sequence.
In a preferred embodiment, the primer is complementary to a sequence which is found entirely within the first region of common sequence. This means that the hybridisation site of the primer has an identical sequence in both the target and non- target nucleic acid sequence. However, in an alternative embodiment a primer may be used which is capable of hybridising to a sequence a part of which differs between the target nucleic acid sequence and the non-target nucleic acid sequence, hi this embodiment, the primer may be folly complementary to a sequence found in either the target or non-target nucleic acid sequence, but a part of the primer may not be complementary to the other nucleic acid sequence. Thus, only a part of the primer is capable of hybridising to one of the nucleic acid sequences. Alternatively, a mixed primer may be used such that the primer contains two species, a first species complementary to the target nucleic acid sequence and a second species complementary to the non-target nucleic acid sequence. The difference in sequence between the target and non-target nucleic acid sequence in the region to which the primer hybridises preferably should be limited to one or two nucleotides, more preferably one nucleotide. The differences should also be located in a region of the nucleic sequences towards which the 5' end of the primer hybridises. If mismatches are located near the 3' end of the primer, it is more likely that polymerisation will be inhibited. These embodiments fall within the scope of the invention provided that under the hybridisation conditions employed, the primer is not capable of selectively hybridising only to one of the two nucleic acid sequences. If that were the case, it would be unnecessary to perform a blocking step, because sequencing would proceed only from one of the two nucleic acid sequences.
The nature of the primer is not particularly limited, provided that it is capable of initiating a sequencing reaction when hybridised to the target nucleic acid. Preferably the primer is a single-stranded DNA. The length of the primer is preferably 10 to 50 nucleotides, more preferably 10 to 40 nucleotides and most preferably 15 to 30 nucleotides. Suitable primers may be designed according to standard techniques known to those skilled in the art for selecting primers for polymerase reactions, such
as for sequencing and for amplification of DNA by the polymerase chain reaction (PCR).
The preparation is contacted with the sequencing primer, typically by adding an aqueous solution of the primer to a preparation containing a suitable amount of DNA. Hybridisation conditions are then selected so that the primer hybridises to the first region of common sequence of the DNA. according to criteria well known to those skilled in the art, and as discussed above in relation to the blocking oligonucleotide. It is important that if the blocking oligonucleotide is not cross-linked to the non-target nucleic acid sequence, the temperature is not raised sufficiently to separate the blocking oligonucleotide from the non-target nucleic acid sequence. Preferably a blocking oligonucleotide and sequencing primer are selected such that they have a similar Tm.
Once the sequencing primer is hybridised to the target nucleic acid sequence, the preparation is subjected to a sequencing reaction. The sequencing reaction may be any type of nucleic acid sequencing reaction, provided that it involves extension or elongation of the primer when hybridised to a nucleic acid sequence. Primer extension is typically performed using a DNA polymerase, such as Thermus aquaticus or Pfu DNA polymerase for reactions involving a high-temperature step, or other suitable DNA polymerases where there is no high-temperature step. Preferably the sequencing reaction comprises real-time sequencing such as pyrosequencing. In another embodiment, the sequencing reaction comprises Sanger sequencing using dideoxynucleotides .
The sequencing reaction proceeds into the second region of dissimilar sequence of the target nucleic acid sequence. Typically this means that at least some of the primer hybridised to the target nucleic acid sequence is extended so that the extended primer contains incorporated nucleotides complementary to one or more nucleotides in the second region of dissimilar sequence of the target nucleic acid, i certain embodiments involving the use of dideoxynucleotide terminator sequencing, only a fraction of the primer may be extended into the second region of dissimilar sequence, as some of the extending primer is terminated at each position in order to determine the sequence.
The blocking oligonucleotide prevents the production of sequencing products from non-target nucleic acid, so that in the second region of dissimilar sequence," the only product that is seen is derived from the target nucleic acid sequence. This allows the target nucleic acid sequence to be determined, because the interference from the non- target nucleic acid sequence is removed. The method also allows a particular sequence in the first region of dissimilar sequence to be determined as being associated with a particular sequence in the second region of dissimilar sequence, by intentionally blocking the sequencing reaction when a particular nucleotide is present at the first region of dissimilar sequence.
Unincorporated terminator nucleotide is then removed, either by washing (especially if the nucleic acid is linked to a solid support) or by the use of a nucleotide-degrading enzyme, such as apyrase. The preparation is then subjected to a sequencing reaction, without allowing the blocking oligonucleotide to separate from the non-target nucleic acid, hi this way, no sequencing reaction proceeds in respect of the non-target nucleic acid sequence. The target nucleic acid sequence is free to allow primer extension and the sequencing reaction proceeds only in respect of the target nucleic acid sequence.
In a preferred embodiment, the sequencing reaction comprises a method of sequencing based on the detection of the release of pyrophosphate. Applicable methods are disclosed in WO 98/28440 and in Science (1998) Nol 281, pages 363 to 365, the contents of which are incorporated herewith by reference. Such methods have been termed "pyrosequencing". According to one suitable pyrosequencing method, the nucleic acid to be sequenced is incubated with the primer, DΝA polymerase, ATP sulforylase, firefly luciferase and a nucleotide-degrading enzyme such as apyrase. Four nucleotides are added stepwise, wherein a nucleotide will only become incorporated into the growing DΝA strand and release pyrophosphate (PPi) if it is complementary to the base in the template strand. Any release of PPi is detected enzymically, for example by an enzyme cascade resulting in the production of light which is detected in a suitable light-sensitive device such as a luminometer or a charge-coupled device camera. Unincorporated nucleotides are degraded between each cycle by the nucleotide-degrading enzyme, so that after the first nucleotide has
been degraded, the next nucleotide can be added. As this procedure is repeated, longer stretches of the template sequence are deduced.
A method based on the detection of the release of pyrophosphate, involving the stepwise addition of nucleotides and real-time detection of their incorporation, is prefened for performing the sequencing reaction according to the present invention,- because it does not require a step of heating which would separate the blocking oligonucleotide from the non-target nucleic acid sequence. Pyrosequencing is preferably performed using a single-stranded template, which may be suitably prepared by biotin capture of one strand on magnetic beads. The single-stranded template may be free in solution or immobilised on a solid support. Alternatively, a double-stranded DNA template may be employed if the enzymes used in the method are thermostable. In such an embodiment a single heating step is used to denature the double-stranded DNA, followed by a step in which the primer is allowed to anneal. Following the blocking step the extending primer is not separated from its template.
Earlier methods based on the detection of the release of pyrophosphate such as those disclosed in WO 93/23562 and WO 98/13523 are also applicable in the present invention. These methods do not use a nucleotide-degrading enzyme, and therefore require immobilisation of DNA on a solid support and washing steps between each nucleotide addition. i a preferred embodiment of the present invention, the method involves determining the combination of individual SNPs which exist in a particular region on one chromosome of a pair in a subject. Determining the association of alleles such as SNPs is termed haplotyping. In this embodiment, each of the first and second regions of dissimilar sequence comprise a single nucleotide. The target nucleic acid sequence comprises a particular locus (such as a particular gene, part of a gene or regulatory element) on one chromosome of a pair in the individual subject, and the non-target nucleic acid sequence comprises the conesponding sequence on the other chromosome in the pair. The locus comprises two or more SNPs. The first and second regions of common sequence comprise parts of the locus which are non- polymorphic between the two chromosomes.
Where the method is used to determine associations of previously identified SNPs in a subject sample, one of the known alleles for the first SNP is used to block further sequencing from that chromosome. In this way, forther sequencing proceeds only from the other chromosome; the base present in the second SNP is determined for the other chromosome and the combination of SNPs present on each chromosome can be determined.
For example, two alleles A (on chromosome A) and C (on chromosome A') for SNP-1 and two alleles G and T for SNP-2 may be known to be present within a particular gene in a subject, but the combination of alleles on each chromosome (haplotype) is unknown. The possible haplotypes (for chromosome A and its pair chromosome A') for this individual are therefore either (1) A-G (on chromosome A) and C-T (on chromosome A'), or (2) A-T and C-G. In order to distinguish between these possibilities, dideoxyguanosine triphosphate is added to the preparation so that it becomes incorporated into the chromosome A' which bears a C at SNP-1. Sequencing then proceeds only on chromosome A. If the sequencing results indicate a G at SNP-
2, then (1) is correct. When dideoxythymidine triphosphate is added for incoφoration at SNP-1, a T would be expected at SNP-2.
HLA genotyping is one area where haplotyping is particularly useful. Genotyping of the two haplotypes of the HLA genes is crucial to the success of the transplantation of organs and bone marrow. In a preferred embodiment, the locus comprises a human Class I or Class II HLA gene.
The invention will now be described further by way of example only, with reference to the following specific drawings.
Fig. 1 shows a target nucleic acid 1 and a non-target nucleic acid 2. The target nucleic acid and non-target nucleic acid each have a first region of common sequence
3, a first region of dissimilar sequence 4 and a second region of dissimilar sequence 6. In the embodiment shown, a second region of common sequence 5 lies between the first and second regions of dissimilar sequence. Third and fourth regions of dissimilar sequence (8 and 10) and third, fourth and fifth regions of common sequence (7, 9 and 11) are also shown.
Figure 2 shows a blocking oligonucleotide (B) which is complementary to at least a portion of the first region of dissimilar sequence of the non-target nucleic acid sequence and which hybridises thereto.
Figure 3 shows a sequencing primer (12) which is complementary to the first region of common sequence and which hybridises thereto. A sequencing reaction proceeds in the direction of the arrow 13, such that the primer 12 is extended in the direction of the arrow using the target nucleic acid sequence as a template. The blocking oligonucleotide (B) blocks the sequencing reaction at least from proceeding into the second region of dissimilar sequence of the non-target nucleic acid sequence.
Figure 4 shows sequencing reaction products (14 to 18) resulting from extension of the primer using the target nucleic acid as a template. The sequencing reaction proceeds at least as far as the second region of dissimilar sequence.
Figure 5 shows a sequencing reaction product (19) resulting from extension of the primer using the non-target nucleic acid as a template. The sequencing reaction does not proceed as far as the second region of dissimilar sequence.
Claims
1. A method for determining a target nucleic acid sequence, wherein the target nucleic acid sequence is comprised in a preparation comprising a non-target nucleic acid sequence, the target nucleic acid sequence and the non-target nucleic acid sequence each having a first region of common sequence upstream of a first region of dissimilar sequence upstream of a second region of dissimilar sequence, the method comprising:
(a) contacting the preparation with a blocking oligonucleotide complementary to at least a portion of the first region of dissimilar sequence of the non-target nucleic acid sequence, under conditions to hybridise the blocking oligonucleotide thereto;
(b) contacting the preparation with a sequencing primer complementary to at least a portion of the first region of common sequence, under conditions to hybridise the primer to the target nucleic acid sequence; and
(c) subjecting the preparation to a sequencing reaction, such that the sequencing reaction proceeds into the second region of dissimilar sequence of the target nucleic acid sequence, thereby determining at least the second region of dissimilar sequence of the target nucleic acid sequence;
and wherein the blocking oligonucleotide blocks the sequencing reaction at least from proceeding into the second region of dissimilar sequence of the non-target nucleic acid sequence.
2. A method according to claim 1, wherein the target nucleic acid sequence and the non-target nucleic acid sequence each have a second region of common sequence which lies between the first and second regions of dissimilar sequence.
3. A method according to claim 1 or claim 2, wherein step (a) further comprises a step of contacting the preparation with a terminator nucleotide, under conditions to incoφorate the terminator nucleotide into the blocking oligonucleotide hybridised to the non-target nucleic acid sequence.
4. A method according to claim 3, wherein the terminator nucleotide is a dideoxy nucleotide.
5. A method according to any preceding claim, wherein hybridisation of the blocking oligonucleotide to the non-target nucleic acid sequence is capable of inhibiting primer binding to the non-target nucleic acid sequence.
6. A method according to any of claims 1 to 4, wherein hybridisation of the blocking oligonucleotide to the non-target nucleic acid sequence is capable of inhibiting extension of the sequencing primer hybridised to the non-target nucleic acid sequence.
7. A method according to any preceding claim, wherein step (a) forther comprises contacting the preparation with a cleavage agent which recognises a double-stranded recognition sequence comprising at least apart of the sequence of the blocking oligonucleotide, under conditions to cleave the non-target nucleic acid sequence.
8. A method according to claim 7, wherein step (a) comprises contacting the preparation with the blocking oligonucleotide, subjecting the preparation to a polymerisation reaction, under conditions to extend the blocking oligonucleotide hybridised to the non-target nucleic acid sequence, and contacting the preparation with the cleavage agent, under conditions to cleave the non-target nucleic acid sequence within the second region of common sequence.
9. A method according to claim 7 or claim 8, wherein the cleavage agent comprises a restriction endonuclease.
10. A method according to claim 9, wherein the restriction endonuclease recognises a recognition sequence comprising at least a part of the first region of dissimilar sequence of the non-target nucleic acid sequence.
11. A method according to claim 9 or claim 10, wherein the restriction endonuclease recognises a recognition sequence comprising at least a part of the second region of common sequence.
12. A method according to claim 7 or claim 8, wherein the cleavage agent comprises a chemical cleavage agent.
13. A method according to any of claims 3 to 12, wherein the terminator nucleotide is capable of covalently cross-linking the blocking oligonucleotide to the non-target nucleic acid.
14. A method according to any preceding claim, wherein the second region of dissimilar sequence comprises a single nucleotide.
15. A method according to any preceding claim, wherein the first region of dissimilar sequence comprises a single nucleotide.
16. A method according to any preceding claim, wherein the sequencing reaction comprises a method of sequencing based on the detection of the release of pyrophosphate.
17. A method according to claim 16, wherein the sequencing reaction comprises pyrosequencing.
18. A method according to any preceding claim, wherein the preparation comprises DNA derived from two or more individuals.
19. A method for determining a plurality of target nucleic acid sequences, wherein the plurality of target nucleic acid sequences is comprised in a preparation further comprising a plurality of conesponding non-target nucleic acid sequences, each target nucleic acid sequence in the preparation conesponds to one or more conesponding non-target nucleic acid sequences in the preparation, each target nucleic acid sequence and each conesponding non-target nucleic acid sequence has a first region of common sequence upstream of a first region of dissimilar sequence upstream of a second region of dissimilar sequence, the first region of common sequence of each target nucleic acid sequence is the same as the first region of common sequence of its corresponding non-target nucleic acid sequences, the first region of dissimilar sequence of each target nucleic acid sequence is different to the first region of dissimilar sequence of its conesponding non-target nucleic acid sequences, the second region of dissimilar sequence of each target nucleic acid sequence is different to the second region of dissimilar sequence of its conesponding non-target nucleic acid sequences, which method comprises:
(a) contacting the preparation with a plurality of blocking oligonucleotides wherein each blocking oligonucleotide is complementary to at least a portion of the first region of dissimilar sequence of a non-target nucleic acid sequence, under conditions to hybridise the blocking oligonucleotide thereto;
(b) contacting the preparation with a plurality of sequencing primers, wherein each primer is complementary to at least a portion of the first region of common sequence of a target nucleic acid sequence and its conesponding non-target nucleic acid sequence, under conditions to hybridise the primer thereto; and
(c) subjecting the preparation to a sequencing reaction, such that the sequencing reaction proceeds into the second region of dissimilar sequence of the target nucleic acid sequences, thereby determining at least the second region of dissimilar sequence of each target nucleic acid sequence;
and wherein the blocking oligonucleotides block the sequencing reaction at least from proceeding into the second region of dissimilar sequence of each conesponding non- target nucleic acid sequence.
20. A method according to any preceding claim, wherein the target nucleic acid sequence and the non-target nucleic acid sequence comprise one or more forther regions of dissimilar sequence downstream of the second region of dissimilar sequence.
21. A method for determining the haplotype of a subject from a sample comprising DNA from the subject, comprising a method as defined in any preceding claim, wherein the preparation comprises the sample, the target nucleic acid sequence comprises a locus on a first chromosome of a pair of chromosomes, the non-target nucleic acid sequence comprises the corresponding locus on the second chromosome of the pair, the locus comprising two or more single nucleotide polymoφhisms for which the subject is heterozygous, wherein the sequencing reaction is conducted to determine the sequence of the locus on the first chromosome of the pair thereby determining the haplotype of the subject.
22. A method according to claim 21, where the locus comprises a human Class I or Class II HLA gene.
23. Use of pyrosequencing for determining the haplotype of a subject from a sample comprising DNA from the subject, wherein pyrosequencing is used to sequence a target locus on a first chromosome of a pair, the target locus comprising two or more single nucleotide polymoφhisms, the conesponding locus on the second chromosome of the pair being blocked from sequencing by a blocking oligonucleotide hybridised to the second chromosome.
24. Use according to claim 21, wherein the blocking oligonucleotide is hybridised to a region of the conesponding locus on the second chromosome which comprises a single nucleotide polymoφhism.
25. A kit for determining one or more target nucleic acid sequences, wherein the one or more target nucleic acid sequences is comprised in a preparation comprising one or more non-target nucleic acid sequences, the one or more target nucleic acid sequences and the one or more non-target nucleic acid sequences each having a first region of common sequence upstream of a first region of dissimilar sequence upstream of a second region of dissimilar sequence, which kit comprises one or more blocking oligonucleotides complementary to at least a portion of the first region of dissimilar sequence of the one or more non-target nucleic acid sequences and one or more sequencing primers complementary to at least a portion of the first region of common sequence.
26. A kit according to claim 25, which forther comprises one or more termmator nucleotides.
27. A kit according to claim 26, wherein the terminator nucleotide comprises a dideoxy nucleotide.
28. A kit according to claim 27, wherein the kit includes dideoxy- ATP, dideoxy- CTP, dideoxy-GTP and/or dideoxy-TTP.
29. A kit according to any of claims 25 to 28, further comprising deoxy-ATP, deoxy-CTP, deoxy-GTP, deoxy-TTP, a DNA polymerase, ATP sulforylase, firefly luciferase and/or a nucleotide-degrading enzyme.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0406863A GB0406863D0 (en) | 2004-03-26 | 2004-03-26 | Nucleic acid sequencing |
PCT/IB2005/000771 WO2005093101A1 (en) | 2004-03-26 | 2005-03-24 | Nucleic acid sequencing |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1737978A1 true EP1737978A1 (en) | 2007-01-03 |
Family
ID=32188784
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP05718267A Withdrawn EP1737978A1 (en) | 2004-03-26 | 2005-03-24 | Nucleic acid sequencing |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1737978A1 (en) |
JP (1) | JP2007530026A (en) |
GB (1) | GB0406863D0 (en) |
WO (1) | WO2005093101A1 (en) |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002537762A (en) | 1999-01-05 | 2002-11-12 | トラスティーズ オブ ボストン ユニバーシティ | Improved nucleic acid cloning |
US7435562B2 (en) | 2000-07-21 | 2008-10-14 | Modular Genetics, Inc. | Modular vector systems |
CA2590245A1 (en) | 2004-11-11 | 2006-05-18 | Modular Genetics, Inc. | Ladder assembly and system for generating diversity |
US7700287B2 (en) | 2005-01-28 | 2010-04-20 | Life Technologies Corporation | Compositions and methods for terminating a sequencing reaction at a specific location in a target DNA template |
US8043814B2 (en) | 2007-07-31 | 2011-10-25 | Eric Guilbeau | Thermoelectric method of sequencing nucleic acids |
US8071338B2 (en) | 2007-08-08 | 2011-12-06 | Roche Molecular Systems, Inc. | Suppression of amplification using an oligonucleotide and a polymerase significantly lacking 5′-3′ nuclease activity |
JP2010172323A (en) * | 2009-02-02 | 2010-08-12 | Nipro Corp | Method for detecting single nucleotide polymorphism |
JP5795341B2 (en) | 2010-03-08 | 2015-10-14 | デイナ ファーバー キャンサー インスティチュート,インコーポレイテッド | FullCOLD-PCR enrichment with reference block sequence |
WO2012118802A1 (en) * | 2011-02-28 | 2012-09-07 | Transgenomic, Inc. | Kit and method for sequencing a target dna in a mixed population |
US11130992B2 (en) | 2011-03-31 | 2021-09-28 | Dana-Farber Cancer Institute, Inc. | Methods and compositions to enable multiplex COLD-PCR |
US9133490B2 (en) | 2012-05-16 | 2015-09-15 | Transgenomic, Inc. | Step-up method for COLD-PCR enrichment |
CN105392897B (en) * | 2013-03-19 | 2020-03-20 | 定向基因组学公司 | Enrichment of target sequences |
WO2018111835A1 (en) | 2016-12-12 | 2018-06-21 | Dana-Farber Cancer Institute, Inc. | Compositions and methods for molecular barcoding of dna molecules prior to mutation enrichment and/or mutation detection |
US11174511B2 (en) | 2017-07-24 | 2021-11-16 | Dana-Farber Cancer Institute, Inc. | Methods and compositions for selecting and amplifying DNA targets in a single reaction mixture |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1293204C (en) * | 2000-08-30 | 2007-01-03 | 戴诺生物技术有限公司 | Method for determining alleles |
GB0022069D0 (en) * | 2000-09-08 | 2000-10-25 | Pyrosequencing Ab | Method |
AU2003241401B8 (en) * | 2002-05-10 | 2009-08-06 | City Of Hope | Pyrophosphorolysis activated polymerization (PAP) |
AU2003256298A1 (en) * | 2002-06-25 | 2004-01-06 | Pel-Freez Clinical Systems, Llc | Method for sequencing nucleic acids |
WO2004003173A2 (en) * | 2002-07-01 | 2004-01-08 | Cleveland State University | Method for detecting mutated polynucleotides within a large population of wild-type polynucleotides |
-
2004
- 2004-03-26 GB GB0406863A patent/GB0406863D0/en not_active Ceased
-
2005
- 2005-03-24 JP JP2007504507A patent/JP2007530026A/en active Pending
- 2005-03-24 EP EP05718267A patent/EP1737978A1/en not_active Withdrawn
- 2005-03-24 WO PCT/IB2005/000771 patent/WO2005093101A1/en active Application Filing
Non-Patent Citations (3)
Title |
---|
HANNA M M ET AL: "RNA-protein crosslinking with photoreactive nucleotide analogs", 1 January 1999, RNA-PROTEIN INTERACTION PROTOCOLS, METHODS IN MOLECULAR BIOLOGY, HUMANA PRESS, TOTOWA, NJ, PAGE(S) 21 - 33, ISBN: 978-0-89603-568-3, XP008105674 * |
See also references of WO2005093101A1 * |
STUMP W T ET AL: "Crosslinking of an iodo-uridine-RNA hairpin to a single site on the human U1A N-terminal RNA binding domain.", RNA (NEW YORK, N.Y.) MAR 1995 LNKD- PUBMED:7489489, vol. 1, no. 1, March 1995 (1995-03-01), pages 55 - 63, ISSN: 1355-8382 * |
Also Published As
Publication number | Publication date |
---|---|
WO2005093101A1 (en) | 2005-10-06 |
JP2007530026A (en) | 2007-11-01 |
GB0406863D0 (en) | 2004-04-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DK1991698T3 (en) | "High-throughput" -sekvensbaseret detection of SNPs using ligeringsassays | |
US6506568B2 (en) | Method of analyzing single nucleotide polymorphisms using melting curve and restriction endonuclease digestion | |
CA2421078A1 (en) | Method for determining alleles | |
Nilsson et al. | Making ends meet in genetic analysis using padlock probes | |
CA2366374C (en) | Method for the detection and/or analysis, by means of primer extension techniques, of single nucleotide polymorphisms in restriction fragments, in particular in amplified restriction fragments generated using aflp | |
US20050100911A1 (en) | Methods for enriching populations of nucleic acid samples | |
WO2005093101A1 (en) | Nucleic acid sequencing | |
US20190185933A1 (en) | Detection and quantification of rare variants with low-depth sequencing via selective allele enrichment or depletion | |
EP2982762B1 (en) | Nucleic acid amplification method using allele-specific reactive primer | |
WO2002040126A2 (en) | Methods for identifying nucleotides at defined positions in target nucleic acids using fluorescence polarization | |
US20080305470A1 (en) | Nucleic Acid Sequencing | |
US20060008826A1 (en) | Method for determining alleles | |
Best et al. | Molecular pathology methods | |
US20030235827A1 (en) | Methods and compositions for monitoring primer extension and polymorphism detection reactions | |
US8008002B2 (en) | Nucleic acid sequencing | |
US20110257018A1 (en) | Nucleic acid sequencing | |
JP2000513202A (en) | Large-scale screening of nucleic acid sequencing or genetic replacement | |
KR100874378B1 (en) | Korean Beef Meat Discrimination Method Using Single Base Polymorphism | |
Smith-Zagone et al. | Molecular pathology methods | |
WO2003020950A2 (en) | Methods and compositions for bi-directional polymorphism detection | |
US20040038256A1 (en) | Methods for identifying nucleotides at defined positions in target nucleic acids using fluorescence polarization | |
JP2018000057A (en) | Methods for preparing dna probes and methods for genomic dna analysis using the dna probes | |
Park et al. | DNA Microarray‐Based Technologies to Genotype Single Nucleotide Polymorphisms | |
WO2009098998A1 (en) | Nucleic acid detection method, and nucleic acid detection kit | |
WO2003070977A2 (en) | Method for detecting single nucleotide polymorphisms |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20061026 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR |
|
DAX | Request for extension of the european patent (deleted) | ||
RIN1 | Information on inventor provided before grant (corrected) |
Inventor name: LARSEN, FRANK |
|
17Q | First examination report despatched |
Effective date: 20080630 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20131001 |