EP4036248A1 - Method for library preparation in next generation sequencing by enzymatic dna fragmentation - Google Patents
Method for library preparation in next generation sequencing by enzymatic dna fragmentation Download PDFInfo
- Publication number
- EP4036248A1 EP4036248A1 EP21154220.4A EP21154220A EP4036248A1 EP 4036248 A1 EP4036248 A1 EP 4036248A1 EP 21154220 A EP21154220 A EP 21154220A EP 4036248 A1 EP4036248 A1 EP 4036248A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- nicks
- triphosphate
- polynucleotides
- nucleotides
- nucleic acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims abstract description 50
- 238000013467 fragmentation Methods 0.000 title description 10
- 238000006062 fragmentation reaction Methods 0.000 title description 10
- 238000007481 next generation sequencing Methods 0.000 title description 9
- 230000002255 enzymatic effect Effects 0.000 title description 3
- 238000002360 preparation method Methods 0.000 title description 3
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 54
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 34
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 34
- 239000002773 nucleotide Substances 0.000 claims abstract description 20
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 19
- 108091034117 Oligonucleotide Proteins 0.000 claims abstract description 17
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 15
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 15
- 239000002157 polynucleotide Substances 0.000 claims abstract description 15
- 230000008878 coupling Effects 0.000 claims abstract description 4
- 238000010168 coupling process Methods 0.000 claims abstract description 4
- 238000005859 coupling reaction Methods 0.000 claims abstract description 4
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 claims description 41
- 230000003321 amplification Effects 0.000 claims description 34
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 34
- 102000004190 Enzymes Human genes 0.000 claims description 20
- 108090000790 Enzymes Proteins 0.000 claims description 20
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 claims description 20
- 108020004414 DNA Proteins 0.000 claims description 11
- 230000000694 effects Effects 0.000 claims description 8
- XKMLYUALXHKNFT-UUOKFMHZSA-N Guanosine-5'-triphosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XKMLYUALXHKNFT-UUOKFMHZSA-N 0.000 claims description 6
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 claims description 6
- XKMLYUALXHKNFT-UHFFFAOYSA-N rGTP Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O XKMLYUALXHKNFT-UHFFFAOYSA-N 0.000 claims description 6
- ZKHQWZAMYRWXGA-KQYNXXCUSA-N Adenosine triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-N 0.000 claims description 5
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 claims description 5
- PGAVKCOVUIYSFO-XVFCMESISA-N UTP Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 PGAVKCOVUIYSFO-XVFCMESISA-N 0.000 claims description 5
- 229960001456 adenosine triphosphate Drugs 0.000 claims description 5
- 238000003752 polymerase chain reaction Methods 0.000 claims description 5
- PGAVKCOVUIYSFO-UHFFFAOYSA-N uridine-triphosphate Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 PGAVKCOVUIYSFO-UHFFFAOYSA-N 0.000 claims description 5
- RZCIEJXAILMSQK-JXOAFFINSA-N TTP Chemical compound O=C1NC(=O)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 RZCIEJXAILMSQK-JXOAFFINSA-N 0.000 claims description 4
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 claims description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 3
- 230000001747 exhibiting effect Effects 0.000 claims description 3
- MXHRCPNRJAMMIM-ULQXZJNLSA-N 1-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-tritiopyrimidine-2,4-dione Chemical compound O=C1NC(=O)C([3H])=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 MXHRCPNRJAMMIM-ULQXZJNLSA-N 0.000 claims description 2
- PCDQPRRSZKQHHS-UHFFFAOYSA-N Cytidine 5'-triphosphate Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 PCDQPRRSZKQHHS-UHFFFAOYSA-N 0.000 claims description 2
- 108020001738 DNA Glycosylase Proteins 0.000 claims description 2
- 102000028381 DNA glycosylase Human genes 0.000 claims description 2
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 claims description 2
- 102100029764 DNA-directed DNA/RNA polymerase mu Human genes 0.000 claims description 2
- 102000004533 Endonucleases Human genes 0.000 claims description 2
- 108010042407 Endonucleases Proteins 0.000 claims description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims description 2
- PCDQPRRSZKQHHS-ZAKLUEHWSA-N cytidine-5'-triphosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO[P@](O)(=O)O[P@@](O)(=O)OP(O)(O)=O)O1 PCDQPRRSZKQHHS-ZAKLUEHWSA-N 0.000 claims description 2
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 claims description 2
- RGWHQCVHVJXOKC-SHYZEUOFSA-N dCTP Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO[P@](O)(=O)O[P@](O)(=O)OP(O)(O)=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-N 0.000 claims description 2
- 239000001226 triphosphate Substances 0.000 claims description 2
- 102000003960 Ligases Human genes 0.000 claims 2
- 108090000364 Ligases Proteins 0.000 claims 2
- 108010068698 spleen exonuclease Proteins 0.000 claims 1
- 101150040913 DUT gene Proteins 0.000 description 35
- 239000012634 fragment Substances 0.000 description 33
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical class O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 20
- 238000006243 chemical reaction Methods 0.000 description 13
- 238000012163 sequencing technique Methods 0.000 description 9
- 238000010348 incorporation Methods 0.000 description 7
- 238000000746 purification Methods 0.000 description 7
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- OTXOHOIOFJSIFX-POYBYMJQSA-N [[(2s,5r)-5-(2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical class O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(=O)O)CC[C@@H]1N1C(=O)NC(=O)C=C1 OTXOHOIOFJSIFX-POYBYMJQSA-N 0.000 description 5
- 238000011002 quantification Methods 0.000 description 5
- 238000010008 shearing Methods 0.000 description 5
- 229940035893 uracil Drugs 0.000 description 5
- 102000004099 Deoxyribonuclease (Pyrimidine Dimer) Human genes 0.000 description 4
- 108010082610 Deoxyribonuclease (Pyrimidine Dimer) Proteins 0.000 description 4
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 4
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 4
- 239000011324 bead Substances 0.000 description 4
- 238000010804 cDNA synthesis Methods 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 230000008439 repair process Effects 0.000 description 4
- 239000007858 starting material Substances 0.000 description 4
- VYZAMTAEIAYCRO-UHFFFAOYSA-N Chromium Chemical compound [Cr] VYZAMTAEIAYCRO-UHFFFAOYSA-N 0.000 description 3
- 101710163270 Nuclease Proteins 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 229910052804 chromium Inorganic materials 0.000 description 3
- 239000011651 chromium Substances 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 102100031780 Endonuclease Human genes 0.000 description 2
- 101710081048 Endonuclease III Proteins 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 108010010677 Phosphodiesterase I Proteins 0.000 description 2
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 2
- 108010020764 Transposases Proteins 0.000 description 2
- 102000008579 Transposases Human genes 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 238000006116 polymerization reaction Methods 0.000 description 2
- 230000037452 priming Effects 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 108091093088 Amplicon Proteins 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 238000007397 LAMP assay Methods 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 101710086015 RNA ligase Proteins 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 241000589500 Thermus aquaticus Species 0.000 description 1
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Natural products O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 1
- 108010012306 Tn5 transposase Proteins 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000006154 adenylylation Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- URGJWIFLBWJRMF-JGVFFNPUSA-N ddTTP Chemical class O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)CC1 URGJWIFLBWJRMF-JGVFFNPUSA-N 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 239000013067 intermediate product Substances 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000001915 proofreading effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B50/00—Methods of creating libraries, e.g. combinatorial synthesis
- C40B50/06—Biochemical methods, e.g. using enzymes or whole viable microorganisms
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
Definitions
- Next Generation Sequencing is an emerging technology extending to all areas of Biomedical Research and Clinical Diagnostics.
- One of the key steps in Next Generation Sequencing is the Library Preparation (Library Prep).
- the DNA to be sequenced is provided with specific sequences on both ends (adaptor sequences), to which the sequencing primer or amplification primers bind.
- adaptor sequences specific sequences on both ends
- sequences providing other information may be added, like specific sequences (barcodes) for the assignment of a Next Generation Sequencing read to a particular sample or a cell or a molecule.
- the known fragmentation techniques include:
- the enzymatic fragmentation and tagmentation procedures have a significant disadvantage:
- the degree of fragmentation and tagmentation is very sensitive towards time and the input amount (input DNA), therefore this step has to be very tightly controlled (by accurate quantification of input amount and incubation time), and reagents have to be prechilled in order to avoid that the reaction starts prematurely.
- dUTP deoxyuridine triphosphate
- T, G, C and U nucleotides the known building blocks for oligonucleotide synthesis like dUTP nucleotides can be used.
- the A, T, G, C and U nucleotides are provided as Adenosine 5'-Triphosphate (ATP), 2'-Deoxyadenosine 5'-Triphosphate (dATP), Thymidine 5'-Triphosphate (TTP), 2'-Deoxythymidine 5'-Triphosphate (dTTP), Guanosine 5'-Triphosphate (GTP), 2'-Deoxyguanosine 5'-Triphosphate (dGTP), Cytidine 5'-Triphosphate (GTP) and 2'-Deoxycytidine 5'-Triphosphate (dGTP), 2'-Deoxyuridine, 5'-Triphosphate (dUTP) or Uridine-5'-triphosphate (UTP).
- ATP Adenosine 5'-Triphosphate
- dATP 2'-Deoxyadenosine 5'-Triphosphate
- TTP Thymidine 5'-Triphosphate
- TTP Thymidine 5'-Triphosphate
- the target nucleic acid library obtained by method of the invention may be sequenced.
- the method for sequencing is not particular important and any method for sequencing known in the art can be used for this purpose.
- the oligonucleotide sequence coupled to the nicks is preferable an adaptor or primer sequence like a PCR starter sequence which can be used for amplification purposes, or a sequencing primer binding sequence which can be used for sequencing the target nucleic acid library.
- the method of the invention provides a novel approach for statistical fragmenting of polynucleotides that can be utilized for the generation of sequencing libraries derived from target nucleic acids.
- this method incorporates uracil nucleotides during a polymerisation step which subsequently are converted into nicks ( Fig. 1 ).
- a key step of the method is the initial polymerisation step which is already part of many nucleic acid library preparation methods. During this polymerisation step, dUTP or ddUTP nucleotides are incorporated into the polynucleotides being synthesized.
- the target nucleic acids may be derived from genomic DNA, RNA or a plurality of DNA molecules comprising 50 to 2000 nucleotides.
- Step a multiplying the target nucleic acids
- the target nucleic acids are provided at the 3' and 5' ends with primer sequences for amplification.
- Fig. 2 depicts a method using messenger RNA as starting material.
- cDNA is synthesized using reverse transcriptase and an oligo(dT) primer (oligonucleotide with multiple T nucleotides); the oligo(dT) primer may contain one or more additional nucleotides at the 3' end; the oligo(dT) primer may also contains a specific nucleic acid sequence 5' to the oligo(dT) stretch (adaptor 1 containing a specific primer binding sequence 1; this adaptor is depicted with upward diagonal stripes).
- oligo(dT) primer oligonucleotide with multiple T nucleotides
- the oligo(dT) primer may contain one or more additional nucleotides at the 3' end
- the oligo(dT) primer may also contains a specific nucleic acid sequence 5' to the oligo(dT) stretch (adaptor 1 containing a specific primer binding sequence 1; this adaptor is depicted with upward diagonal
- a second specific sequence is introduced (adaptor 2 containing specific primer binding sequence 2; this adaptor is depicted with a solid box) using the template switching approach (Chenchik et al., 1998).
- the two specific primers may also be introduced using random priming during reverse transcription and/ or during a subsequent second strand cDNA synthesis step.
- This newly synthesized cDNA is then amplified in the presence of dUTP by a polymerase using primers specific to the primers incorporated during the cDNA synthesis.
- UTP or dUTP nucleotides may already be added during the reverse transcription and/ or second strand synthesis step; in this case, the amplification step may be omitted.
- step a) is conducted by polymerase chain reaction.
- Fig. 3 depicts an method using targeted enrichment (specific amplification) of one or multiple nucleic acid targets; in this example, the target enrichment is conducted by using one primer specific to a sequence already present in the template nucleic acid, and a second primer specific to the target or targets of interest.
- the targeted amplification is conducted using specific primers, a polymerase and nucleotides, including dUTPs.
- the amplification steps mentioned in the descriptions for Fig 3 and Fig 4 can be conducted by polymerase chain reaction using Taq polymerase (thermostable DNA polymerase I of Thermus aquaticus ) or other proof-reading polymerases capable of mediating polymerase chain reactions.
- the amplification can be achieved using Loopmediated isothermal amplification.
- Fig. 4 depicts a method using linear amplification for the amplification of nucleic acids in the presence of dUTP nucleotides, for example using Phi29 polymerase for the amplification of whole genomes (Silander et al., 2008).
- Step b fragmentation of the polynucleotides
- the newly synthesized nucleic acids are subsequently treated with an enzyme mixture capable of removing uracil nucleotides thereby creating nicks.
- nicks are generated by a providing one or more enzymes selected from the group consisting of DNA glycosylases (for example Uracil DNA Glycosylase), endonucleases (for example Endonuclease III or Endonuclease VIII), or engineered recombinant proteins (for example USER enzyme and thermolabile USER II enzyme).
- DNA glycosylases for example Uracil DNA Glycosylase
- endonucleases for example Endonuclease III or Endonuclease VIII
- engineered recombinant proteins for example USER enzyme and thermolabile USER II enzyme
- uracil-DNA glycosylase UDG
- endonuclease III UDG
- endonuclease VIII UDG
- endonuclease VIII UDG
- endonuclease VIII uracil-DNA glycosylase
- commercial enzymes or enzyme mixes like the USER enzyme or the thermoliable USER enzyme from New England Biolabs may be used (Cat. No M5508 and M5507, New England Biolabs, Ipswich, MA, USA).
- the creation of nick can be performed be applying elevated temperatures of chemicals.
- the number of uracil bases in the newly synthesized nucleic acids can be tuned by adjusting the ratio between dUTP/ ddUTP and dTTP/ ddTTP nucleotides during the polymerisation step.
- the fragment length is proportional to the relative abundance of dUTP/ ddUTP during the polymerization step. Therefore, the fragment length can be statistically tuned by adjusting the relative abundance of dUTP/ ddUTP in the polymerization step.
- Step c coupling oligonucleotides to the nicks
- This section lists multiple preferred embodiments for creating nucleic acid libraries from nucleic acid fragments generated by incorporation of uracil nucleotides and subsequent excision of these uracil nucleotides.
- the oligonucleotide sequences coupled to the nicks are primer sequences.
- nucleic acid fragments generated using the method introduced in Fig. 2 are depicted.
- Fig. 5 first creates blunt ends to which a specific oligonucleotide adaptor is subsequently ligated. This is achieved by separating the fragmented nucleic acids followed by the treatment of the fragmented nucleic acids with an enzyme or an enzyme mix exhibiting a 5' ⁇ 3' polymerase activity and a 3' ⁇ 5' exonuclease activity.
- a reverse complimentary second strand is being synthesized using the 5' ⁇ 3' polymerase activity ("fill-in"); at fragments with 3' protruding ends, the protrusion is removed using the 3' ⁇ 5' exonuclease activity. After this treatment, all fragments have blunt ends.
- one or more A nucleotides are added to the 3' end of the fragments ("A-tailing").
- A-tailing is achieved by either using an enzyme with A-tailing activity for the reaction above, or by an additional treatment with an enzyme exhibiting A-tailing activity.
- a double-stranded oligonucleotide (adaptor) is ligated to the fragments (either through blunt end ligation or with a double stranded oligonucleotide containing a T overhang in case the fragments were treated with an enzyme with A-tailing activity).
- the double-stranded adaptor used for ligation contains one or two specific primer binding sequences.
- the adapter might be partially single-stranded.
- the nucleic acid library may be sequenced.
- the primer sequence/ these primer sequences added during adapter ligation can be used for subsequent sequencing of the nucleic acid library.
- sequence library can be amplified before sequencing.
- the adaptor Through the design of the adaptor, specific parts of the nucleic acid fragments can be amplified.
- Fig 6 depicts multiple different adaptor designs.
- an adaptor with a single primer binding sequence (specific primer binding site 3; depicted with downward diagonal stripes) is ligated to the nucleic acid fragments.
- the library fragments containing the 5' end of the original fragment can be specifically amplified using primers specific to primer binding sequence 2 and 3.
- Library fragments containing the 3' end of the original fragment can be specifically amplified using primers specific to primer binding sequence 1 and 3.
- the intermediate fragments will not efficiently amplify, as fragments with the same primer binding sequences (primer binding sequence 3) will form intramolecular hairpins, which prevent the binding of primers to the primer binding sits.
- a Y-shaped adaptor with two different primer binding sequences is ligated to the nucleic acid fragments.
- the library fragments containing the 5' end of the original fragment can be specifically amplified using primers specific to primer binding sequence 2 and 3.
- Library fragments containing the 3' end of the original fragment can be specifically amplified using primers specific to primer binding sequence 1 and 4.
- the intermediate fragments can be amplified using primers specific to primer binding sequences 3 and 4.
- the nucleic acid fragments are first denatured (e.g. using heat or by increasing the pH): thereby, the nucleic acid fragments become single-stranded.
- a single-stranded oligonucleotide containing a specific primer binding site (adaptor 3 with primer sequence 3, depicted with downward diagonal stripes) is ligated to the 5' end of the single stranded nucleic acid fragments.
- the oligonucleotide has a 5' adenylation modification at the 5' end (5' App).
- the ligation reaction is catalysed using the Thermostable 5' App DNA/RNA Ligase from New England Biolabs (Cat. No M0319, New England Biolabs, Ipswich, MA, USA) or an equivalent enzyme.
- the resulting nucleic acid library can either be sequenced directly or amplified using specific primer sets.
- primer sequence 1 depicted with upward diagonal stripes
- primer sequence 3 downward diagonal stripes
- primer sequence 2 solid
- primer sequence 3 primer sequence 3 for the amplification of fragments containing the 5' end
- the nucleic acid fragments are first denatured (e.g. using heat or by increasing the pH): thereby, the nucleic acid fragments become single-stranded.
- the single-stranded nucleic acid fragments are incubated with terminal transferase and a single oligonucleotide, thereby creating a mononucleotide tail at the 3' end of the nucleic acid fragments.
- the nucleotide is ATP, resulting in a poly-A tail at the 3' end of the nucleic acid fragments.
- the fragments containing the 5' end of the original fragment can be amplified by a specific primer with a poly-T stretch at the 3' end of the primer (which binds to the poly-A tail of the library) and a primer specific for sequence 2 [depicted in solid]; the fragments containing the 3' end of the original fragment can be amplified using the same poly-T stretch containing primer and a primer specific for sequence 1 [upward diagonal stripes]).
- Example 1 The fragment size can be adjusted by the ratio between dUTP and dTTP during amplification
- condition 1 20% dUTP, 80% dTTP
- condition 2 4% dUTP, 96% dTTP
- condition 3 0.8% dUTP, 99.2% dTTP
- Condition 4 0.16% dUTP, 99.84% dTTP
- Condition 5 dTTP only.
- Example 2 The fragment size is independent of the template input amount.
- Figure 10 shows the results for the two different template concentrations after USER treatment: for all dUTP concentrations, the fragment distribution was very similar independent of the template concentration. This proves that the proposed method has the very unique feature that the statistical size of nucleic acid fragment does not depend on the input amount. Instead, the fragment size can be fine-tuned by adjusting the relative abundance of dUTP in an amplification reaction. This is a very unique property which facilitates workflows that do not depend on accurate quantification of the starting material or intermediate products.
- the remaining 16 ⁇ l of the amplification/ Thermoliable USER II reaction were purified using 1x SPRIselect beads (Cat. No. B23317, Beckman Coulter, Brea, CA, USA).
- the samples were eluted in 25 ⁇ l elution buffer and subjected to the same procedure of end repair and A-tailing, and adaptor ligation as described for the non-purified counterpart.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biochemistry (AREA)
- Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Analytical Chemistry (AREA)
- Wood Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Biotechnology (AREA)
- Immunology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention is directed to a method for obtaining a nucleic acid library of a sample comprising polynucleotides comprising the steps
a. multiplying the polynucleotides by a polymerase
b. fragmenting the multiplied polynucleotides by creating nicks
c. coupling an oligonucleotide sequence to the nicks to create the target library
characterized in that
step a) is performed by providing A, T, G, C and U nucleotides wherein the molar ratio of T and U is between 150:1 and 25:1 and step b) is performed by excision of the U nucleotides.
a. multiplying the polynucleotides by a polymerase
b. fragmenting the multiplied polynucleotides by creating nicks
c. coupling an oligonucleotide sequence to the nicks to create the target library
characterized in that
step a) is performed by providing A, T, G, C and U nucleotides wherein the molar ratio of T and U is between 150:1 and 25:1 and step b) is performed by excision of the U nucleotides.
Description
- Next Generation Sequencing is an emerging technology extending to all areas of Biomedical Research and Clinical Diagnostics. One of the key steps in Next Generation Sequencing is the Library Preparation (Library Prep).
- During Library Prep, the DNA to be sequenced is provided with specific sequences on both ends (adaptor sequences), to which the sequencing primer or amplification primers bind. To these adaptor sequences further sequences providing other information may be added, like specific sequences (barcodes) for the assignment of a Next Generation Sequencing read to a particular sample or a cell or a molecule.
- Many Next Generation Sequencing assays require that the DNA of interest is being fragmented. Fragmentation techniques are for example disclosed in:
- Hess JF, Kohl TA, Kotrová M, Rönsch K, Paprotka T, Mohr V, Hutzenlaub T, Brüggemann M, Zengerle R, Niemann S, Paust N. Library preparation for next generation sequencing: A review of automation strategies. Biotechnol Adv. 2020 Jul-Aug;41:107537. doi: 10.1016/j.biotechadv.2020.107537. Epub 2020 Mar 19. PMID: 32199980.
- Head SR, Komori HK, LaMere SA, et al. Library construction for next-generation sequencing: overviews and challenges. Biotechniques. 2014;56(2):61-passim. Published 2014 Feb 1. doi:10.2144/000114133
- The known fragmentation techniques include:
- Physical shearing of nucleic acids, for example using the Covaris ultra-sonicator.
- Enzymatical fragmentation of nucleic acids using nucleases.
- Fragmentation of nucleic acids with the use of transposase, which semi-randomly inserts adaptor sequences into DNA.
- Physical shearing requires the purchase of an expensive instrument, cannot be automated, and cannot be parallelized for multiple samples when using a single instrument.
- The enzymatic fragmentation and tagmentation procedures have a significant disadvantage: The degree of fragmentation and tagmentation is very sensitive towards time and the input amount (input DNA), therefore this step has to be very tightly controlled (by accurate quantification of input amount and incubation time), and reagents have to be prechilled in order to avoid that the reaction starts prematurely.
- The requirement that reactions have to be pipetted on ice provides a significant usability constraint, since many laboratories are not equipped with equipment for chilling reagents (especially diagnostics labs using multiple physically separated rooms for contamination prevention refrain from using equipment for chilling reagents like ice machines). Also, incubations at low temperature make automation of the pipetting steps challenging because not every automation solution is capable of cooling reagents.
- The time criticality of incubation steps also impacts the scalability of workflows as some of the reactions will start immediately after addition of the sample.
- The methods of the prior art are compared in the following table
Shearing Tagmentation Enzymatic fragmentation using nucleases Sequence bias Low Sequence bias: strong bias in low GC regions for Illumina Nexteraq kits using mutated Tn5 transposase Lower sequence bias compared to Tagmentation Importance of accurate quantification High High Importance of controlled time and temperature High High Multistep single- tube workflows possible? Yes: End repair and Ligation can be conducted in single tubes Yes Very difficult: majority of workflows require multiple cleanup steps Automated workflows possible? No - shearing has to be conducted manually Difficult: requires accurate quantification of low conc. DNA; requires time- and temperature controlled pipetting Difficult: requires accurate quantification of low conc. DNA; requires time- and temperature controlled pipetting - In order to avoid the downsides of the known methods, it is proposed to use dUTP (deoxyuridine triphosphate) and enzymes catalyzing the excision of uracil nucleotides for fragmenting DNA during the Library Prep workflow of a Next Generation Sequencing assays.
- It was therefore an object of the invention to provide a method for obtaining a nucleic acid library of a sample comprising polynucleotides comprising the steps
- a. multiplying the polynucleotides by a polymerase
- b. fragmenting the multiplied polynucleotides by creating nicks
- c. coupling an oligonucleotide sequence to the nicks to create the target library
- As A, T, G, C and U nucleotides, the known building blocks for oligonucleotide synthesis like dUTP nucleotides can be used.
- Preferable, the A, T, G, C and U nucleotides are provided as Adenosine 5'-Triphosphate (ATP), 2'-Deoxyadenosine 5'-Triphosphate (dATP), Thymidine 5'-Triphosphate (TTP), 2'-Deoxythymidine 5'-Triphosphate (dTTP), Guanosine 5'-Triphosphate (GTP), 2'-Deoxyguanosine 5'-Triphosphate (dGTP), Cytidine 5'-Triphosphate (GTP) and 2'-Deoxycytidine 5'-Triphosphate (dGTP), 2'-Deoxyuridine, 5'-Triphosphate (dUTP) or Uridine-5'-triphosphate (UTP). The person skilled in the art is aware that these compounds are available as natural occurring form or chemicaly modified as derivative. In the method of the invention, the natural occurring form of the nucleotides and/or a derivative thereof (i.e. chemicaly modified version can be used.
- The target nucleic acid library obtained by method of the invention may be sequenced. The method for sequencing is not particular important and any method for sequencing known in the art can be used for this purpose.
- The oligonucleotide sequence coupled to the nicks is preferable an adaptor or primer sequence like a PCR starter sequence which can be used for amplification purposes, or a sequencing primer binding sequence which can be used for sequencing the target nucleic acid library.
-
-
Fig. 1 shows the principle of the method of the invention in a generic process. -
Fig. 2 depicts a variant using messenger RNA as starting material -
Fig. 3 depicts a variant using targeted enrichment (specific amplification) of one or multiple nucleic acid targets -
Fig. 4 depicts a variant using linear amplification for the amplification of nucleic acids in the presence of dUTP nucleotides with Phi29 polymerase -
Fig. 5 depicts a method for generating a nucleic acid library using nucleic acids fragmented with the method of invention by generating blunt ends followed by ligation of a specific nucleotide adapter. -
Fig 6 depicts multiple different adaptor designs that can be used for the method shown inFigure 5 . -
Fig. 7 shows a variant of the method wherein the nucleic acid fragments are first denatured to obtain single-stranded nucleic acid fragments which are then provided with a specific nucleotide adapter . -
Fig. 8 shows a variant of the method wherein the nucleic acid fragments are first denatured to obtain single-stranded nucleic acid fragments which are then ligated with poly-A tails at the 3' ends -
Fig. 9 to 11 show experimental results - The method of the invention provides a novel approach for statistical fragmenting of polynucleotides that can be utilized for the generation of sequencing libraries derived from target nucleic acids.
- Instead of methods of prior art which use physical shearing, nucleases or transposases for statistical fragmentation of target nucleic acids, this method incorporates uracil nucleotides during a polymerisation step which subsequently are converted into nicks (
Fig. 1 ). - A key step of the method is the initial polymerisation step which is already part of many nucleic acid library preparation methods. During this polymerisation step, dUTP or ddUTP nucleotides are incorporated into the polynucleotides being synthesized.
- In the method of the invention, the target nucleic acids may be derived from genomic DNA, RNA or a plurality of DNA molecules comprising 50 to 2000 nucleotides.
- Preferable before step a, the target nucleic acids are provided at the 3' and 5' ends with primer sequences for amplification.
- Multiple polymerase-based methods exist which can be used for incorporating uracil bases into nucleic acid. The following sections contain three different methods that already are part of many workflows for the generation of sequencing libraries:
- Incorporation of uracil nucleotides during cDNA synthesis
- Incorporation of uracil nucleotides during PCR amplification
- Incorporation of uracil nucleotides during linear amplification
-
Fig. 2 depicts a method using messenger RNA as starting material. First, cDNA is synthesized using reverse transcriptase and an oligo(dT) primer (oligonucleotide with multiple T nucleotides); the oligo(dT) primer may contain one or more additional nucleotides at the 3' end; the oligo(dT) primer may also contains a specific nucleic acid sequence 5' to the oligo(dT) stretch (adaptor 1 containing a specificprimer binding sequence 1; this adaptor is depicted with upward diagonal stripes). - Once the reverse transcriptase reaches the 5' end of the mRNA, a second specific sequence is introduced (
adaptor 2 containing specificprimer binding sequence 2; this adaptor is depicted with a solid box) using the template switching approach (Chenchik et al., 1998). - The two specific primers may also be introduced using random priming during reverse transcription and/ or during a subsequent second strand cDNA synthesis step.
- This newly synthesized cDNA is then amplified in the presence of dUTP by a polymerase using primers specific to the primers incorporated during the cDNA synthesis. Alternatively, UTP or dUTP nucleotides may already be added during the reverse transcription and/ or second strand synthesis step; in this case, the amplification step may be omitted.
- Preferable, step a) is conducted by polymerase chain reaction.
Fig. 3 depicts an method using targeted enrichment (specific amplification) of one or multiple nucleic acid targets; in this example, the target enrichment is conducted by using one primer specific to a sequence already present in the template nucleic acid, and a second primer specific to the target or targets of interest. The targeted amplification is conducted using specific primers, a polymerase and nucleotides, including dUTPs. - The amplification steps mentioned in the descriptions for
Fig 3 andFig 4 can be conducted by polymerase chain reaction using Taq polymerase (thermostable DNA polymerase I of Thermus aquaticus) or other proof-reading polymerases capable of mediating polymerase chain reactions. Alternatively, the amplification can be achieved using Loopmediated isothermal amplification. -
Fig. 4 depicts a method using linear amplification for the amplification of nucleic acids in the presence of dUTP nucleotides, for example using Phi29 polymerase for the amplification of whole genomes (Silander et al., 2008). - The newly synthesized nucleic acids are subsequently treated with an enzyme mixture capable of removing uracil nucleotides thereby creating nicks.
- Preferable, nicks are generated by a providing one or more enzymes selected from the group consisting of DNA glycosylases (for example Uracil DNA Glycosylase), endonucleases (for example Endonuclease III or Endonuclease VIII), or engineered recombinant proteins (for example USER enzyme and thermolabile USER II enzyme).
- Examples for such enzyme mixtures are uracil-DNA glycosylase (UDG) and endonuclease III or UDG and endonuclease VIII (Melamade et al, 1994; Jiang et al, 1007). Alternatively, commercial enzymes or enzyme mixes like the USER enzyme or the thermoliable USER enzyme from New England Biolabs may be used (Cat. No M5508 and M5507, New England Biolabs, Ipswich, MA, USA).
- In attrition to providing enzymes, the creation of nick can be performed be applying elevated temperatures of chemicals.
- The number of uracil bases in the newly synthesized nucleic acids can be tuned by adjusting the ratio between dUTP/ ddUTP and dTTP/ ddTTP nucleotides during the polymerisation step. The higher the relative abundance of dUTP/ ddUTP, the more uracil nucleotides will be incorporated (replacing thymidine nucleotides).
- Since nicks are specifically generated at the sites of uracil nucleotides, the fragment length is proportional to the relative abundance of dUTP/ ddUTP during the polymerization step. Therefore, the fragment length can be statistically tuned by adjusting the relative abundance of dUTP/ ddUTP in the polymerization step.
- This section lists multiple preferred embodiments for creating nucleic acid libraries from nucleic acid fragments generated by incorporation of uracil nucleotides and subsequent excision of these uracil nucleotides.
- Optionally, the oligonucleotide sequences coupled to the nicks are primer sequences.
- To exemplify these embodiments, nucleic acid fragments generated using the method introduced in
Fig. 2 (mRNA converted to cDNA with 5' and 3' specific adaptors introduced using template switching and oligo(dT) priming, respectively) are depicted. - The embodiment shown in
Fig. 5 first creates blunt ends to which a specific oligonucleotide adaptor is subsequently ligated. This is achieved by separating the fragmented nucleic acids followed by the treatment of the fragmented nucleic acids with an enzyme or an enzyme mix exhibiting a 5' → 3' polymerase activity and a 3' → 5' exonuclease activity. - At fragments with 5' protruding ends, a reverse complimentary second strand is being synthesized using the 5' → 3' polymerase activity ("fill-in"); at fragments with 3' protruding ends, the protrusion is removed using the 3' → 5' exonuclease activity. After this treatment, all fragments have blunt ends.
- In a modification of this embodiment, one or more A nucleotides are added to the 3' end of the fragments ("A-tailing"). This A-tailing is achieved by either using an enzyme with A-tailing activity for the reaction above, or by an additional treatment with an enzyme exhibiting A-tailing activity.
- Next, a double-stranded oligonucleotide (adaptor) is ligated to the fragments (either through blunt end ligation or with a double stranded oligonucleotide containing a T overhang in case the fragments were treated with an enzyme with A-tailing activity). The double-stranded adaptor used for ligation contains one or two specific primer binding sequences. In a modification of this embodiment, the adapter might be partially single-stranded.
- In a variant of the invention, the nucleic acid library may be sequenced. For this purpose, the primer sequence/ these primer sequences added during adapter ligation can be used for subsequent sequencing of the nucleic acid library.
- Optionally, the sequence library can be amplified before sequencing. Through the design of the adaptor, specific parts of the nucleic acid fragments can be amplified.
Fig 6 depicts multiple different adaptor designs. - In one embodiment (option 1), an adaptor with a single primer binding sequence (specific
primer binding site 3; depicted with downward diagonal stripes) is ligated to the nucleic acid fragments. - After ligation, the library fragments containing the 5' end of the original fragment can be specifically amplified using primers specific to
primer binding sequence - Library fragments containing the 3' end of the original fragment can be specifically amplified using primers specific to
primer binding sequence - The intermediate fragments will not efficiently amplify, as fragments with the same primer binding sequences (primer binding sequence 3) will form intramolecular hairpins, which prevent the binding of primers to the primer binding sits.
- In another embodiment (option 2), a Y-shaped adaptor with two different primer binding sequences (specific
primer binding site 3; depicted with downward diagonal stripes, and specific primer binding site 4; depicted with vertical stripes) is ligated to the nucleic acid fragments. - After ligation, the library fragments containing the 5' end of the original fragment can be specifically amplified using primers specific to
primer binding sequence - Library fragments containing the 3' end of the original fragment can be specifically amplified using primers specific to
primer binding sequence 1 and 4. - The intermediate fragments can be amplified using primers specific to
primer binding sequences 3 and 4. - In another preferred embodiment depicted in
Fig. 7 , the nucleic acid fragments are first denatured (e.g. using heat or by increasing the pH): thereby, the nucleic acid fragments become single-stranded. - Next, a single-stranded oligonucleotide containing a specific primer binding site (
adaptor 3 withprimer sequence 3, depicted with downward diagonal stripes) is ligated to the 5' end of the single stranded nucleic acid fragments. - In the embodiment depicted in
Fig. 7 , the oligonucleotide has a 5' adenylation modification at the 5' end (5' App). The ligation reaction is catalysed using the Thermostable 5' App DNA/RNA Ligase from New England Biolabs (Cat. No M0319, New England Biolabs, Ipswich, MA, USA) or an equivalent enzyme. - The resulting nucleic acid library can either be sequenced directly or amplified using specific primer sets.
- By the choice of the amplification primers, it is possible to amplify a subset of the nucleic acid library: primer sequence 1 (depicted with upward diagonal stripes) and primer sequence 3 (downward diagonal stripes) for the amplification of the fragments containing the 3' end of the original cDNA, and primer sequence 2 (solid) and
primer sequence 3 for the amplification of fragments containing the 5' end, respectively. - In a third preferred embodiment depicted in
Fig. 8 , the nucleic acid fragments are first denatured (e.g. using heat or by increasing the pH): thereby, the nucleic acid fragments become single-stranded. - Next, the single-stranded nucleic acid fragments are incubated with terminal transferase and a single oligonucleotide, thereby creating a mononucleotide tail at the 3' end of the nucleic acid fragments.
- In the embodiment shown in
Fig. 8 , the nucleotide is ATP, resulting in a poly-A tail at the 3' end of the nucleic acid fragments. - In the next step, the fragments containing the 5' end of the original fragment can be amplified by a specific primer with a poly-T stretch at the 3' end of the primer (which binds to the poly-A tail of the library) and a primer specific for sequence 2 [depicted in solid]; the fragments containing the 3' end of the original fragment can be amplified using the same poly-T stretch containing primer and a primer specific for sequence 1 [upward diagonal stripes]).
- Melamede RJ, Hatahet Z, Kow YW, Ide H, Wallace SS. Isolation and characterization of endonuclease VIII from Escherichia coli. Biochemistry. 1994 Feb 8;33(5):1255-64. doi: 10.1021/bi00171a028. PMID: 8110759.
- Jiang D, Hatahet Z, Melamede RJ, Kow YW, Wallace SS. Characterization of Escherichia coli endonuclease VIII. J Biol Chem. 1997 Dec 19;272(51):32230-9. doi: 10.1074/jbc.272.51.32230. PMID: 9405426.
- Chenchik A., Zhu,Y.Y., Diatchenko,L., Li,R., Hill,J. and Siebert,P.D. (1998) Generation and use of high-quality cDNA form small amounts of total RNA by SMART PCR. In Siebert,P. and Larrick,J. (eds), Gene Cloning and Analysis by RT-PCR. Biotechniques Books, Natick, MA, pp. 305-319.
- Silander K, Saarela J. Whole genome amplification with Phi29 DNA polymerase to enable genetic or genomic analysis of samples of low DNA yield. Methods Mol Biol. 2008;439:1-18. doi: 10.1007/978-1-59745-188-8_1. PMID: 18370092.
- We first assessed whether the fragment size can be adjusted by the ratio between dUTP and dTTP in a polymerase chain reaction. As model system we chose amplified cDNA generated with the template switching approach shown in
Figure 2 (generated using the Chromium Next GEM Single Cell V(D)J Reagent Kits v1.1, 10x Genomics, Pleasanton, CA, USA). - In order to statistically incorporate dUTP nucleotides, we re-amplified the cDNA for 10 cycles using the Q5U Hot Start High-Fidelity DNA Polymerase (Cat. No. M0493, New England Biolabs, Ipswich, MA, USA) according to the manufacturer's protocol. Four different relative amounts of dUTP were added to the reaction together with a no dUTP control (the percentage of dUTP refers to the fraction of dTTP replaced by dUTP in the reaction setup): condition 1: 20% dUTP, 80% dTTP; condition 2: 4% dUTP, 96% dTTP; condition 3: 0.8% dUTP, 99.2% dTTP; Condition 4: 0.16% dUTP, 99.84% dTTP; Condition 5: dTTP only.
- After the amplification, an aliquot of the samples was treated with the Thermoliable USER II Enzyme (Cat. No. M5508, New England Biolabs, Ipswich, MA, USA) at 37°C for 15 minutes followed ("USER treatment") by a heat inactivation step at 65°C for 10 minutes. Samples were purified using 0,8x SPRIselect beads (Cat. No. B23317, Beckman Coulter, Brea, CA, USA) and analyzed on an Agilent 4200 TapeStation System using D5000 or High Sensitivity D5000 Screen Tapes (Cat. No. 5067-5588 and 5067-5592, Agilent, Santa Clara, CA, USA).
- As seen in
Figure 9 , the presence of dUTP at different relative fractions did not impact the size distribution of the sample after amplification (left column); after USER treatment, the amplification products were fragmented, and the fragment size was inversely proportional to the relative fraction of dUTP (right column): the larger the relative fraction of dUTP, the smaller the statistical fragment size. - We next assessed whether the fragment size after USER treatment is dependent on the number of molecules used as input for the amplification reaction.
- Two different input amounts were used for the initial amplification (0.5x template: 1 pg/µl; and 2x template: 4 pg/µl). Amplification and USER II treatment were conducted as described in example 1 with the exception that different relative amounts of dUTP were used: condition 1: 20% dUTP, 80% dTTP; condition 2: 10% dUTP, 90% dTTP; condition 3: 5% dUTP, 95% dTTP; condition 4: 2.5% dUTP, 97.5% dTTP.
-
Figure 10 shows the results for the two different template concentrations after USER treatment: for all dUTP concentrations, the fragment distribution was very similar independent of the template concentration. This proves that the proposed method has the very unique feature that the statistical size of nucleic acid fragment does not depend on the input amount. Instead, the fragment size can be fine-tuned by adjusting the relative abundance of dUTP in an amplification reaction. This is a very unique property which facilitates workflows that do not depend on accurate quantification of the starting material or intermediate products. - In this example we amplified re-amplified the same template used in example 1 and 2 in 25 µl reactions (template concentration: 1 pg/µl). Amplification and USER treatment were conducted as described in example 1, and the relative amounts of dUTP were identical to experiment 2: condition 1: 20% dUTP, 80% dTTP; condition 2: 10% dUTP, 90% dTTP; condition 3: 5% dUTP, 95% dTTP; Condition 4: 2.5% dUTP, 97.5% dTTP.
- 10 out of the 25 + 1 µl were subjected to a end repair and A-tailing reaction by using the NEBNext® Ultra™ II End Repair/dA-Tailing Module (Cat. No. E7546, New England Biolabs, Ipswich, MA, USA) following the manufacturers instruction (at half scale), followed by the ligation of the 10x genomics Adaptor Mix (PN 220026, 10x Genomics, Pleasanton, CA, USA) using the NEBNext® Ultra™ II Ligation Module (Cat. No. E7595, New England Biolabs, Ipswich, MA, USA; also at half scale).
- The remaining 16 µl of the amplification/ Thermoliable USER II reaction were purified using 1x SPRIselect beads (Cat. No. B23317, Beckman Coulter, Brea, CA, USA). The samples were eluted in 25 µl elution buffer and subjected to the same procedure of end repair and A-tailing, and adaptor ligation as described for the non-purified counterpart.
- Two µl of each pair of samples were subjected to 10 cycles of sample index PCR using the reagents taken from the Chromium Single Cell 5' Library Construction Kit (PN-1000020, 10x Genomics, Pleasanton, CA, USA). Additionally, 10µl of the ligation product derived from the sample already purified after the amplification/ USER II step was also purified using 1x SPRIselect beads (Cat. No. B23317, Beckman Coulter, Brea, CA, USA) and subjected to the same sample index PCR protocol.
- All samples were finally purified using 0.8x SPRIselect beads (Cat. No. B23317, Beckman Coulter, Brea, CA, USA).
- The result summarized in
Figure 11 proves that libraries can be generated with nucleic acid fragments generated with the proposed method, and that the library size is inversely proportional to the relative fraction of dUTP in the initial PCR reaction. - No purification after amplification/ USER II treatment, no purification after ligation
- Purification after amplification/ USER II treatment, no purification after ligation
- Purification after amplification/ USER II treatment, purification after ligation gave rise to libraries of similar size distribution.
- This observation is of great importance, as it exemplifies an additional unexpected advantage of the proposed method: During the different workflow steps, little undesired artifacts are being generated that compete with the amplification of the final library (sample index PCR), therefore the majority of amplicons generated is specific. Because of this, only one single purification (cleanup) is required to deplete fragments that are too small.
- In contrast, methods of the art like the Chromium Single Cell 5' Library Construction Kit (PN-1000020, 10x Genomics, Pleasanton, CA, USA) require a total of three cleanup steps and one size selection step, which are time consuming and lead to challenges when automating an next generation sequencing workflow.
step a) is performed by providing A, T, G, C and U nucleotides wherein the molar ratio of T and U is between 150:1 and 25:1 and step b) is performed by excision of the U nucleotides.
Claims (14)
- A method for obtaining a nucleic acid library of a sample comprising polynucleotides comprising the stepsd. multiplying the polynucleotides by a polymerasee. fragmenting the multiplied polynucleotides by creating nicksf. coupling an oligonucleotide sequence to the nicks to create the target librarycharacterized in that
step a) is performed by providing A, T, G, C and U nucleotides wherein the molar ratio of T and U is between 150:1 and 25:1 and step b) is performed by excision of the U nucleotides. - Method according to claim 1 characterized in that the A, T, G, C and U nucleotides are provided as Adenosine 5'-Triphosphate (ATP), 2'-Deoxyadenosine 5'-Triphosphate (dATP), Thymidine 5'-Triphosphate (TTP), 2'-Deoxythymidine 5'-Triphosphate (dTTP), Guanosine 5'-Triphosphate (GTP), 2'-Deoxyguanosine 5'-Triphosphate (dGTP), Cytidine 5'-Triphosphate (GTP) and 2'-Deoxycytidine 5'-Triphosphate (dGTP), 2'-Deoxyuridine, 5'-Triphosphate (dUTP) or Uridine-5'-triphosphate (UTP) or a derivate thereof.
- Method according to claim 1 or 2 characterized in that after step b), the nicks are provided with a polymerase exhibiting 5' → 3' exonuclease activity, thereby filling in the 3' recessing ends and removing the 5' overhangs of the nicks.
- Method according to any of the claims 1 to 3 characterized in that step c) is performed by providing a ligase.
- Method according to any of the claims 1 to 4 characterized in that after step b), the nicks are denaturated into single strand nicks and the single strand nicks are provided with a ligase which couples oligonucleotide sequences to the 3' end of the single strand nicks.
- Method according to c any of the claims 1 to 4 characterized in that after step b), the nicks are denaturated into single strand nicks and the single strand nicks are provided with a terminal transferase which couples homonucleotides comprising 2 to 20 nucleic acids as oligonucleotide sequences to the 3' end of the single strand nicks.
- Method according to any of the claims 1 to 6 characterized in that the oligonucleotide sequences coupled to the nicks are primer sequences.
- Method according to any of the claims 1 to 7 characterized in that the polynucleotides are derived from synthetic or genomic DNA or RNA or a plurality of DNA or RNA molecules comprising 50 to 2000 nucleotides.
- Method according to any of the claims 1 to 8 characterized in that before step a, the polynucleotides are provided at the 3' and 5' ends with primer sequences for amplification.
- Method according to claim 8 characterized in that the primer sequences are same or different than the oligonucleotide sequences.
- Method according to any of the claims 1 to 10 characterized in that after step c, the the target library is amplified.
- Method according to any of the claims 1 to 11 characterized in that multiplying the polynucleotides in step a) is conducted by polymerase chain reaction.
- Method according to any of the claims 1 to 12 characterized in that nicks are generated by a providing one or more enzymes selected from the group consisting of DNA glycosylases, endonucleases, engineered recombinant proteins and thermolabile USER II enzyme.
- Method according to any of the claims 1 to 13 characterized in that the nucleic acid library is sequenced.
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP21154220.4A EP4036248A1 (en) | 2021-01-29 | 2021-01-29 | Method for library preparation in next generation sequencing by enzymatic dna fragmentation |
US18/274,541 US20240076803A1 (en) | 2021-01-29 | 2022-01-28 | Method for Library Preparation in Next Generation Sequencing by Enzymatic DNA Fragmentation |
EP22703586.2A EP4284943A1 (en) | 2021-01-29 | 2022-01-28 | Method for library preparation in next generation sequencing by enzymatic dna fragmentation |
PCT/EP2022/051979 WO2022162109A1 (en) | 2021-01-29 | 2022-01-28 | Method for library preparation in next generation sequencing by enzymatic dna fragmentation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP21154220.4A EP4036248A1 (en) | 2021-01-29 | 2021-01-29 | Method for library preparation in next generation sequencing by enzymatic dna fragmentation |
Publications (1)
Publication Number | Publication Date |
---|---|
EP4036248A1 true EP4036248A1 (en) | 2022-08-03 |
Family
ID=74418216
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP21154220.4A Withdrawn EP4036248A1 (en) | 2021-01-29 | 2021-01-29 | Method for library preparation in next generation sequencing by enzymatic dna fragmentation |
EP22703586.2A Pending EP4284943A1 (en) | 2021-01-29 | 2022-01-28 | Method for library preparation in next generation sequencing by enzymatic dna fragmentation |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP22703586.2A Pending EP4284943A1 (en) | 2021-01-29 | 2022-01-28 | Method for library preparation in next generation sequencing by enzymatic dna fragmentation |
Country Status (3)
Country | Link |
---|---|
US (1) | US20240076803A1 (en) |
EP (2) | EP4036248A1 (en) |
WO (1) | WO2022162109A1 (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010148039A2 (en) * | 2009-06-15 | 2010-12-23 | Complete Genomics, Inc. | Methods and compositions for long fragment read sequencing |
WO2011019964A1 (en) * | 2009-08-12 | 2011-02-17 | Nugen Technologies, Inc. | Methods, compositions, and kits for generating nucleic acid products substantially free of template nucleic acid |
WO2015200541A1 (en) * | 2014-06-24 | 2015-12-30 | Bio-Rad Laboratories, Inc. | Digital pcr barcoding |
WO2016114970A1 (en) * | 2015-01-12 | 2016-07-21 | 10X Genomics, Inc. | Processes and systems for preparing nucleic acid sequencing libraries and libraries prepared using same |
CN108486100A (en) * | 2018-03-22 | 2018-09-04 | 苏州泰康吉安仪器科技有限公司 | A kind of controllable fragmentation methods of DNA length and its application in building library |
-
2021
- 2021-01-29 EP EP21154220.4A patent/EP4036248A1/en not_active Withdrawn
-
2022
- 2022-01-28 EP EP22703586.2A patent/EP4284943A1/en active Pending
- 2022-01-28 US US18/274,541 patent/US20240076803A1/en active Pending
- 2022-01-28 WO PCT/EP2022/051979 patent/WO2022162109A1/en active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010148039A2 (en) * | 2009-06-15 | 2010-12-23 | Complete Genomics, Inc. | Methods and compositions for long fragment read sequencing |
WO2011019964A1 (en) * | 2009-08-12 | 2011-02-17 | Nugen Technologies, Inc. | Methods, compositions, and kits for generating nucleic acid products substantially free of template nucleic acid |
WO2015200541A1 (en) * | 2014-06-24 | 2015-12-30 | Bio-Rad Laboratories, Inc. | Digital pcr barcoding |
WO2016114970A1 (en) * | 2015-01-12 | 2016-07-21 | 10X Genomics, Inc. | Processes and systems for preparing nucleic acid sequencing libraries and libraries prepared using same |
CN108486100A (en) * | 2018-03-22 | 2018-09-04 | 苏州泰康吉安仪器科技有限公司 | A kind of controllable fragmentation methods of DNA length and its application in building library |
Non-Patent Citations (9)
Title |
---|
CHENCHIK A.ZHU,Y.Y.DIATCHENKO,L.LI,R.HILL,J.SIEBERT,P.D.: "Gene Cloning and Analysis by RT-PCR. Biotechniques Books, Natick, MA", 1998, article "Generation and use of high-quality cDNA form small amounts of total RNA by SMART PCR", pages: 305 - 319 |
EPICENTRE: "Terminal Deoxynucleotidyl Transferase, Recombinant", EPILIT328 REV. A., 1 December 2012 (2012-12-01), XP055408867, Retrieved from the Internet <URL:https://www.promega.de/-/media/files/resources/protocols/product-information-sheets/g/terminal-deoxynucleotidyl-transferase-recombinant-protocol.pdf?la=en> [retrieved on 20170921] * |
HEAD SRKOMORI HKLAMERE SA ET AL.: "Library construction for next-generation sequencing: overviews and challenges", BIOTECHNIQUES, vol. 56, no. 2, 1 February 2014 (2014-02-01), pages 61, XP055544232, DOI: 10.2144/000114133 |
HESS JFKOHL TAKOTROVA MRONSCH KPAPROTKA TMOHR VHUTZENLAUB TBRIIGGEMANN MZENGERLE RNIEMANN S: "Library preparation for next generation sequencing: A review of automation strategies", BIOTECHNOL ADV, vol. 41, 19 March 2020 (2020-03-19), pages 107537, XP086193204, DOI: 10.1016/j.biotechadv.2020.107537 |
JIANG DHATAHET ZMELAMEDE RJKOW YWWALLACE SS: "Characterization of Escherichia coli endonuclease VIII", J BIOL CHEM, vol. 272, no. 51, 19 December 1997 (1997-12-19), pages 32230 - 9, XP055204534, DOI: 10.1074/jbc.272.51.32230 |
MELAMEDE RJHATAHET ZKOW YWIDE HWALLACE SS: "Isolation and characterization of endonuclease VIII from Escherichia coli", BIOCHEMISTRY, vol. 33, no. 5, 8 February 1994 (1994-02-08), pages 1255 - 64, XP008131904, DOI: 10.1021/bi00171a028 |
MORGANE BOONE ET AL: "Capturing the 'ome': the expanding molecular toolbox for RNA and DNA library construction", NUCLEIC ACIDS RESEARCH, vol. 46, no. 6, 5 March 2018 (2018-03-05), GB, pages 2701 - 2721, XP055681576, ISSN: 0305-1048, DOI: 10.1093/nar/gky167 * |
SILANDER KSAARELA J: "Whole genome amplification with Phi29 DNA polymerase to enable genetic or genomic analysis of samples of low DNA yield", METHODS MOL BIOL, vol. 439, 2008, pages 1 - 18, XP008098919, DOI: 10.1007/978-1-59745-188-8_1 |
TURCHINOVICH A ET AL: "Capture and Amplification by Tailing and Switching (CATS). An ultrasensitive ligation-independent method for generation of DNA libraries for deep sequencing from picogram amounts of DNA and RNA", RNA BIOLOGY,, vol. 11, no. 7, 1 July 2014 (2014-07-01), pages 817 - 828, XP002742135, ISSN: 1547-6286, [retrieved on 20140612], DOI: 10.4161/RNA.29304 * |
Also Published As
Publication number | Publication date |
---|---|
WO2022162109A1 (en) | 2022-08-04 |
EP4284943A1 (en) | 2023-12-06 |
US20240076803A1 (en) | 2024-03-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210222236A1 (en) | Template Switch-Based Methods for Producing a Product Nucleic Acid | |
US10876108B2 (en) | Compositions and methods for targeted nucleic acid sequence enrichment and high efficiency library generation | |
US10301660B2 (en) | Methods and compositions for repair of DNA ends by multiple enzymatic activities | |
EP3842544A1 (en) | Preparation of adapter-ligated amplicons | |
WO2016135300A1 (en) | Efficiency improving methods for gene library generation | |
JP2022518917A (en) | Nucleic acid detection method and primer design method | |
US20240150753A1 (en) | Methods of isothermal complementary dna and library preparation | |
EP3198064B1 (en) | Methods for sample preparation | |
EP4036248A1 (en) | Method for library preparation in next generation sequencing by enzymatic dna fragmentation | |
US20200208299A1 (en) | Rapid library construction for high throughput sequencing | |
EP4279590A1 (en) | Method for generation of a nucleic acid library | |
WO2018009677A1 (en) | Fast target enrichment by multiplexed relay pcr with modified bubble primers | |
WO2024073034A1 (en) | Simplified sequencing library preparation for dna | |
JP2024502293A (en) | Sequencing of non-denaturing inserts and identifiers | |
JP2024538122A (en) | Methods for generating DNA libraries and uses thereof | |
CN112074612A (en) | Nucleic acid amplification method with higher specificity |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN PUBLISHED |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20230204 |