[go: up one dir, main page]

EP4573207A1 - Detection of epigenetic modifications - Google Patents

Detection of epigenetic modifications

Info

Publication number
EP4573207A1
EP4573207A1 EP23757600.4A EP23757600A EP4573207A1 EP 4573207 A1 EP4573207 A1 EP 4573207A1 EP 23757600 A EP23757600 A EP 23757600A EP 4573207 A1 EP4573207 A1 EP 4573207A1
Authority
EP
European Patent Office
Prior art keywords
group
sequencing
5hmc
reactive group
nucleic acid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP23757600.4A
Other languages
German (de)
French (fr)
Inventor
Frank Bergmann
Dieter Heindl
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
F Hoffmann La Roche AG
Roche Diagnostics GmbH
Original Assignee
F Hoffmann La Roche AG
Roche Diagnostics GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by F Hoffmann La Roche AG, Roche Diagnostics GmbH filed Critical F Hoffmann La Roche AG
Publication of EP4573207A1 publication Critical patent/EP4573207A1/en
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay

Definitions

  • the invention relates to the field mapping the epigenetic modifications of DNA and RNA, which has become increasingly important, because these modifications play a role in several biological processes and diseases including development, aging and last but not least, cancer.
  • C cytosine
  • 5mC 5-methylcytosine
  • oxidation can also be performed by chemical means using e.g. potassium perruthenate (KRuCh).
  • KRuCh potassium perruthenate
  • WO 2022/096751 also disclosed a method for generating a dihydrothymine (DHT) or a dihydrouracil (DHU) residue from a nucleoside or a polynucleotide containing 5-methylcytosine (5mC) or 5-carboxyl-cytosine (5caC) by using a radical initiator which may be used together with a nucleophilic compound.
  • DHT dihydrothymine
  • DHU dihydrouracil
  • the present invention is based on the basic idea that the introduction of a nucleophile at 5fC, 5hmC or caC and the subsequent intramolecular addition at C- 6 of a 5-substitued cytosine is enabling a deamination at C-4 of a cytosine derivative.
  • the present invention is directed to a method comprising the steps of a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl, or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.
  • step d) The newly formed cyclic structures in step d) are 5-7 membered rings
  • the method further comprises step f), i.e. reversing the ring formation of step d) thereby obtaining a 5-substituted Uracil.
  • the method may also comprise a step of converting 5mC to 5fC, 5hmC or 5caC prior to step a).
  • the present invention is facilitating the methylation status analysis of naturally occurring DNA.
  • said nucleic acid comprises at least one 5fC residue.
  • the first reactive group may be a C-H acidic group, an amine or a phosphorus ylide.
  • said nucleic acid comprises at least one 5hmC residue. Then, the first reactive group may constitute a Glycosyl Transferase substrate.
  • said nucleic acid comprises a 5caC residue. Then, the first reactive group may be an amine.
  • the said second reactive group may be a nucleophilic group selected from a group consisting of Thiol or Sulfinate.
  • All methods as disclosed above may further comprise a subsequent step of amplifying said nucleic acid, which may be preferably executed by means of PCR amplification.
  • all methods disclosed above may further comprise the step of a subsequent step of sequencing said nucleic acid either with or without a prior amplification step, which is preferably a PCR amplification.
  • Fig. 1 provides a scheme for detection of methylated sequences according to the present invention.
  • Fig. 2 shows a reaction scheme for modification of 5fC by means of a Knoevenagel condensation using a reactant with a C-H acidic group
  • Fig. 3 shows a reaction scheme for modification of 5fC by means of an aldol condensation reaction with a CH-acidic reagent comprising a sulfinate moiety
  • Fig. 4 shows a reaction scheme for modification of 5fC by means of a Wittig reaction.
  • Fig. 5 shows a specific example for a Wittig reaction according to Fig. 4 with a sulfinate modified Wittig reagent.
  • Fig: 7 shows a specific example for a reductive amination as in Fig. 6, where 5fC is reacted with 2-aminomethyl sulfinate
  • Fig. 8 shows a reaction scheme for the modification of hmC using UDP-glucose substituted at the 2-position of the glucose moiety with a nucleophile as a reagent and executing a subsequent beta-glucosyl-transferase reaction.
  • Fig. 9 shows a similar reaction scheme as example 8, wherein 2-thio-glucose-UDP is used.
  • Fig. 10 shows an example for modification of the carboxyl group of caC.
  • a nucleophilic group is a functional group that is attractive to electron-poor or positively charged centers and donates electrons, in particular donating an electron pair to an electrophile or an electrophilic center to form a covalent bond.
  • Nucleophilic groups are for example alcohols, alcoholates, thiols, thiolates, amines, sulfinates or carbanions.
  • deamination refers herein to a substitution reaction of an exocyclic amino group by a hydroxyl group, in particular substitution of the amino group at the C-4 position of a cytosine nucleobase to convert into a uracil nucleobase. Deamination at C-4 position is enhanced in 5,6-dihydrocytosines to give 5,6-dihydrouracil derivatives, as e.g. being applied after bisulfite addition to the C5- C6 double bond of cytosine.
  • C-H acidic group is used herein for a carbon atom within a compound which carries at least one electron-withdrawing group and thus being more easily deprotonated by a base.
  • Electron-withdrawing groups are for example carbonyl containing groups, a nitro group, a cyano group, a sulfoxide or sulfone group.
  • Wittig reagent or phosphorus ylide refers to neutral dipolar molecules containing a formally negatively charged carbon atom (a carbanion) directly attached to a phosphorus atom with a formal positive charge.
  • phosphorus ylides phosphonium ylides
  • Phosphorus ylides are thus 1,2- dipolar compounds and a subclass of zwitterions.
  • triphenylphosphonium ylides are reacted with aldehydes or ketones to give an alkene bond.
  • a Homer-Wadsworth-Emmons reaction stabilized phosphonate carbanions are used as reagent.
  • a Glycosyl Transferase Substrate refers to a reagent comprising an activation group, e.g. a UDP moiety, and a sugar residue, for example a glucose or substituted glucose.
  • Glycosyltransferases catalyze the transfer of the sugar moiety from an activated donor sugar (the glycosyl transferase substrate) onto a saccharide or non- saccharide acceptor.
  • the non-saccharide acceptor can be for example 5- hydroxymethyl-dC.
  • T4 Phage B-glucosyltransferase for example from New England Biolabs, specifically transfers a glucose moiety of uridine diphosphoglucose to the 5-hydroxymethylcytosine (5-hmC) residues in double-stranded DNA, making beta- glucosyl-5-hydroxymethylcytosine.
  • FIG. 1 A general scheme, how methylated C-residues mC are detected is schematically shown in Fig. 1.
  • the epigenetic dC modifications 5-methyl-dC (mC), 5-hydroxymethyl-dC (hmC), 5-formyl-dC (fC) and 5-carboxyl- dC (caC) are read as unmodified dC when amplified and/or sequenced.
  • oxidation of mC by enzymatic and/or chemical means results in oxidized mC modifications hmC, fC and caC (general: xC).
  • Oxidation or conversion of 5mC to 5fC can be achieved by using TET enzymes (ten- eleven translocation (TET) methylcytosine dioxygenases).
  • TET ten- eleven translocation
  • the same class of enzymes are suitable for converting 5mC to 5hmC.
  • 5hmC can be further oxidized to 5fC enzymatically by a laccase enzyme or chemically using either KRuO4 or Cu(II)/TEMPO. Further details are disclosed in Pfeifer et al. Epigenetics & Chromatin 2013, 6: 10, p.1-9.
  • a reagent of the invention comprising two reactive groups, the first reactive group forming a covalent bond with hmC, fC or caC and thus introducing the second reactive group which comprises a nucleophile.
  • the present also provides a method for analyzing the methylation status of a naturally occurring DNA comprising the steps of a) converting 5mC to 5fC, 5hmC or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to
  • the sample is derived from a subject or a patient.
  • the sample may comprise a fragment of a solid tissue or a solid tumor derived from the subject or the patient, e.g., by biopsy.
  • the sample may also comprise body fluids that may contain nucleic acids (e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples).
  • nucleic acids e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid,
  • the sample is a cultured sample, e.g., a tissue culture containing cells and fluids from which nucleic acids may be isolated.
  • the nucleic acids of interest in the sample come from infectious agents such as viruses, bacteria, protozoa or fungi.
  • the present invention involves manipulating isolated nucleic acids isolated or extracted from a sample. Methods of nucleic acid extraction are well known in the art (see, Sambrook et al., & quot; Molecular Cloning: A Laboratory Manual, & quot; 1989, 2nd Ed., Cold Spring Harbor Laboratory Press: New York, N. Y.).
  • kits are commercially available for extracting nucleic acids (DNA or RNA) from biological samples (e.g., KAPA Express Extract (Roche Sequencing Solutions, Pleasanton, Cal.) and other similar products from BD Biosciences Clontech (Palo Alto, Cal.), Epicentre Technologies (Madison, Wise.); Gentra Systems, (Minneapolis, Minn.); and Qiagen (Valencia, Cal.), Ambion (Austin, Tex.); BioRad Laboratories (Hercules, Cal.); and more.
  • the present invention involves detecting an epigenetic modification, specifically, an epigenetic cytosine modification in nucleic acids (including, but not limited to, cytosine methylation).
  • the nucleic acid sequences that are subject to conditional epigenetic modification are the target sequences analyzed by the method disclosed herein.
  • the same nucleic acid sequence may or may not have the epigenetic modification characterized by methylation of cytosines at the 5-position (5mC or 5hmC).
  • a set or a panel of target nucleic acids are probed for the presence of methylation. For example, as shown in Patai, et al.
  • the method of the instant invention includes detecting methylation in all sites throughout the genome of an organism to diagnose a disease or condition or predisposition to a disease or condition using the sequence analysis and artificial intelligence tools described, e.g., in Shull, et al. “Sequencing the cancer methylome,” Methods Mol Biol. 1238:627-635 (2015).
  • the method of detecting epigenetic modifications includes sequencing.
  • the nucleic acid processed as described herein is subjected to sequencing; preferably, massively parallel single molecule sequencing. Analyzing individual molecules by massively parallel sequencing typically requires a separate level of barcoding for sample identification and error correction.
  • the use of molecular barcodes such as described in U.S. Patent Nos. 7,393,665, 8,168,385, 8,481,292, 8,685,678, and 8,722,368.
  • a unique molecular barcode is added to each molecule to be sequenced to mark molecule and its progeny (e.g., the original molecule and its amplicons generated by PCR).
  • the unique molecular barcode has multiple uses including counting the number of original target molecules in the sample and error correction (Newman, et al., “An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage,” Nature Medicine doi: 10.1038/nm.3519 (2014)).
  • Nanopore sequencing is a unique, scalable technology that enables direct, real-time analysis of long DNA or RNA fragments. It works by monitoring changes to an electrical current as nucleic acids are passed through a protein nanopore. The resulting signal is decoded to provide the specific DNA or RNA sequence. Nanopore-based sequencing technology detects the unique electrical signals of different molecules as they pass through the nanopore with a semiconductor-based electronic detection system. This technology makes for a high throughput, cost effective sequencing solution.
  • the biological nanopore a protein pore embedded in a membrane, while the brains of the technology lie in the electronics of a semiconductor integrated circuit and proprietary chemistries.
  • unique molecular barcodes are used for sequencing error correction.
  • the entire progeny of a single target molecule is marked with the same barcode and forms a barcoded family.
  • a variation in the sequence not shared by all members of the barcoded family is discarded as an artefact.
  • Barcodes can also be used for positional deduplication and target quantification, as the entire family represents a single molecule in the original sample (Newman, et al. “Integrated digital error suppression for improved detection of circulating tumor DNA,” Nature Biotechnology 34:547 (2016)).
  • the method involves forming a library comprising nucleic acids from a sample.
  • the library consists of a plurality of nucleic acids ready for sequencing or another type of detection method, e.g., PCR.
  • a library can be stored and used multiple times for further processing such as amplification or sequencing of the nucleic acids in the library.
  • the library is the input nucleic acid in which methylation is detected by the method described herein.
  • the library is formed from nucleic acids that have undergone the methylation detection reactions described herein.
  • Non-limiting examples of sequence assays that are suitable for use with the methods disclosed herein include nanopore sequencing (U.S. Patent Publication Nos. US2013/0244340, US2013/0264207, US2014/0134616, US2015/0119259 and US2015/0337366), Sanger sequencing, capillary array sequencing, thermal cycle sequencing (Sears, et al., Biotechniques, 13:626-633 (1992)), solid-phase sequencing (Zimmerman, et al., Methods Mol.
  • sequencing-by- hybridization platforms from Affymetrix Inc. (Sunnyvale, Calif.), sequencing-by- synthesis platforms from Illumina/Solexa (San Diego, Calif.) and Helicos Biosciences (Cambridge, Mass.), sequencing-by-ligation platform from Applied Biosystems (Foster City, Calif.).
  • Other sequencing technologies include, but are not limited to, the Ion Torrent technology (ThermoFisher Scientific), and nanopore sequencing (Genia Technology, Roche Sequencing Solutions, Santa Clara, Cal.), Oxford Nanopore Technologies (Oxford, UK), and sequencing by expansion (Stratos Genomics, Roche Sequencing Solutions).
  • the sequencing step involves sequence aligning.
  • a consensus sequence is determined from a plurality of sequences all having an identical UID.
  • the sequences having an identical UID are presumed to derive from the same original molecule through amplification.
  • UID is used to eliminate artifacts, i.e., variations existing in the progeny of a single molecule (characterized by a particular UID). Such artifacts resulting from PCR errors or sequencing errors can be eliminated using UIDs.
  • the number of each sequence in the sample can be quantified by quantifying relative numbers of sequences with each UID among the population having the same multiplex sample ID (MID).
  • MID multiplex sample ID
  • Each UID represents a single molecule in the original sample and counting different UIDs associated with each sequence variant can determine the fraction of each sequence variant in the original sample, where all molecules share the same MID.
  • a person skilled in the art will be able to determine the number of sequence reads necessary to determine a consensus sequence.
  • the relevant number is reads per UID (“sequence depth”) necessary for an accurate quantitative result.
  • the desired depth is 5-50 reads per UID. Modification of 5fC
  • Modification of 5fC according to the present invention can be achieved if the first reactive group of the reagent according to the present invention is a C-H acidic group, an amine or a phosphorus ylide.
  • the reaction can be executed as a Knoevenagel condensation, which is schematically depicted in Fig. 2.
  • the CH-acidic reagent comprising a nucleophile group is added via Knoevenagel conditions to fC, then subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at C4 to obtain a “T-equivalent”.
  • Yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by a photoisomerization of the double bond and/or by applying a radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions.
  • an aldol condensation reaction of 5fC with a CH-acidic reagent comprising a sulfinate moiety e.g. 1 -(methyl sulfonyl)methane sulfinate
  • a subsequent addition of a sulfinate moiety at the C-6 position of the substituted cytosine followed by deamination and desulfmation (5fC to T conversion) can be executed (Fig. 3)
  • a desulfmation step under alkaline conditions may even not be required.
  • a thiol moiety can also be applied as nucleophile
  • addition reaction to the C5-C6 double bond can be achieved by means of applying thiol-ene chemistry using a radical initiator or a catalyst.
  • the yields may be increased by photoisomerization of the newly formed double bond.
  • Modification of 5fC according to the present invention can also be done by means of a Wittig reaction (Fig. 4).
  • the reagent according to the present invention is a phosphorus ylide comprising a nucleophilic group.
  • the reagent is added via Wittig conditions to 5fC, then a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling a deamination at the C4 position in order to obtain a “T-equivalent”.
  • Yields of intramolecular addition of the nucleophile to the C5-C6 double bond may be increased by photoisomerization of the newly formed double bond, and/or by applying radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions. Alternatively, the reaction can be performed using Horner-Wadsworth-Emmons reaction conditions, in particular using a sulfinate group as nucleophile.
  • reagent of the invention comprising an amine and a further nucleophilic group is added via reductive amination to 5fC. Subsequently there is an intramolecular addition of the nucleophile to the C5-C6 double bond, which enables the deamination at the C4 position to obtain a “T- equivalent”. Reversal of ring closure can be achieved under alkaline conditions, if necessary. Yields can be increased by applying radical initiation or a catalyst.
  • a more detailed example for implementing a reductive amination reaction is illustrated in Fig. 7.
  • 5fC is reacted with 2-aminomethyl sulfinate, and subsequently addition of the sulfinate moiety at C-6 position of substituted cytosine takes place.
  • This is followed by a deamination and desulfination step, which converts 5fC to T.
  • a desulfination step under alkaline conditions may not be needed.
  • the sulfinate moiety may be replaced by a thiol moiety using thiol-ene chemistry for the addition reaction to double bond C5-C6, which optionally includes the use of a radical initiator or a catalyst.
  • Fig. 8 schematically shows a reaction scheme for the modification of hmC.
  • the reagent used according to the present invention is a UDP -glucose, which is modified at the 2-position with a nucleophilic group. It is added via a B-glucosyl transferase enzyme to hmC. Then, a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”.
  • yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by applying radical initiation or a catalyst and a reversal of the ring closure can be achieved under alkaline conditions, if necessary.
  • Fig. 9 An alternative example is shown in Fig. 9.
  • 5hmC is reacted with 2-thio- glucose-UDP and B-glucosyltransferase.
  • an addition of the thiol moiety at the C6 position of substituted cytosine applying thiol-ene chemistry and optionally using a radical initiator or a catalyst is taking place, which is followed by a deamination reaction, resulting in a 5hmC to T conversion.
  • 2- Sulfinato-glucose-UDP can also be applied as substrate for B-glucosyltransferase.
  • the method of the present invention uses a reagent comprising an amine and a further nucleophilic group. It is added after an EDC/NHS activation of the carboxyl group of caC. This results in caC forming an amide bond, and a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”.
  • the yield of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by means of applying radical initiation or a catalyst and reversal of the ring closure can be achieved under alkaline conditions.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Physics & Mathematics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Immunology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)

Abstract

The present invention provides a method comprising the steps of a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC, b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl hydroxymethyl or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6-di-hydro Cytosine entity, and e) deaminating said 5,6-di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.

Description

Detection of epigenetic modifications
Field of the invention
The invention relates to the field mapping the epigenetic modifications of DNA and RNA, which has become increasingly important, because these modifications play a role in several biological processes and diseases including development, aging and last but not least, cancer.
Technical background
Most relevant to identify epigenetic modifications is detection by sequencing based methods. Differentiation between cytosine (C) and 5-methylcytosine (5mC) is described by direct sequencing (without preamplification) of DNA through a nanopore (e.g. by Pacific Bioscience Single Molecule, Real-Time (SMRT) Sequencing technology) and thereby reading out the different kinetics of nucleotide incorporation opposite C vs. mC by polymerase.
Other techniques are using methods to convert C or mC into a T(U) equivalent and subsequent amplification to allow identification when comparing untreated and converted sample DNA (see review: L. Zhao et al., Protein Cell 2020, 11, 792- 808).
Most important methods are bisulfite sequencing (conversion of C to U by bisulfite treatment), NEB’s EM-seq method (oxidation of mC by TET2 enzyme and glucosylation by B-glucosyltransferase (to block enzymatic deaminase reaction) and convert C to U by APOBEC deaminase), TAPS method (TET-assisted pyridine borane sequencing) applying TET enzyme oxidation of mC and subsequent reduction of oxidized mC species by pyridine borane to obtain dihydrocytosine nucleoside which is easily deaminated to give dihydrouracil (T equivalent) and the CLEVER method which is based on oxidation of mC by TET enzyme and subsequent reaction of 5-formyl-C with malononitrile to give an adduct which acts predominantly as a T equivalent in subsequent PCR amplification (C. Zhu et al., Cell Stem Cell 2017, 20, 720-731). Instead of enzymatic oxidation of mC by TET enzymes oxidation can also be performed by chemical means using e.g. potassium perruthenate (KRuCh).
There are also methods to differentiate the modifications 5-hydroxymethyl-dC (5hmC), 5-formyl (5fC) or 5-carboxyl-C (5caC) by partly modifying the methods described above.
Recently, WO 2022/096751 also disclosed a method for generating a dihydrothymine (DHT) or a dihydrouracil (DHU) residue from a nucleoside or a polynucleotide containing 5-methylcytosine (5mC) or 5-carboxyl-cytosine (5caC) by using a radical initiator which may be used together with a nucleophilic compound.
However, all the methods disclosed have some shortcomings. Bisulfite sequencing encompasses harsh conditions which degrades the majority of input DNA, only indirect detection of methylated cytosines is possible. Likewise, Enzymatic Methyl Sequencing (EM-Seq) via enzymatic deamination which is inefficient because the majority of cytosines which are non-methylated have to be converted. TAPS requires a toxic reagent needed for reduction such as pyridine borane, and the dihydrouridine being labile under slightly acidic conditions cannot easily be converted back to the more stable Uridine derivative. For the CLEVER method, the conversion yield of the condensation reaction of 5fC and malononitrile is far below 100%, because the enzymatic incorporation of nucleotides by polymerase when the condensation product of 5fC with malononitrile acts as substrate is not highly accurate, and incorporation of dA is only favoured, but not exclusive (see also: F. Galardi et al., Biomolecules 2020, 10, 1677). The method of WO 2022/096751 is not entirely specific for caC but also converts unmodified C to some extent. Another disadvantage is that di-hydro Uridine derivative cannot easily be converted back to the more stable Uridine derivative. Summary of the invention:
The present invention is based on the basic idea that the introduction of a nucleophile at 5fC, 5hmC or caC and the subsequent intramolecular addition at C- 6 of a 5-substitued cytosine is enabling a deamination at C-4 of a cytosine derivative.
Thus, in a first aspect, the present invention is directed to a method comprising the steps of a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl, or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.
The newly formed cyclic structures in step d) are 5-7 membered rings
In one embodiment, the method further comprises step f), i.e. reversing the ring formation of step d) thereby obtaining a 5-substituted Uracil.
Since the naturally occurring methylation of DNA is 5mC, the method may also comprise a step of converting 5mC to 5fC, 5hmC or 5caC prior to step a). Thus, the present invention is facilitating the methylation status analysis of naturally occurring DNA.
In a specific embodiment, said nucleic acid comprises at least one 5fC residue. In this case, the first reactive group may be a C-H acidic group, an amine or a phosphorus ylide. In another specific embodiment, said nucleic acid comprises at least one 5hmC residue. Then, the first reactive group may constitute a Glycosyl Transferase substrate. In still another embodiment, said nucleic acid comprises a 5caC residue. Then, the first reactive group may be an amine.
In all cases, the said second reactive group may be a nucleophilic group selected from a group consisting of Thiol or Sulfinate.
All methods as disclosed above may further comprise a subsequent step of amplifying said nucleic acid, which may be preferably executed by means of PCR amplification. As well, all methods disclosed above may further comprise the step of a subsequent step of sequencing said nucleic acid either with or without a prior amplification step, which is preferably a PCR amplification.
Brief description of Figures
Fig. 1 provides a scheme for detection of methylated sequences according to the present invention.
Fig. 2 shows a reaction scheme for modification of 5fC by means of a Knoevenagel condensation using a reactant with a C-H acidic group
Fig. 3 shows a reaction scheme for modification of 5fC by means of an aldol condensation reaction with a CH-acidic reagent comprising a sulfinate moiety
Fig. 4 shows a reaction scheme for modification of 5fC by means of a Wittig reaction.
Fig. 5 shows a specific example for a Wittig reaction according to Fig. 4 with a sulfinate modified Wittig reagent.
Fig. 6 shows a reaction scheme for modification of 5fC by means of reductive amination
Fig: 7 shows a specific example for a reductive amination as in Fig. 6, where 5fC is reacted with 2-aminomethyl sulfinate Fig. 8 shows a reaction scheme for the modification of hmC using UDP-glucose substituted at the 2-position of the glucose moiety with a nucleophile as a reagent and executing a subsequent beta-glucosyl-transferase reaction.
Fig. 9 shows a similar reaction scheme as example 8, wherein 2-thio-glucose-UDP is used.
Fig. 10 shows an example for modification of the carboxyl group of caC.
Detailed description of the invention
Abbreviations
C - cytosine or cytidine
T - thymine or thymidine
U - uracil or uridine
DHU - dihydrouracil or dihydrouridine
5mC - 5-methylcytosine or 5-methylcytidine
5hmC - 5 -hydroxymethyl cytosine or 5 -hydroxymethyl cytidine
5fC - 5-formylcytosine or 5-formylcytidine
5caC - 5-carboxylcytosine or 5-carboxylcytidine dC - 2’ -deoxy cytidine dU - 2’ -deoxyuridine
TET - ten-eleven translocation dioxygenase
TAPS - TET-assisted pyridine-borane sequencing
CAPS - chemically-assisted pyridine-borane sequencing Definitions
A nucleophilic group is a functional group that is attractive to electron-poor or positively charged centers and donates electrons, in particular donating an electron pair to an electrophile or an electrophilic center to form a covalent bond. Nucleophilic groups are for example alcohols, alcoholates, thiols, thiolates, amines, sulfinates or carbanions. The term deamination refers herein to a substitution reaction of an exocyclic amino group by a hydroxyl group, in particular substitution of the amino group at the C-4 position of a cytosine nucleobase to convert into a uracil nucleobase. Deamination at C-4 position is enhanced in 5,6-dihydrocytosines to give 5,6-dihydrouracil derivatives, as e.g. being applied after bisulfite addition to the C5- C6 double bond of cytosine.
The term C-H acidic group is used herein for a carbon atom within a compound which carries at least one electron-withdrawing group and thus being more easily deprotonated by a base. Electron-withdrawing groups are for example carbonyl containing groups, a nitro group, a cyano group, a sulfoxide or sulfone group.
The term Wittig reagent or phosphorus ylide, as used herein, refers to neutral dipolar molecules containing a formally negatively charged carbon atom (a carbanion) directly attached to a phosphorus atom with a formal positive charge. In phosphorus ylides (phosphonium ylides) the two adjacent carbon and phosphorus atoms are connected by both a covalent and an ionic bond. Phosphorus ylides are thus 1,2- dipolar compounds and a subclass of zwitterions. In a Wittig reaction triphenylphosphonium ylides are reacted with aldehydes or ketones to give an alkene bond. In a Homer-Wadsworth-Emmons reaction stabilized phosphonate carbanions are used as reagent.
A Glycosyl Transferase Substrate, as used herein, refers to a reagent comprising an activation group, e.g. a UDP moiety, and a sugar residue, for example a glucose or substituted glucose. Glycosyltransferases catalyze the transfer of the sugar moiety from an activated donor sugar (the glycosyl transferase substrate) onto a saccharide or non- saccharide acceptor. The non-saccharide acceptor can be for example 5- hydroxymethyl-dC. T4 Phage B-glucosyltransferase, for example from New England Biolabs, specifically transfers a glucose moiety of uridine diphosphoglucose to the 5-hydroxymethylcytosine (5-hmC) residues in double-stranded DNA, making beta- glucosyl-5-hydroxymethylcytosine.
Methods for DNA methylation status analysis by means of modification, followed by sequencing
A general scheme, how methylated C-residues mC are detected is schematically shown in Fig. 1. Without any particular treatment, the epigenetic dC modifications 5-methyl-dC (mC), 5-hydroxymethyl-dC (hmC), 5-formyl-dC (fC) and 5-carboxyl- dC (caC) are read as unmodified dC when amplified and/or sequenced. However, oxidation of mC by enzymatic and/or chemical means results in oxidized mC modifications hmC, fC and caC (general: xC).
Oxidation or conversion of 5mC to 5fC can be achieved by using TET enzymes (ten- eleven translocation (TET) methylcytosine dioxygenases). The same class of enzymes are suitable for converting 5mC to 5hmC. 5hmC can be further oxidized to 5fC enzymatically by a laccase enzyme or chemically using either KRuO4 or Cu(II)/TEMPO. Further details are disclosed in Pfeifer et al. Epigenetics & Chromatin 2013, 6: 10, p.1-9.
Subsequently, 5fC, 5hmC or caC are reacted with a reagent of the invention comprising two reactive groups, the first reactive group forming a covalent bond with hmC, fC or caC and thus introducing the second reactive group which comprises a nucleophile. By intramolecular addition of the nucleophile to the C5-C6 double bond of hmC, fC or caC deamination at C4 position is initiated resulting in a modified dU derivative which is read in a subsequent amplification or sequencing step as a T, thus enabling detection of epigenetic modification.
Such a sequencing according to the present invention allows the generation of data in the methylation status of the DNA of interest. Thus, the present also provides a method for analyzing the methylation status of a naturally occurring DNA comprising the steps of a) converting 5mC to 5fC, 5hmC or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.
In some embodiments, the sample is derived from a subject or a patient. In some embodiments the sample may comprise a fragment of a solid tissue or a solid tumor derived from the subject or the patient, e.g., by biopsy. The sample may also comprise body fluids that may contain nucleic acids (e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples). In other embodiments, the sample is a cultured sample, e.g., a tissue culture containing cells and fluids from which nucleic acids may be isolated. In some embodiments, the nucleic acids of interest in the sample come from infectious agents such as viruses, bacteria, protozoa or fungi. The present invention involves manipulating isolated nucleic acids isolated or extracted from a sample. Methods of nucleic acid extraction are well known in the art (see, Sambrook et al., & quot; Molecular Cloning: A Laboratory Manual, & quot; 1989, 2nd Ed., Cold Spring Harbor Laboratory Press: New York, N. Y.). A variety of kits are commercially available for extracting nucleic acids (DNA or RNA) from biological samples (e.g., KAPA Express Extract (Roche Sequencing Solutions, Pleasanton, Cal.) and other similar products from BD Biosciences Clontech (Palo Alto, Cal.), Epicentre Technologies (Madison, Wise.); Gentra Systems, (Minneapolis, Minn.); and Qiagen (Valencia, Cal.), Ambion (Austin, Tex.); BioRad Laboratories (Hercules, Cal.); and more. The present invention involves detecting an epigenetic modification, specifically, an epigenetic cytosine modification in nucleic acids (including, but not limited to, cytosine methylation). The nucleic acid sequences that are subject to conditional epigenetic modification are the target sequences analyzed by the method disclosed herein. The same nucleic acid sequence may or may not have the epigenetic modification characterized by methylation of cytosines at the 5-position (5mC or 5hmC). In some embodiments, a set or a panel of target nucleic acids are probed for the presence of methylation. For example, as shown in Patai, et al. “Comprehensive DNA Methylation Analysis Reveals a Common Ten-Gene Methylation Signature in Colorectal Adenomas and Carcinomas” PLOS ONE 10(8): e0133836 (2015), and in Onwuka, et al., “A panel of DNA methylation signature from peripheral blood may predict colorectal cancer susceptibility,” BMC Cancer 20, 692 (2020), methylation of biomarkers in a panel of methylation biomarkers is indicative of the presence of colorectal cancer in the patient. Accordingly, testing any known or future panels of methylation biomarkers for prognostic or diagnostic purposes is envisioned with the method disclosed herein. In some embodiments, the entire genome of an organism is probed for the presence of methylation. The method of the instant invention includes detecting methylation in all sites throughout the genome of an organism to diagnose a disease or condition or predisposition to a disease or condition using the sequence analysis and artificial intelligence tools described, e.g., in Shull, et al. “Sequencing the cancer methylome,” Methods Mol Biol. 1238:627-635 (2015).
In some embodiments, the method of detecting epigenetic modifications includes sequencing. The nucleic acid processed as described herein is subjected to sequencing; preferably, massively parallel single molecule sequencing. Analyzing individual molecules by massively parallel sequencing typically requires a separate level of barcoding for sample identification and error correction. The use of molecular barcodes such as described in U.S. Patent Nos. 7,393,665, 8,168,385, 8,481,292, 8,685,678, and 8,722,368. A unique molecular barcode is added to each molecule to be sequenced to mark molecule and its progeny (e.g., the original molecule and its amplicons generated by PCR). The unique molecular barcode (UID) has multiple uses including counting the number of original target molecules in the sample and error correction (Newman, et al., “An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage,” Nature Medicine doi: 10.1038/nm.3519 (2014)).
Nanopore sequencing is a unique, scalable technology that enables direct, real-time analysis of long DNA or RNA fragments. It works by monitoring changes to an electrical current as nucleic acids are passed through a protein nanopore. The resulting signal is decoded to provide the specific DNA or RNA sequence. Nanopore-based sequencing technology detects the unique electrical signals of different molecules as they pass through the nanopore with a semiconductor-based electronic detection system. This technology makes for a high throughput, cost effective sequencing solution. At the heart of the technology is the biological nanopore, a protein pore embedded in a membrane, while the brains of the technology lie in the electronics of a semiconductor integrated circuit and proprietary chemistries. The electronic sensor technology embedded in the chip enables automatic membrane assembly and nanopore insertion, while allowing for active control of individual sensors on the circuit. Different sequencing chemistries can be paired with the nanopore and electronic sensor technology to enable high throughput, high accuracy sequencing with faster time-to-data.
In some embodiments, unique molecular barcodes (UIDs) are used for sequencing error correction. The entire progeny of a single target molecule is marked with the same barcode and forms a barcoded family. A variation in the sequence not shared by all members of the barcoded family is discarded as an artefact. Barcodes can also be used for positional deduplication and target quantification, as the entire family represents a single molecule in the original sample (Newman, et al. “Integrated digital error suppression for improved detection of circulating tumor DNA,” Nature Biotechnology 34:547 (2016)).
In some embodiments, the method involves forming a library comprising nucleic acids from a sample. The library consists of a plurality of nucleic acids ready for sequencing or another type of detection method, e.g., PCR. A library can be stored and used multiple times for further processing such as amplification or sequencing of the nucleic acids in the library. In some embodiments, the library is the input nucleic acid in which methylation is detected by the method described herein. In other embodiments, the library is formed from nucleic acids that have undergone the methylation detection reactions described herein.
In some embodiments, the nucleic acids processed for detection of epigenetic modifications according to the method described herein are sequenced. Any of a number of sequencing technologies or sequencing assays can be utilized. The term “Next Generation Sequencing (NGS)” as used herein refers to sequencing methods that allow for massively parallel sequencing of clonally amplified molecules and of single nucleic acid molecules.
Non-limiting examples of sequence assays that are suitable for use with the methods disclosed herein include nanopore sequencing (U.S. Patent Publication Nos. US2013/0244340, US2013/0264207, US2014/0134616, US2015/0119259 and US2015/0337366), Sanger sequencing, capillary array sequencing, thermal cycle sequencing (Sears, et al., Biotechniques, 13:626-633 (1992)), solid-phase sequencing (Zimmerman, et al., Methods Mol. Cell Biol., 3:39-42 (1992)), sequencing with mass spectrometry such as matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF/MS; Fu, et al., Nature Biotech., 16:381-384 (1998)), sequencing by hybridization (Drmanac et al., Nature Biotech., 16:54-58 (1998), and NGS methods, including but not limited to sequencing by synthesis (e.g., HiSeq™, MiSeq™, or Genome Analyzer, each available from Illumina), sequencing by ligation (e.g., SOLiD™, Life Technologies), ion semiconductor sequencing (e.g., Ion Torrent™, Life Technologies), and SMRT® sequencing (e.g., Pacific Biosciences).
Commercially available sequencing technologies include: sequencing-by- hybridization platforms from Affymetrix Inc. (Sunnyvale, Calif.), sequencing-by- synthesis platforms from Illumina/Solexa (San Diego, Calif.) and Helicos Biosciences (Cambridge, Mass.), sequencing-by-ligation platform from Applied Biosystems (Foster City, Calif.). Other sequencing technologies include, but are not limited to, the Ion Torrent technology (ThermoFisher Scientific), and nanopore sequencing (Genia Technology, Roche Sequencing Solutions, Santa Clara, Cal.), Oxford Nanopore Technologies (Oxford, UK), and sequencing by expansion (Stratos Genomics, Roche Sequencing Solutions). In some embodiments, the sequencing step involves sequence aligning. In some embodiments, aligning is used to determine a consensus sequence from a plurality of sequences, e.g., a plurality having the same unique molecular ID (UID). The molecular ID is a barcode that can be added to each molecule prior to sequencing or if amplification step is included, prior to the amplification step. In some embodiments, a UID is present in the 5 ’-portion of the RT primer. Similarly, a UID can be present in the 5 ’-end of the last barcode subunit to be added to the compound barcode. In other embodiments, a UID is present in an adaptor and is added to one or both ends of the target nucleic acid by ligation.
In some embodiments, a consensus sequence is determined from a plurality of sequences all having an identical UID. The sequences having an identical UID are presumed to derive from the same original molecule through amplification. In other embodiments, UID is used to eliminate artifacts, i.e., variations existing in the progeny of a single molecule (characterized by a particular UID). Such artifacts resulting from PCR errors or sequencing errors can be eliminated using UIDs.
In some embodiments, the number of each sequence in the sample can be quantified by quantifying relative numbers of sequences with each UID among the population having the same multiplex sample ID (MID). Each UID represents a single molecule in the original sample and counting different UIDs associated with each sequence variant can determine the fraction of each sequence variant in the original sample, where all molecules share the same MID. A person skilled in the art will be able to determine the number of sequence reads necessary to determine a consensus sequence. In some embodiments, the relevant number is reads per UID (“sequence depth”) necessary for an accurate quantitative result. In some embodiments, the desired depth is 5-50 reads per UID. Modification of 5fC
Modification of 5fC according to the present invention can be achieved if the first reactive group of the reagent according to the present invention is a C-H acidic group, an amine or a phosphorus ylide.
If the first reactive group is a C-H acidic group, the reaction can be executed as a Knoevenagel condensation, which is schematically depicted in Fig. 2. The CH-acidic reagent comprising a nucleophile group is added via Knoevenagel conditions to fC, then subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at C4 to obtain a “T-equivalent”. Yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by a photoisomerization of the double bond and/or by applying a radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions.
Alternatively, an aldol condensation reaction of 5fC with a CH-acidic reagent comprising a sulfinate moiety (e.g. 1 -(methyl sulfonyl)methane sulfinate), a subsequent addition of a sulfinate moiety at the C-6 position of the substituted cytosine, followed by deamination and desulfmation (5fC to T conversion) can be executed (Fig. 3) A desulfmation step under alkaline conditions may even not be required. Instead of a sulfinate moiety, a thiol moiety can also be applied as nucleophile, addition reaction to the C5-C6 double bond can be achieved by means of applying thiol-ene chemistry using a radical initiator or a catalyst. The yields may be increased by photoisomerization of the newly formed double bond.
Modification of 5fC according to the present invention can also be done by means of a Wittig reaction (Fig. 4). I this case, the reagent according to the present invention is a phosphorus ylide comprising a nucleophilic group. The reagent is added via Wittig conditions to 5fC, then a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling a deamination at the C4 position in order to obtain a “T-equivalent”.
Details of the reaction are shown for a particular example in Fig. 5. In this case, 5fC is reacted with a sulfinate modified Wittig reagent. Subsequently there is an addition of the sulfinate moiety at the C-6 position of substituted cytosine, followed by a deamination and desulfination (5fC to T conversion). The desulfination step under alkaline conditions may not be needed necessarily. Alternatively, instead of a sulfinate moiety, a thiol moiety can also be used by means of applying thiol-ene chemistry, using a radical initiator or a catalyst. Yields of intramolecular addition of the nucleophile to the C5-C6 double bond may be increased by photoisomerization of the newly formed double bond, and/or by applying radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions. Alternatively, the reaction can be performed using Horner-Wadsworth-Emmons reaction conditions, in particular using a sulfinate group as nucleophile.
Yet another alternative is the modification of 5fC by means of reductive amination as shown in Fig. 6. In this case, the reagent of the invention comprising an amine and a further nucleophilic group is added via reductive amination to 5fC. Subsequently there is an intramolecular addition of the nucleophile to the C5-C6 double bond, which enables the deamination at the C4 position to obtain a “T- equivalent”. Reversal of ring closure can be achieved under alkaline conditions, if necessary. Yields can be increased by applying radical initiation or a catalyst.
A more detailed example for implementing a reductive amination reaction is illustrated in Fig. 7. 5fC is reacted with 2-aminomethyl sulfinate, and subsequently addition of the sulfinate moiety at C-6 position of substituted cytosine takes place. This is followed by a deamination and desulfination step, which converts 5fC to T. Again, a desulfination step under alkaline conditions may not be needed. Also again, the sulfinate moiety may be replaced by a thiol moiety using thiol-ene chemistry for the addition reaction to double bond C5-C6, which optionally includes the use of a radical initiator or a catalyst.
Modification of hmC:
Fig. 8 schematically shows a reaction scheme for the modification of hmC. In this case, the reagent used according to the present invention is a UDP -glucose, which is modified at the 2-position with a nucleophilic group. It is added via a B-glucosyl transferase enzyme to hmC. Then, a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”. As discussed above, yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by applying radical initiation or a catalyst and a reversal of the ring closure can be achieved under alkaline conditions, if necessary.
An alternative example is shown in Fig. 9. In this case, 5hmC is reacted with 2-thio- glucose-UDP and B-glucosyltransferase. Subsequently, an addition of the thiol moiety at the C6 position of substituted cytosine applying thiol-ene chemistry and optionally using a radical initiator or a catalyst is taking place, which is followed by a deamination reaction, resulting in a 5hmC to T conversion. Alternatively, 2- Sulfinato-glucose-UDP can also be applied as substrate for B-glucosyltransferase.
Modification of caC:
For this embodiment, which is shown in Fig. 10, the method of the present invention uses a reagent comprising an amine and a further nucleophilic group. It is added after an EDC/NHS activation of the carboxyl group of caC. This results in caC forming an amide bond, and a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”. Again, the yield of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by means of applying radical initiation or a catalyst and reversal of the ring closure can be achieved under alkaline conditions.

Claims

Patent Claims
1. A method comprising a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl, or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl, or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6-di-hydro Cytosine entity, and e) deaminating said 5,6-di-hydro Cytosine entity to a 5,6-di-hydro-Uracil entity.
2. Method according to claim 1, further comprising step f) reversing the ring formation of step d) thereby obtaining a 5- substituted Uracil.
3. Method of claim 1-2, further comprising the step of converting 5mC to 5fC, 5hmC or 5caC prior to step a) .
4. A method according to claim 1-3, wherein said nucleic acid comprises 5fC.
5. Method according to claim 4, wherein the first reactive group is a C-H acidic group, an amine or a phosphorus ylide.
6. Method according to claim 1-3, wherein said nucleic acid comprises 5hmC.
7. Method according to claim 6, wherein the first reactive group is a Glycosyl Transferase substrate.
8. Method according to claim 1-3, wherein said nucleic acid comprises 5caC.
9. Method according to claim 8, wherein the first reactive group is an amine.
10. Method according to claims 1-9, wherein said second reactive group is a nucleophilic group selected from a group consisting of Thiol or Sulfinate.
11. Method according to claims 1-10, further comprising a subsequent step of amplifying said nucleic acid.
12. Method according to claims 1-11 further comprising the subsequent step of sequencing said nucleic acid.
13. A composition comprising a nucleic acid treated according to the method of claims 1-10.
14. A kit for performing a method according to claims 1-12.
EP23757600.4A 2022-08-18 2023-08-16 Detection of epigenetic modifications Pending EP4573207A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP22191052 2022-08-18
PCT/EP2023/072497 WO2024038069A1 (en) 2022-08-18 2023-08-16 Detection of epigenetic modifications

Publications (1)

Publication Number Publication Date
EP4573207A1 true EP4573207A1 (en) 2025-06-25

Family

ID=83004497

Family Applications (1)

Application Number Title Priority Date Filing Date
EP23757600.4A Pending EP4573207A1 (en) 2022-08-18 2023-08-16 Detection of epigenetic modifications

Country Status (4)

Country Link
EP (1) EP4573207A1 (en)
JP (1) JP2025527591A (en)
CN (1) CN119731336A (en)
WO (1) WO2024038069A1 (en)

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7393665B2 (en) 2005-02-10 2008-07-01 Population Genetics Technologies Ltd Methods and compositions for tagging and identifying polynucleotides
ES2690753T3 (en) 2010-09-21 2018-11-22 Agilent Technologies, Inc. Increased confidence in allele identifications with molecular count
US10443096B2 (en) 2010-12-17 2019-10-15 The Trustees Of Columbia University In The City Of New York DNA sequencing by synthesis using modified nucleotides and nanopore detection
US20130244340A1 (en) 2012-01-20 2013-09-19 Genia Technologies, Inc. Nanopore Based Molecular Detection and Sequencing
ES2732256T3 (en) 2012-02-16 2019-11-21 Genia Tech Inc Procedures for creating bilayers for use with nanopore sensors
US20150119259A1 (en) 2012-06-20 2015-04-30 Jingyue Ju Nucleic acid sequencing by nanopore detection of tag molecules
US9605309B2 (en) 2012-11-09 2017-03-28 Genia Technologies, Inc. Nucleic acid sequencing using tags
GB202017653D0 (en) 2020-11-09 2020-12-23 Cambridge Entpr Ltd Methods for detection of nucleotide modification

Also Published As

Publication number Publication date
CN119731336A (en) 2025-03-28
JP2025527591A (en) 2025-08-22
WO2024038069A1 (en) 2024-02-22

Similar Documents

Publication Publication Date Title
US12351859B2 (en) Methods for the epigenetic analysis of DNA, particularly cell-free DNA
US10808244B2 (en) Method of normalizing biological samples
EP3146075B1 (en) Ion sensor dna and rna sequencing by synthesis using nucleotide reversible terminators
JP2023535636A (en) Compositions and methods for nucleic acid analysis
EP1117838B1 (en) A method for analyzing polynucleotides
TW202302861A (en) Methods for accurate parallel quantification of nucleic acids in dilute or non-purified samples
JP7646850B2 (en) Methods for base level detection of methylation in nucleic acids - Patents.com
US20240301466A1 (en) Efficient duplex sequencing using high fidelity next generation sequencing reads
US20200040390A1 (en) Methods for Sequencing Repetitive Genomic Regions
EP4573207A1 (en) Detection of epigenetic modifications
CN120019160A (en) TET-assisted pyridine borane sequencing
US20250361549A1 (en) Detection of epigenetic cytosine modification
Shetty et al. Introduction to nucleic acid sequencing
US20250115960A1 (en) Nucleic acid testing method and system
WO2026024692A1 (en) Direct methylation single-base resolution sequencing of enriched unmethylated dna
WO2026024956A1 (en) Selecting low technical noise biomarkers for sensitive molecular diagnostics
WO2024168196A1 (en) Systems and methods for enzymatic synthesis of polynucleotides containing non-standard nucleotide basepairs
WO2025104431A1 (en) Profiling method for determining epigenetic modifications
CN120350100A (en) Method for detecting low frequency mutation of nucleic acid
WO2025002393A1 (en) Method for detecting nucleic acid and use thereof
WO2025257103A1 (en) Methods for double stranded dna sequencing
WO2024213788A1 (en) Method of dna sequencing
HK40093334A (en) Methylated dna fragment enrichment methods, compositions and kits
Honisch The World of Nucleic Acid-based Mass Spectrometry for Microbial and Viral Detection

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: UNKNOWN

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20250318

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)