WO2025188906A1

WO2025188906A1 - Modified adenosine nucleotides

Info

Publication number: WO2025188906A1
Application number: PCT/US2025/018587
Authority: WO
Inventors: Elena CRESSINA; Francesca Manni; Alexandre HOFER; Xiaolin Wu
Original assignee: Illumina Inc
Current assignee: Illumina Inc
Priority date: 2024-03-08
Filing date: 2025-03-05
Publication date: 2025-09-12
Anticipated expiration: 2026-09-08
Also published as: WO2025188906A8

Abstract

Embodiments of the present disclosure relate to modified adenosine nucleosides and nucleotides. Also provided herein are methods and kits for sequencing applications using such nucleotides.

Description

MODIFIED ADENOSINE NUCLEOTIDES

BACKGROUND

Field

[0001] The present disclosure generally relates to compositions, kits and methods for polynucleotide sequencing. The present disclosure also relates to modified adenosine nucleotides for sequencing.

Description of the Related Art

[0002] Advances in the study of molecules have been led, in part, by improvement in technologies used to characterize the molecules or their biological reactions. In particular, the study of the nucleic acids DNA and RNA has benefited from developing technologies used for sequence analysis and the study of hybridization events.

[0003] An example of the technologies that have improved the study of nucleic acids is the development of fabricated arrays of immobilized nucleic acids. These arrays consist typically of a high-density matrix of polynucleotides immobilized onto a solid support material. See, e.g., Fodor et al., Trends Biotech. 12: 19-26, 1994, which describes ways of assembling the nucleic acids using a chemically sensitized glass surface protected by a mask, but exposed at defined areas to allow attachment of suitably modified nucleotide phosphoramidites. Fabricated arrays can also be manufactured by the technique of “spotting” known polynucleotides onto a solid support at predetermined positions (e.g., Stimpson et al., Proc. Natl. Acad. Sei. 92: 6379- 6383, 1995).

[0004] One way of determining the nucleotide sequence of a nucleic acid bound to an array is called “sequencing by synthesis” or “SBS.” This technique for determining the sequence of DNA ideally requires the controlled (i.e., one at a time) incorporation of the correct complementary nucleotide opposite the nucleic acid being sequenced. This allows for accurate sequencing by adding nucleotides in multiple cycles as each nucleotide residue is sequenced one at a time, thus preventing an uncontrolled series of incorporations from occurring. The incorporated nucleotide is read using an appropriate label attached thereto before removal of the label moiety and the subsequent next round of sequencing.

[0005] DNA damage is a well-known problem occurring during SBS sequencing. Damage to template and primer strands can manifest in the form of high phasing, high signal decay, poor resynthesis during pair-end turn, and may result in poor data quality, especially on longer reads. One major factor that causes DNA damage is the exposure to light in the presence of photosensitizers (organic dyes). The mechanisms and the factors affecting photo-induced damage are complex, it has been observed that purine nucleobases such as guanosine (G) and adenosine (A) are more sensitive than pyrimidine nucleobases to photo-induced oxidative damage. Thus, it is desirable to develop modified purine nucleobases with improved photo-stabilities.

SUMMARY

[0006] One aspect of the present disclosure relates to a nucleotide comprising a ribose or deoxyribose, and a modified adenine moiety having the structure of formula (I) or (II): (II), wherein each X is independently N or CR²; each of R¹ and R² is independently H or an electron withdrawing group;

Z¹ is absent, CR^aR^b, NR^C, O, S, C(=O), C(=O)NR^C, S(=O), S(=O)₂, or S(=O)₂NR^C; ring A is a Ce-Cio arylene or a five to 10 membered heteroarylene, each optionally substituted with one or more electron withdrawing groups;

L¹ is absent, -(CH₂)_mNH-, -(CH₂)_mC(=O)NH(CH₂)_n-,

-(CH₂)_mNHC(=O)(CH₂)_n-, -NHC(=O)(CH₂)_kNH-, -C(=O)NH(CH₂)_kNH- -(CH₂)_mS(=O)₂NH-, -S(=O)₂(CH₂)m-, or 2 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N;

Z² is absent, -CH=CH-, -C=C-, -C(=O)-CH=CH-, -C(=O)-C=C-, -C(R^dR^e)-, -C(R^dR^e)-CH=CH-, -C(R^dR^e)-C=C-, unsubstituted or substituted C3-C7 cycloalkylene, unsubstituted or substituted 4 to 7 membered heterocyclylene, -C(=O)(unsubstituted or substituted C3-C7 cycloalkylene), or -C(=O)(unsubstituted or substituted 4 to 7 membered heterocyclylene);

L² is -(CH₂)_mNH-, -(CH₂)_mS(=O)₂NH-, -S(=O)₂(CH₂)_m-

(CH₂)mC(=O)NH(CH₂)_n-, -(CH₂)_mNHC(=O)(CH₂)n-, -NHC(=O)(CH₂)_kNH-, -C(=O)NH(CH₂)_kNH-, or 2 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N ; each of R^a, R^b and R^c is independently H, or unsubstituted or substituted Ci-Cs alkyl; each of R^d and R^e is independently H or an electron withdrawing group, provided that at least one of R^d and R^e is an electron withdrawing group; each of m and n is independently 0, 1, 2, 3, 4, 5 or 6; each k is independently 1, 2, 3, 4, 5, or 6;

* indicates the point of attachment of the modified adenine moiety to the ribose or 2' deoxyribose; and

** indicates the point of the attachment of the modified adenine moiety to a functional moiety or a detectable label, optionally via a cleavable linker; provided that when Z² is absent, then L² is -(CFh -eNH-, -(CH2)_mS(=O)2NH-, -(CH2)mC(=O)NH(CH2)n- -(CH2)_mNHC(=O)(CH2)n- -NHC(=O)(CH₂)_kNH-,

-C(=0)NH(CH2)kNH-, or 2 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N ; and when -Z -L²- comprises , then X is N or at least one of R¹ and R² is an electron withdrawing group (i.e., at least one of R¹ and R² is not H).

[0007] Another aspect of the present disclosure relates to an oligonucleotide or polynucleotide comprising a nucleotide as described herein incorporated thereto.

[0008] Another aspect of the present disclosure relates to a kit comprising a first type of nucleotide as described in the present disclosure.

[0009] Another aspect of the present disclosure relates to a method of determining the sequences of a plurality of different target polynucleotides, comprising:

(a) contacting a solid support with a solution comprising sequencing primers under hybridization conditions, wherein the solid support comprises a plurality of different target polynucleotides immobilized thereon; and the sequencing primers are complementary to at least a portion of the target polynucleotides;

(b) contacting the solid support with an aqueous incorporation mixture comprising DNA polymerase and one or more of four different types of nucleotides (A, C, G and T or U; dATP, dCTP, dGTP and dTTP or dUTP) under conditions suitable for DNA polymerase-mediated primer extension, and incorporating one type of nucleotides into the sequencing primers to produce extended copy polynucleotides, wherein the A nucleotide is the nucleotide triphosphate as described in the present disclosure carrying a detectable label through a cleavable linker, and wherein each of the one or more of four different type of nucleotides comprises a 3' blocking group covalently attached to the 2' deoxyribose of the nucleotide;

(c) imaging and performing one or more fluorescent measurements of the extended copy polynucleotides; (d) removing the 3' blocking group from the nucleotides incorporated into the extended copy polynucleotides;

(e) washing the solid support after the removal of the 3' blocking group from the incorporated nucleotides; and

(f) repeating steps (b) to (e) until the sequences of at least a portion of the target polynucleotides are determined.

[0010] Another aspect of the present disclosure relates to a method of determining the sequences of a plurality of different target polynucleotides, comprising:

(b) contacting the solid support with an aqueous incorporation mixture comprising DNA polymerase and one or more of four different types of nucleotides (A, C, G and T or U; dATP, dCTP, dGTP and dTTP or dUTP) under conditions suitable for DNA polymerase-mediated primer extension, and incorporating one type of nucleotides into the sequencing primers to produce extended copy polynucleotides, wherein at least two types of nucleotides are unlabeled and at least one type of unlabeled nucleotide is a unlabeled A nucleotide triphosphate as described the present disclosure having a first functional moiety, and wherein each of the one or more of four different type of nucleotides comprises a 3' blocking group covalently attached to the 2' deoxyribose of the nucleotide;

(c) contacting the extended copy polynucleotides with an aqueous labeling mixture comprising a first labeling reagent, wherein the first labeling reagent comprises one or more first detectable labels and a first binding moiety that is capable of specific binding to the first functional moiety of the unlabeled nucleotide;

(d) imaging the solid support and performing one or more fluorescent measurements;

(e) removing the 3' blocking group from the nucleotides incorporated into the extended copy polynucleotides;

(f) washing the solid support after the removal of the 3' blocking group from the incorporated nucleotides; and

(g) repeating steps (b) to (f) until the sequences of at least a portion of the target polynucleotides are determined.

[0011] A further aspect of the present disclosure relates to a method of preparing a growing polynucleotide complementary to a target single-stranded polynucleotide (e.g., in a sequencing application), comprising incorporating an adenosine nucleotide described herein into the growing complementary polynucleotide. In some embodiments, the adenosine nucleotide described herein comprises a 3 ' blocking group and the incorporation of the nucleotide prevents the introduction of any subsequent nucleotide into the growing complementary polynucleotide. In some embodiments, the incorporation of the nucleotide is accomplished by a polymerase, a terminal deoxynucleotidyl transferase (TdT), or a reverse transcriptase. In one embodiment, the incorporation is accomplished by a polymerase (e.g., a DNA polymerase).

DETAILED DESCRIPTION

[0012] Embodiments of the present disclosure relate to modified adenosine nucleotides with improved stability from oxidative damage and degradation during SBS. The modifications may also reduce dye quenching and result in better signal to noise ratio and improved SBS error rates.

Definitions

[0013] The section headings used herein are for organizational purposes only and are not to be construed as limiting the subject matter described.

[0014] It is noted that, as used in this specification and the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless expressly and unequivocally limited to one referent. It will be apparent to those skilled in the art that various modifications and variations can be made to various embodiments described herein without departing from the spirit or scope of the present teachings. Thus, it is intended that the various embodiments described herein cover other modifications and variations within the scope of the appended claims and their equivalents.

[0015] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of ordinary skill in the art. The use of the term “including” as well as other forms, such as “include,” “includes,” and “included,” is not limiting. The use of the term “having” as well as other forms, such as “have,” “has,” and “had,” is not limiting. As used in this specification, whether in a transitional phrase or in the body of the claim, the terms “comprise(s)” and “comprising” are to be interpreted as having an open-ended meaning. That is, the above terms are to be interpreted synonymously with the phrases “having at least” or “including at least.” For example, when used in the context of a process, the term “comprising” means that the process includes at least the recited steps but may include additional steps. When used in the context of a compound, composition, or device, the term “comprising” means that the compound, composition, or device includes at least the recited features or components, but may also include additional features or components. [0016] As used herein, common organic abbreviations are defined as follows:

°C Temperature in degrees Centigrade dATP Deoxyadenosine triphosphate dCTP Deoxycytidine triphosphate dGTP Deoxyguanosine triphosphate dTTP Deoxythymidine triphosphate ddNTP Dideoxynucleotide triphosphate ffA Fully functionalized A nucleotide or fully functionalized A nucleoside triphosphate ffC Fully functionalized C nucleotide or fully functionalized C nucleoside triphosphate ffG Fully functionalized G nucleotide or fully functionalized G nucleoside triphosphate ffN Fully functionalized nucleotide or fully functionalized nucleoside triphosphate ffT Fully functionalized T nucleotide or fully functionalized T nucleoside triphosphate h Hour(s)

RT Room temperature

SBS Sequencing by Synthesis

USM Universal scan mix

[0017] As used herein, the term “array” refers to a population of different probe molecules that are attached to one or more substrates such that the different probe molecules can be differentiated from each other according to relative location. An array can include different probe molecules that are each located at a different addressable location on a substrate. Alternatively, or additionally, an array can include separate substrates each bearing a different probe molecule, wherein the different probe molecules can be identified according to the locations of the substrates on a surface to which the substrates are attached or according to the locations of the substrates in a liquid. Exemplary arrays in which separate substrates are located on a surface include, without limitation, those including beads in wells as described, for example, in U.S. Patent No. 6,355,431 Bl, US 2002/0102578 and PCT Publication No. WO 00/63437. Exemplary formats that can be used in the invention to distinguish beads in a liquid array, for example, using a microfluidic device, such as a fluorescent activated cell sorter (FACS), are described, for example, in US Pat. No. 6,524,793. Further examples of arrays that can be used in the invention include, without limitation, those described in U.S. Pat Nos. 5,429,807; 5,436,327; 5,561,071;

5,583,211; 5,658,734; 5,837,858; 5,874,219; 5,919,523; 6,136,269; 6,287,768; 6,287,776; 6,288,220; 6,297,006; 6,291,193; 6,346,413; 6,416,949; 6,482,591; 6,514,751 and 6,610,482; and WO 93/17126; WO 95/11995; WO 95/35505; EP 742 287; and EP 799 897.

[0018] As used herein, the term “covalently attached” or “covalently bonded” refers to the forming of a chemical bonding that is characterized by the sharing of pairs of electrons between atoms. For example, a covalently attached polymer coating refers to a polymer coating that forms chemical bonds with a functionalized surface of a substrate, as compared to attachment to the surface via other means, for example, adhesion or electrostatic interaction. It will be appreciated that polymers that are attached covalently to a surface can also be bonded via other means in addition to covalent attachment.

[0019] The term “halogen” or “halo,” as used herein, means any one of the radio-stable atoms of column 7 of the Periodic Table of the Elements, e.g., fluorine, chlorine, bromine, or iodine, with fluorine and chlorine being preferred.

[0020] As used herein, “C_a to Ct>” in which “a” and “b” are integers refer to the number of carbon atoms in an alkyl, alkenyl or alkynyl group, or the number of ring atoms of a cycloalkyl or aryl group. That is, the alkyl, the alkenyl, the alkynyl, the ring of the cycloalkyl, and ring of the aryl can contain from “a” to “b”, inclusive, carbon atoms. For example, a “Ci to C4 alkyl” group refers to all alkyl groups having from 1 to 4 carbons, that is, CH3-, CH3CH2-, CH3CH2CH2- , (CH₃)₂CH-, CH3CH₂CH₂CH₂-, CH₃CH₂CH(CH3)- and (CH₃)₃C-; a C₃ to C₄ cycloalkyl group refers to all cycloalkyl groups having from 3 to 4 carbon atoms, that is, cyclopropyl and cyclobutyl. Similarly, a “4 to 6 membered heterocyclyl” group refers to all heterocyclyl groups with 4 to 6 total ring atoms, for example, azetidine, oxetane, oxazoline, pyrrolidine, piperidine, piperazine, morpholine, and the like. If no “a” and “b” are designated with regard to an alkyl, alkenyl, alkynyl, cycloalkyl, or aryl group, the broadest range described in these definitions is to be assumed. As used herein, the term “Ci-Ce” includes Ci, C₂, C3, C₄, C5 and Ce, and a range defined by any of the two numbers. For example, Ci-Ce alkyl includes Ci, C₂, C3, C₄, C5 and Ce alkyl, C₂-Ce alkyl, C1-C3 alkyl, etc. Similarly, C₂-Ce alkenyl includes C₂, C3, C₄, C5 and Ce alkenyl, C2-C5 alkenyl, Ci-C₄ alkenyl, etc.; and C2-C6 alkynyl includes C₂, C3, C₄, C5 and Ce alkynyl, C2- C5 alkynyl, C3-C1 alkynyl, etc. C3-C8 cycloalkyl each includes hydrocarbon ring containing 3, 4, 5, 6, 7 and 8 carbon atoms, or a range defined by any of the two numbers, such as C3-C7 cycloalkyl or Cs-Ce cycloalkyl.

[0021] As used herein, “alkyl” refers to a straight or branched hydrocarbon chain that is fully saturated (i.e., contains no double or triple bonds). The alkyl group may have 1 to 20 carbon atoms (whenever it appears herein, a numerical range such as “1 to 20” refers to each integer in the given range; e.g., “1 to 20 carbon atoms” means that the alkyl group may consist of 1 carbon atom, 2 carbon atoms, 3 carbon atoms, etc., up to and including 20 carbon atoms, although the present definition also covers the occurrence of the term “alkyl” where no numerical range is designated). The alkyl group may also be a medium size alkyl having 1 to 9 carbon atoms. The alkyl group could also be a lower alkyl having 1 to 6 carbon atoms. By way of example only, “Ci -6 alkyl” or “Ci-Ce alkyl” indicates that there are one to six carbon atoms in the alkyl chain, i.e., the alkyl chain is selected from the group consisting of methyl, ethyl, propyl, iso-propyl, n- butyl, iso-butyl, sec-butyl, and t-butyl. Typical alkyl groups include, but are in no way limited to, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, tertiary butyl, pentyl, hexyl, and the like.

[0022] As used herein, “alkoxy” refers to the formula -OR wherein R is an alkyl as is defined above, such as “C1-9 alkoxy” or “C1-C9 alkoxy”, including but not limited to methoxy, ethoxy, n-propoxy, 1 -methylethoxy (isopropoxy), n-butoxy, iso-butoxy, sec-butoxy, and tertbutoxy, and the like.

[0023] As used herein, “alkenyl” refers to a straight or branched hydrocarbon chain containing one or more double bonds. The alkenyl group may have 2 to 20 carbon atoms, although the present definition also covers the occurrence of the term “alkenyl” where no numerical range is designated. The alkenyl group may also be a medium size alkenyl having 2 to 9 carbon atoms. The alkenyl group could also be a lower alkenyl having 2 to 6 carbon atoms. By way of example only, “C2-C6 alkenyl” or “C2-6 alkenyl” indicates that there are two to six carbon atoms in the alkenyl chain, i.e., the alkenyl chain is selected from the group consisting of ethenyl, propen- 1-yl, propen-2-yl, propen-3-yl, buten-l-yl, buten-2-yl, buten-3-yl, buten-4-yl, 1-methyl-propen-l-yl, 2-methyl-propen-l-yl, 1-ethyl-ethen-l-yl, 2-methyl-propen-3-yl, buta-1, 3-dienyl, buta-1, 2,- dienyl, and buta-1, 2-dien-4-yl. Typical alkenyl groups include, but are in no way limited to, ethenyl, propenyl, butenyl, pentenyl, and hexenyl, and the like.

[0024] As used herein, “alkynyl” refers to a straight or branched hydrocarbon chain containing one or more triple bonds. The alkynyl group may have 2 to 20 carbon atoms, although the present definition also covers the occurrence of the term “alkynyl” where no numerical range is designated. The alkynyl group may also be a medium size alkynyl having 2 to 9 carbon atoms. The alkynyl group could also be a lower alkynyl having 2 to 6 carbon atoms. By way of example only, “C2-6 alkynyl” or “C2-C6 alkenyl” indicates that there are two to six carbon atoms in the alkynyl chain, i.e., the alkynyl chain is selected from the group consisting of ethynyl, propyn-1- yl, propyn-2-yl, butyn-l-yl, butyn-3-yl, butyn-4-yl, and 2-butynyl. Typical alkynyl groups include, but are in no way limited to, ethynyl, propynyl, butynyl, pentynyl, and hexynyl, and the like.

[0025] The term “aromatic” refers to a ring or ring system having a conjugated pi electron system and includes both carbocyclic aromatic (e.g., phenyl) and heterocyclic aromatic groups (e.g., pyridine). The term includes monocyclic or fused-ring polycyclic (i.e., rings which share adjacent pairs of atoms) groups provided that the entire ring system is aromatic.

[0026] As used herein, “aryl” refers to an aromatic ring or ring system (i.e., two or more fused rings that share two adjacent carbon atoms) containing only carbon in the ring backbone. When the aryl is a ring system, every ring in the system is aromatic. The aryl group may have 6 to 18 carbon atoms, although the present definition also covers the occurrence of the term “aryl” where no numerical range is designated. In some embodiments, the aryl group has 6 to 10 carbon atoms. The aryl group may be designated as “Ce-Cio aryl,” “Ce or Cio aryl,” or similar designations. Examples of aryl groups include, but are not limited to, phenyl, naphthyl, azulenyl, and anthracenyl.

[0027] An “aralkyl” or “arylalkyl” is an aryl group connected, as a substituent, via an alkylene group, such as “C7 -14 aralkyl” and the like, including but not limited to benzyl, 2- phenylethyl, 3 -phenylpropyl, and naphthylalkyl. In some cases, the alkylene group is a lower alkylene group (i.e., a C1-6 alkylene group).

[0028] As used herein, “heteroaryl” refers to an aromatic ring or ring system (i.e., two or more fused rings that share two adjacent atoms) that contain(s) one or more heteroatoms, that is, an element other than carbon, including but not limited to, nitrogen, oxygen and sulfur, in the ring backbone. When the heteroaryl is a ring system, every ring in the system is aromatic. The heteroaryl group may have 5-18 ring members (i.e., the number of atoms making up the ring backbone, including carbon atoms and heteroatoms), although the present definition also covers the occurrence of the term “heteroaryl” where no numerical range is designated. In some embodiments, the heteroaryl group has 5 to 10 ring members or 5 to 7 ring members. The heteroaryl group may be designated as “5-7 membered heteroaryl,” “5-10 membered heteroaryl,” or similar designations. Examples of heteroaryl rings include, but are not limited to, furyl, thienyl, phthalazinyl, pyrrolyl, oxazolyl, thiazolyl, imidazolyl, pyrazolyl, isoxazolyl, isothiazolyl, triazolyl, thiadiazolyl, pyridinyl, pyridazinyl, pyrimidinyl, pyrazinyl, triazinyl, quinolinyl, isoquinolinyl, benzimidazolyl, benzoxazolyl, benzothiazolyl, indolyl, isoindolyl, and benzothienyl.

[0029] A “heteroaralkyl” or “heteroarylalkyl” is heteroaryl group connected, as a substituent, via an alkylene group. Examples include but are not limited to 2-thienylmethyl, 3- thienylmethyl, furylmethyl, thienylethyl, pyrrolylalkyl, pyridylalkyl, isoxazolylalkyl, and imidazolylalkyl. In some cases, the alkylene group is a lower alkylene group (i.e., a C1-6 alkylene group).

[0030] As used herein, “carbocyclyl” means a non-aromatic cyclic ring or ring system containing only carbon atoms in the ring system backbone. When the carbocyclyl is a ring system, two or more rings may be joined together in a fused, bridged or spiro-connected fashion. Carbocyclyls may have any degree of saturation provided that at least one ring in a ring system is not aromatic. Thus, carbocyclyls include cycloalkyls, cycloalkenyls, and cycloalkynyls. The carbocyclyl group may have 3 to 20 carbon atoms, although the present definition also covers the occurrence of the term “carbocyclyl” where no numerical range is designated. The carbocyclyl group may also be a medium size carbocyclyl having 3 to 10 carbon atoms. The carbocyclyl group could also be a carbocyclyl having 3 to 6 carbon atoms. The carbocyclyl group may be designated as “C3-6 carbocyclyl”, “C3-C6 carbocyclyl” or similar designations. Examples of carbocyclyl rings include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cyclohexenyl,

2.3-dihydro-indene, bicycle[2.2.2]octanyl, adamantyl, and spiro[4.4]nonanyl.

[0031] As used herein, “cycloalkyl” means a fully saturated carbocyclyl ring or ring system. Examples include cyclopropyl, cyclobutyl, cyclopentyl, and cyclohexyl.

[0032] As used herein, “heterocyclyl” means a non-aromatic cyclic ring or ring system containing at least one heteroatom in the ring backbone. Heterocyclyls may be joined together in a fused, bridged or spiro-connected fashion. Heterocyclyls may have any degree of saturation provided that at least one ring in the ring system is not aromatic. The heteroatom(s) may be present in either a non-aromatic or aromatic ring in the ring system. The heterocyclyl group may have 3 to 20 ring members (i.e., the number of atoms making up the ring backbone, including carbon atoms and heteroatoms), although the present definition also covers the occurrence of the term “heterocyclyl” where no numerical range is designated. The heterocyclyl group may also be a medium size heterocyclyl having 3 to 10 ring members. The heterocyclyl group could also be a heterocyclyl having 3 to 6 ring members. The heterocyclyl group may be designated as “3-6 membered heterocyclyl” or similar designations. In preferred six membered monocyclic heterocyclyls, the heteroatom(s) are selected from one up to three of O, N or S, and in preferred five membered monocyclic heterocyclyls, the heteroatom(s) are selected from one or two heteroatoms selected from O, N, or S. Examples of heterocyclyl rings include, but are not limited to, aziridinyl, azetidinyl, azepanyl, azepinyl, acridinyl, carbazolyl, cinnolinyl, dioxolanyl, imidazolinyl, imidazolidinyl, morpholinyl, oxiranyl, oxepanyl, thiepanyl, piperidinyl, piperazinyl, dioxopiperazinyl, pyrrolidinyl, pyrrolidonyl, pyrrolidionyl, 4-piperidonyl, pyrazolinyl, pyrazolidinyl, 1,3-dioxinyl, 1,3-dioxanyl, 1,4-dioxinyl, 1 ,4-dioxanyl, 1,3-oxathianyl, 1,4- oxathiinyl, 1,4-oxathianyl, 2H- 1 ,2-oxazinyl, trioxanyl, hexahydro-1, 3, 5-triazinyl, 1,3-dioxolyl,

1.3-dioxolanyl, 1,3-dithiolyl, 1,3-dithiolanyl, isoxazolinyl, isoxazolidinyl, oxazolinyl, oxazolidinyl, oxazolidinonyl, thiazolinyl, thiazolidinyl, 1,3-oxathiolanyl, indolinyl, isoindolinyl, tetrahydrofuranyl, tetrahydropyranyl, tetrahydrothiophenyl, tetrahydrothiopyranyl, tetrahydro-

1.4-thiazinyl, thiamorpholinyl, dihydrobenzofuranyl, benzimidazolidinyl, and tetrahydroquinoline.

[0033] As used herein, “alkylene” refers to a branched, or straight chain fully saturated di-radical chemical group containing only carbon and hydrogen that is attached to the rest of the molecule via two points of attachment. By way of example only, “Ci-Ce alkylene” indicates that there are one to six carbon atoms in the alkylene chain. Non-limiting examples include methylene (-CH2-), ethylene (-CH2CH2-), propylene (-CH2CH2CH2-), butylene (-CH2CH2CH2CH2-), and pentylene (-CH2CH2CH2CH2CH2-).

[0034] As used herein, “heteroalkylene” refers to an alkylene group, as defined herein, containing one or more heteroatoms in the carbon back bone (i.e., an alkylene group in which one or more carbon atoms is replaced with a heteroatom, for example, nitrogen atom (N), oxygen atom (S) or sulfur atom (S)). For example, a -CH2- may be replaced with -O-, -S-, or -NH-. Heteroalkylene groups include, but are not limited to ether, thioether, amino-alkylene, and alky lene-amino- alkylene moieties. In some embodiments, the heteroalkylene may include one, two, three, four, or five

-CH2CH2O- unit(s). Alternatively and/or additionally, one or more carbon atoms can also be substituted with an oxo (=0) to become a carbonyl. For example, a -CH2- may be replaced with -C(=O)-. It is understood that when a carbon atom is replaced with a carbonyl group, it refers to the replacement of -CH2- with -C(=O)-. When a carbon atom is replaced with a nitrogen atom, it refers to the replacement of -CH- with -N-. When a carbon atom is replaced with an oxygen or sulfur atom, it refers to the replacement of -CH2- with -O- or -S-.

[0035] As used herein, (cycloalkyl) alkyl refers to a cycloalkyl group connected via an alkylene group, such as (C3-C7 cycloalkyl)Ci-C6 alkyl. Non-limiting examples include cyclohexyl-(CH2)i-6-, cyclopentyl-(CH2)i-6-, or cyclopropyl-(CH2)i-6-.

[0036] As used herein, (aryl)alkyl refers to an aryl group connected via an alkylene group, such as (Ce-Cio aryl)Ci-Ce alkyl. Non-limiting example includes phenyl-(CH2)i-6-

[0037] As used herein, (heteroaryl)alkyl refers to a heteroaryl group connected via an alkylene group, such as (5 to 10 membered heteroaryl)Ci-C6 alkyl. Non-limiting examples include pyridyl-(CH2)i-6-, pyrimidinyl-(CH2)i-6-, or pyrrolyl-(CH2)i-6-.

[0038] As used herein, (heterocyclyl) alkyl refers to a heterocyclyl group connected via an alkylene group, such as (3 to 10 membered heterocyclyl)Ci-Ce alkyl. Non-limiting examples include morpholinyl-(CH2)i-6-, piperidinyl-(CH2)i-6-, piperazinyl-(CH2)i-6-, pyrrolidinyl-(CH2)i-6-, or azetidinyl-(CH2)i-6-.

[0039] As used herein, “alkoxyalkyl” or “(alkoxy)alkyl” refers to an alkoxy group connected via an alkylene group, such as C2-C8 alkoxyalkyl, or (Ci-Ce alkoxy)Ci-Ce alkyl, for example, -(CH2)I-3-OCH3.

[0040] An “O-carboxy” group refers to a “-OC(=O)R” group in which R is selected from hydrogen, C1-6 alkyl, C2-6 alkenyl, C2-6 alkynyl, C3-7 carbocyclyl, Ce-io aryl, 5-10 membered heteroaryl, and 3-10 membered heterocyclyl, as defined herein.

[0041] A “C-carboxy” group refers to a “-C(=O)OR” group in which R is selected from the group consisting of hydrogen, C1-6 alkyl, C2-6 alkenyl, C2-6 alkynyl, C3-7 carbocyclyl, Ce- 10 aryl, 5-10 membered heteroaryl, and 3-10 membered heterocyclyl, as defined herein. A nonlimiting example includes carboxyl (i.e. , -C(=O)OH).

[0042] A “sulfonyl” group refers to an “-SO2R” group in which R is selected from hydrogen, C1-6 alkyl, C2-6 alkenyl, C2-6 alkynyl, C3-7 carbocyclyl, Cg-io aryl, 5-10 membered heteroaryl, and 3-10 membered heterocyclyl, as defined herein.

[0043] A “sulfino” group refers to a “-S(=O)OH” group.

[0044] A “sulfo” group refers to a“-S(=O)2OH” or “-SO3H” group.

[0045] A “sulfonate” group refers to a “-SO3 ” group.

[0046] A “sulfate” group refers to “-SO4 ” group.

[0047] A “S-sulfonamido” group refers to a “-SO2NRARB” group in which RA and RB are each independently selected from hydrogen, C1-6 alkyl, C2-6 alkenyl, C2-6 alkynyl, C3-7 carbocyclyl, Ce-io aryl, 5-10 membered heteroaryl, and 3-10 membered heterocyclyl, as defined herein.

[0048] An “N-sulfonamido” group refers to a “-N(RA)SO2RB” group in which RA and Rb are each independently selected from hydrogen, C1-6 alkyl, C2-6 alkenyl, C2-6 alkynyl, C3-7 carbocyclyl, Ce-io aryl, 5-10 membered heteroaryl, and 3-10 membered heterocyclyl, as defined herein.

[0049] A “C-amido” group refers to a “-C(=O)NRARB” group in which RA and RB are each independently selected from hydrogen, Ci-6 alkyl, C2-6 alkenyl, C2-6 alkynyl, C3-7 carbocyclyl, Ce-io aryl, 5-10 membered heteroaryl, and 3-10 membered heterocyclyl, as defined herein.

[0050] An “N-amido” group refers to a “-N( RA)C(=O)RB” group in which RA and RB are each independently selected from hydrogen, C1-6 alkyl, C2-6 alkenyl, C2-6 alkynyl, C3-7 carbocyclyl, Ce-io aryl, 5-10 membered heteroaryl, and 3-10 membered heterocyclyl, as defined herein.

[0051] An “amino” group refers to a “-NRARB” group in which RA and RB are each independently selected from hydrogen, C1-6 alkyl, C2-6 alkenyl, C2-6 alkynyl, C3-7 carbocyclyl, C&- 10 aryl, 5-10 membered heteroaryl, and 3-10 membered heterocyclyl, as defined herein. A nonlimiting example includes free amino (i.e., -NH2).

[0052] An “aminoalkyl” group refers to an amino group connected via an alkylene group.

[0053] An “alkoxyalkyl” group refers to an alkoxy group connected via an alkylene group, such as a “C2-C8 alkoxyalkyl” and the like.

[0054] As used herein, “-OAc” or “-O-acyl” refers to acetyloxy with the structure - O-C(=O)CH₃. [0055] When a group is described as “optionally substituted” it may be either unsubstituted or substituted. Likewise, when a group is described as being “substituted”, the substituent may be selected from one or more of the indicated substituents. As used herein, a substituted group is derived from the unsubstituted parent group in which there has been an exchange of one or more hydrogen atoms for another atom or group. Unless otherwise indicated, when a group is deemed to be “substituted,” it is meant that the group is substituted with one or more substituents independently selected from Ci-Ce alkyl, Ci-Ce alkenyl, Ci-Cr, alkynyl, C3-C7 carbocyclyl (optionally substituted with halo, Ci-Ce alkyl, C 1 -C > alkoxy, Ci-Ce haloalkyl, and Ci- Ce haloalkoxy), Cs-Cv-carbocyclyl-Ci-Ce-alkyl (optionally substituted with halo, Ci-Ce alkyl, Ci- Ce alkoxy, Ci-Ce haloalkyl, and Ci-Ce haloalkoxy), 3-10 membered heterocyclyl (optionally substituted with halo, Ci-Ce alkyl, Ci-Ce alkoxy, Ci-Ce haloalkyl, and Ci-Ce haloalkoxy), 3-10 membered heterocyclyl-Ci-Ce-alkyl (optionally substituted with halo, Ci-Ce alkyl, Ci-Ce alkoxy, Ci-Ce haloalkyl, and Ci-Ce haloalkoxy), aryl (optionally substituted with halo, Ci-Ce alkyl, Ci-Ce alkoxy, Ci-Ce haloalkyl, and Ci-Ce haloalkoxy), aryl(Ci-Ce)alkyl (optionally substituted with halo, Ci-Ce alkyl, Ci-Ce alkoxy, Ci-Ce haloalkyl, and Ci-Ce haloalkoxy), 5-10 membered heteroaryl (optionally substituted with halo, Ci-Ce alkyl, Ci-Ce alkoxy, Ci-Ce haloalkyl, and Ci- Ce haloalkoxy), 5-10 membered heteroaryl(Ci-Ce)alkyl (optionally substituted with halo, Ci-Ce alkyl, Ci-Ce alkoxy, Ci-Ce haloalkyl, and Ci-Ce haloalkoxy), halo, -CN, hydroxy, Ci-Ce alkoxy, Ci-Ce alkoxy (Ci-Ce)alkyl (i.e., ether), aryloxy, sulfhydryl (mercapto), halo(Ci-Ce)alkyl (e.g., - CF3), halo(Ci-Ce)alkoxy (e.g., -OCF3), Ci-Ce alkylthio, arylthio, amino, amino(Ci-Ce) alkyl, nitro, O-carbamyl, N-carbamyl, O-thiocarbamyl, N-thiocarbamyl, C-amido, N-amido, S-sulfonamido, N-sulfonamido, C-carboxy, O-carboxy, acyl, cyanato, isocyanato, thiocyanate, isothiocyanate, sulfinyl, sulfonyl, -SO3H, sulfonate, sulfate, sulfino, -OSChCi-Caalkyl, oxo (=0) and thioxo (=S). Wherever a group is described as “optionally substituted” that group can be substituted with the above substituents. In some embodiments, when an alkyl, alkenyl, alkynyl, aryl, heteroaryl, carbocyclyl or heterocyclyl group is substituted, each is independently substituted with one or more substituents selected from the group consisting of halo, -CN, -SO3 , -OSO3 , -SO3H, -SR^A, are each independently selected from H, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, and substituted aryl.

[0056] As understood by one of ordinary skill in the art, a compound described herein may exist in ionized form, e.g., -CO2 , -SO3 , or -O-SCh- . If a compound contains a positively or negatively charged substituent group, for example, -SO3 , it may also contain a negatively or positively charged counterion such that the compound as a whole is neutral. For example, a compound may contain both -CO2 and a quaternary ammonium cation, or both -SO3 and a quaternary ammonium cation. In other aspects, the compound may exist in a salt form, where the counterion is provided by a conjugate acid or base. It is understood that when the compound described herein is substituted with -SO3H, such substituent may exist in its anionic form -SO3 in aqueous solution.

[0057] It is to be understood that certain radical naming conventions can include either a mono-radical or a di-radical, depending on the context. For example, where a substituent requires two points of attachment to the rest of the molecule, it is understood that the substituent is a di-radical. For example, a substituent identified as alkyl that requires two points of attachment includes di-radicals such as -CH2-, -CH2CH2-, -CH2CH(CH3)CH2-, and the like. Other radical naming conventions clearly indicate that the radical is a di-radical such as “alkylene” or “alkenylene.”

[0058] When two “adjacent” R groups are said to form a ring “together with the atom to which they are attached,” it is meant that the collective unit of the atoms, intervening bonds, and the two R groups are the recited ring. For example, when the following substructure is present: and R¹ and R² are defined as selected from the group consisting of hydrogen and alkyl, or R¹ and R² together with the atoms to which they are attached form an aryl or carbocyclyl, it is meant that R¹ and R² can be selected from hydrogen or alkyl, or alternatively, the substructure has structure: where A is an aryl ring or a carbocyclyl containing the depicted double bond.

[0059] Wherever a substituent is depicted as a di-radical (i.e., has two points of attachment to the rest of the molecule), it is to be understood that the substituent can be attached in any directional configuration unless otherwise indicated. Thus, for example, a substituent A depicted as -AE- or "A ^E includes the substituent being oriented such that the A is attached at the leftmost attachment point of the molecule as well as the case in which A is attached at the rightmost attachment point of the molecule. In addition, if a group or substituent is depicted as , and L is defined an optionally present moiety such as a linker moiety; when L is A not present (or absent), such group or substituent is equivalent to 'A ^E

[0060] As used herein, a “nucleotide” includes a nitrogen containing heterocyclic base, a sugar, and one or more phosphate groups. They are monomeric units of a nucleic acid sequence. In RNA, the sugar is a ribose, and in DNA a deoxyribose, i.e., a sugar lacking a hydroxy group that is present in ribose. The nitrogen containing heterocyclic base can be purine, deazapurine, or pyrimidine base. Purine bases include adenine (A) and guanine (G), and modified derivatives or analogs thereof, such as 7-deaza adenine or 7-deaza guanine. Pyrimidine bases include cytosine (C), thymine (T), and uracil (U), and modified derivatives or analogs thereof. The C-l atom of deoxyribose is bonded to N-l of a pyrimidine or N-9 of a purine.

[0061] As used herein, a “nucleoside” is structurally similar to a nucleotide, but is missing the phosphate moieties. An example of a nucleoside analogue would be one in which the label is linked to the base and there is no phosphate group attached to the sugar molecule. The term “nucleoside” is used herein in its ordinary sense as understood by those skilled in the art. Examples include, but are not limited to, a ribonucleoside comprising a ribose moiety and a deoxyribonucleoside comprising a deoxyribose moiety. A modified pentose moiety is a pentose moiety in which an oxygen atom has been replaced with a carbon and/or a carbon has been replaced with a sulfur or an oxygen atom. A “nucleoside” is a monomer that can have a substituted base and/or sugar moiety. Additionally, a nucleoside can be incorporated into larger DNA and/or RNA polymers and oligomers.

[0062] The term “purine base” is used herein in its ordinary sense as understood by those skilled in the art, and includes its tautomers. Similarly, the term “pyrimidine base” is used herein in its ordinary sense as understood by those skilled in the art, and includes its tautomers. A non-limiting list of optionally substituted purine-bases includes purine, adenine, guanine, deazapurine, 7-deaza adenine, 7-deaza guanine, hypoxanthine, xanthine, alloxanthine, 7- alkylguanine (e.g., 7-methylguanine), theobromine, caffeine, uric acid and isoguanine. Examples of pyrimidine bases include, but are not limited to, cytosine, thymine, uracil, 5,6-dihydrouracil and 5-alkylcytosine (e.g., 5 -methylcytosine).

[0063] As used herein, when an oligonucleotide or polynucleotide is described as “comprising” a nucleoside or nucleotide described herein, it means that the nucleoside or nucleotide described herein forms a covalent bond with the oligonucleotide or polynucleotide. Similarly, when a nucleoside or nucleotide is described as part of an oligonucleotide or polynucleotide, such as “incorporated into” an oligonucleotide or polynucleotide, it means that the nucleoside or nucleotide described herein forms a covalent bond with the oligonucleotide or polynucleotide. In some such embodiments, the covalent bond is formed between a 3' hydroxy group of the oligonucleotide or polynucleotide with the 5' phosphate group of a nucleotide described herein as a phosphodiester bond between the 3' carbon atom of the oligonucleotide or polynucleotide and the 5' carbon atom of the nucleotide.

[0064] As used herein, the term “cleavable linker” is not meant to imply that the whole linker is required to be removed. The cleavage site can be located at a position on the linker that ensures that part of the linker remains attached to the detectable label and/or nucleoside or nucleotide moiety after cleavage.

[0065] As used herein, “derivative” or “analog” means a synthetic nucleotide or nucleoside derivative having modified base moieties and/or modified sugar moi eties. Such derivatives and analogs are discussed in, e.g., Scheit, Nucleotide Analogs (John Wiley & Son, 1980) and Uhlman et al., Chemical Reviews 90:543-584, 1990. Nucleotide analogs can also comprise modified phosphodiester linkages, including phosphorothioate, phosphorodithioate, alkyl-phosphonate, phosphoranilidate and phosphoramidate linkages. “Derivative,” “analog,” and “modified” as used herein, may be used interchangeably, and are encompassed by the terms “nucleotide” and “nucleoside” defined herein.

[0066] As used herein, the term “phosphate” is used in its ordinary sense as understood OH

O=P — O— by those skilled in the art, and includes its protonated forms (for example, O' and used herein, the terms “monophosphate,” “diphosphate,” and “triphosphate” are used in their ordinary sense as understood by those skilled in the art, and include protonated forms.

[0067] As used herein, the term “thiophosphate” is used in its ordinary sense as understood by those skilled in the art, refers to a phosphate group in which one or more oxygens atom has been replaced with a sulfur atom. As used herein, the terms “thiodiphosphate,” and “thiotriphosphate” are used in their ordinary sense as understood by those skilled in the art, and include protonated forms, refers to a diphosphate or a triphosphate group in which one or more oxygens atom has been replaced with a sulfur atom.

[0068] As used herein, the term “phasing” refers to a phenomenon in SBS that is caused by incomplete removal of the 3' terminators and fluorophores, and/or failure to complete the incorporation of a portion of DNA strands within clusters by polymerases at a given sequencing cycle. Prephasing is caused by the incorporation of nucleotides without effective 3' terminators, wherein the incorporation event goes 1 cycle ahead due to a termination failure. Phasing and prephasing cause the measured signal intensities for a specific cycle to consist of the signal from the current cycle as well as noise from the preceding and following cycles. As the number of cycles increases, the fraction of sequences per cluster affected by phasing and prephasing increases, hampering the identification of the correct base. Prephasing can be caused by the presence of a trace amount of unprotected or unblocked 3'-OH nucleotides during sequencing by synthesis (SBS). The unprotected 3'-OH nucleotides could be generated during the manufacturing processes or possibly during the storage and reagent handling processes. Accordingly, the discovery of nucleotide analogues which decrease the incidence of prephasing is surprising and provides a great advantage in SBS applications over existing nucleotide analogues. For example, the nucleotide analogues provided can result in faster SBS cycle time, lower phasing and prephasing values, and longer sequencing read lengths.

Modified Adenosine Nucleosides and Nucleotides

[0069] One aspect of the present disclosure relates to a nucleoside or nucleotide comprising a ribose or 2" deoxyribose, and a modified adenine moiety having the structure of formula (I) or (II):

(I), (II), wherein each X is independently N or CR²; each of R¹ and R² is independently H or an electron withdrawing group;

L¹ is absent, -(CH₂)_mNH-, -(CH₂)_mC(=O)NH(CH₂)_n-,

-(CH₂)_mNHC(=O)(CH₂)n-, -NHC(=O)(CH₂)_kNH-, -C(=O)NH(CH₂)_kNH- -(CH₂)_mS(=O)₂NH-, -S(=O)₂(CH₂)m- or 2 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N;

Z² is absent, -CH=CH-, -C=C-, -C(=O)-CH=CH-, -C(=0)-OC-, -C(R^dR^e)-, -C(R^dR^e)-CH=CH-, -C(R^dR^e)-C=C-, unsubstituted or substituted C3-C7 cycloalkylene, unsubstituted or substituted 4 to 7 membered heterocyclylene, -C(=O)(unsubstituted or substituted C -C7 cycloalkylene), or -C(=O)(unsubstituted or substituted 4 to 7 membered heterocyclylene);

L² is -(CH₂)_mNH-, -(CH₂)_mS(=O)₂NH-, -S(=O)₂(CH₂)_m-

(CH₂)_mC(=O)NH(CH₂)_n-,

-(CH₂)_mNHC(=O)(CH₂)_n-, -NHC(=O)(CH₂)_kNH- -C(=O)NH(CH₂)_kNH- or 2 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N; each of R^a, R^b and R^c is independently H, or unsubstituted or substituted Ci-G, alkyl; each of R^d and R^c is independently H or an electron withdrawing group, provided that at least one of R^d and R^e is an electron withdrawing group; each of m and n is independently 0, 1, 2, 3, 4, 5 or 6; each k is independently 1, 2, 3, 4, 5, or 6;

** indicates the point of the attachment of the modified adenine moiety to a functional moiety or a detectable label, optionally via a cleavable linker. In some embodiments, when Z² is absent, then L² is -(CH₂)i-eNH-, -(CH₂)_mS(=O)₂NH-, -(CH₂)_mC(=O)NH(CH₂)n-, -(CH₂)_mNHC(=O)(CH₂)_n-, -NHC(=O)(CH₂)_kNH-

-C(=O)NH(CH₂)_kNH-, or 2 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N. In some embodiments, when -Z²-L²- comprises or is , then X is N or at least one of R and R is an electron withdrawing group (i.e., at least one of R¹ and R² is not H).

[0070] In some embodiments of the nucleotide described herein, the nucleoside or nucleotide has the structure of formula (I- A) or (II- A): wherein

R³ is OH, a protected hydroxy, monophosphate, diphosphate, triphosphate, thiophosphate, thiodiphosphate, or thiotriphosphate;

PG is a 3' hydroxy blocking group; L^c is the cleavable linker; and

R^x is the functional moiety or the detectable label.

[0071] In some embodiments of the nucleoside or nucleotide described herein, X is N. In other embodiments, X is CR². In some such embodiments, R² is H. In some other embodiments,

R2 is an electron withdrawing group including but not limited to halo (F, Cl, Br or 1), CN, NO2 , acetal (-C(=O)H), carboxy (-COOH), or an ester group), or a Ci-Ce alkyl substituted with one or more halo, CN, NO2 , acetal, carboxy or ester. For example, R² is F, CN, NO2, CHF2, CH2F, or CF3. In some embodiments, R¹ is H. In some other embodiments, R¹ is an electron withdrawing group described herein, such as F, CN, NO2, CHF2, CH2F, or CF3.

[0072] In some embodiments of the nucleoside or the nucleotide of formula (I) or (I-

A), ring A is phenylene, pyridylene, pyrimidylene, triazolylene, pyrazolylene, pyrrolylene, furylene, thienylene, imidazolylene, thiazolylene, isothiazolylene, oxazolylene, or isoxazolylene, each optionally substituted with one or more electron withdrawing groups described herein. In herein (such as one or more substituents selected from F, CN, NO2, CHF2, CH2F, or CF3). In some embodiments, Z¹ is absent. In other embodiments, Z¹ is CR^aR^b, NR^C, O or S. In still other embodiments, Z¹ is S(=O), S(=O)2, or S(=O)2NR^C. In still other embodiments, Z¹ is C(=O) or C(=O)NR^C. In some such embodiments, each of R^a, R^b and R^c is H. In some embodiments, L¹ comprises or is -CH2NH-, -CH₂C(=O)NH-

-C(=O)NH-,-CH₂NHC(=O)CH2-, -CH₂S(=O)₂NH-, -S(=O)₂CH₂-, or a 3 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N (e.g., C3, C4, C5, Ce, C7 or Cs alkylene where one, two or three carbon atom is replaced with a heteroatom such as -O- , -S-,

-N- or =0).

[0073] In some embodiments of the nucleoside or the nucleotide of formula (II) or (II- A), Z² is -CH=CH-. In another embodiment, Z² is -C(=O)-CH=CH-. In other embodiment, Z² is

-C(R^dR^e)-, -C(R^dR^e)-CH=CH-, or -C(R^dR^e)-C=C-, where one or both of R^d and R^e is an electron withdrawing group as described herein, for example, Z² is -CF₂- or -CF₂-CH=CH-. In other embodiments, Z² is unsubstituted or substituted cyclopropylene, unsubstituted or substituted cyclobutylene, or unsubstituted or substituted azetidinylene. For example, Z² is , , each unsubstituted or substituted with an electron withdrawing group as described herein. In some embodiments, L² comprises or is -CH₂NH-, -S(=O)₂NH-, or - CH₂C(=O)NH-.

[0074] In some other embodiments of the nucleoside or the nucleotide of formula (II) or (II-A), Z² is absent and L² is -(CH₂)_mNH-, -(CH₂)_mS(=O)₂NH-, -S(=O)₂(CH₂)_m-,

-(CH₂)_mC(=O)NH(CH₂)_n-, -(CH₂)_inNHC(=O)(CH₂)_n-, -NHC(=O)(CH₂)_kNH-

-C(=O)NH(CH₂)kNH-, or 3 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N. In some such embodiments, L² comprises or is -S(=O)₂NH-, — S(=O)₂CH₂— , -C(=O)NH-, -NHC(=O)CH₂NH- or -C(=O)NHCH₂NH-.

[0075] In some embodiments of the nucleoside or the nucleotide described herein (e.g., formula (I-A) or (II-A)), PG is azidomethyl (-CH₂N3), allyl (-CH₂CH=CH₂), or - CH₂OCH₂CH=CH₂ (“AOM”). In some embodiments, the cleavable linker L^c and the 3' hydroxy blocking group are cleavable under the same reaction condition. In other embodiments, PG is a glycoside blocking group that is subject to enzymatic cleavage. For example, -OPG may have the structure the squiggle line indicates the point of 3'- oxygen’s attachment to the 3 '-carbon atom.

[0076] In another aspect, the present disclosure provides for an oligonucleotide or polynucleotide comprising the nucleotide as described herein incorporated thereto. In some embodiments, the oligonucleotide or polynucleotide is at least partially complementary and hybridized to a target polynucleotide immobilized on a surface of a solid support. In further embodiments, the solid support comprises an array of different target polynucleotides immobilized thereon.

[0077] Additional embodiments of the present disclosure relate to a solid support comprising an array of a plurality of immobilized template or target polynucleotides and at least a portion of such immobilized template or target polynucleotides is hybridized to an oligonucleotide or a polynucleotide comprising the nucleotide described herein. Labeled Nucleotides

[0078] According to an aspect of the disclosure, the modified adenosine nucleotide described herein also comprises a detectable label, (e.g., R^x). Such a nucleotide is referred to herein as a “labeled nucleotide.” The label (e.g., a fluorescent dye) can be conjugated via an optionally present linker by a variety of means including hydrophobic attraction, ionic attraction, and covalent attachment. In some aspect, the detectable label is conjugated to the substrate by covalent attachment. More particularly, the covalent attachment is by means of a linker group, such as a cleavable linker.

[0079] Various fluorescent dyes may be used in the present disclosure as detectable labels, in particularly those dyes that may be excitation by a blue light (e.g., about 450 nm to about 460 nm) or a green light (e.g., about 520 nm to about 540 nm), or a light source having a wavelength in between blue and green. These dyes may also be referred to as “blue dyes” and “green dyes” respectively. Examples of various type of blue dyes, including but not limited to coumarin dyes, chromenoquinoline dyes, bisboron containing heterocycles, and naphthalimide dyes that are disclosed in U.S. Publication Nos. 2018/0094140, 2018/0201981, 2020/0277529, 2020/0277670, 2021/0188832, 2022/0033900, 2022/0195196, 2022/0195517, 2022/0380389, 2023/0313292, and 2023/0416279, and U.S. Ser. Nos. 63/492896 and 63/593489, each of which is incorporated by reference in its entirety. Examples of green dyes including cyanine or polymethine dyes disclosed in International Publication Nos. W02013/041117, WO2014/135221, WO 2016/189287, W02017/051201 and W02018/060482A1, as well as U.S. Ser. No. 63/616289, each of which is incorporated by reference in its entirety.

[0080] Labeled nucleosides and nucleotides are useful for labeling polynucleotides formed by enzymatic synthesis, such as, by way of non-limiting example, in PCR amplification, isothermal amplification, solid phase amplification, polynucleotide sequencing (e.g., solid phase sequencing), nick translation reactions and the like.

[0081] In some embodiments, the dye may be covalently attached to oligonucleotides or nucleotides via the nucleotide base. For example, the labeled nucleotide or oligonucleotide may have the label attached to the C5 position of a pyrimidine base or the C7 position of a 7-deaza purine base through a linker moiety.

[0082] Alternatively, the modified adenosine nucleotide may be unlabeled but include a functional moiety for use in a post-incorporation labeling SBS method. Post- incorporation SBS kits and methods have been described in U.S. Publication No. 2023/0383342 Al, which is incorporated by reference in its entirety. Non-limiting examples of noncovalent interaction between a functional moiety (e.g., R^x) of the nucleotide and a binding moiety of the labeling reagent include but are not limited to avidin (e.g., streptavidin or neutravidin) and biotin; dinitrophenyl (DNP) moiety and anti-DNP antibody; digoxigenin (DIG) and anti-DIG antibody; P-N-acetyl glucosamine (O-GlcNAc) and WGA (lectin); alkyl guanine moiety and SNAP-Tag®, alkyl chloride moiety and HaloTag®; 3 -nitrotyrosine and anti-nitrotyrosine antibody; nickel or cobalt complex such as Ni-nitrilotriacetic acid (NTA) and His-Tag; zinc complex and oligoaspartate protein. Non-limiting examples of covalent interaction between a functional moiety (e.g., R^x) and a labeling reagent include but are not limited to a biorthogonal reaction selected from the group consisting of a [3+2] dipolar cycloaddition, a Diels-Alder cycloaddition, a [4+1] cycloaddition, a phosphine ligation, or condensation with 2-acylphenyl boronic acid. For example, one of the functional moiety and the binding moiety comprises or is norbornene, transcyclooctene (TCO), dibenzocyclooctyne (DBCO), or bicyclo[6.1.0]nonyne (BCN), and the other one of the functional moiety and the binding moiety comprises or is azido. In some other embodiments, one of the functional moiety and the binding moiety comprises or is TCO, and the other one of the functional moiety and the binding moiety comprises or is an optionally substituted 1, 2,4,5 - tetrazine moiety.

Linkers

[0083] In some embodiments described herein, the adenosine base of the nucleotide or nucleoside molecules described herein can be linked to a detectable label or a functional moiety that can be attached to a labeling reagent or a functional moiety (e.g., R^x) as described above. In some such embodiments, the linkers used are cleavable. The use of a cleavable linker ensures that the label can, if required, be removed after detection, avoiding any interfering signal with any labeled nucleotide or nucleoside incorporated subsequently. In some embodiments, the cleavable linker comprises an azido moiety, a -O-C2-C6 alkenyl moiety (e.g., -O-allyl), a disulfide moiety, an acetal moiety (same or similar to the 3' blocking group AOM described herein), or a thiocarbamate moiety.

[0084] In some other embodiments, the linkers used are non-cleavable. For instance, where a nucleotide of the present disclosure is incorporated, no nucleotides need to be subsequently incorporated and thus the label need not be removed from the nucleotide.

[0085] Cleavable linkers are known in the art, and conventional chemistry can be applied to attach a linker to a nucleotide base and a label. The linker can be cleaved by any suitable method, including exposure to acids, bases, nucleophiles, electrophiles, radicals, metals, reducing or oxidizing agents, light, temperature, enzymes etc. The linker as discussed herein may also (or alternatively) be cleaved using an enzyme. In some examples, the same enzymatic catalyst used to cleave the 3'-O-blocking group bond can be used to cleave the linker. In some further embodiments, the linker group may be enzymatically cleavable by the same enzyme that can cleave the blocking group (e.g., PG) in accordance with the present disclosure. In other embodiments, the linker group may be enzymatically cleavable by a different enzyme than the enzyme capable of cleaving the blocking group. Suitable linkers can be adapted from standard chemical protecting groups, as disclosed in Greene & Wuts, Protective Groups in Organic Synthesis, John Wiley & Sons. Further suitable cleavable linkers used in solid-phase synthesis are disclosed in Guillier et al. (Chem. Rev. 100:2092-2157, 2000).

[0086] Where the detectable label is attached to the base, the linker can be attached at any position on the nucleotide base provided that Watson-Crick base pairing can still be carried out. In the context of purine bases, it is preferred if the linker is attached via the 7-position of the purine or the preferred deazapurine analogue, via an 8-modified purine, via an N-6 modified adenosine.

[0087] In some embodiments, the cleavable linker can further comprise a spacer unit. It may be desirable that the length of the linker holds the label at a sufficient distance from the nucleotide so as not to interfere with any interaction between the nucleotide and an enzyme, for example, a polymerase.

[0088] In some embodiments, the cleavable linker may consist of the similar functionality as the 3'-OH protecting group. This may make the deprotection and deprotecting process more efficient, as only a single treatment will be required to remove both the label and the protecting group.

[0089] Use of the term “cleavable linker” is not meant to imply that the whole linker is required to be removed. The cleavage site can be located at a position on the linker that ensures that part of the linker remains attached to the dye and/or substrate moiety after cleavage. Cleavable linkers may be, by way of non-limiting example, electrophilically cleavable linkers, nucleophilically cleavable linkers, photocleavable linkers, cleavable under reductive conditions (for example disulfide or azide containing linkers), oxidative conditions, cleavable via use of safety-catch linkers and cleavable by elimination mechanisms. The use of a cleavable linker to attach the dye compound to a substrate moiety ensures that the label can, if required, be removed after detection, avoiding any interfering signal in downstream steps.

[0090] Useful linker groups may be found in PCT Publication No. W02004/018493 (herein incorporated by reference), examples of which include linkers that may be cleaved using water-soluble phosphines or water-soluble transition metal catalysts formed from a transition metal and at least partially water-soluble ligands, for example, a Pd(II) complex and THP. In aqueous solution the latter form at least partially water-soluble transition metal complexes. Such cleavable linkers can be used to connect bases of nucleotides to labels such as the dyes set forth herein. [0091] Particular linkers include those disclosed in PCT Publication No. W02004/018493 (herein incorporated by reference) such as those that include moieties of the formulae:

(wherein X is selected from the group comprising O, S, NH and NQ wherein Q is a Ci-io substituted or unsubstituted alkyl group, Y is selected from the group comprising O, S, NH and N(allyl), T is hydrogen or a Ci-Cio substituted or unsubstituted alkyl group and * indicates where the moiety is connected to the remainder of the nucleotide or nucleoside). In some aspect, the linkers connect the bases of nucleotides to labels such as, for example, the dye compounds described herein.

[0092] Additional examples of linkers include those disclosed in U.S. Publication No.

2016/0040225 (herein incorporated by reference), such as those include moieties of the formulae:

The linker moieties illustrated herein may comprise the whole or partial linker structure between the nucleotides/nucleosides and the labels.

[0093] Additional examples of linkers are disclosed in U.S. Publication No.

2020/0216891 Al, which is incorporated by reference in its entirety:

is 1, 2, 3, 4, 5; k is 1; Z is -N3 (azido), -O-Ci-Ce alkyl, -O-C2-C6 alkenyl, or -O-C2-C6 alkynyl; and R comprises a detectable label (e.g., a fluorescent dye described herein) or a functional moiety for post-incorporation labeling, which may contain additional linker and/or spacer structure. One of ordinary skill in the art understands that the detectable label or the binding moiety described herein is covalently bound to the linker by reacting a functional group of the binding moiety containing compound (e.g., carboxyl) with a functional group of the linker (e.g., amino) to form an amide bond. In one embodiment, the cleavable linker comprises (“AOL” linker moiety) where Z is

-O-allyl. For the purpose of the present disclosure, the nucleotide may contain multiple cleavable linkers repeating units (e.g., k is 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10). The linker can also comprise a spacer unit, such as one or more PEG unit(s) (-OCFECFE-ln, where n is an integer of 1-20, for example 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or 15. The spacer distances, for example, the nucleotide base from a cleavage site or label.

[0094] In particular embodiments, the length of the linker between a fluorescent dye (fluorophore) and a nucleobase can be altered, for example, by introducing a polyethylene glycol spacer group, thereby increasing the fluorescence intensity compared to the same fluorophore attached to the guanine base through other linkages known in the art. Exemplary linkers and their properties are set forth in PCT Publication No. WO 2007/020457 (herein incorporated by reference). The design of linkers, and especially their increased length, can allow improvements in the brightness of fluorophores attached to the guanine bases of guanosine nucleotides when incorporated into polynucleotides such as DNA. Thus, when the dye is for use in any method of analysis which requires detection of a fluorescent dye label attached to a guanine-containing nucleotide, it is advantageous if the linker comprises a spacer group of formula -((CHz Ojn-, wherein n is an integer between 2 and 50, as described in WO 2007/020457.

[0095] In particular embodiments the labeled nucleotide may be enzymatically incorporable and enzymatically extendable. Accordingly, a linker moiety may be of sufficient length to connect the nucleotide to the compound such that the compound does not significantly interfere with the overall binding and recognition of the nucleotide by a nucleic acid replication enzyme. Thus, the linker can also comprise a spacer unit. The spacer distances, for example, the nucleotide base from a cleavage site or label.

[0096] In a particular embodiment, the linker (between dye and nucleotide) and blocking group are both present and are separate moieties. In particular embodiments, the linker and blocking group are both cleavable under substantially similar conditions. Thus, deprotection and deblocking processes may be more efficient because only a single treatment will be required to remove both the dye compound and the blocking group. However, in some embodiments a linker and blocking group need not be cleavable under similar conditions, instead being individually cleavable under distinct conditions.

[0097] In some embodiments, non-limiting exemplary fully functionalized nucleotide conjugates are shown below: is an integer of 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In some embodiments, p is 1 or 4. In some embodiments, q is 2 or 5. In further embodiments, the nucleotide may be attached to the functional moiety via more than one of the same cleavable linkers , where t is 2, 3, 4, 5, 6, 7 or 8. In addition, the linker may further include additional PEG spacers as described herein, for example, between R^x the cleavable linker.

Kits

[0098] One aspect of the present disclosure relates to a kit comprising one or more nucleotides, wherein one type of nucleotide is the modified adenosine nucleotide described herein.

[0099] In some embodiments, the kit may contain four types of labeled nucleotides (A, C, G and T or U; dATP, dCTP, dGTP and dTTP or dUTP), where one or more of the four types of nucleotides is labeled. In some embodiments, the first type of nucleotide is the modified adenosine nucleotide described herein, which carries a first detectable label. In further embodiments, the kit comprises four types of nucleotides, wherein a second type of nucleotide carries a second detectable label, a third type of nucleotide carries a third detectable label, and a fourth type of nucleotide is unlabeled (dark), and wherein each of the detectable labels has a distinct emission maximum that is distinguishable from the other detectable labels. In other embodiments, a second type of nucleotide carries a second detectable label, a third type of nucleotide comprises a mixture of the third type of nucleotide carrying the first detectable label and the third type of nucleotide carrying the second detectable label, and a fourth type of nucleotide is unlabeled (dark). Thus, one or more of the label nucleotides can have a distinct absorbance maximum and/or emission maximum such that the compound(s) is(are) distinguishable from other compounds. For example, each compound can have a distinct absorbance maximum and/or emission maximum such that each of the compounds is spectrally distinguishable from the other three compounds (or two compounds if the fourth nucleotide is unlabeled). It will be understood that parts of the absorbance spectrum and/or emission spectrum other than the maxima can differ and these differences can be exploited to distinguish the compounds. The kit may be such that two or more of the compounds have a distinct absorbance maximum.

[0100] The compounds, nucleotides, or kits that are set forth herein may be used to detect, measure, or identify a biological system (including, for example, processes or components thereof). Exemplary techniques that can employ the compounds, nucleotides or kits include sequencing, expression analysis, hybridization analysis, genetic analysis, RNA analysis, cellular assay (e.g., cell binding or cell function analysis), or protein assay (e.g., protein binding assay or protein activity assay). The use may be on an automated instrument for carrying out a particular technique, such as an automated sequencing instrument. The sequencing instrument may contain two light sources operating at different wavelengths. [0101] In a particular embodiment, the labeled nucleotide(s) described herein may be supplied in combination with unlabeled or native nucleotides, or any combination thereof. Combinations of nucleotides may be provided as separate individual components (e.g., one nucleotide type per vessel or tube) or as nucleotide mixtures (e.g., two or more nucleotides mixed in the same vessel or tube).

[0102] Where kits comprise a plurality, particularly two, or three, or more particularly four, nucleotides, the different nucleotides may be labeled with different dye compounds, or one may be dark, with no dye compounds. Where the different nucleotides are labeled with different dye compounds, it is a feature of the kits that the dye compounds are spectrally distinguishable fluorescent dyes. As used herein, the term "spectrally distinguishable fluorescent dyes" refers to fluorescent dyes that emit fluorescent energy at wavelengths that can be distinguished by fluorescent detection equipment (for example, a commercial capillary-based DNA sequencing platform) when two or more such dyes are present in one sample. When two nucleotides labeled with fluorescent dye compounds are supplied in kit form, it is a feature of some embodiments that the spectrally distinguishable fluorescent dyes can be excited at the same wavelength, such as, for example by the same light source. When four nucleotides labeled with fluorescent dye compounds are supplied in kit form, it is a feature of some embodiments that two of the spectrally distinguishable fluorescent dyes can both be excited at one wavelength and the other two spectrally distinguishable dyes can both be excited at another wavelength. Particular excitation wavelengths for the dyes are between 450-460 nm, 490-500 nm, or 520 nm or above (e.g., 532 nm).

[0103] In some embodiments, the kit further comprises a DNA polymerase and one or more buffer compositions.

Post-incorporation labeling kits

[0104] In other embodiments, one or more types of nucleotides (A, G, C, and T or U; dATP, dCTP, dGTP and dTTP or dUTP) are unlabeled. In some embodiments, the first type of nucleotide is the unlabeled modified adenosine nucleotide described herein and has a first functional moiety that can attach to a labeling reagent. In further embodiments, the kit further comprises a first labeling reagent, wherein the first labeling reagent comprises one or more first detectable labels and a first binding moiety that is capable of specific binding to the first functional moiety of the first type of unlabeled nucleotide. In some such embodiments, the first functional moiety of the first type of unlabeled nucleotide is bound to the first labeling reagent by either covalent bonding or noncovalent interaction via a cleavable linker described herein. In some embodiments, each of the four types of nucleotides is unlabeled, and wherein the second type of unlabeled nucleotides comprises a second functional moiety, and the kit further comprises a second labeling reagent, wherein the second labeling reagent comprises one or more second detectable labels and a second binding moiety that is capable of specific binding to the second functional moiety of the second type of unlabeled nucleotide. In some such embodiments, the second functional moiety of the second type of unlabeled nucleotide is bound to the second labeling reagent by either covalent bonding or noncovalent interaction via a cleavable linker described herein. In some further embodiments, the third type of unlabeled nucleotides comprises a third functional moiety, and the kit further comprises a third labeling reagent, wherein the third labeling reagent comprises one or more third detectable labels and a third binding moiety that is capable of specific binding to the third functional moiety of the third type of unlabeled nucleotide. In some such embodiments, the third functional moiety of the third type of unlabeled nucleotide is bound to the third labeling reagent by either covalent bonding or noncovalent interaction via a cleavable linker described herein. In other embodiments, the third type of unlabeled nucleotides comprises a mixture of the third type of unlabeled nucleotides comprising the first functional moiety and the third type of unlabeled nucleotides comprising the second functional moiety, and wherein both the first labeling reagent and the second labeling reagent are capable of specific binding to the third type of unlabeled nucleotides. The fourth type of unlabeled nucleotides is not capable of specific binding with any of the first, second, or third labeling reagent.

[0105] Post-incorporation labeling kits and methods have been described in U.S. Publication No. 2023/0383342 Al, which is incorporated by reference in its entirety. Non-limiting examples of noncovalent interaction between a functional moiety of the nucleotide and a binding moiety of the labeling reagent include but are not limited to avidin (e.g., streptavidin or neutravidin) and biotin; dinitrophenyl (DNP) moiety and anti-DNP antibody; digoxigenin (DIG) and anti-DIG antibody; P-N-acetyl glucosamine (O-GlcNAc) and WGA (lectin); alkyl guanine moiety and SNAP-Tag®, alkyl chloride moiety and HaloTag®; 3 -nitrotyrosine and anti- nitrotyrosine antibody; nickel or cobalt complex such as Ni-nitrilotriacetic acid (NTA) and His- Tag; zinc complex and oligo- aspartate protein. Non-limiting examples of covalent interaction between a functional moiety and a labeling reagent include but are not limited to a biorthogonal reaction selected from the group consisting of a [3+2] dipolar cycloaddition, a Diels-Alder cycloaddition, a [4+1] cycloaddition, a phosphine ligation, or condensation with 2-acylphenyl boronic acid. For example, one of the functional moiety and the binding moiety comprises or is norbornene, transcyclooctene (TCO), dibenzocyclooctyne (DBCO), or bicyclo[6.1.0]nonyne (BCN), and the other one of the functional moiety and the binding moiety comprises or is azido. In some other embodiments, one of the functional moiety and the binding moiety comprises or is TCO, and the other one of the functional moiety and the binding moiety comprises or is an optionally substituted 1,2,4,5-tetrazine moiety.

[0106] Some additional embodiments of the present disclosure relate to a sequencing kit comprising:

(a) an incorporation mixture comprising DNA polymerase and nucleotides A, G, C, and T or U (e.g. dATP, dCTP, dGTP and dTTP or dUTP) as described herein, wherein the DNA polymerase is an altered archaeal DNA polymerase;

(b) an aqueous deblocking solution comprising a palladium catalyst, tris(hydroxyalkyl)phosphine, and one or more buffer reagents that is suitable to chemically remove (i) 3' blocking groups from incorporated nucleotides to expose a 3 ’-OH group for further nucleotide incorporation on the solid support, and (ii) any detectable labels attached via cleavable linkers; and

(c) an aqueous wash solution comprising a Pd(II) scavenger; wherein said kit is configured for performing at least about 50, 100, 150, 200, 250, 300, 350, 400, 450 or 500 cycles of sequencing-by-syn thesis.

[0107] In some further embodiment, the kit may comprise four types of labeled nucleotides of fully functionalized nucleotides described herein (A, C, T and G), where each type of nucleotide comprises the 3' blocking group (such as the AOM blocking group). In further embodiments, G is unlabeled and does not comprise any cleavable linker. In some further embodiments, at least one type of the nucleotides comprises a base that is attached to a detectable label via a cleavable linker described herein.

Methods of Sequencing

[0108] An aspect of the present disclosure relates to a method of determining the sequences of a plurality of different target polynucleotides, comprising:

(b) contacting the solid support with an aqueous incorporation mixture comprising DNA polymerase and one or more of four different types of nucleotides (A, G, C, and T or U; dATP, dGTP, dCTP and dTTP or dUTP) under conditions suitable for DNA polymerase-mediated primer extension, and incorporating one type of nucleotides into the sequencing primers to produce extended copy polynucleotides, wherein the A nucleotide is the modified adenosine nucleotide as described herein carrying a detectable label through a cleavable linker, and wherein each of the one or more of four different type of nucleotides comprises a 3' blocking group covalently attached to the 2' deoxyribose of the nucleotide;

(c) imaging and performing one or more fluorescent measurements of the extended copy polynucleotides to determine the identity of incorporated nucleotides;

(d) removing the 3' blocking group from the nucleotides incorporated into the extended copy polynucleotides;

[0109] In some embodiments, steps (b) to (e) are repeated at least 50, 100, 150, 200, 250, 300, 350, 400, 450 or 500 cycles. In some embodiments, the removal of the 3' blocking group also removes the detectable label of the incorporated nucleotides.

Post-Incorporation Labelins SBS Workflow

[0110] Another aspect of the present disclosure relates to a method of determining the sequences of a plurality of different target polynucleotides, comprising:

(b) contacting the solid support with an aqueous incorporation mixture comprising DNA polymerase and one or more of four different types of nucleotides (A, G, C, and T or U; dATP, dGTP, dCTP and dTTP or dUTP) under conditions suitable for DNA polymerase-mediated primer extension, and incorporating one type of nucleotides into the sequencing primers to produce extended copy polynucleotides, wherein at least two types of nucleotides are unlabeled and at least one type of unlabeled nucleotide is a unlabeled modified adenosine nucleotide as described herein having a first functional moiety, and wherein each of the one or more of four different type of nucleotides comprises a 3' blocking group covalently attached to the 2' deoxyribose of the nucleotide;

(c) contacting the extended copy polynucleotides with an aqueous labeling mixture comprising a first labeling reagent, wherein the first labeling reagent comprises one or more first detectable labels and a first binding moiety that is capable of specific binding to the first functional moiety of the unlabeled nucleotide; (d) imaging the solid support and performing one or more fluorescent measurements to determine the identity of incorporated nucleotides;

[0111] In some embodiments, steps (b) to (f) are repeated at least 50, 100, 150, 200, 250, 300, 350, 400, 450 or 500 cycles. In some embodiments, the first functional moiety of the first type of unlabeled nucleotide is bound to the first labeling reagent by either covalent bonding or noncovalent interaction via a cleavable linker. In some embodiments, the removal of the 3' blocking group also removes the first labeling reagent. In some embodiments, the method is performed on an automated sequencing instrument comprising a single light source. In further embodiments, the single light source operates at a wavelength from about 500 nm to about 540 nm, or about 520 nm to about 525 nm. In some embodiments, the method is performed on an automated sequencing instrument comprising two light sources operating at two different wavelengths. In further embodiments, one light source operates at a wavelength from about 450 nm to about 460 nm, and the other light source operates at a wavelength from about 520 nm to about 525 nm.

[0112] In some embodiments, step (c) is performed under conditions where the first labeling reagent binds specifically to the incorporated unlabeled first type of nucleotides to provide labeled extended copy polynucleotides. In some embodiments, the first functional moiety of the first type of unlabeled nucleotide is bound to the first labeling reagent by covalent bonding, optionally via the cleavable linker as described herein. In some other embodiments, the first functional moiety of the first type of unlabeled nucleotide is bound to the first labeling reagent by noncovalent interaction, optionally via the cleavable linker as described herein.

[0113] In some embodiments, each of the four types of nucleotides in the aqueous incorporation mixture is unlabeled, the second type of unlabeled nucleotides comprises a second functional moiety, wherein the aqueous labeling mixture comprises a second labeling reagent, and the second labeling reagent comprises one or more second detectable labels and a second binding moiety that is capable of specific binding to the second functional moiety of the second type of unlabeled nucleotides. In some such embodiments, step (c) is performed under conditions where the first labeling reagent binds specifically to the incorporated unlabeled first type of nucleotides, and the second labeling reagent binds specifically to the incorporated unlabeled second type of nucleotides to provide labeled extended copy polynucleotides. In some such embodiments, the first functional moiety of the first type of unlabeled nucleotide is bound to the first labeling reagent by covalent bonding, optionally via a cleavable linker as described herein. In some other embodiments, the second functional moiety of the first type of unlabeled nucleotide is bound to the second labeling reagent by noncovalent interaction, optionally via the cleavable linker as described herein. In some such embodiments, the third type of unlabeled nucleotides comprises a third functional moiety, wherein the aqueous labeling mixture comprises a third labeling reagent, and the third labeling reagent comprises one or more third detectable labels and a third binding moiety that is capable of specific binding to the third functional moiety of the third type of unlabeled nucleotides. In some such embodiments, step (c) is performed under conditions where the first labeling reagent binds specifically to the incorporated unlabeled first type of nucleotides, the second labeling reagent binds specifically to the incorporated unlabeled second type of nucleotides, and the third labeling reagent binds specifically to the incorporated unlabeled third type of nucleotides to provide labeled extended copy polynucleotides. In some other embodiments, the third type of unlabeled nucleotide comprises a mixture of the third type of unlabeled nucleotides comprising the first functional moiety and the third type of unlabeled nucleotides comprising the second functional moiety, and wherein both the first labeling reagent and the second labeling reagent are capable of specific binding to the third type of unlabeled nucleotides. In further embodiments, the fourth type of unlabeled nucleotides is not capable of specific binding with any of the first, second, or third labeling reagent. In some embodiments of the any of the sequencing methods described herein, each type of nucleotides has a terminally modified triphosphate group as described herein.

[0114] In any embodiments of the sequencing methods described herein, the solid support comprises at least 500,000, 1,000,000, 2,000,000, 3,000,000, 4,000,000, or 5,000,000 spatially distinguishable sites/cm² that comprise multiple copies of target polynucleotides.

Incorporation Mix

[0115] In some embodiments of the method described herein, step (b) is also referred to as the incorporation step, includes contacting a mixture containing one or more nucleotides (e.g., dATP, dCTP, dGTP, and dTTP or dUTP) with a copy polynucleotide/target polynucleotide complex in an incorporation solution comprising a polymerase and one or more buffering agents. In some such embodiments, the polymerase is a DNA polymerase, such as a mutant of 9°N polymerase (e.g., those disclosed in WO 2005/024010, which is incorporated by reference), for example, Pol 812, Pol 1901, Pol 1558 or Pol 963. The amino acid sequences of Pol 812, Pol 1901, Pol 1558 or Pol 963 DNA polymerases are described, for example, in U.S. Patent Publication Nos. 2020/0131484 Al and 2020/0181587 Al, both of which are incorporated by reference herein. Additional polymerases that may be used in the method include those disclosed in U.S. Ser. Nos. 63/412,241 and 63/433,971, both of which are incorporated by reference. In some embodiments, the one or more buffering agents comprise a primary amine, a secondary amine, a tertiary amine, a natural amino acid, or a non-natural amino acid, or combinations thereof. In further embodiments, the buffering agents comprise ethanolamine or glycine, or a combination thereof. In one embodiment, the buffer agent comprises or is glycine. In further embodiments, the mutant of 9°N polymerase may be engineered for high efficient incorporation of the nucleotide in accordance with the present disclosure.

Cleavage Mix

[0116] In some embodiments of the method described herein, step (d) of the standard SBS or step (e) of the post-incorporation labeling SBS, also referred to as the cleaving step, includes contacting the incorporated nucleotide and the copy polynucleotide strand with a cleavage solution comprising a catalyst (e.g., a Pd(II) catalyst or a glycoside hydrolase or glycosidase such as an arabinofuranosidase, a glucosidase, a mannosidase, a xylosidase, a galactosidase, an N-acetyl-glucosaminidase, or a glucuronidase). In some embodiments, the cleavage solution comprises a catalyst capable of cleaving the linker group in accordance with the present disclosure. In further embodiments, the 3'-OH blocking group and the detectable label are removed in a single step of reaction.

Embodiments and Alternatives of Sequencing-By-Synthesis

[0117] Alternatively, the sequencing methods described herein may also be carried out using unlabeled nucleotides and affinity reagents containing a fluorescent dye described herein. For example, one, two, three or each of the four different types of nucleotides (e.g., dATP, dCTP, dGTP and dTTP or dUTP) in the incorporation mixture of step (1) may be unlabeled. Each of the four types of nucleotides (e.g., dNTPs) has a 3' blocking group to ensure that only a single base can be added by a polymerase to the 3' end of the primer polynucleotide. After incorporation of an unlabeled nucleotide, the remaining unincorporated nucleotides are washed away. An affinity reagent is then introduced that specifically recognizes and binds to the incorporated dNTP to provide a labeled extension product comprising the incorporated dNTP. Uses of unlabeled nucleotides and affinity reagents in sequencing by synthesis have been disclosed in WO 2018/129214 and WO 2020/097607. A modified sequencing method of the present disclosure using unlabeled nucleotides may include the following steps:

(1) contacting a solid support with sequencing primers under hybridization conditions, wherein the solid support comprises a plurality of target polynucleotides immobilized thereon; and the sequencing primers are complementary to at least a portion of the target polynucleotides;

(2) contacting the solid support with an aqueous solution comprising DNA polymerase and one or more of four different types of unlabeled nucleotides (A, G, C and T or U) under conditions suitable for DNA polymerase- mediated primer extension, wherein each of the nucleotides comprises a 3' blocking group and at least one type of nucleotide comprising a blocking group as described herein;

(3) contacting the extended copy polynucleotides with a set of affinity reagents under conditions wherein one affinity reagent binds specifically to the incorporated unlabeled nucleotides to provide labeled extended copy polynucleotides;

(4) imaging the solid support to determine the identity of the incorporated nucleotides (e.g., by performing one or more fluorescent measurements of the extended copy polynucleotides); and

(5) contacting the solid support with an aqueous deblocking solution comprising a palladium catalyst and tris(hydroxyalkyl)phosphine under conditions suitable to chemically remove 3' allyl blocking groups from incorporated nucleotides to expose a 3’- OH group for further nucleotide incorporation on the solid support; and

(6) repeating steps (2)-(5) to determine target polynucleotide sequences.

[0118] In some embodiments of the modified sequencing method described herein, the method further comprises removing the affinity reagents from the incorporated nucleotides. In still further embodiments, the 3' blocking group and the affinity reagent are removed in the same reaction. In some embodiments, the method further comprises a step (6) washing the solid support with a third aqueous wash solution. In further embodiments, steps (2) through (6) are repeated at least 50, 100, 150, 200, 250, or 300 cycles to determine the target polynucleotide sequences. In some embodiments, the set of affinity reagents may comprise a first affinity reagent that binds specifically to the first type of nucleotide, a second affinity reagent that binds specifically to the second type of nucleotide, and a third affinity reagent that binds specifically to the third type of nucleotide. In some further embodiments, each of the first, second and the third affinity reagents comprises a detectable labeled that is spectrally distinguishable. In some embodiments, the affinity reagents may include protein tags, antibodies (including but not limited to binding fragments of antibodies, single chain antibodies, bispecific antibodies, and the like), aptamers, knottins, affimers, or any other known agent that binds an incorporated nucleotide with a suitable specificity and affinity. In one embodiment, at least one affinity reagent is an antibody or a protein tag. In another embodiment, at least one of the first type, the second type, and the third type of affinity reagents is an antibody or a protein tag comprising one or more detectable labels (e.g., multiple copies of the same detectable label), wherein the detectable label is or comprises a bis-boron dye moiety described herein.

[0119] Some embodiments include pyrosequencing techniques. Pyrosequencing detects the release of inorganic pyrophosphate (PPi) as particular nucleotides are incorporated into the nascent strand (Ronaghi, M., Karamohamed, S., Pettersson, B., Uhlen, M. and Nyren, P. (1996) “Real-time DNA sequencing using detection of pyrophosphate release.” Analytical Biochemistry 242(1), 84-9; Ronaghi, M. (2001) “Pyrosequencing sheds light on DNA sequencing.” Genome Res. 11(1), 3-11; Ronaghi, M., Uhlen, M. and Nyren, P. (1998) “A sequencing method based on real-time pyrophosphate.” Science 281(5375), 363; U.S. Pat. Nos. 6,210,891; 6,258,568 and 6,274,320, the disclosures of which are incorporated herein by reference in their entireties). In pyrosequencing, released PPi can be detected by being immediately converted to adenosine triphosphate (ATP) by ATP sulfurase, and the level of ATP generated is detected via luciferase-produced photons. The nucleic acids to be sequenced can be attached to features in an array and the array can be imaged to capture the chemiluminescent signals that are produced due to incorporation of a nucleotides at the features of the array. An image can be obtained after the array is treated with a particular nucleotide type (e.g., A, T, C or G). Images obtained after addition of each nucleotide type will differ with regard to which features in the array are detected. These differences in the image reflect the different sequence content of the features on the array. However, the relative locations of each feature will remain unchanged in the images. The images can be stored, processed and analyzed using the methods set forth herein. For example, images obtained after treatment of the array with each different nucleotide type can be handled in the same way as exemplified herein for images obtained from different detection channels for reversible terminator-based sequencing methods.

[0120] In another exemplary type of SBS, cycle sequencing is accomplished by stepwise addition of reversible terminator nucleotides containing, for example, a cleavable or photobleachable dye label as described, for example, in WO 04/018497 and U.S. Pat. No. 7,057,026, the disclosures of which are incorporated herein by reference. This approach is being commercialized by Solexa (now Illumina, Inc.), and is also described in WO 91/06678 and WO 07/123,744, each of which is incorporated herein by reference. The availability of fluorescently- labeled terminators in which both the termination can be reversed, and the fluorescent label cleaved facilitates efficient cyclic reversible termination (CRT) sequencing. Polymerases can also be co-engineered to efficiently incorporate and extend from these modified nucleotides.

[0121] Preferably in reversible terminator-based sequencing embodiments, the labels do not substantially inhibit extension under SBS reaction conditions. However, the detection labels can be removable, for example, by cleavage or degradation. Images can be captured following incorporation of labels into arrayed nucleic acid features. In particular embodiments, each cycle involves simultaneous delivery of four different nucleotide types to the array and each nucleotide type has a spectrally distinct label. Four images can then be obtained, each using a detection channel that is selective for one of the four different labels. Alternatively, different nucleotide types can be added sequentially, and an image of the array can be obtained between each addition step. In such embodiments each image will show nucleic acid features that have incorporated nucleotides of a particular type. Different features will be present or absent in the different images due the different sequence content of each feature. However, the relative position of the features will remain unchanged in the images. Images obtained from such reversible terminator-SBS methods can be stored, processed and analyzed as set forth herein. Following the image capture step, labels can be removed, and reversible terminator moieties can be removed for subsequent cycles of nucleotide addition and detection. Removal of the labels after they have been detected in a particular cycle and prior to a subsequent cycle can provide the advantage of reducing background signal and crosstalk between cycles. Examples of useful labels and removal methods are set forth below.

[0122] Some embodiments can utilize detection of four different nucleotides using fewer than four different labels. For example, SBS can be performed utilizing methods and systems described in the incorporated materials of U.S. Pub. No. 2013/0079232. As a first example, a pair of nucleotide types can be detected at the same wavelength, but distinguished based on a difference in intensity for one member of the pair compared to the other, or based on a change to one member of the pair (e.g., via chemical modification, photochemical modification or physical modification) that causes apparent signal to appear or disappear compared to the signal detected for the other member of the pair. As a second example, three of four different nucleotide types can be detected under particular conditions while a fourth nucleotide type lacks a label that is detectable under those conditions, or is minimally detected under those conditions (e.g. , minimal detection due to background fluorescence, etc.). Incorporation of the first three nucleotide types into a nucleic acid can be determined based on presence of their respective signals and incorporation of the fourth nucleotide type into the nucleic acid can be determined based on absence or minimal detection of any signal. As a third example, one nucleotide type can include label(s) that are detected in two different channels, whereas other nucleotide types are detected in no more than one of the channels. The aforementioned three exemplary configurations are not considered mutually exclusive and can be used in various combinations. An exemplary embodiment that combines all three examples, is a fluorescent-based SBS method that uses a first nucleotide type that is detected in a first channel (e.g., dATP having a label that is detected in the first channel when excited by a first excitation wavelength), a second nucleotide type that is detected in a second channel (e.g., dCTP having a label that is detected in the second channel when excited by a second excitation wavelength), a third nucleotide type that is detected in both the first and the second channel (e.g., dTTP having at least one label that is detected in both channels when excited by the first and/or second excitation wavelength) and a fourth nucleotide type that lacks a label that is not, or minimally, detected in either channel (e.g., dGTP having no label).

[0123] Further, as described in the incorporated materials of U.S. Pub. No. 2013/0079232, sequencing data can be obtained using a single channel. In such so-called one-dye sequencing approaches, the first nucleotide type is labeled but the label is removed after the first image is generated, and the second nucleotide type is labeled only after a first image is generated. The third nucleotide type retains its label in both the first and second images, and the fourth nucleotide type remains unlabeled in both images.

[0124] Some embodiments can utilize sequencing by ligation techniques. Such techniques utilize DNA ligase to incorporate oligonucleotides and identify the incorporation of such oligonucleotides. The oligonucleotides typically have different labels that are correlated with the identity of a particular nucleotide in a sequence to which the oligonucleotides hybridize. As with other SBS methods, images can be obtained following treatment of an array of nucleic acid features with the labeled sequencing reagents. Each image will show nucleic acid features that have incorporated labels of a particular type. Different features will be present or absent in the different images due the different sequence content of each feature, but the relative position of the features will remain unchanged in the images. Images obtained from ligation-based sequencing methods can be stored, processed and analyzed as set forth herein. Exemplary SBS systems and methods which can be utilized with the methods and systems described herein are described in U.S. Pat. Nos. 6,969,488, 6,172,218, and 6,306,597, the disclosures of which are incorporated herein by reference in their entireties.

[0125] Some embodiments can utilize nanopore sequencing (Deamer, D. W. & Akeson, M. “Nanopores and nucleic acids: prospects for ultrarapid sequencing.” Trends Biotechnol. 18, 147-151 (2000); Deamer, D. and D. Branton, “Characterization of nucleic acids by nanopore analysis”, Ace. Chem. Res. 35:817-825 (2002); Li, J., M. Gershow, D. Stein, E. Brandin, and J. A. Golovchenko, “DNA molecules and configurations in a solid-state nanopore microscope” Nat. Mater. 2:611-615 (2003), the disclosures of which are incorporated herein by reference in their entireties). In such embodiments, the target nucleic acid passes through a nanopore. The nanopore can be a synthetic pore or biological membrane protein, such as a- hemolysin. As the target nucleic acid passes through the nanopore, each base-pair can be identified by measuring fluctuations in the electrical conductance of the pore. (U.S. Pat. No. 7,001,792; Soni, G. V. & Meller, A. “Progress toward ultrafast DNA sequencing using solid-state nanopores.” Clin. Chem. 53, 1996-2001 (2007); Healy, K. “Nanopore-based single-molecule DNA analysis.” Nanomed. 2, 459-481 (2007); Cockroft, S. L., Chu, J., Amorin, M. & Ghadiri, M. R. “A singlemolecule nanopore device detects DNA polymerase activity with single-nucleotide resolution.” J. Am. Chem. Soc. 130, 818-820 (2008), the disclosures of which are incorporated herein by reference in their entireties). Data obtained from nanopore sequencing can be stored, processed and analyzed as set forth herein. In particular, the data can be treated as an image in accordance with the exemplary treatment of optical images and other images that is set forth herein.

[0126] Some other embodiments of sequencing methods involve the use the 3' blocked nucleotide described herein in nanoball sequencing technique, such as those described in U.S. Patent No. 9,222,132, the disclosure of which is incorporated by reference. Through the process of rolling circle amplification (RCA), a large number of discrete DNA nanoballs may be generated. The nanoball mixture is then distributed onto a patterned slide surface containing features that allow a single nanoball to associate with each location. In DNA nanoball generation, DNA is fragmented and ligated to the first of four adapter sequences. The template is amplified, circularized and cleaved with a type II endonuclease. A second set of adapters is added, followed by amplification, circularization and cleavage. This process is repeated for the remaining two adapters. The final product is a circular template with four adapters, each separated by a template sequence. Library molecules undergo a rolling circle amplification step, generating a large mass of concatemers called DNA nanoballs, which are then deposited on a flow cell. Goodwin et al., “Coming of age: ten years of next-generation sequencing technologies,” Nat Rev Genet. 2016; 17(6):333-51.

[0127] Some embodiments can utilize methods involving the real-time monitoring of DNA polymerase activity. Nucleotide incorporations can be detected through fluorescence resonance energy transfer (FRET) interactions between a fluorophore-bearing polymerase and y- phosphate-labeled nucleotides as described, for example, in U.S. Pat. Nos. 7,329,492 and 7,211,414, both of which are incorporated herein by reference, or nucleotide incorporations can be detected with zero-mode waveguides as described, for example, in U.S. Pat. No. 7,315,019, which is incorporated herein by reference, and using fluorescent nucleotide analogs and engineered polymerases as described, for example, in U.S. Pat. No. 7,405,281 and U.S. Pub. No. 2008/0108082, both of which are incorporated herein by reference. The illumination can be restricted to a zeptoliter-scale volume around a surface-tethered polymerase such that incorporation of fluorescently labeled nucleotides can be observed with low background (Levene, M. J. et al. “Zero-mode waveguides for single-molecule analysis at high concentrations.” Science 299, 682-686 (2003); Lundquist, P. M. et al. “Parallel confocal detection of single molecules in real time.” Opt. Lett. 33, 1026-1028 (2008); Korlach, J. etal. “Selective aluminum passivation for targeted immobilization of single DNA polymerase molecules in zero-mode waveguide nano structures.” Proc. Natl. Acad. Sci. USA 105, 1176-1181 (2008), the disclosures of which are incorporated herein by reference in their entireties). Images obtained from such methods can be stored, processed and analyzed as set forth herein.

[0128] Some SBS embodiments include detection of a proton released upon incorporation of a nucleotide into an extension product. For example, sequencing based on detection of released protons can use an electrical detector and associated techniques that are commercially available from Ion Torrent (Guilford, CT, a Life Technologies subsidiary) or sequencing methods and systems described in U.S. Pub. Nos. 2009/0026082; 2009/0127589; 2010/0137143; and 2010/0282617, all of which are incorporated herein by reference. Methods set forth herein for amplifying target nucleic acids using kinetic exclusion can be readily applied to substrates used for detecting protons. More specifically, methods set forth herein can be used to produce clonal populations of amplicons that are used to detect protons.

[0129] The above SBS methods can be advantageously carried out in multiplex formats such that multiple different target nucleic acids are manipulated simultaneously. In particular embodiments, different target nucleic acids can be treated in a common reaction vessel or on a surface of a particular substrate. This allows convenient delivery of sequencing reagents, removal of unreacted reagents and detection of incorporation events in a multiplex manner. In embodiments using surface-bound target nucleic acids, the target nucleic acids can be in an array format. In an array format, the target nucleic acids can be typically bound to a surface in a spatially distinguishable manner. The target nucleic acids can be bound by direct covalent attachment, attachment to a bead or other particle or binding to a polymerase or other molecule that is attached to the surface. The array can include a single copy of a target nucleic acid at each site (also referred to as a feature) or multiple copies having the same sequence can be present at each site or feature. Multiple copies can be produced by amplification methods such as, bridge amplification or emulsion PCR as described in further detail below.

[0130] The methods set forth herein can use arrays having features at any of a variety of densities including, for example, at least about 10 features/cm², 100 features/cm², 500 features/cm², 1,000 features/cm², 5,000 features/cm², 10,000 features/cm², 50,000 features/cm², 100,000 features/cm², 1,000,000 features/cm², 5,000,000 features/cm², or higher.

[0131] An advantage of the methods set forth herein is that they provide for rapid and efficient detection of a plurality of target nucleic acid in parallel. Accordingly, the present disclosure provides integrated systems capable of preparing and detecting nucleic acids using techniques known in the art such as those exemplified above. Thus, an integrated system of the present disclosure can include fluidic components capable of delivering amplification reagents and/or sequencing reagents to one or more immobilized DNA fragments, the system comprising components such as pumps, valves, reservoirs, fluidic lines and the like. A flow cell can be configured and/or used in an integrated system for detection of target nucleic acids. Exemplary flow cells are described, for example, in U.S. Pub. No. 2010/0111768 and US Ser. No. 13/273,666, each of which is incorporated herein by reference. As exemplified for flow cells, one or more of the fluidic components of an integrated system can be used for an amplification method and for a detection method. Taking a nucleic acid sequencing embodiment as an example, one or more of the fluidic components of an integrated system can be used for an amplification method set forth herein and for the delivery of sequencing reagents in a sequencing method such as those exemplified above. Alternatively, an integrated system can include separate fluidic systems to carry out amplification methods and to carry out detection methods. Examples of integrated sequencing systems that are capable of creating amplified nucleic acids and also determining the sequence of the nucleic acids include, without limitation, the MiSeq™ platform (Illumina, Inc., San Diego, CA) and devices described in US Ser. No. 13/273,666, which is incorporated herein by reference.

[0132] Arrays in which polynucleotides have been directly attached to silica-based supports are those for example disclosed in WO 00/06770 (incorporated herein by reference), wherein polynucleotides are immobilized on a glass support by reaction between a pendant epoxide group on the glass with an internal amino group on the polynucleotide. In addition, polynucleotides can be attached to a solid support by reaction of a sulfur-based nucleophile with the solid support, for example, as described in WO 2005/047301 (incorporated herein by reference). A still further example of solid-supported template polynucleotides is where the template polynucleotides are attached to hydrogel supported upon silica-based or other solid supports, for example, as described in WO 00/31148, WO 01/01143, WO 02/12566, WO 03/014392, U.S. Pat. No. 6,465,178 and WO 00/53812, each of which is incorporated herein by reference.

[0133] A particular surface to which template polynucleotides may be immobilized is a polyacrylamide hydrogel. Polyacrylamide hydrogels are described in the references cited above and in WO 2005/065814, which is incorporated herein by reference. Specific hydrogels that may be used include those described in WO 2005/065814 and U.S. Pub. No. 2014/0079923. In one embodiment, the hydrogel is PAZAM (poly(N-(5-azidoacetamidylpentyl) acrylamide-co- acrylamide)).

[0134] DNA template molecules can be attached to beads or microparticles, for example, as described in U.S. Pat. No. 6,172,218 (which is incorporated herein by reference). Attachment to beads or microparticles can be useful for sequencing applications. Bead libraries can be prepared where each bead contains different DNA sequences. Exemplary libraries and methods for their creation are described in Nature, 437, 376-380 (2005); Science, 309, 5741, 1728- 1732 (2005), each of which is incorporated herein by reference. Sequencing of arrays of such beads using nucleotides set forth herein is within the scope of the disclosure.

[0135] Templates that are to be sequenced may form part of an “array” on a solid support, in which case the array may take any convenient form. Thus, the method of the disclosure is applicable to all types of high-density arrays, including single-molecule arrays, clustered arrays, and bead arrays. Labeled nucleotides of the present disclosure may be used for sequencing templates on essentially any type of array, including but not limited to those formed by immobilization of nucleic acid molecules on a solid support.

[0136] However, labeled nucleotides of the disclosure are particularly advantageous in the context of sequencing of clustered arrays. In clustered arrays, distinct regions on the array (often referred to as sites, or features) comprise multiple polynucleotide template molecules. Generally, the multiple polynucleotide molecules are not individually resolvable by optical means and are instead detected as an ensemble. Depending on how the array is formed, each site on the array may comprise multiple copies of one individual polynucleotide molecule (e.g., the site is homogenous for a particular single- or double-stranded nucleic acid species) or even multiple copies of a small number of different polynucleotide molecules (e.g., multiple copies of two different nucleic acid species). Clustered arrays of nucleic acid molecules may be produced using techniques generally known in the art. By way of example, WO 98/44151 and WO 00/18957, each of which is incorporated herein, describe methods of amplification of nucleic acids wherein both the template and amplification products remain immobilized on a solid support in order to form arrays comprised of clusters or “colonies” of immobilized nucleic acid molecules. The nucleic acid molecules present on the clustered arrays prepared according to these methods are suitable templates for sequencing using the nucleotides labeled with dye compounds of the disclosure.

[0137] The labeled nucleotides of the present disclosure are also useful in sequencing of templates on single molecule arrays. The term “single molecule array” or “SMA” as used herein refers to a population of polynucleotide molecules, distributed (or arrayed) over a solid support, wherein the spacing of any individual polynucleotide from all others of the population is such that it is possible to individually resolve the individual polynucleotide molecules. The target nucleic acid molecules immobilized onto the surface of the solid support can thus be capable of being resolved by optical means in some embodiments. This means that one or more distinct signals, each representing one polynucleotide, will occur within the resolvable area of the particular imaging device used.

[0138] Single molecule detection may be achieved wherein the spacing between adjacent polynucleotide molecules on an array is at least 100 nm, more particularly at least 250 nm, still more particularly at least 300 nm, even more particularly at least 350 nm. Thus, each molecule is individually resolvable and detectable as a single molecule fluorescent point, and fluorescence from said single molecule fluorescent point also exhibits single step photobleaching.

[0139] The terms “individually resolved” and “individual resolution” are used herein to specify that, when visualized, it is possible to distinguish one molecule on the array from its neighboring molecules. Separation between individual molecules on the array will be determined, in part, by the particular technique used to resolve the individual molecules. The general features of single molecule arrays will be understood by reference to published applications WO 00/06770 and WO 01/57248, each of which is incorporated herein by reference. Although one use of the nucleotides of the disclosure is in sequencing-by-synthesis reactions, the utility of the nucleotides is not limited to such methods. In fact, the nucleotides may be used advantageously in any sequencing methodology which requires detection of fluorescent labels attached to nucleotides incorporated into a polynucleotide.

[0140] In particular, the labeled nucleotides of the disclosure may be used in automated fluorescent sequencing protocols, particularly fluorescent dye-terminator cycle sequencing based on the chain termination sequencing method of Sanger and co-workers. Such methods generally use enzymes and cycle sequencing to incorporate fluorescently labeled dideoxynucleotides in a primer extension sequencing reaction. So-called Sanger sequencing methods, and related protocols (Sanger-type), utilize randomized chain termination with labeled dideoxynucleo tides .

[0141] Thus, the present disclosure also encompasses labeled nucleotides which are dideoxynucleotides lacking hydroxyl groups at both of the 3' and 2' positions, such dideoxynucleotides being suitable for use in Sanger type sequencing methods and the like.

[0142] Labeled nucleotides of the present disclosure incorporating 3' blocking groups, it will be recognized, may also be of utility in Sanger methods and related protocols since the same effect achieved by using dideoxy nucleotides may be achieved by using nucleotides having 3'-OH blocking groups: both prevent incorporation of subsequent nucleotides. Where nucleotides according to the present disclosure, and having a 3' blocking group are to be used in Sanger-type sequencing methods it will be appreciated that the dye compounds or detectable labels attached to the nucleotides need not be connected via cleavable linkers, since in each instance where a labeled nucleotide of the disclosure is incorporated; no nucleotides need to be subsequently incorporated and thus the label need not be removed from the nucleotide.

EXAMPLES

[0143] Additional embodiments are disclosed in further detail in the following examples, which are not in any way intended to limit the scope of the claims.

Example 1. Synthesis of modified adenosine nucleosides

Synthesis of 7-deaza-7-(ethylacryloyl)-2’-deoxyadenosine (Compound II-l)

[0144] Firstly, the catalyst solution was prepared by dissolving 2-amino-4,6- dihydroxypyrimidine (52 mg, 0.4 mmol) in 3 mL of water and 200 pL of 4 M NaOH solution. Then Pd(OAc)2 (44 mg, 0.2 mmol) was added and the suspension was heated to 65 °C for 2 hours. After this time a clear brown-pink solution was formed, then 800 pL of water were added to obtain a 50 mM stock of catalyst. 7-Iodo-7-deaza-2’ deoxy adenosine (88 mg, 0.234 mmol) was suspended in 3 mL of acetonitrile. Ethyl 3-(4,4,5,5,-tetramethyl-l,3,2-dioxaborolan-2-yl)acrylate (90 mg, 0.4 mmol) was added then the suspension was degassed by bubbling N2 gas for 1 hour. Sodium carbonate (50 mg, 0.468 mmol) dissolved in 2 mL of water was added, followed by 530 pL of the palladium catalyst solution. The mixture was bubbled with N2 gas for further 5 minutes, then it was heated to 60 °C. After 18 hours, 1.5 eq. of ethyl 3-(4,4,5,5-tetramethyl-l,3,2- dioxaborolan-2-yl)acrylate and 530 pL of Pd catalyst solution were added and the reaction was stirred at 65 °C for further 2 hours. The crude was cooled, evaporated to dryness under reduced pressure and purified by flash chromatography on silica gel. Yield: 44 mg (0.126 mmol), 54%. 'H NMR (400 MHz, A -DMSO): 8 (ppm) 8.13 (s, 1H, H-2), 8.12 (s, 1H, H-8), 7.94 (dd, J = 15.6, 0.7 Hz, 1H, CH=COO ), 6.86 (s, 2H, NH₂), 6.52 (dd, J = 7.9, 6.0 Hz, 1H, I’-CH), 6.42 (d, J = 15.7 Hz, 1H, Ar-CH=), 5.27 (d, J = 4.1 Hz, 1H, 3’-OH), 5.04 (t, J = 5.7 Hz, 1H, 5’-OH), 4.36 (m, 1H, 3’-CH), 4.16 (q, J = 7.0 Hz, 2H, CH₂ Et), 3.83 (td, J = 4.6, 2.6 Hz, 1H, 4’-CH), 3.83 (td, J = 4.6, 2.6 Hz, 1H), 3.60 (dt, J = 11.7, 5.1 Hz, 1H, 5’-C/7H), 3.52 (ddd, J = 11.7, 6.0, 4.6 Hz, 1H, 5’- CHH), 2.20 (ddd, J = 13.1, 6.0, 2.9 Hz, 1H, 2’-C7/H), 1.26 (t, J = 7.1 Hz, 3H, CH₃ Et). LC-MS (ESI): (positive ion) m/z 349 (M+H⁺), 390 (M+H⁺ +CH3CN); (negative ion) m/z 347 (M-H⁺). Synthesis of 4-(trifluoracetamidomethyl)phenylboronic acid pinacol ester

[0145] 4-(Aminomethyl)phenylboronic acid pinacol ester hydrochloride (1 g, 3.71 mmol) was suspended in anhydrous THF/DCM 1:2. Triethylamine was added, followed by methyl trifluoroacetate (410 pL, 4.08 mmol). The suspension was stirred for 3 hours at room temperature, then filtered on a plug of silica gel. The silica was washed with -400 mL of petroleum/ethyl acetate 1:1. The filtrate was evaporated to dryness under reduced pressure to afford the title product in quantitative yield (1.23 g, 3.7 mmol). H NMR (400 MHz, ₆ -DMSO): 5 (ppm) 10.02 (t, J = 6.0 Hz, 1H, NH), 7.67 (d, J = 8.0 Hz, 2H, Ar), 7.29 (d, J = 8.1 Hz, 2H, Ar), 4.41 (d, J = 5.9 Hz, 2H, CH2), 1.29 (s, 12H, CH3). LC-MS (ESI): (positive ion) m/z 347 (M+H30⁺); (negative ion) m/z 328 (M-H⁺).

Synthesis of 7-deaza-7-(4-(trifluoroacetamidomethyl)benzyl)-2’-deoxyadenosine (Compound I- 11

[0146] 7-Iodo-7-deaza-2’ deoxyadenosine (100 mg, 0.266 mmol), 4-

(trifluoracetamidomethyl)phenylboronic acid pinacol ester (262 mg, 0.8 mmol) and sodium carbonate (42 mg, 0.4 mmol) were suspended in 9 mL of acetonitrile/water 1:2. The solution was degassed by bubbling N2 gas for 10 minutes. Tris(3,3',3"- phosphinidynetris(benzenesulfonato)palladium(0) nonasodium salt nonahydrate (51 mg, 0.026 mmol) was added and the suspension was degassed for further 5 minutes, then heated to 80 °C. Upon completion, the reaction was cooled down in water/ice bath and the volatiles were removed under reduced pressure. The crude was purified by flash chromatography on silica gel. Yield: 46 mg (0.102 mmol). ^]H NMR (400 MHz, d₆-DMSO): 5 (ppm) 10.03 (t, J = 6.0 Hz, 1H, NH-COCF3), 8.15 (s, 1H, H-2), 7.52 (s, 1H, H-8), TAI (d, J = 8.2 Hz, 2H, Ar), 7.39 (d, J = 8.2 Hz, 2H, Ar), 6.58 (dd, J = 8.2, 5.9 Hz, 1H, l’-CH), 6.11 (s, 2H, NH₂), 5.26 (d, J = 4.1 Hz, 1H, 3’-OH), 5.04 (t, J = 5.6 Hz, 1H, 5’-OH), 4.44 (d, J = 5.8 Hz, 2H, CH2-NH-COCF3), 4.36 (m, 1H, 3’-OH), 3.83 (td, J = 4.4, 2.4 Hz, 1H, 4’-CH), 3.57 (dt, J = 11.7, 4.9 Hz, 1H, 5’-CHH), 3.51 (ddd, J = 11.7, 5.9, 4.3 Hz, 1H, 5’-C//H), 2.63 - 2.53 (m, 1H, 2’-CHH), 2.20 (ddd, J = 13.1, 6.0, 2.6 Hz, 1H, 2-CH77). ¹⁹F NMR (376 MHz, -DMSO): 5 (ppm) -74.23. LC-MS (ESI): (positive ion) m/z 452 (M+H⁺); (negative ion) m/z 450 (M-H⁺).

Synthesis of 7-deaza-7-(4-methoxycarbonylphenyl)-2’-deoxyadenosine (Compound 1-2)

[0147] 7-Iodo-7-deaza- mg, 0.25 mmol), 4- methoxycarbonyphenylboronic acid pinacol ester (196 mg, 0.75 mmol) and sodium carbonate (40 mg, 0.374 mmol) were suspended in 9 mL of acetonitrile/water 1:2. The solution was degassed by bubbling N2 gas for 5 minutes. Tris(3,3',3''-phosphinidynetris(benzenesulfonato)palladium(0) nonasodium salt nonahydrate (49 mg, 0.025 mmol) was added and the suspension was degassed for further 5 minutes, then heated to 80 °C. After 3.5 hours, the reaction was cooled down and the volatiles were removed under reduced pressure. The crude was purified by flash chromatography on silica gel. Yield: 34 mg (0.088 mmol). ^]H NMR (400 MHz, de -DMSO): 5 (ppm) 8.17 (s, 1H, H-2), 8.08 - 8.01 (m, 2H, Ar), 7.68 (s, 1H, H-8), 7.66 - 7.58 (m, 2H, Ar), 6.60 (dd, J = 8.2, 5.9 Hz, 1H, l’-CH), 6.28 - 6.22 (br s, 2H,NH₂), 5.27 (d, J = 4.1 Hz, 1H. 3’-OH), 5.04 (t, J = 5.6 Hz, 1H, 5’-OH), 4.36 (m, 1H, 3’-CH), 3.88 (s, 3H, CH₃), 3.84 (td, J = 4.4, 2.5 Hz, 1H, 4’-CH), 3.59 (dt, J = 11.7, 4.9 Hz, 1H, 5’-C//H), 3.51 (ddd, J = 11.7, 5.9, 4.4 Hz, 1H, 5’-CHH), 2.57 (ddd, J = 13.6, 8.2, 5.8 Hz, 1H, 2’-Cffii), 2.21 (ddd, J = 13.1, 6.0, 2.7 Hz, 1H, 2’-CHfl). LC-MS (ESI): (positive ion) m/z 385 (M+H⁺); (negative ion) m/z 383 (M-H⁺).

Synthesis of 8-aza-7-deaza-7-[3-(2,2,2-trifluoroacetamido)-prop-l-enyl]-2’-deoxyadenosine

(Compound II-2)

[0148] In a 50-mL round-bottomed flask, 7-iodo-7-deaza-8-aza-2’-deoxyadenosine (0.070 g, 0.18 mmol) was suspended in 30 mL of a 1:1 mixture of water/acetonitrile at room temperature, and 3-(2,2,2-trifluoroacetamido)-prop-l-enylboronic acid pinacol ester (0.104 g, 0.37 mmol) was added to the stirring solution, along with A,A^r-di isopropyl ethyl amine (0.1 mL, 0.56 mmol) and Pd(OAc)₂ (0.001 g, 5 pmol). The reaction was stirred under nitrogen atmosphere at 45 °C for 50 minutes. The reaction was allowed to cool down to room temperature and concentrated under reduced pressure. The resulting crude was purified via flash chromatography on silica gel. Yield: 0.062 g (87%). ^]H NMR (400 MHz, tfc -DMSO): 5 (ppm) 9.73 (t, J = 5.5 Hz, 1H, NH-TFA), 8.18 (s, 1H, H-2), 7.46 (br s, 2H, NH₂), 7.15 (dt, J = 15.4, 1.6 Hz, 1H, C7/=CH- CH₂), 6.55 (t, I = 6.4 Hz, 1H, l’-CH), 6.49 - 6.40 (m, 1H, CH=CH-CH₂), 5.25 (d, J = 4.6 Hz, 1H, 3’-OH), 4.79 (dd, J = 6.2, 5.4 Hz, 1H, 5’-OH), 4.49 - 4.41 (m, 1H, CHH-NH), 4.08 (m, 1H, CH/7 NH), 3.86 - 3.78 (m, 1H, 4’-OH), 3.57 - 3.49 (m, 2H, 5’-CH₂), 2.84 - 2.73 (m, 1H, 2’-CflH), 2.28 - 2. 18 (m, 1H, 2’-CHH). LC-MS (ES and CI): (positive ion) m/z 385 (M+H⁺); (negative ion) m/z 383 (M-H⁺).¹⁹F NMR (376 MHz, d₆ -DMSO): 5 (ppm) -74.13. LC-MS (ESI): (negative ion) m/z 401 (M-H⁺).

Synthesis of 8-aza-7-deaza-7-[3-(2,2,2-trifluoroacetamido)-prop-l-ynyl1-2’ -deoxyadenosine

(Compound II-3)

[0149] A solution of 7-iodo-7-deaza-8-aza-2’-deoxyadenosine (0.150 g, 0.400 mmol) in anhydrous THF (15 mL) was prepared under nitrogen, in a 25 mL round-bottomed flask. 2,2,2- trifluoro-A-prop-2-ynyl-acetamide (0.090 g, 0.6 mmol), Cui (0.008 g, 32 pmol) were subsequently added to the stirring solution, along with Pd(PPh3)4 (0.03 7g, 32 pmol). Lastly, El₂N (0.201 g, 280 pmol, 2 mmol) was added to the solution. The reaction was stirred at 35 °C under nitrogen overnight. Upon complete consumption of the starting material, the reaction was concentrated to dryness under reduced pressure. The crude material was purified via flash chromatography on silica gel. ^]H NMR (400 MHz, 7₆ -DMSO): 5 (ppm) 10.15 (t, 1H, NH-TFA), 8.24 (s, 1H, H-2), 8.15 (br s, 2H, NH₂), 6.54 (t, J = 6.4 Hz, 1H, I’-CH), 5.27 (d, 7 = 4.6 Hz, 1H, 3’-OH), 4.75 (t, 7 = 5.7 Hz, 1H, 5’-OH), 4.41 (d, 7 = 5.2 Hz, 2H, CH₂-NH), 3.83 - 3.78 (m, 1H, 3’-CH), 3.54 - 3.46 (m, 1H, 4’-CH), 3.39 - 3.34 (m, 2H, 5’-CH2), 2.80 - 2.72 (m, 1H, 2’-CHH), 2.29 - 2.20 (m, 1H, 2’-CH/7). ¹⁹F NMR (376 MHz, d₆ -DMSO): 5 (ppm) -77.33. LC-MS (ESI): (negative ion) m/z 399 (M-H⁺). Example 2. Cyclic voltammetry evaluation of nucleoside redox potentials

[0150] Electrochemical characterization was performed using a Metrohm Autolab potentiostat with a scan speed of 100 mV/s. Cyclic voltammetry experiments (CV) were performed in anhydrous acetonitrile with 0.1 M TBAPFe (tetrabutylammonium hexafluorophosphate) as supporting electrolyte. Analytical solutions were degassed with a gentle flow of N2 for 10 minutes prior to acquisition of CV traces. A typical three-electrode setup was used with glassy carbon (GC, WE), Pt wire (CE) and Ag wire (pseudo-reference electrode). All potentials were registered against Fc/Fc⁺ as an external standard and then converted into NHE (Fc/Fc⁺ = 0.63 V vs NHE). The results are shown in Table 1.

Table 1. Redox potential of adenosine derivatives

[0151] The redox potential of the adenosine derivatives measured by CV indicated that strong electron-donating substituents at 7-position of 7 -deazadenosines (for example, 7-iodo-7- deazadenosine, 7-deazaA-DB and Compound II-l) make the aromatic ring more prone to oxidation. Similarly, electron withdrawing groups at position 7- (for example, 7-deazaA-PA and Compound El and Compound E2) increase the oxidation potential, making the base less prone to oxidation. In the 8-aza-7-deaza-adenosines series (Compounds II-2 and II-3), the introduction of a nitrogen atom decreases the electronic density of the aromatic ring, therefore Compounds II-2 and II- 3 have an oxidation potential very similar to the one of natural adenosine. The trends observed in the redox potential measurement are consistent with those obtained in the nucleotide ROS damage assay described below. dA Natural

Example 3. Nucleotide Reactive Oxygen Species damage assay

[0152] To test nucleoside stability in the presence of ROS, the nucleosides were presented in an aqueous mixture with a dye generating ROS when irradiated. The mixture was irradiated with light at 405 nm wavelength for a period of time and the stability of the nucleoside was monitored at set time points by analytical HPLC.

[0153] Experimental procedure: A mixture containing the nucleoside of interest (90 pM) with an internal standard (thymidine triphosphate, 90 pM) and a ROS-producing dye (8- methoxypyrene-l,3,6-trisulfonate trisodium salt, 450 pM) in Tris buffer (10 mM Tris, pH 8) was irradiated with a violet LED (405 nm) at room temperature. The remaining nucleoside fraction was determined at different time points by analytical HPLC, using the internal standard peak as reference. The results are presented in Table 2. From the kinetic curves the half-life of the nucleosides under the reaction conditions were calculated and summarized in Table 3.

Table 2.

Table 3.

[0154] The obtained data indicates that 7-deazaA has a limited stability under high- ROS condition in comparison to the natural deoxynucleoside. Varying the pendant arm at C7 on 7-deazaA further affects ROS stability. In comparison, the 7-deaza-8-aza nucleosides were significantly more stable under the tested conditions. The trends observed in this experiment are consistent with the redox potential trends obtained by cyclic voltammetry analysis.

Claims

WHAT IS CLAIMED IS:

1. A nucleotide comprising a ribose or 2' deoxyribose, and a modified adenine moiety having the structure of formula (I) or (II): wherein each X is independently N or CR²; each of R¹ and R² is independently H or an electron withdrawing group;

Z¹ is absent, CR^aR^b, NR^C, 0, S, C(=0), C(=O)NR^C, S(=0), S(=0)₂, or S(=O)₂NR^C; ring A is a Ce-Cio arylene or a five to 10 membered heteroarylene, each optionally substituted with one or more electron withdrawing groups;

L¹ is absent, -(CH₂)_mNH-, -(CH₂)_mC(=O)NH(CH₂)_n- -(Cl I₂)_mNHC(=O)(CI I₂)_n- •>

-NHC(=O)(CH₂)kNH-, -C(=O)NH(CH₂)kNH- -(CH₂)_mS(=O)₂NH-, -S(=O)₂(CH₂)_m-, or 2 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N;

L² is -(CH₂)_mNH-, -(CH₂)_mS(=O)₂NH-, -S(=O)₂(CH₂)_m-,

(CH₂)_mC(=O)NH(CH₂)n-, -(CH₂)_mNHC(=O)(CH₂)_n-, -NHC(=O)(CH₂)_kNH-, -C(=O)NH(CH₂)_kNH-, or 2 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N; each of R^a, R^b and R^c is independently H, or unsubstituted or substituted Ci-Ce alkyl; each of R^d and R^e is independently H or an electron withdrawing group, provided that at least one of R^d and R^e is an electron withdrawing group; each of m and n is independently 0, 1, 2, 3, 4, 5 or 6; each k is independently 1, 2, 3, 4, 5, or 6;

* indicates the point of attachment of the modified adenine moiety to the ribose or 2' deoxyribose; and ** indicates the point of the attachment of the modified adenine moiety to a functional moiety or a detectable label, optionally via a cleavable linker; provided that when Z² is absent, then L² is -(CFb -eNH-, -(CH₂)mS(=O)2NH-, -(CH₂)_mC(=O)NH(CH₂)n-, -(CH₂)_mNHC(=O)(CH₂)n-, -NHC(=O)(CH₂)_kNH-

-C(=O)NH(CH₂)kNH-, or 2 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N; and when then X is N or at least one of R¹ and R² is an electron withdrawing group.

2. The nucleotide of claim 1 , wherein the nucleotide has the structure of formula (I- A) or

(II-A): wherein

R³ is monophosphate, diphosphate, triphosphate, thiophosphate, thiodiphosphate, or thiotriphosphate;

PG is a 3' hydroxy blocking group;

L^c is the cleavable linker; and

R^x is the functional moiety or the detectable label.

3. The nucleotide of claim 1 or 2, wherein X is N.

4. The nucleotide of claim 1 or 2, wherein X is CR², and R² is H, F, CN, NO₂, CHF₂, CH₂F, or CF₃.

5. The nucleotide of any one of claims 1 to 4, wherein R¹ is H, F, CN, NO₂, CHF₂, CH₂F, or CF₃.

6. The nucleotide of any one of claims 1 to 5, wherein ring A is phenylene, pyridylene, pyrimidylene, triazolylene, pyrazolylene, pyrrolylene, furylene, thienylene, imidazolylene, thiazolylene, isothiazolylene, oxazolylene, or isoxazolylene, each optionally substituted with one or more electron withdrawing groups.

7. The nucleotide of claim 6, wherein ring A is , each optionally substituted with one or more substituents selected from F, CN, NO₂, CHF₂, CH₂F, or CF₃.

8. The nucleotide of claim 6 or 7, wherein Z¹ is absent.

9. The nucleotide of claim 6 or 7, wherein Z¹ is CH₂, NH, O or S.

10. The nucleotide of any one of claims 6 to 9, wherein L¹ is -CH₂NH-, -CH₂C(=O)NH- -C(=O)NH- -CH₂NHC(=O)CH₂- or a 3 to 8 membered heteroalkylene containing one or more heteroatoms selected from O, S and N.

11. The nucleotide of any one of claims 1 to 5, wherein Z² is -CH=CH-, -C(=O)- CH=CH-, -CF₂-, -CF₂-CH=CH-, unsubstituted or substituted cyclopropylene, unsubstituted or substituted cyclobutylene, or unsubstituted or substituted azetidinylene.

12. The nucleotide of claim 11, wherein L² is -CH₂NH-, -S(=O)₂NH-, or -CH₂C(=O)NH-.

13. The nucleotide of any one of claims 1 to 5, wherein Z² is absent.

14. The nucleotide of claim 13, wherein L² is -S(=O)₂NH-, -S(=O)₂CH₂-, -C(=O)NH-, -NHC(=O)CH₂NH- or -C(=O)NHCH₂NH-.

15. The nucleotide of any one of claims 2 to 14, wherein PG is azidomethyl (-CH₂N₃), allyl (-CH₂CH=CH₂), or -CH₂OCH₂CH=CH₂.

16. The nucleotide of any one of claims 2 to 15, wherein R³ is a triphosphate.

17. An oligonucleotide or polynucleotide comprising the nucleotide of claim 16 incorporated thereto.

18. The oligonucleotide or polynucleotide of claim 17, wherein the oligonucleotide or polynucleotide is at least partially complementary and hybridized to a target polynucleotide immobilized on a surface of a solid support.

19. The oligonucleotide or polynucleotide of claim 18, wherein the solid support comprises an array of different target polynucleotides immobilized thereon.

20. A kit comprising a first type of nucleotide according to claim 16.

21. The kit of claim 20, wherein the first type of nucleotide carries a first detectable label.

22. The kit of claim 21, comprising four types of nucleotides, wherein a second type of nucleotide carries a second detectable label, a third type of nucleotide carries a third detectable label, and a fourth type of nucleotide is unlabeled (dark), and wherein each of the detectable labels has a distinct emission maximum that is distinguishable from the other detectable labels.

23. The kit of claim 21, comprising four types of nucleotides, wherein a second type of nucleotide carries a second detectable label, a third type of nucleotide comprises a mixture of the third type of nucleotide carrying the first detectable label and the third type of nucleotide carrying the second detectable label, and a fourth type of nucleotide is unlabeled (dark).

24. The kit of claim 20, wherein the first type of nucleotide is unlabeled and has a first functional moiety that can attach to a labeling reagent.

25. The kit of claim 24, wherein the kit further comprises a first labeling reagent, wherein the first labeling reagent comprises one or more first detectable labels and a first binding moiety that is capable of specific binding to the first functional moiety of the first type of unlabeled nucleotide.

26. The kit of claim 24 or 25, wherein each of the four types of nucleotides is unlabeled, and wherein the second type of unlabeled nucleotides comprises a second functional moiety, and the kit further comprises a second labeling reagent, wherein the second labeling reagent comprises one or more second detectable labels and a second binding moiety that is capable of specific binding to the second functional moiety of the second type of unlabeled nucleotide.

27. The kit of claim 26, wherein the third type of unlabeled nucleotides comprises a third functional moiety, and the kit further comprises a third labeling reagent, wherein the third labeling reagent comprises one or more third detectable labels and a third binding moiety that is capable of specific binding to the third functional moiety of the third type of unlabeled nucleotide.

28. The kit of claim 26, wherein the third type of unlabeled nucleotides comprises a mixture of the third type of unlabeled nucleotides comprising the first functional moiety and the third type of unlabeled nucleotides comprising the second functional moiety, and wherein both the first labeling reagent and the second labeling reagent are capable of specific binding to the third type of unlabeled nucleotides.

29. The kit of claim 27 or 28, wherein the fourth type of unlabeled nucleotides is not capable of specific binding with any of the first, second, or third labeling reagent.

30. The kit according to any one of claims 20 to 29, further comprising a DNA polymerase and one or more buffer compositions.

31. A method of determining the sequences of a plurality of different target polynucleotides, comprising: (a) contacting a solid support with a solution comprising sequencing primers under hybridization conditions, wherein the solid support comprises a plurality of different target polynucleotides immobilized thereon; and the sequencing primers are complementary to at least a portion of the target polynucleotides;

(b) contacting the solid support with an aqueous incorporation mixture comprising DNA polymerase and one or more of four different types of nucleotides (A, C, G and T or U) under conditions suitable for DNA polymerase-mediated primer extension, and incorporating one type of nucleotides into the sequencing primers to produce extended copy polynucleotides, wherein the A nucleotide is the nucleotide of claim 16 carrying a detectable label through a cleavable linker, and wherein each of the one or more of four different type of nucleotides comprises a 3' blocking group covalently attached to the 2" deoxyribose of the nucleotide;

(c) imaging and performing one or more fluorescent measurements of the extended copy polynucleotides;

32. The method of claim 31, wherein steps (b) to (e) are repeated at least 50, 100, 150, 200, 250, 300, 350, 400, 450 or 500 cycles.

33. The method of claim 31 or 32, wherein the removal of the 3' blocking group also removes the detectable label.

34. A method of determining the sequences of a plurality of different target polynucleotides, comprising:

(b) contacting the solid support with an aqueous incorporation mixture comprising DNA polymerase and one or more of four different types of nucleotides (A, C, G and T or U) under conditions suitable for DNA polymerase-mediated primer extension, and incorporating one type of nucleotides into the sequencing primers to produce extended copy polynucleotides, wherein at least two types of nucleotides are unlabeled and at least one type of unlabeled nucleotide is a unlabeled A nucleotide of claim 16 having a first functional moiety, and wherein each of the one or more of four different type of nucleotides comprises a 3' blocking group covalently attached to the deoxyribose of the nucleotide;

(c) contacting the extended copy polynucleotides with an aqueous labeling mixture comprising a first labeling reagent, wherein the first labeling reagent comprises one or more first detectable labels and a first binding moiety that is capable of specific binding to the first functional moiety of the unlabeled A nucleotide;

35. The method of claim 34, wherein steps (b) to (f) are repeated at least 50, 100, 150, 200, 250, 300, 350, 400, 450 or 500 cycles.

36. The method of claim 34 or 35, wherein the first functional moiety of the first type of unlabeled A nucleotide is bound to the first labeling reagent by either covalent bonding or noncovalent interaction via a cleavable linker.

37. The method of any one of claim 34 to 36, wherein the removal of the 3' blocking group also removes the first labeling reagent.

38. The method of any one of claims 31 to 36, wherein the method is performed on an automated sequencing instrument comprising a single light source.

39. The method of claim 38, wherein the single light source operates at a wavelength from about 500 nm to about 540 nm, or about 520 nm to about 525 nm.

40. The method of any one of claims 31 to 36, wherein the method is performed on an automated sequencing instrument comprising two light sources operating at two different wavelengths.

41. The method of claim 40, wherein one light source operates at a wavelength from about 450 nm to about 460 nm, and the other light source operates at a wavelength from about 520 nm to about 525 nm.

42. A method of preparing a growing polynucleotide complementary to a target singlestranded polynucleotide, comprising incorporating a nucleotide of claim 16 into the growing complementary polynucleotide, wherein the incorporation of the nucleotide prevents the introduction of any subsequent nucleotide into the growing complementary polynucleotide.

43. The method of claim 42, wherein the incorporation of the nucleotide is accomplished by a polymerase, a terminal deoxynucleotidyl transferase (TdT), or a reverse transcriptase.